BrainNet II - Creating A Neural Network Library
This article will explain the actual concepts of Backward Propagation Neural Networks - in such a way that even a person with zero knowledge in neural networks can understand the required theory and concepts very easily. The related project demonstrates the designing and implementation of a fully working 'BackProp' Neural Network library, i.e, the Brain Net library as I call it. You can find the theory, illustration and concepts here - along with the explanation of the neural network library project - in this article. Also, find the full source code of the library and related demo projects (a simple pattern detector, a hand writing detection pad, an xml based neural network processing language etc) in the associated zip file. |
Contents
- 1. Overview
- 2. Before We Begin.
- 3. Understanding Neural Networks
- 4. How A Neural Network Actually Works
- 5. Designing BrainNet Neural Network Library
- What is Next
- Appendix A: Small Dose Of Spiritual Programming!!
1. Overview
- Solution Architect: "Well, you learned something about neural networks?"
- (Dumb?) Developer: "No, I'm smart enough. I love using other's code."
- Solution Architect: "But, if you don't understand the concepts, how you can optimize and re-use other's code?"
- (Dumb?) Developer: "Err.. I feel that most others can code better than me, so why should I optimize?"
In my previous article, the focus was on what a neural network can do. In this article, we will see what a neural network is, and how to create one yourself. I will go a little deeper. After reading this article, you will be able to
- Understand the basic theory behind neural networks (backward propagation neural networks in particular)
- Understand how neural networks actually 'work'
- Understand in more detail, the design and source code of BrainNet library.
- Understand in more detail, how to use BrainNet Library in your projects.
- Think about new possibilities of neural network programming
- Put forward some concepts to optimize and generalize BrainNet library.
Now, let me answer some questions I got in past.
- Q) Why you selected an object oriented programming model for this
Neural Network Library?
- Answer - The focus is on the understandability of basic concepts, not on performance.
- Q) Is this neural network library fully optimized?
- Answer - Not yet, we are still in the beta stage. The focus is on readability, so the code is flattened so that even a beginner can understand it. Suggestions and modifications are always welcome. Send your modifications, hacks and suggestions to amazedsaint@gmail.com
- Q) Whether this library can be used in projects?
- You can use it - as long as your usage confronts to the specifications in the associated license notice (see the source code). Anyway, I request you to send me a notification (and the modified code), if you hack it or use it in any of your projects.
2. Before We Begin.
This article is complete by itself. It explains what is a neural network, and how to create one your own. How ever, to get an idea regarding what a neural network can do, and to get a user level experience - please read the first part of this article.
The first article in this article series is titled "BrainNet Neural Network Library - Part I - Learn Neural Network Programming step by step And Develop a Simple Handwriting Detection System". You can read it and download the source code, either from Code Project [ Click Here ] or from my Website [ Click Here ].
If you are really a beginner, it will help you a lot, and may provide you a step by step approach towards understanding neural networks.
This is my second article about Neural Networks in general and the BrainNet Neural Network Library in particular. This article explains Neural Networks and their working in more detail, and in a very simple way. Then I will explain the design concepts of BrainNet library.
|
3. Understanding Neural Networks
One fascinating thing about artificial neural networks is that, they are mainly inspired by the human brain. This doesn't mean that Artificial Neural Networks are exact simulations of the biological neural networks inside our brain - because the actual working of human brain is still a mystery. The concept of artificial neural networks emerged in its present form our very limited understanding about our own brain ("I know that I know nothing").
Brain Net Neural Network library is designed and implemented using Object Oriented Concepts.
|
Before understanding how neurons and neural networks actually work, let us revisit the structure of a neural network. As I mentioned earlier, a neural network consists of several layers, and each layer has a number of neurons in it. Neurons is one layer is connected to multiple or all neurons in the next layer. Input is fed to the neurons in input layer, and output is obtained from the neurons in the last layer.
Fig: A Fully Connected 4-4-2 neural network with 4 neurons in input layer, 4 neurons in hidden layer and 2 neurons in output layer.
An artificial neural network can learn from a set of samples.
For training a neural network, first you provide a set of inputs and outputs. For example, if you need a neural network to detect fractures from an X-Ray of a born, first you train the network with a number of samples. You provide an X-Ray, along with the information that whether that particular X-Ray has a fracture or not. After training the network a number of times with a number of samples like this (probably thousands of samples), it is assumed that the neural network can 'detect' whether a given X-Ray indicates a fracture in the born (This is just an example). The concept of training a network is detailed in my first article. Later, in this article, we will discuss the theory behind network learning.
As we already discussed, the basic component in a neural network is a neuron. First of all, let us have a very brief look towards biological neurons, and their corresponding artificial models.
3.1 Biological Neurons
First of all, let us have a look at a biological neuron. Frankly, I don't have much knowledge regarding the actual structure of a biological neuron - how ever, the following information is more than enough at this stage for us to get in to the groove. A biological neuron will look some what similar to this.
The four basic components of a biological neuron are
-
Dendrites - Dendrites are hair like extensions of a neuron, and each dendrite can bring some input to the neuron (from neurons in the previous layer). These inputs are given to the soma.
-
Soma - Soma is responsible for processing these inputs, and the output is provided to other neurons through the axon and synapses.
-
Axon - The axon is responsible for carrying the output of soma to other neurons, through the synapses
-
Synapses - Synapses of one neuron is connected to the dendrites of neurons in the next layer. The connections between neurons is possible because of synapses and dendrites.
A single neuron is connected to multiple neurons (mostly, all neurons) in the next layer. Also, a neuron in one layer can accept inputs from more than one neuron (mostly, all neurons) in the previous layer.
3.2 Artificial Neurons
Now, let us have a look at the model of an artificial neuron.
An artificial neuron consists of various inputs, much like the biological neuron. Instead of Soma and Axon, we have a summation unit and a transfer function unit. The output of one neuron can be given as input to multiple neurons.
Please note that for an artificial neuron, we have a weight value associated with each input. Now, let us have a look at the working of a neuron.
Summation Unit
-
When inputs are fed to the neuron, the summation unit will initially find the net-value. For finding the Net Value, the product of each input value and corresponding connection weight is calculated.
-
i.e, input value x(i) of each input to the neuron is multiplied with the associated connection weight w(i). In simplest case, these products are summed and fed to the transfer function. See the pseudo code below, it is simpler to understand.
Also, a neuron has a bias value, which affects the net value. A bias of a neuron is set to a random value, when the network is initialized. We will change the connection weights and bias of all neurons in the network (other than neurons in the input layer), during training phase.
I.e, if x is the input, and w is the associated weight, then pseudo code for net value calculation is as follows.
netValue=0
for i=0 to neuron.inputs.count-1
netValue=netValue + x(i) * w(i)
next
netValue=netValue + Bias
Transfer Function
Transfer function is a simple function, that uses the net value to generate an output. This output is then propagated to the neurons in the next layer. We can use various types of transfer functions as shown below.
Hard Limit Transfer Function: For example, a simple hard limit function will output 1 if net value is greater than 0.5, and will output 0 if the net value is lesser than 0.5 - as shown.
if (netValue<0.5)
output = 0
else
output = 1
Sigmoid Transfer Function: Another type of transfer function is a sigmoid transfer function. A sigmoid transfer function will take a net value as input and produce an output between 0 and 1 as shown.
output = 1 / (1 + Exp(-netValue))
The implementation of summation unit and transfer function unit may vary in different networks.
This, a neural network is constructed from such basic models, called neurons, arranged together in layers, and connected to each other as explained earlier. Now let us see how all these neurons work together, inside a neural network.
4. How A Neural Network Actually 'Works'
Working with a neural network includes
- Training the network - by providing inputs and corresponding outputs.
- In this phase, we train a neural network with samples to perform a particular task.
- Running the network - by providing the input to obtain the output.
- In this phase, we will provide an input to the network, and obtain the output. The output may not be accurate always. Generally speaking, the accuracy of the output during running phase depends a lot on the samples we provided during the training phase, and the number of times we trained the network.
4.1. Training Phase
This section explains how the training takes place, in a back ward propagation neural network. In a backward propagation neural network, there are several layers, and each neuron in each layer is connected to all neurons in the next layer. For each connection, a random weight is assigned when the network is initialized. Also, a random bias value is assigned to each neuron during initialization.
Training is the process of adjusting the connection weights and bias of all neurons in the network (other than neurons in the input layer), to enable the network to produce expected output for all input sets.
Now, let us see how the training actually happens. Consider a small 2-2-1 network. Now, we are going to train this network with AND truth table. As you know, AND truth table is
|
|
Fig: A 2-2-1 Neural Network and Truth Table Of AND
In the above network, N1 and N2 are neurons in input layer, N3 and N4 are neurons in hidden layer, and N5 is the neuron in output layer. The inputs are fed to N1 and N2. Each neuron in each layer is connected to all neurons in next layer. We call the above network a 2-2-1 network, based on the number of neurons in each layer.
|
The above diagram will be used to illustrate the process of training.
First, let us see how we train our 2-2-1 network, the first condition in the truth table, i.e, when A=0, B=0 then output=0.
Step 1 - Feeding The Inputs
Initially, we will feed the inputs to the neural network. This is done by simply setting the output of neurons in Layer 1, as the input values we need to feed. I.e, as per the above example, our inputs are 0,0 and output is 0. we will set the output of Neuron N1 as 0, and the output of N2 is set to 0.
Have a look at this pseudo code, and it will make things clear. Inputs is the input array. The number of elements in Input array should match the number of neurons in input layer.
i = 0
For Each neuron In InputLayer
someNeuron.OutputValue = Inputs(i)
i = i + 1
Next
Step 2 - Finding the output of the network
We have already seen how we calculate the output of a single neuron. As per our above example, the output of neurons N1 and N2 will act as the inputs of N3 and N4.
Finding the output of neural network involves, calculating the outputs of all hidden layers and output layer. As we discussed earlier, a neural network can have a number of hidden layers.
'Find output of all neurons in all hidden layers
For each layer in HiddenLayers
For Each neuron In layer.Neurons
neuron.UpdateOutput()
Next
Next
'Find output of all neurons in output layer
For Each neuron In OutputLayer.Neurons
neuron.UpdateOutput()
Next
UpdateOutput() function of a single neuron works exactly as we discussed earlier. First, net value is calculated by the summation unit, and then it is provided to a transfer function to obtain the output of the neuron. Pseudo code is again shown below.
Summation Unit works like this:
Dim netValue As Single = bias
For Each InputNeuron connected to ThisNeuron
netValue = netValue + (Weight Associated With InputNeuron * Output of InputNeuron)
Next
I.e, as per our above example, let us calculate the net value of neuron N3. We know that N1 and N2 are connected to N3
- Net Value Of N3 = N3.Bias + (N1.Output * Weight Of Connection From N1 to N3) + (N2.Output * Weight Of Connection From N2 to N3)
Similarly, to calculate the net value of N4,
- Net Value Of N4 = N4.Bias + (N1.Output * Weight Of Connection From N1 to N4) + (N2.Output * Weight Of Connection From N2 to N4)
Activation Unit Or Transfer Unit:
Now, let us see how we are generating the output, using Transfer unit. Here, we are using the sigmoid transfer function. This is exactly as we discussed earlier.
Output of Neuron = 1 / (1 + Exp( - NetValue )
Now, the output of N3 and N4 will be passed to each neuron in the next layer as inputs. This process of propagating the output of one layer as the input to the next layer is called forward propagation part in the training phase.
Thus, after step 2, we just found the output of each neuron in each layer - starting from the first hidden layer to the output layer. The output of the network is simply the output of all neurons in the output layer.
Step 3 - Calculating The Error or Delta
In this step, we will calculate the error of the network. Error or Delta can be stated as the difference between the expected output and the obtained output. For example, when we find the output value of the network for the first time, most probably the output will be wrong. We need to get 0 as the output for inputs A=0 and B=0. But the output may be, some other value like 0.55, based on the random values assigned to the bias and connection weights of each neuron.
Now let us see, how we can calculate the error. Let us see how to calculate the error or delta of each neuron in all the layers.
- First we will calculate the error or delta of each neuron in the output layer.
- The delta value thus calculated will be used to calculate the error or delta of neurons in the previous layer (i.e, the last hidden layer)
- The delta value of all neurons in the last hidden layer is used to calculate the error or delta of all neurons in the previous layer (i.e, second last hidden layer)
- This process is continued, till we reach the first hidden layer (delta of input layer is not calculated).
Please note one interesting point. In Step 2, we are propagating values forward - starting from the first hidden layer to the output layer, for finding the output. In Step 3, we are starting from the output layer, and propagating the error values backward - and hence, this neural network is called as a Backward Propagation neural network.
Time to see how things actually work. The general equation for finding the delta of a neuron is
Neuron.Delta = Neuron.Output * (1 - Neuron.Output) * ErrorFactor
Now, let us see how the error factor is calculated for each neuron. The Error Factor of neurons in output layer can be calculated directly (since we know the expected output of each neuron in output layer).
For a neuron in output layer,
ErrorFactor Of An Output Layer Neuron = ExpectedOutput - Neuron's Actual Output
i.e, with respect to our above example, if the output of N5 is 0.5 and the expected output is 0, then error factor = 0 - 0.5 = - 0.5
For a neuron in hidden layer, error factor calculation is some what different. To calculate the error factor of a neuron in hidden layer,
- First the delta of each neuron to which this neuron is connected is multiplied with the weight of this connection
- These products are summed up together to obtain the error factor of a hidden layer neuron
Simply speaking, a neuron in a hidden layer is using the delta of all connected neurons in next layer, along with the corresponding connection weights, to find the error factor. This is because, we don't have any direct parameters for calculating the error of neurons in the hidden layer (as we did in the output layer neurons).
|
'Calculating the error factor of a neuron in a hidden layer
For Each Neuron N to which ThisNeuron Is Connected
'Sum up all the delta * weight
errorFactor = errorFactor + (N.DeltaValue * Weight Of Connection From ThisNeuron To N)
Next
To illustrate this, consider a neuron x1 (ThisNeuron), which is a hidden layer neuron. X1 is connected to neurons y1, y2, y3 and y4 - and these are neurons in next layer.
i.e, to make things simple,
- Error Factor of X1 = (Y1.Delta * Weight Of Connection From X1 To Y1) + (Y2.Delta * Weight Of Connection From X1 To Y2) + (Y3.Delta * Weight Of Connection From X1 To Y3) + (Y4.Delta * Weight Of Connection From X4 To Y4)
Now, as we discussed earlier, the Delta of a X1 can be calculated as,
- X1.Delta = X1.Output * (1 - X1.Output) * ErrorFactor Of X1
Thus, after finishing step 3, we have the Delta of all neurons.
Step 4 - Adjusting The Weights and Bias
After calculating the delta of all neurons in all layers, we should correct the weights and bias with respect to the error or delta, to produce a more accurate output next time. Connection Weights and Bias, together are called free parameters. Remember that a neuron should update more than one number of weights - because, as we already discussed, there is a weight associated with each connection to a neuron.
See the pseudo code for updating the free parameters of all neurons in all layers
'Update free parameters of all neurons in hidden layer
For each layer in HiddenLayers
For Each neuron In layer.Neurons
neuron.UpdateFreeParams()
Next
Next
'Update free parameters of all neurons in output layer
For Each neuron In OutputLayer.Neurons
neuron.UpdateFreeParams()
Next
UpdateFreeParams() function simply does two things.
- Find the new bias of a neuron, based on the delta we calculated above
- Update the connection weights based on the delta we calculated above
Finding the new bias value of a neuron is pretty simple. See the pseudo code. If Learning Rate is a constant (for e.g, Learning Rate=0.5)
New Bias Value = Old Bias Value + LEARNING_RATE * 1 * Delta
Now let us see how to update the connection weights. The new weight associated with an input neuron can be calculated as shown below.
New Weight = Old Weight + LEARNING_RATE * 1 * Output Of InputNeuron * Delta
As a neuron can have more than one input, the above step should be performed for all input neurons connected to this neuron.
I.e,
For Each InputNeuron N connected to ThisNeuron
New Weight of N = Old Weight of N + LEARNING_RATE * 1 * N.Output * ThisNeuron.Delta
Next
Now, after step 4, we have a better network. This process is repeated for all other entries in the AND truth table - for probably more than thousand number of times, to train the network 'well'.
4.2. Running The Network
Running the network involves,
- Providing the inputs to the network exactly as described earlier in Step 1 above
- Calculating the outputs as explained in Step 2 above
How ever, it is important to note that the network should be trained with sufficient samples (and sufficient number of times), to obtain desired results. Anyway, it is almost impossible to say that the output of a neural network will be 100% accurate for any input.
Now, let us see how these concepts are implemented in BrainNet Neural Network Library.
5. Designing BrainNet Neural Network Library
The fundamental challenge for any solution developer is to create, build or assemble a working program from his abstract concepts about a system. The quality of this transformation depends a lot on how well he understand the system. At this point, I would like to mention that Brain Net Library is actually not designed after a complete and thorough understanding of various existing neural network models and emerging possibilities in the area of neural networks. Hence, I suspect that the present design of this framework is mostly biased towards Backward Propagation systems I explained earlier - though it can be modified to create other neural network models also.
We are simply mapping the above concepts to the library. Hence, the following code and explanation is very easy to understand, if you read the above concepts regarding Neural Networks.
5.1. The UML Model
Now, let us have a look at some of the interfaces and classes in BrainNet library.
|
Have a look at this model below. Please not that this model holds only the major interfaces and classes with in the model.
Fig: An Partial Model of BrainNet Framework
As we discussed earlier, a Neural Network consists of various Neuron Layers, and each Neuron Layer has various Neurons. A Neuron has a strategy - which decides how it should perform tasks like summation, activation, error calculation, bias adjustment, weight adjustment etc.
To brief the UML diagram above,
-
INeuron, INeuronStrategy, INeuralNetwork and INetworkFactory are interfaces
-
A Neuron should implement the INeuron interface
-
A Neural Network should implement the INeuralNetwork interface
-
A Neuron has a strategy, and a strategy should implement the INeuronStrategy interface. We have a concrete implementation of INeuronStrategy, called BackPropNeuronStrategy (for a backward propagation neural network).
-
A Neural Network is initialized and connections betweens layers are made by a neural network factory. A Factory should implement the INetworkFactory interface. We have a concrete implementation of INetworkFactory, called BackPropNetworkFactory, for creating Backward Propagation neural networks.
The major interfaces in the model are briefed below.
INetworkFactory |
An interface to define a neural network factory |
INeuron |
The interface for defining a neuron |
INeuronStrategy |
The interface for defining the strategy of the neuron |
INeuralNetwork |
The interface for defining a neural network |
The major classes in the model are briefed below.
|
5.2. A Neuron In BrainNet Library
The INeuron interface provides an abstract interface that should be implemented to create a concrete neuron. I request you to refresh the concepts of an artificial neuron we discussed earlier.
The elements in INeuron interface is detailed below.
'The interface for defining a neuron
Public Interface INeuron
'The current bias this neuron
Property BiasValue() As Single
'The current output this neuron
Property OutputValue() As Single
'The current delta value this neuron
Property DeltaValue() As Single
'A list of neurons to which this neuron is connected
ReadOnly Property ForwardConnections() As NeuronCollection
'Gets a list of neurons connected to this neuron
ReadOnly Property Inputs() As NeuronConnections
'Gets or sets the strategy of this neuron
Property Strategy() As INeuronStrategy
'Method to update the output of a neuron
Sub UpdateOutput()
'Method to find new delta value
Sub UpdateDelta(ByVal errorFactor As Single)
'Method to update free parameters
Sub UpdateFreeParams()
End Interface
A concrete neuron will implement the INeuron interface. Neuron class is a concrete implementation of INeuron. The Strategy property of a Neuron holds its current strategy. Inputs property holds the references of Neurons (in previous layer) connected to this neuron. ForwardConnections holds references to the neurons (in next layer) to which this neuron is connected.
Now, have a look at the Neuron class by extracting the source code zip of BrainNet library. Let us inspect three major functions implemented in the Neuron class - UpdateOutput, UpdateDelta and UpdateFreeParams. These functions are called by the NeuralNetwork class, by training and running the network. We will see later how the functions in NeuralNetwork class call these functions.
These functions uses the current strategy object of the neuron to perform operations.
- UpdateDelta - Find the new delta of this neuron using the current strategy. Error factor (remember that this will vary based on the layer of a neuron) will be passed to the UpdateDelta function, from the functions in Neural Network class.
- UpdateOutput - Find the new output of the neuron, by finding the net value, and then by invoking the activation function - as defined in the current strategy.
- UpdateFreeParams - Updating free parameters includes calling the functions according to the current strategy of this neuron to find new bias and to update weights.
'Calculate the error value
Public Sub UpdateDelta(ByVal errorFactor As Single) Implements _
NeuralFramework.INeuron.UpdateDelta
If _strategy Is Nothing Then Throw New StrategyNotInitializedException("", Nothing)
'Error factor is found and passed to this
DeltaValue = Strategy.FindDelta(OutputValue, errorFactor)
End Sub
'Calculate the output
Public Sub UpdateOutput() Implements NeuralFramework.INeuron.UpdateOutput
If _strategy Is Nothing Then Throw New StrategyNotInitializedException("..", Nothing)
Dim netValue As Single = Strategy.FindNetValue(Inputs, BiasValue)
OutputValue = Strategy.Activation(netValue)
End Sub
'Calculate the free parameters
Public Sub UpdateFreeParams() Implements NeuralFramework.INeuron.UpdateFreeParams
If _strategy Is Nothing Then Throw New StrategyNotInitializedException("..", Nothing)
BiasValue = Strategy.FindNewBias(BiasValue, DeltaValue)
Strategy.UpdateWeights(Inputs, DeltaValue)
End Sub
5.3. The Strategy Of A Neuron
How a Neuron actually functions is decided by the strategy of a neuron. A concrete strategy should implement the INeuronStrategy interface. This interface is shown below. BackPropNeuronStrategy is a concrete implementation of INeuronStrategy interface.
The elements in INeuronStrategy interface, along with description is given below.
'The interface for defining the strategy of a neuron
Public Interface INeuronStrategy
'Function to find the delta or error rate of this INeuron
Function FindDelta(ByVal output As Single, ByVal errorFactor As Single) As Single
'Activation Function, or ThreshHold function
Function Activation(ByVal value As Single) As Single
'Summation Function for finding the net value
Function FindNetValue(ByVal inputs As NeuronConnections, ByVal bias As Single) As Single
'Function for calculating new bias
Function FindNewBias(ByVal bias As Single, ByVal delta As Single) As Single
'Function for updating weights
Sub UpdateWeights(ByRef connections As NeuronConnections, ByVal delta As Single)
End Interface
Have a look at the BackPropNeuronStrategy class, in the code, and see how these functions are implemented as we described earlier. It is pretty easy to understand.
5.4. A Neural Network In BrainNet library
Now, let us see how the Neural Network is implemented. Any concrete neural network should implement the INeuralNetwork interface. INeuralNetwork interface is shown below.
Public Interface INeuralNetwork
'Method to train a network
Sub TrainNetwork(ByVal t As TrainingData)
'This function can be used for connecting two neurons together
Sub ConnectNeurons(ByVal source As INeuron, ByVal destination As INeuron, ByVal weight As Single)
'This function can be used for connecting two neurons together with random weight
Sub ConnectNeurons(ByVal source As INeuron, ByVal destination As INeuron)
'This function can be used for connecting neurons in two layers together with random weights
Sub ConnectLayers(ByVal layer1 As NeuronLayer, ByVal layer2 As NeuronLayer)
'This function can be used for connecting all neurons in all layers together
Sub ConnectLayers()
'This function may be used for running the network
Function RunNetwork(ByVal inputs As ArrayList) As ArrayList
'This function may be used to obtain the output list
Function GetOutput() As ArrayList
ReadOnly Property Layers() As NeuronLayerCollection
'Gets the first (input) layer
ReadOnly Property InputLayer() As NeuronLayer
'Gets the last (output) layer
ReadOnly Property OutputLayer() As NeuronLayer
End Interface
There are two interesting functions, TrainNetwork and RunNetwork, for training and running the network. The input to the TrainNetwork function is an object of TrainingData class. The TrainingData class has two properties of type ArrayList - Inputs and Outputs. To train the network, we put the input values to the Inputs array list, and corresponding output values are filled to the Outputs array list.
5.5. Training The Network
First of all, feed the inputs to all the neurons in the input layer. Then, the algorithm is like
- Step1: Find the output of hidden layer neurons and output layer neurons
- Step2: Finding Delta
- 2.1) find the delta (error rate) of output layer
- 2.2) Calculate delta of all the hidden layers, backwards
- Step3: Update the free parameters of hidden and output layers
Have a look at how this goes, inside TrainNetwork function in the NeuralNetwork class, it is commented heavily. Some part of TrainNetwork function is shown below.
Dim i As Long
Dim someNeuron As INeuron
i = 0
'Give our inputs to the first layer. t is an object of TrainingData class
For Each someNeuron In InputLayer
someNeuron.OutputValue = t.Inputs(i)
i = i + 1
Next
'Step1: Find the output of hidden layer neurons and output layer neurons
Dim nl As NeuronLayer
Dim count As Long = 1
For count = 1 To _layers.Count - 1
nl = _layers(count)
For Each someNeuron In nl
someNeuron.UpdateOutput()
Next
Next
'Step2: Finding Delta
'2.1) Find the delta (error rate) of output layer
i = 0
For Each someNeuron In OutputLayer
'Find the target-output value and pass it
someNeuron.UpdateDelta(t.Outputs(i) - someNeuron.OutputValue)
i = i + 1
Next
'2.2) Calculate delta of all the hidden layers, backwards
Dim layer As Long
Dim currentLayer As NeuronLayer
For i = _layers.Count - 2 To 1 Step -1
currentLayer = _layers(i)
For Each someNeuron In currentLayer
Dim errorFactor As Single = 0
Dim connectedNeuron As INeuron
For Each connectedNeuron In someNeuron.ForwardConnections
'Sum up all the delta * weight
errorFactor = errorFactor + (connectedNeuron.DeltaValue * _
connectedNeuron.Inputs.Weight(someNeuron))
Next
someNeuron.UpdateDelta(errorFactor)
Next
Next
'Step3: Update the free parameters of hidden and output layers
For i = 1 To _layers.Count - 1
For Each someNeuron In _layers(i)
someNeuron.UpdateFreeParams()
Next
Next
5.6. Running The Network
Running the network is pretty simple. For running the network, we just feed the inputs to the first layer, and calculate the outputs, just as explained earlier during the training phase. Here is some part of the RunNetwork function.
Dim someNeuron As INeuron
Dim i As Long = 0
For Each someNeuron In InputLayer
someNeuron.OutputValue = CType(inputs(i), System.Single)
i += 1
Next
'Step1: Find the output of each hidden neuron layer
Dim nl As NeuronLayer
For i = 1 To _layers.Count - 1
nl = _layers(i)
For Each someNeuron In nl
someNeuron.UpdateOutput()
Next
Next
5.7. Creating A Network
Now, let us see how you can create a network easily. Here is a simple code that shows how to create a network. Let us assume that the input to the method is an array list which holds a list of long values that represent the number of neurons in each layer.
'Demo Routine to create a network. The input parameter is a list of
'long values that represent the number of neurons in each layer
Public Sub CreateNetwork(ByVal neuronsInLayers As ArrayList)
Dim bnn As New NeuralNetwork()
Dim neurons As Long
Dim strategy As New BackPropNeuronStrategy()
'NeuronsInLayers is an arraylist which holds
'the number of neurons in each layer
For Each neurons In neuronsInLayers
Dim layer As NeuronLayer
Dim i As Long
layer = New NeuronLayer()
'Let us add
For i = 0 To neurons - 1
layer.Add(New Neuron(strategy))
Next
bnn.Layers.Add(layer)
Next
'Connect all layers together
bnn.ConnectLayers()
'Now the network is ready, do other stuff here
End Function
Or better, you can use the BackPropNetworkFactory class to create a network easily. Have a look at the BackPropNetworkFactory class. It has two overloaded CreateNetwork functions, for creating a neural network.
Some notes.
- This article is much like a 'Developers Guide' of BrainNet neural network library.
- Have a look at my previous article if you haven't done that yet. It is more or less a 'user's guide' for this library - for more information regarding how to use this BrainNet Library in your own projects, and to see the demo projects in action.
What is Next?
Cheers!! Thus, we finished the second article about Neural Networks. Just turn back and make sure that you understood all the points clearly.
Experiment yourself with the library, and try to optimize it a little bit, or even better, create a neural network yourself using this as an example. In my next article,
- I will explain how to create an XML based language yourself, for creating, training and processing neural networks.
- Explain the concept of some classes in the framework that I haven't mentioned in this article (like NXML interpreter, NetworkSerializer etc).
There are some 'Easter Eggs' along with the BrainNet library source code, that I haven't mentioned right now. For example, If you are smart enough, start playing with the nxml tool, already included in the associated zip. The zip file holds the whole code. nxml is a command line tool which may help you to create, train and run a neural network using xml. I'll explain it in detail, in my next article. Anyway, after compiling the project, typing nxml in the command prompt will reveal its usage :) - just if you can't wait till my next article. Another demo project is a simple Handwriting detection pad, which is also available in the source code zip.
Also,
- You may visit my website http://amazedsaint.blogspot.com/ for a lot of tech resources, code and projects
- Read all the articles I published so far here,
http://amazedsaint-articles.blogspot.com/. - You'll find articles about
Design Patterns, Neural Networks, Security, Hacking and more.
- You can subscribe to the XML atom feed of my technical articles blog, for tracking new posts. Click Here for the XML Atom Feed.
When you play with the library, if you come across any bugs, please report it.
Consider A Donation:
Your contributions to the Amazed saint blog and BrainNet Library will help us to bring out more open source projects like BrainNet - along with other well written articles, projects and tutorials in emerging areas.
- Hence, we humbly request you to consider a small donation here.
Appendix A: Small Dose Of Spiritual Programming!!
This paragraph is about the life of a programmer, and not about technology.
Always, we are seeking for joy in life, but few people find it permanently. We need to be intense in our life. When we were kids, we had a lot of intensity, grace and bliss in our approach, and we were capable of extracting the whole juice and we were 100% in all our activities. But we lost our innocence, bliss and intensity in between. Satisfaction and smile is the cornerstone and symbol of success.
Here are some tips for my fellow programmers.
- You may go to http://www.artofliving.org/ and attend the Art Of Living healing breath workshop by contacting a center near you - and learn some Yoga and breathing practices to clear your body and mind. It is amazing.
- Drink a lot of water in between when you are with your computer, because if you don't have enough water in your body, you'll get depressed very easily. Drink two/three glasses of water each day, when you get up. Normally, when you are working, your entropy goes high, and a lot of water loss is happening.
- Take some Ayurveda medicines, this can stabilize your body and mind, to improve the quality of your intellect and clarity of your mind.
After spending quite a few years as a programmer/solution architect, I realized that practicing a little bit of Yoga, Sudarshana Kriya, Pranayamas etc (I learned these processes after attending a program from the Art Of Living Foundation) can change my attitude and behavior a lot, so that I can create a lot of harmony and contentment inside and outside - to improve my productivity leaps and bounds. This prompted me to recommend the Art Of Living Healing Breath Workshop for all my friends and fellow programmers.
Have a look at my Inspiring Intuitions blog at http://amazedsaint-intutions.blogspot.com/ for some more thoughts. Have a great time, enjoy coding - and don't forget to enjoy life. And what I recommend to you is some spiritual programming!!
Again, to conclude with some technology - you may proceed and read all the articles I published so far here, at http://amazedsaint-articles.blogspot.com/
7 comments:
Thanks a lot. I've been searching all over for a more concrete description of the whole neural network creation and training process and this is the best I have found.
Can't explain how this article helped me out... Really, thanks a lot! Our teacher's material is nothing compared to this.
Congratulations!
Thank you very much, I realy needed such an example to understand better the EBP-Alg. In addition you made it as an example with UML-Diagramm -> respect.. respect.. :)
Best wishes to you!
Nice article and nice code. I've used the Neural Networks in Matlab for image recognition of number plates.
The network and training is nearly the same but this recognition uses the same number of output values for the amount of characters to recognize. (= a-z + 0-9 + special chars)
This technique works mutch better that using the binary value and guesing the value.
Therefore the change of equality of input and output is calculated.
so output 1 is: A; output 2 is: B; output 4 is: C; etc.
If the user 'say' the character is incorrect (or other intelligence) the next alternative can be used.
This could also be used in the example of 'Neutral gate'. It's just an other way to use the Neutral network.
Beside the use of only image information also other information can be used as input parameter. Like the position in the text, previous letter, next letter, type of character. Even the following parameters can be used: the time of day (because of involence of light), weather, the noise, etc.
Al these parameters makes the recognition quality better, but also slower!.
Wow! This is really a great article. Thanks!
Truly excellent article - has provided lots of food for thought for my thesis! :)
the images seem to be broken (404). can you have a look at that?
thanks!
Post a Comment