7. Statistical Prediction with Neural Networks

Neural nets fundamentals

The Perceptron

A type of artificial neuron developed in the 1950s and the 1960s

It takes several binary inputs $x_1,x_2,x_3$ and produces a single binary output

Each input has an associated weight $w_1,w_2,w_3$ indicating the importance of its input to the output.

To calculate the output: $output = \left\{ \begin{array}{ll} 0 & \text{if } \sum_jw_jx_j \leq \text{threshold}\\ 1 & \text{if } \sum_jw_jx_j \geq \text{threshold} \\ \end{array} \right.$

A network of perceptrons could weigh up evidence and make decisions, like computing logical functions with binary operations such as AND, OR or NAND gates.

From binary to sigmoid functions

Sigmoid neuron. Same structure as perceptron

The output is defined by the sigmoid function:

$\sigma(z)=\frac{1}{1+e^{-z}}$ $\sigma(w\cdot x+b)=\frac{1}{1+exp(-\sum_jw_jx_j-b)}$

Inputs $x_j$ and single output in the $[0,1]$ range.

Weights, $w_j$ tell us how important each input is.

Bias $b$ tell us how high the sum needs to be to activate the neuron.

Other activation functions: ReLU, Softax, etc

7. Statistical Prediction with Neural Networks

7. Statistical Prediction with Neural Networks

Learning objectives

Neural nets fundamentals

ANNs

Network architecture

The Perceptron

From binary to sigmoid functions

Backpropagation

Cost function

Gradient descent

Gradient descend

Gradient descend calculation

Algorithm

Again

Backpropagation implementation

Artificial Neural Networks