introduction to artificial neural networks

Introduction to ArtificialNeural Networks

主講人 : 虞台文

Content Fundamental Concepts of ANNs. Basic Models and Learning Rules

– Neuron Models– ANN structures– Learning

Distributed Representations Conclusions

Fundamental Concepts of ANNs

What is ANN? Why ANN?

ANN Artificial Neural Networks– To simulate human brain behavior– A new generation of information

processing system.

Applications

Pattern Matching Pattern Recognition Associate Memory (Content Addressable

Memory) Function Approximation Learning Optimization Vector Quantization Data Clustering . . .

Applications

Pattern Matching Pattern Recognition Associate Memory (Content Addressable

Memory) Function Approximation Learning Optimization Vector Quantization Data Clustering . . .

Traditional Computers are inefficient at these tasks although their computation speed is faster.

The Configuration of ANNs

An ANN consists of a large number of interconnected processing elements called neurons.– A human brain consists of ~1011 neurons of

many different types.

How ANN works?– Collective behavior.

The Biologic Neuron

樹狀突樹狀突

軸突軸突二神經原之神經絲接合部分二神經原之神經絲接合部分

The Biologic Neuron

Excitatory or Inhibitory

The Artificial Neuron

f (.) a (.)

jiji xwf

)()1( fatyi )()1( fatyi

otherwise

f (.) a (.)

positive excitatorynegative inhibitoryzero no connection

f (.) a (.)

Proposed by McCulloch and Pitts [1943]M-P neurons

What can be done by M-P neurons?

A hard limiter. A binary threshold unit. Hyperspace separation.

otherwise

)( 2211

otherwise

)( 2211

w 1 x 1 + w 2 x 2 =

w1 w2 10

What ANNs will be?

ANN A neurally inspired mathematical model. Consists a large number of highly interconnected PE

s. Its connections (weights) holds knowledge. The response of PE depends only on local informatio

n. Its collective behavior demonstrates the computatio

n power. With learning, recalling and, generalization capabilit

Three Basic Entities of ANN Models

Models of Neurons or PEs.Models of synaptic interconnections and stru

ctures. Training or learning rules.

Basic Models and Learning Rules Neuron Models ANN structures Learning

Processing Elements

f (.) a (.)

What integration functions we may have?

What activation functions we may have?

Extensions of M-P neurons

Integration Functions

f (.) a (.)

jiji xwf

Quadratic Function

jijji wxf

2)(Spherical Function

i ijk j k j k ij k

f w x x x x

i ijk j k j k ij k

f w x x x x

Polynomial Function

jijii xwnetf

1M-P neuron

Activation Functions

f (.) a (.)

M-P neuron: (Step function)

otherwise

f (.) a (.)

Hard Limiter (Threshold function)

01)sgn()(

f (.) a (.)

Ramp function:

f (.) a (.)

Unipolar sigmoid function:

-4 -3 -2 -1 0 1 2 3 4

1)( fe

f (.) a (.)

Bipolar sigmoid function:

-4 -3 -2 -1 0 1 2 3 4

Example: Activation Surfaces

L1 L2 L3

xy+4=0

L1 L2 L3

001 101 100

Region Code

z=0 L4

L1 L2 L3

z=0 L4

L1 L2 L3

M-P neuron: (Step function)

otherwise

L1 L2 L3

=5 =10

Unipolar sigmoid function: fe

1)( fe

ANN Structure (Connections)

Single-Layer Feedforward Networks

y1 y2 yn

x1 x2 xm

w11 w12

w1mw21 w22

w2m wn1 wnmwn2

Multilayer Feedforward Networks

x1 x2 xm

y1 y2 yn

Hidden Layer

Input Layer

Output Layer

Multilayer Feedforward Networks

Pattern Recognition

Analysis

ClassificationOutput

Learning

Where the knowledge

Single Node with Feedback to Itself

FeedbackLoop

Single-Layer Recurrent Networks

x1 x2 xm

y1 y2 yn

Multilayer Recurrent Networks

x1 x2 x3

y1 y2 y3

Learning

Consider an ANN with n neurons and each with m adaptive weights.

Weight matrix:

Learning

Consider an ANN with n neurons and each with m adaptive weights.

Weight matrix:

To “Learn” the weight matrix.To “Learn” the weight matrix.

Learning Rules

Supervised learning

Reinforcement learning

Unsupervised learning

Supervised Learning

Learning with a teacher

Learning by examples Training set

(1) (2)(1) (2 )) ( )(( , ), ( , ), , ( , ),kk d d dx xT x

Supervised Learning

Errorsignal

Generator

Errorsignal

Generator

(1) (2)(1) (2 )) ( )(( , ), ( , ), , ( , ),kk d d dx xT x

Reinforcement Learning

Learning with a criticLearning by comments

Reinforcement Learning

Criticsignal

Generator

Criticsignal

Generator

WReinforcement

Signal

Unsupervised Learning

Self-organizingClustering

– Form proper clusters by discovering the similarities and dissimilarities among objects.

Unsupervised Learning

x yANN

The General Weight Learning Rule

i ijijj

net xw

Input:

Output: ( )i iy a net

wi,m-1

i ijijj

net xw

Input:

Output: ( )i iy a net

wi,m-1

We want to learn the weights & bias.

i ijijj

net xw

Input:

wi,m-1

net xw

Let xm = 1 and wim = i.

i ijijj

net xw

Input:

wi,m-1

net xw

Let xm = 1 and wim = i.

xm= 1wim=i

Input:

wi,m-1

net xw

xm= 1wim=i

wi=(wi1, wi2 ,…,wim)Twi=(wi1, wi2 ,…,wim)T

wi(t) = ?wi(t) = ?

We wantto learn

wiwix yi

r diLearningSignal

Generator

LearningSignal

Generator

wiwix yi

r diLearningSignal

Generator

LearningSignal

Generator

( , , )r i if dw x

wiwix yi

r diLearningSignal

Generator

LearningSignal

Generator

)()( trti xw )()( trti xw

( , , )r i if dw x

wiwix yi

r diLearningSignal

Generator

LearningSignal

Generator

( ) ( )i t tr w x( ) ( )i t tr w x

)()( trti xw )()( trti xw

Learning Rate

( , , )r i if dw x

wi=(wi1, wi2 ,…,wim)Twi=(wi1, wi2 ,…,wim)TWe wantto learn

( , , )r i ir f d w x( , , )r i ir f d w x( ) ( )i t tr w x( ) ( )i t tr w x

( 1) ( ) ( ) ( ) ( ) ( )( , , )t t t t t ti i r i if d w w w x x

Discrete-Time Weight Modification Rule:

Continuous-Time Weight Modification Rule:

( )( )id t

Hebb’s Learning Law

• Hebb [1994] hypothesis that when an axonal input from A to B causes neuron B to immediately emit a pulse (fire) and this situation happens repeatedly or persistently.

• Then, the efficacy of that axonal input, in terms of ability to help neuron B to fire in future, is somehow increased.

• Hebb’s learning rule is a unsupervised learning rule.

Hebb’s Learning Law

( , , ) ( )Tr i i iif d ar y w x w x( , , ) ( )Tr i i iif d ar y w x w x

( ) ( ) ii rt t y w x x( ) ( ) ii rt t y w x x

iij jw xy iij jw xy

Distributed Representations

• Distributed Representation:– An entity is represented by a pattern of a

ctivity distributed over many PEs.– Each Processing element is involved in r

epresenting many different entities.

• Local Representation:– Each entity is represented by one PE.

Example

P0 P1 P2 P3 P4 P5 P6 P7 P8 P9 P10 P11 P12 P13 P14 P15

+ _ + + _ _ _ _ + + + + + _ _ _

+ _ + + _ _ _ _ + _ + _ + + _ +

+ + _ + _ + + _ + _ _ + + + + _

+ _ + + _ _ _ _ + + + + + _ _ _

+ _ + + _ _ _ _ + _ + _ + + _ +

+ + _ + _ + + _ + _ _ + + + + _

Advantages

Act as a content addressable memory.

+ + + +

What is this?What is this?

Advantages

+ _ + + _ _ _ _ + + + + + _ _ _

+ _ + + _ _ _ _ + _ + _ + + _ +

+ + _ + _ + + _ + _ _ + + + + _

Act as a content addressable memory.

Make induction easy.

+ _ _ + _ _ _ _ + + + + + + _ _Fido

Dog has 4 legs? How many for Fido?Dog has 4 legs? How many for Fido?

Advantages

+ _ + + _ _ _ _ + + + + + _ _ _

+ _ + + _ _ _ _ + _ + _ + + _ +

+ + _ + _ + + _ + _ _ + + + + _

Act as a content addressable memory. Make induction easy. Make the creation of new entities or

concept easy (without allocation of new hardware).

+ + _ _ _ + + _ + _ _ _ + + + _Doughnut

Add doughnut by changing weights.Add doughnut by changing weights.

Advantages

+ _ + + _ _ _ _ + + + + + _ _ _

+ _ + + _ _ _ _ + _ + _ + + _ +

+ + _ + _ + + _ + _ _ + + + + _

Act as a content addressable memory. Make induction easy. Make the creation of new entities or concept

easy (without allocation of new hardware). Fault Tolerance.

Some PEs break down don’t cause problem.Some PEs break down don’t cause problem.

Disadvantages

• How to understand?• How to modify?

Learning procedures are required.Learning procedures are required.

introduction to artificial neural networks

Documents

人工神经网络 artificial neural networks

rough neural networks

performance of artificial neural networks in forecasting

artificial neural networks and deep learning

chapter 4: artificial neural networks. artificial neural...

backpropagation: understanding how to update artificial...

chapter 15 artificial neural networks for combinatorial...

bone recognition in ute mr images by artificial neural...

artificial neural networks for nmr structure elucidation...

introduction to neural networks - github pages ·...

time series prediction with artificial neural networks: an

신경회로망 (neural networks)

estudo do emprego de redes neurais artificiais...

introduction to artificial neural networks 主講人 :...

jaringan syaraf tiruan (artificial neural networks)

materi 7 artificial neural networks

aplikasi model artificial neural networks untuk di …

artificial neural networks for genome-enabled …€¦ ·...

convolutionalneuralnetworksfor malwareclassiﬁcation ·...

lecture 7 artificial neural...