﻿1
00:00:00,870 --> 00:00:04,770
‫Below we have discussed the individual sale.

2
00:00:04,920 --> 00:00:13,610
‫Now we are going to start these says to create network offsets just to avoid confusion with biological

3
00:00:13,610 --> 00:00:14,690
‫neuron.

4
00:00:14,750 --> 00:00:19,640
‫I'll be calling a neuron as perception runs only through perception.

5
00:00:19,640 --> 00:00:24,310
‫From now on means any artificial neuron.

6
00:00:24,440 --> 00:00:32,000
‫Now there are two ways we can stack cells violently or sequence in.

7
00:00:32,170 --> 00:00:43,420
‫Let's see what happens when this tax is badly hit as a single positron with three inputs and one output.

8
00:00:43,430 --> 00:00:52,370
‫Now we had a deposit brought bad luck to this cell also gets the same three inputs but it has a different

9
00:00:52,460 --> 00:00:58,690
‫output why do we can keep on adding more cells better to these ones.

10
00:00:58,820 --> 00:01:07,040
‫Maybe a third or fourth or even more than that we we'll just start getting new output or in other words

11
00:01:07,280 --> 00:01:15,920
‫we can predict for multiple output using the same input features for example when we are doing image

12
00:01:15,980 --> 00:01:20,670
‫recognition and we are trying to find out a face of a person.

13
00:01:21,020 --> 00:01:29,080
‫He may also want to find the x and y coordinate of that phase equal to for these two variables will

14
00:01:29,080 --> 00:01:36,530
‫become vital NYT although image definition needs a much more complex network.

15
00:01:36,630 --> 00:01:43,380
‫By giving this example I wanted to make the point that neural networks are not bound to only one output

16
00:01:44,810 --> 00:01:51,470
‫with the same input you can get multiple output because we can do parallel stacking of the artificial

17
00:01:51,470 --> 00:01:54,740
‫neurons.

18
00:01:54,740 --> 00:02:00,290
‫Now let's see going chilled stacking in the image above.

19
00:02:00,500 --> 00:02:09,390
‫We have five inputs which we input to three parallel separate now the output from this set of perceptions

20
00:02:09,690 --> 00:02:16,210
‫is taken and Fred as input to another set of valid perceptions.

21
00:02:16,430 --> 00:02:25,220
‫Here I am inputting the output of these three to these four plus approx again I take the four outputs

22
00:02:25,220 --> 00:02:30,940
‫of these perceptions and input these into this single perceptual.

23
00:02:31,020 --> 00:02:38,050
‫Lastly the singleton is giving out one single output which is the variable which we want to predict.

24
00:02:41,440 --> 00:02:50,170
‫So this is sequential stacking in which the output of one set of parallel least tagged neurons is sequentially

25
00:02:50,200 --> 00:02:54,220
‫given as input to the next set of parallels to act neurons.

26
00:02:56,640 --> 00:03:00,380
‫Let's first understand the benefit of doing this.

27
00:03:00,530 --> 00:03:09,390
‫That is why did we not just input all the five inputs into a single cell and use this output to predict

28
00:03:09,460 --> 00:03:20,250
‫variable like how is stacking these additional sets of neuron helpful to we have this type of beta.

29
00:03:20,310 --> 00:03:28,240
‫There are these two input variables maybe height and weight basis which we are trying to classify.

30
00:03:28,710 --> 00:03:36,960
‫If the anyone in the room is a cow or a dog so cows generally like it they have more weight and more

31
00:03:36,960 --> 00:03:38,520
‫heat than a dog.

32
00:03:38,640 --> 00:03:45,010
‫And dogs generally like it that is they are represented by the red dot.

33
00:03:45,140 --> 00:03:52,700
‫Now when we're classifying this sort of data we can have a linear separator that is a straight line

34
00:03:53,090 --> 00:03:54,890
‫to separate these two classes.

35
00:03:56,720 --> 00:04:03,310
‫Anything on the right side will be predicted as a call and anything on the left side will be predicted

36
00:04:03,450 --> 00:04:03,960
‫as a dog.

37
00:04:05,040 --> 00:04:13,790
‫This is the capability of a single perceptual single perception can find out the best straight line

38
00:04:14,150 --> 00:04:22,920
‫to classify the given data so if we have this problem using a single person wrong would suffice.

39
00:04:24,340 --> 00:04:32,500
‫But the situation is more complex in fact in real life situations we'd never use neural networks when

40
00:04:32,500 --> 00:04:38,770
‫we need to classify a photo situation as simple as this the real life situations for neural networks

41
00:04:38,980 --> 00:04:49,470
‫is often more complex let me complicate the example a little bit what if we wanted to classify objects

42
00:04:49,830 --> 00:04:56,770
‫which have this distribution so anything to the left of the first name and anything to the date of the

43
00:04:56,770 --> 00:05:06,940
‫second line is Class 8 or is it a dot and anything in between these two lines is Class B or if a green

44
00:05:06,940 --> 00:05:16,630
‫dot this type of classification situation cannot be handled by a single person on a netbook such as

45
00:05:16,960 --> 00:05:19,790
‫the one shown on the right can easily handle it.

46
00:05:21,230 --> 00:05:25,360
‫For example this bus neuron will fire.

47
00:05:25,560 --> 00:05:34,430
‫That is give output as one ready point lays to the left of lane one and the second neuron will give

48
00:05:34,430 --> 00:05:43,560
‫output as one ready point lays to divide off line to and this final neuron gives output as one when

49
00:05:43,770 --> 00:05:50,040
‫any one of the two inputs is one you can polish the video.

50
00:05:50,370 --> 00:05:59,220
‫Think about it for a couple of minutes and see how this small network is handling this special classification.

51
00:05:59,230 --> 00:06:03,630
‫This is the power of a neural network in the network.

52
00:06:03,630 --> 00:06:11,850
‫We created each neuron can focus on a particular feature of the object and not on the final output.

53
00:06:12,990 --> 00:06:20,480
‫The final output will be predicted based is the desert of these features in this way.

54
00:06:20,530 --> 00:06:26,890
‫Neural networks can do really sophisticated decision making with basic machine learning techniques such

55
00:06:26,890 --> 00:06:29,890
‫as linear regression cannot do with good accuracy

56
00:06:32,660 --> 00:06:34,360
‫before we move on.

57
00:06:34,520 --> 00:06:36,500
‫Let's take a minute to discuss this.

58
00:06:36,500 --> 00:06:39,630
‫Networks nominated.

59
00:06:39,850 --> 00:06:43,000
‫This is a neural network now.

60
00:06:43,050 --> 00:06:53,590
‫Each set of parallel neurons are called Live that first is the input layer the last is the output left

61
00:06:54,260 --> 00:07:04,080
‫and these in-between decoding live this network had five inputs three cells inherently and one for inherently

62
00:07:04,110 --> 00:07:10,050
‫two and one in the output layer so for brevity.

63
00:07:10,270 --> 00:07:14,250
‫This network can also be called as a 5 3 4 1.

64
00:07:14,290 --> 00:07:14,800
‫Network

65
00:07:18,630 --> 00:07:27,550
‫ultimate is that the process information in this network is flowing in only the forward detection which

66
00:07:27,550 --> 00:07:34,010
‫is why the network is also called a feed forward network in comparison.

67
00:07:34,050 --> 00:07:41,130
‫If the output of one of these cells of that layer goes back as input to end of the scale of that literally

68
00:07:41,700 --> 00:07:48,030
‫then it is called a cyclic network but it could have neural networks also known as audit.

69
00:07:48,020 --> 00:07:55,780
‫In the example of cyclic network auditing are used in natural language processing and language modeling.

70
00:07:55,970 --> 00:08:00,560
‫For now let's come back to standard feed for word network.

71
00:08:00,620 --> 00:08:09,330
‫Now you can notice here that output from this said is going to force it.

72
00:08:09,360 --> 00:08:11,700
‫These are not for different outputs.

73
00:08:11,700 --> 00:08:21,590
‫It is only one output the same output is going as input in all these pulses also note that every neuron

74
00:08:21,830 --> 00:08:30,340
‫in each layer is connected to every other neuron in the Edison forward layer therefore this network

75
00:08:30,340 --> 00:08:31,680
‫is fully connected.

76
00:08:33,120 --> 00:08:41,100
‫If somehow some links were missing we call it possibly connected but for most practical purposes we

77
00:08:41,100 --> 00:08:42,530
‫use a fully connected network.

78
00:08:45,680 --> 00:08:51,680
‫Before I close this LICHTER I would like to tell you that within this short span of time in which we

79
00:08:51,680 --> 00:08:59,510
‫covered this LICHTER We have entered devoid of deep learning such artificial neural networks is what

80
00:08:59,600 --> 00:09:02,980
‫deep learning is made up of basically.

81
00:09:03,150 --> 00:09:09,330
‫Think of this like a system which loans the relationship between input and output.

82
00:09:10,850 --> 00:09:19,190
‫The more layers we have in the system the more deep our system is more that is capable of establishing

83
00:09:19,260 --> 00:09:28,820
‫a complex relationship between input and output so I hope that you understand the basics of neural networks

84
00:09:30,010 --> 00:09:37,600
‫in the next lecture will go deeper and see how these networks process the output and find the optimum

85
00:09:37,810 --> 00:09:45,830
‫values of weight and biases to get good accuracy of prediction unanimously.

