1
00:00:00,990 --> 00:00:05,760
‫Now, this is the architecture we are going to use for our CNN model.

2
00:00:07,560 --> 00:00:15,450
‫If you remember last time when we ran ANN model on this problem, we used this side of the network.

3
00:00:16,200 --> 00:00:22,860
‫We first use flatten layer, then dense layered with 300 neuron, then dense layer with 100 neurons

4
00:00:23,010 --> 00:00:24,240
‫and then the output layer.

5
00:00:25,610 --> 00:00:29,220
‫We are going to add CNN layers before this.

6
00:00:30,810 --> 00:00:35,280
‫So this is our input, 28 by 28 pixel cross one.

7
00:00:36,390 --> 00:00:42,540
‫So 28 pixels as width, 28 pixels as height and one as the channel.

8
00:00:45,300 --> 00:00:49,890
‫We are going to use a filter size of three by three with the shride of one.

9
00:00:52,410 --> 00:00:55,650
‫And then this will be our convolutional layer.

10
00:00:56,700 --> 00:00:59,340
‫Since we are using filter size of three by three.

11
00:00:59,820 --> 00:01:02,670
‫And we are also using a padding as valid.

12
00:01:03,150 --> 00:01:06,110
‫We are ignoring one pixel from each side.

13
00:01:06,630 --> 00:01:08,070
‫Since our padding is valid

14
00:01:08,550 --> 00:01:17,310
‫So the convolutional layer, we are going to get this 26 by 26 into 32, we are going to use 32 different

15
00:01:17,310 --> 00:01:19,800
‫filters for this convolutional layer.

16
00:01:22,200 --> 00:01:25,390
‫After this, we are going to use to use Max pooling

17
00:01:27,030 --> 00:01:35,220
‫So finally, after this corn layer, we are going to get 13 by 13 into 32 cells in our pooling layer.

18
00:01:37,950 --> 00:01:40,410
‫Now, we have to add some dense layer as well.

19
00:01:41,310 --> 00:01:45,500
‫So we have to convert this 3D object into one dimensionality.

20
00:01:45,720 --> 00:01:47,820
‫So we are using flatten layer.

21
00:01:48,660 --> 00:01:50,790
‫Then we are going to use to dense layer.

22
00:01:51,120 --> 00:01:52,920
‫First one with 300 neuron.

23
00:01:53,430 --> 00:01:55,260
‫And next one with hundred neurons.

24
00:01:56,310 --> 00:02:01,200
‫And then we are going to use an output layer with 10 neurons.

25
00:02:01,290 --> 00:02:08,790
‫Since we have to identify ten different classes, the code is almost similar.

26
00:02:09,480 --> 00:02:11,370
‫First, we have to create a model object.

27
00:02:13,320 --> 00:02:16,800
‫Then we are going to add the convolutional layer.

28
00:02:18,690 --> 00:02:24,340
‫We'll write model, dot, add and then keras dot layers, dot com, 2d.

29
00:02:25,740 --> 00:02:28,370
‫And as parameters, we want to do filters.

30
00:02:29,550 --> 00:02:33,300
‫We want to have our filter of size three by three.

31
00:02:34,290 --> 00:02:35,190
‫So we have to provide

32
00:02:35,270 --> 00:02:38,030
‫Kernel size has three comma three.

33
00:02:38,580 --> 00:02:42,370
‫If you want a filter size of four by four you can write four comma four

34
00:02:44,040 --> 00:02:46,080
‫By default the stride value is one.

35
00:02:47,190 --> 00:02:49,590
‫We are going to use the same.

36
00:02:50,550 --> 00:02:53,220
‫And we are going to use padding as valid.

37
00:02:54,390 --> 00:02:56,240
‫The activation is always relu.

38
00:02:57,990 --> 00:03:00,760
‫And the input shape is 28.

39
00:03:00,820 --> 00:03:01,710
‫Cross 28.

40
00:03:01,710 --> 00:03:02,350
‫Cross one.

41
00:03:03,360 --> 00:03:08,880
‫If you are dealing with any Colored images, you have to give values for channel as well.

42
00:03:09,180 --> 00:03:11,520
‫So for colored images, this should be three.

43
00:03:12,870 --> 00:03:17,940
‫This should be the height, width
‫and then the number of channels.

44
00:03:19,980 --> 00:03:23,370
‫So this is a work on layer for our pulling layer.

45
00:03:24,360 --> 00:03:32,180
‫We can edit using this syntax model dot add then Keraf dot layers dot max pulling 2D.

46
00:03:32,850 --> 00:03:37,020
‫And then here you have to give the dimensions of your window.

47
00:03:37,650 --> 00:03:39,240
‫We are going to use deceptive.

48
00:03:39,300 --> 00:03:40,680
‫Field of two cross two.

49
00:03:41,020 --> 00:03:44,630
‫That's why we are giving two comma two as a parameter.

50
00:03:46,410 --> 00:03:51,390
‫After this, we want to convert our three dimensional matrix into one dimensional matrix.

51
00:03:51,720 --> 00:03:55,220
‫So we have to use flatten and after flatten.

52
00:03:55,260 --> 00:03:59,470
‫We can use our dense layers as we did in ANN.

53
00:04:00,960 --> 00:04:04,230
‫So we are adding another layer with 300 neurons.

54
00:04:05,100 --> 00:04:07,480
‫And then another layer with 200 neurons.

55
00:04:07,800 --> 00:04:12,510
‫And for both of these layers, we want our activation function to be of RELU.

56
00:04:13,950 --> 00:04:16,470
‫And the next layer is our output layer.

57
00:04:17,970 --> 00:04:20,340
‫We want to classify 10 different categories.

58
00:04:20,550 --> 00:04:22,650
‫That's why we are using 10 neurons.

59
00:04:23,160 --> 00:04:28,200
‫And the activation is softmax activation because this is a classification problem.

60
00:04:29,460 --> 00:04:30,780
‫Now, let's run this.

61
00:04:34,880 --> 00:04:36,230
‫No, similar to last time.

62
00:04:37,130 --> 00:04:42,260
‫We can use Dot summary method to get details.

63
00:04:42,350 --> 00:04:44,310
‫Of our model architecture.

64
00:04:46,880 --> 00:04:53,810
‫Here you can get information of all the layers or what is the shape of those layers and the number of parameters

65
00:04:53,870 --> 00:04:55,340
‫that you are going to train.

66
00:04:55,430 --> 00:04:57,080
‫Using this model architecture,

67
00:05:00,740 --> 00:05:07,190
‫now important thing here you can notice is that the number of parameters we are going to train is in

68
00:05:07,190 --> 00:05:07,820
‫millions.

69
00:05:09,110 --> 00:05:13,850
‫So in this dense layer, we have around one point six million parameters to train.

70
00:05:16,910 --> 00:05:23,450
‫Now, if we have not used Max fulling layer two, this would have been around four million or five million

71
00:05:23,450 --> 00:05:23,870
‫neurons.

72
00:05:23,870 --> 00:05:32,180
‫So so using max pooling has significantly reduced the number of parameters that we are going to train

73
00:05:32,950 --> 00:05:33,950
‫in the next lecture.

74
00:05:34,220 --> 00:05:38,360
‫We will try to train a model without a pooling layer.

75
00:05:38,810 --> 00:05:45,410
‫And there we can clearly see the difference in number of parameter and the execution time in the models

76
00:05:45,680 --> 00:05:48,320
‫with the pulling layer and without the pullying layer.

77
00:05:52,740 --> 00:06:03,150
‫The next step is to compile the model now since we have exclusive multi label classification problem.

78
00:06:04,050 --> 00:06:11,950
‫We are going to use a sparse, categorical and Cross and Groppi and we want to use Optimizer as a study.

79
00:06:12,380 --> 00:06:14,080
‫So class state, Verdean descent.

80
00:06:14,710 --> 00:06:16,800
‫And we want to calculate the accuracy.

81
00:06:17,040 --> 00:06:22,230
‫Since we are doing the classification problem, it's compiled this model.