1
00:00:00,090 --> 00:00:06,180
So previously we looked at the addition rule for probability, and now we want to look at the multiplication

2
00:00:06,180 --> 00:00:07,650
rule for probability.

3
00:00:07,740 --> 00:00:12,900
When we talk about the multiplication rule, we have to consider both independent and dependent events.

4
00:00:12,930 --> 00:00:17,790
We're going to use the multiplication rule in two different ways, depending on whether or not our events

5
00:00:17,790 --> 00:00:19,730
are independent or dependent.

6
00:00:19,740 --> 00:00:23,230
But first, let's just start with the multiplication rule itself.

7
00:00:23,250 --> 00:00:29,670
The multiplication rule tells us that the probability of events A and B both occurring is equivalent

8
00:00:29,670 --> 00:00:35,230
to the product of the probability that event occurs and the probability that event B occurs.

9
00:00:35,250 --> 00:00:38,600
And we've just written this multiplication rule in two different ways.

10
00:00:38,610 --> 00:00:44,250
The first way using this intersection notation and the second way using the word.

11
00:00:44,250 --> 00:00:50,430
And remember that this intersection probability symbol represents this idea of and probability.

12
00:00:50,430 --> 00:00:57,180
And so to write this as the probability of a intersection, B or the probability of A and B is really

13
00:00:57,180 --> 00:01:01,010
just expressing the same idea, just written in two different ways.

14
00:01:01,020 --> 00:01:07,470
Let's start looking at this multiplication rule when we use it to apply just to independent events.

15
00:01:07,470 --> 00:01:10,990
So independent events are events that don't affect each other.

16
00:01:11,010 --> 00:01:13,530
Think about two separate coin flips.

17
00:01:13,530 --> 00:01:19,200
The result of the first coin flip doesn't affect the result of the second coin flip or rolling a die,

18
00:01:19,230 --> 00:01:21,090
picking it up and rolling it again.

19
00:01:21,090 --> 00:01:23,200
Those two DI roles are independent of each other.

20
00:01:23,220 --> 00:01:26,670
What I get on the first roll doesn't affect what I get on the second roll.

21
00:01:26,670 --> 00:01:33,000
So if we have independent events and we want to find the probability that they both occur, we call

22
00:01:33,000 --> 00:01:38,880
that a joint occurrence and we multiply the individual probabilities of the two events.

23
00:01:38,910 --> 00:01:44,340
In other words, let's just take the classic example where I'm flipping a coin two times and I want

24
00:01:44,340 --> 00:01:48,200
to know the probability that both coin flips come up as tails.

25
00:01:48,210 --> 00:01:53,730
That means that event A is getting tails on the first flip and event B is getting tails on the second

26
00:01:53,730 --> 00:01:54,240
flip.

27
00:01:54,240 --> 00:02:02,370
So I could write that probability as the probability of tails and the intersection of that with tails.

28
00:02:02,370 --> 00:02:05,880
I could also express this as the probability of tails.

29
00:02:06,790 --> 00:02:13,720
And tales, or sometimes we'll see this written as the probability of just tails tails.

30
00:02:13,720 --> 00:02:19,480
But we're trying to indicate the same joint occurrence in each of these three representations.

31
00:02:19,480 --> 00:02:25,360
And no matter which representation we use, our multiplication rule formula tells us that the probability

32
00:02:25,360 --> 00:02:29,410
that we get tails on the first coin flip is, of course, just one half.

33
00:02:29,410 --> 00:02:35,500
Because if I flip a coin, assuming it's a fair coin, there's a 50% chance I'll get heads and a 50%

34
00:02:35,500 --> 00:02:36,610
chance I'll get tails.

35
00:02:36,610 --> 00:02:39,670
So I have a one in two chance of flipping tails.

36
00:02:39,670 --> 00:02:43,840
So the probability of event A is one half.

37
00:02:43,840 --> 00:02:49,750
And then I need to multiply that by the probability of event B, Well, event B is getting tails on

38
00:02:49,750 --> 00:02:50,920
the second coin flip.

39
00:02:51,100 --> 00:02:55,960
And that second coin flip, really, if I pick up the coin again after flipping it the first time,

40
00:02:55,960 --> 00:02:58,150
I'm now holding that coin in my hand.

41
00:02:58,150 --> 00:03:04,000
And it doesn't matter whether I flipped heads or tails previously, I now have a coin held in my hand.

42
00:03:04,000 --> 00:03:08,140
I'm about to flip it and it has no knowledge of how it previously landed.

43
00:03:08,140 --> 00:03:12,970
I have exactly a one and two chance of getting heads and a one and two chance of getting tails on that

44
00:03:12,970 --> 00:03:13,840
second flip.

45
00:03:14,020 --> 00:03:19,390
So the probability of getting tails on the second flip is also one half.

46
00:03:19,390 --> 00:03:22,480
So the probability of event A is one half.

47
00:03:22,480 --> 00:03:25,210
The probability of event B is one half.

48
00:03:25,210 --> 00:03:30,550
I multiply these together and I get one half times one half is one fourth.

49
00:03:30,550 --> 00:03:35,620
So the probability of flipping tails two times in a row is one fourth.

50
00:03:35,620 --> 00:03:41,740
We could also think about this coin flip scenario by imagining all of the possibilities when we flip

51
00:03:41,740 --> 00:03:42,760
a coin two times.

52
00:03:42,760 --> 00:03:47,860
So if we flip a coin two times, there are four equally likely outcomes.

53
00:03:47,860 --> 00:03:50,800
I could get heads and then heads.

54
00:03:50,830 --> 00:03:53,050
I could get heads and then tails.

55
00:03:53,050 --> 00:03:56,770
I could get tails and then heads or I could get tails.

56
00:03:56,770 --> 00:03:57,850
And then tails.

57
00:03:57,880 --> 00:04:03,790
These are the four possible scenarios, the four equally likely outcomes of flipping a coin two times

58
00:04:03,790 --> 00:04:04,480
in a row.

59
00:04:04,480 --> 00:04:11,350
And so because I'm looking for the specific scenario of tails and then tails, if I take this back to

60
00:04:11,350 --> 00:04:17,440
my addition rule formula, I can say that I have one matching outcome with the scenario I'm looking

61
00:04:17,440 --> 00:04:18,670
for tails, tails.

62
00:04:18,670 --> 00:04:25,960
So there's one matching outcome and there are four total outcomes one, two, three, four total outcomes.

63
00:04:25,960 --> 00:04:32,680
So the probability that I get tails, tails when I flip a coin four times is one in four or one fourth

64
00:04:32,680 --> 00:04:34,420
or a 25% chance.

65
00:04:34,420 --> 00:04:38,020
And of course this rule doesn't just work with coin flips.

66
00:04:38,020 --> 00:04:44,500
Let's take an example where maybe we'll say that the probability that a factory produces a defective

67
00:04:44,500 --> 00:04:51,520
part is 2%, and we want to know the probability of picking three defective parts in a row if we're

68
00:04:51,520 --> 00:04:57,010
just picking parts randomly off of the assembly line, Well, we could write that here as the probability

69
00:04:57,010 --> 00:05:03,340
of a defective part and then another defective part and then another defective part.

70
00:05:03,340 --> 00:05:08,710
In other words, we could almost think about this as events A, B, and C where event A, event B and

71
00:05:08,710 --> 00:05:11,470
event C are all picking a defective part.

72
00:05:11,710 --> 00:05:18,460
Well, we said that the probability of the factory producing a defective part was 2%, which means that

73
00:05:18,460 --> 00:05:23,920
the probability of event a being choosing a defective part is going to be 2%.

74
00:05:23,920 --> 00:05:27,480
Or we could write that as two over 100.

75
00:05:27,550 --> 00:05:32,230
Then the probability of choosing another defective part is again 2%.

76
00:05:32,230 --> 00:05:41,500
So multiply by 2% or two over 100 and the probability of choosing a third defective part is also 2%

77
00:05:41,500 --> 00:05:46,540
or two over 100, because at least as far as we can tell, these events are independent.

78
00:05:46,540 --> 00:05:51,490
I'm choosing completely randomly a part off of the assembly line.

79
00:05:51,490 --> 00:05:54,730
This factory produces defective parts at a 2% rate.

80
00:05:54,730 --> 00:05:59,440
So the chance that any one part that I choose is defective is 2%.

81
00:05:59,470 --> 00:06:04,450
Of course, we have to be careful here because we might think that these events are independent, but

82
00:06:04,450 --> 00:06:10,060
let's say we pick three parts in a row and maybe the group of employees who's working in that particular

83
00:06:10,060 --> 00:06:15,970
hour when we choose parts, actually produces defective parts at a higher rate than an employee group

84
00:06:15,970 --> 00:06:19,090
who works a different shift in the same factory.

85
00:06:19,090 --> 00:06:24,250
And we have to take that into account or there's maybe multiple assembly lines in the same factory and

86
00:06:24,250 --> 00:06:27,160
one line produces more defective parts than the other.

87
00:06:27,160 --> 00:06:31,210
So we have to be really careful about the independence of these events.

88
00:06:31,210 --> 00:06:35,740
But assuming we know these events are independent, this is how we would actually do the math.

89
00:06:35,740 --> 00:06:37,990
And when we multiply here, we get two times.

90
00:06:37,990 --> 00:06:42,370
Two times two is eight and then 100.

91
00:06:43,190 --> 00:06:44,990
Cubed is 1 million.

92
00:06:44,990 --> 00:06:54,230
If we simplify eight over 1 million, we get one over 125,000, which means there's a one in 125,000

93
00:06:54,230 --> 00:07:01,190
chance that randomly choosing three parts off of the assembly line out of the factory results in three

94
00:07:01,190 --> 00:07:02,480
defective parts.

95
00:07:02,510 --> 00:07:09,260
So this is how we calculate probability for a joint occurrence when we have independent events and we

96
00:07:09,260 --> 00:07:12,800
want to know the probability that those independent events all occur.

97
00:07:13,040 --> 00:07:17,030
But when we have dependent events, we use this set of formulas.

98
00:07:17,030 --> 00:07:23,690
So this set of formulas for dependent events, this is the same multiplication rule formula, except

99
00:07:23,690 --> 00:07:29,570
that you can see that we've changed one detail, which is that we have changed this probability of B

100
00:07:29,600 --> 00:07:35,870
piece of the multiplication formula into this expression here, The probability of B given that event

101
00:07:35,870 --> 00:07:37,450
A has already occurred.

102
00:07:37,460 --> 00:07:40,070
That's what this expression means here.

103
00:07:40,070 --> 00:07:47,510
When we separate B and A with this line, we're saying the probability that event B happens given that

104
00:07:47,510 --> 00:07:48,940
A has already occurred.

105
00:07:48,950 --> 00:07:54,740
So we're looking for the probability that this happens, given that this has already occurred, regardless

106
00:07:54,740 --> 00:08:00,770
of what A and B actually are, the probability that the first thing happens given that the second thing

107
00:08:00,770 --> 00:08:02,190
has already occurred.

108
00:08:02,210 --> 00:08:07,610
The classic example here is pulling cards out of a standard 52 card deck.

109
00:08:07,610 --> 00:08:14,210
For instance, let's say we want to know the probability of pulling a king and then another king.

110
00:08:14,210 --> 00:08:19,640
So a king and then another king from a deck of 52 cards.

111
00:08:19,640 --> 00:08:26,600
And remember, in a deck of 52 cards, there are four suits, and each suit contains 13 cards from two

112
00:08:26,600 --> 00:08:27,860
all the way up through ten.

113
00:08:27,860 --> 00:08:29,660
And then Jack, Queen, King, ace.

114
00:08:29,660 --> 00:08:33,980
So 13 cards in each suit for suits, 52 cards in total.

115
00:08:33,980 --> 00:08:39,500
And we want to know the probability that we pull one king and then another KING Well, we could do this

116
00:08:39,500 --> 00:08:45,830
as an independent events problem, assuming that after we pull the first king, we put it back into

117
00:08:45,830 --> 00:08:48,110
the deck before pulling another card.

118
00:08:48,110 --> 00:08:54,290
If we pull the first king and we leave it outside the deck and then pull another card, then we're talking

119
00:08:54,290 --> 00:09:01,040
about dependent events, because the probability of pulling a king on that second pull is dependent

120
00:09:01,040 --> 00:09:03,140
on what we polled in the first poll.

121
00:09:03,170 --> 00:09:12,140
Because if you imagine if we start out with 52 cards and four of those 52 cards are kings, if we pull

122
00:09:12,140 --> 00:09:18,200
a card from this 52 card deck and we don't replace it, then when we go to pull a second card, we're

123
00:09:18,200 --> 00:09:21,920
only going to have 51 cards to pull from because one of the cards is missing.

124
00:09:21,920 --> 00:09:23,870
We already pulled it in our first poll.

125
00:09:23,870 --> 00:09:27,110
There's only 51 cards left in the deck from which to poll.

126
00:09:27,110 --> 00:09:31,670
And so right there automatically, we know that we're looking at dependent events.

127
00:09:31,670 --> 00:09:37,220
If we don't replace that first card and we can think about that idea of dependent events, because this

128
00:09:37,220 --> 00:09:44,180
second event depends on this first event, it is changed by it is affected by whatever we do with this

129
00:09:44,180 --> 00:09:45,020
first event.

130
00:09:45,020 --> 00:09:52,400
Whereas if I flip a coin two times, that second flip is not affected in any way by the first flip.

131
00:09:52,490 --> 00:09:58,070
But if I'm pulling cards from a deck and I pull the card and don't replace it, then that second poll

132
00:09:58,070 --> 00:10:01,790
is going to be affected by the fact that I pulled a card already.

133
00:10:01,790 --> 00:10:04,550
So let's assume that's what we've done here.

134
00:10:04,550 --> 00:10:09,170
We're looking for the probability of getting a king on our first poll and a king on our second poll.

135
00:10:09,170 --> 00:10:13,070
And we are not replacing the first card after we pull it from the deck.

136
00:10:13,070 --> 00:10:18,800
Well, in that case, the probability of event A, the probability that we get a king on our first poll

137
00:10:18,830 --> 00:10:25,670
is going to be four in 52 because there are four kings in the deck, 52 cards in the deck in total.

138
00:10:25,670 --> 00:10:30,590
So there are four matching outcomes and 52 total outcomes.

139
00:10:30,590 --> 00:10:33,500
So probability of event A is.

140
00:10:34,300 --> 00:10:35,740
For over 52.

141
00:10:35,770 --> 00:10:42,010
Now, the probability of event B, given that A has already happened in our example, means the probability

142
00:10:42,010 --> 00:10:47,380
that we pull a king, given that we already pulled a king and didn't put it back in the deck.

143
00:10:47,380 --> 00:10:51,730
Well, if we already pulled a king, our deck is down one card.

144
00:10:51,730 --> 00:10:54,010
There's only 51 cards left in total.

145
00:10:54,460 --> 00:10:59,380
And if we already pull the king, then there are only three kings left in that deck.

146
00:10:59,410 --> 00:11:06,160
Which means the probability that we pull a king in that second poll is three out of 51 three matching

147
00:11:06,160 --> 00:11:08,860
outcomes out of 51 total outcomes.

148
00:11:08,860 --> 00:11:17,230
So we can multiply here by three over 51 three over 51 is the probability of event B given that A has

149
00:11:17,230 --> 00:11:22,690
already happened or the probability we pull a king, given that we've already pulled a king and we didn't

150
00:11:22,690 --> 00:11:23,530
put it back.

151
00:11:23,830 --> 00:11:33,520
So we can then simplify the math here, four over 52 is the same as one over 13 and three over 51 is

152
00:11:33,520 --> 00:11:35,680
the same as one over 17.

153
00:11:35,680 --> 00:11:41,560
So one over 13 times one over 17 is one over 221.

154
00:11:41,590 --> 00:11:48,610
There's a one in 221 chance that we pull a king and then another king, assuming we didn't replace the

155
00:11:48,610 --> 00:11:50,200
first king after we pulled it.

156
00:11:50,200 --> 00:11:54,070
So this is our classic dependent events example.

157
00:11:54,070 --> 00:12:02,800
What this shows us then is that we can use these two formulas to prove whether or not events are independent

158
00:12:02,800 --> 00:12:06,880
or dependent, because these two formulas here are identical.

159
00:12:06,880 --> 00:12:08,200
We talked about this earlier.

160
00:12:08,470 --> 00:12:15,580
These two formulas are identical, except that we're saying here P of B, the probability of B is getting

161
00:12:15,580 --> 00:12:22,510
replaced with the probability of B given A Other than these two things here, the formulas are identical,

162
00:12:22,510 --> 00:12:30,430
which means that if we can show that the probability of B is equal to the probability of B, given that

163
00:12:30,430 --> 00:12:31,570
A has already happened.

164
00:12:31,570 --> 00:12:37,660
In other words, if we can show that these two values are equivalent, then we know that the events

165
00:12:37,660 --> 00:12:43,870
have to be independent, because what this is saying is that the probability of Event B happening,

166
00:12:43,870 --> 00:12:48,610
given that A has already happened, is the same as the probability that B happens.

167
00:12:48,610 --> 00:12:55,480
In other words, event A happening had no effect on the probability of event B happening.

168
00:12:55,480 --> 00:13:00,190
If event A already happened, event B is totally unaffected.

169
00:13:00,190 --> 00:13:07,150
And if it's totally unaffected, that means that B must be independent from A because A has no effect

170
00:13:07,150 --> 00:13:07,780
on B.

171
00:13:07,780 --> 00:13:13,660
So if we can show that this relationship holds, then the events have to be independent.

172
00:13:13,660 --> 00:13:17,140
Think back to our coin flip example from earlier.

173
00:13:17,140 --> 00:13:26,590
If we say that event A is flipping heads on one flip and that event B is flipping heads.

174
00:13:27,380 --> 00:13:34,580
On a second flip, we know that the probability of event B flipping heads on this coin flip is one half,

175
00:13:34,580 --> 00:13:38,690
so the probability of event B is one half.

176
00:13:38,690 --> 00:13:45,410
And this probability here, the probability of event B given event A we can think about as the idea

177
00:13:45,410 --> 00:13:49,460
that we flipped the coin once and we got heads on that first flip.

178
00:13:49,460 --> 00:13:54,470
Now we're looking at event B, we've picked up the coin again and we're looking at the probability of

179
00:13:54,470 --> 00:13:58,670
flipping heads given that we flipped heads on the previous coin flip.

180
00:13:58,670 --> 00:14:04,760
Well, this second coin flip is totally unaffected by whether we got heads or tails on the last flip.

181
00:14:04,760 --> 00:14:10,370
So the probability of B given a meaning, the probability of flipping heads, given that we flipped

182
00:14:10,370 --> 00:14:12,800
heads last time, is still just one half.

183
00:14:12,800 --> 00:14:18,380
So we can say the probability of B given A is one half.

184
00:14:18,380 --> 00:14:26,510
And so then because these two values are equivalent, the resulting logical conclusion here is that

185
00:14:26,510 --> 00:14:29,630
A and B.

186
00:14:30,480 --> 00:14:32,400
Must be independent.

187
00:14:33,390 --> 00:14:34,110
Of one another.

188
00:14:34,110 --> 00:14:40,020
They must be independent events because we showed that this equation was true here, that these two

189
00:14:40,020 --> 00:14:46,380
probabilities were equal and therefore that the probability of B was totally unaffected by event A already

190
00:14:46,380 --> 00:14:46,950
happening.

191
00:14:46,950 --> 00:14:50,580
And therefore we know that events A and B must be independent.

192
00:14:50,580 --> 00:14:57,780
So this is another check we can use for confirming whether two different events are independent or dependent

193
00:14:57,780 --> 00:14:58,320
events.

194
00:14:58,320 --> 00:15:04,800
But this is how we use the multiplication rule to calculate the probability that two events both occur,

195
00:15:04,830 --> 00:15:10,590
the probability of event A and event B both occurring if the events are independent.

196
00:15:10,590 --> 00:15:12,150
We use this formula here.

197
00:15:12,150 --> 00:15:15,570
We just multiply the probabilities of the individual events.

198
00:15:15,690 --> 00:15:22,500
If the events are dependent, then we're calculating conditional probability because one event is conditional

199
00:15:22,500 --> 00:15:25,320
on the other, one event is dependent on the other.

200
00:15:25,320 --> 00:15:32,040
And so we replace this probability of event B expression with the probability of event B Given that

201
00:15:32,040 --> 00:15:39,810
event A has already happened in order to account for the dependence of one event on another, the probability

202
00:15:39,810 --> 00:15:42,810
of one event is conditional on the other event.

203
00:15:42,810 --> 00:15:49,710
And so we modify our formula in this way to calculate the probability that multiple events are occurring.

