1
00:00:00,120 --> 00:00:03,300
Like binomial and Bernoulli random variables.

2
00:00:03,330 --> 00:00:09,960
A Poisson random variable is another type of discrete random variable, because again with Poisson,

3
00:00:09,960 --> 00:00:13,680
we're focusing on a countable number of outcomes.

4
00:00:13,680 --> 00:00:18,150
So we'll always treat Poisson variables as discrete, not continuous.

5
00:00:18,180 --> 00:00:23,270
Now the difference between a Poisson random variable and a binomial random variable.

6
00:00:23,280 --> 00:00:30,330
If we remember about the binomial random variable, we indicate a probability of success and then we're

7
00:00:30,330 --> 00:00:35,000
looking for the probability of K successes in n trials.

8
00:00:35,010 --> 00:00:43,710
So a binomial random variable is always indicated this way B of NP or sometimes capital X of NP, where

9
00:00:43,710 --> 00:00:45,510
N is the number of trials for performing.

10
00:00:45,510 --> 00:00:52,350
That's always a fixed number of trials and P is the probability of success on any particular trial.

11
00:00:52,350 --> 00:00:57,690
And when we calculate probability, we're looking at the probability of a certain number of successes

12
00:00:57,690 --> 00:00:59,340
in those end trials.

13
00:00:59,340 --> 00:01:06,660
So if we're running an equals 20 trials and our probability of success is 30% or 0.3, then we could

14
00:01:06,660 --> 00:01:13,890
go on to calculate probability around seeing any number of successes from K equals zero successes all

15
00:01:13,890 --> 00:01:16,350
the way up to K equals 20 successes.

16
00:01:16,350 --> 00:01:19,050
Since we're running an equals 20 trials.

17
00:01:19,200 --> 00:01:28,050
But with the Poisson variable, we have this idea that the rate of occurrence of some event or object

18
00:01:28,050 --> 00:01:34,830
or something that's going on per unit of something else we call lambda, this Greek letter lambda,

19
00:01:34,830 --> 00:01:40,230
and then we're looking for or we're investigating probability of a certain number of occurrences or

20
00:01:40,230 --> 00:01:46,500
successes over one unit where these two units are referring to the same thing here.

21
00:01:46,500 --> 00:01:55,380
So an example could be the number of planes that fly over our house per hour of the day where the occurrence

22
00:01:55,380 --> 00:02:01,470
there is a plane flying over the house and the unit is time in particular ours.

23
00:02:01,470 --> 00:02:07,890
And so if we say that Lambda is equal to ten planes per hour, then we could investigate the probability

24
00:02:07,890 --> 00:02:12,660
of any number of planes flying over the house in one hour.

25
00:02:12,660 --> 00:02:15,690
But we can pick any kind of occurrence.

26
00:02:15,690 --> 00:02:18,900
And this unit doesn't have to be a unit of time.

27
00:02:18,900 --> 00:02:27,390
For instance, we could count the number of dogs that we pass per city block that we walk, and maybe

28
00:02:27,390 --> 00:02:32,460
on average we pass 1.5 dogs for every block that we walk.

29
00:02:32,460 --> 00:02:38,190
And then we could investigate the probability of passing any number of dogs, zero dogs, three dogs,

30
00:02:38,190 --> 00:02:41,400
17 dogs in any one city BLOCK.

31
00:02:41,400 --> 00:02:45,390
So again, this occurrence can be really anything we can imagine.

32
00:02:45,390 --> 00:02:50,580
And these units here can be distance, they can be area volume time.

33
00:02:50,580 --> 00:02:58,230
Any kind of occurrence per unit could in theory, be modeled by a Poisson random variable as long as

34
00:02:58,230 --> 00:03:00,330
we meet a certain set of conditions.

35
00:03:00,330 --> 00:03:02,970
And those conditions are these five.

36
00:03:02,970 --> 00:03:06,570
So again, we're looking for occurrences in an interval.

37
00:03:06,570 --> 00:03:14,610
We have to be counting a certain number of occurrences or events over some other interval unit measurement.

38
00:03:14,610 --> 00:03:16,230
That's what we described here.

39
00:03:16,230 --> 00:03:18,090
That's the first condition we have to meet.

40
00:03:18,090 --> 00:03:25,530
We have to be able to say that the mean is constant for each interval or each unit that we choose.

41
00:03:25,530 --> 00:03:31,380
So in our earlier example where we talked about planes passing over the House, this second condition

42
00:03:31,380 --> 00:03:38,550
might not hold because maybe planes only fly between certain hours of the day, maybe 6:00 am and 10:00

43
00:03:38,550 --> 00:03:39,120
PM.

44
00:03:39,120 --> 00:03:44,880
And so if we're looking at a 24 hour period, this mean is not constant across each interval.

45
00:03:44,880 --> 00:03:50,820
We only want to define a Poisson random variable when all of the intervals were investigating, as far

46
00:03:50,820 --> 00:03:54,510
as we know, at least theoretically can have the same mean.

47
00:03:54,510 --> 00:04:00,510
We also need the count of occurrences or events in each interval to be independent.

48
00:04:00,510 --> 00:04:07,590
So we need to be able to say that the number of events that we count in one interval in one unit doesn't

49
00:04:07,590 --> 00:04:10,470
affect the count we get in the other intervals.

50
00:04:10,470 --> 00:04:16,079
Our counts need to be independent across intervals, and if the count we get in one interval means that

51
00:04:16,079 --> 00:04:21,269
we're going to get a smaller or larger count in the next interval, then we don't have independent counts.

52
00:04:21,269 --> 00:04:23,070
We wouldn't meet this third condition.

53
00:04:23,070 --> 00:04:25,710
We need to have non-overlapping intervals.

54
00:04:25,710 --> 00:04:31,830
So if we're counting planes passing over the house per hour, we can't overlap our hours.

55
00:04:31,830 --> 00:04:38,490
So if we count planes from 9 to 10 a.m., then the next interval would be 10 to 11 and then 11 to 12.

56
00:04:38,490 --> 00:04:44,220
We can't count planes from 9 to 10 and then treat 930 to 1030 as another interval.

57
00:04:44,220 --> 00:04:45,960
Those intervals would overlap.

58
00:04:45,960 --> 00:04:51,720
And then finally, we have to be able to say that probability is proportional to interval size, which

59
00:04:51,720 --> 00:04:57,780
just means that if we determine that the rate at which planes pass over the House is, let's say, ten

60
00:04:57,780 --> 00:04:59,960
planes per hour, then we need to be able to.

61
00:05:00,040 --> 00:05:06,260
To expect that in any two hour interval we should see 20 planes pass over the House.

62
00:05:06,260 --> 00:05:09,350
In a three hour interval, we should see 30 planes pass over the House.

63
00:05:09,350 --> 00:05:17,990
So that probability that mean or rate lambda should scale up or scale down as we scale up or scale down

64
00:05:17,990 --> 00:05:19,210
the interval size.

65
00:05:19,220 --> 00:05:25,370
If all of these things hold, then we can model the situation with a Poisson random variable and its

66
00:05:25,370 --> 00:05:31,010
associated Poisson distribution, and we can use those two things to calculate probability of a certain

67
00:05:31,010 --> 00:05:35,440
number of K successes per unit or per interval.

68
00:05:35,450 --> 00:05:40,160
Now, when it comes to calculating probability, here's what that looks like.

69
00:05:40,580 --> 00:05:49,220
Our probability formula for the probability of K successes in one unit or one interval is lambda to

70
00:05:49,220 --> 00:05:54,740
the power of K, where K is an exponent here multiplied by e to the negative lambda.

71
00:05:54,740 --> 00:05:56,690
So negative lambda is the exponent.

72
00:05:56,690 --> 00:05:59,540
This value e here is Euler's number.

73
00:05:59,540 --> 00:06:00,860
Technically it's a constant.

74
00:06:00,860 --> 00:06:06,080
It's equal to about 2.7 approximately, but it's an infinite decimal number.

75
00:06:06,080 --> 00:06:13,280
Any calculator you use will have a button for Euler's number and then we're dividing by K factorial.

76
00:06:13,280 --> 00:06:20,120
So just to see what this looks like, let's say we're taking an example where we have a customer support

77
00:06:20,120 --> 00:06:26,720
team that receives a certain number of email support tickets per hour and they're responsible for processing

78
00:06:26,720 --> 00:06:28,310
those support tickets.

79
00:06:28,310 --> 00:06:34,640
Maybe we'll say that they receive on average four support tickets per hour such that Lambda is equal

80
00:06:34,640 --> 00:06:41,150
to four, and we then want to investigate probability around the team receiving zero support tickets

81
00:06:41,150 --> 00:06:44,780
in an hour, one support ticket in an hour to support tickets, etc..

82
00:06:44,780 --> 00:06:50,780
Well, if we first check to see that this is a Poisson process, let's bring back our criteria here.

83
00:06:50,900 --> 00:06:54,170
We are, in fact, looking for occurrences in an interval.

84
00:06:54,170 --> 00:06:58,580
We're looking for a certain number of email support tickets per hour.

85
00:06:58,580 --> 00:07:02,030
So we have email support tickets in an interval of time.

86
00:07:02,030 --> 00:07:04,310
That's the kind of situation we're modeling.

87
00:07:04,310 --> 00:07:05,690
So that checks out.

88
00:07:05,720 --> 00:07:10,070
We're saying based on data that we have, that the mean is constant in each interval.

89
00:07:10,070 --> 00:07:17,150
Maybe we have a global company and on average we do get about four support tickets per hour and that

90
00:07:17,150 --> 00:07:20,660
doesn't fluctuate significantly from one hour to the next.

91
00:07:20,660 --> 00:07:22,490
So we meet that condition.

92
00:07:22,490 --> 00:07:27,890
We'll say that the count in each interval is independent because the support tickets we receive in one

93
00:07:27,890 --> 00:07:34,190
hour don't necessarily indicate that we'll receive more or less support tickets in the next hour.

94
00:07:34,190 --> 00:07:40,160
So the interval counts are independent, Our intervals are non overlapping because we're going to look

95
00:07:40,160 --> 00:07:44,150
at each hour of the day without overlapping those hours.

96
00:07:44,150 --> 00:07:47,360
So we'll start maybe at the 8:00 hour in the morning.

97
00:07:47,360 --> 00:07:50,720
Then we'll look at the 9:00 hour, the 10:00 hour, 11:00 hour.

98
00:07:50,720 --> 00:07:53,180
We're not going to overlap our intervals.

99
00:07:53,180 --> 00:07:58,880
So we'll hold to that and we'll say that probability is proportional to interval size, because if we're

100
00:07:58,880 --> 00:08:04,010
getting about four support tickets per hour, we can expect eight support tickets every 2 hours, 12

101
00:08:04,010 --> 00:08:06,050
support tickets every 3 hours, roughly.

102
00:08:06,050 --> 00:08:08,840
So we'll say that we meet this condition as well.

103
00:08:08,840 --> 00:08:11,150
So that's kind of the idea there.

104
00:08:11,150 --> 00:08:17,570
If we say that this situation can be modeled by a Poisson random variable, then here's what probability

105
00:08:17,570 --> 00:08:19,250
looks like for this situation.

106
00:08:19,340 --> 00:08:25,250
So starting to calculate probability, what we want to look at is the probability that this customer

107
00:08:25,250 --> 00:08:29,840
support team gets zero support emails in an hour.

108
00:08:29,840 --> 00:08:33,080
Well, in that case, K is equal to zero.

109
00:08:33,080 --> 00:08:39,799
We already said that on average the support team receives Lambda equals four support tickets per hour.

110
00:08:39,799 --> 00:08:47,510
So with lambda equal to four and K equal to zero, we make substitutions into our probability formula

111
00:08:47,660 --> 00:08:54,710
and we say that we get four to the zero multiplied by E to the negative four divided by zero factorial.

112
00:08:54,740 --> 00:08:57,620
Remember that zero factorial is equal to one.

113
00:08:57,620 --> 00:09:00,320
We can use a calculator to find this value.

114
00:09:00,320 --> 00:09:07,190
If we round to four decimal places, it's approximately equal to 0.0183 or just under 2%.

115
00:09:07,190 --> 00:09:15,050
So there's a slightly less than 2% chance that the customer service team will receive exactly zero support

116
00:09:15,050 --> 00:09:18,650
emails when on average they receive four in an hour.

117
00:09:18,650 --> 00:09:22,970
So slightly less than 2% chance that they'll receive zero in any given hour.

118
00:09:22,970 --> 00:09:24,860
And then we keep everything else the same.

119
00:09:24,860 --> 00:09:29,960
We just change the value of K from 0 to 1 and then to two and then to three.

120
00:09:29,960 --> 00:09:35,570
And these are the probabilities that we get that are associated with the service team receiving exactly

121
00:09:35,570 --> 00:09:38,840
one customer support ticket in any given hour.

122
00:09:38,840 --> 00:09:39,530
Exactly.

123
00:09:39,530 --> 00:09:40,550
Two tickets.

124
00:09:40,550 --> 00:09:41,990
Exactly three tickets.

125
00:09:41,990 --> 00:09:49,190
And what you can see is that as we get closer to the lambda equals four mean value, the probability

126
00:09:49,190 --> 00:09:50,030
is rising.

127
00:09:50,030 --> 00:09:56,420
So since lambda was equal to four, the further we get away from four, the smaller probability we expect

128
00:09:56,420 --> 00:09:56,810
to see.

129
00:09:56,810 --> 00:09:59,680
So the probability that they get zero tickets is just.

130
00:09:59,740 --> 00:10:00,790
Under 2%.

131
00:10:00,880 --> 00:10:04,840
The probability that they get one ticket is a little over 7%.

132
00:10:04,870 --> 00:10:09,010
The probability they get two tickets is about 14 and one half percent.

133
00:10:09,010 --> 00:10:13,100
And the probability that they get three tickets is about 19 and a half percent.

134
00:10:13,120 --> 00:10:20,800
And then if we calculate probability for exactly four customer service emails and then five, six,

135
00:10:20,800 --> 00:10:26,500
seven, eight, we see that the probability that they get four emails is about nine and a half percent.

136
00:10:26,500 --> 00:10:32,800
And then just like we saw on this side of lambda equals four, on the other side of lambda equals four,

137
00:10:32,800 --> 00:10:35,200
we see probability start to drop off as well.

138
00:10:35,200 --> 00:10:42,820
And if we plot these probabilities in a distribution, what we see is the Poisson distribution or the

139
00:10:42,820 --> 00:10:48,400
probability mass function that's associated with this discrete random variable, this Poisson discrete

140
00:10:48,400 --> 00:10:49,360
random variable.

141
00:10:49,360 --> 00:10:55,810
And it makes sense that if Lambda is equal to four, we see these higher probability values clustered

142
00:10:55,810 --> 00:10:59,290
around that mean value of lambda equals four.

143
00:10:59,290 --> 00:11:05,780
And then on both sides of lambda equals four, the probability tapers off or decreases in both directions.

144
00:11:05,800 --> 00:11:09,640
Now realize here that on the left hand side of lambda equals four.

145
00:11:09,640 --> 00:11:14,680
We stopped at zero because of course it doesn't make sense to calculate the probability that the customer

146
00:11:14,680 --> 00:11:18,000
service team receives negative one emails in an hour.

147
00:11:18,010 --> 00:11:19,390
That's nonsensical.

148
00:11:19,420 --> 00:11:23,110
The fewest emails that they can receive is zero emails.

149
00:11:23,110 --> 00:11:27,400
So that's sort of the left hand bound of our probability distribution.

150
00:11:27,490 --> 00:11:31,120
But on the right hand side, the distribution is unbounded.

151
00:11:31,120 --> 00:11:36,700
So we could have actually continued to calculate the probability that they get nine emails, ten emails,

152
00:11:36,700 --> 00:11:43,210
11 emails, 12 emails, etc. on to infinity, and we could have found smaller and smaller and smaller

153
00:11:43,210 --> 00:11:45,100
probability values forever.

154
00:11:45,100 --> 00:11:50,680
But because those values get so insignificant at a certain point, it makes sense just to cut it off.

155
00:11:50,680 --> 00:11:55,960
So this last value that we calculated here was the probability that K is greater than eight.

156
00:11:55,960 --> 00:12:01,930
We calculated the probability that they receive exactly eight emails, and then instead of calculating

157
00:12:01,930 --> 00:12:08,110
the probability that they receive nine emails, remember that because probability always sums to one,

158
00:12:08,110 --> 00:12:15,340
we took one and then we subtracted all of these values that we calculated up through K equals eight.

159
00:12:15,340 --> 00:12:27,220
So we took one -0.0183 -0.0733 -0.1465, etc. Continuing to subtract values all the way until we got

160
00:12:27,220 --> 00:12:30,550
to this last value and we subtracted 0.0298.

161
00:12:30,550 --> 00:12:37,510
And the result of subtracting all of these values from K equals zero to K equals eight from one was

162
00:12:37,510 --> 00:12:39,280
0.0 to 1 three.

163
00:12:39,280 --> 00:12:46,090
So the remaining probability, the probability that they receive nine or more emails in any given hour,

164
00:12:46,090 --> 00:12:51,970
the probability that they receive nine emails or ten emails or 11 emails all the way on to literally

165
00:12:51,970 --> 00:12:57,670
an infinitely large number of emails, all that probability summed up together is a little more than

166
00:12:57,670 --> 00:12:59,930
0.02 or 2%.

167
00:12:59,950 --> 00:13:04,600
So there's just over a 2% chance that they receive nine or more emails.

168
00:13:04,600 --> 00:13:09,370
And we've summarized that here with this last line, and we've also summarized it here in the distribution

169
00:13:09,370 --> 00:13:13,990
to show the probability that they receive nine or more emails in any given hour.

170
00:13:13,990 --> 00:13:20,260
So those are the conditions under which we would use a Poisson random variable and how it differs from

171
00:13:20,260 --> 00:13:22,840
the binomial random variable we looked at earlier.

172
00:13:22,840 --> 00:13:29,620
And then this is the formula we use to calculate probability of K successes in any of the intervals

173
00:13:29,620 --> 00:13:36,400
that we defined, and then how to take all of those probability calculations and use them to create

174
00:13:36,400 --> 00:13:40,780
the Poisson distribution for the Poisson random variable.

