1
00:00:00,090 --> 00:00:07,080
We've already talked about the idea of a population and a sample where a sample is a subset of the population,

2
00:00:07,080 --> 00:00:13,620
and we're trying to take a sample from the population that is representative of that population.

3
00:00:13,620 --> 00:00:19,110
But now we want to talk about this idea of comparing information about the sample to information about

4
00:00:19,110 --> 00:00:23,430
the population, for instance, the population mean versus the sample mean.

5
00:00:23,430 --> 00:00:29,190
So the first thing we want to say is that when we talk about information that's related to the population,

6
00:00:29,430 --> 00:00:32,310
we call those the parameters of the population.

7
00:00:32,310 --> 00:00:40,680
So the population has the parameters of size mean and standard deviation given by capital nn mu and

8
00:00:40,680 --> 00:00:42,090
sigma respectively.

9
00:00:42,090 --> 00:00:47,490
And in comparison, if we talk about these same characteristics or the same set of information that

10
00:00:47,490 --> 00:00:51,480
relates to the sample, we call those values statistics.

11
00:00:51,480 --> 00:00:58,110
So characteristics of the sample are statistics, characteristics of the population are parameters.

12
00:00:58,110 --> 00:01:03,870
And we already looked at these values before the size of the population is given by capital N, whereas

13
00:01:03,870 --> 00:01:09,780
the size of the sample is given by lowercase M, we indicate the mean of the population and sample with

14
00:01:09,780 --> 00:01:15,660
MU and x bar respectively, and the standard deviations of population and sample are given by Sigma

15
00:01:15,660 --> 00:01:23,340
and S, And the whole idea behind the field of statistics is that we are sampling, we've talked about

16
00:01:23,340 --> 00:01:24,360
sampling already.

17
00:01:24,360 --> 00:01:26,640
We are sampling from the population.

18
00:01:26,850 --> 00:01:30,360
We are collecting statistics from that sample.

19
00:01:30,360 --> 00:01:33,840
For instance, we're looking at the mean and the standard deviation of the sample.

20
00:01:33,870 --> 00:01:41,040
And our goal, our hope is to be able to use these statistics to make inferences about the corresponding

21
00:01:41,040 --> 00:01:42,870
parameters from the population.

22
00:01:42,870 --> 00:01:48,600
In other words, we want to be able to look at the sample mean and use the sample mean to get what we

23
00:01:48,600 --> 00:01:51,180
hope will be an accurate estimate of the population.

24
00:01:51,180 --> 00:01:55,860
Mean because after all, it's these population parameters that we're really interested in.

25
00:01:55,860 --> 00:02:00,390
We want to know maybe the mean and standard deviation for the entire population.

26
00:02:00,390 --> 00:02:06,450
And if we could actually calculate that value specifically by collecting data or surveying the entire

27
00:02:06,450 --> 00:02:08,250
population, we certainly would.

28
00:02:08,250 --> 00:02:14,430
We're only sampling because it's very difficult or impossible to collect data from the entire population.

29
00:02:14,430 --> 00:02:20,880
So the best we can do is sample gather statistics and then use these statistics to make inferences about

30
00:02:20,880 --> 00:02:22,350
the corresponding parameters.

31
00:02:22,350 --> 00:02:30,120
The problem is that if we just collect one sample from the population and we compute a mean and a standard

32
00:02:30,120 --> 00:02:36,390
deviation for that single sample, the values we find for sample mean and sample standard deviation

33
00:02:36,390 --> 00:02:42,090
may or may not be good estimates of the mean and standard deviation of the population.

34
00:02:42,090 --> 00:02:47,490
In other words, if the sample is very representative of the population, if it does a good job representing

35
00:02:47,490 --> 00:02:53,520
the population, then our statistics might be good estimates of the parameters, but we might just happen

36
00:02:53,520 --> 00:02:58,530
to pick a sample that is not very representative of the population or does a bad job representing the

37
00:02:58,530 --> 00:03:05,400
population, in which case our sample statistics are going to do a bad job estimating their corresponding

38
00:03:05,400 --> 00:03:06,690
population parameters.

39
00:03:06,690 --> 00:03:13,320
Now to try to prevent this problem where the statistics we find are bad estimates of the corresponding

40
00:03:13,320 --> 00:03:19,020
parameters, we can think about the idea of taking many samples instead of just one sample.

41
00:03:19,020 --> 00:03:24,810
You can imagine that if we take just one sample, that sample will have its own mean and maybe we call

42
00:03:24,810 --> 00:03:31,290
that sample mean x sub one like this because it's the sample mean for the first sample and the standard

43
00:03:31,290 --> 00:03:36,540
deviation of that first sample will call s sub one, but then we might take another sample.

44
00:03:36,540 --> 00:03:43,560
And that sample mean we might write like this X to with that standard deviation for that second sample

45
00:03:43,590 --> 00:03:45,390
being as two.

46
00:03:45,390 --> 00:03:53,130
And maybe we take a third sample and we compute the mean and standard deviation for the third sample.

47
00:03:53,130 --> 00:03:59,460
And obviously we could continue on and eventually if we keep sampling over and over and over collecting

48
00:03:59,460 --> 00:04:08,550
many, many samples, we end up with a whole set of sample means x1x2x3x4x5 and on and on through many

49
00:04:08,550 --> 00:04:09,090
samples.

50
00:04:09,090 --> 00:04:14,910
So we have this whole set of sample means and it turns out, and this might feel somewhat intuitive,

51
00:04:14,910 --> 00:04:21,990
that as we take many, many sample means more of those sample means will turn out to be closer to the

52
00:04:21,990 --> 00:04:28,830
population mean mu and fewer of those sample means will turn out to be further away from the population.

53
00:04:28,830 --> 00:04:37,170
Mean mu and in fact that set of sample means will form its own probability distribution around the population.

54
00:04:37,170 --> 00:04:44,520
Mean mu, and that probability distribution of sample means will almost always be a normal distribution.

55
00:04:44,520 --> 00:04:51,360
This normal distribution has its own mean and if you think about it, we could call that mean the mean

56
00:04:51,360 --> 00:04:53,100
of all the sample means right.

57
00:04:53,100 --> 00:04:57,900
This is the probability distribution of the sample means the means of all those different samples that

58
00:04:57,900 --> 00:04:59,940
we took and we could calculate a.

59
00:04:59,960 --> 00:05:02,180
I mean, of all of these sample means.

60
00:05:02,180 --> 00:05:08,600
So this probability distribution has its mean right at the center here as the mean of all these sample

61
00:05:08,600 --> 00:05:11,450
means and that mean of sample means.

62
00:05:11,450 --> 00:05:19,550
If we put it in maybe right here we call mu sub x bar because x bar is how we indicate a sample mean.

63
00:05:19,550 --> 00:05:23,000
And so this is the mean of the sample means.

64
00:05:23,000 --> 00:05:29,870
And it turns out that this probability distribution is centered at or has its own mean the mean of the

65
00:05:29,870 --> 00:05:34,850
sample means and that this mean of sample means will be equal to.

66
00:05:34,850 --> 00:05:42,020
And this is the amazing part the population mean and this is in fact the conclusion of the central limit

67
00:05:42,020 --> 00:05:44,810
theorem which is an amazing conclusion.

68
00:05:44,810 --> 00:05:51,410
It tells us that if we continue to take samples and we find a sample mean for each one of those samples

69
00:05:51,410 --> 00:05:54,290
and so we get the sample means x1x2x3.

70
00:05:54,290 --> 00:06:02,060
If we plot those sample means along a number line, then it turns out that more of these sample means

71
00:06:02,060 --> 00:06:04,820
are going to cluster around the population mean.

72
00:06:04,820 --> 00:06:11,210
So we might calculate the first sample mean and find that its value is let's say right here.

73
00:06:11,210 --> 00:06:15,890
And then we calculate a second sample mean and we find out that its value is here.

74
00:06:15,920 --> 00:06:19,820
We calculate a third sample mean and we find out that its value is here.

75
00:06:19,970 --> 00:06:25,610
And as we continue calculating more sample means taking more samples and finding the mean of each sample.

76
00:06:25,610 --> 00:06:32,660
We see that those means start to cluster more heavily around this one value.

77
00:06:32,690 --> 00:06:33,320
Here.

78
00:06:33,320 --> 00:06:41,030
Of course, there's some all over this distribution, but most of them are clustered in the center here

79
00:06:41,030 --> 00:06:42,230
at this value.

80
00:06:42,230 --> 00:06:45,980
And then fewer of them out to the right here and fewer of them out to the left here.

81
00:06:45,980 --> 00:06:51,020
But most of them are clustered around the mean of the sample means, which is going to be equal to the

82
00:06:51,020 --> 00:06:52,100
population mean.

83
00:06:52,100 --> 00:06:58,460
And so this is amazing here because what we're saying is that we have a population and we don't know

84
00:06:58,460 --> 00:07:01,730
the population mean we have no idea what it is.

85
00:07:01,730 --> 00:07:06,860
But with enough sampling, we can create this distribution here.

86
00:07:06,860 --> 00:07:12,320
And this distribution is almost always going to be a normal curve, a technically normal curve, that

87
00:07:12,320 --> 00:07:15,560
bell shaped curve that is symmetrical with its mean.

88
00:07:15,560 --> 00:07:23,860
Here at the center, we call this distribution the sampling distribution of the sample mean or SDS.

89
00:07:23,930 --> 00:07:29,510
M Which makes sense because we're creating a distribution by sampling, and this is the distribution

90
00:07:29,510 --> 00:07:31,040
of all of the sample means.

91
00:07:31,040 --> 00:07:33,650
So it's the sampling distribution of the sample mean.

92
00:07:33,650 --> 00:07:38,690
And the central limit theorem tells us that this distribution will be a normal curve.

93
00:07:38,690 --> 00:07:45,800
So what this is really allowing us to do is turn a population that is non normal, that is not normal

94
00:07:45,800 --> 00:07:49,580
into a distribution that follows a normal curve here.

95
00:07:49,580 --> 00:07:54,980
And once we have this normal distribution, we can convert it into the standard, normal distribution

96
00:07:54,980 --> 00:08:01,610
and use Z scores to answer all kinds of probability questions related to this distribution and in that

97
00:08:01,610 --> 00:08:09,020
way make educated guesses about how accurate our statistics are as estimates of their corresponding

98
00:08:09,020 --> 00:08:10,400
population parameters.

99
00:08:10,400 --> 00:08:15,470
Now, a couple of things we should say before we just move on from the idea that we have a normal curve

100
00:08:15,470 --> 00:08:17,630
here for the sampling distribution of the sample mean.

101
00:08:17,810 --> 00:08:23,660
The first is that if the original population is normally distributed, then the sampling distribution

102
00:08:23,660 --> 00:08:29,540
of the sample mean will also be normally distributed regardless of the sample size that we use.

103
00:08:29,570 --> 00:08:35,600
In other words, if the population is normal, then we don't really have to worry about the size n of

104
00:08:35,600 --> 00:08:36,559
our sample.

105
00:08:36,559 --> 00:08:40,909
We know that the sampling distribution of the sample mean will also be a normal distribution.

106
00:08:40,909 --> 00:08:41,990
It'll be a normal curve.

107
00:08:41,990 --> 00:08:48,020
But if the population is not normally distributed, or if we just don't know whether or not it's normally

108
00:08:48,020 --> 00:08:53,180
distributed, then the sampling distribution of the sample mean is only guaranteed to be normal if we

109
00:08:53,180 --> 00:08:56,900
use a sample size of at least 30.

110
00:08:56,900 --> 00:09:03,920
So n has to be greater than or equal to 30 in order to ensure that this curve is a normal curve.

111
00:09:03,920 --> 00:09:10,610
So if at all possible, it's good practice to use a sample size of 30 or greater to ensure that we're

112
00:09:10,610 --> 00:09:12,200
working with the normal curve here.

113
00:09:12,200 --> 00:09:16,700
If we don't know whether or not the population itself is normal, which will often be the case, we

114
00:09:16,700 --> 00:09:21,200
often will not know for sure whether or not the population is normally distributed.

115
00:09:21,200 --> 00:09:26,480
So now that we have a sampling distribution of the sample mean, we recognize that this is a normal

116
00:09:26,480 --> 00:09:27,020
curve.

117
00:09:27,020 --> 00:09:31,940
And so we can talk about the mean and standard deviation of this specific distribution.

118
00:09:31,940 --> 00:09:37,760
Specifically, we can say that the mean of this distribution, which we said we can think about as the

119
00:09:37,760 --> 00:09:41,450
mean of the sample means is equal to the population mean.

120
00:09:41,450 --> 00:09:43,550
So that's our first point there.

121
00:09:43,550 --> 00:09:51,230
Then the variance of this distribution is going to be equal to sample variance divided by sample size.

122
00:09:51,230 --> 00:09:57,500
So the variance of the sampling distribution of the sample mean will be equal to the sample variance

123
00:09:57,500 --> 00:09:59,630
of the single sample that we took.

124
00:09:59,930 --> 00:10:03,380
Divided by the sample size for that single sample.

125
00:10:03,380 --> 00:10:08,270
And therefore, remember, that standard deviation is always the square root of variance.

126
00:10:08,270 --> 00:10:14,210
So as you might expect, if we take the square root of SE squared divided by MN, the square root of

127
00:10:14,210 --> 00:10:18,920
SE squared is just SW, and the square root of N is the square root of MN.

128
00:10:18,950 --> 00:10:24,590
So the standard deviation of the sampling distribution of the sample mean is se divided by square root

129
00:10:24,590 --> 00:10:30,650
of MN or the standard deviation of our single sample, divided by the square root of the sample size.

130
00:10:30,650 --> 00:10:36,800
And we write that standard deviation as the standard deviation with x bar here, because it's the standard

131
00:10:36,800 --> 00:10:39,470
deviation of the sampling distribution of sample means.

132
00:10:39,470 --> 00:10:43,670
And specifically, this value is important enough that we give it a special name.

133
00:10:43,670 --> 00:10:45,800
We call it the standard error.

134
00:10:45,800 --> 00:10:51,620
In other words, the standard error is another name for the standard deviation of sample means or the

135
00:10:51,620 --> 00:10:53,330
standard deviation of the sampling.

136
00:10:53,330 --> 00:10:54,830
Distribution of the sample means.

137
00:10:54,860 --> 00:10:59,720
Now, of course, just like with any other standard deviation value we've talked about before, when

138
00:10:59,720 --> 00:11:05,480
this standard deviation is larger, when the standard error is larger, it tells us that the sample

139
00:11:05,480 --> 00:11:11,870
means in the sampling distribution of the sample mean are more spread out away from the mean, which

140
00:11:11,870 --> 00:11:19,040
means that any one particular sample mean is less likely to be an accurate representation of the true

141
00:11:19,040 --> 00:11:20,000
population mean.

142
00:11:20,000 --> 00:11:26,450
Whereas when standard error is smaller, when this standard deviation is smaller, it means that all

143
00:11:26,450 --> 00:11:32,990
these sample means are more tightly clustered around this mean here, which means that any one sample

144
00:11:32,990 --> 00:11:38,390
mean that we take is more likely to be an accurate representation of the actual population.

145
00:11:38,390 --> 00:11:38,810
Mean.

146
00:11:38,810 --> 00:11:45,080
In other words, standard error gives us an idea of how likely it is that any given sample mean is an

147
00:11:45,080 --> 00:11:47,750
accurate representation of the true population mean.

148
00:11:47,750 --> 00:11:55,130
So ideally, what we want is a small standard error because the smaller we can get the value of standard

149
00:11:55,130 --> 00:12:01,730
error, the more likely it is that our sample mean for our one single sample that we took from the population,

150
00:12:01,730 --> 00:12:06,290
the more likely it is that that sample mean is a good representation of the population mean, which

151
00:12:06,290 --> 00:12:09,470
is ultimately what we want because we want to know population mean.

152
00:12:09,470 --> 00:12:15,770
So we're hoping that when we take a sample the mean of that sample will be close to the population mean

153
00:12:15,770 --> 00:12:21,080
and the smaller the value of standard error, the more likely that is to be true, the more likely it

154
00:12:21,080 --> 00:12:26,720
is that the mean of the single sample we took is close to the population mean.

155
00:12:26,840 --> 00:12:31,580
So then the idea becomes how do we decrease the value of standard error?

156
00:12:31,580 --> 00:12:33,890
How do we get a really small standard error?

157
00:12:33,890 --> 00:12:39,710
Well, when you have a fraction like this one, we have a sample standard deviation divided by square

158
00:12:39,710 --> 00:12:40,790
root of sample size.

159
00:12:40,790 --> 00:12:47,540
When you have a fraction, there are two ways to decrease the value of the entire fraction.

160
00:12:47,540 --> 00:12:53,720
We can either decrease the value of the numerator, which in this case would mean decreasing sample

161
00:12:53,720 --> 00:13:00,410
standard deviation and or we can increase the size of the denominator, which in this case means increasing

162
00:13:00,410 --> 00:13:01,730
the size of the sample.

163
00:13:01,730 --> 00:13:07,520
Now, there's not a whole lot we can do about the sample standard deviation, because the value of sample

164
00:13:07,520 --> 00:13:11,120
standard deviation is just a result of the data that we collect from the sample.

165
00:13:11,120 --> 00:13:15,980
It just turns out to be whatever value it is based on the sample that we get.

166
00:13:15,980 --> 00:13:22,280
What we can do is increase the sample size and that should make sense to us at an intuitive level.

167
00:13:22,280 --> 00:13:28,910
The larger our sample, the more accurate our statistics are going to be as representations of the parameters.

168
00:13:28,910 --> 00:13:36,920
If our total population, let's say, is 1000 people and we take a sample of just ten people, intuitively

169
00:13:36,920 --> 00:13:40,580
we know that that sample could turn out to be really inaccurate.

170
00:13:40,580 --> 00:13:46,970
It may not be representative of the population at all, but if we have a population of 1000 people and

171
00:13:46,970 --> 00:13:53,300
our sample is, let's say 700 people, we drastically increase the sample size, being able to collect

172
00:13:53,300 --> 00:13:59,990
information from 700 of the 1000 total people, a 700 person sample of a 1000 person population.

173
00:13:59,990 --> 00:14:04,940
We're probably going to get a pretty accurate idea of what that population looks like with a sample

174
00:14:04,940 --> 00:14:05,660
that big.

175
00:14:05,660 --> 00:14:12,830
So as we increase the sample size, we probably got a more accurate sample mean and therefore a smaller

176
00:14:12,830 --> 00:14:14,420
sample standard deviation.

177
00:14:14,420 --> 00:14:21,080
And so our standard error is smaller and it's more likely that our sample mean is an accurate representation

178
00:14:21,080 --> 00:14:22,460
of the population mean.

179
00:14:22,460 --> 00:14:28,610
So the conclusion there is that taking a larger sample is going to help improve the accuracy of the

180
00:14:28,610 --> 00:14:31,070
sample mean as a reflection of the population mean.

181
00:14:31,070 --> 00:14:38,030
But of course taking a larger sample might mean that we have to spend more time or money on our sampling

182
00:14:38,030 --> 00:14:38,480
process.

183
00:14:38,480 --> 00:14:40,580
It might be more difficult to take a larger sample.

184
00:14:40,580 --> 00:14:43,220
So in the real world, we're always balancing these things.

185
00:14:43,220 --> 00:14:48,680
Maybe we want to take as big of a sample as possible, but we're limited by our manpower or money,

186
00:14:48,680 --> 00:14:52,160
our resources, and so the sample we take can only be so big.

187
00:14:52,160 --> 00:14:58,160
Now, the other thing we want to say about these formulas for variance and standard deviation is that

188
00:14:58,160 --> 00:14:59,570
depending on the size.

189
00:14:59,650 --> 00:15:05,680
With our population and the size of our sample, we may need to apply what's called the finite population

190
00:15:05,680 --> 00:15:07,840
correction factor, which looks like this.

191
00:15:07,840 --> 00:15:09,880
So the finite population correction factor.

192
00:15:09,940 --> 00:15:15,820
FPC for short, we'll talk about when we need to apply it, but when we do need to apply it, the formula

193
00:15:15,820 --> 00:15:19,150
for variance changes to this formula.

194
00:15:19,150 --> 00:15:25,420
Here we keep the SE squared over n, but we have to multiply by capital n minus lowercase n.

195
00:15:25,420 --> 00:15:31,480
In other words, population size minus sample size, divided by population size minus one.

196
00:15:31,600 --> 00:15:38,170
And then the formula for standard error or standard deviation of the sample means is the same as divided

197
00:15:38,170 --> 00:15:38,770
by square root.

198
00:15:38,770 --> 00:15:39,210
N.

199
00:15:39,220 --> 00:15:44,470
But then we have to multiply that by square root of this population size minus sample size, divided

200
00:15:44,470 --> 00:15:46,480
by population, size minus one.

201
00:15:46,480 --> 00:15:49,870
Now, when do we have to apply the finite population correction factor?

202
00:15:49,870 --> 00:15:56,740
Well, we have to do it when we are sampling without replacement and or when we're sampling from more

203
00:15:56,740 --> 00:16:00,160
than 5% of a finite population.

204
00:16:00,160 --> 00:16:01,840
So here's what that means.

205
00:16:01,840 --> 00:16:05,800
Remember previously we talked about taking a simple random sample.

206
00:16:05,800 --> 00:16:11,740
Well, whenever we're sampling, ideally we're sampling with replacement, which means that in theory,

207
00:16:11,740 --> 00:16:15,520
the same person can be picked multiple times for our sample.

208
00:16:15,520 --> 00:16:22,180
So, for instance, to take a simple example, let's say our population is all the people who live in

209
00:16:22,180 --> 00:16:25,240
our neighborhood, which maybe is 1500 people.

210
00:16:25,240 --> 00:16:31,030
If we're sampling with replacement, it means that we randomly choose one person in the neighborhood

211
00:16:31,030 --> 00:16:36,730
and we record them as part of our sample, but then we sort of throw them back into the pot and we pick

212
00:16:36,730 --> 00:16:39,340
another person to be included in our sample.

213
00:16:39,340 --> 00:16:43,720
We throw them back in the pot and then we pick a third person, throw them back in the pot, pick a

214
00:16:43,720 --> 00:16:46,990
fourth person, until eventually we have our full sample.

215
00:16:46,990 --> 00:16:52,450
What that means is that because every time we choose to include someone in our sample and then we sort

216
00:16:52,450 --> 00:16:58,060
of put them back in the population, in theory, we could put that same person multiple times for our

217
00:16:58,060 --> 00:16:58,570
sample.

218
00:16:58,570 --> 00:17:03,730
Two times, three times, four times, really, because it's random chance we could pick them every

219
00:17:03,730 --> 00:17:04,960
time for our sample.

220
00:17:04,960 --> 00:17:11,200
That's the idea of sampling with replacement, The same subject object person, whatever it is in the

221
00:17:11,200 --> 00:17:17,440
population can be chosen multiple times for the sample, but sometimes we won't be able to sample with

222
00:17:17,440 --> 00:17:18,160
replacement.

223
00:17:18,190 --> 00:17:25,390
Maybe we're working for a political campaign and we're asking people as they leave their voting location

224
00:17:25,390 --> 00:17:30,520
about how they voted and we're collecting that information for our sample.

225
00:17:30,520 --> 00:17:34,180
Well, obviously that person's only going to leave the voting location one time.

226
00:17:34,180 --> 00:17:37,270
We're going to ask them our questions on the way out the door and then we're never going to see them

227
00:17:37,270 --> 00:17:37,720
again.

228
00:17:37,720 --> 00:17:39,970
In that case, we're sampling without replacement.

229
00:17:39,970 --> 00:17:44,350
It's not possible for us to pick the same person for our sample multiple times.

230
00:17:44,350 --> 00:17:46,300
And so we would be sampling without replacement.

231
00:17:46,300 --> 00:17:51,010
If that's how we're collecting our sample, then we would need to make sure that we include this finite

232
00:17:51,010 --> 00:17:56,110
population correction factor in the formulas for variance and standard deviation.

233
00:17:56,110 --> 00:17:58,570
For this sampling distribution a sample means.

234
00:17:58,570 --> 00:18:04,720
And then the other sampling condition when we apply the FPC is when we are sampling for more than 5%

235
00:18:04,720 --> 00:18:07,030
of essentially a finite population.

236
00:18:07,030 --> 00:18:12,790
So again, going back to our neighborhood example, when our population is the 500 people who live in

237
00:18:12,790 --> 00:18:17,680
our neighborhood, 5% of 4500 is 75 people.

238
00:18:17,680 --> 00:18:25,450
So if we are taking a sample that is 75 people or larger, then we would want to apply this finite population

239
00:18:25,450 --> 00:18:26,410
correction factor.

240
00:18:26,410 --> 00:18:27,940
In that case as well.

241
00:18:27,970 --> 00:18:34,660
We only apply the correction factor in these conditions because outside of these conditions, it turns

242
00:18:34,660 --> 00:18:40,480
out that this finite population correction factor, this value here, turns out to be very, very close

243
00:18:40,480 --> 00:18:41,650
to one.

244
00:18:41,650 --> 00:18:47,710
And when its value is close to one, obviously it's not going to affect the value of variance or standard

245
00:18:47,710 --> 00:18:48,280
deviation.

246
00:18:48,280 --> 00:18:50,320
And so we don't have to include it.

247
00:18:50,320 --> 00:18:57,220
The value of the correction factor drifts away from the value one under these conditions and therefore

248
00:18:57,220 --> 00:19:01,570
does start to have an effect on variance in standard deviation, which is why we have to include it

249
00:19:01,570 --> 00:19:03,370
under these particular conditions.

250
00:19:03,370 --> 00:19:06,670
So that's the idea behind the central limit theorem.

251
00:19:06,850 --> 00:19:13,240
And it's critical because what it allows us to do is take one sample from our population and with this

252
00:19:13,240 --> 00:19:18,700
justification of the central limit theorem telling us that the sampling distribution of sample means

253
00:19:18,700 --> 00:19:25,210
is normal, a normally distributed curve around the mean of sample means, which is equal to the population,

254
00:19:25,210 --> 00:19:31,060
mean What we'll be able to do with that conclusion a little later on is from the one sample.

255
00:19:31,270 --> 00:19:37,180
Calculate the sample mean and sample standard deviation for that one sample.

256
00:19:37,180 --> 00:19:43,870
And then because we have this normal curve and we know how to use Z scores to answer probability questions

257
00:19:43,870 --> 00:19:50,590
about data that's normally distributed, we'll be able to make a statement about how likely it is we'll

258
00:19:50,590 --> 00:19:56,980
be able to give a specific percentage about the likelihood that this sample mean the mean of our one

259
00:19:56,980 --> 00:19:58,900
sample that we took falls within.

260
00:19:59,010 --> 00:20:02,100
Some interval around the population mean.

261
00:20:02,100 --> 00:20:09,630
In other words, we might find a sample mean let's say our sample mean is 14 and then we'll learn later

262
00:20:09,630 --> 00:20:13,380
on about how to calculate an interval around this sample mean.

263
00:20:13,380 --> 00:20:20,520
But let's say our interval is one and one half units to each side of this sample mean, which takes

264
00:20:20,520 --> 00:20:25,980
us to 12.5 on the left and to 15.5 on the right.

265
00:20:26,070 --> 00:20:34,950
And so we end up with this interval of 12.5 to 15.5 and we might be able to make a statement along the

266
00:20:34,950 --> 00:20:45,870
lines of we are 95% confident that the population mean falls somewhere in the interval 12.5 to 15.5,

267
00:20:45,870 --> 00:20:49,530
which of course is a very powerful kind of conclusion.

268
00:20:49,530 --> 00:20:57,360
We can say how confident we are or how likely it is that the actual real population mean will fall within

269
00:20:57,360 --> 00:21:02,790
some particular interval, even though we have no idea what the actual population mean is.

270
00:21:02,790 --> 00:21:09,750
And all we did was take one single sample from that population and calculate a sample mean of 14.

271
00:21:09,750 --> 00:21:16,740
But using just that one data point, just that one statistic, one sample mean because of the power

272
00:21:16,740 --> 00:21:21,270
of and the conclusion of the central limit theorem as it relates to the sampling distribution of the

273
00:21:21,270 --> 00:21:27,780
sample mean we can make a statement about our confidence level that the population mean falls within

274
00:21:27,780 --> 00:21:32,850
some particular interval around the sample mean that we calculated.