1
00:00:03,690 --> 00:00:08,700
The normal gasoline distribution is one of the most important probability density functions when it

2
00:00:08,700 --> 00:00:14,670
comes to data fusion, the distribution is a very natural way of expressing an estimated value and the

3
00:00:14,670 --> 00:00:16,980
associated uncertainty of the estimate.

4
00:00:17,490 --> 00:00:22,590
The whole distribution is continuous, which makes it a mathematically nice function to operate with.

5
00:00:23,280 --> 00:00:27,510
The shape of the distribution is shown here along with its function.

6
00:00:28,140 --> 00:00:31,650
The complete distribution is fully described by two parameters.

7
00:00:31,650 --> 00:00:36,780
It is described by the main and the variance, and it's usually notated up here.

8
00:00:36,780 --> 00:00:41,550
So we have the capital N for the normal distribution has the main and has a variance.

9
00:00:42,240 --> 00:00:47,910
Now the fact that it's the food distribution is described by only these two parameters to get this complete

10
00:00:47,910 --> 00:00:52,100
shape makes it a very compact way of describing the distribution.

11
00:00:53,220 --> 00:00:59,470
As usual, the complete area under the curve under the distribution has to equal one for it to be a

12
00:00:59,490 --> 00:01:00,820
valid PDF.

13
00:01:01,680 --> 00:01:07,770
This is why there's a normalization factor out of this distribution as variance of the distribution

14
00:01:07,770 --> 00:01:08,450
shrinks.

15
00:01:08,460 --> 00:01:14,640
So as the edges get closer, the likelihood or value of the distribution has to increase to maintain

16
00:01:14,640 --> 00:01:16,380
the unit area under the curve.

17
00:01:16,980 --> 00:01:21,930
The majority of the random values produced by the distribution are clustered around the main or the

18
00:01:21,930 --> 00:01:27,750
peak value, and they become less probable as they move further out either side of the main.

19
00:01:28,380 --> 00:01:34,740
As the main position shifts the take shifts as a variance, Frinks regrows the spread of the distribution

20
00:01:34,740 --> 00:01:39,510
shrinks or growers are calling a very useful property of the Gaussian distribution.

21
00:01:39,510 --> 00:01:44,960
Is the Three Sigma rule the probability that the random value is within one signal of the main.

22
00:01:45,240 --> 00:01:49,790
So if the value lies in this area, here is 68 percent.

23
00:01:50,220 --> 00:01:57,690
So if we integrate the PADF between around the main so the main minus one sigma all the way up to the

24
00:01:57,690 --> 00:02:03,000
main plus one sigma, if we integrate the probability density function, we get point six eight.

25
00:02:03,630 --> 00:02:05,140
So sixty eight percent.

26
00:02:05,850 --> 00:02:10,680
Now if we do the same thing for two sigma, so if we integrate the area under the curve between these

27
00:02:10,680 --> 00:02:15,990
two bands here, we then get a probability of point nine, five or 95 percent.

28
00:02:16,470 --> 00:02:21,040
And then again, if we do it with three sigma, so we integrate the complete area between these two

29
00:02:21,040 --> 00:02:25,980
elements here we end up with a probability of point nine, nine or 99 percent.

30
00:02:26,870 --> 00:02:32,160
This gives us a likely bound on the random number that might be produced by the random variable, it

31
00:02:32,180 --> 00:02:37,700
conversely also gives us a measure of how likely any value might be consistent with the random variable

32
00:02:37,700 --> 00:02:38,570
or distribution.

33
00:02:39,080 --> 00:02:44,060
We call it the three sigma rule, because pretty much any value produced by the distribution is most

34
00:02:44,060 --> 00:02:48,220
likely to be contained within plus or minus three sigma of the main value.

35
00:02:48,380 --> 00:02:51,950
And we can be 99 percent confident that this value is.

36
00:02:53,190 --> 00:02:58,710
One of the most important properties of the Gaussian distribution is the fact that any linear transformation

37
00:02:58,710 --> 00:03:02,580
of the Gaussian distribution is another Gaussian distribution.

38
00:03:03,150 --> 00:03:08,430
This will be explored later on in depth as it is a key property for many of the estimation processes

39
00:03:08,430 --> 00:03:08,990
later on.

40
00:03:09,570 --> 00:03:15,630
But basically this allows the computationally expensive operations involving convolutions or integrations

41
00:03:15,630 --> 00:03:22,050
of the probability density functions to be simplified down to a transformation of the main and various

42
00:03:22,050 --> 00:03:27,220
parameters themselves, rather than carrying out the complete transformation of the Paideia function.

43
00:03:27,990 --> 00:03:30,000
And again, we will look at this in more detail.

44
00:03:30,000 --> 00:03:33,120
But this is a very important concept of the Gaussian distribution.

45
00:03:34,300 --> 00:03:39,140
So now let's have a look at an estimation example to highlight the usefulness of this distribution.

46
00:03:39,910 --> 00:03:42,700
Imagine that we're trying to estimate a position of a car.

47
00:03:43,090 --> 00:03:48,730
So the estimation process gives us a Gaussian distribution for the estimate of the car described by

48
00:03:48,730 --> 00:03:50,310
a mean and variance.

49
00:03:50,320 --> 00:03:56,290
So the distribution for the position estimate looks something like this and it can be described using

50
00:03:56,290 --> 00:03:57,150
these numbers here.

51
00:03:57,160 --> 00:03:59,380
So imagine we have an estimate of the precision.

52
00:03:59,890 --> 00:04:06,250
It has a main value of one hundred and twenty five meters and a variance of four square meters squared.

53
00:04:07,360 --> 00:04:09,470
So we know the most likely position of the car.

54
00:04:09,490 --> 00:04:11,290
And we also know how good the estimate is.

55
00:04:11,560 --> 00:04:17,800
So using the three sigma rule, we know that the composition has to be within plus or minus three sigma

56
00:04:17,800 --> 00:04:22,280
of the main and we can be ninety nine point eight percent sure of this confidence.

57
00:04:23,230 --> 00:04:25,530
This gives us a bounce on a position estimate.

58
00:04:25,900 --> 00:04:31,900
So therefore the estimated position must be one hundred and twenty five meters, plus or minus three

59
00:04:31,900 --> 00:04:33,260
sigma or 12 meters.

60
00:04:33,790 --> 00:04:38,440
So this is a very useful way of expressing an estimated position and it's uncertainty.

61
00:04:40,250 --> 00:04:44,960
So hopefully you can see why the Gaussian distribution is a very nice way of expressing an estimated

62
00:04:44,960 --> 00:04:50,690
value and its associated uncertainty, and this is going to be very useful later on when we look in-depth

63
00:04:50,690 --> 00:04:52,520
at different estimation processes.

64
00:04:54,610 --> 00:05:00,370
We can extend the Gaussian distribution from a one dimensional example using a single random variable

65
00:05:01,300 --> 00:05:05,230
into a higher order, Gaussian distribution using a random vector.

66
00:05:06,040 --> 00:05:12,610
So now if we use a random vector, we can describe the distribution or the Gaussian distribution based

67
00:05:12,610 --> 00:05:17,430
on our main vector X bar and covariance matrix, C of X.

68
00:05:17,950 --> 00:05:21,980
And when we do this, we get this equation here for the Gaussian distribution.

69
00:05:22,450 --> 00:05:26,350
So for a two dimensional Gaussian distribution, we'll end up with something that looks like this.

70
00:05:26,860 --> 00:05:30,380
So this is an example, distribution for a third Gaussian.

71
00:05:30,760 --> 00:05:37,000
So basically, instead of having a single dimension, we now have multiple dimensions and we can work

72
00:05:37,000 --> 00:05:40,240
out a Gaussian distribution of any dimension that we want.

73
00:05:40,360 --> 00:05:43,610
So it could be a three dimensional, four dimensional, five dimensional Gaussian.

74
00:05:44,230 --> 00:05:45,330
It could be any number.

75
00:05:45,370 --> 00:05:49,850
It just becomes very difficult to visualize higher order Gaussian density functions.

76
00:05:49,870 --> 00:05:52,000
So in a third case is fairly simple.

77
00:05:52,660 --> 00:05:58,750
In the three case is a 3D ellipsoid, but in the higher order terms, it's very difficult to visualize

78
00:05:58,750 --> 00:05:58,900
it.

79
00:06:00,320 --> 00:06:06,320
In a multidimensional Gaussian distribution, the main shifts the center of the distribution, the variance

80
00:06:06,320 --> 00:06:11,630
controls the spread in the different axes, while the crosscourt variances control the orientation of

81
00:06:11,630 --> 00:06:12,440
the distribution.

82
00:06:13,040 --> 00:06:20,300
So if we look at a two dimensional example here, so we have a variance in the X and the variance in

83
00:06:20,300 --> 00:06:20,750
the Y.

84
00:06:21,970 --> 00:06:26,940
We have the main being, this case going to be zero zero, so that shifts where the origin is.

85
00:06:27,400 --> 00:06:33,490
We also have the covariance and that covariance controls the angle in this case for the 2D example of

86
00:06:33,490 --> 00:06:34,420
the Ellipse here.

87
00:06:34,450 --> 00:06:39,430
So this ellipse drawing here is going to be the one sigma uncertainty ellipse.

88
00:06:39,880 --> 00:06:45,850
If we look at the two dimensional Gaussian distribution and trace a line where sigma equals one, we

89
00:06:45,850 --> 00:06:46,930
end up with an ellipse.

90
00:06:48,250 --> 00:06:53,290
So in this case, we're going to have a non-zero across convergences because it's Ellipse is not aligned

91
00:06:53,290 --> 00:06:57,340
with the X or Y axis, it is shifted or is being rotated.

92
00:06:57,520 --> 00:07:02,830
So there's some cross correlations between the X and Y axis of this Gaussian distribution.

93
00:07:03,850 --> 00:07:09,520
The more dimensional Gaussian distribution becomes very important when we start looking at data fusion

94
00:07:10,370 --> 00:07:15,790
so we can look at a Gaussian distribution to describe the uncertainty of our estimate, where each state

95
00:07:16,000 --> 00:07:19,690
is basically a random value inside a random variable vector.

96
00:07:20,830 --> 00:07:27,340
This means we can describe the accuracy of the estimation process using a multidimensional Gaussian

97
00:07:27,340 --> 00:07:28,060
distribution.