1
00:00:09,140 --> 00:00:14,510
The extended common Philidor, or JKF, is a variant of the linnear common photo that we've covered

2
00:00:14,510 --> 00:00:21,110
before, this is a modified version of the linear common filter that can operate on non-linear systems.

3
00:00:23,670 --> 00:00:29,670
The nonlinear system that it is operating on is approximated as a linear system, using a first order

4
00:00:29,670 --> 00:00:33,020
approximation around a given state and covariance.

5
00:00:33,300 --> 00:00:35,790
This is known as the linear ization point.

6
00:00:37,080 --> 00:00:43,140
The state and the covariance that this approximation is made about is recursively updated as the state

7
00:00:43,140 --> 00:00:46,780
and covariance are updated as time goes on and the system evolves.

8
00:00:47,370 --> 00:00:54,450
This allows us to always maintain a linear system or a linear approximation that is approximating the

9
00:00:54,450 --> 00:00:57,540
nonlinear system, even though the system is changing.

10
00:00:59,070 --> 00:01:04,740
So since the filter now works on non-linear systems, we want to look at the discrete time nonlinear

11
00:01:04,740 --> 00:01:07,810
system models that the system is going to be operating on.

12
00:01:08,610 --> 00:01:13,800
So the nonlinear process model that we're going to be using in this filter can be represented as a function

13
00:01:13,800 --> 00:01:14,600
that looks like this.

14
00:01:14,940 --> 00:01:18,000
So we're going to have a function that calculates to current state.

15
00:01:18,450 --> 00:01:20,350
So this is the current state vector of time.

16
00:01:21,090 --> 00:01:26,040
And this function is going to have a number of inputs is going to take in the previous state for time

17
00:01:26,040 --> 00:01:32,790
K minus one is going to take in the current control input for Time K and the current noise input for

18
00:01:32,790 --> 00:01:33,500
time K.

19
00:01:34,260 --> 00:01:37,470
We're also going to have a different model for the measurement model.

20
00:01:37,500 --> 00:01:43,080
So now we're going to have a nonlinear function and this nonlinear function is going to take in the

21
00:01:43,080 --> 00:01:49,110
current state tomake the noise input on the measurement at TOMKA and this is going to give us the measurement

22
00:01:49,230 --> 00:01:49,860
at times.

23
00:01:50,700 --> 00:01:55,450
So again, this is now going to be a nonlinear model rather than having a linear matrix model.

24
00:01:55,470 --> 00:01:59,430
We're going to have a nonlinear model to represent anything that we want.

25
00:02:01,420 --> 00:02:05,920
So the definition of the different matrices and what we call them haven't changed, we still have the

26
00:02:05,920 --> 00:02:06,770
state vector.

27
00:02:06,790 --> 00:02:08,640
We still have the control input vector.

28
00:02:08,650 --> 00:02:10,480
We have the process model noise.

29
00:02:10,480 --> 00:02:12,580
And we have the measurement model noise.

30
00:02:13,240 --> 00:02:17,950
And again, we still have the same assumptions that we use in the linear coming filter.

31
00:02:18,230 --> 00:02:20,730
We know the properties of the noise parameters.

32
00:02:21,220 --> 00:02:27,670
We know that the noise vector for our presence model is going to be a normal distribution with zero

33
00:02:27,670 --> 00:02:29,020
main and covariance.

34
00:02:30,670 --> 00:02:36,020
We know that the measurement model noise out the K is going to be another normal distribution of zero

35
00:02:36,030 --> 00:02:39,080
amount and has a covariance matrix of OK.

36
00:02:39,700 --> 00:02:44,200
So these assumptions are the same and we also make the same assumptions about the correlations.

37
00:02:44,200 --> 00:02:50,290
So we know that these two processes are not correlated with the time and they're independent from each

38
00:02:50,290 --> 00:02:50,470
other.

39
00:02:50,480 --> 00:02:54,420
So there's no correlation between the process of noise and the measurement model noise.

40
00:02:54,910 --> 00:02:58,750
So these assumptions are standard for the linear comment here and there.

41
00:02:59,020 --> 00:03:02,560
They are as equally applicable for the extended common filter.

42
00:03:04,560 --> 00:03:10,830
So when we compare the linear material to the extent Carmencita, the linear coming field, it assumes

43
00:03:10,830 --> 00:03:11,870
linear dynamics.

44
00:03:11,910 --> 00:03:13,640
So we've seen this equation before.

45
00:03:13,650 --> 00:03:17,130
This is the process model for the linear common field.

46
00:03:17,610 --> 00:03:23,970
So we have these process model matrixes, F, G and L that take in the different measurement vectors

47
00:03:23,970 --> 00:03:28,110
and vectors and calculate the new state for the extended coming.

48
00:03:28,530 --> 00:03:30,680
We're going to assume a nonlinear dynamics.

49
00:03:30,690 --> 00:03:35,730
So we're going to have this general function f which can be any function, it could be a linear function

50
00:03:35,730 --> 00:03:37,230
or it could be a nonlinear function.

51
00:03:38,370 --> 00:03:42,620
Same thing happens for the measurement model in the linear coming through.

52
00:03:42,660 --> 00:03:47,340
We've assumed a linear measurement model model, so we assumed that we had this sense of measurement

53
00:03:47,340 --> 00:03:54,990
matrix, height, cheer and noise model matrix m and that the current measurement is a linear combination

54
00:03:54,990 --> 00:03:58,770
of the state and the noise to extend the common field.

55
00:03:59,190 --> 00:04:04,470
We're going to assume that we have a nonlinear measurement model, so we assume that we have nonlinear

56
00:04:04,470 --> 00:04:05,030
measurements.

57
00:04:05,040 --> 00:04:11,850
So our measurement said K, is given by a function which can be a nonlinear function of our current

58
00:04:11,850 --> 00:04:13,650
state and our current noise.

59
00:04:14,490 --> 00:04:20,130
So just like before the linear common filter and the extent of Kamasutra both make the same assumptions

60
00:04:20,130 --> 00:04:22,260
about the noise properties of the system.

61
00:04:22,620 --> 00:04:28,950
We assume that we have our galaxy in noise for our process model and for our measurement model, and

62
00:04:28,950 --> 00:04:34,070
that they have known variances and they're independent and non correlated with time.

63
00:04:36,090 --> 00:04:39,030
So we can compare the properties of the two different filters.

64
00:04:39,270 --> 00:04:45,370
Now, the linear common field, we know from past that it is the best linear estimator for the system.

65
00:04:45,870 --> 00:04:48,740
However, this cannot be said for the extended incoming filter.

66
00:04:49,080 --> 00:04:53,130
Now that we're operating on a nonlinear system, it is no longer the best estimate.

67
00:04:53,610 --> 00:04:56,760
It is, however, the best linear estimate of the system.

68
00:04:58,570 --> 00:05:02,950
The Linnear coming through had some very nice properties, it was stable for any initial conditions

69
00:05:02,950 --> 00:05:09,030
and any probation's in state, which is basically saying that no matter where we initialize to filter,

70
00:05:09,220 --> 00:05:11,830
the filter would always converge to the truth.

71
00:05:12,370 --> 00:05:17,650
Unfortunately, since we're using linear approximations for the extent to come and filter, this no

72
00:05:17,650 --> 00:05:18,790
longer can be said.

73
00:05:19,330 --> 00:05:26,200
So the extend current filter, unfortunately, does not make any guarantees on stability of the system,

74
00:05:26,200 --> 00:05:30,670
and it can diverge if the current state gets too far away from the true state.

75
00:05:31,570 --> 00:05:37,540
So what this actually means is that the estimated state must always be close to the true state or the

76
00:05:37,540 --> 00:05:43,480
filter can diverge, which means the errors can't grow too large and the initial conditions must be

77
00:05:43,480 --> 00:05:44,470
close to the truth.

78
00:05:44,470 --> 00:05:46,600
So we can't just set any initial condition.

79
00:05:46,780 --> 00:05:52,180
We have to have some information about the true state of the system so that we can correctly initialize

80
00:05:52,180 --> 00:05:54,730
the filter so the filter doesn't diverge.

81
00:05:55,390 --> 00:06:01,390
Despite its limitations, the IQF has shown itself to be very capable in the real world, especially

82
00:06:01,390 --> 00:06:07,630
for navigation models or kinetic models such as for inertial navigation or estimating the states of

83
00:06:07,630 --> 00:06:08,590
dynamic systems.

84
00:06:09,310 --> 00:06:14,290
This is why the extended common filter is one of the most widely used filters in engineering at the

85
00:06:14,290 --> 00:06:14,700
moment.

86
00:06:16,830 --> 00:06:22,350
Now investigate a bit more into the actual definition or mathematical definition of the extent to come

87
00:06:22,350 --> 00:06:26,330
and filter the extended comment filter is a recursive estimate.

88
00:06:26,670 --> 00:06:32,340
So is a predictor correct, to form, which means we make predictions and then we use extra information

89
00:06:32,340 --> 00:06:34,590
to correct that prediction, to minimize the error.

90
00:06:35,250 --> 00:06:41,600
And this solves the linear quadratic estimation problem using the minimum means squared error method.

91
00:06:42,300 --> 00:06:46,110
So mathematically, this can be expressed as this equation here.

92
00:06:46,680 --> 00:06:51,700
Now, this might look complicated, but is actually just explaining a very simple concept.

93
00:06:52,830 --> 00:06:54,720
So what this equation is calculating.

94
00:06:54,720 --> 00:06:57,770
It is calculating the state estimate for TimeStep.

95
00:06:57,780 --> 00:07:01,470
I given all the information up into TimeStep J.

96
00:07:01,890 --> 00:07:03,930
So this is a state estimate for time.

97
00:07:03,930 --> 00:07:08,290
I given all the information on measurements up to time J.

98
00:07:08,730 --> 00:07:10,970
Now I and J don't have the same number.

99
00:07:11,010 --> 00:07:17,340
They can be the different intervals and the coming for that works is spelled out by a optimization process

100
00:07:17,490 --> 00:07:24,770
or working out the minimum expected main squared error over here inside the expectation operator.

101
00:07:24,930 --> 00:07:26,870
We have our EXI.

102
00:07:26,880 --> 00:07:29,590
So this is the current true state of the system.

103
00:07:30,270 --> 00:07:36,660
This is the true state and we're subtracting our estimate of here to get the estimation error.

104
00:07:36,930 --> 00:07:41,970
So what we want to do is we want to make this estimation error as small as possible, which is why this

105
00:07:41,970 --> 00:07:43,150
is the ARG min.

106
00:07:43,350 --> 00:07:50,610
So this is working out the value of our state estimate to minimize this error such that as we do this,

107
00:07:50,610 --> 00:07:53,580
our estimate should go towards our true state.

108
00:07:54,450 --> 00:07:58,190
This term here is just looking at the estimation error squared.

109
00:07:58,200 --> 00:07:59,850
So we're just taking the same time here.

110
00:07:59,850 --> 00:08:06,090
Transposing it is just a fancy way of calculating the estimation error squared over here.

111
00:08:06,120 --> 00:08:12,720
We want to work out this expectation of of this probabilistic distribution given all these measurements.

112
00:08:12,750 --> 00:08:18,840
So we want to calculate, given all the information from measurement, said one all the way up to measurement,

113
00:08:18,840 --> 00:08:19,980
said J.

114
00:08:20,490 --> 00:08:23,190
So these eye injuries are given over here.

115
00:08:23,850 --> 00:08:30,060
So this is what the common filter is actually doing, is calculating the state estimate that minimizes

116
00:08:30,060 --> 00:08:34,510
this equation, which is the expectation of the estimation error squared.

117
00:08:34,860 --> 00:08:37,020
Given all these measurement information.

118
00:08:39,360 --> 00:08:45,540
So let's recap the different estimation terms starting off, we have our X, which is the true side

119
00:08:45,540 --> 00:08:46,300
of the system.

120
00:08:46,320 --> 00:08:49,010
This is what we don't know, but this is what we want to estimate.

121
00:08:49,620 --> 00:08:53,100
Our estimate of the true state is going to be our X hat.

122
00:08:53,430 --> 00:08:58,410
So this is going to give us our estimate of the system and we want to make our estimate as close to

123
00:08:58,410 --> 00:08:59,930
the true state as possible.

124
00:09:00,450 --> 00:09:05,490
So if we look at the difference between the true state and estimate the state, we get our estimation

125
00:09:05,490 --> 00:09:05,870
error.

126
00:09:05,880 --> 00:09:09,180
And this is going to be indicated by the Tillett or over the top.

127
00:09:11,210 --> 00:09:16,580
So we know that the common filter expresses our state estimate as a probability density function, it

128
00:09:16,580 --> 00:09:18,760
has a mean and it has a covariance.

129
00:09:19,190 --> 00:09:26,580
So when we look at the main of the state estimates, we have our predicted state and we have our update

130
00:09:26,600 --> 00:09:27,090
of the state.

131
00:09:27,830 --> 00:09:35,210
So what this means is that our predicted state for TimeStep K or our apre state is going to be the state

132
00:09:35,210 --> 00:09:40,490
estimate for times, given all the information up to time K minus one.

133
00:09:40,760 --> 00:09:42,380
So this is where the minus comes in.

134
00:09:42,860 --> 00:09:49,270
And what this is, is the expected value of our probability density function of our state, given all

135
00:09:49,280 --> 00:09:56,150
the measurements up to K minus one and of course our updated state estimate our posterior, I estimate

136
00:09:56,390 --> 00:10:04,130
our X plus is going to be a state estimate of TimeStep K given all the information up to TimeStep K,

137
00:10:04,460 --> 00:10:10,220
which is also going to be our expected value of the probability distribution of our state estimate,

138
00:10:10,550 --> 00:10:13,180
given all our measurements up to Tom K.

139
00:10:13,820 --> 00:10:16,030
So this is the predicted state.

140
00:10:16,040 --> 00:10:21,830
This includes all the information up to K minus one, while this is the updated state which uses all

141
00:10:21,830 --> 00:10:23,990
the information up to the current timestep.

142
00:10:25,370 --> 00:10:27,950
So that gives the main of the probability density function.

143
00:10:27,980 --> 00:10:32,010
So now we can look at the covariance of the probability distribution function.

144
00:10:32,450 --> 00:10:38,570
So again, we have our predicted state covariance, our P minus, which is the covariance for Tompsett.

145
00:10:38,570 --> 00:10:45,950
K, given all the information up to TimeStep K minus one, which is simply the expectation of the probability

146
00:10:45,950 --> 00:10:49,220
density function of the estimation error squared.

147
00:10:49,790 --> 00:10:56,420
Given our measurements up to time K minus one hour, Abdellah covariance is simply denoted again by

148
00:10:56,420 --> 00:11:03,350
this P plus the simply the probability density function covariance for our state estimation error for

149
00:11:03,350 --> 00:11:03,650
a time.

150
00:11:03,650 --> 00:11:10,880
Sepak given all the information up to times of K, which is again simply expressed as an expected operator

151
00:11:11,120 --> 00:11:15,260
for our estimation error, given all the information up to TimeStep K.

152
00:11:16,100 --> 00:11:19,810
So again, to recap, we can look at what this means in draft form.

153
00:11:19,820 --> 00:11:25,070
So we have a probability density function, which is a conditional density function of what our state

154
00:11:25,070 --> 00:11:28,280
estimate is going to be given the set of measurements.

155
00:11:29,090 --> 00:11:32,560
And we're going to assume that it's a normal distribution.

156
00:11:32,930 --> 00:11:39,320
So we assume that this can be represented as a normal distribution of main exper and variance sigma

157
00:11:39,320 --> 00:11:39,690
squared.

158
00:11:40,310 --> 00:11:43,870
So our estimate, the state is going to be the main of this distribution.

159
00:11:43,880 --> 00:11:46,070
So this is what these are calculating here.

160
00:11:46,490 --> 00:11:51,830
So our estimate, the state is going to be the main of this density function, which is the expected

161
00:11:51,830 --> 00:11:54,950
value of the probability density function of X.

162
00:11:54,950 --> 00:12:01,130
Given all the measurements said and our covariance of this function here or the spread of the data is

163
00:12:01,130 --> 00:12:06,410
going to be the variance squared, which is going to be the expected operator on the estimation error

164
00:12:06,410 --> 00:12:09,470
squared, given all the measurements.

165
00:12:11,000 --> 00:12:16,790
So to recap, all these terms are exactly the same as the linear common field for the extended common

166
00:12:16,790 --> 00:12:17,080
photo.

167
00:12:17,540 --> 00:12:22,430
But we want to make sure that we understand the basic concepts between what they represent and the difference

168
00:12:22,430 --> 00:12:29,870
between the a priori state and the posterior state, as well as how the states and the covariance map

169
00:12:29,880 --> 00:12:32,750
to the probability density function of our state estimate.
