1
00:00:11,060 --> 00:00:17,180
OK, so in this video, we are going to continue our discussion of Arche, so let's start with the one

2
00:00:17,870 --> 00:00:18,530
you can think of.

3
00:00:18,530 --> 00:00:22,750
This is analogous to the R-1 process with some minor differences.

4
00:00:23,300 --> 00:00:30,260
Specifically, we're going to say Epsilon of T is equal to zero T times the square root of Omega plus

5
00:00:30,260 --> 00:00:32,990
Alpha one times Epsilon at T minus one.

6
00:00:33,860 --> 00:00:36,310
So let's break down what each part of this means.

7
00:00:37,280 --> 00:00:43,070
Of course, the left side, we know this is just the noise time series we're trying to model on the

8
00:00:43,070 --> 00:00:43,640
right side.

9
00:00:43,640 --> 00:00:46,430
We start with ZT in March.

10
00:00:46,430 --> 00:00:53,420
This of T takes on the role of the unpredictable noise will assume it has mean zero and variance one,

11
00:00:54,050 --> 00:00:57,340
for example, it could be in zero one the standard normal.

12
00:00:58,010 --> 00:01:02,750
In fact, most of the time we'll just assume that it's in zero one, although it's not necessary.

13
00:01:03,860 --> 00:01:04,910
Next we have Omega.

14
00:01:05,540 --> 00:01:07,790
You can think of Omega like the bias term.

15
00:01:08,630 --> 00:01:10,620
After Omega, we have Alpha one.

16
00:01:11,540 --> 00:01:16,100
So in March we use Alpha as our symbol for the auto regressive coefficients.

17
00:01:16,550 --> 00:01:21,290
Since this is a large one, we'll just have one auto regressive coefficient called out for what?

18
00:01:22,400 --> 00:01:27,170
Finally, we have the previous Noisome and the Time series, which is Epsilon at T minus one.

19
00:01:32,000 --> 00:01:37,790
Now, one alternative way to look at the arch model is to simply square all the terms when we do it

20
00:01:37,790 --> 00:01:40,340
this way, it looks a lot more like an air one.

21
00:01:41,150 --> 00:01:46,490
Basically, instead of building a linear model of Epsilon of T, we're building a linear model of Epsilon

22
00:01:46,490 --> 00:01:47,450
of T squared.

23
00:01:48,200 --> 00:01:53,480
Furthermore, the unpredictable noise terms of T is multiplicative rather than additive.

24
00:01:58,190 --> 00:02:03,720
So at this point, we're going to introduce a new symbol, sigma of tea, as you might expect.

25
00:02:03,740 --> 00:02:07,990
We want sigma squared of tea to represent the variance of epsilon of tea.

26
00:02:08,930 --> 00:02:11,460
So to make sure that this relationship is true.

27
00:02:11,780 --> 00:02:16,580
We're going to say that Epsilon of tea is equal to Z of Tea Times, Sigma of Tea.

28
00:02:17,480 --> 00:02:19,100
Now, why does this make sense?

29
00:02:19,670 --> 00:02:23,840
Well, if it doesn't make sense to you right away, I would encourage you to try this experiment in

30
00:02:23,840 --> 00:02:24,470
Python.

31
00:02:25,920 --> 00:02:31,230
Firstly, I think we can all agree that if you call NPE not random, not random, then you will get

32
00:02:31,230 --> 00:02:36,060
a random sample from the standard normal distribution with mean zero and variance one.

33
00:02:37,020 --> 00:02:41,720
Now, suppose I take this random sample and I multiply it by some number, say three.

34
00:02:42,450 --> 00:02:48,480
If I do this, the result will be that this new random sample will have variants of three squared or

35
00:02:48,480 --> 00:02:48,960
nine.

36
00:02:49,830 --> 00:02:55,160
Now, again, I encourage you to try this yourself, draw a bunch of random samples using and beat out

37
00:02:55,170 --> 00:02:58,740
random random and multiply all those samples by three.

38
00:03:00,240 --> 00:03:04,690
You should be able to confirm that the variance of these samples is approximately nine.

39
00:03:05,400 --> 00:03:11,280
And because of this, we can conclude that if we take Z of T, which is a sample from the standard normal

40
00:03:11,520 --> 00:03:17,580
and we multiply that by Sigma of T, the result will be a sample from the normal with mean zero and

41
00:03:17,580 --> 00:03:19,200
variance sigma squared of T.

42
00:03:23,920 --> 00:03:25,810
So what else can we do with this new symbol?

43
00:03:26,680 --> 00:03:32,310
Well, let's take the model we had before and let's suppose that we move the zaev down to the left side,

44
00:03:33,070 --> 00:03:36,080
of course, given the relationship we just learned about.

45
00:03:36,280 --> 00:03:40,660
We know that Epsilon of T divided by society is just Sigma T..

46
00:03:41,410 --> 00:03:47,830
So what we really end up with is that sigma squared of T is equal to this linear equation of epsilon

47
00:03:47,830 --> 00:03:49,180
squared of T minus one.

48
00:03:49,990 --> 00:03:51,580
So what is this really saying?

49
00:03:52,120 --> 00:03:55,870
It's saying that the arch model is really a linear model of variance.

50
00:03:56,350 --> 00:04:00,270
The variance is a linear function of previous squared errors.

51
00:04:04,900 --> 00:04:09,710
Now, it's important to recognize that Gargash models are not directly analogous to Arima.

52
00:04:10,330 --> 00:04:14,710
We know that there must be some constraints since this variance must be non-negative.

53
00:04:15,340 --> 00:04:17,770
In particular, notice how we have a square root.

54
00:04:18,760 --> 00:04:23,740
Since we want the thing inside the square root to be non-negative, we must have that omega's greater

55
00:04:23,740 --> 00:04:27,010
than zero and alpha one is greater than or equal to zero.

56
00:04:28,290 --> 00:04:34,020
In addition, if we want the air to be stationary with finite variance, we need Alpha1 to be strictly

57
00:04:34,020 --> 00:04:34,990
less than one.

58
00:04:36,330 --> 00:04:42,030
So you can see that if one were to be greater than or equal to one, the variance could explode.

59
00:04:46,710 --> 00:04:51,280
OK, so at this point, we've determined how we can model that conditional variants of Epsilon.

60
00:04:51,960 --> 00:04:57,150
That is what is the variance of Epsilon at time t given the previous values of Epsilon.

61
00:04:58,110 --> 00:05:02,540
One further question we can ask is what is the unconditional variance of Epsilon?

62
00:05:03,240 --> 00:05:08,410
That is, if we do not know anything about the other values, what is the variance of Epsilon?

63
00:05:09,630 --> 00:05:15,170
Another way of thinking about this is what is the variance of epsilon infinitely far into the future?

64
00:05:15,810 --> 00:05:22,710
Let's call the results simply Sigma Squared without any subscript t in this case, we can simply take

65
00:05:22,710 --> 00:05:27,390
the unconditional expectation of both sides on the right side.

66
00:05:27,390 --> 00:05:32,340
Since Zaev in Ypsilanti are independent, we can separate these expectations.

67
00:05:33,120 --> 00:05:35,930
Furthermore, recall that the variance of ZT is one.

68
00:05:37,050 --> 00:05:40,190
Finally, we can rearrange and isolate sigma squared.

69
00:05:40,800 --> 00:05:42,450
We get that sigma squared.

70
00:05:42,450 --> 00:05:47,950
The unconditional variance of epsilon is omega divided by one minus Alpha one.

71
00:05:48,930 --> 00:05:52,080
So this is another reason for Alpha one to be less than one.

72
00:05:52,710 --> 00:05:56,250
If it were not less than one than this result would be nonsensical.

73
00:05:57,300 --> 00:06:00,660
As a side note, please be aware that I'm leaving out some details.

74
00:06:00,960 --> 00:06:06,210
When Alpha one is one, the variance is infinite, but you may still get a stationary series.

75
00:06:06,720 --> 00:06:11,820
In any case, this and most of the math behind Gurche is considered to be outside the scope of this

76
00:06:11,820 --> 00:06:12,540
cause.

77
00:06:17,100 --> 00:06:22,380
One thing we'll be able to observe in the code is that if we forecast for a long time into the future,

78
00:06:22,590 --> 00:06:25,170
our forecasts will actually converge to this value.

79
00:06:25,950 --> 00:06:31,590
So what this means is that Arche and Gurche are useful for short term forecasts of the variance.

80
00:06:32,130 --> 00:06:37,680
As with all other financial times, there is models we typically cannot forecast far into the future.

81
00:06:38,370 --> 00:06:43,650
This makes sense since we can't expect a model like this to predict something like Elon Musk making

82
00:06:43,650 --> 00:06:48,060
a tweet about Bitcoin or some country banning a cryptocurrency exchange.

83
00:06:48,690 --> 00:06:51,060
Another obvious example is covid-19.

84
00:06:51,690 --> 00:06:55,550
Clearly, Gargash is not going to be able to predict these world scale events.

85
00:06:56,130 --> 00:06:59,360
These events are really what caused large deviations in the variance.

86
00:06:59,730 --> 00:07:04,490
But of course, without insider trading, you probably wouldn't know about them beforehand.

87
00:07:09,220 --> 00:07:13,180
OK, so the final thing I want to discuss in this lecture is the RFP.

88
00:07:14,050 --> 00:07:19,840
Basically, it's a very simple extension of the arch one where we simply let Epsilon of t depend on

89
00:07:19,840 --> 00:07:21,920
people's legs instead of just one.

90
00:07:22,570 --> 00:07:25,370
So you can think of this as being analogous to AARP.

91
00:07:26,320 --> 00:07:31,320
As usual, we can use tools like the AIC to help us determine the best value of P.
