1
00:00:11,100 --> 00:00:16,890
So in this lecture, we will summarize everything we learned in this section and also discuss some ways

2
00:00:17,140 --> 00:00:20,560
has been extended in case you wish to further your studies.

3
00:00:21,330 --> 00:00:24,270
So this section focused on a technique called Gurche.

4
00:00:24,750 --> 00:00:30,150
Unlike all of the other time series methods we learned in this course, Gaja is not focused on learning

5
00:00:30,150 --> 00:00:31,600
the mean of a Time series.

6
00:00:32,040 --> 00:00:36,110
Instead, it's focused on learning about the volatility of the errors.

7
00:00:36,600 --> 00:00:41,160
This makes it completely unique compared to everything else we studied in this course.

8
00:00:41,760 --> 00:00:48,030
It turns out that Gurche is a very popular model for Financial Times series because volatility is one

9
00:00:48,030 --> 00:00:52,410
of the few characteristics of Financial Times series that have predictable structure.

10
00:00:57,130 --> 00:01:02,650
Specifically, we learn that although stock prices look like a random walk, when you plot the action

11
00:01:02,660 --> 00:01:08,440
of the returns, they do not look like a random walk when you plot the action of the squared returns.

12
00:01:08,980 --> 00:01:13,270
This is one factor that differentiates stock prices from random walks.

13
00:01:13,870 --> 00:01:19,480
What we notice when we look at a time series of stock returns is that they tend to exhibit volatility

14
00:01:19,480 --> 00:01:20,090
clustering.

15
00:01:20,890 --> 00:01:25,660
This means that large returns tend to be surrounded by other large returns.

16
00:01:26,530 --> 00:01:30,210
Small returns tend to be surrounded by other small returns.

17
00:01:30,760 --> 00:01:35,000
And of course, this is exactly the kind of behavior that garden is designed for.

18
00:01:35,350 --> 00:01:37,420
It's one of the main properties of garbage.

19
00:01:41,950 --> 00:01:47,890
In this section, we studied the arch one in depth, and we learn that it has many of the same characteristics

20
00:01:47,890 --> 00:01:52,300
of stock returns which make them an appropriate model for Financial Times series.

21
00:01:52,930 --> 00:01:57,940
We then learned about Gurche, which extends the arch by improving its ability to have the variants

22
00:01:57,940 --> 00:02:02,480
persist after looking at the theory behind Arch and Gargash.

23
00:02:02,710 --> 00:02:04,940
We then looked at how to use Gartland Python.

24
00:02:05,800 --> 00:02:10,540
This was hopefully a bit surprising because it's very much unlike the other models we've studied in

25
00:02:10,540 --> 00:02:11,380
this course.

26
00:02:11,950 --> 00:02:18,250
We noted that the forecast isn't really a direct forecast of a time series, but instead a forecast

27
00:02:18,250 --> 00:02:19,090
of variance.

28
00:02:20,470 --> 00:02:23,740
What's interesting about this is we don't really have a target.

29
00:02:24,400 --> 00:02:29,030
That is, there's nothing to compare it to, to check if our model predictions are correct.

30
00:02:29,800 --> 00:02:35,200
So we didn't follow the usual process of training a model and making a forecast and then checking some

31
00:02:35,200 --> 00:02:36,170
metric like the May.

32
00:02:38,150 --> 00:02:43,010
The next thing we did in this section was to look at how garbage can be fitted if you were to do it

33
00:02:43,010 --> 00:02:44,270
yourself from scratch.

34
00:02:44,810 --> 00:02:49,550
This was mostly conceptual, but of course, there's nothing stopping you from actually implementing

35
00:02:49,550 --> 00:02:50,510
what you learned.

36
00:02:51,230 --> 00:02:56,480
One thing we observed was that this opens the door for us to build much more complex models, including

37
00:02:56,480 --> 00:02:58,270
those that use deep neural networks.

38
00:02:58,820 --> 00:03:03,510
So as long as you know how to build a custom lost function in your favorite deep learning library.

39
00:03:03,800 --> 00:03:05,300
This is something you can try.

40
00:03:10,050 --> 00:03:15,780
The final topic of this lecture is to explore some extensions of garbage, although garbage is already

41
00:03:15,780 --> 00:03:16,630
very powerful.

42
00:03:16,890 --> 00:03:20,640
There are some ways it can be improved depending on what you are trying to model.

43
00:03:21,600 --> 00:03:23,620
One such model is called a park.

44
00:03:24,150 --> 00:03:27,520
In fact, this model introduces two new improvements.

45
00:03:28,110 --> 00:03:31,890
The first improvement is that it model something called the leverage effect.

46
00:03:32,490 --> 00:03:38,930
Essentially, what this means is that large negative returns tend to increase volatility compared to

47
00:03:38,940 --> 00:03:41,580
large positive returns of the same magnitude.

48
00:03:42,270 --> 00:03:47,550
This can not be modeled with a simple gurche because garbage can only depend on past the squared error

49
00:03:47,550 --> 00:03:50,250
terms and the square function is symmetric.

50
00:03:50,790 --> 00:03:56,450
That is Epsilon at T minus one has the same influence, whether it is positive or negative.

51
00:03:57,840 --> 00:04:03,810
The way that a park handles this is it introduces this new function where we take the absolute value

52
00:04:03,810 --> 00:04:08,490
of the old error and subtract a parameter gamma multiplied by the old error.

53
00:04:09,300 --> 00:04:12,930
Now this is probably confusing, so it helps to simply look at a picture.

54
00:04:13,740 --> 00:04:14,670
In this picture.

55
00:04:14,670 --> 00:04:20,430
We can see what happens for different values of gamma when gamma equals zero, we can see that both

56
00:04:20,430 --> 00:04:23,280
positive and negative values are symmetric.

57
00:04:23,730 --> 00:04:29,070
When Gamma equals zero point nine, we can see that on the negative side, the influence of these values

58
00:04:29,070 --> 00:04:31,650
are much higher compared to the positive side.

59
00:04:32,220 --> 00:04:37,370
Thus, with a positive gamma negative, returns will have a stronger influence on Sigma.

60
00:04:38,760 --> 00:04:43,480
The second obvious thing to notice about this is that we are not squaring the terms.

61
00:04:43,890 --> 00:04:46,240
Instead, we raise them to the power delta.

62
00:04:46,980 --> 00:04:51,000
So this allows for more flexibility by giving you another parameter to tune.

63
00:04:55,760 --> 00:05:02,030
Yet another way to extend Gargash is to use the multivariate garbage, that is to consider multiple

64
00:05:02,030 --> 00:05:03,590
asset returns at once.

65
00:05:04,310 --> 00:05:09,990
This makes sense since it's very possible that returns of one asset could affect the returns of another.

66
00:05:10,820 --> 00:05:16,010
For example, if there's a sudden movement in Bitcoin, this could transfer over to either as well.

67
00:05:16,700 --> 00:05:19,520
However, this is challenging for several reasons.

68
00:05:20,240 --> 00:05:23,370
One reason is that it gives us many more values to estimate.

69
00:05:24,200 --> 00:05:29,930
As you recall, in multiple dimensions, it's not just the variance we need to know, but the covariance

70
00:05:29,930 --> 00:05:30,740
matrix.

71
00:05:31,130 --> 00:05:37,340
If we have the assets, then we need to estimate the Times D plus one over two values in the covariance

72
00:05:37,340 --> 00:05:38,120
matrix.

73
00:05:38,570 --> 00:05:44,140
This is not quite squared since this matrix is symmetric, but it still does grow quite dramatically.

74
00:05:46,030 --> 00:05:51,880
Another issue is that, as you recall, the volatility is unobserved, so we're basically trying to

75
00:05:51,880 --> 00:05:57,670
estimate a quadratic number of new unobserved variables with only a linear amount of new data.

76
00:05:58,510 --> 00:06:03,220
Now, it's not just a matter of adding more data in the past, since, of course, the stock market

77
00:06:03,220 --> 00:06:06,960
is dynamic, the relationships between assets are always changing.

78
00:06:07,570 --> 00:06:12,210
So the correlation and returns 20 years ago is not the same as it is today.

79
00:06:14,070 --> 00:06:19,900
Finally, we also have to account for the fact that covariance matrices need to be positive, definite.

80
00:06:20,610 --> 00:06:25,170
This is the matrix equivalent of saying that the variance needs to be bigger than zero.

81
00:06:25,650 --> 00:06:30,780
So we can't just pick any numbers for the covariance matrix, but they need to have a special structure.