WEBVTT

00:04.240 --> 00:09.560
In this lecture, we will present the seven variables which we want to extract from the MrBeast YouTube

00:09.560 --> 00:11.600
channel for each of his videos.

00:12.640 --> 00:17.280
Here we have an example of the seven video parameters we are interested in for a particular video.

00:18.120 --> 00:23.240
Going through them briefly, video ID is a unique identifier for each video title is.

00:23.240 --> 00:27.600
The title of the video published at is when the video was published.

00:28.000 --> 00:32.280
Duration is how long the video is and views, likes and comment counts.

00:32.320 --> 00:34.600
We have already seen in the previous lecture.

00:34.880 --> 00:40.040
Now, in order to get these variables when using the YouTube API, we have to jump to competitors.

00:40.720 --> 00:47.000
The first hurdle being getting the ID of the playlist that contains all the channels uploaded videos.

00:47.320 --> 00:51.080
The second hurdle is using the playlist ID we just found.

00:51.120 --> 00:58.160
We then get the list of unique video IDs, and once we have these unique IDs, we can then finally get

00:58.160 --> 00:59.720
all the variables we are after.

01:00.200 --> 01:04.920
To help us conceptualize this better, we can use the Russian nesting doll analogy.

01:05.480 --> 01:10.800
If you're not familiar with these dolls, they are a set of usually wooden dolls of decreasing size

01:10.840 --> 01:12.720
placed one inside another.

01:13.040 --> 01:18.100
So if we were to use this analogy and what we learn from getting familiar with the YouTube API.

01:18.140 --> 01:18.700
Lecture.

01:18.700 --> 01:22.540
When we mentioned resources, we can summarize this lecture as follows.

01:22.900 --> 01:28.260
The biggest doll represents the first hurdle where we will use the channel's resource to get the playlist

01:28.260 --> 01:28.620
ID.

01:28.820 --> 01:34.020
Once we have the playlist ID, we can open the biggest doll where there we will find the next biggest

01:34.020 --> 01:34.380
one.

01:34.900 --> 01:40.620
This represents the list of unique video IDs which we will get from the resource playlist items.

01:40.900 --> 01:46.540
When finally we get the list of video IDs, we have the smallest Russian doll which relates to the videos

01:46.540 --> 01:47.940
resource where.

01:47.940 --> 01:51.340
Here we will get the final seven video variables.

01:52.340 --> 01:55.380
That's all the theory that we will need to cover in the coming four lectures.

01:55.380 --> 01:58.940
We will translate what we learned in the previous lectures to Python code.

01:59.340 --> 02:04.860
So if we were to summarize, the first three out of the four remaining lectures represent the three

02:04.900 --> 02:07.180
Russian dolls we introduce in this lecture.

02:07.500 --> 02:14.460
So part one will focus on getting the playlist ID, part two the video IDs, and part three the video

02:14.460 --> 02:14.980
data.

02:15.300 --> 02:21.140
And in the fourth and final lecture, we will save the data we extracted to JSON, ready to be ingested

02:21.140 --> 02:22.260
into the data warehouse.

02:22.940 --> 02:25.940
That's all for now and I can't wait to start coding this up with you.
