WEBVTT

00:00.810 --> 00:02.160
Greetings, everyone.

00:02.160 --> 00:07.380
Recently I had a question about how to scrape a page with authentication.

00:07.380 --> 00:10.230
So that's what I'm going to look at in this section.

00:10.230 --> 00:14.430
We're going to look at two methods to scrape a page with authentication.

00:14.430 --> 00:19.860
One is with request first and the other one is going to be with Puppeteer.

00:20.010 --> 00:25.620
So let's take a look first at the page that we're going to access with authentication and scrape.

00:25.620 --> 00:30.750
So it's going to be our good old Faithful Craigslist that we all know now.

00:30.900 --> 00:36.870
And we're going to log in to Craigslist on this, my account over here.

00:38.090 --> 00:42.080
And if you don't have an account already, just go ahead and create one.

00:42.080 --> 00:44.330
It's really fast and easy to do.

00:44.600 --> 00:54.770
So once we log in, well, we get to this ball page when we are logged in where we can access our postings

00:54.770 --> 00:55.810
that we made.

00:55.820 --> 01:06.110
And our objective in this section is basically going to be to log in and then access the billing page,

01:06.110 --> 01:08.030
the tab we have over here.

01:09.320 --> 01:15.380
So that's going to be our objective and well, it's going to be similar, similar to other scraping

01:15.380 --> 01:21.830
projects that you might do in the future where you need to log in first and then access some other page

01:21.830 --> 01:23.390
when you are logged in.

01:23.600 --> 01:30.410
So now first I'm going to talk about how Craigslist is authenticating us, and then we're going to look

01:30.440 --> 01:33.350
at how to do this inside of request.
