WEBVTT

00:01.180 --> 00:02.590
What is up, everyone?

00:02.590 --> 00:11.230
In this section we are going to look into how to scrape the guest allowed and the bedrooms, beds and

00:11.230 --> 00:12.220
baths.

00:12.220 --> 00:22.180
Now, um, in the previous section we got the price per night using this CSS selector, which has a

00:22.180 --> 00:25.240
lot of do's and spans and so on.

00:25.240 --> 00:30.970
But I found out that you can't really do the same thing with the guest allowed.

00:31.530 --> 00:43.120
Um, instead we have to use some regular expressions because we can't just define an how many where,

00:43.390 --> 00:51.220
where the information is in the Dom tree because I think there's different versions you can get.

00:51.220 --> 00:56.200
There's seems to be three different versions of the site you can get.

00:56.200 --> 01:03.680
So sometimes I found out that one version would work, sometimes another one would work.

01:03.680 --> 01:08.240
So I came up to three different versions for each of these.

01:08.840 --> 01:13.820
And, um, yeah, so you could of course do that.

01:13.940 --> 01:21.770
But I think it would be a lot nicer if we started using regular expressions instead to get this data

01:21.890 --> 01:24.110
we want down here.

01:25.050 --> 01:28.650
So basically the page is changing a bit.

01:28.650 --> 01:36.450
So we can't use the these very generic CSS selectors without any classes.

01:36.450 --> 01:43.020
We can't just select it through the Dom tree because the page is changing at random.

01:43.140 --> 01:50.340
I think it has something to do with, for example, an assignment or excuse me, an announcement like

01:50.340 --> 01:52.110
this one we have here.

01:52.800 --> 02:00.180
Um, so yeah, that is why we are going to use regular expressions now to get the guest allowed and

02:00.180 --> 02:01.920
the bedrooms and so on.
