WEBVTT

00:00.050 --> 00:07.160
So now for a really short intro to those of you who might not know how an HTML table actually is coded,

00:07.190 --> 00:09.560
then I'm just going to go through it briefly here.

00:09.830 --> 00:12.590
So we have an HTML element here.

00:12.590 --> 00:16.370
That's the parent element, the table element up here.

00:16.370 --> 00:19.970
And this contains the table rows.

00:20.690 --> 00:23.450
So each table row is the first one we have.

00:23.450 --> 00:24.530
Here is the headers.

00:24.530 --> 00:26.710
That's the one with the color, the green color.

00:26.720 --> 00:29.140
That's just some styling we added after.

00:29.150 --> 00:37.580
So we have a first table row here which has the elements and the table row elements is just TR that's

00:37.580 --> 00:42.970
the the table table headers, company contact and country.

00:42.980 --> 00:45.020
That's the one that comes up here.

00:45.440 --> 00:50.810
And then we have the table row again, which this time is the TD element.

00:52.210 --> 00:59.470
So the element is table data basically, or like a table cell or table data is probably an easy way

00:59.470 --> 01:00.400
to remember it.

01:02.000 --> 01:06.830
So they contain all of the data inside of the table and that's it.

01:06.830 --> 01:10.710
That's that's all there is to a table in HTML.

01:10.730 --> 01:13.460
We have a table parent element.

01:13.490 --> 01:16.310
It contains all of these elements.

01:16.310 --> 01:22.370
It has a closing element down here as well, and it contains all of the table rows with the header in

01:22.370 --> 01:22.670
top.

01:22.670 --> 01:28.460
Now keep in mind that these HTML tables can be quite strange arranged.

01:28.550 --> 01:34.750
They can have multiple columns, maybe two columns inside, one row and so on.

01:34.760 --> 01:40.700
So they might not be as simple as this one, but this is just a simple example to get you started and

01:40.700 --> 01:48.950
to see how we can scrape a table and most of the tables we see anyway pretty much look like this is

01:48.950 --> 01:54.230
arranged this way with just some columns and some rows, basically in this simple way.

01:55.410 --> 01:58.260
So I'm not going to talk anymore about Dcml now.

01:58.290 --> 02:04.320
Now let's move on to see how our data structure for this data we're going to escape is going to look

02:04.320 --> 02:04.830
like.
