WEBVTT

00:01.260 --> 00:06.900
When it comes to online investigations, one thing that's always a problem is that if we're investigating

00:06.900 --> 00:14.010
a website, a forum post or a social media post, that post might be deleted.

00:14.430 --> 00:22.320
Web site may be deleted, altered or in some other way modified from the original content.

00:23.580 --> 00:29.880
No, if we if we're able to get to the site or the forum post before that happens, we always could

00:29.880 --> 00:31.440
do things like screenshots.

00:32.220 --> 00:38.040
We might be able to do news program like H Drag to copy the web site, but what do we do if we get to

00:38.040 --> 00:39.330
the page at a later date?

00:40.760 --> 00:46.340
We can use things such as the Internet Archive to potentially pull up a archived version of that or

00:46.340 --> 00:47.420
even create her own.

00:48.230 --> 00:49.370
So let's take a look at it.

00:50.150 --> 00:57.140
So if we go to the website archive.org, we could find a non-profit basically internet library here.

00:58.470 --> 01:00.190
So as we see here, it's a nonprofit.

01:00.660 --> 01:03.990
They have millions of free books, movies, software, web sites and more.

01:05.040 --> 01:06.060
And we can scroll down here.

01:06.060 --> 01:15.690
We can see various things like on the Android APK archive, we could see music, we could see religious

01:16.020 --> 01:19.080
religion, television archives and so on and so forth.

01:20.280 --> 01:22.050
So this is a pretty cool website.

01:22.060 --> 01:30.510
And if I type in something in the Wayback Machine here and say, I look through Yahoo Yahoo.com, I

01:30.510 --> 01:41.160
can eat the energy and in here I'll show the various archives they have for Yahoo dating back from 1996

01:41.160 --> 01:41.940
up to the present.

01:43.680 --> 01:50.580
So we could see in this chart here in the denser the graph, the more the more snapshots they have of

01:50.580 --> 01:50.720
it.

01:50.730 --> 01:59.070
So if I go back to 1996 here and click in here and we could see a calendar here and we see that there's

01:59.070 --> 02:02.010
nothing from January through September here.

02:02.040 --> 02:04.740
However, October we see some snapshots in here.

02:05.520 --> 02:08.790
If we mouse over here, we could see one snapshot of October 20th.

02:09.760 --> 02:15.730
And we could see seven snapshots on October 17, 1996, so let's go ahead and take a look at this.

02:16.720 --> 02:17.740
So if we click in here.

02:20.870 --> 02:23.870
We could take a look at what Yahoo would look like in 1996.

02:24.830 --> 02:29.510
And as you can see here, the site looks pretty different from what it did on currently.

02:29.570 --> 02:31.040
So this is actually really cool.

02:31.040 --> 02:37.820
So we could actually look at historical data because the Internet Archive or Wayback Machine actually

02:37.820 --> 02:44.810
took these different snapshots of the website during that time, which again makes it really handy because

02:44.810 --> 02:51.470
someone could actually just delete out a website, delete a web page or whatnot, and normally it would

02:51.470 --> 02:54.170
be gone with so on archived and we have access to it.

02:54.920 --> 02:58.570
Well, with the Internet Archive, we could potentially grab those archives.

02:58.580 --> 03:05.290
So like in here, we see this is July 31st, 2011, and we could actually go in here.

03:05.300 --> 03:09.110
We can see the sidebar here wouldn't look like we could see the various headlines.

03:09.110 --> 03:09.920
And then in here.

03:11.620 --> 03:13.000
So pretty handy.

03:13.990 --> 03:16.270
So let's take a look at another way we can do this.

03:16.840 --> 03:22.360
So here I have the our website disposable heroes diggers.

03:23.170 --> 03:30.010
And if I go to Wayback, the Wayback Machine, that web dot archive.org, we can create our own snapshot

03:30.010 --> 03:30.400
in here.

03:31.420 --> 03:35.860
So I just need to enter a URL that we're going to, that we're going to copy here.

03:35.880 --> 03:37.180
I'm just going to copy this here.

03:39.290 --> 03:39.660
OK.

03:40.520 --> 03:45.320
And we're just going to paste in here and we could do save page.

03:46.110 --> 03:46.880
Now it's.

03:49.030 --> 03:56.720
Now, we need to be sure that if the pages are HTP s that you want to remove that, I'm just going to

03:56.720 --> 03:59.740
uncheck the save error page and I'm going to click Save.

04:01.990 --> 04:06.270
OK, and this is going to take a while, so it's going to start archiving that page for it.

04:06.280 --> 04:12.160
So if anything happened to that page, we could actually grab a copy of it.

04:12.670 --> 04:15.310
So I do that ahead of time because this could take a while.

04:16.270 --> 04:22.710
I've seen this take anywhere between 15 minutes to a couple hours for it to actually archive.

04:22.960 --> 04:24.880
So fortunately, this was actually pretty quick.

04:24.930 --> 04:27.460
The Dugas pages are really huge.

04:28.150 --> 04:31.650
And here is a here's another one here.

04:31.660 --> 04:32.830
Let me just click on here.

04:33.880 --> 04:35.470
And we could take a look at the page.

04:37.360 --> 04:43.990
So now if I delete this page, if I modify this page from this point on, people could potentially go

04:43.990 --> 04:44.410
in here.

04:44.410 --> 04:48.820
They can grab that snapshot and see what the page look like at that time and date.

04:50.080 --> 04:57.070
So again, having the historical data is actually really cool and really important for for ocean investigations.

04:57.280 --> 05:02.370
So again, this was the Internet Archive, and we could find if we want to make a copy, we can go to

05:02.440 --> 05:04.300
Web Dot Archive.org.

05:04.780 --> 05:11.370
Otherwise we could find the Wayback Machine at at the archive.org.

05:11.380 --> 05:12.610
So thank you for watching.

05:12.640 --> 05:13.180
I'll see you next.
