Fix Broken Links Web Crawls : Free Web : Download & Streaming : Inter…

archived 27 Feb 2018 05:19:47 UTC
Skip to main content
Search the history of over 310 billion web pages on the Internet.
Wayback Machine

Fix Broken Links Web Crawls

These crawls are part of an effort to archive pages as they are created and archive the pages that they refer to. That way, as the pages that are referenced are changed or taken from the web, a link to the version that was live when the page was written will be preserved.

Then the Internet Archive hopes that references to these archived pages will be put in place of a link that would be otherwise be broken, or a companion link to allow people to see what was originally intended by a page's authors.

The goal is to fix all broken links on the web. Crawls of supported "No More 404" sites.

Share This Collection

73,562
RESULTS
rss


Media Type
4
collections
73,558
web
Year
2,063
2018
20,890
2017
26,524
2016
10,741
2015
8,544
2014
4,796
2013
More right-solid
Topics & Subjects
73,558
crawldata
36,228
no404
18,420
wordpress
17,056
wikipedia
752
search
1
GDELT
More right-solid
Collection
More right-solid
Creator
73,558
internet archive
SHOW DETAILS
up-solid
down-solid
eye
Title
Date Archived
Creator
303.8M 304M
Wikipedia Near Real Time (from IRC)
collection
16,414
ITEMS
303.8M
VIEWS
Sep 23, 2013 09/13
collection
eye 303.8M
This is a collection of web page captures from links added to, or changed on, Wikipedia pages. The idea is to bring a reliability to Wikipedia outlinks so that if the pages referenced by Wikipedia articles are changed, or go away, a reader can permanently find what was originally referred to. This is part of the Internet Archive's attempt to rid the web of broken links .
Topics: Wikipedia, Wikimedia
149.9M 150M
Wordpress Blogs and the Pages They Link To
collection
18,420
ITEMS
149.9M
VIEWS
Sep 11, 2013 09/13
collection
eye 149.9M
This is a collection of pages and embedded objects from WordPress blogs and the external pages they link to. Captures of these pages are made on a continuous basis seeded from a feed of new or changed pages hosted by Wordpress.com or by Wordpress pages hosted by sites running a properly configured Jetpack wordpress plugin.
Topics: Wordpress.com, blogs, jetpack
110.7M 111M
GDELT
collection
37,306
ITEMS
110.7M
VIEWS
Aug 27, 2014 08/14
collection
eye 110.7M
A daily crawl of more than 200,000 home pages of news sites, including the pages linked from those home pages. Site list provided by The GDELT Project
Topics: GDELT, News
Wikipedia Near Real Time (from IRC)
2M 2.0M
web
eye 2M
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Fri May 5 03:05:22 PDT 2017 to Fri May 5 20:22:10 PDT 2017.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC)
615,581 616K
web
eye 615,581
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Wed Oct 30 21:19:56 PDT 2013 to Wed Oct 30 15:58:29 PDT 2013.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC)
553,652 554K
web
eye 553,652
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Jun 27 02:35:05 PDT 2015 to Fri Jun 26 21:13:31 PDT 2015.
Topics: no404, wikipedia, crawldata
Wordpress Blogs and the Pages They Link To
351,366 351K
web
eye 351,366
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Wed Sep 11 01:48:56 PDT 2013 to Tue Sep 10 19:39:40 PDT 2013.
Topics: no404, wordpress, crawldata
Wikipedia Near Real Time (from IRC)
297,715 298K
web
eye 297,715
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Mon Nov 4 01:08:35 PST 2013 to Sun Nov 3 18:31:38 PST 2013.
Topics: no404, wikipedia, crawldata
Wordpress Blogs and the Pages They Link To
264,123 264K
web
eye 264,123
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl344.us.archive.org:no404 from Fri Nov 8 18:07:43 PST 2013 to Fri Nov 8 11:24:54 PST 2013.
Topics: no404, wordpress, crawldata
GDELT
246,860 247K
web
eye 246,860
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Sun Jun 5 12:10:10 PDT 2016 to Sun Jun 5 06:42:33 PDT 2016.
Topic: crawldata
Wordpress Blogs and the Pages They Link To
244,010 244K
web
eye 244,010
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Mon Oct 7 06:39:20 PDT 2013 to Mon Oct 7 01:07:00 PDT 2013.
Topics: no404, wordpress, crawldata
Wordpress Blogs and the Pages They Link To
243,117 243K
web
eye 243,117
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Wed Oct 9 13:36:49 PDT 2013 to Wed Oct 9 07:59:25 PDT 2013.
Topics: no404, wordpress, crawldata
Wikipedia Near Real Time (from IRC)
229,629 230K
web
eye 229,629
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Fri Oct 11 18:57:27 PDT 2013 to Fri Oct 11 18:27:42 PDT 2013.
Topics: no404, wikipedia, crawldata
Wordpress Blogs and the Pages They Link To
221,133 221K
web
eye 221,133
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Fri Nov 8 18:12:47 PST 2013 to Fri Nov 8 11:19:28 PST 2013.
Topics: no404, wordpress, crawldata
Wikipedia Near Real Time (from IRC)
219,623 220K
web
eye 219,623
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sun Jan 11 20:40:00 PST 2015 to Sun Jan 11 17:30:17 PST 2015.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC)
217,259 217K
web
eye 217,259
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Mon Apr 13 23:31:41 PDT 2015 to Mon Apr 13 18:01:32 PDT 2015.
Topics: no404, wikipedia, crawldata
Wordpress Blogs and the Pages They Link To
217,210 217K
web
eye 217,210
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl344.us.archive.org:no404 from Mon Dec 2 21:56:25 PST 2013 to Mon Dec 2 15:29:07 PST 2013.
Topics: no404, wordpress, crawldata
Wikipedia Near Real Time (from IRC)
215,783 216K
web
eye 215,783
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Jan 10 20:28:55 PST 2015 to Sat Jan 10 14:35:49 PST 2015.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC)
209,250 209K
web
eye 209,250
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Thu Mar 13 15:39:54 PDT 2014 to Thu Mar 13 10:29:53 PDT 2014.
Topics: no404, wikipedia, crawldata
GDELT
206,506 207K
web
eye 206,506
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Thu Jul 16 10:27:47 PDT 2015 to Thu Jul 16 04:43:26 PDT 2015.
Topic: crawldata
Wikipedia Near Real Time (from IRC)
199,397 199K
web
eye 199,397
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Nov 2 18:33:00 PDT 2013 to Sat Nov 2 13:05:27 PDT 2013.
Topics: no404, wikipedia, crawldata
Wordpress Blogs and the Pages They Link To
196,688 197K
web
eye 196,688
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Wed Sep 11 02:25:02 PDT 2013 to Tue Sep 10 20:16:15 PDT 2013.
Topics: no404, wordpress, crawldata
Wikipedia Near Real Time (from IRC)
195,580 196K
web
eye 195,580
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Fri Nov 8 17:40:25 PST 2013 to Fri Nov 8 17:19:48 PST 2013.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC)
192,523 193K
web
eye 192,523
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 12 02:06:46 PDT 2013 to Fri Oct 11 20:57:12 PDT 2013.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC)
179,987 180K
web
eye 179,987
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Fri Sep 26 10:30:15 PDT 2014 to Fri Sep 26 16:41:01 PDT 2014.
Topics: no404, wikipedia, crawldata
Wordpress Blogs and the Pages They Link To
177,803 178K
web
eye 177,803
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Fri Nov 8 17:30:01 PST 2013 to Fri Nov 8 10:39:57 PST 2013.
Topics: no404, wordpress, crawldata
Wikipedia Near Real Time (from IRC)
176,575 177K
web
eye 176,575
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 12 05:10:05 PDT 2013 to Fri Oct 11 23:33:01 PDT 2013.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC)
175,252 175K
web
eye 175,252
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sun Oct 26 06:02:13 PDT 2014 to Sun Oct 26 00:44:36 PDT 2014.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC)
172,521 173K
web
eye 172,521
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 12 01:17:32 PDT 2013 to Fri Oct 11 19:35:18 PDT 2013.
Topics: no404, wikipedia, crawldata
Wordpress Blogs and the Pages They Link To
171,067 171K
web
eye 171,067
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl344.us.archive.org:no404 from Mon Dec 9 02:29:07 PST 2013 to Sun Dec 8 19:51:18 PST 2013.
Topics: no404, wordpress, crawldata
Wordpress Blogs and the Pages They Link To
170,721 171K
web
eye 170,721
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl344.us.archive.org:no404 from Tue Oct 28 01:11:50 PDT 2014 to Mon Oct 27 19:49:07 PDT 2014.
Topics: no404, wordpress, crawldata
Wikipedia Near Real Time (from IRC)
170,252 170K
web
eye 170,252
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sun Sep 22 23:03:50 PDT 2013 to Sun Sep 22 17:38:17 PDT 2013.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC)
169,934 170K
web
eye 169,934
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Fri Sep 27 21:12:16 PDT 2013 to Fri Sep 27 15:37:52 PDT 2013.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC)
166,923 167K
web
eye 166,923
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 25 13:38:01 PDT 2014 to Sat Oct 25 08:30:18 PDT 2014.
Topics: no404, wikipedia, crawldata
Wordpress Blogs and the Pages They Link To
166,410 166K
web
eye 166,410
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Wed Sep 11 09:26:30 PDT 2013 to Wed Sep 11 03:51:47 PDT 2013.
Topics: no404, wordpress, crawldata
Wikipedia Near Real Time (from IRC)
165,617 166K
web
eye 165,617
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 12 03:09:48 PDT 2013 to Fri Oct 11 21:36:24 PDT 2013.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC)
165,289 165K
web
eye 165,289
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Tue Dec 30 23:11:37 PST 2014 to Tue Dec 30 17:11:10 PST 2014.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC)
163,459 163K
web
eye 163,459
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sun Oct 13 22:05:29 PDT 2013 to Sun Oct 13 16:25:33 PDT 2013.
Topics: no404, wikipedia, crawldata
GDELT
161,827 162K
web
eye 161,827
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Thu Jul 30 02:42:20 PDT 2015 to Wed Jul 29 20:39:49 PDT 2015.
Topic: crawldata
GDELT
159,807 160K
web
eye 159,807
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Thu Jul 30 00:44:10 PDT 2015 to Wed Jul 29 18:56:39 PDT 2015.
Topic: crawldata
GDELT
159,323 159K
web
eye 159,323
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl892.us.archive.org:gdelt from Wed Feb 3 05:18:05 PST 2016 to Wed Feb 3 01:37:40 PST 2016.
Topic: crawldata
Wikipedia Near Real Time (from IRC)
158,422 158K
web
eye 158,422
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 12 04:03:47 PDT 2013 to Fri Oct 11 22:24:49 PDT 2013.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC)
157,299 157K
web
eye 157,299
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sun Sep 22 02:43:39 PDT 2013 to Sat Sep 21 21:49:05 PDT 2013.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC)
157,242 157K
web
eye 157,242
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Sep 21 22:25:59 PDT 2013 to Sat Sep 21 18:13:45 PDT 2013.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC)
156,434 156K
web
eye 156,434
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Sep 21 23:53:08 PDT 2013 to Sat Sep 21 19:42:29 PDT 2013.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC)
155,613 156K
web
eye 155,613
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 25 12:43:51 PDT 2014 to Sat Oct 25 07:04:01 PDT 2014.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC)
151,034 151K
web
eye 151,034
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sun Feb 15 10:36:00 PST 2015 to Sun Feb 15 05:04:15 PST 2015.
Topics: no404, wikipedia, crawldata
Wordpress Blogs and the Pages They Link To
148,573 149K
web
eye 148,573
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Fri Nov 8 19:29:44 PST 2013 to Fri Nov 8 12:30:08 PST 2013.
Topics: no404, wordpress, crawldata
Wordpress Blogs and the Pages They Link To
147,620 148K
web
eye 147,620
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Thu Oct 10 04:04:33 PDT 2013 to Wed Oct 9 22:07:10 PDT 2013.
Topics: no404, wordpress, crawldata
Wikipedia Near Real Time (from IRC)
147,482 147K
web
eye 147,482
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 12 13:26:15 PDT 2013 to Sat Oct 12 07:59:33 PDT 2013.
Topics: no404, wikipedia, crawldata
Wordpress Blogs and the Pages They Link To
145,923 146K
web
eye 145,923
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Mon Sep 9 20:54:31 PDT 2013 to Mon Sep 9 22:15:49 PDT 2013.
Topics: no404, wordpress, crawldata
Wordpress Blogs and the Pages They Link To
142,593 143K
web
eye 142,593
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Wed Sep 11 02:57:31 PDT 2013 to Tue Sep 10 20:46:44 PDT 2013.
Topics: no404, wordpress, crawldata
Wikipedia Near Real Time (from IRC)
141,782 142K
web
eye 141,782
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 12 07:38:37 PDT 2013 to Sat Oct 12 02:15:16 PDT 2013.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC)
141,451 141K
web
eye 141,451
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Sep 21 12:19:50 PDT 2013 to Sat Sep 21 07:00:20 PDT 2013.
Topics: no404, wikipedia, crawldata
Fix Broken Links Web Crawls
141,179 141K
web
eye 141,179
favorite 0
comment 0
Internet Archive crawldata from Webwide Crawl, captured by crawl810.us.archive.org:wideaux from Mon Apr 4 10:54:40 PDT 2016 to Tue Apr 5 04:55:27 PDT 2016.
Topics: no404, search, crawldata
Wikipedia Near Real Time (from IRC)
138,289 138K
web
eye 138,289
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sun Oct 26 05:06:16 PDT 2014 to Sat Oct 25 23:40:30 PDT 2014.
Topics: no404, wikipedia, crawldata
GDELT
138,181 138K
web
eye 138,181
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Fri Oct 2 13:26:12 PDT 2015 to Fri Oct 2 07:16:23 PDT 2015.
Topic: crawldata
Wikipedia Near Real Time (from IRC)
134,274 134K
web
eye 134,274
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sun Feb 2 10:16:09 PST 2014 to Sun Feb 2 04:00:31 PST 2014.
Topics: no404, wikipedia, crawldata
Wordpress Blogs and the Pages They Link To
133,393 133K
web
eye 133,393
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Tue Mar 22 04:54:50 PDT 2016 to Mon Mar 21 23:42:16 PDT 2016.
Topics: no404, wordpress, crawldata
Wikipedia Near Real Time (from IRC)
132,687 133K
web
eye 132,687
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 12 11:08:48 PDT 2013 to Sat Oct 12 06:01:41 PDT 2013.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC)
132,354 132K
web
eye 132,354
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sun Dec 15 11:40:44 PST 2013 to Sun Dec 15 05:32:44 PST 2013.
Topics: no404, wikipedia, crawldata
GDELT
131,508 132K
web
eye 131,508
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Sat May 6 16:20:29 PDT 2017 to Sat May 6 10:25:37 PDT 2017.
Topic: crawldata
Wordpress Blogs and the Pages They Link To
129,691 130K
web
eye 129,691
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Wed Sep 11 01:00:19 PDT 2013 to Tue Sep 10 19:09:02 PDT 2013.
Topics: no404, wordpress, crawldata
Wikipedia Near Real Time (from IRC)
129,237 129K
web
eye 129,237
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 12 06:01:08 PDT 2013 to Sat Oct 12 00:24:12 PDT 2013.
Topics: no404, wikipedia, crawldata
Wordpress Blogs and the Pages They Link To
128,793 129K
web
eye 128,793
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Mon Nov 4 03:20:28 PST 2013 to Sun Nov 3 20:09:08 PST 2013.
Topics: no404, wordpress, crawldata
Wikipedia Near Real Time (from IRC)
128,488 128K
web
eye 128,488
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Thu Sep 26 18:48:25 PDT 2013 to Thu Sep 26 13:29:27 PDT 2013.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC)
127,494 127K
web
eye 127,494
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 12 10:13:29 PDT 2013 to Sat Oct 12 04:53:58 PDT 2013.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC)
126,933 127K
web
eye 126,933
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sun Dec 8 22:11:01 PST 2013 to Sun Dec 8 16:34:40 PST 2013.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC)
126,470 126K
web
eye 126,470
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 12 12:22:18 PDT 2013 to Sat Oct 12 06:45:42 PDT 2013.
Topics: no404, wikipedia, crawldata
Wordpress Blogs and the Pages They Link To
126,274 126K
web
eye 126,274
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Tue Oct 8 00:49:08 PDT 2013 to Mon Oct 7 18:50:02 PDT 2013.
Topics: no404, wordpress, crawldata
Wikipedia Near Real Time (from IRC)
125,512 126K
web
eye 125,512
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sun Oct 13 15:39:00 PDT 2013 to Sun Oct 13 10:13:45 PDT 2013.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC)
124,068 124K
web
eye 124,068
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sun Oct 13 23:06:38 PDT 2013 to Sun Oct 13 17:41:00 PDT 2013.
Topics: no404, wikipedia, crawldata
Wordpress Blogs and the Pages They Link To
122,290 122K
web
eye 122,290
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl344.us.archive.org:no404 from Fri Nov 8 17:19:15 PST 2013 to Fri Nov 8 10:31:17 PST 2013.
Topics: no404, wordpress, crawldata
Wordpress Blogs and the Pages They Link To
121,237 121K
web
eye 121,237
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Fri Oct 18 06:02:22 PDT 2013 to Fri Oct 18 00:35:25 PDT 2013.
Topics: no404, wordpress, crawldata
Wikipedia Near Real Time (from IRC)
120,644 121K
web
eye 120,644
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sun Oct 13 07:51:48 PDT 2013 to Sun Oct 13 02:21:02 PDT 2013.
Topics: no404, wikipedia, crawldata
MORE RESULTS
Fetching more results
DESCRIPTION
These crawls are part of an effort to archive pages as they are created and archive the pages that they refer to. That way, as the pages that are referenced are changed or taken from the web, a link to the version that was live when the page was written will be preserved.

Then the Internet Archive hopes that references to these archived pages will be put in place of a link that would be otherwise be broken, or a companion link to allow people to see what was originally intended by a page's authors.

The goal is to fix all broken links on the web. Crawls of supported "No More 404" sites.
ACTIVITY

Created on
September 12
2013
ARossi
Archivist
ADDITIONAL CONTRIBUTORS
VIEWS

594,804,514

ITEMS

73,536

TOP REGIONS (LAST 30 DAYS) – BETA

(data not available)
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%