13.1M
13M
Feb 22, 2019
02/19
Feb 22, 2019
Crawls performed by the Internet Archive in 2019 on behalf of the National Library of Australia.
Topics: nla, web, 2019
3M
3.0M
Jan 18, 2019
01/19
Jan 18, 2019
Domain crawl of the Luxembourg web domain (.lu) performed by Internet Archive on behalf of the National Library of Luxembourg / Bibliothèque nationale de Luxembourg in January 2019.
Topic: web
36
36
Sep 20, 2018
09/18
Sep 20, 2018
From October 2 to November 20, 2017, a working group of individuals representing multiple NDSA member institutions and interest groups conducted a survey of organizations in the United States actively involved in, or planning to start, programs to archive content from the Web.
Topics: NDSA, web archiving, digital preservation
8.8M
8.8M
Sep 18, 2018
09/18
Sep 18, 2018
This crawl of online resources of the 115th US Congress was performed on behalf of The United States National Archives & Records
Topic: crawldata
5.1M
5.1M
Jul 10, 2018
07/18
Jul 10, 2018
Domain crawl of the Luxembourg web domain (.lu) performed by Internet Archive on behalf of the National Library of Luxembourg / Bibliothèque nationale de Luxembourg in July 2018.
Topic: web
22.6M
23M
Feb 27, 2018
02/18
Feb 27, 2018
Crawls performed by the Internet Archive in 2018 on behalf of the National Library of Australia.
Topics: nla, web, 2018
17.5M
18M
Jan 18, 2018
01/18
Jan 18, 2018
Domain crawl of the New Zealand web domain (.nz) performed by Internet Archive on behalf of the National Library of New Zealand in January-February, 2018.
Topics: web, nlnz, 2018
5.7M
5.7M
Jun 27, 2017
06/17
Jun 27, 2017
Domain crawl of the Luxembourg web domain (.lu) performed by Internet Archive on behalf of the National Library of Luxembourg / Bibliothèque nationale de Luxembourg in June and July of 2017.
Topics: BNL, web, 2017
354,528
355K
Jun 22, 2017
06/17
Jun 22, 2017
Derivative files from crawling done for the News Measures Research Project, between July 27, 2016 and September 27, 2016. Public access to the archived websites is available here: https://archive-it.org/collections/7520. For this project, Archive-It partnered with researchers at Rutgers University’s School of Communication & Information and the Dewitt Wallace Center for Media and Democracy at Duke University in a project designed to evaluate the health of local media ecosystems as part of...
Topic: web data
6.8M
6.8M
Apr 26, 2017
04/17
Apr 26, 2017
Crawls performed by the Internet Archive of the .id (Indonesia) web domain. This data is not currently publicly accessible.
Topics: web, 2017
35.3M
35M
Feb 3, 2017
02/17
Feb 3, 2017
Crawls performed by the Internet Archive in 2017 on behalf of the National Library of Australia.
Topic: nla web 2017