The twenty-first crawl of the Portuguese web (incremental), performed in 2016 (AWP21).
There Is No Preview Available For This Item
This item does not appear to have any files that can be experienced on Archive.org.
Please download files in this item to interact with them on your computer.
Show all files
- Publication date
- Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications, pwacrawlid:AWP21, 2016
- Portuguese Web Archive
All the items of the AWP21 incremental crawl are identified by the custom field pwacrawlid:AWP21. The AWP21 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/). The previous complete crawl performed without DeDuplicator has the pwacrawlid:AWP20. The files of AWP21 that were duplicates from AWP20 were not archived. To see the complete content of AWP21 (e.g. pages containing duplicate images from AWP20) it must be combined with AWP20.
- 2017-02-19 10:23:46
- 193 212 877 web files (7.2 TB) incrementally crawled between 30 May 2016 and 3 August 2016 mainly from .PT domain and web sites crawled during the previous crawl.
IN COLLECTIONSArquivo.pt: the Portuguese web-archive
Uploaded by Daniel Gomes on