The tenth crawl of the Portuguese web (incremental), performed in 2011 (AWP10).
There Is No Preview Available For This Item
This item does not appear to have any files that can be experienced on Archive.org.
Please download files in this item to interact with them on your computer.
Show all files
- Publication date
- Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications, pwacrawlid:AWP10, 2011
- Portuguese Web Archive
All the items of the AWP10 incremental crawl are identified by the custom field pwacrawlid:AWP10. The AWP10 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/). The previous complete crawl performed without DeDuplicator has the pwacrawlid:AWP7. The files of AWP10 that were duplicates from AWP7 were not archived. To see the complete content of AWP10 (e.g. pages containing duplicate images from AWP7) it must be combined with AWP7.
- 2014-07-11 02:37:54
- 76 710 879 web files (2.1 TB) incrementally crawled between 17 May 2011 and 17 June 2011 mainly from .PT domain and web sites crawled during the previous crawl.
IN COLLECTIONSArquivo.pt: the Portuguese web-archive
Uploaded by Daniel Gomes on