Incremental crawl of the Portuguese web performed between 30 December 2011 and 28 February 2012 mainly from .PT domain. The AWP12 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP7 as baseline. Thus, the files that remained unchanged from the AWP7 complete crawl were not archived (duplicated) on the AWP12 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Incremental crawl of the Portuguese web performed between 17 May 2011 and 17 June 2011 mainly from .PT domain. The AWP10 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP7 as baseline. Thus, the files that remained unchanged from the AWP7 complete crawl were not archived (duplicated) on the AWP10 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Incremental crawl of the Portuguese web performed between 17 May 2011 and 17 June 2011 mainly from .PT domain. The AWP10 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP7 as baseline. Thus, the files that remained unchanged from the AWP7 complete crawl were not archived (duplicated) on the AWP10 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Incremental crawl of the Portuguese web performed between 17 May 2011 and 17 June 2011 mainly from .PT domain. The AWP10 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP7 as baseline. Thus, the files that remained unchanged from the AWP7 complete crawl were not archived (duplicated) on the AWP10 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Incremental crawl of the Portuguese web performed between 13 August 2015 and 5 November 2015 mainly from .PT domain. The AWP18 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP17 as baseline. Thus, the files that remained unchanged from the AWP17 complete crawl were not archived (duplicated) on the AWP18 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Incremental crawl of the Portuguese web performed between 13 August 2015 and 5 November 2015 mainly from .PT domain. The AWP18 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP17 as baseline. Thus, the files that remained unchanged from the AWP17 complete crawl were not archived (duplicated) on the AWP18 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Incremental crawl of the Portuguese web performed between 13 August 2015 and 5 November 2015 mainly from .PT domain. The AWP18 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP17 as baseline. Thus, the files that remained unchanged from the AWP17 complete crawl were not archived (duplicated) on the AWP18 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Incremental crawl of the Portuguese web performed between 12 November 2015 and 5 January 2015 mainly from .PT domain. The AWP19 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP18 as baseline. Thus, the files that remained unchanged from the AWP18 complete crawl were not archived (duplicated) on the AWP19 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Incremental crawl of the Portuguese web performed between 12 November 2015 and 5 January 2015 mainly from .PT domain. The AWP19 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP18 as baseline. Thus, the files that remained unchanged from the AWP18 complete crawl were not archived (duplicated) on the AWP19 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Incremental crawl of the Portuguese web performed between 13 August 2015 and 5 November 2015 mainly from .PT domain. The AWP18 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP17 as baseline. Thus, the files that remained unchanged from the AWP17 complete crawl were not archived (duplicated) on the AWP18 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Incremental crawl of the Portuguese web performed between 12 November 2015 and 5 January 2015 mainly from .PT domain. The AWP19 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP18 as baseline. Thus, the files that remained unchanged from the AWP18 complete crawl were not archived (duplicated) on the AWP19 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Sixth collection FAWP. With Deduplicator.
Topics: Frequent crawl of news media from Portuguese web, Portuguese Web Archive, Portuguese online...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Incremental crawl of the Portuguese web performed between 30 May 2016 and 3 August 2016 mainly from .PT domain. The AWP21 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP20 as baseline. Thus, the files that remained unchanged from the AWP20 complete crawl were not archived (duplicated) on the AWP21 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty Three collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Sixth collection FAWP. With Deduplicator.
Topics: Frequent crawl of news media from Portuguese web, Portuguese Web Archive, Portuguese online...
Complete crawl of the Portuguese web performed between 5 November 2013 and 13 January 2014 mainly from .PT domain. The AWP15 crawl did NOT use DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/).
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Incremental crawl of the Portuguese web performed between 23 September 2014 and 24 October 2014 mainly from .PT domain. The AWP16 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP15 as baseline. Thus, the files that remained unchanged from the AWP15 complete crawl were not archived (duplicated) on the AWP16 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Incremental crawl of the Portuguese web performed between 12 November 2015 and 5 January 2015 mainly from .PT domain. The AWP19 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP18 as baseline. Thus, the files that remained unchanged from the AWP18 complete crawl were not archived (duplicated) on the AWP19 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty collection AWP. With Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Twenty-fourth collection AWP. With Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty one collection AWP. With Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Twenty-fourth collection AWP. With Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Twenty-fourth collection AWP. With Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Twenty-fourth collection AWP. With Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty one collection AWP. With Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Twenty-fourth collection AWP. With Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Twenty-fourth collection AWP. With Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Twenty-fourth collection AWP. With Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Twenty-fourth collection AWP. With Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Twenty-fourth collection AWP. With Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Complete crawl of the Portuguese web performed between 1 January 2017 and 7 May 2017 mainly from .PT domain. The AWP23 crawl did NOT use DeDuplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Twenty-fourth collection AWP. With Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Incremental crawl of the Portuguese web performed between 23 September 2014 and 24 October 2014 mainly from .PT domain. The AWP16 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP15 as baseline. Thus, the files that remained unchanged from the AWP15 complete crawl were not archived (duplicated) on the AWP16 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Incremental crawl of the Portuguese web performed between 12 November 2015 and 5 January 2015 mainly from .PT domain. The AWP19 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP18 as baseline. Thus, the files that remained unchanged from the AWP18 complete crawl were not archived (duplicated) on the AWP19 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Incremental crawl of the Portuguese web performed between 12 November 2015 and 5 January 2015 mainly from .PT domain. The AWP19 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP18 as baseline. Thus, the files that remained unchanged from the AWP18 complete crawl were not archived (duplicated) on the AWP19 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Incremental crawl of the Portuguese web performed between 23 September 2014 and 24 October 2014 mainly from .PT domain. The AWP16 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP15 as baseline. Thus, the files that remained unchanged from the AWP15 complete crawl were not archived (duplicated) on the AWP16 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Incremental crawl of the Portuguese web performed between 13 August 2015 and 5 November 2015 mainly from .PT domain. The AWP18 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP17 as baseline. Thus, the files that remained unchanged from the AWP17 complete crawl were not archived (duplicated) on the AWP18 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Incremental crawl of the Portuguese web performed between 12 November 2015 and 5 January 2015 mainly from .PT domain. The AWP19 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP18 as baseline. Thus, the files that remained unchanged from the AWP18 complete crawl were not archived (duplicated) on the AWP19 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Incremental crawl of the Portuguese web performed between 23 September 2014 and 24 October 2014 mainly from .PT domain. The AWP16 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP15 as baseline. Thus, the files that remained unchanged from the AWP15 complete crawl were not archived (duplicated) on the AWP16 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Incremental crawl of the Portuguese web performed between 13 August 2015 and 5 November 2015 mainly from .PT domain. The AWP18 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP17 as baseline. Thus, the files that remained unchanged from the AWP17 complete crawl were not archived (duplicated) on the AWP18 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Complete crawl of the Portuguese web performed between 5 November 2013 and 13 January 2014 mainly from .PT domain. The AWP15 crawl did NOT use DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/).
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Incremental crawl of the Portuguese web performed between 30 May 2016 and 3 August 2016 mainly from .PT domain. The AWP21 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP20 as baseline. Thus, the files that remained unchanged from the AWP20 complete crawl were not archived (duplicated) on the AWP21 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Twenty-seventh collection AWP. No Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Twenty-seventh collection AWP. No Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty two collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Twenty-seventh collection AWP. No Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty two collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Twenty-seventh collection AWP. No Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty two collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Twenty-seventh collection AWP. No Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Twenty-seventh collection AWP. No Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Twenty-seventh collection AWP. No Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Twenty-seventh collection AWP. No Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty two collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty two collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Thirty two collection AWP. Without Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Twenty-seventh collection AWP. No Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Twenty-seventh collection AWP. No Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Incremental crawl of the Portuguese web performed between 12 November 2015 and 5 January 2015 mainly from .PT domain. The AWP19 crawl is incremental because it was performed using DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/) taking the content of AWP18 as baseline. Thus, the files that remained unchanged from the AWP18 complete crawl were not archived (duplicated) on the AWP19 incremental crawl.
Topics: Incremental crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Twenty-seventh collection AWP. No Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Twenty-seventh collection AWP. No Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Twenty-seventh collection AWP. No Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Twenty-seventh collection AWP. No Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Complete crawl of the Portuguese web performed between 5 November 2013 and 13 January 2014 mainly from .PT domain. The AWP15 crawl did NOT use DeDuplicator (http://landsbokasafn.github.io/DeDuplicator/).
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Twenty-seventh collection AWP. No Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Twenty-seventh collection AWP. No Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...
Twenty-seventh collection AWP. No Deduplicator.
Topics: Complete crawl of the Portuguese web, Portuguese Web Archive, Portuguese online publications,...