*** Welcome to piglix ***

List of Web archiving initiatives


This page contains a list of Web archiving initiatives worldwide. For easier reading, the information is divided in three tables: web archiving initiatives, archived data, and access methods.

Access: Wayback, Lucene

The Web Archiving Bucket provides set of tools to help archivists and professionals in their daily work.

mitigation as well as the legal function. On-demand manual capture provides clients with the ability to capture a fully functioning page or series of pages from a website or social media property as needed through the Reed Tech Web Preserver plug-in. This approach tends to be used to support the legal, marketing and competitive intelligence functions.

Deduplication: using WARCrefs tool to deduplicate web archive contents in BA cluster
OpenWayback: handling big data indexing by using ZipNumCluster to locate a certain URI in compressed CDX files

av_tools and p2 platform for parallel processing. It was replaced by a simpler access and direct method that enables automatic access to files but no platform for processing.

Enhanced access to Human Rights collection available at: Human Rights Web Archive.


...
Wikipedia

...