technical-report

Rules of Acquisition for Mementos and Their Content

Rules of Acquisition for Mementos and Their Content

by Shawn M. Jones, Harihar Shankar

Text extraction from web pages has many applications, including web crawling optimization and document clustering. Though much has been written about the acquisition of content from live web pages, content acquisition of archived web pages, known as mementos, remains a relativ...

Read More
Bringing Web Time Travel to MediaWiki: An Assessment of the Memento MediaWiki Extension

Bringing Web Time Travel to MediaWiki: An Assessment of the Memento MediaWiki Extension

by Shawn M. Jones, Michael L. Nelson, Harihar Shankar, Herbert Van de Sompel

We have implemented the Memento MediaWiki Extension Version 2.0, which brings the Memento Protocol to MediaWiki, used by Wikipedia and the Wikimedia Foundation. Test results show that the extension has a negligible impact on performance. Two 302 status code datetime negotiatio...

Read More