Hypercane: Toolkit for Summarizing Large Collections of Archived Webpages

Hypercane: Toolkit for Summarizing Large Collections of Archived Webpages

by Shawn M. Jones, Michele C. Weigle, Michael L. Nelson

In the Dark and Stormy Archives (DSA) project, we focus on storytelling techniques to summarize collections of archived web pages. Since collections can have hundreds or even thousands of seeds (initial URLs) and each seed can be recrawled many times, with each version separat...

Read More
Hypercane: Intelligent Sampling for Web Archive Collections

Hypercane: Intelligent Sampling for Web Archive Collections

by Shawn M. Jones, Michele C. Weigle, Martin Klein, Michael L. Nelson

Humans can choose individual documents from a web archive collection, but doing so is difficult if they are unfamiliar with the collection. The issue is scale. Most web archive collections consist of thousands of documents. Hypercane is a tool that automates the selection of d...

Read More
It's All About The Cards: Sharing on Social Media Probably Encouraged HTML Metadata Growth

It's All About The Cards: Sharing on Social Media Probably Encouraged HTML Metadata Growth

by Shawn M. Jones, Valentina Neblitt-Jones, Michele C. Weigle, Martin Klein, and Michael L. Nelson

In a perfect world, all articles consistently contain sufficient metadata to describe the resource. We know this is not the reality, so we are motivated to investigate the evolution of the metadata that is present when authors and publishers supply their own. Because applying ...

Read More
It's All About The Cards: Sharing on Social Media Encouraged HTML Metadata Growth

It's All About The Cards: Sharing on Social Media Encouraged HTML Metadata Growth

In a perfect world, all articles consistently contain sufficient metadata to describe the resource. We know this is not the reality, so we are motivated to investigate the evolution of the metadata that is present when authors and publishers supply their own. Because applying ...

Read More
From Student To Researcher III

From Student To Researcher III

After graduating, I officially accepted a position in Los Alamos National Laboratory’s Information Sciences Division (CCS-3) working for Diane Oyen. On October 4, 2021, I will no longer be a member of the Los Alamos National Laboratory (LANL) Research Library and I will inste...

Read More
Improving Collection Understanding for Web Archives with Storytelling: Shining Light Into Dark and Stormy Archives

Improving Collection Understanding for Web Archives with Storytelling: Shining Light Into Dark and Stormy Archives

by Shawn M. Jones

Collections are the tools that people use to make sense of an ever-increasing number of archived web pages. As collections themselves grow, we need tools to make sense of them. Tools that work on the general web, like search engines, are not a good fit for these collections be...

Read More
Improving Collection Understanding For Web Archives With Storytelling

Improving Collection Understanding For Web Archives With Storytelling

In a perfect world, all articles consistently contain sufficient metadata to describe the resource. We know this is not the reality, so we are motivated to investigate the evolution of the metadata that is present when authors and publishers supply their own. Because applying ...

Read More
Interoperability for Accessing Versions of Web Resources with the Memento Protocol

Interoperability for Accessing Versions of Web Resources with the Memento Protocol

by Shawn M. Jones, Martin Klein, Herbert Van de Sompel, Michael L. Nelson, and Michele C. Weigle

Used by a variety of researchers, web archive collections have become invaluable sources of evidence. If a researcher is presented with a web archive collection that they did not create, how do they know what is inside so that they can use it for their own research? Search eng...

Read More
Automatically Selecting Striking Images for Social Cards

Automatically Selecting Striking Images for Social Cards

To allow previewing a web page, social media platforms have developed social cards: visualizations consisting of vital information about the underlying resource. At a minimum, social cards often include features such as the web resource’s title, text summary, striking image, a...

Read More
Automatically Selecting Striking Images for Social Cards

Automatically Selecting Striking Images for Social Cards

by Shawn M. Jones, Michele C. Weigle, Martin Klein, Michael L. Nelson

To allow previewing a web page, social media platforms have developed social cards: visualizations consisting of vital information about the underlying resource. At a minimum, social cards often include features such as the web resource’s title, text summary, striking image, a...

Read More