2016-04-27: Mementos in the Raw

While analyzing mementos in a recent experiment, we discovered problems processing archived content . Many web archives augment the mementos they serve with additional archive-specific information, including HTML, text, and JavaScript. We were attempting to compare content across many web archives, and had to develop custom solutions to remove these augmentations. Most augment their mementos in order to provide additional user experience features, such as navigation to additional mementos, by rewriting links and providing additional discovery tools. From an end-user perspective, these augmented mementos enhance the usability and overall experience of web archives and are the default case for user access to mementos. An example from the PRONI web archive is shown below, with the augmentations outlined in red. Others have requirements to differentiate archived content from live content , because they expose archived content to web search engines. Below, we see that a Goog