Posts

2016-05-31: Can I find this story? API: Yes, Google: Maybe, Native Search: No

Image
A story on Storify titled: "Lecture on Academic Freedom"  (capture date: 2016-05-31) The story on Storify titled: "Lecture on Academic Freedom" could not be found on Google   (capture date: 2016-05-31) The story on Storify titled: "Lecture on Academic Freedom" could not be found on Storify native search  (capture date: 2016-05-31) A part of our research ( funded by IMLS ) to build collections for stories or events involves exploring content curation sites like Storify  in order to determine if they hold quality (news worthy, timely, etc.) content. Storify is a social network service used to create stories which consists of text and multimedia content, as well as content from other social media sites like Twitter , Facebook and Instagram . Our exploration involved collecting stories from Storify over a period in other to manually inspect the stories to determine their newsworthiness. This exploration was dual natured: we collected latest

2016-04-27: Mementos in the Raw

Image
While analyzing mementos in a recent experiment, we discovered problems processing archived content .  Many web archives augment the mementos they serve with additional archive-specific information, including HTML, text, and JavaScript.  We were attempting to compare content across many web archives, and had to develop custom solutions to remove these augmentations. Most augment their mementos in order to provide additional user experience features, such as navigation to additional mementos, by rewriting links and providing additional discovery tools. From an end-user perspective, these  augmented  mementos enhance the usability and overall experience of web archives and are the default case for user access to mementos.  An example from the PRONI web archive is shown below, with the augmentations outlined in red. Others have  requirements to differentiate archived content from live content , because they expose archived content to web search engines. Below, we see that a Goog

2016-04-24: WWW 2016 Trip Report

Image
I was fortunate to present a poster at the 25th International World Wide Web Conference , held from April 11, 2016 - April 15, 2016. Though my primary mission was to represent both the WS-DL and the LANL Prototyping Group , I gained a better appreciation for the state of the art of the World Wide Web.  The conference was held in MontrĂ©al, Canada at the Palais des congrĂ©s de MontĂ©al . SAVE-SD 2016 I began the conference at the  SAVE-SD  workshop, focusing on the semantics, analytics, and visualization of scholarly data.  They had 6 full research papers, 2 position papers, and 2 poster papers.  The acceptance rate for this conference is relatively high.  The conference was kicked off by Alejandra Gonzales-Beltran and Francesco Osborne. They encouraged the use of Research Articles in Simplified HTML . Alex Wade gave us an introduction to the Microsoft Academic Service (MAS) and a sneak peek at the new features offered by Microsoft Academic , such as the Microsoft Academ