Posts

Showing posts from March, 2020

2020-03-26: Memento Compliance Audit of PyWB

Image
This document is an audit report of the latest development version of PyWB , a Web archive replay sytem, for its Memento (RFC 7089) compliance. As a growing number of public Web archives are moving towards deploying PyWB, it becomes critical to comply with standards to ensure that tools in the archiving ecosystem continue to function as expected. To audit the Memento compliance of PyWB I established the following setup: Captured example.com five times in separate WARC files with the gap of a few minutes each using warcio Created various test instances of PyWB's develop branch, which is one commit ahead of the v-2.4.0-rc6-test version (commit hash: 92e459bda52a2b03f33a4b0b8094ed424248d2a5 ) Initialized a collection named example and loaded freshly captured warc files in it for replay Placed multiple custom configuration files that are loaded by setting PYWB_CONFIG_FILE environment variable for each test instance Preserved the state of the relevant folder tree in pywb

2020-03-24: StoryGraph at Computation + Journalism Symposium 2020 Non-Trip Report

Image
Click to expand: Overview of StoryGraph illustrating the process of generating a news similarity graph is four primary steps. The four steps are explained in the StoryGraph Tech Report We never did give StoryGraph  a proper introduction. Over three years, I have tweeted, created a Twitter account ( @storygraphbot ) for StoryGraph, and published two blogposts that utilized the StoryGraph service to determine the top news stories of 2018 and 2019 . But I never really introduced and motivated the need for StoryGraph. I hoped that the  Computation + Journalism Symposium  would provide the opportunity for giving StoryGraph a proper introduction, but the COVID-19 pandemic interrupted it. To commemorate #ElectionDay , Same domain story linking results for October 2018 (for @BreitbartNews @FoxNews @CNN @HuffPost ) shows stories news media focused on 1 month before the elections. pic.twitter.com/Xw196Y66Mi — StoryGraph (@storygraphbot) November 6, 2018 The Computation + Jou

2020-03-18: Travis Reid (Computer Science PhD Student)

Image
My name is Travis Reid and I joined the Web Science and Digital Libraries (WS-DL) research group at Old Dominion University (ODU) as a Ph.D. student during the Spring 2020 semester. I decided on joining the PhD program, because I wanted to further my education, learn more about different research areas, and see how it is like being a researcher. My advisers are Dr. Michele Weigle and Dr. Michael Nelson . Some of my research interests are information visualization and digital libraries. I have enjoyed my first semester in the PhD program, because there is a variety of tasks that I do. During this semester I have been able to take CS 800 and CS 864, I am a full-time Teaching Assistant (TA), and I am working on a task for the WS-DL group. In CS 800 , I have learned more about graduate school and how it is like being a researcher. In CS 864, I have learned about blockchains, cryptocurrencies, and smart contracts. For the full-time TA position I have graded assignments and instr

2020-03-07: At the nexus of the CNI keynote and Rosenthal’s response: “It's not an easy thing to meet your maker.”

Image
Author's note: This document is a public companion to a private document that outlines a number of security vulnerabilities and exploits.  Both documents were shared in October, 2019 with the Internet Archive, Rhizome, and other colleagues. This document is being published in January, 2020 at both blog.dshr.org and ws-dl.blogspot.com , but currently there are no plans to publish the private document. At the nexus of the CNI keynote and Rosenthal’s response: “It's not an easy thing to meet your maker.” Michael L. Nelson 2019-10-11 One of the high points of my career is the invitation from Cliff Lynch for the Spring 2019 CNI Keynote and his gracious introduction prior to my speaking.  Unfortunately, I’ve since discovered that the antidote for a Cliff Lynch “high” is a negative review from David Rosenthal, a revered researcher with a storied career, a valued colleague, and occasional co-author.  For example, in our research group our graduate students keep track of “D