Posts

2018-04-24: Let's Get Visual and Examine Web Page Surrogates

Image
Why visualize individual web pages? A variety of visualizations of individual web pages exist, but why do we need them when we can just choose a URI from a list and put it in our web browser? URIs are intended to be opaque : text from the underlying web resource does not need to exist in the URI. Consider http://dx.doi.org/10.1007/s00799-016-0200-8. Where does it go? Should we click on it? What content exists under the veil of the URI? Will it meet our needs? Now consider this web page surrogate produced by embed.ly for the same URI: Avoiding spoilers: wiki time travel with Sheldon Cooper A variety of fan-based wikis about episodic fiction (e.g., television shows, novels, movies) exist on the World Wide Web. These wikis provide a wealth of information about complex stories, but if... If we were looking for research papers about avoiding spoilers for TV shows, then we know that clicking on this surrogate will take us to something that meets our information needs. If we ...

2018-04-23: "Grampa, what's a deleted tweet?"

Image
Took screen shot, just in case, but I fear #Breitbart is well beyond the point of decency and shame that they would delete this insane tweet. #INTL4335 #Islamophobia pic.twitter.com/ipo1MhDmNI — Cas Mudde 🌪️ (@CasMudde) February 5, 2018 In early February, 2018  Breitbart News  made a splash with its inflammatory tweet suggesting  Muslims will end Super Bowl ,  which they deleted twelve hours later stating it did not meet their editorial standards. The deleted tweet had an imaginary conversation between a Muslim child and a grandparent about the Super Bowl and linked to one of articles on the  declining TV ratings of  National Football League (NFL) for the annual championship game . News articles from  The Hill , Huffington Post , Politico , Independent , etc., talked about the deleted tweet controversy in detail.  We have deleted a tweet that did not meet our editorial standards. — Breitbart News (@BreitbartNews...

2018-04-13: Web Archives are Used for Link Stability, Censorship Avoidance, and Traffic Siphoning

Image
ISIS members immolating captured Jordanian pilot Web archives have been used for purposes other than digital preservation and browsing historical data. These purposes can be divided into three categories: Uploading content to web archives to ensure continuous availability of the data. Avoiding governments' censorship or websites' terms of service. Using URLs from web archives, instead of direct links, for news sites with opposing ideologies to avoid increasing their web traffic and deprive them of ad revenue. 1. Uploading content to web archives to ensure continuous availability of the data Web archives, by design, are intended to solve the problem of digital data preservation so people can access data when it is no longer available on the live web. In this paper,  Who and What Links to the Internet Archive , ( Yasmin AlNoamany , Ahmed AlSum , Michele C. Weigle , and Michael L. Nelson , 2013), the authors show that the percentage of the requested archived pag...