2015-12-22: 60% of Web Annotations are Orphaned or in Danger of Being Orphaned
Figure 1. An Annotation is defined by OAC as a set of connected resources In our TPDL paper , we studied 6281 highlighted text annotations (out of 7744 annotations) available in the Hypothes.is annotation system in January 2015. The main goal was to investigate the prevalence of orphaned annotations, where neither a live Web page nor an archived copy of the web page contains the text that had previously been annotated. Recently, we applied the same analysis as in our TPDL paper to a larger number of annotations. Figure 2 illustrates that the number of annotations in Hypothes.is has been increasing since July 2013. Our TPDL paper focused on the 7744 annotations available in January 2015. Our updated paper (available at arXiv.org ) analyzed the 20,133 highlighted text annotations (out of 33,946 total annotations) available in August 2015. In this post, I will focus on reporting results of our arXiv paper. Figure 2. January 2015 - dataset used in TPDL paper August 2015