2017-04-24: Pushing Boundaries
Since the advent of the web, more elements of scholarly communication are occurring online . A world that once consisted mostly of conference proceedings, books, and journal articles now includes blog posts, project websites, datasets, software projects, and more. Efforts like LOCKSS , CLOCKSS , and Portico preserve the existing journal system, but there is no similar dedicated effort for the web presence of scholarly communication . Because web-based scholarly communication is born on the web, it can benefit from web archiving . This is complicated by the complexity of scholarly objects. Consider a dataset on the website Figshare , whose landing page is shown in Fig. 1. Each dataset on Figshare has a landing page consisting of a title, owner name, brief description, licensing information, and links to bibliographic metadata in various forms. If an archivist merely downloads the dataset and ignores the rest, then a future scholar using their holdings is denied context and addit