2016-06-23: Joint Conference on Digital Libraries (JCDL) 2016 Trip Report

Good morning Newark! Beautiful start for the doctoral consortium and tutorials at #jcdl2016 pic.twitter.com/kEBdJtvIho

— Michele Weigle (@weiglemc) June 19, 2016

The ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL) is a major international conference that offers the opportunity to address technical, practical, and social issues associated with digital libraries. This annual conference was held at the Paul Robeson Campus Center, Rutgers University Newark, New Jersey, between June 19-23, 2016. Here is a list of the accepted papers and posters/demos.

The theme for this year's conference was Big Libraries, Big Data, Big Innovation. Computer/Information scientists, Librarians, Archivists, Social scientists, Historians and other participants from around the world and other disciples involved in digital library research and practice presented papers and posters, specialized workshops (see Mat's blog post about the WADL workshop), tutorials, panels, and a doctoral consortium (see Sawood's blog post about the doctoral consortium). Also joining this distinguished collection of professionals were five members of the ws-dl group: Dr. Michael Nelson, Dr. Michele Weigle, Sawood Alam, Mat Kelly, and myself (Alexander Nwala).

.@WebSciDL on our way to #jcdl2016 @machawk1 @weiglemc @phonedude_mln @acnwala @ibnesayeed @jcdl2016 pic.twitter.com/xI7zeDQeAd
— Michael L. Nelson (@phonedude_mln) June 18, 2016

We drove from Norfolk, Virginia to Lewes, Delaware, took a ferry from Lewes to Cape May, New Jersey, and finally drove to Newark, New Jersey.

@elunca @WebSciDL @machawk1 @weiglemc @acnwala @ibnesayeed @jcdl2016 yes! Took the ferry to make it longer! ;-) pic.twitter.com/wbJENY334O
— Michael L. Nelson (@phonedude_mln) June 18, 2016

Aboard the Cape Henlopen ferry en route Newark, New Jersey. From left to right Dr. Michele Weigle, Alexander Nwala, Dr. Michael Nelson, Mat Kelly, and Sawood Alam

The first day kicked off at 9am until 5pm (June 19, 2016) with two concurrent events - Tutorials and a Doctoral Consortium at the Paul Robeson Campus Center at Rutgers University. We attended the Doctoral Consortium in which Sawood presented his work. The tutorials presented were as follows:

Introduction to Digital Libraries, presented by Edward A. Fox (Virginia Tech).
Introduction to the Digital Public Library of America (DPLA) API, presented by Unmil P. Karadkar, (The University of Texas at Austin), Audrey Altman and Mark Breedlove (Digital Public Library of America).
Information Extraction for Scholarly Digital Libraries, presented by Kyle Williams, Jian Wu, Zhaohui Wu and C. Lee Giles (Pennsylvania State University).

Day 1 (June 20, 2016):

The conference officially began on the second day with a keynote titled Future Digital Libraries: Research and Responsibilities by Maria Zemankova of the National Science Foundation in which she talked about libraries, archives, museums, and collections. The talk began with a brief history of libraries, before exploring digital libraries of today, and the future of digital libraries, etc.

JCDL 2016_keynote - Maria Zemankova

Two concurrent paper sessions (from 11am - 12:30pm) followed the keynote. The first paper session was about Wikipedia and Newspaper Analysis, and the second was about Curation and Education.

The first paper session chaired by Dr. Herbert Van de Sompel (Los Alamos National Laboratory), consisted of the following presentations and was attended by the five members of the ws-dl group present:

Querylog-based Assessment of Retrievability Bias in a Large Newspaper Corpus, by Myriam Traub, Thaer Samar, Jacco van Ossenbruggen, Jiyin He, Arjen de Vries and Lynda Hardman: Myriam Traub et al. addressed the problem of bias in digital libraries by measuring the effectiveness of retrievability measure using a large collection of digital newspapers.

@MyriamCTraub from @CWInl is now presenting her PhD work at #jcdl2016
View with us: https://t.co/1Zvu9WE7PY https://t.co/detL6Nru4j
— Arjen P. de Vries (@arjenpdevries) June 20, 2016

Very high Gini coefficient means many documents are never in top 10 results #jcdl2016 pic.twitter.com/jdAqn9gYhu
— Gary Munnelly (@russano22) June 20, 2016

Digital History Meets Wikipedia: Analyzing Historical Persons in Wikipedia, by Adam Jatowt, Daisuke Kawai, and Katsumi Tanaka: Adam Jatowt et al. conducted a temporal analysis about historical persons in Wikipedia, by examining the hyperlink structure of documents in order to understand the relationship between time, link structure and article popularity.

Super cool. From missing birth dates in Wikipedia, get a sense of how much of past that's lost to us. #jcdl2016 pic.twitter.com/mBwqvsXnbC
— Ian Milligan (@ianmilligan1) June 20, 2016

Public historians: lobby to get your topics made into Google Doodles to interest the most people. 😉 #jcdl2016 pic.twitter.com/pu9Z4LXaKt
— Ian Milligan (@ianmilligan1) June 20, 2016

Fascinating fig from the Wikipedia & history article (https://t.co/d5md9SnU3M). Different PageRanks/views. #jcdl2016 pic.twitter.com/5itpSzAuYg
— Ian Milligan (@ianmilligan1) June 20, 2016

Quality assessment of Wikipedia articles without feature engineering, by Quang Vinh Dang and Claudia-Lavinia Ignat: Given the popularity of Wikipedia and concern about the quality of Wikipedia documents, Quang Vinh Dang et al. addressed the problem of assessing Wikipedia articles for quality by not engineering a list of features which indicate quality. Instead, they assessed Wikipedia articles for quality by analyzing their content, rather than considering a feature set. This was achieved through a deep learning/natural language processing framework.

Automatically assessing the quality of Wikipedia articles. How badly did you mess up by citing that page? #jcdl2016 pic.twitter.com/NZSPYuLgz7
— Gary Munnelly (@russano22) June 20, 2016
Glyph Miner: A System for Efficiently Extracting Glyphs from Early Prints in the Context of OCR, by Benedikt Budig, Thomas C. Van Dijk and Felix Kirchner: Benedikt Budig et al. devised a system that replaces a common part of the OCR training pipeline with a more efficient workflow. Given a set of scanned historical documents, their user-interactive system extracts large numbers of glyphs selected by the user.

Benedikt Budig describes Glyph Miner--creates training models for identifying glyphs by removing false pos https://t.co/S99pTRovQY #jcdl2016
— Mat Kelly (@machawk1) June 20, 2016

The second paper session chaired by Dr. Edward A. Fox (Virginia Tech) consisted of the following presentations:

A short break followed the paper sessions, after which two concurrent sessions were conducted. The first panel session titled Issues of Dealing with Fluid Data in Digital Libraries, was chaired by Byron Marshall (Oregon State University) and consisted of the following panelists:

Soo-yeon Hwang - School of Communication and Information, Rutgers University
Melissa Cragin - National Center for Supercomputing Applications, University of Illinois at Urbana-Champaign
Michael Lesk - School of Communication and Information, Rutgers University
Yu-Hung Lin - Rutgers University Libraries
Daniel O'Connor - School of Communication and Information, Rutgers University

@WebSciDL ready for action #jcdl2016 @weiglemc @machawk1 @ibnesayeed @acnwala pic.twitter.com/Hpx0KUXvpS
— Martin Klein (@mart1nkle1n) June 20, 2016

The third paper session (ws-dl members present) was titled Web Archiving. The paper session was chaired by Martin Klein (University of California, Los Angeles) and consisted of the following presentations:

Routing Memento Requests Using Binary Classifiers, by Nicolas J. Bornand, Lyudmila Balakireva and Herbert Van de Sompel. Nicolas J. Bornand et al. explored the use of binary and archive-specific classifiers to determine whether or not to query an archive for a given URI. This method was showed to significantly decrease the number of requests and the overall response time of the aggregator, without compromising recall.

@hvdsomp^3 Routing Memento Requests Using Binary Classifiers #jcdl2016 pic.twitter.com/8jMnQrWsF7
— Martin Klein (@mart1nkle1n) June 20, 2016

Van de Sompel: Memento aggregator searches multiple web archives at the same time (right now, 21 archives) #jcdl2016
— Jennifer Vinopal (@jvinopal) June 20, 2016

.@hvdsomp: 80% of URI-Rs were stored in 0-2 web archives. attempted to predict whether to look at a specific WAs by looking URI-R #jcdl2016
— Mx A. Matienzo (@anarchivist) June 20, 2016

The Dawn of Today's Popular Domains: A Study of the Archived German Web over 18 Years, by Helge Holzmann, Wolfgang Nejdl and Avishek Anand: In an effort to see what the future of the Web is, Helge Holzmann et al. embarked on a longitudinal study to see how websites evolved over time by studying a German Web collection to retrospectively analyze how the popular Web evolved over the past 18 years.

Holzmann: first Q "how long do web sites live?" --- existential angst at #jcdl2016
— Paula Goodale (@PaulaGoodale) June 20, 2016

Holzmann: developed 3 types of statistic aggregations on yrly granularity: evolution, relative to domain age, relative to URL age #jcdl2016
— Mx A. Matienzo (@anarchivist) June 20, 2016

.@helgeho describes the data cleaning & initial analysis done on CDX files received from @internetarchive #jcdl2016 pic.twitter.com/MEIRtEfA49
— Mat Kelly (@machawk1) June 20, 2016

German University websites tend to live longer than other German sites -@helgeho #jcdl2016
— Mat Kelly (@machawk1) June 20, 2016

Holzmann: finding: domains grow exponentially, doubling their volume every 2 years #jcdl2016
— Jennifer Vinopal (@jvinopal) June 20, 2016

Holzmann: domains grow exponentionally, doubling volume every 2 years #jcdl2016
— Mx A. Matienzo (@anarchivist) June 20, 2016

ArchiveSpark: Efficient Web Archive Access, Extraction and Derivation, by Helge Holzmann, Vinay Goel and Avishek Anand: Researchers exploring digital libraries require tools that provides efficient access to Web archive data for extraction and derivation of smaller datasets. To fulfill this need, Helge Holzmann et al. proposed ArchiveSpark; a framework for efficient and distributed Web archive processing.

And now up is @helgeho on ArchiveSpark, a #webArchiving analysis framework. https://t.co/DWr7ADXHGW #jcdl2016
— Ian Milligan (@ianmilligan1) June 20, 2016

Important point from @helgeho: many think that web archives are the Wayback, but it’s really Big Data. & requires Big Data tools. #jcdl2016
— Ian Milligan (@ianmilligan1) June 20, 2016

When I began my work, what I would have given for ArchiveSpark (https://t.co/DWr7ADXHGW) or Warcbase (https://t.co/iOyD3AGI81). #JCDL2016
— Ian Milligan (@ianmilligan1) June 20, 2016

.@helgeho: example usecase for ArcihveSpark: sentiment analysis/reactions previous election cycle; support filtering decisions #jcdl2016
— Mx A. Matienzo (@anarchivist) June 20, 2016

Demo time! @helgeho is using a Jupyter notebook to interact with ArchiveSpark for their cluster running at L3S #jcdl2016
— Mx A. Matienzo (@anarchivist) June 20, 2016

Helge doing a fantastic job presenting our work at #jcdl2016 https://t.co/bSW2m7IdI0
— Vinay Goel (@vinaygo) June 20, 2016

Minute madness starts in five mins. This horn is surprisingly loud... honks if you speak over a minute! 😈 #jcdl2016 pic.twitter.com/MqhIkwxprc
— Ian Milligan (@ianmilligan1) June 20, 2016

@ianmilligan1 being mean (tough as a Canadian) during minute madness #jcdl2016 pic.twitter.com/LVqDQsMZwk
— Martin Klein (@mart1nkle1n) June 20, 2016

@clarellewellyn built a Twitter archive on #brexit #jcdl2016 pic.twitter.com/JUzsLGQXGk
— Martin Klein (@mart1nkle1n) June 20, 2016

Cool poster on #learninganalytics #jcdl2016 pic.twitter.com/I5HBcKNRoi
— Robert H. McDonald (@mcdonald) June 20, 2016

@damirah on Semantometrics #jcdl2016 pic.twitter.com/X6o1I8679T
— Martin Klein (@mart1nkle1n) June 20, 2016

https://t.co/TxvEziz1Mp cool tool for capturing use data for recommender options #jcdl2016 pic.twitter.com/nd2fphVwi7
— Robert H. McDonald (@mcdonald) June 20, 2016

@acnwala on scholarly vs non-scholarly web queries #jcdl2016 pic.twitter.com/zbu0bglFX4
— Martin Klein (@mart1nkle1n) June 20, 2016

De-anonymized, this is Unmil @unmil #jcdl2016 pic.twitter.com/nujZeirhQZ
— Martin Klein (@mart1nkle1n) June 20, 2016

#IamNotaGator @ibnesayeed #jcdl2016 pic.twitter.com/zlU70bxBsD
— Martin Klein (@mart1nkle1n) June 20, 2016

#iamnotagator #jcdl2016 pic.twitter.com/Qv9u63udou
— Mx A. Matienzo (@anarchivist) June 20, 2016

.@machawk1 presenting https://t.co/15NHyGNJlz #jcdl2016 @WebSciDL pic.twitter.com/25oNQAz89m
— Michael L. Nelson (@phonedude_mln) June 20, 2016

#jcdl2016 poster session underway. I'm very proud of helping chair this track! pic.twitter.com/jlCgfjwT7R
— Ian Milligan (@ianmilligan1) June 20, 2016

@zxie on cloud cost for data repos #jcdl2016 pic.twitter.com/uBHMy2ggP0
— Martin Klein (@mart1nkle1n) June 20, 2016

@clarellewellyn lecturing on her Twitter dataset and how to avoid the drunkard's search #jcdl2016 pic.twitter.com/po1NvSCOWf
— Martin Klein (@mart1nkle1n) June 20, 2016

@damirah explaining how to evaluate research output based on full text #jcdl2016 pic.twitter.com/f7EwlApEVB
— Martin Klein (@mart1nkle1n) June 20, 2016

Day 2 (June 21, 2016):

Day 2 of the conference, just like Day 1 began with a keynote address by Rachel Frick (DPLA), titled The State of Practice and Use of Digital Collections: the Digital Public Library of America as a platform for research, in which she talked about the DPLA, exploring its history as well as the past and current efforts.

Day two keynote by @rlfrick on history of @dpla #jcdl2016 @jcdl2016 pic.twitter.com/galAthQUbr
— Robert H. McDonald (@mcdonald) June 21, 2016

.@rlfrick @dpla Desire to make large-scale (rather than incremental) change #jcdl2016
— Jennifer Vinopal (@jvinopal) June 21, 2016

Dpla's goal was reuse. Wanted to see 80% of hits coming via API with only 20% using the portal. Reality is about 60:40 #jcdl2016
— Gary Munnelly (@russano22) June 21, 2016

The fourth paper session followed the keynote. It was chaired by Xiaozhong Liu (Indiana University) and consisted of the following presentations:

Low-cost semantic enhancement to Digital Library metadata and indexing: Simple yet effective strategies, by Annika Hinze, David Bainbridge, Sally Jo Cunningham and J. Stephen Downie: Annika Hinze et al. addressed accessing digital libraries non-disruptively and cheaply by using the results of semantic analysis and disambiguation, while retaining a keyword-based search and lexicographic index.

Shout out to @vphill's in-depth analysis of @DPLA metadata, including subject analysis: https://t.co/joHXfyPjI9 #JCDL2016
— Mx A. Matienzo (@anarchivist) June 21, 2016

Hinze: textualising a semantic intdex is surprisingly effective; addresses false negatives but adds false positives #jcdl2016
— Mx A. Matienzo (@anarchivist) June 21, 2016

Demo video of Capisco system presented by Hinze available: https://t.co/3IoqFuYuvU #jcdl2016
— Mx A. Matienzo (@anarchivist) June 21, 2016

Desiderata for Exploratory Search Interfaces to Web Archives in Support of Scholarly Activities, by Andrew Jackson, Jimmy Lin, Ian Milligan and Nick Ruest: Andrew Jackson et al. described an exploratory search interface to web archives for humanities scholars and social scientists by presenting their initial implementation and discussed their findings in terms of a desiderata for the system.

Title slide of my #jcdl2016 paper – so happy to have been able to collaborate with such a fantastic team. #jcdl2016 pic.twitter.com/MfZVSkd3tR
— Ian Milligan (@ianmilligan1) June 21, 2016

.@ianmilligan1: Historians should care about WWW as a source of traces; demonstrated 100x growth between 1995-7of Geocities #jcdl2016
— Mx A. Matienzo (@anarchivist) June 21, 2016

Historian @ianmilligan1 quoting Schneiderman @benbendc at a digital libraries conference #jcdl2016 pic.twitter.com/MqAns33h8K
— Michele Weigle (@weiglemc) June 21, 2016

Web Archives for Longitudinal Knowledge (WALK) Portal , https://t.co/opdafRo8pq demo by @ianmilligan1 #jcdl2016
— Philipp Mayr (@Philipp_Mayr) June 21, 2016

.@ianmilligan1's comment: Storytelling for summarizing web archive collections, https://t.co/27edPIzWUz @yasmina_anwar @weiglemc #jcdl2016
— Michael L. Nelson (@phonedude_mln) June 21, 2016

Content Selection and Curation for Web Archiving: The Gatekeepers vs. the Masses, by Ian Milligan, Nick Ruest and Jimmy Lin: Ian Milligan et al. addressed the question: "what should we archive?" by a case study about the 2015 Canadian federal elections by comparing a broad ("gatekeepers") crawl approach to a "the masses" crawl approach. Through their study, they recommended a hybrid approach that combines social media and more traditional curatorial methods.

.@ruebot: focused on Canadian Politlcal Parties/PIG Archive-IT collection from UToronto; fetched data into WARCBase #jcdl2016
— Mx A. Matienzo (@anarchivist) June 21, 2016

.@ruebot: Collected 3.9M tweets using @edsu's twarc; collects in raw json and very easy to get started w/ collecting #jcdl2016
— Mx A. Matienzo (@anarchivist) June 21, 2016

.@ruebot: took all URIs and compared them; in CPP only 0.341% were in Twitter dataset; in Twitter; only 0.269% were in CPP dataset #jcdl2016
— Mx A. Matienzo (@anarchivist) June 21, 2016

Towards Better Understanding of Academic Search, by Madian Khabsa, Zhaohui Wu and C. Lee Giles: Madian Khabsa et al. studied the distribution of queries that are received by an academic search engine. They also introduced a machine learning approach to identify navigational academic queries.

Lee Giles talking about academic search in citeseer. User queries, not bot queries. @jcdl2016 #jcdl2016 https://t.co/XrjqIdMKtI
— Philipp Mayr (@Philipp_Mayr) June 21, 2016
Investigating Cluster Stability when Analyzing Transaction Logs, by Paul Clough and Daniel Grech: Paul Clough et al. computed stability based on the Jaccard coefficient to investigate the cluster stability when using different subsets of transaction log data from WorldCat.org.

Paul Clough presenting "Investigating Cluster Stability when Analyzing Transaction Logs" https://t.co/d4spAMkFDP #jcdl2016 @jcdl2016
— Philipp Mayr (@Philipp_Mayr) June 21, 2016

After an hour and half lunch break, two concurrent events began - a second panel session titled Preserving Born-Digital News (chaired by Vivek Singh, Rutgers University), and a fifth paper session on Q&A and Gaming (chaired by Sally Jo Cunningham, University of Waikato).

The participants of the second panel session (attended by the ws-dl members) consisted of the following panel participants:

Edward McCain (Organizer) - Donald W. Reynolds Institute, University of Missouri Libraries
Matthew Weber - School of Communication and Information, Rutgers University
Martin Klein - University of California Los Angeles Libraries

.@e_mccain References @CoMissourian learning a hard lesson about digital data preservation and the memory hole left bc of lack of #jcdl2016
— Mat Kelly (@machawk1) June 21, 2016

.@mart1nkle1n talking about “Collecting, analyzing, and linking TV news and social media collections,” based on work at UCLA. #jcdl2016
— Ian Milligan (@ianmilligan1) June 21, 2016

my news media & research presentation slides for #jcdl2016 available here https://t.co/pMvizdAchK
— Matthew Weber (@docmattweber) June 21, 2016

.@mart1nkle1n presenting on linking TV news and tweets #jcdl2016 pic.twitter.com/xTQiN2610h
— Michael L. Nelson (@phonedude_mln) June 21, 2016

.@docmattweber Much of what happens as news on Twitter is based off of what happens on TV, real life, etc. #jcdl2016
— Mat Kelly (@machawk1) June 21, 2016

Fantastic @docmattweber visualization of link profile correlations - how papers duplicate content. #jcdl2016 pic.twitter.com/05ToLZr0ap
— Ian Milligan (@ianmilligan1) June 21, 2016

from @docmattweber prez: "combat operations" vs. "major combat operations" https://t.co/fTTrqsDkGS vs. https://t.co/ZyjxtzJJgw #jcdl2016
— Michael L. Nelson (@phonedude_mln) June 21, 2016

re: Hurricane Katrina summary page vs. replay from archive, see: https://t.co/SN1W5cUzMf from me, @weiglemc #jcdl2016
— Michael L. Nelson (@phonedude_mln) June 21, 2016

The fifth paper session titled Q&A and Gaming and chaired by Sally Jo Cunningham (University of Waikato) consisted of the following papers and corresponding presenters:

Experimental Evaluation of Affective Embodied Agents in an Information Literacy Game, by Yan Ru Guo, Dion Hoe-Lian Goh, Hurizan Bin Hussain Muhamad, Boon Kuang Ong and Zichao Lei.
Evaluating the Quality of Educational Answers in Community Question-Answering, by Long Le, Chirag Shah and Erik Choi.
Music Information Seeking via Social Q&A: An Analysis of Questions in Music StackExchange Community, by Hengyi Fu and Yun Fan.

The sixth paper session followed the previous paper/panel sessions. It was titled Publication Mining, chaired by Giorgio Maria Di Nunzio (University of Padua), and consisted of the following papers and corresponding presenters:

PDFFigures++: Mining Figures from Research Papers, by Christopher Clark and Santosh Divvala: Christopher Clark et al. presented their tool/algorithm (PDFFigures 2.0) which extracts figures, tables, and captions from scholarly documents. Their algorithm showed impressive results (94% precision at 90% recall) on the test dataset, and surpassed the state of the art.

PDFFigures 2.0: Mining Figures from Research Papers https://t.co/GWx9eVXbEy #jcdl2016 @jcdl2016 great paper
— Philipp Mayr (@Philipp_Mayr) June 21, 2016

@Philipp_Mayr @jcdl2016 "PDFFigures 2.0 powers the figure extraction feature in Semantic Scholar" https://t.co/152iJqEw7Y #jcdl2016
— Philipp Mayr (@Philipp_Mayr) June 21, 2016

Comparing Published Scientific Journal Articles to Their Pre-print Versions, by Martin Klein, Peter Broadwell, Sharon Farb and Todd Grappone: U.S. academic libraries paid $1.7 billion for serial subscriptions in 2008 alone to academic publishers. Consequently, the analysis of Martin Klein et al. revealed that the text contents of scientific papers generally changed very little from their pre-print to final published versions. Thereby providing information to facilitate economic decision targeting subscriptions.

Now @mart1nkle1n on Comparing Published Scientific Journal Articles to Their Pre-print Versions. https://t.co/BfDZ4EsjEG #jcdl2016
— Ian Milligan (@ianmilligan1) June 21, 2016

.@mart1nkle1n presenting "Comparing published articles to preprints" https://t.co/OOjmDgpZft #jcdl2016 pic.twitter.com/w4rG4lfWb3
— Michael L. Nelson (@phonedude_mln) June 21, 2016

How much does 1.7 billion buy you ? "Comparing Journal Articles to Their Preprint Versions" https://t.co/eoqUkHTXzF @mart1nkle1n #jcdl2016
— Herbert (@hvdsomp) June 21, 2016

Slides for "Comparing Published Scientific Journal Articles to Their Pre-print Versions"
#jcdl2016 https://t.co/iecMdxE4Lt
— Martin Klein (@mart1nkle1n) June 21, 2016

Paper at: https://t.co/8dmJ9RDXZo #jcdl2016
— Martin Klein (@mart1nkle1n) June 21, 2016

Extracting Academic Genealogy Trees from the Networked Digital Library of Theses and Dissertations, by Wellington Dores, Fabricio Benevenuto and Alberto Laender: Given the decentralized storage of research theses and dissertations across local digital libraries, exploring the genealogy of researches over time is challenging. Thus, Wellington Dores et al. presented a first step towards building a large repository that records the academic genealogy of researchers across different fields and countries.

Good example of NLP meets Scientometrics "Extracting Academic Genealogy" by Alberto Laenderhttps://t.co/XEGQRwKAnw #jcdl2016 @jcdl2016
— Philipp Mayr (@Philipp_Mayr) June 21, 2016

Alberto Laender, 1 of our WOSP speakers now up on the main stage. Come to our mining workshop tomorrow to hear more from him! #jcdl2016
— OpenMinTeD (@openminted_eu) June 21, 2016

Probabilistic Assignment of Medical Subject Headings to PubMed Records Based on References and Abstract Similarity, by Adam Kehoe and Vetle Torvik: Adam K. Kehoe et al. described a method for assigning Medical Subject Headings (MeSH) to unlabeled documents by combining abstract similarities and direct citations to labeled MEDLINE records.

@mart1nkle1n citing @gemmahersh on "radical" changes in manuscripts due to publisher editing, demonstrating almost no editing done #jcdl2016
— Petr Knoth (@petrknoth) June 21, 2016

After the sixth paper session of the conference a banquet at the Newark Museum followed.

Conference banquet at the Newark Museum #jcdl2016 pic.twitter.com/XtR5ea8QyJ
— Michele Weigle (@weiglemc) June 21, 2016

A night at the museum.#jcdl2016 #newarkmuseum #diederich #chandelier pic.twitter.com/P3V6lQJ4Fq
— Giorgio M Di Nunzio (@airamoigroig) June 22, 2016

.@WebSciDL taking in the art at #jcdl2016 @phonedude_mln @mart1nkle1n @machawk1 @acnwala @ibnesayeed pic.twitter.com/zEVNXE06NU
— Michele Weigle (@weiglemc) June 22, 2016

Loved the #jcdl2016 conference dinner at @NewarkMuseum. Fantastic venue, food and company! Plus the Ballantine House was spectacular.
— Paula Goodale (@PaulaGoodale) June 22, 2016

In this banquet, the winners/runner-ups of the best paper and poster were announced and recognized. The Vannevar Bush best paper award went to Comparing Published Scientific Journal Articles to Their Pre-print Versions by Martin Klein, Peter Broadwell, Sharon E. Farb, and Todd Grappone.

Fun #jcdl2016 banquet... and the Vannevar Bush best paper award goes to @mart1nkle1n et al! Congrats! pic.twitter.com/DZ4m5R5Sgt
— Ian Milligan (@ianmilligan1) June 21, 2016

.@mart1nkle1n's paper just won Vannevar Bush best paper award at #jcdl2016 https://t.co/Ob8zA4POBn
— Herbert (@hvdsomp) June 21, 2016

The best poster, third place went to my poster: A Supervised Learning Algorithm for Binary DomainClassification of Web Queries using SERPs

Best Poster for #jcdl2016 3rd Place A Supervised Learning Algorithm... by @acnwala and @phonedude_mln
— JCDL 2016 (@jcdl2016) June 22, 2016

The second place by one vote went to Avoiding the Drunkard's Search: Investigating Collection Strategies for Building a Twitter Dataset, by Clare Llewellyn, Laura Cram, and Adrian Favero

Best Poster for #jcdl2016 2nd Place Avoiding the Drunkard's Search: Investigating Collection Strategies for Building a Twitter Dataset
— JCDL 2016 (@jcdl2016) June 22, 2016

The best poster, first place went to Semantometrics: Towards Fulltext-based Research Evaluation, by Drahomira Herrmannova and Petr Knoth.

Best Poster for #jcdl2016 1st Place Semantometrics: Towards Fulltext-based Research Evaluation
— JCDL 2016 (@jcdl2016) June 22, 2016

The best student paper went to Evaluating the Quality of Educational Answers in Community Question-Answering by Long Le, Chirag Shah, Erik Choi

#jcdl2016 Best Student Paper:Evaluating the Quality of Educational Answers in Community Question-Answering by Long Le, Chirag Shah,Erik Choi
— JCDL 2016 (@jcdl2016) June 22, 2016

Here is a complete list of all the winners and nominees.

Day 3 (June 22, 2016):

The third day of the conference was split into three sections - the seventh paper session titled Recommendation and Prediction, a keynote by Stephen Bury (New York Art Resources Consortium - NYARC), and four workshops.

The seventh paper session was chaired by Edie Rasmussen (University of British Columbia), and consisted of the following presentations:

Profiling vs. Time vs. Content: What does Matter for Top-k Publication Recommendation based on Twitter Profiles?, by Chifumi Nishioka and Ansgar Scherp: To address the lack of clarity about how different factors of a scientific publication recommender system (based on users' tweets) have an influence on the recommendation performance, Chifumi Nishioka et al. examined three different factors - profiling method, temporal decay, and richness of content.

Nishioka presents a new personalised approach for research paper recommendation based on Twitter profiles (#coldstart problem) #jcdl2016
— Petr Knoth (@petrknoth) June 22, 2016

Chifumi Nishioka from U Kiel on Publication Recommendation based on Twitter Profiles. Looking fwd to demo #jcdl2016 pic.twitter.com/RXcnmnCENY
— Martin Klein (@mart1nkle1n) June 22, 2016

Early Prediction of Scholar Popularity, by Masoumeh Nezhadbiglari, Marcos Goncalves and Jussara Almeida: Masoumeh Nezhadbiglari et al. tackle the problem of predicting the popularity of scholars by attempting to make the predictions both as earlier and accurate as possible.
Evaluating Link-based Recommendations for Wikipedia, by Malte Schwarzer, Moritz Schubotz, Norman Meuschke, Corinna Breitinger, Volker Markl and Bela Gipp: Malte Schwarzer et al. reported on the first large-scale investigation about the the performance of the Co-Citation Proximity Analysis method of generating recommendations for Wikipedia. They analyzed links instead of citations to generate article recommendations.

Stephen Bury gave the final keynote titled, The ENERGY OF DELUSION: THE NEW YORK ART RESOURCES CONSORTIUM (NYARC) & THE DIGITAL.

We're delighted to have Stephen Bury from @fricklibrary and @nyarcist presenting the final keynote at #jcdl2016 pic.twitter.com/jvdVfzMcRs
— Michele Weigle (@weiglemc) June 22, 2016

Stephen Bury of @nyarcist gives the final keynote of #jcdl2016, describing the rel. of his org. with local museums. pic.twitter.com/CxLEu0hWAl
— Mat Kelly (@machawk1) June 22, 2016

Stephen Bury demonstrating the @nyarcist/@Frick_DAHL ARIES "digital lightbox" environment. #jcdl2016 pic.twitter.com/64VK424iUV
— Pete Broadwell (@PeterBroadwell) June 22, 2016

The main conference ended following the presentations from the seventh paper session, but not before Ian Milligan invited us to attend JCDL 2017 in Canada!

JCDL 2017 will be held at @UofT in Toronto, Canada. #jcdl2016 #jcdl2017 pic.twitter.com/lFpTPD8Q2m
— Mat Kelly (@machawk1) June 22, 2016

-- Nwala (@acnwala)

Search This Blog

Web Science and Digital Libraries Research Group

2016-06-23: Joint Conference on Digital Libraries (JCDL) 2016 Trip Report

Comments

Post a Comment