2022-01-26: Review of WS-DL's 2021

Capturing the essence of 2021: unmasked post-vax but inbetween Delta & Omicron waves, and a mix of online and f2f meetings.

2021 was the second year of the Covid-19 pandemic and although things improved from 2020, there was continued impact on our research, teaching, and service here at the for us at the Web Science and Digital Libraries (WS-DL) Research Group.  We graduated one Ph.D. student and three MS students, and added five graduate students and two undergraduates to the group.

Students and Faculty

Unlike last year when we had three Ph.D. defenses, this year we had only a single defense.  Sadly, this one was also on Zoom. 

We did not add additional faculty to @WebSciDL this year, standing pat with six members.  We continued to offer an exciting array of web science related courses, with six courses offered in both Spring 2021 and Fall 2021.


We added five graduate students this year:

 And two undergraduate students:

We also had three people advance from Ph.D. "student" to "candidate" (but sadly, no crush board pic):

  • Corren McCoy became a candidate on 2021-12-13: "A Relevance-Based Model for Ranking Cybersecurity Vulnerabilities"
  • Muntabir Choudhury became a candidate on 2021-11-17: "Automatic Metadata Extraction Incorporating Visual Features from Scanned Electronic Theses and Dissertations".
  • Gavindya Jayawardena became a candidate on 2021-10-25: "Automated Filtering of Eye Gaze Metrics from Dynamic Areas of Interest"

Publications and Presentations

Given the size of our research group, manually compiling a list of publications has become unwieldy, but our best estimate for 2021 is: six journal articles, one book chapter, two patents, 26 conference/workshop publications, and 11 technical reports or other contributions.   We've been working on a software package, Scholar Groups, that scrapes Google Scholar profiles, dedups the entries, and then merges them into a single list.   There are still a few issues that we are resolving, and anticipate making a full release in early 2022.  Since it scrapes Google Scholar, it is subject to any errors or limitations that might be in the Google Scholar entries.  I've included an expandable/collapsable gist below that is our 2021 publications list according to Scholar Groups. 

Collapse/expand the list of WSDL's 2021 publications

Web Science and Digital Libraries Research Group 2021 Publications

  1. M Aturban, ML Nelson, MC Weigle, Where Did the Web Archive Go?, International Conference on Theory and Practice of Digital Libraries, 73-84, 2021.

  2. S Rajtmajer, C Griffin, J Wu, R Fraleigh, L Balaji, A Squicciarini, ..., A Synthetic Prediction Market for Estimating Confidence in Published Work, arXiv preprint arXiv:2201.06924, 2021.

  3. TR Griffin, EJ Healy, IV Ramakrishnan, VG Ashok, System and Method Associated with Determining Physician Attribution Related to In-Patient Care Using Prediction-Based Analysis, US Patent App. 17/399,511, 2021.

  4. M Ling, J Chen, T Möller, P Isenberg, T Isenberg, M Sedlmair, ..., Document Domain Randomization for Deep Learning Document Layout Extraction, arXiv preprint arXiv:2105.14931, 2021.

  5. S Uddin, B Banerjee, J Wu, WA Ingram, EA Fox, Building A Large Collection of Multi-domain Electronic Theses and Dissertations, 2021 IEEE International Conference on Big Data (Big Data), 6043-6045, 2021.

  6. A Mabe, ML Nelson, MC Weigle, Extending Chromium: Memento-Aware Browser, 2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL), 310-311, 2021.

  7. SY Kahu, WA Ingram, EA Fox, J Wu, Scanbank: A benchmark dataset for figure extraction from scanned electronic theses and dissertations, arXiv preprint arXiv:2106.15320, 2021.

  8. M Hasan Choudhury, HR Jayanetti, J Wu, WA Ingram, EA Fox, Automatic Metadata Extraction Incorporating Visual Features from Scanned Electronic Theses and Dissertations, arXiv e-prints, arXiv: 2107.00516, 2021.

  9. SM Jones, M Klein, HV Sompel, ML Nelson, MC Weigle, Interoperability for Accessing Versions of Web Resources with the Memento Protocol, The Past Web, 101-126, 2021.

  10. S De Silva, SU Dayarathna, G Ariyarathne, D Meedeniya, S Jayarathna, fMRI feature extraction model for ADHD classification using convolutional neural network, International Journal of E-Health and Medical Communications (IJEHMC) 12 (1 …, 2021.

  11. HN Lee, V Ashok, IV Ramakrishnan, Bringing Things Closer: Enhancing Low-Vision Interaction Experience with Office Productivity Applications, Proceedings of the ACM on Human-Computer Interaction 5 (EICS), 1-18, 2021.

  12. JS Downie, D McKay, H Suleman, DM Nichols, F Poursardar, 2021 ACM/IEEE Joint Conference on Digital Libraries, .

  13. C Rane, SM Subramanya, DS Endluri, J Wu, CL Giles, ChartReader: Automatic Parsing of Bar-Plots, 2021 IEEE 22nd International Conference on Information Reuse and Integration …, 2021.

  14. SM Jones, V Neblitt-Jones, MC Weiglet, M Klein, ML Nelson, It's All About The Cards: Sharing on Social Media Encouraged HTML Metadata Growth, 2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL), 110-119, 2021.

  15. B Mahanama, G Jayawardena, S Jayarathna, Analyzing Unconstrained Reading Patterns of Digital Documents Using Eye Tracking, 2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL), 282-283, 2021.

  16. S De Silva, S Dayarathna, G Ariyarathne, D Meedeniya, S Jayarathna, ..., Computational Decision Support System for ADHD Identification, International Journal of Automation and Computing 18 (2), 233-255, 2021.

  17. J Ferdous, S Uddin, V Ashok, Semantic table-of-contents for efficient web screen reading, Proceedings of the 36th Annual ACM Symposium on Applied Computing, 1941-1949, 2021.

  18. W Wang, F Xia, J Wu, Z Gong, H Tong, BD Davison, Scholar2vec: vector representation of scholars for lifetime collaborator prediction, ACM Transactions on Knowledge Discovery from Data (TKDD) 15 (3), 1-19, 2021.

  19. M Gong, X Wei, D Oyen, J Wu, M Gryder, L Yang, Recognizing Figure Labels in Patents., SDU@ AAAI, 2021.

  20. S Alam, MC Weigle, ML Nelson, Profiling Web Archival Voids for Memento Routing, 2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL), 150-159, 2021.

  21. YJ Ko, A Putkonen, AS Aydin, S Feiz, Y Wang, V Ashok, IV Ramakrishnan, ..., Modeling Gliding-based Target Selection for Blind Touchscreen Users, Proceedings of the 23rd International Conference on Mobile Human-Computer …, 2021.

  22. D Patel, AC Nwala, ML Nelson, MC Weigle, What Did It Look Like: A service for creating website timelapses using the Memento framework, arXiv preprint arXiv:2104.14041, 2021.

  23. J Wu, S Rohatgi, SRR Keesara, J Chhay, K Kuo, AM Menon, S Parsons, ..., Building an Accessible, Usable, Scalable, and Sustainable Service for Scholarly Big Data, 2021 IEEE International Conference on Big Data (Big Data), 141-152, 2021.

  24. S Rohatgi, CL Giles, J Wu, What Were People Searching For? A Query Log Analysis of An Academic Search Engine, 2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL), 342-343, 2021.

  25. SS Teja Lanka, S Rajtmajer, J Wu, CL Giles, Extraction and Evaluation of Statistical Information from Social and Behavioral Science Papers, Companion Proceedings of the Web Conference 2021, 426-430, 2021.

  26. SA Modukuri, SM Rajtmajer, AC Squicciarini, J Wu, CL Giles, Understanding and Predicting Retractions of Published Work., SDU@ AAAI, 2021.

  27. B Kandimalla, S Rohatgi, J Wu, CL Giles, Large scale subject category classification of scholarly papers with deep attentive neural networks, Frontiers in research metrics and analytics 5, 31, 2021.

  28. SM Jones, V Neblitt-Jones, MC Weigle, M Klein, ML Nelson, It's All About The Cards: Sharing on Social Media Probably Encouraged HTML Metadata Growth, arXiv preprint arXiv:2104.04116, 2021.

  29. Y Jayawardana, G Jayawardena, AT Duchowski, S Jayarathna, Metadata-driven eye tracking for real-time applications, Proceedings of the 21st ACM Symposium on Document Engineering, 1-4, 2021.

  30. HN Lee, V Ashok, Towards Enhancing Blind Users' Interaction Experience with Online Videos via Motion Gestures, Proceedings of the 32nd ACM Conference on Hypertext and Social Media, 231-236, 2021.

  31. J Wu, R Nivargi, SST Lanka, AM Menon, SA Modukuri, N Nakshatri, X Wei, ..., Predicting the Reproducibility of Social and Behavioral Science Papers Using Supervised Learning Models, arXiv preprint arXiv:2104.04580, 2021.

  32. M Ling, J Chen, T Möller, P Isenberg, T Isenberg, M Sedlmair, R Laramee, ..., Three benchmark datasets for scholarly article layout analysis, .

  33. S Rohatgi, J Wu, CL Giles, Ranked List Fusion and Re-ranking with Pre-trained Transformers for ARQMath Lab, .

  34. AS Aydin, YJ Ko, U Uckun, IV Ramakrishnan, V Ashok, Non-Visual Accessibility Assessment of Videos, Proceedings of the 30th ACM International Conference on Information …, 2021.

  35. S Jones, M Weigle, M Klein, ML Nelson, Automatically Selecting Striking Images for Social Cards, 13th ACM Web Science Conference 2021, 36-45, 2021.

  36. G Jayawardena, S Jayarathna, J Wu, Analysis of Reading Patterns of Scientific Literature Using Eye-Tracking Measures, .

  37. SM Jones, MC Weigle, ML Nelson, Hypercane: toolkit for summarizing large collections of archived webpages, ACM SIGWEB Newsletter, 1-14, 2021.

  38. A Sefid, J Wu, P Mitra, L Giles, Extractive Research Slide Generation Using Windowed Labeling Ranking, arXiv preprint arXiv:2106.03246, 2021.

  39. AC Nwala, MC Weigle, ML Nelson, Garbage, Glitter, or Gold: Assigning Multi-dimensional Quality Scores to Social Media Seeds for Web Archive Collections, 2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL), 80-89, 2021.

  40. K Garg, HR Jayanetti, S Alam, MC Weigle, ML Nelson, Replaying Archived Twitter: When your bird is broken, will it bring you down?, 2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL), 160-169, 2021.

  41. N Alipourfard, B Arendt, DJ Benjamin, N Benkler, M Bishop, M Burstein, ..., Systematizing Confidence in Open Research and Evidence (SCORE), SocArXiv, 2021.

  42. G Jayawardena, S Jayarathna, Automated Filtering of Eye Movements Using Dynamic AOI in Multiple Granularity Levels, International Journal of Multimedia Data Engineering and Management (IJMDEM …, 2021.

  43. S Sazzed, S Jayarathna, Ssentia: a self-supervised sentiment analyzer for classification from unlabeled data, Machine Learning with Applications 4, 100026, 2021.

  44. SM Jones, MC Weigle, M Klein, ML Nelson, Hypercane: Intelligent Sampling for Web Archive Collections, 2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL), 316-317, 2021.

In 2021, most conferences were online, but we still issued trip reports for the ones that we "attended":

Research presentations and outreach

2021 brought a mix of online, f2f, and hybrid meetings and outreach.  For example, in between the Delta and Omicron waves, in October Dr. Wu attended the CCI Workshop "Role of Cybersecurity on Curbing the Spread of Disinformation and Misinformation and Disinformation" in Charlottesville, VA, supporting his successful CCI proposal for which he is the PI: "The Acceptance and Effectiveness of Explainable AI-Powered Credibility Labeler for Scientific Misinformation and Disinformation". 

During the summer, Dr. Jayarathna held two workshops on campus:

Dr. Jayarathna repeated the successful "Trick or Research" event, which was virtual again this year.  Other outreach events included:

Software, data sets, services

Scholarly contributions are not limited to conventional publications or presentations: we also advance the state of the art through releasing software, data sets, and proof-of-concept services & demos.  Some of the software, data sets, and services that we either initially released or made significant updates to in the course of 2021 include: 

Other contributions

Our research group made a number of other contributions in 2021 that do not neatly fit into any of the above categories.  Some of the highlights include:

Awards and Recognition

Many members of our group received awards and other forms of recognition:

Funding 

2021 was a much better year for funding.  We were fortunate to have received 14 external grants (> $1.7M) and two internal grants ($11k). We are especially proud of Dr. Sampath Jayarathna winning the prestigious NSF CAREER award!

For internal grants:

  • Drs. Jayarathna & Poursardar as well as others in the CS department received $10k for "U-REACT: Undergraduate Research Experience and Career Training" from the College of Science Undergraduate Research Program Grant (COSURP).
  • Drs. Poursardar & Jayarathna received a $750 Faculty Innovator Grant for "Drone Programming Studio" from the Center for Learning and Teaching (CLT).

2022 -- it can only get better, right? right?!

For most, 2021 was an improvement over 2020 and the benefit of vaccines enabled some travel, f2f meetings, and in-person teaching.  Yet the pandemic continues, and it is becoming less clear when or if there will be a return to how things were before March 2020 (i.e., "normal").  Covid has impacted many members of our research group, and it is likely that in 2022 it will continue to impact more of us. 

Despite the many challenges, 2021 was a good year for us: we graduated four students (1 PhD, 3 MS), but were joined by five graduate and two undergraduate students.  We continued to publish in prestigious venues, and we more than tripled 2020's level of external funding.  Our PhD students continue to get great jobs in academia, government, and industry.  Despite our challenges, we are not just surviving, we are thriving.  I am proud of all members of WSDL and their progress in 2021.

If you would like to join WSDL, please get in touch.  To get a feel for our recent activities, please review our previous WS-DL annual summaries (2020, 2019, 2018, 2017, 2016, 2015, 2014, and 2013) and be sure to follow us at @WebSciDL to keep up to date with publications, software releases, trip reports and other activities.  We especially would like to thank all those who have publicly complimented some aspect of the Web Science and Digital Libraries Research Group: our code, services, papers, or students and faculty.  We really appreciate the feedback, some of which we include below.


--Michael

 

 

Comments