Posts

2019-09-09: Information Reuse & Integration for Data Science (IRI) 2019 Trip Report

The 20th IEEE Information Reuse and Integration for Data Science (IRI) 2019 was held in Los Angles, CA this year. Given the emerging global Information-centric IT landscape that has tremendous social and economic implications, effectively processing and integrating humungous volumes of information from diverse sources to enable effective decision making and knowledge generation have become one of the most significant challenges of current times. Information Reuse and Integration for Data Science (IRI) seeks to maximize the reuse of information by creating simple, rich, and reusable knowledge representations and consequently explores strategies for integrating this knowledge into systems and applications. The IEEE IRI conference serves as a forum for researchers and practitioners from academia, industry, and government to present, discuss, and exchange ideas that address real-world problems with real-world solutions. Theoretical and applied papers are both included. The conference progr

2019-09-09: Introducing sumgram, a tool for generating the most frequent conjoined ngrams

Image
Comparison of top 20 (first column) bigrams, top 20 (second column) six-grams, and top 20 (third column) sumgrams (conjoined ngrams) generated by sumgram for a collection of documents about the 2014 Ebola Virus Outbreak . Proper nouns of more than two words (e.g., "centers for disease control and prevention") are split when generating bigrams, sumgram strives to remedy this. Generating six-grams surfaces non-salient six-grams. Click image to expand. A Web archive collection consists of groups of webpages that share a common topic e.g., “Ebola virus” or “Hurricane Harvey.” One of the most common tasks involved in understanding the "aboutness" of a collection is generating the top k (e.g., k = 20) ngrams. For example, given a collection about Ebola Virus , we could generate the top 20 bigrams as presented in Fig. 1. This simple operation of calculating the most frequent bigrams unveils useful bigrams that help us understand the focus of the collection, and m

2019-08-07: Invited Talk at ODU CS Summer Research Workshop: Introducing EEG in Data Science

The Department of Computer Science at ODU has been conducting summer workshops over the past few years for selected undergraduate student groups from India. During this period, they are provided with on-premise accommodation and are arranged to work closely with research groups. Researchers from various groups at ODU present their work and ongoing research to them. Some students who participated in this program in the past have already joined graduate programs at ODU. Ajay Gupta, the Director of Computer Resources at the Department of Computer Science, ODU plays a significant role in conducting this event. Overall, the program has been a great encouragement for students to engage in research. Just shared my summer workshop presentation "Introduction to Data Science with EEG" on SlideShare. @WebSciDL https://t.co/MHuyU4k8dO — Yasith Jayawardana (@yasithmilinda) August 7, 2019 This year, the summer workshop group comprised of 25 undergraduates from three universities:

2019-09-05: How to Become a Tenure-Track Assistant Professor - Part II (job ads, CV, teaching and research statement, LOR and cover letter)

Image
This is a three-part write-up, in this second post, I’ll talk about how to find tenure-track positions, how to shortlist your target schools, CV, teaching statement, research statement and cover letters. I’ll do another blog post later about, how to prepare for interviews (skype/phone, onsite), what to do and not to do during your on-campus interviews, offer negotiations, two-body problem etc.  How to Become a Tenure-Track Assistant Professor - Part I (publications, research, teaching and service)  How to Become a Tenure-Track Assistant Professor - Part II (job ads, CV, teaching and research statement, LOR and cover letter)  How to Become a Tenure-Track Assistant Professor - Part III (interview prep, on-campus interview, offer negotiations, two-body problem)  Where to find jobs:  There are number of options for you to find a job advertisement for a tenure-track positions (or non-tenure track teaching positions and postdoc opportunities). I primarily used the ACM and CRA fo

2019-09-04: Invited Talk at ODU CS Summer Research Workshop: Eye Tracking for Predicting ADHD

Image
This summer 2019, ten students from B.N.M Institute of technolog y , fourteen students from  Acharya Institute of Technology , and one student from  Ramaiah Institute of Technology participated for the Summer Research Workshop organized by  Ajay Gupta  and the CS department at ODU. Over the past few years, this workshop has enabled participants to collaborate with various research groups and join ODU for graduate degrees. One of the main goals of this annual workshop is to  encourage the undergraduate students to actively engage in research activities. ⁦ @Gavindya2 ⁩ presenting about Eye Tracking in predicting ADHD. At 2120, E&CS Building pic.twitter.com/q5bJ9LuX4Q — Yasith Jayawardana (@yasithmilinda) July 12, 2019 I was invited to give a talk in one of the session on the topic of "Eye Tracking for Predicting ADHD".  The slides are available at:  https://www.slideshare.net/GavindyaJayawardena/eye-tracking-for-predicting-adhd . I was able to make it in

2019-09-02: So Long, and Thanks for All the Frogs

Image
Mat Kelly has received his PhD. This is the Final Blog Post                                                                                                                                                                                                                                                                                                                                                                           ⓖⓞⓖⓐⓣⓞⓡⓢ On May 7th, 2019, after a very long trek as a PhD student, I successfully defended my dissertation, "Aggregating Private and Public Web Archives Using the Mementity Framework" ( slides ). The tome (physical height still to be determined), originally titled, " A Framework for Aggregating Private and Public Web Archives " consisted of exactly that and a bit more. The crux of the work was originally presented in the best-paper-nominated paper with the latter named (hence the change) at JCDL 2018 ( arXiv ). The extended version addressed is

2019-08-30: Where did the archive go? Part 1: Library and Archives Canada

Image
Web archives are established with the objective of providing permanent access to archived web pages, or mementos. However, in our 14-month study of 16,627 mementos from 17 public web archives, we found that three web archives changed their base URLs and did not leave a machine readable method of locating their new URLs.  We were able to manually discover the three new URLs for the archives. A fourth archive has partially ceased operations. (1) Library and Archives Canada ( collectionscanada.gc.ca ) Around  May 2018 , mementos in this archive were moved to a new archive ( webarchive.bac-lac.gc.ca ) which has a different domain name. We noticed that 49 mementos (out of 351) can not be found in the new archive. (2)  The  National Library of Ireland (NLI)   Around  May 2018,  the  European Archive ( europarchive.org )  was shut down and the domain name was purchased by another entity. The National Library of Ireland (NLI)  collection preserved by this archive was moved to anothe