2018-07-18: HyperText and Social Media (HT) Trip Report

Leaping Tiger statue next to the College of Arts at Towson University

From July 9 - 12, the 2018 ACM Conference on Hypertext and Social Media (HT) took place at the College of Arts at Towson University in Baltimore, Maryland. Researchers from around the world presented the results of complete or ongoing work in tutorial, poster, and paper sessions. Also, during the conference I had the opportunity to present a full paper: "Bootstrapping Web Archive Collections from Social Media" on behalf of co-authors Dr. Michele Weigle and Dr. Michael Nelson.

Day 1 (July 9, 2018)

The first day of the conference was dedicated to a tutorial (Efficient Auto-generation of Taxonomies for Structured Knowledge Discovery and Organization) and three workshops:

I attended the Opinion Mining, Summarization and Diversification workshop. The workshop started with a talk titled: "On Reviews, Ratings and Collaborative Filtering," presented by Dr. Oren Sar Shalom, principal data scientist at Intuit, Israel. Next, Ophélie Fraisier, a PhD student studying stance analysis on social media at Paul Sabatier University, France, presented: "Politics on Twitter : A Panorama," in which she surveyed methods of analyzing tweets to study and detect polarization and stances, as well as election prediction and political engagement.

@SyrupType at @ACMHT presenting: "Politics on Twitter: A Panorama" that explores
- the relationship between homophily and echo chambers
- correlation between retweets and political ideology
- (limitations of) election prediction with Twitter#acmht18 pic.twitter.com/nkpF6tA55W
— Alexander C. Nwala (@acnwala) July 9, 2018

Next, Jaishree Ranganathan, a PhD student at the University of North Carolina, Charlotte, presented: "Automatic Detection of Emotions in Twitter Data - A Scalable Decision Tree Classification Method."

@WebSciDL Jaishree Ranganathan presents her decision tree classifier method for assigning emotion labels to tweets.#acmht18 pic.twitter.com/QVZdis4569
— Alexander C. Nwala (@acnwala) July 9, 2018

Finally, Amin Salehi, a PhD student at Arizona State University, presented: "From Individual Opinion Mining to Collective Opinion Mining." He showed how collective opinion mining can help capture the drivers behind opinions as opposed to individual opinion mining (or sentiment) which identifies single individual attitudes toward an item.

@WebSciDL Amin Salehi presents: "From Individual Opinion Mining to Collective Opinion Mining"#acmht18 pic.twitter.com/78z9z0UTN7

— Alexander C. Nwala (@acnwala) July 9, 2018

Day 2 (July 10, 2018)

The conference officially began on day 2 with a keynote: "Lessons in Search Data" by Dr. Seth Stephens-Davidowitz, a data scientist and NYT bestselling author of: "Everybody Lies."

I can now officially present my book, Everybody Lies -- insights from new, internet data. I thank a million people! https://t.co/I3quPp6nw3
— Seth Stephens-Davidowitz (@SethS_D) May 9, 2017

In his keynote, Dr. Stephens-Davidowitz revealed insights gained from search data ranging from racism to child abuse. He also discussed a phenomenon in which people are likely to lie to pollsters (social desirability bias) but are honest to Google ("Digital Truth Serum") because Google incentivizes telling the truth. The paper sessions followed the keynote with two full papers and a short paper presentation.

@SethS_D keynote @ACMHT: African American Turnout was low correlating with @Google searches in mostly African-American areas.#acmht18 pic.twitter.com/jwwXdxYSaV
— Alexander C. Nwala (@acnwala) July 10, 2018

@SethS_D keynote @ACMHT: Not enough researchers are using Google Trends (https://t.co/QqK3kcE9tk)

#acmht18
— Alexander C. Nwala (@acnwala) July 10, 2018

Interesting #ACMHT18 Keynote by Seth Stephens-Davidowitz 'Lessons in Search Data' . Google search data as "digital truth serum" - while reporting of child abuse go down at the recession time, Google search data indicates that real child abuse increases https://t.co/DQQoAotZqB
— Peter Brusilovsky (@peterpaws) July 10, 2018

Opening talk at #acmht18 by @SethS_D:#Google: “Digital Truth Serum”#Facebook: “Digital Brag to My Friends About How Good My Life is Serum” pic.twitter.com/GOVHxX6KRP
— Claus Atzenbeck 🇪🇺 (@clausatz) July 10, 2018

More interesting findings from search data: what is the best age to become fun of a baseball team? #acmht18 pic.twitter.com/ewmRi52ctV
— Peter Brusilovsky (@peterpaws) July 10, 2018

No doubt, the #acmht18 opening talk by @SethS_D is very interesting; quite well done individual studies, nicely presented.

However, it feels more like a research talk rather than a #keynote. Though still interesting, I’d rather hear about a #vision for this area of #research.
— Claus Atzenbeck 🇪🇺 (@clausatz) July 10, 2018

The first (full) paper of day 2 in the Computational Social Science session: "Detecting the Correlation between Sentiment and User-level as well as Text-Level Meta-data from Benchmark Corpora," was presented by Shubhanshu Mishra, a PhD student at the iSchool of the University of Illinois at Urbana-Champaign. He showed correlations between user-level and tweet-level metadata by addressing two questions: "Do tweets from users with similar Twitter characteristics have similar sentiments?" and "What meta-data features of tweets and users correlate with tweet sentiment?"

First paper of Day 2 at @ACMHT by @TheShubhanshu: Detecting the correlation between sentiment and user-as well as text-level meta-data from benchmark corpora, exploring using additional tweet metadata such as rts, follower/following count to predict sentiment.#acmht18 pic.twitter.com/ln3ha61rou
— Alexander C. Nwala (@acnwala) July 10, 2018

Next, Dr. Fred Morstatter presented a full paper: "Mining and Forecasting Career Trajectories of Music Artists," in which he showed that their dataset generated from concert discovery platforms can be used to predict important career milestones (e.g., signing by a major music label) of musicians.

@fredmorstatter at @ACMHT presenting: "Mining and Forecasting Career Trajectories of Music Artists." They researched:

- Forecasting artist success
- Event prediction
- Discovery of important artists and venues

Code: https://t.co/ycz7iWKZrn #acmht18 pic.twitter.com/3GADA8z7RI
— Alexander C. Nwala (@acnwala) July 10, 2018

Next, Dr. Nikolaos Aletras, a research associate at the University College London, Media Futures Group, presented a short paper: "Predicting Twitter User Socioeconomic Attributes with Network and Language Information." He described a method of predicting the occupational class and income of Twitter users by using information extracted from their extended networks.

Can your Twitter data be used to predict your income status? @nikaletras at @ACMHT presenting: "Predicting Twitter User Socioeconomic Attributes with Network and Language Information"#acmht18 pic.twitter.com/mU5fRnZtyM
— Alexander C. Nwala (@acnwala) July 10, 2018

After a break, the Machine Learning session began with a full paper (Best Paper Runner-Up): "Joint Distributed Representation of Text and Structure of Semi-Structured Documents," presented by Samiulla Shaikh, a software engineer and researcher at IBM India Research Labs.

@SamiullaShaikh begins the Machine Learning session at @ACMHT with:
"Joint Distributed Representation of Text and Structure of Semi-Structured
Documents."

He explored the use of vector representation to capture the semantic information of semi-structured documents#acmht18 pic.twitter.com/8DkOpwqt0N

— Alexander C. Nwala (@acnwala) July 10, 2018

Next, Dr. Oren Sar Shalom presented a short paper titled: "As Stable As You Are: Re-ranking Search Results using Query-Drift Analysis," in which he presented the merits of using query-drift analysis for search re-ranking. This was followed by a short paper presentation titled: "Embedding Networks with Edge Attributes," by Palash Goyal, a PhD student at University of Southern California. In his presentation, he showed a new approach to learn node embeddings that uses the edges and associated labels.

Oren Sar Shalom @ @ACMHT presents "As Stable As You Are: Re-ranking Search Results using Query-Drift Analysis:" addressed the query drift problem by:
- effective selection of "promising" query aspects
- considering each document's role with such potential query drift#acmht18 pic.twitter.com/SyaSlbDykd
— Alexander C. Nwala (@acnwala) July 10, 2018

@palashiitkgp at @ACMHT presents: "Embedding Networks with Edge Attributes"#acmht18 pic.twitter.com/TKRNV1IQBB
— Alexander C. Nwala (@acnwala) July 10, 2018

Another short paper presentation (Recommendation System session) by Dr. Oren Sar Shalom followed. It was titled: "A Collaborative Filtering Method for Handling Diverse and Repetitive User-Item Interactions." He presented a collaborative filtering model that captures multiple complex user-item interactions without any prior domain knowledge.

@oren_sarshalom is already presenting his second #acmht18 paper! Now https://t.co/oiUb2qXI8d and previously https://t.co/ylkRpushdP
— Ingmar Weber (@ingmarweber) July 10, 2018

Oren Sar Shalom (again) starts the Recommendation System presentations at @ACMHT: "A Collaborative Filtering Method for Handling Diverse and Repetitive User-Item Interactions."#acmht18 pic.twitter.com/XXduX0G4OR
— Alexander C. Nwala (@acnwala) July 10, 2018

Next, Ashwini Tonge, a PhD student at Kansas State University presented a short paper titled: "Privacy-Aware Tag Recommendation for Image Sharing," in which she presented a means of tagging images on social media in order to improve the quality of user annotations while preserving user privacy sharing patterns.

Can we recommend tags for images considering their privacy to help image-sharing privacy settings?

Ashwini Tonge at @ACMHT presents: "Privacy-Aware Tag Recommendation for Image Sharing"#acmht18 pic.twitter.com/mk9rtP3zFV
— Alexander C. Nwala (@acnwala) July 12, 2018

Coffee break at #acmht18! #FRAvsBEL pic.twitter.com/25y8VpvM31
— Ujwal Gadiraju (@UjLaw) July 10, 2018

#ht2018conf coffee break pic.twitter.com/IyTNroLCC7
— Nuno Nunes (@njn) July 10, 2018

Finally, Palash Goyal presented another short paper titled: "Recommending Teammates with Deep Neural Networks."

The day 2 closing keynote by Leslie Sage, director of data science at DevResults followed after a break that featured a brief screening of the 2018 World Cup semi-final game between France and Belgium. In her keynote, she presented the challenges experienced in the application of big data toward international development.

@palashiitkgp (again) at @ACMHT presents: "Recommending Teammates with Deep Neural Networks." He proposed a recommendation system that makes recommendations of players to maximize performance.#acmht18 pic.twitter.com/tznJp2ZBdX

— Alexander C. Nwala (@acnwala) July 10, 2018

@lesliemsage wraps up day 2 at @ACMHT with a keynote: "Data and Design in International Development." She explored the scope of challenges, approaches and remaining hurdles in applying big data toward international development.#acmht18 pic.twitter.com/vFcnXHxmBG
— Alexander C. Nwala (@acnwala) July 10, 2018

Day 3 (July 11, 2018)

Day 3 of the conference began with a keynote: "Insecure Machine Learning Systems and Their Impact on the Web" by Dr. Ben Zhao, Neubauer Professor of Computer Science at University of Chicago. He highlighted many milestones of machine learning by showing problems they have solved in natural language processing and computer vision. But showed that opaque machine learning systems are vulnerable to attack by agents with malicious intents, and he expressed the idea that these critical issues must be addressed especially given the rush to deploy machine learning systems.

@ravenben starts day 3 at @ACMHT: "Insecure Machine Learning Systems and Their Impact on the Web:" neural networks are doing things that we thought were exclusive to humans

- Explores the impact of machine learning on information/(dis)misinformation on the web#acmht18 pic.twitter.com/l1RWt5CV59
— Alexander C. Nwala (@acnwala) July 11, 2018

Kick off at #acmht18 day 2 - and it’s into fake news land pic.twitter.com/91BgFKUfGY
— David Millard (@hoosfoos) July 11, 2018

Identify the teacher model. #acmht18 pic.twitter.com/R2YBsyIhLF
— Shubhanshu Mishra (@TheShubhanshu) July 11, 2018

@ravenben’s conclusion about #MachineLearning at #acmht18:

→ Deep learning a powerful tool for both sides
→ Worrisome implications for journalists
→ Need more attention on security side on the Web

His paper can be found at the ACM DL:https://t.co/QcWLnwDEdk pic.twitter.com/2r4kAphSLg
— Claus Atzenbeck 🇪🇺 (@clausatz) July 11, 2018

Following the keynote, I present our full paper: "Bootstrapping Web Archive Collections from Social Media" in the Temporal session. I highlighted the importance of web archive collections as a means of preserving the historical record of important events, and the seeds (URLs) from which they are formed. The seeds are collected by experts curators, but we do not have enough experts to collect seeds in a world of rapidly unfolding events. Consequently, I proposed exploiting the collective domain expertise of web users by generating seeds from social media collections and showed through a novel suite of measures, that seeds generated from social media are similar to those generated by experts.

Generate seed uri from social media. Uses @Wikipedia reference as seed urls. #acmht18 @ACMHT
Uses @Twitter moments to extract seeds along with @Storify and @reddit . pic.twitter.com/bowjwGsA9K
— Shubhanshu Mishra (@TheShubhanshu) July 11, 2018

Next, Paul Mousset, a PhD student at Paul Sabatier University, presented a full paper: "Studying the Spatio-Temporal Dynamics of Small-Scale Events in Twitter," in which he presented his work into the granular identification and characterization of event types on Twitter.

A nice exploratory of small-scale events in Twitter - by @PaulMousset @PitYo and @LyndaTamine - https://t.co/g26CthY5XX #acmht18 https://t.co/UDg9Oak97B
— Laure Soulier (@LaureSoulier) July 11, 2018

Next, Dr. Nuno Moniz, invited Professor at the Sciences College of the University of Porto, presented a short paper: "The Utility Problem of Web Content Popularity Prediction." He demonstrated that state-of-the-art approaches for predicting web content popularity have been optimized for improving the predictability of average behavior of data: items with low levels of popularity.

@nunompmoniz at @ACMHT presents: "The Utility Problem of Web Content Popularity Prediction"

Paper: https://t.co/CDyyaegENx
Code: https://t.co/LBWkWEeILY #acmht18 pic.twitter.com/jq9lQGztes
— Alexander C. Nwala (@acnwala) July 11, 2018

Next, Samiulla Shaikh (again), presented the first full paper (Nelson Newcomer Award winner) of the Semantic session: "Know Thy Neighbors, and More! Studying the Role of Context in Entity Recommendation," in which he showed how to efficiently explore a knowledge graph for the purpose of entity recommendation by utilizing contextual information to help in the selection of a subset of a entities in a knowledge graph.

Given Steve Jobs (entity) and the Pixar (context) what are the related entities?@SamiullaShaikh at @ACMHT presents: "Know Thy Neighbors, and More! Studying the Role of Context in Entity Recommendation"

Paper: https://t.co/YXzgcvOAlq #acmht18 pic.twitter.com/4pktFoXHKi
— Alexander C. Nwala (@acnwala) July 11, 2018

Samiulla Shaikh (again), presented a short paper: "Content Driven Enrichment of Formal Text using Concept Definitions and Applications," in which he showed a method of making formal text more readable to non-expert users by text enrichment e.g., highlighting definitions and fetching of definitions from external data sources.

How can you make formal text more readable to non-expert users?@SamiullaShaikh (again) at @ACMHT presents: "Content Driven Enrichment of Formal Text Using Concept Definitions and Applications"#acmht18 pic.twitter.com/r1WvAxOPCs
— Alexander C. Nwala (@acnwala) July 11, 2018

Next, Yihan Lu, a PhD student at Arizona State University, presented a short paper: "Modeling Semantics Between Programming Codes and Annotations." He presented the results from investigating a systematic method to examine annotation semantics and its relationship with source codes. He also showed their model which predict concepts in programming code annotation. Such annotations could be useful to new programmers.

Yihan Lu at @ACMHT presents: "Modeling Semantics Between Programming Codes and Annotations," which attempt so help new programmers learn from examples and connect code with linguistic features in annotation#acmht18 pic.twitter.com/q5tD66eae9
— Alexander C. Nwala (@acnwala) July 11, 2018

Following a break, the User Behavior session began. Dr. Tarmo Robal, a research scientist at the Tallinn University of Technology, Estonia, presented a full paper: "IntelliEye: Enhancing MOOC Learners' Video Watching Experience with Real-Time Attention Tracking." He introduced IntelliEye, a system that monitors students watching video lessons and detects when they are distracted and intervenes in an attempt to refocus their attention.

Can loss of attention when watching video lessons be automatically detected and interventions activated (visual and audio)?

Tarmo Robal at @ACMHT presents "IntelliEye: Enhancing MOOC Learners' Video Watching Experience with RealTime Attention Tracking"#acmht18 pic.twitter.com/g10MmIFpxj
— Alexander C. Nwala (@acnwala) July 11, 2018

More links on the IntelliEye presentation at #acmht18: https://t.co/r2xsj2cvO4 https://t.co/CeTuleIxlT https://t.co/J7VSABHDbc By @CharlotteHase et al.
— Ingmar Weber (@ingmarweber) July 11, 2018

Next, Dr. Ujwal Gadiraju, a postdoctoral researcher at L3S Research Center, Germany, presented a full paper: "SimilarHITs: Revealing the Role of Task Similarity in Microtask Crowdsourcing." He presented his findings from investigating the role of task similarity in microtask crowdsourcing on platforms such as Amazon Mechanical Turk and its effect on market dynamics.

@UjLaw at @ACMHT presents: "SimilarHITs: Revealing the Role of Task Similarity in Microtask Crowdsourcing"

Paper: https://t.co/hpOfl8ZlF0 #acmht18 pic.twitter.com/LOssjqqRMy
— Alexander C. Nwala (@acnwala) July 11, 2018

Next, Xinyi Zhang, a computer science PhD candidate at UC Santa Barbara, presented a short paper: "Penny Auctions are Predictable: Predicting and profiling user behavior on DealDash." She showed that penny auction sites such as DealDash are vulnerable to modeling and adversarial attacks by showing that both the timing and source of bids are highly predictable and users can be easily classified into groups based on their bidding behaviors.

Are there identifiable patterns in penny auctions? (yes)
What are the common strategies used in penny auctions?

Xinyi Zhang at @ACMHT presents: "Penny Auctions are Predictable: Predicting and profiling user behavior on DealDash"#acmht18 pic.twitter.com/BLVRtJ6OdH
— Alexander C. Nwala (@acnwala) July 11, 2018

Shortly after another break, the Hypertext paper sessions began. Dr. Charlie Hargood, senior lecturer at Bournemouth University, UK and Dr. David Millard, associate Professor at the University of Southampton, UK, presented a full paper: "The StoryPlaces Platform: Building a Web-Based Locative Hypertext System." They presented StoryPlaces, an open source authoring tool designed for the creation of locative hypertext systems.

@drchargood & @hoosfoos at @ACMHT present: "The StoryPlaces Platform: Building a Web-Based Locative Hypertext System," where they introduced StoryPlaces - a locative hypertext platform and authoring tool

Website: https://t.co/UsLbCmsJkT
Paper: https://t.co/qk7eOyjh2N #acmht18 pic.twitter.com/qevwxaxJlp
— Alexander C. Nwala (@acnwala) July 11, 2018

Giving a conference presentation during an English World Cup semifinal must be some sort of test #acmht18 but I think we just about managed it pic.twitter.com/7w7N6lI3LI
— David Millard (@hoosfoos) July 11, 2018

Next, Sharath Srivatsa, a Masters student at International Institute of Information Technology, India, presented a full paper: "Narrative Plot Comparison Based on a Bag-of-actors Document Model." He presented an abstract "bag-of-actors" document model for indexing, retrieving, and comparing documents based on their narrative structures. The model resolves the actors in the plot and their corresponding actions.

Sharath Srivatsa at @ACMHT presents: "Narrative Plot Comparison Based on a Bag-of-actors Document Model"#acmht18 pic.twitter.com/W3uOwAa7oe
— Alexander C. Nwala (@acnwala) July 11, 2018

Next, Dr. Claus Atzenbeck, professor at Hof University, Germany, presented a short paper: "Mother: An Integrated Approach to Hypertext Domains." He stated that the Dexter Hypertext Reference Model which was developed to provide a generic model for node-link hypertext systems does not match the need of Component-Based Open Hypermedia Systems (CB-OHS), and proposed how this can be remedied by introducing Mother, a system that implements link support.

Today’s final #acmht18 session is entitled “#Hypertext”. I’m about to present our work “Mother – An Integrated Approach to Hypertext Domains” (co-authored by @dnlrssnr & @tzagara). The paper is available as #OpenAccess at the ACM DL:https://t.co/cAVurtOM2L
— Claus Atzenbeck 🇪🇺 (@clausatz) July 11, 2018

It wouldn’t be #acmht18 without these three (brought to you by @clausatz ) pic.twitter.com/1F5CcOkkvt
— David Millard (@hoosfoos) July 11, 2018

The final (short) paper of the day, "VAnnotatoR: A Framework for Generating Multimodal Hypertexts," was presented by Giuseppe Abrami. He introduced a virtual reality and augmented reality framework for generating multimodal hypertexts called VAnnotatoR. The framework enables the annotation and linkage of texts, images and their segments with walk-on-able animations of places and buildings.

Last of day 3 but not least at @ACMHT, presentation from Giuseppe Abrami: "VAnnotatoR: A Framework for Generating Multimodal Hypertexts"#acmht18 pic.twitter.com/gJFma4Of9N
— Alexander C. Nwala (@acnwala) July 11, 2018

Wow, the second Open Hypermedia Systems seen in the wild in one day #acmht18 pic.twitter.com/ge9V7Ke6gf
— David Millard (@hoosfoos) July 11, 2018

The conference banquet at Rusty Scupper followed the last paper presentation. The next HyperText conference was announced at the banquet.

Banquet location for #acmht18 pic.twitter.com/1OElzjyaEm
— Ingmar Weber (@ingmarweber) July 11, 2018

I‘m pleased to announce that the next ACM Hypertext conference 2019 (#acmht19) will take place at #HofUniversity, Germany.https://t.co/kmacR2LG2T @ACMHT @iisys_de #acmht18 pic.twitter.com/mBBre1UzVH
— Claus Atzenbeck 🇪🇺 (@clausatz) July 12, 2018

Day 4 (July 12, 2018)

The final day of the conference featured multiple papers presentations such as:

Intelligent Generative Locative Hyperstructure (Hargood et al.)
As We May Hear: Our Slaves Of Steel II (Bernstein)
A Villain's Guide to Social Media and Web Science (Bernstein and Cooper)

The day began with a keynote "The US National Library of Medicine: A Platform for Biomedical Discovery and Data-Powered Health," presented by Elizabeth Kittrie, strategic advisor for data and open science at the National Library of Medicine (NLM). She discussed the role the NLM serves such as provider of health data for biomedical research and discovery. She also discussed the challenges that arise from the rapid growth of biomedical data, shifting paradigms of data sharing, as well as the role of libraries in providing access to digital health information.

By 2025, the total amount of genomics data alone is expected to equal or exceed totals from the three major producers of large amounts of data: Astronomy, @YouTube, and @Twitter - @NIH Strategic Plan for Data Science (2018-06)

Elizabeth Kittrie (@nlm_news) at @ACMHT #acmht18 pic.twitter.com/c1LiPfmyJi
— Alexander C. Nwala (@acnwala) July 12, 2018

.@KittrieE of @nlm_news presents the "All of Us" initiative: https://t.co/hej9vQa1fR https://t.co/192tHdLIp8 #acmht18
— Ingmar Weber (@ingmarweber) July 12, 2018

The Privacy session of exclusively full papers followed the keynote. Ghazaleh Beigi, a PhD student at Arizona State University presented: "Securing Social Media User Data - An Adversarial Approach." She showed a privacy vulnerability that arises from the anonymization of social media data by demonstrating an adversarial attack specialized for social media data.

Are structural and textual anonymization of social media data sufficient? (no because of the heterogeneity of social media data)

Ghazaleh Beigi at @ACMHT presents: "Securing Social Media User Data - An Adversarial Approach"#acmht18 pic.twitter.com/Ih57m3iv4e
— Alexander C. Nwala (@acnwala) July 12, 2018

Next, Mizanur Rahman, a PhD student at Florida International University, presented: "Search Rank Fraud De-Anonymization in Online Systems." The bots and automatic methods session with two full paper presentations followed.

Find the crowdsourcing fraudsters who control user accounts that posted fraudulent activities for a product.

Mizanur Rahman at @ACMHT presents: "Search Rank Fraud De-Anonymization in Online Systems"

Paper: https://t.co/2ocA61uaZr
Code: https://t.co/LqhHeNK9pi #acmht18 pic.twitter.com/P2xe8p8yzJ
— Alexander C. Nwala (@acnwala) July 12, 2018

Diego Perna, a researcher at the University of Calabria, Italy, presented: "Learning to Rank Social Bots." Given recent reports about the use of bots to spread misinformation/disinformation on the web in order to sway public opinion, Diego Perna proposed a machine-learning framework for identifying and ranking online social network accounts based on their degree similarity to bots.

Diego Perna presents "Learning to rank social bots" at @ACMHT #acmht18 pic.twitter.com/p5YN0VqQXV
— Alexander C. Nwala (@acnwala) July 12, 2018

Next, David Smith, a researcher at University of Florida, presented: "An Approximately Optimal Bot for Non-Submodular Social Reconnaissance." He showed that studies that show how social bots befriend real users as part of an effort to collect sensitive information operate with the premise that the likelihood of users accepting bot friend requests is fixed, a constraint contradicted by empirical evidence. Subsequently, he presented his work which addressed this limitation.

David Smith at @ACMHT presents: "An Approximately Optimal Bot for Non-Submodular Social Reconnaissance"#acmht18 pic.twitter.com/Ok20ek5a2X
— Alexander C. Nwala (@acnwala) July 12, 2018

The News session began shortly after a break with a full paper (Best Paper Award) presentation from Lemei Zhang, a PhD candidate from Norwegian University of Science and Technology: "A Deep Joint Network for Session-based News Recommendations with Contextual Augmentation." She highlighted some of the issues news recommendation system suffer such as fast updating rate of news articles and lack of user profiles. Next, she proposed a news recommendation system that combines user click events within sessions and news contextual features to predict the next click behavior of a user.

Lemei Zhang begins the News session at @ACMHT with: "A Deep Joint Network for Session-based News Recommendations with Contextual Augmentation"#acmht18 pic.twitter.com/8dIX5Q0CY8
— Alexander C. Nwala (@acnwala) July 12, 2018

Next, Lucy Wang, senior data scientist at Buzzfeed, presented a short paper: "Dynamics and Prediction of Clicks on News from Twitter."

Lucy Wang at @ACMHT introduces a minimalist click prediction model that only uses publicly available, aggregated data from the first hour of a link's lifecyle: "Dynamics and Prediction of Clicks on News from Twitter"#acmht18 pic.twitter.com/fcuwMBNQmp
— Alexander C. Nwala (@acnwala) July 12, 2018

Next, Sofiane Abbar, senior software/research engineer at Qatar Computing Research Institute, presented via a YouTube video: "To Post or Not to Post: Using Online Trends to Predict Popularity of Offline Content." He proposed a new approach for predicting the popularity of news articles before they are published. The approach is based on observations regarding the article similarity and topicality and complements existing content-based methods.

Can we predict the popularity of news stories before they go online?@SofianeAbbar at @ACMHT presents via video: "To Post or Not to Post: Using Online Trends to Predict Popularity of Offline Content"

Video: https://t.co/gYFHy1CTR6
Questions? Post comment below video#acmht18 pic.twitter.com/SIX9hmkdyp
— Alexander C. Nwala (@acnwala) July 12, 2018

Next, two full papers (Community Detection session) where presented by Ophélie Fraisier and Amin Salehi. Ophélie Fraisier presented: "Stance Classification through Proximity-based Community Detection." She proposed the Sequential Community-based Stance Detection (SCSD) model for stance (online viewpoints) detection. It is a semi-supervised ensemble algorithm which considers multiple signals that inform stance detection. Next, Amin Salehi presented: "Sentiment-driven Community Profiling and Detection on Social Media." He presented a method of profiling social media communities based on their sentiment toward topics and proposed a method of detecting such communities and identifying motives behind their formation.

Account for textual, social, and geographic proximities in order to detect stances.@SyrupType at @ACMHT presents: "Stance Classification through Proximity -based Community Detection"#acmht18 pic.twitter.com/eKINrghjUh
— Alexander C. Nwala (@acnwala) July 12, 2018

Use sentiments (positive & negative) to identify and profile social media communities.

Amin Salehi at @ACMHT presents: "Sentiment-driven Community Profiling and Detection on Social Media"#acmht18 pic.twitter.com/3IU1iX9ZcB
— Alexander C. Nwala (@acnwala) July 12, 2018

For pictures and notes complementary to this blogpost see Shubhanshu Mishra's notes.

I would like to thank the organizers of the conference, the hosts, Towson University College of Arts, as well as IMLS for funding our research.

-- Nwala (@acnwala)

Search This Blog

Web Science and Digital Libraries Research Group