Posts

2019-04-01: Creating a data set for 116th Congress Twitter handles

Image
Senators from Alabama in the 115th Congress Any researcher conducting research on Twitter and the US Congress might think, "how hard could it be in creating a data set of Twitter handles for the members of Congress?". At any given time, we know the number of members in the US Congress and we also know the current members of Congress. At this point, creating a data set of Twitter handles for the members of Congress might seem like an easy task, but it turns out it is a lot more challenging than expected.  We present the challenges involved in creating a data set of Twitter handles for the members of  116th US Congress  and provide a data set of Twitter handles for 116th US Congress .  Brief about the US Congress The US Congress is a bicameral legislature comprising of the Senate and the House of Representatives. The Congress consists of: 100 senators, two from each of the fifty states. 435 representatives, seats are distributed by population across the f

2019-03-18: Cookie Violations Cause Archived Twitter Pages to Simultaneously Replay in Multiple Languages

Image
Figure 1: Mixed language blocks on a memento of a Twitter timeline. Highlighted with blue colored box for Portuguese, orange for English, and red for Urdu. Dotted border indicates the template present in the original HTML response while blocks with solid borders indicate lazily loaded content. Would you be surprised if I were to tell you that Twitter is a multi-lingual website, supporting 47 different international languages ? How about if I were to tell you that a usual Twitter timeline page can contain tweets in whatever languages the owner of the handle chooses to tweet, but can also show navigation bar and various sidebar blocks in many different languages simultaneously, now surprised? Well, while it makes no sense, it may actually happen in web archives when a memento of a timeline is accessed as shown in Figure 1. Spoiler alert! Cookies are to be blamed, once again . Last month, I was investigating a real life version of " Ron Burgundy will read anything on the