Saturday, August 28, 2010

2010-08-28: A Lookup for Nicknames and Diminutive Names

I created a simple lookup file that contains United States given names (first names) and their associated nicknames or diminutive names. For example "gregory" -> "greg", or "geoffrey" -> "geoff".  The file can be downloaded and contributed to from here

This lookup was started from which is used for genealogy purposes. It was a good source to start from but because it is used for genealogy purposes there are some pretty of old names in there.  There was also a significant effort to make it machine readable, i.e. separate names with commas, remove human readable conventions, like "rickie(y)", so that it would be made into two different names "rickie", and "ricky".

This is a large list with about 700 entries. Any help from people to clean this list up and add to it is greatly appreciated. Think of it as a wiki where you can contribute or change it as needed.  CSV was the easiest format to use. Maybe I'll release this in XML or something later, or maybe a kind soul who uses this library wants to contribute another format they converted it into?

I was rather surprised that I couldn't find anything like this on the web.  The best I could find was the pdNickname database and it costs $500.  So, I created my own and released it as open source so that others could benefit from my work.


Wednesday, August 18, 2010

2010-08-18: Fall 2010 Classes

There will be two WS-DL classes offered for Fall 2010. CS 418/518 "Web Programming" will be taught by Martin Klein, but it will be similar in format and content to prior offerings, especially in respect to the focus on LAMP. This class involves significant programming, developing a single project throughout the semester. It is a good complement to CS 495/595 "Web Server Development" which last taught by Martin, in Spring 2010. 2010-08-30 edit: The class page for CS 418/518 is now available.

I will teach CS 895 "Time on the Web", a new class that will deal explore the issues of Web resources evolving through time and how we interact with them. Aside from the canonical background readings, we will focus on current and recent projects such as our own Memento & Synchronicity, as well as OAC, Zoetrope, The Re:Search Engine, ADAPT, Past Web Browser, and other projects and papers to be determined. This class will be heavily oriented to research and will require the students to explore and investigate topics on their own, develop prototypes, and present the results to the rest of the class.

I'll update this entry when class pages are available.

2010-08-30 edit: CS 895 will begin on September 8th (not Sept 1), 4:20-7:00 PM, r. 3316.
2010-09-08 edit: the CS 895 class page is now available.