Posts

Showing posts from August, 2020

2020-08-30: Google Translate + Stanford NERC produce comparable results to Arabic Linguistic Pipeline (ALP)

Image
Arabic Named Entity Recognition and Classification Named Entity Recognition and Classification (NERC) is very important for many text processing tasks. However, Arabic NERC research is not popular because  only three percent of online content is in Arabic . Furthermore, NERC for Arabic is a challenging task due to Arabic's lack of capitalization, multiple types, different writing styles, complex morphology, ambiguity, and lack of resources. There has not been many Arabic NERC systems available for the same reasons. In this post, I evaluate the performance of  ALP  (Arabic linguistic pipeline), one of the Arabic NERC tools that provides state-of-the-art precision and recall  [1] and compare the  results  to a new approach that relies on  Google Translate  and one of the rich and mature English NERC tools,  Stanford Named Entity Recognizer and Classifier (NERC) . I expected the combination of  Google Translate  and  Stanford NERC  to have reduced the precision and recall in comparis

2020-08-27: A 25 Year Retrospective on D-Lib Magazine

Image
Authors’ note: This document is also available as https://arxiv.org/abs/2008.11680 .  In this HTML version, the footnotes were converted to either hyperlinks or endnotes.  A 25 Year Retrospective on  D-Lib Magazine Michael L. Nelson  Old Dominion University  Norfolk, VA USA  mln@cs.odu.edu Herbert Van de Sompel Data Archiving and Networked Services  The Hague, Netherlands  herbert.van.de.sompel@dans.knaw.nl Abstract In July, 1995 the first issue of D-Lib Magazine was published as an on-line, HTML-only, open access magazine, serving as the focal point for the then emerging digital library research community.  In 2017 it ceased publication, in part due to the maturity of the community it served as well as the increasing availability of and competition from eprints, institutional repositories, conferences, social media, and online journals -- the very ecosystem that D-Lib Magazine nurtured and enabled.  As long-time members of the digital library community