2020-08-30: Google Translate + Stanford NERC produce comparable results to Arabic Linguistic Pipeline (ALP)

Arabic Named Entity Recognition and Classification Named Entity Recognition and Classification (NERC) is very important for many text processing tasks. However, Arabic NERC research is not popular because only three percent of online content is in Arabic . Furthermore, NERC for Arabic is a challenging task due to Arabic's lack of capitalization, multiple types, different writing styles, complex morphology, ambiguity, and lack of resources. There has not been many Arabic NERC systems available for the same reasons. In this post, I evaluate the performance of ALP (Arabic linguistic pipeline), one of the Arabic NERC tools that provides state-of-the-art precision and recall [1] and compare the results to a new approach that relies on Google Translate and one of the rich and mature English NERC tools, Stanford Named Entity Recognizer and Classifier (NERC) . I expected the combination of Google Translate and Stanford NERC to have reduced the precision and recall in comparis