Language-Independent
Named Entity Recognition at CoNLL-2003 Notes: This dataset is a manual annotatation of a subset of
RCV1 (Reuters Corpus Volume 1).
The annotation per se is available free of charge (subject to a licensing agreement)
from the CoNLL site. The raw text of RCV1 documents must be
requested from NIST
(also free of charge and also subject to a licensing agreement).
Keywords: Computational Linguistics,
Natural Language Processing, NLP,
Natural Language Understanding, Natural Language Analysis,
Natural Language Generation, Information Retrieval, IR,
Artificial Intelligence, AI,
Machine Learning, Corpus Linguistics, Algorithm Design,
Text Mining, Text Data Mining, Name Entity Recognition,
Disambiguation