• Home
  • Automatic Annotation of Bibliographical References with Target Language

Automatic Annotation of Bibliographical References with Target Language

Sourcetitle: 
Coling 2008: Proceedings of MMIES-2: Workshop on Multi-source, Multilingual Information Extraction and Summarization; August 2008, Manchester
Year of publication: 
2008
PublicationType: 
Conference paper - peer reviewed

In a large-scale project to list bibliographical
references to all of the ca 7 000 languages
of the world, the need arises to automatically annotated the bibliographical entries with ISO-639-3 language identifiers. The task can be seen as a special
case of a more general Information Extraction
problem: to classify short text snippets
in various languages into a large number
of classes. We will explore supervised
and unsupervised approaches motivated by
distributional characterists of the specific
domain and availability of data sets. In
all cases, we make use of a database with
language names and identifiers. The suggested
methods are rigorously evaluated on
a fresh representative data set.

http://www.aclweb.org/anthology/W08-1409

Sourcepages: 
57-64
To the top

Page updated: 2012-01-30 14:01

Send as email
Print page
Show as pdf

X
Loading