• Home
  • Semantic Relation Mining of Solid Compounds in Medical Corpora.

Semantic Relation Mining of Solid Compounds in Medical Corpora.

Sourcetitle: 
Proceedings of the 21th Conference on the European Federation for Medical Informatics (MIE 2008)
Year of publication: 
2008
PublicationType: 
Conference paper - peer reviewed

In the context of scientific and technical texts, meaning is usually embedded in noun compounds and the semantic interpretation of these compounds deals with the detection and semantic classification of the relation that holds between the compound’s constituents. Semantic relation mining, the technology applied for marking up, interpreting, extracting and classifying relations that hold between pairs of words, is an important enterprise that contribute to deeper means of enhancing document understanding technologies, such as Information Extraction, Question Answering, Summarization, Paraphrasing, Ontology Building and Textual Entailment. This paper explores the application of assigning semantic descriptors taken from a multilingual medical thesaurus to a large sample of solid (closed form) compounds taken from large Swedish medical corpora, and determining the relation(s) that may hold between the compound constituents. Our work is inspired by previous research in the area of using lexical hierarchies for identifying relations between two-word noun compounds in the medical domain. In contrast to previous research, Swedish, as other Germanic languages, require further means of analysis, since compounds are written as one sequence with no white space between the words, e.g. virus diseases vs. virussjukdomar, which makes the problem more challenging, since solid compounds are harder to identify and segment.

To the top

Page updated: 2012-01-30 14:04

Send as email
Print page
Show as pdf

X
Loading