Using UMLS Concept Unique Identifiers (CUIs) for word sense disambiguation in the biomedical domain.

Bridget T. McInnes, Ted Pedersen, John V Carlis

Research output: Contribution to journalArticlepeer-review

42 Scopus citations

Abstract

This paper explores the use of Concept Unique Identifiers (CUIs) as assigned by MetaMap as features for a supervised learning approach to word sense disambiguation of biomedical text. We compare the use of CUIs that occur in abstracts containing an instance of the target word with using the CUIs that occur in sentences containing an instance of the target word. We also experiment with frequency cutoffs for determining which CUIs should be included as features. We find that a Naive Bayesian classifier where the features represent CUIs that occur two or more times in abstracts containing the target word attains accuracy 9% greater than Leroy and Rindflesch's approach, which includes features based on semantic types assigned by MetaMap. Our results are comparable to those of Joshi, et. al. and Liu, et. al., who use feature sets that do not contain biomedical information.

Original languageEnglish (US)
Pages (from-to)533-537
Number of pages5
JournalAMIA ... Annual Symposium proceedings / AMIA Symposium. AMIA Symposium
StatePublished - 2007

Fingerprint

Dive into the research topics of 'Using UMLS Concept Unique Identifiers (CUIs) for word sense disambiguation in the biomedical domain.'. Together they form a unique fingerprint.

Cite this