Knowledge-based method for determining the meaning of ambiguous biomedical terms using information content measures of similarity.

Bridget T. McInnes, Ted Pedersen, Ying Liu, Genevieve B. Melton, Serguei V. Pakhomov

Research output: Contribution to journalArticlepeer-review

21 Scopus citations

Abstract

In this paper, we introduce a novel knowledge-based word sense disambiguation method that determines the sense of an ambiguous word in biomedical text using semantic similarity or relatedness measures. These measures quantify the degree of similarity between concepts in the Unified Medical Language System (UMLS). The objective of this work was to develop a method that can disambiguate terms in biomedical text by exploiting similarity information extracted from the UMLS and to evaluate the efficacy of information content-based semantic similarity measures, which augment path-based information with probabilities derived from biomedical corpora. We show that information content-based measures obtain a higher disambiguation accuracy than path-based measures because they weight the path based on where it exists in the taxonomy coupled with the probability of the concepts occurring in a corpus of text.

Original languageEnglish (US)
Pages (from-to)895-904
Number of pages10
JournalAMIA ... Annual Symposium proceedings / AMIA Symposium. AMIA Symposium
Volume2011
StatePublished - 2011

Fingerprint

Dive into the research topics of 'Knowledge-based method for determining the meaning of ambiguous biomedical terms using information content measures of similarity.'. Together they form a unique fingerprint.

Cite this