Using WordNet-based Context Vectors to Estimate the Semantic Relatedness of Concepts

Siddharth Patwardhan, Ted Pedersen

Research output: Contribution to conferencePaperpeer-review

300 Scopus citations

Abstract

In this paper, we introduce a WordNet-based measure of semantic relatedness by combining the structure and content of WordNet with co–occurrence information derived from raw text. We use the co–occurrence information along with the WordNet definitions to build gloss vectors corresponding to each concept in WordNet. Numeric scores of relatedness are assigned to a pair of concepts by measuring the cosine of the angle between their respective gloss vectors. We show that this measure compares favorably to other measures with respect to human judgments of semantic relatedness, and that it performs well when used in a word sense disambiguation algorithm that relies on semantic relatedness. This measure is flexible in that it can make comparisons between any two concepts without regard to their part of speech. In addition, it can be adapted to different domains, since any plain text corpus can be used to derive the co–occurrence information.

Original languageEnglish (US)
Pages1-8
Number of pages8
StatePublished - 2006
Externally publishedYes
Event2006 Workshop on Making Sense of Sense: Bringing Psycholinguistics and Computational Linguistics Together - Trento, Italy
Duration: Apr 4 2006 → …

Conference

Conference2006 Workshop on Making Sense of Sense: Bringing Psycholinguistics and Computational Linguistics Together
Country/TerritoryItaly
CityTrento
Period4/4/06 → …

Bibliographical note

Publisher Copyright:
© EACL 2006 - 11th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Workshop on Making Sense of Sense: Bringing Psycholinguistics and Computational Linguistics Together. All rights reserved.

Fingerprint

Dive into the research topics of 'Using WordNet-based Context Vectors to Estimate the Semantic Relatedness of Concepts'. Together they form a unique fingerprint.

Cite this