Improving Correlation with Human Judgments by Integrating Semantic Similarity with Second–Order Vectors

Bridget T. McInnes, Ted Pedersen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Scopus citations

Abstract

Vector space methods that measure semantic similarity and relatedness often rely on distributional information such as co–occurrence frequencies or statistical measures of association to weight the importance of particular co–occurrences. In this paper, we extend these methods by incorporating a measure of semantic similarity based on a human curated taxonomy into a second–order vector representation. This results in a measure of semantic relatedness that combines both the contextual information available in a corpus–based vector space representation with the semantic knowledge found in a biomedical ontology. Our results show that incorporating semantic similarity into a second order co–occurrence matrices improves correlation with human judgments for both similarity and relatedness, and that our method compares favorably to various different word embedding methods that have recently been evaluated on the same reference standards we have used.

Original languageEnglish (US)
Title of host publicationBioNLP 2017 - SIGBioMed Workshop on Biomedical Natural Language Processing, Proceedings of the 16th BioNLP Workshop
PublisherAssociation for Computational Linguistics (ACL)
Pages107-116
Number of pages10
ISBN (Electronic)9781945626593
StatePublished - 2017
Event16th SIGBioMed Workshop on Biomedical Natural Language Processing, BioNLP 2017 - Vancouver, Canada
Duration: Aug 4 2017 → …

Publication series

NameBioNLP 2017 - SIGBioMed Workshop on Biomedical Natural Language Processing, Proceedings of the 16th BioNLP Workshop

Conference

Conference16th SIGBioMed Workshop on Biomedical Natural Language Processing, BioNLP 2017
Country/TerritoryCanada
CityVancouver
Period8/4/17 → …

Bibliographical note

Publisher Copyright:
© 2017 Association for Computational Linguistics

Fingerprint

Dive into the research topics of 'Improving Correlation with Human Judgments by Integrating Semantic Similarity with Second–Order Vectors'. Together they form a unique fingerprint.

Cite this