Dependent bigram identification

Research output: Chapter in Book/Report/Conference proceedingConference contribution

11 Scopus citations

Abstract

Dependent bigrams are two consecutive words that occur together in a text more often than would be expected purely by chance. Identifying such bigrams is an important issue since they provide valuable clues for machine translation, word sense disambiguation, and information retrieval. Minimum sensitivity is one of the proposal made to identify these lexical pairs. It is simple to compute and is free from the underlying distributional assumptions made by significance tests. Experimental results show that minimum sensitivity results in the identification of bigrams that are largely made up of content words. The tendency of minimum sensitivity to filter out bigrams containing non-content words is an important quality in language processing applications.

Original languageEnglish (US)
Title of host publicationInnovative Applications of Artificial Intelligence - Conference Proceedings
Editors Anon
PublisherAAAI
Number of pages1
StatePublished - Jan 1 1998
EventProceedings of the 1998 10th Conference on Innovative Applications of Artificial Intelligence, IAAI - Madison, WI, USA
Duration: Jul 26 1998Jul 30 1998

Other

OtherProceedings of the 1998 10th Conference on Innovative Applications of Artificial Intelligence, IAAI
CityMadison, WI, USA
Period7/26/987/30/98

Fingerprint

Dive into the research topics of 'Dependent bigram identification'. Together they form a unique fingerprint.

Cite this