Abstract
SenseClusters is a freely available system that clusters similar contexts. It can be applied to a wide range of problems, although here we focus on word sense and name discrimination. It supports several different measures for automatically determining the number of clusters in which a collection of contexts should be grouped. These can be used to discover the number of senses in which a word is used in a large corpus of text, or the number of entities that share the same name. There are three measures based on clustering criterion functions, and another on the Gap Statistic.
| Original language | English (US) |
|---|---|
| Pages | 276-279 |
| Number of pages | 4 |
| DOIs | |
| State | Published - 2006 |
| Event | 2006 Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, HLT-NAACL 2006 - New York City, United States Duration: Jun 4 2006 → Jun 9 2006 |
Conference
| Conference | 2006 Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, HLT-NAACL 2006 |
|---|---|
| Country/Territory | United States |
| City | New York City |
| Period | 6/4/06 → 6/9/06 |
Bibliographical note
Publisher Copyright:© 2006 Association for Computational Linguistics.