Abstract
We present a corpus-based approach to word-sense disambiguation that only requires information that can be automatically extracted from untagged text. We use unsupervised techniques to estimate the parameters of a model describing the conditional distribution of the sense group given the known contextual features. Both the EM algorithm and Gibbs Sampling are evaluated to determine which is most appropriate for our data. We compare their disambiguation accuracy in an experiment with thirteen different words and three feature sets. Gibbs Sampling results in small but consistent improvement in disambiguation accuracy over the EM algorithm.
Original language | English (US) |
---|---|
Title of host publication | Proceedings of the National Conference on Artificial Intelligence |
Editors | Anon |
Publisher | AAAI |
Pages | 800-805 |
Number of pages | 6 |
State | Published - Jan 1 1998 |
Event | Proceedings of the 1998 15th National Conference on Artificial Intelligence, AAAI - Madison, WI, USA Duration: Jul 26 1998 → Jul 30 1998 |
Other
Other | Proceedings of the 1998 15th National Conference on Artificial Intelligence, AAAI |
---|---|
City | Madison, WI, USA |
Period | 7/26/98 → 7/30/98 |