Abstract
We propose an agent for exploring and categorizing documents on the World Wide Web. The heart of the agent is an automatic categorization of a set of documents, combined with a process for generating new queries used to search for new related documents and filtering the resulting documents to extract the set of documents most closely related to the starting set. The document categories are not given a-priori. We present the overall architecture and describe two novel algorithms which provide significant improvement over traditional clustering algorithms and form the basis for the query generation and search component of the agent.
Original language | English (US) |
---|---|
Title of host publication | Proceedings of the International Conference on Autonomous Agents |
Editors | Anon |
Pages | 408-415 |
Number of pages | 8 |
State | Published - Jan 1 1998 |
Event | Proceedings of the 1998 2nd International Conference on Autonomous Agents - Minneapolis, MN, USA Duration: May 9 1998 → May 13 1998 |
Other
Other | Proceedings of the 1998 2nd International Conference on Autonomous Agents |
---|---|
City | Minneapolis, MN, USA |
Period | 5/9/98 → 5/13/98 |