A client-side web agent for document categorization

Daniel Boley, Maria Gini, Kyle Hastings, Bamshad Mobasher, Jerry Moore

Research output: Contribution to journalArticle

5 Scopus citations

Abstract

The authors propose a client-side agent for exploring and categorizing documents on the World Wide Web. As the user browses the Web using a usual Web browser, this agent is designed to aid the user by classifying the documents the user finds most interesting into clusters. The agent carries out the task completely automatically and autonomously, with as little user intervention as the user desires. The principal novel components in this agent that make it possible are a scalable hierarchical clustering algorithm and a taxonomic label generator. In this paper, the overall architecture of this agent is described and the details of the algorithms within its key components are discussed.

Original languageEnglish (US)
Pages (from-to)387-399
Number of pages13
JournalInternet Research
Volume8
Issue number5
DOIs
StatePublished - Jan 1 1998

Fingerprint Dive into the research topics of 'A client-side web agent for document categorization'. Together they form a unique fingerprint.

  • Cite this