Pattern Directed Mining of Sequence Data

Valery Guralnik, Duminda Wijesekera, Jaideep Srivastava

Research output: Chapter in Book/Report/Conference proceedingConference contribution

13 Scopus citations

Abstract

Sequence data arise naturally in many applications, and can be viewed as an ordering of events, where each event has an associated time of occurrence. An important characteristic of event sequences is the occurrence of episodes, i.e. a collection of events occurring in a certain pattern. Of special interest are frequent episodes, i.e. episodes occurring with a frequency above a certain threshold. In this paper, we study the problem of mining for frequent episodes in sequence data. We present a framework for efficient mining of frequent episodes which goes beyond previous work in a number of ways. First, we present a language for specifying episodes of interest. Second, we describe a novel data structure, called the sequential pattern tree (SP Tree), which captures the relationships specified in the pattern language in a very compact manner. Third, we show how this data structure can be used by a standard bottom-up mining algorithm to generate frequent episodes in an efficient manner. Finally, we show how the SP Tree can be optimized by sharing common conditions, and evaluating each such expression only once. We present the results of an evaluation of the proposed techniques.

Original languageEnglish (US)
Title of host publicationProceedings of the 4th International Conference on Knowledge Discovery and Data Mining, KDD 1998
PublisherAAAI press
Pages51-57
Number of pages7
ISBN (Electronic)1577350707, 9781577350705
StatePublished - 1998
Externally publishedYes
Event4th International Conference on Knowledge Discovery and Data Mining, KDD 1998 - New York City, United States
Duration: Aug 27 1998Aug 31 1998

Publication series

NameProceedings of the 4th International Conference on Knowledge Discovery and Data Mining, KDD 1998

Conference

Conference4th International Conference on Knowledge Discovery and Data Mining, KDD 1998
Country/TerritoryUnited States
CityNew York City
Period8/27/988/31/98

Bibliographical note

Publisher Copyright:
Copyright © 1998, American Association for Artificial Intelligence (www.aaai.org). All rights reserved.

Fingerprint

Dive into the research topics of 'Pattern Directed Mining of Sequence Data'. Together they form a unique fingerprint.

Cite this