Abstract. Many business and scientific domains require the collection and analysis of sequences of events and time series data. Although statistical approaches have been long applied to time series, most of these approaches assume the time series is stationary and typically must be applied globally to the sequence. Thus, other methods are needed to solve many types of problems that occur in sequential business and scientific applications. One such problem is when there is no global correlation between sequences, but there are periodic occurrences when the signature of one sequence is present in other sequences. This can be solved by using association rules that relate the sequences of events, as described in this dissertation. Discovering association rules relating the sequences of events is an important data mining problem first addressed by Das et al. in [16]. In this dissertation, we build on their work by using representative association rules [44], closures [77], and constraints to drive our association analysis over sequences of events. Representative association rules together with closures have not been applied to event sequences, as performed in this dissertation. As in [16], we first discretize and cluster the data into sequences of events. These sequences of events
|
3413
|
C4.5: Programs for Machine Learning
– Quinlan
- 1993
|
|
2634
|
Classification and Regression Trees
– Breiman, Friedman, et al.
- 1984
|
|
2131
|
Neural Networks: A Comprehensive Foundation
– Haykin
- 1994
|
|
1734
|
Fast algorithms for mining association rules
– Agrawal, Srikant
- 1994
|
|
771
|
Learning logical definitions from relations
– Quinlan
- 1990
|
|
710
|
Mining sequential patterns
– Agrawal, Srikant
- 1995
|
|
332
|
Mining Sequential Patterns: Generalizations and Performance Improvements
– Srikant, Agrawal
- 1996
|
|
330
|
Efficient Similarity Search in Sequence Databases
– Agrawal, Faloutsos, et al.
- 1993
|
|
328
|
Fast subsequence matching in time-series databases
– Faloutsos, Ranganathan, et al.
- 1994
|
|
318
|
Restructuring lattice theory: An approach based on hierarchies of concepts
– Wille
- 1982
|
|
276
|
Detecting Strange Attractors in Turbulence
– Takens
- 1981
|
|
270
|
Knowledge Discovery in Databases: An Overview
– FRAWLEY, PIATETSKY-SHAPIRO, et al.
- 1991
|
|
231
|
Integrating classification and association rule mining
– Liu, Hsu, et al.
- 1998
|
|
225
|
Exploratory mining and pruning optimizations of constrained associations rules
– Ng, Lakshmanan, et al.
- 1998
|
|
202
|
Discovering Frequent Episodes in Sequences
– Mannila, Toivonen, et al.
- 1995
|
|
194
|
Discovery of frequent episodes in event sequences. Data Mining and Knowledge Discovery
– Mannila, Toivonen, et al.
- 1997
|
|
188
|
Mining association rules with item constraints
– SRIKANT, VU, et al.
- 1997
|
|
179
|
Forecasting, Structural Time Series Models and the Kalman Filter
– Harvey
- 1989
|
|
150
|
Data preparation for data mining
– Pyle
- 1999
|
|
130
|
Generating non-redundant association rules
– Zaki
- 2000
|
|
113
|
Similarity-based queries for time series data
– Rafiei, Mendelzon
- 1997
|
|
107
|
Constraint-based rule mining in large, dense databases
– Bayardo, Agrawal, et al.
- 1999
|
|
105
|
An enhanced representation of time series which allows fast and accurate classification, clustering and relevance feedback
– Keogh, Pazzani
- 1998
|
|
100
|
Fast Similarity Search
– Agrawal, Faloutsos, et al.
- 1995
|
|
99
|
Rule Discovery from Time Series
– Das, Lin, et al.
- 1998
|
|
96
|
Finding patterns in time series: A dynamic programming approach
– Berndt, Clifford
- 1996
|
|
95
|
Discovering generalized episodes using minimal occurrences
– Mannila, Toivonen
- 1996
|
|
92
|
Querying shapes of histories
– Agrawal, Psaila, et al.
- 1995
|
|
85
|
On similarity queries for time-series data: Constraint specification and implementation
– Goldin, Kanellakis
- 1995
|
|
80
|
A probabilistic approach to fast pattern matching in time series databases
– Keogh, Smyth
- 1997
|
|
77
|
Combinatorial Pattern Discovery for Scientific Data: Some Preliminary Results
– Wang, Chirn, et al.
- 1994
|
|
74
|
Finding Similar Time Series
– Das, Gunopulos, et al.
- 1997
|
|
56
|
Situation recognition: Representation and algorithms
– Dousson, Gaborit, et al.
- 1993
|
|
52
|
Can we push more constraints into frequent pattern mining
– Pei, Han
- 2000
|
|
51
|
Efficient enumeration of frequent sequences
– Zaki
- 1998
|
|
45
|
Mining surprising patterns using temporal description length
– Chakrabarti, Sarawagi, et al.
- 1998
|
|
45
|
Online data mining for co-evolving time sequences
– Yi, Sidiropoulos, et al.
- 2000
|
|
44
|
Incremental and interactive sequence mining
– Parthasarathy, Zaki, et al.
- 1999
|
|
42
|
Scaling up dynamic time warping to massive datasets
– Keogh, Pazzani
- 1999
|
|
36
|
A practical guide to wavelet analysis
– Torrence, Compo
- 1998
|
|
36
|
Finite Impulse Response Neural Networks with Applications in Time Series Prediction
– Wan
- 1993
|
|
35
|
Discovering Frequent Event Patterns with Multiple Granularities in Time Sequences
– Bettini, Wang, et al.
- 1998
|
|
28
|
Supporting fast search in time series for movement patterns in multiple scales
– Qu, Wang, et al.
- 1998
|
|
27
|
Ecient mining of association rules using closed itemset lattices
– Pasquier, Bastide, et al.
- 1999
|
|
25
|
Time-Series Similarity Problems and Well-Separated Geometric Sets
– Bollobas, Das, et al.
- 1997
|
|
23
|
Breaking the barrier of transactions: Mining inter-transaction association rules
– Tung, Lu, et al.
- 1999
|
|
22
|
Continuous categories for a mobile robot
– Rosenstein, Cohen
- 1999
|
|
16
|
A knowledge discovery methodology for telecommunication network alarm databases
– Klemettinen
- 1999
|
|
16
|
A family of algorithms for finding temporal structure in data
– Oates, Schmill, et al.
- 1997
|
|
16
|
Time series data mining: Identifying temporal patterns for characterization and prediction of time series events
– Povinelli
- 1999
|