Download:
|
by Pusheng Zhang, Yan Huang, Shashi Shekhar, Vipin Kumar
In the Proc. of the 7th PAKDD
http://www-users.cs.umn.edu/~huangyan/papers/pakdd.ps
Add To MetaCart
Abstract:
A spatial time series dataset is a collection of time series, each referencing a location in a common spatial framework. Correlation analysis is often used to identify pairs of interacting elements from the cross product of two spatial time series datasets. However, the computational cost of correlation analysis is very high when the dimension of the time series and the number of locations in the spatial frameworks are large. The key contribution of this paper is the use of spatial autocorrelation among spatial neighboring time series to reduce the computational cost. A lter-and-rene algorithm based on coning, i.e. group of locations, is proposed to reduce the cost of correlation analysis over a pair of spatial time series datasets. Cone-level correlation computation can be used to eliminate ( lter out) a large number of element pairs whose correlation is clearly below (or above) a given threshold. Element pair correlation needs to be computed for remaining pairs. Using algebraic cost models and experimental studies with Earth science datasets, we show that the lter-andre ne approach can save a large fraction of the computational cost, particularly when the minimal correlation threshold is high.
Citations
|
735
|
Data Mining: Concepts and Techniques
– Han, Kamber
- 2000
|
|
696
|
Time Series Analysis: Forecasting and Control”, third edition
– Box, Jenkins, et al.
- 1994
|
|
493
|
Statistics for spatial data
– Cressie
- 1993
|
|
311
|
Efcient similarity search in sequence databases
– Agrawal, Faloutsos, et al.
- 1993
|
|
127
|
Efficient time-series matching by wavelets
– Chan, Fu
- 1999
|
|
116
|
GIS — a computing perspective
– Worboys
- 1995
|
|
96
|
Searching Multimedia Databases by Content
– Faloutsos
- 1996
|
|
56
|
Spatial Databases: A Tour
– Shekhar
- 2003
|
|
16
|
Spatial databases: Accomplishments and research needs
– Shekhar, Chawla, et al.
- 1999
|
|
15
|
R.: Data Mining for Scientific and Engineering Applications
– Grossman, Karnath, et al.
- 2001
|
|
15
|
An indexing scheme for fast similarity search in large time series databases
– Keogh, Pazzani
- 1999
|
|
15
|
Interannual variability in terrestrial net primary production: Exploration of trends and controls on regional to global scales, Ecosystems
– Potter, Klooster, et al.
- 1999
|
|
12
|
Data mining for the discovery of ocean climate indicies
– Steinbach, Tan, et al.
- 2002
|
|
5
|
Statistical Theory (Fourth Edition
– Lindgren
- 1998
|
|
4
|
A Subsequence Matching Method in TimeSeries Databases Based on Generalized Windows
– Moon, Whang, et al.
- 2002
|
|
4
|
Spatio-Temporal Data Mining
– Temporal
- 2000
|
|
4
|
Impacts of the El Nio/Southern Oscillation on the Pacific Northwest. http://www.ocs.orst.edu/reports/enso pnw.html
– Taylor
|