Abstract. Information is one of the most valuable assets of an organisation and when used properly can assist in intelligent decision making that can significantly improve the functioning of an organisation. Data Warehousing is a recent technology that allows information to be easily and efficiently accessed for decision making activities by collecting data from many operational, legacy and possibly heterogeneous data sources. On-Line Analytical Processing (OLAP) tools are well-suited for complex data analysis, such as multi-dimensional data analysis, and to assist in decision support activities while data mining tools take the process one step further and actively search the data for patterns and hidden knowledge in the data held in the warehouse. Many organisations are building, or are planning to develop, a data warehouse for their operational and decision support needs. In this paper, we present an overview of data warehousing, multi-dimensional databases, OLAP and data mining technology and discuss the directions of current research in the area. We also discuss recent developments in data warehouse modelling, view selection and maintenance, indexing schemes, parallel query processing and data mining issues. A number of technical issues for exploratory research are presented and possible solutions are also discussed.
|
1448
|
Mining association rules between sets of items in large databases
– Agrawal, Imielinski, et al.
- 1993
|
|
446
|
From data mining to knowledge discovery: an overview
– Piatetsky-Shapiro, Smyth
- 1996
|
|
269
|
Data mining: An overview from database perspective
– Chen, Han, et al.
- 1996
|
|
266
|
What is a Data Warehouse
– Inmon
- 1995
|
|
255
|
Knowledge discovery in databases: an overview
– Frawley, Piatetsky
- 1992
|
|
234
|
Maintaining Views Incrementally
– Gupta, Mumick, et al.
- 1993
|
|
229
|
View maintenance in a warehousing environment
– Zhuge, Garcia-Molina, et al.
- 1995
|
|
220
|
Maintenance of materialized views: Problems, techniques, and applications
– GUPTA, MUMICK
- 1995
|
|
220
|
Research problems in data warehousing
– Widom
- 1995
|
|
201
|
An overview of data warehousing and olap technology
– Chaudhuri, Dayal
- 1997
|
|
170
|
Wm.; Efficiently Updating Materialized Views
– Blakeley, Larson, et al.
- 1986
|
|
150
|
Data-driven Discovery of Quantitative Rules in Relational Databases
– Han, Cai, et al.
- 1993
|
|
140
|
Incremental Maintenance of Views with Duplicates
– Griffin, Libkin
- 1995
|
|
116
|
Improved query performance with variant indexes
– O’Neil, Quass
- 1997
|
|
112
|
Making views self-maintainable for data warehousing
– Quass, Gupta, et al.
- 1996
|
|
104
|
S.: Modelling Multi-dimensional Databases
– Agrawal, Gupta, et al.
- 1995
|
|
104
|
Materialized view selection in a multidimensional database
– Baralis, Paraboschi, et al.
- 1997
|
|
98
|
An interval classifier for database mining applications
– Agrawal, Ghosh, et al.
- 1992
|
|
97
|
A Framework for Supporting Data Integration Using the Materialized and Virtual Approaches
– Hull, Zhou
- 1996
|
|
88
|
Efficient View Maintenance at Data Warehouses
– Agrawal, Abbadi, et al.
- 1997
|
|
86
|
The Strobe Algorithms for Multi-Source Warehouse Consistency
– Zhuge, Garc'ia-Molina, et al.
- 1996
|
|
79
|
Materialized view maintenance and integrity constraint checking: Trading space for time
– Ross, Srivastava, et al.
- 1996
|
|
64
|
Adapting Materialized Views After Redefinitions
– Gupta, Mumick, et al.
- 1995
|
|
54
|
Data mining: The search for knowledge in databases
– Holsheimer, Siebes
- 1994
|
|
45
|
A data model for supporting on-line analytical processing
– Li, Wang
- 1996
|
|
45
|
View Indexing in Relational databases
– Roussopoulos
- 1982
|
|
42
|
et al. Data Cube: A relational aggregation operator generalizing group-by, cross-tab, and sub-totals
– Gray
- 1997
|
|
40
|
On the Efficient Computation of the Difference Between Consecutive Database States
– Kuchenhoff
- 1991
|
|
39
|
Spatial Data Mining: Progress and Challenges, Paper presented at the
– Koperski, Adhikary, et al.
- 1996
|
|
32
|
Physical database design for data warehouses
– Labio, Quass, et al.
- 1997
|
|
27
|
Survey of spatio-temporal databases
– Abraham, Rodding
- 1999
|
|
19
|
On-Line Warehouse View Maintenance for Batch Updates
– Quass, Widom
- 1997
|
|
17
|
Incremental Maintenance of Materialized Views
– Mohania, Konomi, et al.
- 1997
|
|
15
|
Applying data mining techniques to a health insurance information system
– Nearhos, Rothman, et al.
- 1996
|
|
15
|
A framework for designing materialized views in data warehousing environment
– YANG, KARLAPALEM, et al.
- 1996
|
|
14
|
A case for parallelism in data warehousing and olap
– Datta, Moon, et al.
- 1998
|
|
14
|
Efficient Incremental Evaluation of Queries with Aggregation
– Ramakrishnan, Ross, et al.
- 1992
|
|
14
|
An application of Datalogic/R knowledge discovery tool to identify strong predictive rules in stock market data
– Ziarko, Golan, et al.
- 1993
|
|
11
|
Cross-db: A feature-extended multidimensional data model for statistical and scientific databases
– Lehner, Ruf, et al.
- 1996
|
|
10
|
Distributed view maintenance by incremental semijoin and tagging
– Bailey, Dong, et al.
- 1998
|
|
9
|
Algorithms for view maintenance in mobile databases
– Dong, Mohania
- 1996
|
|
8
|
Conservative timestamp revised for materialized view maintenance in a data warehouse
– Baralis, Ceri, et al.
- 1996
|
|
8
|
Maintaining materialised views in distributed databases
– Segev, Park
- 1989
|
|
7
|
Database issues in knowledge discovery and data mining, Australian
– Rainsford, Roddick
- 1999
|
|
6
|
The Cube-Query-Language (CQL) for Multidimensional Statistical and Scientific Database Systems
– Bauer, Lehner
- 1997
|
|
6
|
Automated analysis of a large-scale sky survey: The SKICAT system
– Fayyad
- 1993
|
|
6
|
Multiple view self-maintenance in data warehousing environments
– Huyn
- 1997
|
|
6
|
Avoiding re-computation: View adaptation in data warehouses
– Mohania
- 1997
|
|
6
|
Currency based updates to distributed materialised views
– Segev, Fang
- 1990
|
|
6
|
Data mining applications in BT
– Shortland, Scarfe
- 1994
|