Download:
|
by Suhail Ansari, Ron Kohavi, Llew Mason, Zijian Zheng
Tech. rep., Blue Martini Software. Available
http://robotics.stanford.edu/~ronnyk/WEBKDD2000/papers/ronny.ps
Add To MetaCart
Abstract:
We show that the e-commerce domain can provide all the right ingredients for successful data mining and claim that it is a killer domain for data mining. We describe an integrated architecture, based on our experience at Blue Martini Software, for supporting this integration. The architecture can dramatically reduce the pre-processing, cleaning, and data understanding effort often documented to take 80 % of the time in knowledge discovery projects. We emphasize the need for data collection at the application server layer (not the web server) in order to support logging of data and metadata that is essential to the discovery process. We describe the data transformation bridges required from the transaction processing systems and customer event streams (e.g., clickstreams) to the data warehouse. We detail the mining workbench, which needs to provide multiple views of the data through reporting, data mining algorithms, visualization, and OLAP. We conclude with a set of challenges. 1
Citations
|
135
|
Data Mining Techniques for
– Berry, Linoff
- 1997
|
|
118
|
Discovering web access patterns and trends by applying OLAP and data mining technology on web logs
– Zaiane, Xin, et al.
- 1998
|
|
59
|
The Data Warehouse Toolkit: Practical Techniques for Building Dimensional Data Warehouses
– Kimball
- 1996
|
|
49
|
The Data Warehouse Lifecycle Toolkit: Expert Methods for Designing, Developing, and Deploying Data Warehouses
– Kimball
- 1998
|
|
29
|
Visualizing the simple bayesian classifier
– Becker, Kohavi, et al.
- 2001
|
|
8
|
search of reliable usage data
– Pitkow, In
- 1997
|
|
7
|
The identification and satisfaction of consumer analysis-driven information needs
– Sen, Padmanabhan, et al.
- 1998
|
|
3
|
Measuring Web Success, Forrester Report
– Schmitt, Manning, et al.
- 1999
|
|
3
|
Characterizing browsing behaviors on
– Catledge, Pitkow
- 1995
|
|
3
|
Einat Neumann, Yizhak Idan, and Gadi Pinkas, Discovery of Fraud Rules for Telecommunications: Challenges and
– Rosset, Murad
- 1999
|
|
2
|
Evangelos Simoudis, An Overview of Issues
– Piatetsky-Shapiro, Brachman, et al.
- 1996
|
|
2
|
Bamshad Mobashar, and Jaideep Shrivastava, Data Preparation for Mining World
– Cooley
- 1999
|
|
2
|
Yasuhiro Akiba, and Shigeo Kaneda, On Handling Tree-Structured Attributes
– Almuallim
- 1995
|