• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

Data preparation for mining World Wide Web browsing patterns”, Knowledge and Information Systems (1999)

by R Cooley, B Mobasher, J Srivastava
Add To MetaCart

Tools

Sorted by:
Results 1 - 10 of 567
Next 10 →

Automatic Personalization Based on Web Usage Mining

by Bamshad Mobasher, Robert Cooley, Jaideep Srivastava , 1999
"... ..."
Abstract - Cited by 409 (22 self) - Add to MetaCart
Abstract not found

Survey of clustering data mining techniques

by Pavel Berkhin , 2002
"... Accrue Software, Inc. Clustering is a division of data into groups of similar objects. Representing the data by fewer clusters necessarily loses certain fine details, but achieves simplification. It models data by its clusters. Data modeling puts clustering in a historical perspective rooted in math ..."
Abstract - Cited by 408 (0 self) - Add to MetaCart
Accrue Software, Inc. Clustering is a division of data into groups of similar objects. Representing the data by fewer clusters necessarily loses certain fine details, but achieves simplification. It models data by its clusters. Data modeling puts clustering in a historical perspective rooted in mathematics, statistics, and numerical analysis. From a machine learning perspective clusters correspond to hidden patterns, the search for clusters is unsupervised learning, and the resulting system represents a data concept. From a practical perspective clustering plays an outstanding role in data mining applications such as scientific data exploration, information retrieval and text mining, spatial database applications, Web analysis, CRM, marketing, medical diagnostics, computational biology, and many others. Clustering is the subject of active research in several fields such as statistics, pattern recognition, and machine learning. This survey focuses on clustering in data mining. Data mining adds to clustering the complications of very large datasets with very many attributes of different types. This imposes unique
(Show Context)

Citation Context

...se applications, for example GIS, (Xu et al. [XEKS98], Sander et al. [SEKX98], Ester et al. [EFKS00]), sequence and heterogeneous data analysis (Cadez et al. [CSM01]), Web applications (Cooley et al. =-=[CMS99]-=-, Heer & Chi [HC01], Foss et al. [FWZ01]), DNA analysis in computational biology (Ben-Dor & Yakhini [BY99]), and many others. They resulted in a large amount of applicationspecific ideas that are beyo...

Web mining research: A survey

by Raymond Kosala - SIGKDD Explorations , 2000
"... ..."
Abstract - Cited by 386 (1 self) - Add to MetaCart
Abstract not found
(Show Context)

Citation Context

...mong unique users, server sessions, episodes, etc. in the presence of caching and proxy servers [87; 113]. For the details and comparison of some pre-processing methods for Web usage data we refer to =-=[31-=-]. In general, typical data mining methods (see for example in [113]) could be used to mine the usage data after the data have been pre-processed to the desired form. However, modications of the typic...

Web mining for web personalization

by Magdalini Eirinaki, Michalis Vazirgiannis - ACM Transactions on Internet Technology , 2003
"... Web personalization is the process of customizing a Web site to the needs of specific users, taking advantage of the knowledge acquired from the analysis of the user’s navigational behavior (usage data) in correlation with other information collected in the Web context, namely, structure, content an ..."
Abstract - Cited by 217 (6 self) - Add to MetaCart
Web personalization is the process of customizing a Web site to the needs of specific users, taking advantage of the knowledge acquired from the analysis of the user’s navigational behavior (usage data) in correlation with other information collected in the Web context, namely, structure, content and user profile data. Due to the explosive growth of the Web, the domain of Web personalization has gained great momentum both in the research and commercial areas. In this article we present a survey of the use of Web mining for Web personalization. More specifically, we introduce the modules that comprise a Web personalization system, emphasizing the Web usage mining module. A review of the most common methods that are used as well as technical issues that occur is given, along with a brief overview of the most popular tools and applications available from software vendors. Moreover, the most important research initiatives in the Web usage mining and personalization areas are presented.

Mining Access Patterns Efficiently from Web Logs

by Jian Pei, Jiawei Han, Behzad Mortazavi-asl, Hua Zhu - Proc. 2000 Paci c-Asia Conf. on Knowledge Discovery and Data Mining (PAKDD'00 , 2000
"... With the explosive growth of data available on the World Wide Web, discovery and analysis of useful information from the World Wide Web becomes a practical necessity.Web access pattern, which is the sequence of accesses pursued by users frequently, is a kind of interesting and useful knowledge in pr ..."
Abstract - Cited by 155 (3 self) - Add to MetaCart
With the explosive growth of data available on the World Wide Web, discovery and analysis of useful information from the World Wide Web becomes a practical necessity.Web access pattern, which is the sequence of accesses pursued by users frequently, is a kind of interesting and useful knowledge in practice. In this paper, we study the problem of mining access patterns from Web logs efficiently. A novel data structure, called Web access pattern tree, or WAP-tree in short, is developed for efficient mining of access patterns from pieces of logs. The Web access pattern tree stores highly compressed, critical information for access pattern mining and facilitates the developmentofnovel algorithms for mining access patterns in large set of log pieces. Our algorithm can find access patterns from Web logs quite efficiently. The experimental and performance studies show that our method is in general an order of magnitude faster than conventional methods.

Data Mining of User Navigation Patterns

by Jose Borges, Mark Levene , 2000
"... We propose a data mining model that captures the user navigation behaviour patterns. The user navigation sessions are modelled as ahypertext probabilistic grammar whose higher probability strings correspond to the user's preferred trails. An algorithm to efficiently mine suchtrailsisgiven. ..."
Abstract - Cited by 151 (19 self) - Add to MetaCart
We propose a data mining model that captures the user navigation behaviour patterns. The user navigation sessions are modelled as ahypertext probabilistic grammar whose higher probability strings correspond to the user's preferred trails. An algorithm to efficiently mine suchtrailsisgiven. Wemake use of the Ngram model which assumes that the last N pages browsed affect the probability of the next page to be visited. The model is based on the theory of probabilistic grammars providing it with a sound theoretical foundation for future enhancements. Moreover, we propose the use of entropy as an estimator of the grammar's statistical properties. Extensive experiments were conducted and the results show that the algorithm runs in linear time, the grammar's entropy is a good estimator of the number of mined trails and the real data rules confirm the effectiveness of the model.

Discovery and Evaluation of Aggregate Usage Profiles for Web Personalization

by Bamshad Mobasher, Honghua Dai, Tao Luo, Miki Nakagawa - Data Mining and Knowledge Discovery , 2002
"... Web usage mining, possibly used in conjunction with standard approaches to personalization such as collaborative filtering, can help address some of the shortcomings of these techniques, including reliance on subjective user ratings, lack of scalability, and poor performance in the face of high-dime ..."
Abstract - Cited by 142 (15 self) - Add to MetaCart
Web usage mining, possibly used in conjunction with standard approaches to personalization such as collaborative filtering, can help address some of the shortcomings of these techniques, including reliance on subjective user ratings, lack of scalability, and poor performance in the face of high-dimensional and sparse data. However, the discovery of patterns from usage data by itself is not sufficient for performing the personalization tasks. The critical step is the effective derivation of good quality and useful (i.e., actionable) "aggregate usage profiles" from these patterns. In this paper we present and experimentally evaluate two techniques, based on clustering of user transactions and clustering of pageviews, in order to discover overlapping aggregate profiles that can be effectively used by recommender systems for real-time Web personalization. We evaluate these techniques both in terms of the quality of the individual profiles generated, as well as in the context of providing recommendations as an integrated part of a personalization engine. In particular, our results indicate that using the generated aggregate profiles, we can achieve effective personalization at early stages of users' visits to a site, based only on anonymous clickstream data and without the benefit of explicit input by these users or deeper knowledge about them.
(Show Context)

Citation Context

...content. The personalized content can take the form of recommended links and products, or targeted advertisements and textual content. In the data preparation stage, we use the heuristics proposed in =-=[CMS99]-=- to identify unique user sessions form anonymous usage data and to infer cached references (path completion). In this stage, the data cleaning tasks involve the removal of erroneous or redundant refer...

Discovering Internet Marketing Intelligence through Online Analytical Web Usage Mining

by Alex G. Büchner, et al.
"... This article describes a novel way of combining data mining techniques on Internet data in order to discover actionable marketing intelligence in electronic commerce scenarios. The data that is considered not only covers various types of server and web meta information, but also marketing data and k ..."
Abstract - Cited by 131 (2 self) - Add to MetaCart
This article describes a novel way of combining data mining techniques on Internet data in order to discover actionable marketing intelligence in electronic commerce scenarios. The data that is considered not only covers various types of server and web meta information, but also marketing data and knowledge. Furthermore, heterogeneity resolution thereof and Internet- and electronic commerce-specific preprocessing activities are embedded. A generic web log data hypercube is formally defined and schematic designs for analytical and predictive activities are given. From these materialised views, various online analytical web usage data mining techniques are shown, which include marketing expertise as domain knowledge and are specifically designed for electronic commerce purposes.

Effective personalization based on association rule discovery from web usage data. In:

by B Mobasher, H Dai, T Luo, M Nakagawa - Proceedings of the 3rd International Workshop on Web Information and Data Management, , 2001
"... ..."
Abstract - Cited by 125 (11 self) - Add to MetaCart
Abstract not found
(Show Context)

Citation Context

...Web sites without the benefit of additional information for user and session identification must rely on heuristics methods. These heuristics and details of usage preprocessing tasks are explained in =-=[4] and w-=-e do not discuss them further in this paper. The above preprocessing tasks ultimately result in a set of n pageviews, P = {p1,p2, ··· ,pn}, andasetofmuser transactions, T = {t1,t2, ··· ,tm}, whe...

Creating adaptive web sites through usage-based clustering of urls

by Bamshad Mobasher, Robert Cooley, Jaideep Srivastava - In IEEE Knowledge and Data Engineering Workshop (KDEX'99 , 1999
"... ..."
Abstract - Cited by 109 (14 self) - Add to MetaCart
Abstract not found
(Show Context)

Citation Context

...hter et al [SKS98] have developed techniques for using path profiles of users to predict future HTTP requests, which can be used for network and proxy caching. Spiliopoulou et al [SF99], Cooley et al =-=[CMS99]-=-, and Buchner and Mulvenna [BM99] have applied data mining techniques to extract usage patterns from Web logs, for the purpose of deriving marketing intelligence. Shahabi et al [SZA97], Yan et al [YJG...

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University