This directory is created automatically and some papers may be mislabeled. Only document within the CiteSeer database are listed. The directory is intended to provide entry points for browsing the database and is not intended to be authoritative. Papers may not appear in all relevant categories. For example, papers in a sub-category may not appear in higher level categories.
Three Perspectives of Data Mining - Zhou (2003)(Correct)
This paper reviews three recent books on data mining written from three different perspectives, i.e. databases, machine learning, and statistics. Although the exploration in this paper is suggestive i... / of data stored in databases data warehouses or other kinds of data
Multidimensional Modeling with UML Package Diagrams - Luján-Mora, Trujillo, Song (2002)(Correct)
The Unified Modeling Language (UML) has become the de
facto standard for object-oriented analysis and design, providing different
diagrams for modeling different aspects of a system. In this paper, ... / MD models for data warehouses DW using UML package br st Intl. Workshop on Data Warehouse and Data Mining DWDM' Volume
ComponentXchange: An e-exchange for software components - Sharma (2002)(Correct)
A key challenge in Component Based Software Engineering (CBSE) approaches to build software systems
using pre-existing software components is searching and selecting software appropriate software
comp... / of the technology of Data Warehousing and Data Mining DW DM br of the technology of Data Warehousing and Data Mining DW DM There are
Versus: A Web Repository - Gomes, Campos, Silva (2002)(Correct)
this paper we consider a Web application (or simply an application), as
a Versus client with the ability of executing a task through parallel data
processing. Therefore each application should be comp... /
Approximate Frequency Counts over Data Streams - Manku, Motwani (2002)(Correct)
We present algorithms for computing frequency
counts exceeding a user-specified threshold over
data streams. Our algorithms are simple and have
provably small memory footprints. Although the
outpu... / frequent itemsets in a data warehouse setting. Our algorithm br . Offline Streams In a data warehousing environment bulk updates
Effective Change Detection Using Sampling - Cho, Ntoulas (2002)(Correct)
For a large-scale data-intensive environment,
such as the World-Wide Web or data warehousing,
we often make local copies of remote data
sources. Due to limited network and computational
resources,... / data sources. For instance a data warehouse may copy remote sales and br such as the World-Wide Web or data warehousing we often make local copies
Algebraic Rewritings for Optimizing Regular Path Queries - Grahne, Thomo (2001)(Correct)
Rewriting queries using views is a powerful technique that has applications in query optimization,
data integration, data warehousing etc. Query rewriting in relational databases
is by now rather well... / data integration data warehousing etc. Query rewriting in
Minimizing View Sets without Losing Query-Answering Power - Li, Bawa, Ullman (2001)(Correct)
The problem of answering queries using views has been studied
extensively due to its relevance in a wide variety of data-management
applications. In these applications, we often need to select a s... / computing the views. In a data warehouse views can preclude costly br as information integration data warehousing and query optimization. The
Generating Efficient Plans for Queries Using Views - Afrati, Li, Ullman (2001)(Correct)
We study the problem of generating efficient, equivalent
rewritings using views to compute the answer to a query.
We take the closed-world assumption, in which views are
materialized from base relatio... / D. Theodoratos and T. Sellis. Data warehouse configuration. In Proc. of br data warehousing web-site designs
Proxy-Server Architectures for OLAP - Kalnis, Papadias (2001)(Correct)
Data warehouses have been successfully employed for assisting
decision making by offering a global view of the enterprise data
and providing mechanisms for On-Line Analytical processing.
Traditionally... / ABSTRACT Data warehouses have been successfully br include the maintenance of the warehouse data consistency given a set of
InterBase-KB: Integrating a Knowledge Base System with a.. - Bassiliades, Vlahavas (2001)(Correct)
This paper describes the integration of a multidatabase system and a knowledge-base system to support unknown InterBase
: Integrating a Knowledge Base System with a
Multidatabase System for Data War... / data-integration component of a Data Warehouse. The multidatabase system br Self-Maintainable in a Data Warehouse Data Knowledge Engineering
Selecting and materializing horizontally partitioned warehouse views - Ezeife (2001)(Correct)
Data warehouse views typically store large aggregate tables based on a subset of dimension attributes of the main
data warehouse fact table. Aggregate views can be stored as 2
n
subviews of a data c... / April Abstract Data warehouse views typically store large br which fragments of a warehouse data cube view best answers any
Software Tools - Grundy, Hosking (2001)(Correct)
Software is growing ever-more complex and new software processes, methods and products put greater demands
on software engineers than ever before. The support of appropriate software tools is essentia... /
Adaptive-FP: An Efficient And Effective Method For Multi-Level.. - Mao (2001)(Correct)
Real life transaction databases usually contain both item information and dimension information. Moreover, taxonomies about items likely exist. Knowledge about multilevel and multi-dimensional frequen... / transactional databases and data warehouses. A comprehensive data mining br interests in data mining and data warehousing for his responsiveness
Data Mining in Soft Computing Framework: A Survey - Mitra, Pal, Mitra (2001)(Correct)
The present article provides a survey of the available literature on data mining using soft computing. A categorization has been provided based on the di#erent soft computing tools and their hybridiza... / amounts of data stored in data warehouses or other information br W. H. Inmon The data warehouse and data mining Communications of
Specifying OLAP Cubes on XML Data - Jensen, Møller, Pedersen (2001)(Correct)
On-Line Analytical Processing (OLAP) enables analysts to gain insight into data through fast and interactive
access to a variety of possible views on information, organized in a dimensional model. Th... / great effort in keeping the data warehouse up to date e.g. when data br in modern enterprises. In the data warehousing approach selected
Discovering And Mining User Web-Page Traversal Patterns - Mortazavi-Asl (2001)(Correct)
As the popularity of WWW explodes, a massive amount of data is
gathered by Web servers in the form of Web access logs. This is a rich
source of information for understanding Web user surfing behavior.... / Data Data Integration Data Warehouse Task-relevant Data br transformation Data warehouse Data Cube OLAP Server PivotTable
From Databases to Information Systems - Information Quality Makes the .. - Naumann (2001)(Correct)
Research and business is currently moving from centralized databases towards in-formation systems integrating distributed and autonomous data sources. Simultane-ously, it is a well acknowledged fact t... / from a set of data such as a data warehouse. Data mining techniques are br a set of data such as a data warehouse. Data mining techniques are
Analysis and Optimisation of Event-Condition-Action Rules on XML - Bailey, Poulovassilis, Wood (2001)(Correct)
XML is a now a dominant standard for storing and exchanging information. With its
increasing use in areas such as data warehousing and e-commerce, there is a rapidly growing
need for rule-based tech... / materialised views in the XML data warehouse for validating and cleansing br use in areas such as data warehousing and e-commerce there is a
A Case for Dynamic View Management - Kotidis, Roussopoulos (2001)(Correct)
this
paper, we present DynaMat, a system that manages dynamic collections of materialized aggregate
views in a data warehouse. At query time DynaMat utilizes a dedicated disk space for storing
compute... / set of redundant entities in a data warehouse that are frequently used to
Reverse Engineering Meets Data Analysis - Andritsos, Miller (2001)(Correct)
We demonstrate how the data management techniques
known as On--Line Analytical Processing, or OLAP, can
be used to enhance the sophistication and range of software
reverse engineering tools. This is t... / of a business or enterprise. Data warehouses came into existence to meet br Processing OLAP and Data warehousing An increasing number
Web-Document Prediction And Presending Using Association Rule.. - Li (2001)(Correct)
An important data source for data mining is the web-log data that traces the user's web
browsing actions. From the web logs, one can build prediction models that predict with
high accuracy the user's ... / can be regarded as the largest data warehouse in the world. The most common
Improving Min/Max Aggregation over Spatial Objects - Zhang, Tsotras (2001)(Correct)
We examine the problem of computing MIN/MAX aggregate queries over a collection of spatial objects. Each spatial object is associated with a weight (value), for example, the average temperature or rai... / spatial dimension in a spatial data warehouse environment but can be used
CUBIST: A New Approach to Speeding Up OLAP Queries in Data Cubes - Fu, Hammer (2001)(Correct)
We report on a new, efficient encoding for the data cube, which results in a drastic speed-up
of OLAP queries that aggregate along any combination of dimensions over numerical and
categorical attrib... / queries on top of a relational data warehouse. We are focusing on a class of br are often complex and the data warehouse database is often very large
TxnWrap: A Transactional Approach to Data Warehouse Maintenance - Chen, Chen, Rundensteiner (2001)(Correct)
A Data Warehouse Management System (DWMS) maintains materialized views derived from one or more
information sources (ISs) under source changes. Much recent research has developed maintenance algorith... / A Transactional Approach to Data Warehouse Maintenance by Jun Chen br IS and commits when the data warehouse database has been successfully
Discovery and Application of Check Constraints in DB2 - Gryz, Schiefer, Zheng, Zuzarte (2001)(Correct)
The traditional role of integrity constraints is to protect
the integrity of data. But integrity constraints can and do
play other roles in databases; for example, they can be used
for query optimizat... / In some environments such as data warehousing data loading is strictly br environments such as data warehousing data loading is strictly
An Evolutionary Approach to Materialized Views Selection in a Data.. - Zhang, Yao, Yang (2001)(Correct)
A data warehouse contains multiple views accessed by queries. One of the most important decisions
in designing a data warehouse is selecting views to materialize for the purpose of eciently supporting... / Views Selection in a Data Warehouse Environment Chuan Zhang br view selection Data warehousing Data mining. I. Introduction
Incremental Maintenance of Multi-Source Views - Moro, Sartori (2001)(Correct)
In recent years, numerous algorithms have been proposed for
incremental view maintenance of data warehouses. As a matter of
fact, all of them follow almost the same general approach, namely
they compu... / incremental view maintenance of data warehouses. As a matter of fact all of
Potter's Wheel: An Interactive Data Cleaning System - Vijayshankar Raman And (2001)(Correct)
Cleaning data of errors in structure and content is important
for data warehousing and integration. Current
solutions for data cleaning involve many iterations of
data "auditing" to find errors, an... / and content is important for data warehousing and integration. Current br many contexts such as data warehousing and data integration. The current
Data Integration Services - Convey, Karpenko, Tatbul (2001)(Correct)
Introduction
With the prevalence of the network technology and the Internet, access to data
independent of its physical storage location has become highly facilitated. This
further has enabled users ... /
UML and the Semantic Web - Cranefield (2001)(Correct)
This paper discusses technology to support the use of UML for representing ontologies and domain knowledge in the Semantic Web. Two mappings have been defined and implemented using XSLT to produce J... / Warehouse Metamodel for data warehousing business intelligence
Knowledge Management in Heterogeneous Data Warehouse Environments - Kerschberg (2001)(Correct)
This paper addresses issues related to Knowledge Management in the
context of heterogeneous data warehouse environments. The traditional notion
of data warehouse is evolving into a federated warehou... / Management in Heterogeneous Data Warehouse Environments Larry br and placed into the data warehouse or data mart according to a schema
An Experimental Performance Evaluation of Incremental Materialized.. - Akhtar Ali Norman (2001)(Correct)
The development of techniques for supporting incremental unknown An Experimental Performance Evaluation of
Incremental Materialized View Maintenance in
Object Databases
M. Akhtar Ali
, Norman W. P... / Materialized View Selection in Data Warehouse Environments. In Proc. br of Materialized Views in a Data Warehousing Environment A Case Study. In
Mining E-Commerce Data: The Good, the Bad, and the Ugly - Kohavi (2001)(Correct)
Organizations conducting Electronic Commerce (e-commerce) can greatly benefit from the insight that data mining of transactional and clickstream data provides. Such insight helps not only to improve t... / The discoveries made in the data warehouse rarely affected the
Content Integration for E-Business - Michael Stonebraker Joseph (2001)(Correct)
We define the problem of content integration for EBusiness,
and show how it differs in fundamental ways
from traditional issues surrounding data integration, application
integration, data warehousing ... / One simple issue is that data warehouses are built on parallel br application integration data warehousing and OLTP. Content integration
Lineage Tracing for General Data Warehouse Transformations - Cui, Widom (2001)(Correct)
Data warehousing systems integrate information
from operational data sources into a central repository to enable
analysis and mining of the integrated information. During the
integration process, sour... / Lineage Tracing for General Data Warehouse Transformations Yingwei br problem is that of tracing warehouse data items back to the original
Updating <=, <-Chains - Delgrande, Gupta (2001)(Correct)
We address the problem of very efficient reasoning and update in ; !-chains, where
a ; !-chain is a directed acyclic graph such that there is a directed path between every
pair of vertices, and edge... / For example in the data warehousing community GAA
B-trees: Bearing Fruits of All Kinds - Ooi, Tan (2001)(Correct)
Index structures are often used to support search operations
in large databases. Many advanced database application domains
such as spatial databases, multimedia databases, temporal
databases, and obj... / temporal databases data warehousing high-dimensional databases
The Nimble Integration Engine - Draper, Halevy, Weld (2001)(Correct)
The consensus that XML has become the de facto standard
for data interchange will spur demand for technology that
allows users to integrate data from a variety of applications,
repositories, and legac... / of creating a new uni ed data warehouse that stores all the
The Clio Project: Managing Heterogeneity - Miller, Hernández, Haas, Yan, .. (2001)(Correct)
Clio is a system for managing and facilitating the
complex tasks of heterogeneous data transformation
and integration. In Clio, we have collected together
a powerful set of data management techniques
... / For instance before a data warehouse can be loaded DBAs and br modern data applications in data warehousing and electronic commerce
Versus: A Temporal Web Repository - Joao Campos Mario (2001)(Correct)
Web data warehouses are useful for applications that need to process
large amounts of Web data in a short time. This paper presents
Versus, a Web repository model supporting object versioning and di... / Abstract Web data warehouses are useful for applications
Privacy Preserving Distributed Data Mining - Clifton (2001)(Correct)
em, there is
a simple distributed solution that provides a degree of privacy to the individual sites. An example
association rule could be:
Received F lu shot and age > 50 implies hospital admission,... /
Data Mining for Intelligent Web Caching - Bonchi, Giannotti, Manco, Nanni.. (2001)(Correct)
The paper presents a vertical application of data warehousing and data mining technology:
intelligent web caching. We introduce several ways to construct intelligent web
caching algorithms that empl... / mining application based on data warehouse technology the development br a vertical application of data warehousing and data mining technology
ISCO: A Practical Language for Logic-Based Construction of.. - Abreu (2001)(Correct)
Evora's Integrated Information System
(SIIUE) aims at representing the entire universe of concepts
useful for the management and day-to-day operation
of the Organization, as seen from the point of vie... / is one of the issues in data warehousing some approaches taken to
HYSSOP: Natural Language Generation Meets Knowledge Discovery in.. - Robin, Favero (2001)(Correct)
In this paper, we present HYSSOP, a system that generates natural language hypertext summaries of insights resulting
from a knowledge discovery process. We discuss the synergy between the two technolo... / in a multidimensional data warehouse in a more intuitive and br that seamlessly integrates data warehousing OLAP data mining automated
Knowledge Processes and Ontologies - Staab, Studer, Schnurr (2001)(Correct)
this article, we present an approach for ontology
-based KM that includes a suite of ontologybased
tools as well as a methodology for developing
ontology-based KM systems. Our approach, shown
in F... / can liken the situation to data warehousing except that the input
Administering Permissions for Distributed Data: Factoring and.. - Rosenthal, Sciore (2001)(Correct)
We extend SQL's grant/revoke model to handle all administration of permissions in a
distributed database. The key idea is to "factor" permissions into simpler decisions that
can be administered separa... / small transactions. T' is in a data warehouse used for large data-mining br all read-only operations on warehouse database DW or all
Generic Schema Matching with Cupid - Madhavan, Bernstein, Rahm (2001)(Correct)
Schema matching is a critical step in many applications, such as XML message mapping, data warehouse loading, and schema integration. In this paper, we investigate algorithms for generic schema matchi... / such as XML message mapping data warehouse loading and schema
A Model for a Temporal Data Warehouse - Eder, Koncilia, Morzy (2001)(Correct)
Data warehouses are a primary means for a consolidated
view on the data within an enterprise and frequently
a rst step in integrating enterprise information
systems. Above all, data warehouses are us... / A Model for a Temporal Data Warehouse Johann Eder University br W. Martin editor. Data Warehousing -Data Mining -OLAP. Thomson
IEEE 66 Computer - Designing Data Warehouses (2001)(Correct)
ions of
our work. We believe that our innovative approach
provides a theoretical foundation for the use of OO
databases and object-relational databases in data
warehouses, MDB, and OLAP applicatio... / Computer Designing Data Warehouses with OO Conceptual Models br st Int'l Workshop on Data Warehousing and Data Mining DWDM vol.
A Data Warehousing Architecture for Enabling Service Provisioning.. - Kotidis (2001)(Correct)
In this paper we focus on the following problem
in information management: given a large collection
of recorded information and some knowledge
of the process that is generating this data we want
t... / we load new records in the data warehouse. In fact many service br A Data Warehousing Architecture for Enabling
Business Process Coordination: State of the Art, Trends, and Open.. - Dayal, Hsu, Ladin (2001)(Correct)
Over the past decade, there has been a lot of
work in developing middleware for integrating
and automating enterprise business processes.
Today, with the growth in e-commerce and the
blurring of e... / is to build a business process data warehouse which can be loaded with the br intelligence aims to apply data warehousing data analysis and data
Mining Mart: Metadata-Driven Preprocessing - Zücker, Kietz, Vaduva (2001)(Correct)
The Mining Mart project (Enabling End-User Data Warehouse Mining) proposes a case-based reasoning system for maximum support of end users during data preprocessing. Our approach 1) uses a case base fo... / amounts of data i.e. of a data warehouse. In the Mining Mart project
Improving Business Process Quality through Exception Understanding.. - Daniela Grigori Loria (2001)(Correct)
Business process automation technologies are
being increasingly used by many companies to
improve the efficiency of both internal processes
as well as of e-services offered to customers. In
order ... / the availability of a process data warehouse. The design population and br data mining algorithms on the warehouse data in order to Understand
Warlock: A Data Allocation Tool for Parallel Warehouses. - Stöhr, Rahm (2001)(Correct)
a set of fragmentation attributes from the dimensional
attributes, at most one per dimension. All fact table rows
corresponding to a single value combination of the fragmentation
attributes are assign... / determine a parallel data warehouse's allocation to disk.
Warehousing Workflow Data: Challenges and Opportunities - Angela Bonifati Politecnico (2001)(Correct)
Workflow management systems (WfMSs) are
software platforms that allow the definition,
execution, monitoring, and management of
business processes. WfMSs log every event that
occurs during process ... / execution data called Workflow Data Warehouse or WDW in the following br by executing the scripts on warehouse data and by storing the results in
Generalized Affinity-Based Association Rule Mining for Multimedia.. - Shyu, Chen, Kashyap (2001)(Correct)
The recent progress in high-speed communication networks and largecapacity
storage devices has led to a tremendous increase in the number of databases
and the volume of data in them. This has create... / the huge amounts of data in data warehouses for decision-support br database systems data warehousing data mining and distributed
Xyro: The Xyleme Robot Architecture - Mignet, Aguilera, Ailleret, Veltri (2001)(Correct)
In this paper we address the problem of loading data from the web. We present the architecture
of Xyro, a crawler designed to fetch data from the Web, and particularly XML data.
We describe our experi... / aim was build a dynamic XML data warehouse to provide high level and
A Review of Data Mining Techniques - Lee, Slau (2001)(Correct)
Terabytes of data are generated
everyday in many organizations .
To extract hidden predictive
information from large volumes of
data, data mining (DM)
techniques are needed.
Organizations are starting... /
Quality of Very Large Databases - William Winkler Bureau (2001)(Correct)
Analyses and data mining of large computer files are affected by the quality of the information in
the files. For large population registers and for files that are created by merging two or more
files... / that might be used in creating data warehouses merging lists and
Data Mining: Concepts and - Techniques Slides For (2001)(Correct)
Rule: Basic Concepts
n Given: (1) database of transactions, (2) each transaction is
a list of items (purchased by a customer in a visit)
n Find: all rules that correlate the presence of one set of
i... /
View-based Query Processing and Constraint Satisfaction - Calvanese, De Giacomo, Lenzerini.. (2000)(Correct)
View-based query processing requires to answer a query posed to a database only on the basis of the information on a set of views, which are again queries over the same database. This problem is relev... / to answer a query. A data warehouse can be seen as a set of br including query optimization data warehousing data integration and query
A Vision for Management of Complex Models - Bernstein, Levy, Pottinger (2000)(Correct)
Many problems encountered when building applications of database systems involve the manipulation
of models. By "model," we mean a complex structure that represents a design artifact,
such as a relati... / mapping data sources into data warehouse tables to generate programs
A Logic Based Language for Parametric Inheritance - Jamil (2000)(Correct)
Though overriding as a single and default
mode of inheritance is adequate for most
knowledge bases, a large class of applications
naturally requires several inheritance
modes and types. We propose... / What can hierarchies do for data warehouses In Proc. of the VLDB br secure databases data warehousing data mining etc.
Approximating multi-dimensional aggregate range queries over real.. - Gunopulos, Kollios, Tsotras.. (2000)(Correct)
Finding approximate answers to multi-dimensional range queries over real valued attributes has significant
applications in data exploration and database query optimization. In this paper we consider ... / optimization data mining and data warehousing. The query optimizer requires br task interactive. In data warehousing datasets can be very large.
Temporal Statement Modifiers - Böhlen, Jensen, Snodgrass (2000)(Correct)
this paper we advocate a dierent approach, of articulating a set of requirements, or
desiderata, that directly imply the syntactic structure and core semantics of a temporal extension
of an (arbitrar... / such as decision support and data warehousing old versions of data are
What is View-Based Query Rewriting? - Calvanese, De Giacomo, Lenzerini, al. (2000)(Correct)
View-based query processing requires to answer a query posed to a database
only on the basis of the information on a set of views, which are again queries
over the same database. This problem is rel... / accessible to answer a query. A data warehouse can be seen as a set of br including query optimization data warehousing data integration and query
Data Cleansing: Beyond Integrity Analysis - Maletic, Marcus (2000)(Correct)
The paper analyzes the problem of data cleansing and automatically identifying potential errors in data sets. An overview of the diminutive amount of existing literature concerning data cleansing is g... / their defining processes are data warehousing knowledge discovery in