This directory is created automatically and some papers may be mislabeled. Only document within the CiteSeer database are listed. The directory is intended to provide entry points for browsing the database and is not intended to be authoritative. Papers may not appear in all relevant categories. For example, papers in a sub-category may not appear in higher level categories.
188 Implementing Data Cubes Efficiently - Harinarayan, Rajaraman, Ullman (1996)(Correct)
Decision support applications involve complex queries on very large databases. Since response
times should be small, query optimization is critical. Users typically view the data as multidimensional
d... / databases called data warehouses on which users can carry out
128 Research Problems in Data Warehousing - Widom (1995)(Correct)
The topic of data warehousing encompasses architectures,
algorithms, and tools for bringing together selected
data from multiple databases or other information
sources into a single repository, called... / a single repository called a data warehouse suitable for direct br Wrapper Monitor Integrator Warehouse Data Figure Basic
114 View Maintenance in a Warehousing Environment - Zhuge, Garcia-Molina, Hammer, Widom (1995)(Correct)
A warehouse is a repository of integrated information drawn
from remote data sources. Since a warehouse effectively implements
materialized views, we must maintain the views as
the data sources are up... / information sources. A data warehouse is a repository of br consistent meaning that the warehouse data always corresponds to a mean-
85 Infomaster: An Information Integration System - Genesereth, Keller, Duschka (1997)(Correct)
Infomaster is an information integration system that provides
integrated access to multiple distributed heterogeneous
information sources on the Internet, thus giving the illusion
of a centralized, ho... / Infomaster creates a virtual data warehouse. The core of Infomaster is a
80 Data Mining: An Overview from a Database Perspective - Chen, Han, Yu (1996)(Correct)
Mining information and knowledge from large databases has been recognized by many researchers as a key research topic in database systems and machine learning, and by many industrial companies as an i... / providing services such as data warehousing and on-line services over the br of the techniques of data warehousing and data mining in the near future.
74 Storing Semistructured Data with STORED - Deutsch, Fernandez, Suciu (1999)(Correct)
this paper, we describe a technique for using relational databases to store and manage semistructured
data. Our purpose is to use high-performance RDBM systems to store, query, and manage semistructur... / of query plans Ull or data warehouse design TS we must
73 Data Mining: An Overview from Database Perspective - Chen, Han, Yu (1997)(Correct)
Mining information and knowledge from large databases has been recognized by many researchers as a key research topic in database systems and machine learning, and by many industrial companies as an i... / providing services such as data warehousing and on-line services over the br of the techniques of data warehousing and data mining in the near future.
68 Index Selection for OLAP - Gupta, Harinarayan, Rajaraman, Ullman (1997)(Correct)
On-line analytical processing (OLAP) is a recent
and important application of database systems. Typically,
OLAP data is presented as a multidimensional
"data cube." OLAP queries are complex and can ta... / database commonly called a data warehouse.Analysts use the data
68 Complexity of Answering Queries Using Materialized Views - Abiteboul, Duschka (1998)(Correct)
We study the complexity of the problem of answering queries using
materialized views. This problem has attracted a lot of attention recently
because of its relevance in data integration. Previous work... / with the popularity of data warehouses The problem of br problems which arise in data warehousing. Introduction The
68 Complexity of Answering Queries Using Materialized Views (Extended.. - Abiteboul, Duschka (1998)(Correct)
We study the complexity of the problem of answering queries using materialized views. This problem has attracted a lot of attention recently because of its relevance in data integration. Previous work... / with the popularity of data warehouses LZW The problem br problem which arise in data warehousing. Part of the work
67 Aggregate-Query Processing in Data Warehousing Environments - Gupta, Harinarayan, Quass (1995)(Correct)
In this paper we introduce generalized projections
(GP s), an extension of duplicateeliminating
projections, that capture aggregations,
groupbys, duplicate-eliminating projections
(distinct), and du... / the growing number of large data warehouses for decision support br Aggregate-Query Processing in Data Warehousing Environments Ashish
65 Selection of Views to Materialize in a Data Warehouse - Gupta (1997)(Correct)
A data warehouse stores materialized views of data from one
or more sources, with the purpose of efficiently implementing decisionsupport
or OLAP queries. One of the most important decisions in desi... / of Views to Materialize in a Data Warehouse Himanshu Gupta
58 Scaling Clustering Algorithms to Large Databases - Bradley, Fayyad, Reina (1998)(Correct)
Practical clustering algorithms require multiple data scans to
achieve convergence. For large databases, these scans become
prohibitively expensive. We present a scalable clustering
framework applicab... / over a potentially distributed data warehouse with much processing
56 Making Views Self-Maintainable for Data Warehousing - Dallan Quass (1996)(Correct)
A data warehouse stores materialized views over
data from one or more sources in order to provide fast
access to the integrated data, regardless of the availability
of the data sources. Warehouse view... / Abstract A data warehouse stores materialized views over br Views Self-Maintainable for Data Warehousing Dallan Quass Ashish
56 Making Views Self-Maintainable for Data Warehousing (Extended.. - Quass (1996)(Correct)
Dallan Quass
Stanford University
quass@cs.stanford.edu
Ashish Gupta
Oracle Corporation
ashgupta.us.oracle.com
Inderpal Singh Mumick
AT&T Bell Laboratories
mumick@research.att.com
Jennife... / Abstract A data warehouse stores materialized views over br Views Self-Maintainable for Data Warehousing Extended Abstract
56 Managing Semantic Heterogeneity in Databases: A Theoretical.. - Hull (1997)(Correct)
In Proc. of Intl. Conf. on Very Large Data Bases, pages 455--468, 1990. [HZ96] R. Hull and G. Zhou. A framework for supporting data integration using the materialized and virtual approaches. In Pro... / Rdb VMS Developing the Data Warehouse. QED Publishing Group
51 Answering Queries with Aggregation Using Views - Srivastava, Dar, H.V.Jagadish, Levy (1996)(Correct)
We present novel algorithms for the problem of using materialized views to compute answers to SQL queries with grouping and aggregation, in the presence of multiset tables. In addition to its obvious ... / Example . Consider a data warehouse that holds information useful br in many applications such as data warehousing very large transaction
48 Materialized View Selection in a Multidimensional Database - Baralis (1997)(Correct)
A multidimensional database is a data repository
that supports the efficient execution of
complex business decision queries. Query response
can be significantly improved by storing
an appropriate set ... / data. An MDDB is a relational data warehouse in which the information is
45 Including Group-By in Query Optimization - Chaudhuri (1994)(Correct)
In existing relational database systems, processing of group-by and computation of aggregate functions are always postponed until all joins are performed. In this paper, we present transformations tha... / are of great importance in data warehouse applications. These queries
45 Adapting Materialized Views After Redefinitions: . . . - Gupta, al. (1995)(Correct)
this article, we consider the problem of keeping a materialized view up-to-date in response to changes made to the view definition, that is, in response to redefinition of the view. We call this probl... / is a reasonable assumption in data warehouse environments where data br related to decision support data warehousing and data integration. A
44 Description Logics for Conceptual Data Modeling - Calvanese, al. (1998)(Correct)
The article aims at establishing a logical approach to class-based
data modeling. After a discussion on class-based formalisms for data modeling,
weintroduce a family of logics, called Description... / D W Q Foundations of Data Warehouse Quality DESCRIPTION
42 Recursive Plans for Information Gathering - Duschka, Levy (1997)(Correct)
Generating query-answering plans for information
gathering agents requires to translate a
user query, formulated in terms of a set of virtual
relations, to a query that uses relations
that are actuall... / for query optimization and data warehousing Yang and Larson
42 Efficient View Maintenance at Data Warehouses - Agrawal, Abbadi, Singh, Yurek (1997)(Correct)
We present incremental view maintenance algorithms for a data warehouse derived from multiple distributed autonomous data sources. We begin with a detailed framework for analyzing view maintenance alg... / Efficient View Maintenance at Data Warehouses D. Agrawal A. El Abbadi
42 On Similarity-Based Queries for Time Series Data - Davood Rafiei Department (1997)(Correct)
We study similarity queries for time series data where
similarity is defined in terms of a set of linear transformations
on the Fourier series representation of a sequence. We
have shown in an earlier... / such as data mining or data warehousing. A time series is a sequence
41 Algorithms for Deferred View Maintenance - Colby, Griffin, Libkin, Mumick.. (1997)(Correct)
Materialized views and view maintenance are important for
data warehouses, retailing, banking, and billing applications.
We consider two related view maintenance problems: 1) how
to maintain views aft... / maintenance are important for data warehouses retailing banking and
41 The Strobe Algorithms for Multi-Source Warehouse Consistency - Zhuge, Garcia-Molina, Wiener (1996)(Correct)
A warehouse is a data repository containing integrated
information for efficient querying and analysis.
Maintaining the consistency of warehouse data is challenging,
especially if the data sources are... / Introduction A data warehouse is a repository of integrated br Maintaining the consistency of warehouse data is challenging especially if
41 Description Logic Framework for Information Integration - Calvanese, De Giacomo, Lenzerini.. (1998)(Correct)
Information Integration is one of the core
problems in distributed databases, cooperative
information systems, and data warehousing,
which are key areas in the software development
industry. Two criti... / Project DWQ Foundations of Data Warehouse Quality Calvanese De br information systems and data warehousing which are key areas in the
37 New Sampling-Based Summary Statistics for Improving Approximate Query .. - Gibbons, Matias (1998)(Correct)
In large data recording and warehousing environments, it is often
advantageous to provide fast, approximate answers to queries,
whenever possible. Before DBMSs providing highly-accurate approximate
an... / of ongoing insertions to the data warehouse. Introduction In large br Figure A traditional data warehouse. Data Warehouse New Data
36 Methods and problems in data mining - Mannila (1997)(Correct)
Knowledge discovery in databases and data mining aim at semiautomatic tools for the analysis of large data sets. We consider some methods used in data mining, concentrating on levelwise search for all... / the rise of the concepts of data warehousing and on-line analytical
32 Discovering Web Access Patterns and Trends by Applying OLAP and Data.. - Zaïane, Xin, Han (1998)(Correct)
As a confluence of data mining and WWW technologies, it is now possible to perform data mining on web log records collected from the Internet web page access history. The behaviour of the web page rea... / of relational database and data warehouse-based data mining system br of data mining and data warehousing has made available powerful
30 Multiple-View Self-Maintenance in Data Warehousing Environments - Huyn (1997)(Correct)
A data warehouse is a collection of materialized views derived from relations that may not reside at the
warehouse. Using these stored views, user queries can often be evaluated much more cheaply than... / Abstract A data warehouse is a collection of br Self-Maintenance in Data Warehousing Environments Technical
30 Rewriting Aggregate Queries Using Views - Cohen, Nutt, Serebrenik (1999)(Correct)
We investigate the problem of rewriting queries with aggregate operators using views that may or may not contain aggregate operators. A rewriting of a query is a second query that uses view predicates... / value. In fact most existing data warehouses make use of this idea in br recently by the surge of data warehousing and decision support
30 Selection of Views to Materialize Under a Maintenance Cost Constraint - Gupta (1999)(Correct)
A data warehouse stores materialized views derived from
one or more sources for the purpose of efficiently implementing decisionsupport
or OLAP queries. One of the most important decisions in design... / Summit NJ Abstract. A data warehouse stores materialized views br source s for execution. Also warehouse data is available for queries even
27 Change Detection in Hierarchically Structured Information - Sudarshan Chawathe (1996)(Correct)
Detecting and representing changes to data is important
for active databases, data warehousing, view maintenance,
and version and configuration management. Most previous
work in change management has ... / Rdb VMS Developing the Data Warehouse. QED Publishing Group br for active databases data warehousing view maintenance and
27 Supporting Multiple View Maintenance Policies - Colby (1997)(Correct)
Materialized views and view maintenance are becoming increasingly
important in practice. In order to satisfy different
data currency and performance requirements, a number of
view maintenance policies... / retailing decision support data warehousing and data inte- The br decision support data warehousing and data inte- The work of L.
27 On-Line Warehouse View Maintenance - Quass, Widom (1997)(Correct)
Data warehouses store materialized views over base data
from external sources. Clients typically perform complex
read-only queries on the views. The views are refreshed periodically
by maintenance tra... / Abstract Data warehouses store materialized views over
27 Bitmap Index Design and Evaluation - Chan, Ioannidis (1998)(Correct)
Bitmap indexing has been touted as a promising approach for processing
complex adhoc queries in read-mostly environments, like
those of decision support systems. Nevertheless, only few possible
bitmap... / the disk space requirement of data warehouse applications. Understanding br specifically designed for data warehousing applications which supports
26 Maintenance of Materialized Views: Problems, Techniques, and.. - Gupta, Mumick (1995)(Correct)
In this paper we motivate and describe materialized views, their applications, and the problems and techniques for their maintenance. We present a taxonomy of view maintenance problems based upon the ... / is often described as a data warehouse. Materialized views provide a br in new applications such as data warehousing replication servers
25 A Logical Approach to Multidimensional Databases - Cabibbo, Torlone (1998)(Correct)
In this paper we present MD, a logical model for OLAP
systems, and show how it can be used in the design of multidimensional
databases. Unlike other models for multidimensional databases, MD is
i... / production needs. A data warehouse is an integrated collection
24 The Stanford Data Warehousing Project - Hammer, Garcia-Molina, Widom, Labio, .. (1995)(Correct)
The goal of the data warehousing project at Stanford (the WHIPS project) is to develop algorithms and tools for the efficient collection and integration of information from heterogeneous and autonomou... / project. Introduction A data warehouse is a repository of integrated br already resolved. Furthermore warehouse data can be accessed without tying
24 Efficient Time Series Matching by Wavelets - Chan, Fu (1999)(Correct)
Time series stored as feature vectors can be indexed by multidimensional
index trees like R-Trees for fast retrieval. Due to
the dimensionality curse problem, transformations are applied to
time serie... / database applications such as data warehousing and data mining br applications such as data warehousing and data mining A
22 Range Queries in OLAP Data Cubes - Ho, Agrawal, Megiddo, Srikant (1997)(Correct)
A range query applies an aggregation operation over all selected cells of an OLAP data cube where the selection is specified by providing ranges of values for numeric dimensions. We present fast algor... / databases built from their data warehouses. An increasingly popular data
22 Algorithms for Materialized View Design in Data Warehousing.. - Yang, Karlapalem, Li (1997)(Correct)
Selecting views to materialize is one of the
most important decisions in designing a data
warehouse. In this paper, we present a framework
for analyzing the issues in selecting views
to materialize so... / decisions in designing a data warehouse. In this paper we present a br Materialized View Design in Data Warehousing Environment Jian Yang
22 A Data Model for Supporting On-Line Analytical Processing - Li (1996)(Correct)
A database application, called "on-line analytical processing" (or
OLAP) and aimed at providing business intelligence through on-line
multidimensional data analysis, has become increasingly important
... / are based on the concept of a data warehouse storing materialized views
22 Incremental Maintenance of Externally Materialized Views - Staudt, Jarke (1996)(Correct)
With the advent of the Internet, access to
database servers from autonomous clients will
become more and more popular. In this paper,
we propose a monitoring service that could be
offered by such data... / multi-databases and data warehouses Incremental br is change propagation in data warehousing Traditional
22 Fast Incremental Maintenance of Approximate Histograms - Phillip Gibbons Yossi (1997)(Correct)
Many commercial database systems maintain histograms to summarize the contents of large relations
and permit efficient estimation of query result sizes for use in query optimizers. Delaying the propa... / This pattern is common in data warehouses keeping transactional br environments or in data warehousing environments that house
21 A Case for Delay-Conscious Caching of Web Documents - Scheuermann, Shim, Vingralek (1997)(Correct)
Caching at proxy servers plays an important role in reducing the latency of the user response, the network delays and the load on Web servers. The cache performance depends critically on the design of... / R. Vingralek WATCHMAN A Data Warehouse Intelligent Cache Manager br for caching query results in a data warehousing environment We
21 Computing Iceberg Queries Efficiently - Min Fang (1998)(Correct)
Many applications compute aggregate functions
over an attribute (or set of attributes)
to find aggregate values above some specified
threshold. We call such queries iceberg
queries, because the numbe... / market basket queries on large data warehouses that store customer sales br many applications including data warehousing information-retrieval
21 Querying Multidimensional Databases - Cabibbo, Torlone (1997)(Correct)
Multidimensional databases are large collections of data, often
historical, used for sophisticated analysis oriented to decision making.
This activity is supported by an emerging category of softw... / large historical databases data warehouses oriented to decision making.
20 Workflow Handbook - Lawrence (1997)(Correct)
This article is a position paper on the nature of the data
warehouse refreshment which is often defined as a view
maintenance problem or as a loading process. We will
show that the refreshment proc... / - Modeling Data Warehouse Refreshment Process as a br systems used for the data warehouse and data marts wrappers and
19 A Survey of Methods for Scaling Up Inductive Algorithms - Provost, Kolluri (1999)(Correct)
One of the defining challenges for the KDD research community is to enable inductive
learning algorithms to mine very large databases. By collecting, categorizing, and summarizing
existing work on s... / unlikely that all the data in a data warehouse would be mined simultaneously.
19 Source Integration in Data Warehousing - Calvanese, De Giacomo, Lenzerini.. (1997)(Correct)
Source Integration is one of the core problems in Data Warehousing. Two critical factors for the design and maintenance of applications requiring Source Integration, and in particular Data Warehouse a... / Integration and in particular Data Warehouse applications are conceptual br Source Integration in Data Warehousing Diego Calvanese Giuseppe
19 Synchronizing a database to Improve Freshness - Cho, Garcia-Molina (2000)(Correct)
In this paper we study how to refresh a local copy of an autonomous data source to maintain the copy up-to-date. As the size of the data grows, it becomes more difficult to maintain the copy "fresh," ... / availability. For instance a data warehouse may copy remote sales and
18 Data Mining and Database Systems: Where is the Intersection? - Chaudhuri (1998)(Correct)
this paper). This raises the question as
to what role, if any, database systems research may contribute to area of data mining. In this article, I will try
to present my biased view on this issue and ... / issues. However even after a data warehouse has been set up it is often br systems. Data is in the warehouse Data warehouses are deploying
18 Multiple View Consistency for Data Warehousing - Zhuge, Wiener, Garcia-Molina (1997)(Correct)
A data warehouse stores integrated information from
multiple distributed data sources. In effect, the warehouse
stores materialized views over the source data. The problem
of ensuring data consistency... / Abstract A data warehouse stores integrated information br generates transactions for the warehouse database system. We make no
18 Data Warehouse Configuration - Theodoratos, Sellis (1997)(Correct)
In the data warehousing approach to the integration
of data from multiple information
sources, selected information is extracted in
advance and stored in a repository. A data
warehouse (DW) can th... / Data Warehouse Configuration Dimitri
18 Querying Aggregate Data - Grumbach, Rafanelli, Tininini (1999)(Correct)
We introduce a first-order language with real polynomial arithmetic and aggregation operators (count, iterated sum and multiply), which is well suited for the definition of aggregate queries involving... / such as for instance data warehousing. In such applications
17 Physical Database Design for Data Warehouses - Labio, Quass, Adelberg (1997)(Correct)
Data warehouses collect copies of information from
remote sources into a single database. Since the remote
data is cached at the warehouse, it appears as local relations
to the users of the warehouse.... / Physical Database Design for Data Warehouses Wilburt Juan
17 Using Schematically Heterogeneous Structures - Miller (1998)(Correct)
Schematic heterogeneity arises when information that is represented as data under one schema, is represented within the schema (as metadata) in another. Schematic heterogeneity is an important class o... / has been collected in a data warehouse and consider a decision br legacy data in federated or data warehousing applications. Traditional
17 Static Versus Dynamic Sampling for Data Mining - John (1996)(Correct)
As data warehouses grow to the point where one
hundred gigabytes is considered small, the computational
efficiency of data-mining algorithms on large
databases becomes increasingly important. Using a
... / Abstract As data warehouses grow to the point where one
17 Efficient Mining of Association Rules in Distributed Databases - Cheung, Ng, Fu, Fu (1996)(Correct)
Many sequential algorithms have been proposed for mining of association rules. However, very
little work has been done in mining association rules in distributed databases. A direct application
of seq... / data mining together with data warehousing and data repositories are br mining together with data warehousing and data repositories are three new
17 Recursive Query Plans for Data Integration - Duschka, Genesereth, Levy (1999)(Correct)
Generating query-answering plans for data integration systems requires to translate
a user query, formulated in terms of a mediated schema, to a query that uses
relations that are actually stored in d... / for query optimization and data warehousing Most
17 FaCT and iFaCT - Horrocks(Correct)
I
), consisting of a set
I
, called the domain of I, and a function
I
which
maps every concept to a subset of
I
and every role to
a subset of
I
I
such that the properties... / schema assertions from a data warehousing application Calvanese et
16 A System Prototype for Warehouse View Maintenance - Wiener, Gupta, Labio, Zhuge.. (1996)(Correct)
A data warehouse collects and integrates data from multiple, autonomous, heterogeneous, sources. The warehouse effectively maintains one or more materialized views over the source data. In this paper ... / Abstract A data warehouse collects and integrates data br the basic architecture of a warehouse data is collected from each
16 Data Integration using Self-Maintainable Views - Gupta (1996)(Correct)
In this paper we define the concept of self-maintainable views
-- these are views that can be maintained using only the contents of
the view and the database modifications, without accessing any of ... / of such an environment is data warehousing wherein views are used for
16 Cubetree: Organization of and Bulk Incremental Updates on the Data.. - Roussopoulos (1997)(Correct)
The data cube is an aggregate operator which has been shown to be very powerful for On Line Analytical Processing (OLAP) in the context of data warehousing. It is, however, very expensive to compute, ... / the most critical issue in data warehouse environments is the time to br OLAP in the context of data warehousing. It is however very
16 Answering Queries Using Views: A Survey - Levy(Correct)
The problem of answering queries using views is to find e#cient methods of answering a
query using a set of previously materialized views over the database, rather than accessing
the database relati... / data integration and data warehouse design. Informally speaking
15 GeoMiner: A System Prototype for Spatial Data Mining - Han, Koperski, Stefanovic (1997)(Correct)
Spatial data mining is to mine high-level spatial information and knowledge from large spatial databases. A spatial data mining system prototype, GeoMiner, has been designed and developed based on our... / in relational databases and data warehouses. Spatial data mining is a br research into data mining and data warehousing in recent years many
15 Expiring Data in a Warehouse - Hector Garcia-Molina (1998)(Correct)
Data warehouses collect data into materialized views for analysis. After some time, some
of the data may no longer be needed or may not be of interest. In this paper, we handle
this by expiring or rem... / Abstract Data warehouses collect data into br views are often used to store warehouse data. The amount of data copied
15 Join Synopses for Approximate Query Answering - Acharya (1999)(Correct)
In large data warehousing environments, it is often advantageous to provide fast, approximate answers to complex aggregate queries based on statistical summaries of the full data. In this paper, we de... / histograms on the data in the data warehouse. A key feature of Aqua is br Abstract In large data warehousing environments it is often
15 Encoded Bitmap Indexing for Data Warehouses - Wu, Buchmann (1998)(Correct)
We present a new indexing technique, encoded bitmap indexing, for data warehouses
(DW). Three critical factors, complex query types, huge data volumes and very high
read/update ratios, make the indexi... / Encoded Bitmap Indexing for Data Warehouses Ming-Chuan Wu Alejandro P. br Hierarchy Encoding The Warehouse Data Is Usually Modeled As A Star
14 Multidimensional Data Modeling for Complex Data - Pedersen, Jensen (1998)(Correct)
Systems for On-Line Analytical Processing (OLAP) considerably ease the process of analyzing business
data and have become widely used in industry. OLAP systems primarily employ multidimensional
data m... / . R. Kimball. The Data Warehouse Toolkit. Wiley Computer br and the recent focus on data warehousing the notion of On-Line
14 Concurrency Control Theory for Deferred Materialized Views - Kawaguchi, Lieuwen, Mumick, Ross (1997)(Correct)
We consider concurrency control problems that arise in the presence of materialized views. Consider a database system supporting materialized views to speed up queries. For a range of important appl... / relations. considers a data warehouse where a view is materialized br in domains such as data warehousing mobile systems data
14 Measurement and Analysis of IP Network Usage and Behavior - Caceres Duffield Feldmann (2000)(Correct)
Traffic, usage, and performance measurements are
crucial to the design, operation and control of Internet
Protocol (IP) networks. This paper describes a
prototype infrastructure for the measurement, s... / repository we call the WorldNet Data Warehouse. We have used the data both br Measurement Infrastructure and Warehouse Data Sources server's considerable
14 A Survey on Logical Models for OLAP Databases - Vassiliadis, Sellis (1999)(Correct)
this paper we provided a categorization of the
work in the area of OLAP logical models by
surveying some major efforts, from commercial
tools, benchmarks and standards, and academic
efforts. We have a... / not powerful enough for data warehouse applications and that data
13 Metarule-Guided Mining of Multi-Dimensional Association Rules Using.. - Kamber, Jenny, Chiang (1997)(Correct)
In this paper, we employ a novel approach to
metarule-guided, multi-dimensional association
rule mining which explores a data cube structure.
We propose algorithms for metarule-guided mining:
give... / tasks will be performed on data warehouses. With efficient techniques br With recent progress on data warehousing and OLAP technology
13 Active Disks - Remote Execution for Network-Attached Storage - Riedel (1999)(Correct)
Today's commodity disk drives, the basic unit of storage for computer systems large and small, are actually small computers, with a processor, memory, and `network' connection, along with the spinning... / second system i.e. to use a data warehouse separate from the production br Sun storage arrays . TB data warehousing Table - Large storage
12 A Strategy for Database Interoperation - Karp (1995)(Correct)
To realize the full potential of biological databases (DBs) requires more than the interactive, hypertext
flavor of database interoperation that is now so popular in the bioinformatics community. Inte... / Net. . Approach A Data Warehouse In this approach a set of
12 OLAP Mining: An Integration of OLAP with Data Mining - Han (1997)(Correct)
OLAP mining is a mechanism which integrates on-line analytical processing (OLAP) with data mining so that mining can be performed in different portions of databases or data warehouses and at different... / portions of databases or data warehouses and at different levels of br to develop powerful data warehousing and data mining tools for analysis
12 Research Issues in Large Workflow Management Systems - Alonso, Schek (1996)(Correct)
In this position paper we describe what we believe are fundamental weaknesses of existing commercial
workflow products and how database technology can be used to address these issues. By exporting
da... / schema integration and data warehousing are all relevant topics in
12 Integrating Keyword Search into XML Query Processing - Florescu, Kossmann (2000)(Correct)
Due to the popularity of the XML data format, several query languagesfor XML have been proposed,
specially devised to handle data whose structure is unknown, loose, or absent. While these languages ar... / how an RDBMS can be used as a data warehouse for XML data. Unfortunately
11 Graph Structured Views and Their Incremental Maintenance - Zhuge (1998)(Correct)
We study the problem of maintaining materialized views of graph structured data. The base data
consists of records containing identifiers of other records. The data could represent traditional objects... / these algorithms when only a data warehouse and not the data sources br basic architecture of a data warehouse. Data Warehouse Wrapper
11 On-Line Warehouse View Maintenance for Batch Updates - Quass, Widom (1997)(Correct)
Data warehouses store materialized views over base data from external sources. Clients typically
perform complex read-only queries on the views. The views are refreshed periodically by maintenance
tra... / Abstract Data warehouses store materialized views over
11 Replication and Consistency: Being Lazy Helps Sometimes - Breitbart (1997)(Correct)
The issue of data replication is considered in the context of
a restricted system model motivated by certain distributed
data-warehousing applications. A new replica management
protocol is defined for... / with the advent of distributed data warehouses and data marts at the high br by certain distributed data-warehousing applications. A new replica
10 Rewriting of Regular Expressions and Regular Path Queries - Calvanese, De Giacomo, Lenzerini.. (1999)(Correct)
Recent work on semi-structured data has revitalized the
interest in path queries, i.e., queries that ask for all
pairs of objects in the database that are connected by
a path conforming to a certain s... / No. DWQ Foundations of Data Warehouse Quality and by the Italian br well as in data integration data warehousing and query optimization the
10 MultiMediaMiner: A System Prototype for MultiMedia Data Mining - Zaiane, Han, Li, Chiang (1998)(Correct)
Multimedia data mining is the mining of high-level multimedia information and knowledge from large multimedia databases. A multimedia data mining system prototype, MultiMediaMiner, has been designed a... / in relational databases and data warehouses Multimedia has been the br the field of data mining and data warehousing research but nothing
10 Synopsis Data Structures for Massive Data Sets - Matias (1998)(Correct)
Massive data sets with terabytes of data are becoming commonplace. There is an increasing
demand for algorithms and data structures that provide fast response times to queries on such
data sets. In ... / for ad hoc queries of large data warehouses GM In large data br as a cache for the disks. In a data warehousing environment for example
9 Information Retrieval from an Incomplete Data Cube - Curtis Dyreson (1996)(Correct)
A complete data cube is a data cube in which
every aggregate value in the multidimensional
space is stored or can be computed. An incomplete
data cube is a data cube in which
points in the multidimen... / overnight cron job. ffl A data warehouse collects data from a variety br For instance when warehousing data from different stores one
9 Materialized Views and Data Warehouses - Roussopoulos (1997)(Correct)
A data warehouse is a redundant collection of data replicated from several possibly distributed and loosely coupled source databases, organized to answer OLAP queries. Relational views are used both a... / Materialized Views and Data Warehouses Nick Roussopoulos br plan for the derivation of the warehouse data. In this position paper we
9 Maintaining Data Cubes under Dimension Updates - Carlos Hurtado (1999)(Correct)
OLAP systems support data analysis through a multidimensional
data model, according to which data facts
are viewed as points in a space of application-related
"dimensions", organized into levels which... / the dynamic aspect of the data warehouse while dimensions are
9 Towards On-Line Analytical Mining in Large Databases - Han (1998)(Correct)
Great efforts have been paid in the Intelligent Database Systems Research Lab for the research and development of efficient data mining methods and construction of on-line analytical data mining syste... / large relational databases and data warehouses. The system implements a wide br mining relational data data warehouse data spatial data data formed
9 Unbundling Active Functionality - Gatziu, Koschel, Bültzingsloewen.. (1998)(Correct)
New application areas or new technical innovations
expect from database management systems more and
more new functionality. However, adding functions to the
DBMS as an integral part of them, tends to ... / new application areas like data warehousing new architectural forms
9 An Alternative Storage Organization for ROLAP Aggregate Views Based.. - Kotidis, Roussopoulos (1998)(Correct)
The Relational On-Line Analytical Processing (ROLAP) is emerging
as the dominant approach in data warehousing with decision support
applications. In order to enhance query performance, the ROLAP
appro... / warehouse. However in large data warehouses indexing alone is often not br as the dominant approach in data warehousing with decision support
9 Intelligent Agents for Intrusion Detection - Helmer, Wong, Honavar, Miller (1998)(Correct)
This paper focuses on intrusion detection and
countermeasures with respect to widely-used
operating systems and networks. The design and
architecture of an intrusion detection system built
from distri... / agents maintain the data warehouse by combining knowledge and
9 WebOQL: Restructuring Documents, Databases and Webs - Arocena, Mendelzon (1998)(Correct)
The widespread use of the Web has originated several new
data management problems, such as extracting data from
Web pages and making databases accessible from Web
browsers, and has renewed the interes... / with certain features Web-data warehousing i.e.extracting
9 Discovering Structural Association of Semistructured Data - Wang, Liu (1999)(Correct)
Many semistructured objects are similarly, though not identically, structured. We study the
problem of discovering "typical" substructures of a collection of semistructured objects. The
discovered s... / sources such as data warehousing Unlike unstructured raw
8 Query Optimization for Selections using Bitmaps - Wu (1998)(Correct)
Bitmaps are popular indexes for Data Warehouse (DW) applications and most database
management systems (DBMSs) offer them today. This paper analyzes query optimization
issues for selection operations u... / Bitmaps are popular indexes for Data Warehouse DW applications and most
8 Views for Semistructured Data - Serge Abiteboul (1977)(Correct)
Defining a view over a semistructured database introduces many new problems. In this paper
we propose a view specification language and consider the problem of answering queries posed
over views. The ... / For example consider a large data warehouse stored in Lore that
8 Generalized Projections: A Powerful Approach to Aggregation - Gupta, Harinarayan, Quass (1995)(Correct)
In this paper we introduce generalized projections (GPs), an extension of duplicate-eliminating projections, that capture aggregations, groupbys, conventional projection with duplicate elimination (di... / the growing number of large data warehouses for decision support br and important problem in data warehousing how to answer an aggregate
8 Supporting Data Integration and Warehousing Using H2O - Zhou, Hull, King, Franchitti(Correct)
This paper presents a broad framework for data integration, that supports both data materialization
and virtual view capabilities, and that can be used with legacy as well as modern database
systems. ... / component of the mediator is a data warehouse that holds a materialized br addressing this problem data warehousing i.e.materializing
8 A Framework for Designing Materialized Views in Data Warehousing.. - Yang, Karlapalem, Li (1996)(Correct)
Data warehouses may contain multiple views with different query frequencies. When these views are related to each other and defined over overlapping portions of the base data, then it may be more effi... / Hong Kong Abstract Data warehouses may contain multiple views br Materialized Views in Data Warehousing Environment J. Yang K.