Home     Top: Databases: Data Warehousing    [Concurrency   Data Warehousing   Deductive   Object-oriented   Performance   Query Processing   Relational   Temporal]

Change ordering:   Authority   Hubs (tutorials)   Date   Expected authority       Show titles only
Ordered by the number of citations

This directory is created automatically and some papers may be mislabeled. Only document within the CiteSeer database are listed. The directory is intended to provide entry points for browsing the database and is not intended to be authoritative. Papers may not appear in all relevant categories. For example, papers in a sub-category may not appear in higher level categories.

188   Implementing Data Cubes Efficiently - Harinarayan, Rajaraman, Ullman (1996)   (Correct)
Decision support applications involve complex queries on very large databases. Since response times should be small, query optimization is critical. Users typically view the data as multidimensional d... / databases called data warehouses on which users can carry out

128   Research Problems in Data Warehousing - Widom (1995)   (Correct)
The topic of data warehousing encompasses architectures, algorithms, and tools for bringing together selected data from multiple databases or other information sources into a single repository, called... / a single repository called a data warehouse suitable for direct br Wrapper Monitor Integrator Warehouse Data Figure Basic

114   View Maintenance in a Warehousing Environment - Zhuge, Garcia-Molina, Hammer, Widom (1995)   (Correct)
A warehouse is a repository of integrated information drawn from remote data sources. Since a warehouse effectively implements materialized views, we must maintain the views as the data sources are up... / information sources. A data warehouse is a repository of br consistent meaning that the warehouse data always corresponds to a mean-

94   Scaling Heterogeneous Databases and the Design of DISCO - Tomasic, Raschid, Valduriez (1996)   (Correct)
Access to large numbers of data sources introduces new problems for users of heterogeneous distributed databases. End users and application programmers must deal with unavailable data sources. Databas... / A mediator may as in data warehousing also keep state or summary

85   Infomaster: An Information Integration System - Genesereth, Keller, Duschka (1997)   (Correct)
Infomaster is an information integration system that provides integrated access to multiple distributed heterogeneous information sources on the Internet, thus giving the illusion of a centralized, ho... / Infomaster creates a virtual data warehouse. The core of Infomaster is a

80   Data Mining: An Overview from a Database Perspective - Chen, Han, Yu (1996)   (Correct)
Mining information and knowledge from large databases has been recognized by many researchers as a key research topic in database systems and machine learning, and by many industrial companies as an i... / providing services such as data warehousing and on-line services over the br of the techniques of data warehousing and data mining in the near future.

74   Storing Semistructured Data with STORED - Deutsch, Fernandez, Suciu (1999)   (Correct)
this paper, we describe a technique for using relational databases to store and manage semistructured data. Our purpose is to use high-performance RDBM systems to store, query, and manage semistructur... / of query plans Ull or data warehouse design TS we must

73   Data Mining: An Overview from Database Perspective - Chen, Han, Yu (1997)   (Correct)
Mining information and knowledge from large databases has been recognized by many researchers as a key research topic in database systems and machine learning, and by many industrial companies as an i... / providing services such as data warehousing and on-line services over the br of the techniques of data warehousing and data mining in the near future.

68   Index Selection for OLAP - Gupta, Harinarayan, Rajaraman, Ullman (1997)   (Correct)
On-line analytical processing (OLAP) is a recent and important application of database systems. Typically, OLAP data is presented as a multidimensional "data cube." OLAP queries are complex and can ta... / database commonly called a data warehouse.Analysts use the data

68   Complexity of Answering Queries Using Materialized Views - Abiteboul, Duschka (1998)   (Correct)
We study the complexity of the problem of answering queries using materialized views. This problem has attracted a lot of attention recently because of its relevance in data integration. Previous work... / with the popularity of data warehouses The problem of br problems which arise in data warehousing. Introduction The

68   Complexity of Answering Queries Using Materialized Views (Extended.. - Abiteboul, Duschka (1998)   (Correct)
We study the complexity of the problem of answering queries using materialized views. This problem has attracted a lot of attention recently because of its relevance in data integration. Previous work... / with the popularity of data warehouses LZW The problem br problem which arise in data warehousing. Part of the work

67   Aggregate-Query Processing in Data Warehousing Environments - Gupta, Harinarayan, Quass (1995)   (Correct)
In this paper we introduce generalized projections (GP s), an extension of duplicateeliminating projections, that capture aggregations, groupbys, duplicate-eliminating projections (distinct), and du... / the growing number of large data warehouses for decision support br Aggregate-Query Processing in Data Warehousing Environments Ashish

65   Selection of Views to Materialize in a Data Warehouse - Gupta (1997)   (Correct)
A data warehouse stores materialized views of data from one or more sources, with the purpose of efficiently implementing decisionsupport or OLAP queries. One of the most important decisions in desi... / of Views to Materialize in a Data Warehouse Himanshu Gupta

58   Scaling Clustering Algorithms to Large Databases - Bradley, Fayyad, Reina (1998)   (Correct)
Practical clustering algorithms require multiple data scans to achieve convergence. For large databases, these scans become prohibitively expensive. We present a scalable clustering framework applicab... / over a potentially distributed data warehouse with much processing

56   Making Views Self-Maintainable for Data Warehousing - Dallan Quass (1996)   (Correct)
A data warehouse stores materialized views over data from one or more sources in order to provide fast access to the integrated data, regardless of the availability of the data sources. Warehouse view... / Abstract A data warehouse stores materialized views over br Views Self-Maintainable for Data Warehousing Dallan Quass Ashish

56   Making Views Self-Maintainable for Data Warehousing (Extended.. - Quass (1996)   (Correct)
Dallan Quass Stanford University quass@cs.stanford.edu Ashish Gupta Oracle Corporation ashgupta.us.oracle.com Inderpal Singh Mumick AT&T Bell Laboratories mumick@research.att.com Jennife... / Abstract A data warehouse stores materialized views over br Views Self-Maintainable for Data Warehousing Extended Abstract

56   Managing Semantic Heterogeneity in Databases: A Theoretical.. - Hull (1997)   (Correct)
In Proc. of Intl. Conf. on Very Large Data Bases, pages 455--468, 1990. [HZ96] R. Hull and G. Zhou. A framework for supporting data integration using the materialized and virtual approaches. In Pro... / Rdb VMS Developing the Data Warehouse. QED Publishing Group

51   Answering Queries with Aggregation Using Views - Srivastava, Dar, H.V.Jagadish, Levy (1996)   (Correct)
We present novel algorithms for the problem of using materialized views to compute answers to SQL queries with grouping and aggregation, in the presence of multiset tables. In addition to its obvious ... / Example . Consider a data warehouse that holds information useful br in many applications such as data warehousing very large transaction

48   Materialized View Selection in a Multidimensional Database - Baralis (1997)   (Correct)
A multidimensional database is a data repository that supports the efficient execution of complex business decision queries. Query response can be significantly improved by storing an appropriate set ... / data. An MDDB is a relational data warehouse in which the information is

45   Including Group-By in Query Optimization - Chaudhuri (1994)   (Correct)
In existing relational database systems, processing of group-by and computation of aggregate functions are always postponed until all joins are performed. In this paper, we present transformations tha... / are of great importance in data warehouse applications. These queries

45   Adapting Materialized Views After Redefinitions: . . . - Gupta, al. (1995)   (Correct)
this article, we consider the problem of keeping a materialized view up-to-date in response to changes made to the view definition, that is, in response to redefinition of the view. We call this probl... / is a reasonable assumption in data warehouse environments where data br related to decision support data warehousing and data integration. A

44   Description Logics for Conceptual Data Modeling - Calvanese, al. (1998)   (Correct)
The article aims at establishing a logical approach to class-based data modeling. After a discussion on class-based formalisms for data modeling, weintroduce a family of logics, called Description... / D W Q Foundations of Data Warehouse Quality DESCRIPTION

42   Recursive Plans for Information Gathering - Duschka, Levy (1997)   (Correct)
Generating query-answering plans for information gathering agents requires to translate a user query, formulated in terms of a set of virtual relations, to a query that uses relations that are actuall... / for query optimization and data warehousing Yang and Larson

42   Efficient View Maintenance at Data Warehouses - Agrawal, Abbadi, Singh, Yurek (1997)   (Correct)
We present incremental view maintenance algorithms for a data warehouse derived from multiple distributed autonomous data sources. We begin with a detailed framework for analyzing view maintenance alg... / Efficient View Maintenance at Data Warehouses D. Agrawal A. El Abbadi

42   On Similarity-Based Queries for Time Series Data - Davood Rafiei Department (1997)   (Correct)
We study similarity queries for time series data where similarity is defined in terms of a set of linear transformations on the Fourier series representation of a sequence. We have shown in an earlier... / such as data mining or data warehousing. A time series is a sequence

41   Algorithms for Deferred View Maintenance - Colby, Griffin, Libkin, Mumick.. (1997)   (Correct)
Materialized views and view maintenance are important for data warehouses, retailing, banking, and billing applications. We consider two related view maintenance problems: 1) how to maintain views aft... / maintenance are important for data warehouses retailing banking and

41   The Strobe Algorithms for Multi-Source Warehouse Consistency - Zhuge, Garcia-Molina, Wiener (1996)   (Correct)
A warehouse is a data repository containing integrated information for efficient querying and analysis. Maintaining the consistency of warehouse data is challenging, especially if the data sources are... / Introduction A data warehouse is a repository of integrated br Maintaining the consistency of warehouse data is challenging especially if

41   Description Logic Framework for Information Integration - Calvanese, De Giacomo, Lenzerini.. (1998)   (Correct)
Information Integration is one of the core problems in distributed databases, cooperative information systems, and data warehousing, which are key areas in the software development industry. Two criti... / Project DWQ Foundations of Data Warehouse Quality Calvanese De br information systems and data warehousing which are key areas in the

38   Knowledge Discovery and Data Mining: Towards a Unifying Framework - Fayyad, Piatetsky-Shapiro, Smyth (1996)   (Correct)
This paper presents a first step towards a unifying framework for Knowledge Discovery in Databases. We describe links between data mining, knowledge discovery, and other related fields. We then define... / approach for analysis of data warehouses has been called OLAP br another related area is data warehousing which refers to the popular

37   On the Decidability of Query Containment under Constraints - Calvanese, De Giacomo, Lenzerini (1998)   (Correct)
Query containment under constraints is the problem of checking whether for every database satisfying a given set of constraints, the result of one query is a subset of the result of another query. Rec... / No. DWQ Foundations of Data Warehouse Quality the Italian br view maintenance data warehousing and constraint

37   New Sampling-Based Summary Statistics for Improving Approximate Query .. - Gibbons, Matias (1998)   (Correct)
In large data recording and warehousing environments, it is often advantageous to provide fast, approximate answers to queries, whenever possible. Before DBMSs providing highly-accurate approximate an... / of ongoing insertions to the data warehouse. Introduction In large br Figure A traditional data warehouse. Data Warehouse New Data

36   Methods and problems in data mining - Mannila (1997)   (Correct)
Knowledge discovery in databases and data mining aim at semiautomatic tools for the analysis of large data sets. We consider some methods used in data mining, concentrating on levelwise search for all... / the rise of the concepts of data warehousing and on-line analytical

35   Part-Whole Relations in Object-Centered Systems: An Overview - Artale, Franconi, Guarino, Pazzi (1996)   (Correct)
Knowledge bases, data bases and object-oriented systems (referred to in the paper as Object-Centered systems) all rely on attributes as the main construct used to associate properties to objects; amon... / D W Q Foundations of Data Warehouse Quality Part-Whole

34   SchemaSQL - A Language for Interoperability in Relational.. - Lakshmanan, Sadri, Subramanian (1996)   (Correct)
We provide a principled extension of SQL, called SchemaSQL , that offers the capability of uniform manipulation of data and meta-data in relational multi-database systems. We develop a precise syntax ... /

32   Discovering Web Access Patterns and Trends by Applying OLAP and Data.. - Zaïane, Xin, Han (1998)   (Correct)
As a confluence of data mining and WWW technologies, it is now possible to perform data mining on web log records collected from the Internet web page access history. The behaviour of the web page rea... / of relational database and data warehouse-based data mining system br of data mining and data warehousing has made available powerful

31   WATCHMAN: A Data Warehouse Intelligent Cache Manager - Scheuermann, Shim, Vingralek (1996)   (Correct)
Data warehouses store large volumes of data which are used frequently by decision support applications. Such applications involve complex queries. Query performance in such an environment is critical ... / WATCHMAN A Data Warehouse Intelligent Cache Manager

31   Approximate Computation of Multidimensional Aggregates of Sparse Data .. - Vitter, Wang (1999)   (Correct)
Computing multidimensional aggregates in high dimensions is a performance bottleneck for many OLAP applications. Obtaining the exact answer to an aggregation query can be prohibitively expensive in te... / time and or storage space in a data warehouse environment. It is

30   Multiple-View Self-Maintenance in Data Warehousing Environments - Huyn (1997)   (Correct)
A data warehouse is a collection of materialized views derived from relations that may not reside at the warehouse. Using these stored views, user queries can often be evaluated much more cheaply than... / Abstract A data warehouse is a collection of br Self-Maintenance in Data Warehousing Environments Technical

30   Maintenance of Data Cubes and Summary Tables in a Warehouse - Mumick, Quass, Mumick (1997)   (Correct)
Data warehouses contain large amounts of information, often collected from a variety of independent sources. Decisionsupport functions in a warehouse, such as on-line analytical processing (OLAP), inv... / Abstract Data warehouses contain large amounts of

30   Rewriting Aggregate Queries Using Views - Cohen, Nutt, Serebrenik (1999)   (Correct)
We investigate the problem of rewriting queries with aggregate operators using views that may or may not contain aggregate operators. A rewriting of a query is a second query that uses view predicates... / value. In fact most existing data warehouses make use of this idea in br recently by the surge of data warehousing and decision support

30   Selection of Views to Materialize Under a Maintenance Cost Constraint - Gupta (1999)   (Correct)
A data warehouse stores materialized views derived from one or more sources for the purpose of efficiently implementing decisionsupport or OLAP queries. One of the most important decisions in design... / Summit NJ Abstract. A data warehouse stores materialized views br source s for execution. Also warehouse data is available for queries even

29   A Description Logic with Transitive and Inverse Roles and Role.. - Horrocks (1998)   (Correct)
base could contain the following entries defining two different parts of the brain, namely the gyrus and the cerebellum. In contrast to a gyrus, a cerebellum is an integral organ and, furthermore, a f... / D W Q Foundations of Data Warehouse Quality A Description

27   Change Detection in Hierarchically Structured Information - Sudarshan Chawathe (1996)   (Correct)
Detecting and representing changes to data is important for active databases, data warehousing, view maintenance, and version and configuration management. Most previous work in change management has ... / Rdb VMS Developing the Data Warehouse. QED Publishing Group br for active databases data warehousing view maintenance and

27   Supporting Multiple View Maintenance Policies - Colby (1997)   (Correct)
Materialized views and view maintenance are becoming increasingly important in practice. In order to satisfy different data currency and performance requirements, a number of view maintenance policies... / retailing decision support data warehousing and data inte- The br decision support data warehousing and data inte- The work of L.

27   On-Line Warehouse View Maintenance - Quass, Widom (1997)   (Correct)
Data warehouses store materialized views over base data from external sources. Clients typically perform complex read-only queries on the views. The views are refreshed periodically by maintenance tra... / Abstract Data warehouses store materialized views over

27   Bitmap Index Design and Evaluation - Chan, Ioannidis (1998)   (Correct)
Bitmap indexing has been touted as a promising approach for processing complex adhoc queries in read-mostly environments, like those of decision support systems. Nevertheless, only few possible bitmap... / the disk space requirement of data warehouse applications. Understanding br specifically designed for data warehousing applications which supports

26   Maintenance of Materialized Views: Problems, Techniques, and.. - Gupta, Mumick (1995)   (Correct)
In this paper we motivate and describe materialized views, their applications, and the problems and techniques for their maintenance. We present a taxonomy of view maintenance problems based upon the ... / is often described as a data warehouse. Materialized views provide a br in new applications such as data warehousing replication servers

25   Data Cube Approximation and Histograms via Wavelets (Extended.. - Vitter, al. (1998)   (Correct)
Jeffrey Scott Vitter Center for Geometric Computing and Department of Computer Science Duke University Durham, NC 27708--0129 USA jsv@cs.duke.edu Min Wang y Center for Geometric Computing and De... / in the analysis of data in data warehouses in the field of On-Line

25   A Logical Approach to Multidimensional Databases - Cabibbo, Torlone (1998)   (Correct)
In this paper we present MD, a logical model for OLAP systems, and show how it can be used in the design of multidimensional databases. Unlike other models for multidimensional databases, MD is i... / production needs. A data warehouse is an integrated collection

24   The Stanford Data Warehousing Project - Hammer, Garcia-Molina, Widom, Labio, .. (1995)   (Correct)
The goal of the data warehousing project at Stanford (the WHIPS project) is to develop algorithms and tools for the efficient collection and integration of information from heterogeneous and autonomou... / project. Introduction A data warehouse is a repository of integrated br already resolved. Furthermore warehouse data can be accessed without tying

24   Efficient Time Series Matching by Wavelets - Chan, Fu (1999)   (Correct)
Time series stored as feature vectors can be indexed by multidimensional index trees like R-Trees for fast retrieval. Due to the dimensionality curse problem, transformations are applied to time serie... / database applications such as data warehousing and data mining br applications such as data warehousing and data mining A

23   Information Integration: Conceptual Modeling and Reasoning Support - Calvanese, De Giacomo, Lenzerini.. (1998)   (Correct)
Information Integration is one of the core problems in cooperative information systems. We argue that two critical factors for the design and maintenance of applications requiring Information Integrat... / An example of Query Model in a Data Warehouse application is a conceptual br information systems and data warehousing which are key areas in the

22   Range Queries in OLAP Data Cubes - Ho, Agrawal, Megiddo, Srikant (1997)   (Correct)
A range query applies an aggregation operation over all selected cells of an OLAP data cube where the selection is specified by providing ranges of values for numeric dimensions. We present fast algor... / databases built from their data warehouses. An increasingly popular data

22   Algorithms for Materialized View Design in Data Warehousing.. - Yang, Karlapalem, Li (1997)   (Correct)
Selecting views to materialize is one of the most important decisions in designing a data warehouse. In this paper, we present a framework for analyzing the issues in selecting views to materialize so... / decisions in designing a data warehouse. In this paper we present a br Materialized View Design in Data Warehousing Environment Jian Yang

22   A Data Model for Supporting On-Line Analytical Processing - Li (1996)   (Correct)
A database application, called "on-line analytical processing" (or OLAP) and aimed at providing business intelligence through on-line multidimensional data analysis, has become increasingly important ... / are based on the concept of a data warehouse storing materialized views

22   Incremental Maintenance of Externally Materialized Views - Staudt, Jarke (1996)   (Correct)
With the advent of the Internet, access to database servers from autonomous clients will become more and more popular. In this paper, we propose a monitoring service that could be offered by such data... / multi-databases and data warehouses Incremental br is change propagation in data warehousing Traditional

22   Fast Incremental Maintenance of Approximate Histograms - Phillip Gibbons Yossi (1997)   (Correct)
Many commercial database systems maintain histograms to summarize the contents of large relations and permit efficient estimation of query result sizes for use in query optimizers. Delaying the propa... / This pattern is common in data warehouses keeping transactional br environments or in data warehousing environments that house

21   A Case for Delay-Conscious Caching of Web Documents - Scheuermann, Shim, Vingralek (1997)   (Correct)
Caching at proxy servers plays an important role in reducing the latency of the user response, the network delays and the load on Web servers. The cache performance depends critically on the design of... / R. Vingralek WATCHMAN A Data Warehouse Intelligent Cache Manager br for caching query results in a data warehousing environment We

21   Computing Iceberg Queries Efficiently - Min Fang (1998)   (Correct)
Many applications compute aggregate functions over an attribute (or set of attributes) to find aggregate values above some specified threshold. We call such queries iceberg queries, because the numbe... / market basket queries on large data warehouses that store customer sales br many applications including data warehousing information-retrieval

21   Querying Multidimensional Databases - Cabibbo, Torlone (1997)   (Correct)
Multidimensional databases are large collections of data, often historical, used for sophisticated analysis oriented to decision making. This activity is supported by an emerging category of softw... / large historical databases data warehouses oriented to decision making.

20   Workflow Handbook - Lawrence (1997)   (Correct)
This article is a position paper on the nature of the data warehouse refreshment which is often defined as a view maintenance problem or as a loading process. We will show that the refreshment proc... / - Modeling Data Warehouse Refreshment Process as a br systems used for the data warehouse and data marts wrappers and

19   Recovering Information from Summary Data - Faloutsos, Jagadish, Sidiropoulos (1997)   (Correct)
Data is often stored in summarized form, as a histogram of aggregates (COUNTs, SUMs, or AVeraGes) over specified ranges. We study how to estimate the original detail data from the stored summary. We f... / information such as OLAP data warehousing and histograms in query

19   A Survey of Methods for Scaling Up Inductive Algorithms - Provost, Kolluri (1999)   (Correct)
One of the defining challenges for the KDD research community is to enable inductive learning algorithms to mine very large databases. By collecting, categorizing, and summarizing existing work on s... / unlikely that all the data in a data warehouse would be mined simultaneously.

19   Source Integration in Data Warehousing - Calvanese, De Giacomo, Lenzerini.. (1997)   (Correct)
Source Integration is one of the core problems in Data Warehousing. Two critical factors for the design and maintenance of applications requiring Source Integration, and in particular Data Warehouse a... / Integration and in particular Data Warehouse applications are conceptual br Source Integration in Data Warehousing Diego Calvanese Giuseppe

19   Answering Queries Using Views in Description Logics - Calvanese, De Giacomo, Lenzerini (1999)   (Correct)
Answering queries using views amounts to computing the answer to a query having information only on the extension of a set of views. This problem is relevant in several elds, such as information inte... / as information integration data warehousing query optimization etc. In

19   Synchronizing a database to Improve Freshness - Cho, Garcia-Molina (2000)   (Correct)
In this paper we study how to refresh a local copy of an autonomous data source to maintain the copy up-to-date. As the size of the data grows, it becomes more difficult to maintain the copy "fresh," ... / availability. For instance a data warehouse may copy remote sales and

19   A Scalable Algorithm for Answering Queries Using Views - Pottinger, Levy (2000)   (Correct)
The problem of answering queries using views is to find efficient methods of answering a query using a set of previously materialized views over the database, rather than accessing the database rel... / and data warehouse and web-site design

18   Data Mining and Database Systems: Where is the Intersection? - Chaudhuri (1998)   (Correct)
this paper). This raises the question as to what role, if any, database systems research may contribute to area of data mining. In this article, I will try to present my biased view on this issue and ... / issues. However even after a data warehouse has been set up it is often br systems. Data is in the warehouse Data warehouses are deploying

18   Multiple View Consistency for Data Warehousing - Zhuge, Wiener, Garcia-Molina (1997)   (Correct)
A data warehouse stores integrated information from multiple distributed data sources. In effect, the warehouse stores materialized views over the source data. The problem of ensuring data consistency... / Abstract A data warehouse stores integrated information br generates transactions for the warehouse database system. We make no

18   Data Warehouse Configuration - Theodoratos, Sellis (1997)   (Correct)
In the data warehousing approach to the integration of data from multiple information sources, selected information is extracted in advance and stored in a repository. A data warehouse (DW) can th... / Data Warehouse Configuration Dimitri

18   What can Knowledge Representation do for Semi-Structured Data? - Calvanese (1998)   (Correct)
The problem of modeling semi-structured data is important in many application areas such as multimedia data management, biological databases, digital libraries, and data integration. Graph schemas ... / D W Q Foundations of Data Warehouse Quality What can

18   Querying Aggregate Data - Grumbach, Rafanelli, Tininini (1999)   (Correct)
We introduce a first-order language with real polynomial arithmetic and aggregation operators (count, iterated sum and multiply), which is well suited for the definition of aggregate queries involving... / such as for instance data warehousing. In such applications

17   Physical Database Design for Data Warehouses - Labio, Quass, Adelberg (1997)   (Correct)
Data warehouses collect copies of information from remote sources into a single database. Since the remote data is cached at the warehouse, it appears as local relations to the users of the warehouse.... / Physical Database Design for Data Warehouses Wilburt Juan

17   Using Schematically Heterogeneous Structures - Miller (1998)   (Correct)
Schematic heterogeneity arises when information that is represented as data under one schema, is represented within the schema (as metadata) in another. Schematic heterogeneity is an important class o... / has been collected in a data warehouse and consider a decision br legacy data in federated or data warehousing applications. Traditional

17   Static Versus Dynamic Sampling for Data Mining - John (1996)   (Correct)
As data warehouses grow to the point where one hundred gigabytes is considered small, the computational efficiency of data-mining algorithms on large databases becomes increasingly important. Using a ... / Abstract As data warehouses grow to the point where one

17   Efficient Mining of Association Rules in Distributed Databases - Cheung, Ng, Fu, Fu (1996)   (Correct)
Many sequential algorithms have been proposed for mining of association rules. However, very little work has been done in mining association rules in distributed databases. A direct application of seq... / data mining together with data warehousing and data repositories are br mining together with data warehousing and data repositories are three new

17   Recursive Query Plans for Data Integration - Duschka, Genesereth, Levy (1999)   (Correct)
Generating query-answering plans for data integration systems requires to translate a user query, formulated in terms of a mediated schema, to a query that uses relations that are actually stored in d... / for query optimization and data warehousing Most

17   FaCT and iFaCT - Horrocks   (Correct)
I ), consisting of a set  I , called the domain of I, and a function  I which maps every concept to a subset of  I and every role to a subset of  I  I such that the properties... / schema assertions from a data warehousing application Calvanese et

16   A System Prototype for Warehouse View Maintenance - Wiener, Gupta, Labio, Zhuge.. (1996)   (Correct)
A data warehouse collects and integrates data from multiple, autonomous, heterogeneous, sources. The warehouse effectively maintains one or more materialized views over the source data. In this paper ... / Abstract A data warehouse collects and integrates data br the basic architecture of a warehouse data is collected from each

16   Data Integration using Self-Maintainable Views - Gupta (1996)   (Correct)
In this paper we define the concept of self-maintainable views -- these are views that can be maintained using only the contents of the view and the database modifications, without accessing any of ... / of such an environment is data warehousing wherein views are used for

16   Mining surprising patterns using temporal description length - Chakrabarti, Sarawagi, Dom (1998)   (Correct)
We propose a new notion of surprising temporal patterns in market basket data, and algorithms to find such patterns. This is distinct from finding frequent patterns as addressed in the common mining l... / roles. Introduction Data warehousing technology has enabled

16   BOAT - Optimistic Decision Tree Construction - Gehrke, Ganti, Ramakrishnan, Loh (1999)   (Correct)
Classification is an important data mining problem. Given a training database of records, each tagged with a class label, the goal of classification is to build a concise model that can be used to pre... / dynamic environments such as data warehouses in which the training

16   Incremental Clustering for Mining in a Data Warehousing Environment - Ester, Kriegel, Sander, Wimmer, Xu (1998)   (Correct)
Data warehouses provide a great deal of opportunities for performing data mining tasks such as classification and clustering. Typically, updates are collected and applied to the data warehouse periodi... / Abstract Data warehouses provide a great deal of br Clustering for Mining in a Data Warehousing Environment Martin Ester

16   Cubetree: Organization of and Bulk Incremental Updates on the Data.. - Roussopoulos (1997)   (Correct)
The data cube is an aggregate operator which has been shown to be very powerful for On Line Analytical Processing (OLAP) in the context of data warehousing. It is, however, very expensive to compute, ... / the most critical issue in data warehouse environments is the time to br OLAP in the context of data warehousing. It is however very

16   Answering Queries Using Views: A Survey - Levy   (Correct)
The problem of answering queries using views is to find e#cient methods of answering a query using a set of previously materialized views over the database, rather than accessing the database relati... / data integration and data warehouse design. Informally speaking

16   Change-Centric Management of Versions in an XML Warehouse - Marian, Abiteboul, Mignet (2000)   (Correct)
We consider the management of changes in a Web Warehouse of XML data. Our approach is change-centric in that it focuses on deltas, i.e., the changes themselves vs. other approaches based on snapshots ... / project. Keywords XML Datawarehouse Versions Deltas Temporal

15   GeoMiner: A System Prototype for Spatial Data Mining - Han, Koperski, Stefanovic (1997)   (Correct)
Spatial data mining is to mine high-level spatial information and knowledge from large spatial databases. A spatial data mining system prototype, GeoMiner, has been designed and developed based on our... / in relational databases and data warehouses. Spatial data mining is a br research into data mining and data warehousing in recent years many

15   Expiring Data in a Warehouse - Hector Garcia-Molina (1998)   (Correct)
Data warehouses collect data into materialized views for analysis. After some time, some of the data may no longer be needed or may not be of interest. In this paper, we handle this by expiring or rem... / Abstract Data warehouses collect data into br views are often used to store warehouse data. The amount of data copied

15   Join Synopses for Approximate Query Answering - Acharya (1999)   (Correct)
In large data warehousing environments, it is often advantageous to provide fast, approximate answers to complex aggregate queries based on statistical summaries of the full data. In this paper, we de... / histograms on the data in the data warehouse. A key feature of Aqua is br Abstract In large data warehousing environments it is often

15   Encoded Bitmap Indexing for Data Warehouses - Wu, Buchmann (1998)   (Correct)
We present a new indexing technique, encoded bitmap indexing, for data warehouses (DW). Three critical factors, complex query types, huge data volumes and very high read/update ratios, make the indexi... / Encoded Bitmap Indexing for Data Warehouses Ming-Chuan Wu Alejandro P. br Hierarchy Encoding The Warehouse Data Is Usually Modeled As A Star

15   An Extensible Framework for Data Cleaning - Galhardas, Florescu, Shasha, Simon (2000)   (Correct)
Data integration solutions dealing with large amounts of data have been strongly required in the last few years. Besides the traditional data integration problems (e.g. schema integration, local to g... / ed view of them. Following the data warehouse terminology we shall call

15   DynaMat: A Dynamic View Management System for Data Warehouses - Kotidis, Roussopoulos (1999)   (Correct)
Pre-computation and materialization of views with aggregate functions is a common technique in Data Warehouses. Due to the complex structure of the warehouse and the different profiles of the users wh... / View Management System for Data Warehouses Yannis Kotidis

15   Can a Shared-Memory Model Serve as a Bridging Model for Parallel.. - Gibbons, Matias, Ramachandran (1998)   (Correct)
There has been a great deal of interest recently in the development of general-purpose bridging models for parallel computation. Models such as the bsp and logp have been proposed as more realistic ... /

14   Multidimensional Data Modeling for Complex Data - Pedersen, Jensen (1998)   (Correct)
Systems for On-Line Analytical Processing (OLAP) considerably ease the process of analyzing business data and have become widely used in industry. OLAP systems primarily employ multidimensional data m... / . R. Kimball. The Data Warehouse Toolkit. Wiley Computer br and the recent focus on data warehousing the notion of On-Line

14   Concurrency Control Theory for Deferred Materialized Views - Kawaguchi, Lieuwen, Mumick, Ross (1997)   (Correct)
We consider concurrency control problems that arise in the presence of materialized views. Consider a database system supporting materialized views to speed up queries. For a range of important appl... / relations. considers a data warehouse where a view is materialized br in domains such as data warehousing mobile systems data

14   Measurement and Analysis of IP Network Usage and Behavior - Caceres Duffield Feldmann (2000)   (Correct)
Traffic, usage, and performance measurements are crucial to the design, operation and control of Internet Protocol (IP) networks. This paper describes a prototype infrastructure for the measurement, s... / repository we call the WorldNet Data Warehouse. We have used the data both br Measurement Infrastructure and Warehouse Data Sources server's considerable

14   A Survey on Logical Models for OLAP Databases - Vassiliadis, Sellis (1999)   (Correct)
this paper we provided a categorization of the work in the area of OLAP logical models by surveying some major efforts, from commercial tools, benchmarks and standards, and academic efforts. We have a... / not powerful enough for data warehouse applications and that data

13   High-Performance Cluster Computing Using SCI - Ibel, Schauser, Scheiman, Weis (1997)   (Correct)
The Scalable Coherent Interface (SCI) is a recent communication standard for cluster interconnects. We study the use of SCI in a high-performance parallel computing setting, using a cluster of UltraSp... / dia data mining and data warehousing have created additional

13   Conceptual Design of Data Warehouses from E/R Schemes - Golfarelli, Maio, Rizzi (1998)   (Correct)
Data warehousing systems enable enterprise managers to acquire and integrate information from heterogeneous sources and to query very large databases efficiently. Building a data warehouse requires ad... / Hawaii. Conceptual Design of Data Warehouses from E R Schemes Matteo

13   Metarule-Guided Mining of Multi-Dimensional Association Rules Using.. - Kamber, Jenny, Chiang (1997)   (Correct)
In this paper, we employ a novel approach to metarule-guided, multi-dimensional association rule mining which explores a data cube structure. We propose algorithms for metarule-guided mining: give... / tasks will be performed on data warehouses. With efficient techniques br With recent progress on data warehousing and OLAP technology

13   Active Disks - Remote Execution for Network-Attached Storage - Riedel (1999)   (Correct)
Today's commodity disk drives, the basic unit of storage for computer systems large and small, are actually small computers, with a processor, memory, and `network' connection, along with the spinning... / second system i.e. to use a data warehouse separate from the production br Sun storage arrays . TB data warehousing Table - Large storage

12   A Strategy for Database Interoperation - Karp (1995)   (Correct)
To realize the full potential of biological databases (DBs) requires more than the interactive, hypertext flavor of database interoperation that is now so popular in the bioinformatics community. Inte... / Net. . Approach A Data Warehouse In this approach a set of

12   OLAP Mining: An Integration of OLAP with Data Mining - Han (1997)   (Correct)
OLAP mining is a mechanism which integrates on-line analytical processing (OLAP) with data mining so that mining can be performed in different portions of databases or data warehouses and at different... / portions of databases or data warehouses and at different levels of br to develop powerful data warehousing and data mining tools for analysis

12   Research Issues in Large Workflow Management Systems - Alonso, Schek (1996)   (Correct)
In this position paper we describe what we believe are fundamental weaknesses of existing commercial workflow products and how database technology can be used to address these issues. By exporting da... / schema integration and data warehousing are all relevant topics in

12   Integrating Keyword Search into XML Query Processing - Florescu, Kossmann (2000)   (Correct)
Due to the popularity of the XML data format, several query languagesfor XML have been proposed, specially devised to handle data whose structure is unknown, loose, or absent. While these languages ar... / how an RDBMS can be used as a data warehouse for XML data. Unfortunately

12   Incremental Computation and Maintenance of Temporal Aggregates - Jun Yang And (2001)   (Correct)
We consider the problems of computing aggregation queries in temporal databases, and of maintaining materialized temporal aggregate views efficiently. The latter problem is particularly challenging si... / the rapidly increasing use of data warehouses to collect historical

12   Description Logics with Concrete Domains and Aggregation - Baader, Sattler (1998)   (Correct)
We extend different Description Logics by concrete domains (such as integers and reals) and by aggregation functions over these domains (such as min; max; count; sum), which are usually available in d... / D W Q Foundations of Data Warehouse Quality Description

11   Graph Structured Views and Their Incremental Maintenance - Zhuge (1998)   (Correct)
We study the problem of maintaining materialized views of graph structured data. The base data consists of records containing identifiers of other records. The data could represent traditional objects... / these algorithms when only a data warehouse and not the data sources br basic architecture of a data warehouse. Data Warehouse Wrapper

11   On-Line Warehouse View Maintenance for Batch Updates - Quass, Widom (1997)   (Correct)
Data warehouses store materialized views over base data from external sources. Clients typically perform complex read-only queries on the views. The views are refreshed periodically by maintenance tra... / Abstract Data warehouses store materialized views over

11   Fast Approximate Answers to Aggregate Queries on a Data Cube - Poosala, Ganti (1999)   (Correct)
Modern decision support systems require very quick (interactive) responses from the DBMS, but pose complex queries on large volumes of data. In this paper, we present a novel solution to this problem:... / analyze the data in a data warehouse to glean interesting trend

11   Replication and Consistency: Being Lazy Helps Sometimes - Breitbart (1997)   (Correct)
The issue of data replication is considered in the context of a restricted system model motivated by certain distributed data-warehousing applications. A new replica management protocol is defined for... / with the advent of distributed data warehouses and data marts at the high br by certain distributed data-warehousing applications. A new replica

10   Rewriting of Regular Expressions and Regular Path Queries - Calvanese, De Giacomo, Lenzerini.. (1999)   (Correct)
Recent work on semi-structured data has revitalized the interest in path queries, i.e., queries that ask for all pairs of objects in the database that are connected by a path conforming to a certain s... / No. DWQ Foundations of Data Warehouse Quality and by the Italian br well as in data integration data warehousing and query optimization the

10   A Survey of Methods for Scaling Up Inductive Learning Algorithms - Provost, Kolluri (1997)   (Correct)
Each year, one of the explicit challenges for the KDD research community is to develop methods that facilitate the use of inductive learning algorithms for mining very large databases. By collecting... / unlikely that all the data in a data warehouse would be mined simultaneously.

10   MultiMediaMiner: A System Prototype for MultiMedia Data Mining - Zaiane, Han, Li, Chiang (1998)   (Correct)
Multimedia data mining is the mining of high-level multimedia information and knowledge from large multimedia databases. A multimedia data mining system prototype, MultiMediaMiner, has been designed a... / in relational databases and data warehouses Multimedia has been the br the field of data mining and data warehousing research but nothing

10   The Dimensional Fact Model: A Conceptual Model For Data Warehouses - Golfarelli, Maio, Rizzi (1998)   (Correct)
this paper we formalize a graphical conceptual model for data warehouses, called Dimensional Fact model, and propose a semi-automated methodology to build it from the pre-existing (conce... / Model A Conceptual Model For Data Warehouses Matteo Golfarelli

10   Synopsis Data Structures for Massive Data Sets - Matias (1998)   (Correct)
Massive data sets with terabytes of data are becoming commonplace. There is an increasing demand for algorithms and data structures that provide fast response times to queries on such data sets. In ... / for ad hoc queries of large data warehouses GM In large data br as a cache for the disks. In a data warehousing environment for example

10   starER: A Conceptual Model for Data Warehouse Design - Tryfona, Busborg, Christiansen (1999)   (Correct)
Modeling data warehouses is a complex task focusing, very often, into internal structures and implementation issues. In this paper we argue that, in order to accurately reflect the users requirement... / starER A Conceptual Model for Data Warehouse Design Nectaria Tryfona

9   Information Retrieval from an Incomplete Data Cube - Curtis Dyreson (1996)   (Correct)
A complete data cube is a data cube in which every aggregate value in the multidimensional space is stored or can be computed. An incomplete data cube is a data cube in which points in the multidimen... / overnight cron job. ffl A data warehouse collects data from a variety br For instance when warehousing data from different stores one

9   Quality-driven Integration of Heterogeneous Information Systems - Naumann, Leser, Freytag (1999)   (Correct)
Integrated access to information that is spread over multiple, distributed, and heterogeneous sources is an important problem in many scientific and commercial domains. Typically there are many ways t... / integrated to their data warehouse was unusable due to the poor

9   Materialized Views and Data Warehouses - Roussopoulos (1997)   (Correct)
A data warehouse is a redundant collection of data replicated from several possibly distributed and loosely coupled source databases, organized to answer OLAP queries. Relational views are used both a... / Materialized Views and Data Warehouses Nick Roussopoulos br plan for the derivation of the warehouse data. In this position paper we

9   Maintaining Data Cubes under Dimension Updates - Carlos Hurtado (1999)   (Correct)
OLAP systems support data analysis through a multidimensional data model, according to which data facts are viewed as points in a space of application-related "dimensions", organized into levels which... / the dynamic aspect of the data warehouse while dimensions are

9   Towards On-Line Analytical Mining in Large Databases - Han (1998)   (Correct)
Great efforts have been paid in the Intelligent Database Systems Research Lab for the research and development of efficient data mining methods and construction of on-line analytical data mining syste... / large relational databases and data warehouses. The system implements a wide br mining relational data data warehouse data spatial data data formed

9   Unbundling Active Functionality - Gatziu, Koschel, Bültzingsloewen.. (1998)   (Correct)
New application areas or new technical innovations expect from database management systems more and more new functionality. However, adding functions to the DBMS as an integral part of them, tends to ... / new application areas like data warehousing new architectural forms

9   An Alternative Storage Organization for ROLAP Aggregate Views Based.. - Kotidis, Roussopoulos (1998)   (Correct)
The Relational On-Line Analytical Processing (ROLAP) is emerging as the dominant approach in data warehousing with decision support applications. In order to enhance query performance, the ROLAP appro... / warehouse. However in large data warehouses indexing alone is often not br as the dominant approach in data warehousing with decision support

9   Intelligent Agents for Intrusion Detection - Helmer, Wong, Honavar, Miller (1998)   (Correct)
This paper focuses on intrusion detection and countermeasures with respect to widely-used operating systems and networks. The design and architecture of an intrusion detection system built from distri... / agents maintain the data warehouse by combining knowledge and

9   WebOQL: Restructuring Documents, Databases and Webs - Arocena, Mendelzon (1998)   (Correct)
The widespread use of the Web has originated several new data management problems, such as extracting data from Web pages and making databases accessible from Web browsers, and has renewed the interes... / with certain features Web-data warehousing i.e.extracting

9   Density-Based Indexing for Approximate Nearest-Neighbor Queries - Bennett, Fayyad, Geiger (1999)   (Correct)
We consider the problem of performing nearest-neighbor queries efficiently over large high-dimensional databases. Assuming that a full database scan to determine the nearest neighbor entries is not a... / databases. With the growth of Data Warehousing nearest-neighbor queries are

9   Design and Analysis of Quality Information for Data Warehouses - Jeusfeld, Quix, Jarke (1998)   (Correct)
Data warehouses are complex systems that have to deliver highly-aggregated, high quality data from heterogeneous sources to decision makers. Due to the dynamic change in the requirements and the e... / of Quality Information for Data Warehouses Manfred A. Jeusfeld

9   Discovering Structural Association of Semistructured Data - Wang, Liu (1999)   (Correct)
Many semistructured objects are similarly, though not identically, structured. We study the problem of discovering "typical" substructures of a collection of semistructured objects. The discovered s... / sources such as data warehousing Unlike unstructured raw

9   Answering Regular Path Queries Using Views - Calvanese, De Giacomo, Lenzerini.. (1999)   (Correct)
Query answering using views amounts to computing the answer to a query having information only on the extension of a set of views. This problem is relevant in several fields, such as information integ... / as information integration data warehousing query optimization mobile

9   Selective Materialization: An Efficient Method for Spatial Data Cube.. - Han, Stefanovic, Koperski (1998)   (Correct)
On-line analytical processing (OLAP) has gained its popu- larity in database industry. With a huge amount of data stored in spatial databases and the introduction of spatial components to many relat... / techniques. A spatial data warehouse model which consists of both br techniques. Keywords Data warehouse data mining on-hne analytical

9   Authentic Third-party Data Publication - Devanbu, Gertz, Martel, Stubblebine (1999)   (Correct)
Integrity critical databases, such as financial information, which are used in high-value decisions, are frequently published over the internet. Publishers of such data must satisfy the integrity, a... /

8   Query Optimization for Selections using Bitmaps - Wu (1998)   (Correct)
Bitmaps are popular indexes for Data Warehouse (DW) applications and most database management systems (DBMSs) offer them today. This paper analyzes query optimization issues for selection operations u... / Bitmaps are popular indexes for Data Warehouse DW applications and most

8   Views for Semistructured Data - Serge Abiteboul (1977)   (Correct)
Defining a view over a semistructured database introduces many new problems. In this paper we propose a view specification language and consider the problem of answering queries posed over views. The ... / For example consider a large data warehouse stored in Lore that

8   Generalized Projections: A Powerful Approach to Aggregation - Gupta, Harinarayan, Quass (1995)   (Correct)
In this paper we introduce generalized projections (GPs), an extension of duplicate-eliminating projections, that capture aggregations, groupbys, conventional projection with duplicate elimination (di... / the growing number of large data warehouses for decision support br and important problem in data warehousing how to answer an aggregate

8   Supporting Data Integration and Warehousing Using H2O - Zhou, Hull, King, Franchitti   (Correct)
This paper presents a broad framework for data integration, that supports both data materialization and virtual view capabilities, and that can be used with legacy as well as modern database systems. ... / component of the mediator is a data warehouse that holds a materialized br addressing this problem data warehousing i.e.materializing

8   A Framework for Designing Materialized Views in Data Warehousing.. - Yang, Karlapalem, Li (1996)   (Correct)
Data warehouses may contain multiple views with different query frequencies. When these views are related to each other and defined over overlapping portions of the base data, then it may be more effi... / Hong Kong Abstract Data warehouses may contain multiple views br Materialized Views in Data Warehousing Environment J. Yang K.

8   Incremental Updates for Materialized OQL Views - Gluche, Grust, Mainberger, Scholl (1997)   (Correct)
This work discusses the CROQUE approach to the maintenance problem for materialized views. In a CROQUE database, application -specified collections (type extents or classes) themselves need not be m... / is currently investigated in data warehousing applications for example. In

8   Dynamic Load Balancing in Hierarchical Parallel Database Systems - Bouganim, Florescu, Valduriez (1996)   (Correct)
We consider the execution of multi-join queries in a hierarchical parallel system, i.e., a shared-nothing system whose nodes are shared-memory multiprocessors. In this context, the problem of load bal... / for decision support e.g.data warehousing The objective of parallel

CiteSeer - citeseer.org - Terms of Service - Privacy Policy - Copyright © 1997-2002 NEC Research Institute