Home     Top: Databases: Data Warehousing    [Concurrency   Data Warehousing   Deductive   Object-oriented   Performance   Query Processing   Relational   Temporal]

Change ordering:   Authority   Hubs (tutorials)   Date   Expected authority       Show titles only
Tutorials/surveys/introductory articles (ordered by the degree of citation of authoritative articles)

This directory is created automatically and some papers may be mislabeled. Only document within the CiteSeer database are listed. The directory is intended to provide entry points for browsing the database and is not intended to be authoritative. Papers may not appear in all relevant categories. For example, papers in a sub-category may not appear in higher level categories.

11458.7   Active Disks - Remote Execution for Network-Attached Storage - Riedel (1999)   (Correct)
Today's commodity disk drives, the basic unit of storage for computer systems large and small, are actually small computers, with a processor, memory, and `network' connection, along with the spinning... / second system i.e. to use a data warehouse separate from the production br Sun storage arrays . TB data warehousing Table - Large storage

10777.0   Prefetching Techniques for Client/Server, Object-Oriented Database.. - Knafla   (Correct)
The performance of many object-oriented database applications suffers from the page fetch latency which is determined by the expense of disk access. In this work we suggest several prefetching techniq... / was performed with a concrete data warehouse application. One result is

7274.1   Mathematical Programming for Data Mining: Formulations and Challenges - Bradley, Fayyad, Mangasarian (1998)   (Correct)
This paper is intended to serve as an overview of a rapidly emerging research and applications area. In addition to providing a general overview, motivating the importance of data mining problems with... / of nonhomogeneous databases. A data warehouse represents a large collection br the database field called data warehousing has been concerned with

6773.3   A Survey of Methods for Scaling Up Inductive Algorithms - Provost, Kolluri (1999)   (Correct)
One of the defining challenges for the KDD research community is to enable inductive learning algorithms to mine very large databases. By collecting, categorizing, and summarizing existing work on s... / unlikely that all the data in a data warehouse would be mined simultaneously.

6189.6   Neural Networks for Combinatorial Optimization: A Review of More Than .. - Smith (1999)   (Correct)
This article briefly summarizes the work that has been done and presents the current standing of neural networks for combinatorial optimization by considering each of the major classes of combinatoria... /

5614.7   Data Mining: An Overview from a Database Perspective - Chen, Han, Yu (1996)   (Correct)
Mining information and knowledge from large databases has been recognized by many researchers as a key research topic in database systems and machine learning, and by many industrial companies as an i... / providing services such as data warehousing and on-line services over the br of the techniques of data warehousing and data mining in the near future.

4899.8   Nearest Neighbor Search in Multidimensional Spaces - Tsaparas (1999)   (Correct)
The Nearest Neighbor Search problem is defined as follows: given a set P of n points, preprocess the points so as to efficiently answer queries that require finding the closest point in P to a query ... / can be given in the area of data warehousing and data mining. It seems br be given in the area of data warehousing and data mining. It seems likely

4433.7   Can a Shared-Memory Model Serve as a Bridging Model for Parallel.. - Gibbons, Matias, Ramachandran (1997)   (Correct)
There has been a great deal of interest recently in the development of general-purpose bridging models for parallel computation. Models such as the bsp and logp have been proposed as more realistic al... /

4369.9   A Survey of Methods for Scaling Up Inductive Learning Algorithms - Provost, Kolluri (1997)   (Correct)
Each year, one of the explicit challenges for the KDD research community is to develop methods that facilitate the use of inductive learning algorithms for mining very large databases. By collecting... / unlikely that all the data in a data warehouse would be mined simultaneously.

4342.0   HERALD: Hybrid Environment for Robust Analysis of Language Data - Ballim, Coray, Pallotta (1999)   (Correct)
This project addresses the problem of performing structural and semantic analysis of data where the syntactic and semantic models of the domain are inadequate, and robust methods must be employed to ... / Information retrieval data warehousing and knowledge management are

4341.1   Null Values in Object Bases: Pulling Out the Head from the Sand - Subieta, Kambayashi, Leszczylowski.. (1996)   (Correct)
We address the problem of null values and other forms of irregular data in object-oriented databases. This problem is important because of such aspects of modern database technologies as conceptual mo... / and inconsistencies. Data warehouses Poe assume collecting br from heterogeneous sources data warehousing as well as reliability

4276.1   Data Mining: Trends In Research And Development - Deogun, Raghavan, Sarkar, Sever (1996)   (Correct)
Data mining is an interdisciplinary research area spanning several disciplines such as database systems, machine learning, intelligent information systems, statistics, and expert systems. Data mining ... / systems statistics data warehousing and knowledge acquisition in

4243.8   Design and Implementation of On-Line Analytical Processing (OLAP) of.. - Stefanovic (1997)   (Correct)
On-line analytical processing (OLAP) has gained its popularity in database industry. With a huge amount of data stored in spatial databases and the introduction of spatial components to many relationa... / techniques. A spatial data warehouse model which consists of both

4206.6   Data Mining: Research Trends, Challenges, and Applications - Deogun, Raghavan, Sarkar, Sever (1997)   (Correct)
Data mining is an interdisciplinary research area spanning severals disciplines such as database systems, machine learning, intelligent information systems, statistics, and expert systems. Data mining... / systems statistics data warehousing and knowledge acquisition in

3925.4   Processing Semi-Structured Data in Object Bases - Subieta, Leszczylowski, Ulidowski   (Correct)
We address the problem of null values and other forms of semi-structured data in object-oriented databases. Various aspects and issues concerning semi-structured data that are currently presented in t... / active agents Data warehouses which assume collecting

3859.5   Datacube: Its Implementation and Application in OLAP Mining - Tam (1998)   (Correct)
With huge amounts of data collected in various kinds of applications, data warehouse is becoming a mainstream information repository for decision support and data analysis mainly because a data wareho... / various kinds of applications data warehouse is becoming a mainstream br . . What is a Data Warehouse Data warehouses have been defined

3775.4   "Have your Data and Index it, too". Efficient Storage and Indexing.. - Datta, Moon, Ramamritham, Thomas.. (1998)   (Correct)
Two possible strategies may be utilized to enhance the efficiency of processing OLAP queries: (a) precomputation strategies (e.g., view materialization, realizing data cubes), and (b) ad-hoc strategie... / Storage and Indexing for Data Warehouses Anindya Datta br large sizes KB of warehouse data blocks this assumption is

3743.4   Semantics Of Behavioral Inheritance In Deductive Object-Oriented.. - Jamil (1995)   (Correct)
Semantics of Behavioral Inheritance in Deductive Object-Oriented Databases Hasan M. Jamil, Ph.D. Concordia University, 1996 We argue that powerful models for supporting next generation database an... /

3322.2   Robust Text Analysis: an Overview - Ballim, Pallotta, Lieske (1999)   (Correct)
Short abstract Contents 1 Introduction 2 1.1 Motivations and Goals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 1.2 Empiric evidence for Brittleness . . . . . . . . . . ... /

3297.2   Maintenance of Materialized Views: Problems, Techniques, and.. - Gupta, Mumick (1995)   (Correct)
In this paper we motivate and describe materialized views, their applications, and the problems and techniques for their maintenance. We present a taxonomy of view maintenance problems based upon the ... / is often described as a data warehouse. Materialized views provide a br in new applications such as data warehousing replication servers

3287.9   Statistical Approaches to Predictive Modeling in Large Databases - Cheng (1998)   (Correct)
Prediction, i.e., predicting the potential values or value distributions of certain attributes for objects in a database or data warehouse, is an attractive goal in data mining. To predict future even... / for objects in a database or data warehouse is an attractive goal in data br practice in data warehousing and data mining fields. Usually

3267.6   What Good are Shared-Memory Models? - Gibbons (1996)   (Correct)
Shared memory models have been criticized for years for failing to model essential realities of parallel machines. Given the current wave of popular message-passing and distributed memory models (e.g.... / support data mining OLAP data warehousing implies that the most

3057.9   Advances and Research Directions in Data Warehousing Technology - Mohania, Samtani, Roddick, Kambayashi   (Correct)
Information is one of the most valuable assets of an organization and when used properly can assist in intelligent decision making that can significantly improve the functioning of an organization. Da... / separate repository called a data warehouse that selects data from many

3002.8   GA-MINER: Parallel Data Mining with Hierarchical Genetic Algorithms - .. - Flockhart (1995)   (Correct)
Many organisations now routinely gather vast and ever-increasing amounts of data in the ordinary course of their business. While much of this information is collected for day-to-day operational reason... / systems often based around a data warehouse. Such systems may then be br commercial market for data warehousing and data mining tools is expanding

2943.1   Fourier-Transform Based Techniques in Efficient Retrieval of Similar.. - Rafiei (1999)   (Correct)
Fourier-Transform Based Techniques in Efficient Retrieval of Similar Time Sequences Davood Rafiei Doctor of Philosophy Graduate Department of Computer Science University of Toronto 1999 The idea... / such as data mining or data warehousing. In this dissertation the

2937.6   Methods and problems in data mining - Mannila (1997)   (Correct)
Knowledge discovery in databases and data mining aim at semiautomatic tools for the analysis of large data sets. We consider some methods used in data mining, concentrating on levelwise search for all... / the rise of the concepts of data warehousing and on-line analytical

2929.5   Complexity of Answering Queries Using Materialized Views - Abiteboul, Duschka (1998)   (Correct)
We study the complexity of the problem of answering queries using materialized views. This problem has attracted a lot of attention recently because of its relevance in data integration. Previous work... / with the popularity of data warehouses The problem of br problems which arise in data warehousing. Introduction The

2712.9   Knowledge Representation Approach to Information Integration - Calvanese, De Giacomo, Lenzerini.. (1998)   (Correct)
In recent years there has been a growing interest in accessing, relating, and combining data from multiple sources. Indeed, Information Integration is one of the core problems in distributed databases... / and the Enterprise Model. Data warehouses In this setting views br information systems and data warehousing which are key areas in the

2697.9   Periodic Pattern Search on Time-Related Data Sets - Gong (1997)   (Correct)
For many applications such as accounting, banking, business transaction processing systems, geographical information systems, medical record book keeping, etc., the changes made on their databases ove... / and OLAP Techniques A data warehouse is a subject-oriented br in knowledge discovery and data warehousing become important. Knowledge

2645.3   Efficient Cost Models for Spatial Queries Using R-trees - Theodoridis, Stefanakis, Sellis (1998)   (Correct)
Selection and join queries are the fundamental operations in Data Base Management Systems (DBMS). Support for non-traditional data, including spatial objects, in an efficient manner is a new directi... / Fal to data mining and data warehousing FJS RKR An example of

2641.9   Information Integration: Conceptual Modeling and Reasoning Support - Calvanese, De Giacomo, Lenzerini.. (1998)   (Correct)
Information Integration is one of the core problems in cooperative information systems. We argue that two critical factors for the design and maintenance of applications requiring Information Integrat... / An example of Query Model in a Data Warehouse application is a conceptual br information systems and data warehousing which are key areas in the

2604.1   On-Line Analytical Mining of Association Rules - Zhu (1998)   (Correct)
With wide applications of computers and automated data collection tools, massive amounts of data have been continuously collected and stored in databases, which creates an imminent need and great oppo... / association rule mining data warehouse. Acknowledgement I would br several sections. . Data warehouse Data warehouse is a semantically

2595.6   Constructing Classification Trees with Exception Annotations for.. - Li (1999)   (Correct)
Classification is an important problem in data mining, which identifies essential features of different classes based on a set of training data and then classifies new instances into appropriate class... / our method is integrated with data warehouse functionalities thus br that enables users to explore warehouse data interactively. Basic OLAP

2591.9   ILA-2: An Inductive Learning Algorithm over uncertain data - Tolun, Sever, al.   (Correct)
ABSTRACT AND CONCLUSION NEEDS TO BE RE-WRITTEN. ESPECIALLY WE SHOULD EMPHASIZE OUR CONTRIBUTION AND ORGINALITY OF THE WORK IN CONCLUSION. In this paper we describe the ILA-2 rule induction algorithm f... / data set from a database or a data warehouse and to possibly sample the

2581.7   Data Currency Quality Factors in Data Warehouse Design - Theodoratos, Bouzeghoub (1999)   (Correct)
A Data Warehouse (DW) is a large collection of data integrated from multiple distributed autonomous databases and other information sources. A DW can be seen as a set of materialized views defined... / Currency Quality Factors in Data Warehouse Design Dimitri Theodoratos

2573.6   Efficient Storage of Temporal Data in a Data Warehousing Context - Musicant, Dabu (1997)   (Correct)
A data warehouse provides strong capabilities for answering complex decision support queries. In particular, it is often desirable to query relations as they existed at some point in the past. Much of... / Abstract A data warehouse provides strong capabilities br prior to the inception of the warehouse. Data that is extremely old may

2545.7   Rewriting of Regular Expressions and Regular Path Queries - Calvanese, De Giacomo, Lenzerini.. (1999)   (Correct)
Recent work on semi-structured data has revitalized the interest in path queries, i.e., queries that ask for all pairs of objects in the database that are connected by a path conforming to a certain s... / No. DWQ Foundations of Data Warehouse Quality and by the Italian br well as in data integration data warehousing and query optimization the

2511.8   Similarity Search in Time Series Data Sets - Xia (1997)   (Correct)
Similarity search on time-series data sets is of growing importance in data mining. With the increasing amount of data of time-series in many applications, from financial to scientific, it is importan... / time-related database data warehouse data mining and wavelet br time-related database data warehouse data mining and wavelet analysis.

2502.6   Towards an Integrated Query/Programming Language for Object Bases: a.. - Subieta, Kambayashi, Leszczylowski   (Correct)
We present general principles of modern programming languages and discuss requirements which have to be observed during the development of integrated object-oriented query/programming languages and th... /

2472.7   Scalability Analysis of Declustering Methods for Cartesian Product.. - Moon, Saltz (1996)   (Correct)
Efficient storage and retrieval of multi-attribute datasets has become one of the essential requirements for many data-intensive applications. The Cartesian product file has been known as an effective... / SCSI disks. And a . terabyte data warehouse was recently reported to be

2464.9   The State of Rough Sets for Database Mining Applications - Raghavan, Sever (1995)   (Correct)
The database mining problem is often cited as one of the most promising research topics in the fields of database systems and machine learning. Although many available machine learning algorithms are ... /

2459.7   Using Schematically Heterogeneous Structures - Miller (1998)   (Correct)
Schematic heterogeneity arises when information that is represented as data under one schema, is represented within the schema (as metadata) in another. Schematic heterogeneity is an important class o... / has been collected in a data warehouse and consider a decision br legacy data in federated or data warehousing applications. Traditional

2449.9   View-based Query Processing and Constraint Satisfaction - Calvanese, De Giacomo, Lenzerini.. (2000)   (Correct)
View-based query processing requires to answer a query posed to a database only on the basis of the information on a set of views, which are again queries over the same database. This problem is relev... / to answer a query. A data warehouse can be seen as a set of br including query optimization data warehousing data integration and query

2442.7   Very Large Databases - Garofalakis, Miller   (Correct)
Data Types Traditional DBMS support a fixed set of simple data types (for example, integers and dates). Extensible DBMS can be extended dynamically with user-defined types and functions. These types c... / DBMS support technologies. Data Warehouses Data warehouses provide br Package Web Browser Data Warehouse Data Access Integrated Data

2428.5   Scalability Analysis of Declustering Methods for Multidimensional.. - Moon, Saltz (1998)   (Correct)
Efficient storage and retrieval of multi-attribute datasets have become one of the essential requirements for many data-intensive applications. The Cartesian product file has been known as an effectiv... / SCSI disks. A . terabyte data warehouse was recently reported to be

2407.8   Synopsis Data Structures for Massive Data Sets - Matias (1998)   (Correct)
Massive data sets with terabytes of data are becoming commonplace. There is an increasing demand for algorithms and data structures that provide fast response times to queries on such data sets. In ... / for ad hoc queries of large data warehouses GM In large data br as a cache for the disks. In a data warehousing environment for example

2384.6   Mining Exceptions And Quantitative Association Rules In Olap Data Cube - Chen (1999)   (Correct)
People nowadays are relying more and more on OLAP data to find business solutions. A typical OLAP data cube usually contains four to eight dimensions, with two to six hierarchical levels and tens to h... / The Multidimensional Model for Data Warehouse . br that enables users to examine warehouse data interactively. Typical OLAP

2376.2   Aqua Project White Paper - Gibbons, Matias, Poosala (1997)   (Correct)
In large data recording and warehousing environments, it is often advantageous to provide fast, approximate answers to queries, whenever possible. The goal is to provide an estimated response in ord... / Figure depicts a traditional data warehouse set-up in which the base data

2322.0   Range Queries in OLAP Data Cubes - Ho, Agrawal, Megiddo, Srikant (1997)   (Correct)
A range query applies an aggregation operation over all selected cells of an OLAP data cube where the selection is specified by providing ranges of values for numeric dimensions. We present fast algor... / databases built from their data warehouses. An increasingly popular data

2285.9   On the Decidability of Query Containment under Constraints - Calvanese, De Giacomo, Lenzerini (1998)   (Correct)
Query containment under constraints is the problem of checking whether for every database satisfying a given set of constraints, the result of one query is a subset of the result of another query. Rec... / No. DWQ Foundations of Data Warehouse Quality the Italian br view maintenance data warehousing and constraint

2283.2   A Vision for Management of Complex Models - Bernstein, Levy, Pottinger (2000)   (Correct)
Many problems encountered when building applications of database systems involve the manipulation of models. By "model," we mean a complex structure that represents a design artifact, such as a relati... / mapping data sources into data warehouse tables to generate programs

2273.8   Temporal Statement Modifiers - Böhlen, Jensen   (Correct)
A wide range of database applications manage time-varying data. Although temporal database technology has reached a level of maturity and sophistication where it is evident that these applications may... / such as decision support and data warehousing old versions of data are

2272.4   Evaluation of Main Memory Join Algorithms for Joins with Subset Join.. - Helmer, Moerkotte (1997)   (Correct)
Current data models like the NF 2 model and object-oriented models support set-valued attributes. Hence, it becomes possible to have join predicates based on set comparison. This paper introduces an... /

2242.6   Project Summary - The Goal Of   (Correct)
sion assume that the learning samples are drawn from identical and independent distributions(i.i.ds). In other words, the data about a cholera patient is independent(spatially) of the data describing ... / over a potentially distributed data warehouse. Thus the algorithm must

2228.1   Evaluation of Main Memory Join Algorithms for Joins with Set.. - Helmer, Moerkotte (1996)   (Correct)
Current data models like the NF 2 model and object-oriented models support setvalued attributes. Hence, it becomes possible to have join predicates based on set comparison. This paper introduces and... /

2218.9   Scaling EM (Expectation-Maximization) Clustering to Large Databases - Bradley, Fayyad, Reina (1999)   (Correct)
Practical statistical data clustering algorithms require multiple data scans to converge. For large databases, these scans become prohibitively expensive. We present a scalable clustering framework... / over a potentially distributed data warehouse. Hence a second scan may

2216.9   Recursive Query Plans for Data Integration - Duschka, Genesereth, Levy (1999)   (Correct)
Generating query-answering plans for data integration systems requires to translate a user query, formulated in terms of a mediated schema, to a query that uses relations that are actually stored in d... / for query optimization and data warehousing Most

2200.0   WebOQL: Exploiting Document Structure in Web Queries - Arocena (1997)   (Correct)
The widespread use of the Web has given rise to several new data management problems, such as extracting data from Web pages and making databases accessible from browsers, and has renewed the interest... / with certain features Web-data warehousing HG i.e.extracting

2170.8   Object Views and Database Restructuring - Lacroix, Delobel, Brèche (1997)   (Correct)
We present a formal data model for views in Object DataBase Systems (ODBS) as a transformation mechanism for databases. Our model relaxes the usual constraint where an object belongs to a single cla... / several sources as in data warehouse We propose RQL Referent

2152.6   SchemaSQL - An Extension to SQL for Multi-database Interoperability - Lakshmanan, Sadri, Subramanian   (Correct)
We provide a principled extension of SQL, called SchemaSQL, that offers the capability of uniform manipulation of data and schema in relational multi-database systems. We develop a precise syntax an... / query optimization in a data warehouse SV and scalable

2143.1   Integrating Domain Specific Language Design in the Software Life Cycle - Kutter, Schweizer, Thiele   (Correct)
Domain Specific Languages help to split the software live cycle in different independent cycles. While the use of the newly created language is just an additional tool in the established cycle, the ... / complexity. For example in a data-warehouse scenario there is the need of

2122.9   Towards Guaranteed Quality and Dependability of Information Services - Weikum (1999)   (Correct)
The impressive advances in global networking and information technology provide great opportunities for all kinds of ubiquitous information services, ranging from digital libraries and information dis... / of service quality. In a data warehouse environment for br of the query result in a data warehouse or data mining system we need to

2111.2   The MyView Project: a Data Warehousing Approach to Personalized.. - Wolff, Cremers (1999)   (Correct)
The MyView project aims at the integration of both structured and unstructured bibliographic information from a diversity of heterogeneous Internet repositories like electronic journals and traditio... / a single repository called a data warehouse suitable for direct br to explore the personal local warehouse data. Our goal is to include for

2107.5   Diag-Join: An Opportunistic Join Algorithm for 1:N Relationships - Helmer, Westmann, Moerkotte (1997)   (Correct)
Time of creation is one of the predominant (often implicit) clustering strategies found not only in Data Warehouse systems: line items are created together with their corresponding order, objects are ... / strategies found not only in Data Warehouse systems line items are

2105.0   Collective Data Mining: A New Perspective Toward Distributed Data.. - Kargupta, al (1999)   (Correct)
This paper introduces the collective data mining (CDM), a new approach toward distributed data mining (DDM) from heterogeneous sites. It points out that naive approaches to distributed data analysis i... / all the data to a centralized data warehouse followed by analysis using

2093.0   Towards On-Line Analytical Mining in Large Databases - Han (1998)   (Correct)
Great efforts have been paid in the Intelligent Database Systems Research Lab for the research and development of efficient data mining methods and construction of on-line analytical data mining syste... / large relational databases and data warehouses. The system implements a wide br mining relational data data warehouse data spatial data data formed

2087.2   Answering Queries Using Views: A Survey - Levy   (Correct)
The problem of answering queries using views is to find e#cient methods of answering a query using a set of previously materialized views over the database, rather than accessing the database relati... / data integration and data warehouse design. Informally speaking

2086.1   Overview of Strudel - A Web-Site Management System - Fernandez, Florescu, Kang, Levy.. (1998)   (Correct)
The Strudel system applies concepts from database management systems to the process of building Web sites. Strudel's key idea is separating the management of the site's data, the creation and manage... / two problems whether to warehouse data from external sources or to br problems similar to those of data warehousing such as highly prohibitive

2084.2   Selecting and Maintaining Materialized Views for Message Management - Gupta, Srivastava   (Correct)
Electronic messaging has become one of the primary means for the dissemination, exchange and sharing of information. This is facilitated, especially within an organization, by the use of shared folder... / the messaging system as a data warehouse where each message is a tuple

2056.6   Data Warehouse Configuration - Theodoratos, Sellis (1997)   (Correct)
In the data warehousing approach to the integration of data from multiple information sources, selected information is extracted in advance and stored in a repository. A data warehouse (DW) can th... / Data Warehouse Configuration Dimitri

2027.1   The Rejuvenation of Materialized Views (Extended Abstract) - Mumick (1995)   (Correct)
Inderpal Singh Mumick AT&T Bell Laboratories 600 Mountain Avenue Murray Hill, NJ 07974, USA. mumick@research.att.com Abstract. This is a short summary of a talk presented at the sixth International... / is often described as a data warehouse. The warehouse usually br are emerging. These include data warehousing mobile systems data

2023.6   Visual Data Mining: Framework and Algorithm Development - Ganesh, Han, Kumar, Shekhar.. (1996)   (Correct)
Visual data mining is the use of visualization techniques to allow data miners and analysts to evaluate, monitor, and guide the inputs, products and process of data mining. It can help introduce user ... / unknown patterns in a data warehouse which is a historical br different source databases the warehouse data is usually cleaned or

2022.8   Answering Regular Path Queries Using Views - Calvanese, De Giacomo, Lenzerini.. (1999)   (Correct)
Query answering using views amounts to computing the answer to a query having information only on the extension of a set of views. This problem is relevant in several elds, such as information integr... / as information integration data warehousing query optimization mobile

2012.1   Consistency Algorithms for Multi-Source Warehouse View Maintenance - Yue Zhuge (1998)   (Correct)
A warehouse is a data repository containing integrated information for efficient querying and analysis. Maintaining the consistency of warehouse data is challenging, especially if the data sources a... / update scenarios. Keywords data warehouse data consistency view br Maintaining the consistency of warehouse data is challenging especially if

1996.6   Graph Structured Views and Their Incremental Maintenance - Zhuge (1998)   (Correct)
We study the problem of maintaining materialized views of graph structured data. The base data consists of records containing identifiers of other records. The data could represent traditional objects... / these algorithms when only a data warehouse and not the data sources br basic architecture of a data warehouse. Data Warehouse Wrapper

1981.0   Space Efficiency of Propositional Knowledge Representation Formalisms - Cadoli, al. (2000)   (Correct)
We investigate the space e#ciency of a Propositional Knowledge Representation (PKR) formalism. Intuitively, the space e#ciency of a formalism F in representing a certain piece of knowledge #, is the... /

1973.0   Data mining: machine learning, statistics, and databases - Mannila (1996)   (Correct)
Knowledge discovery in databases and data mining aim at semiautomatic tools for the analysis of large data sets. We give an overview of the area and present some of the research issues, especially fr... / to the rise of the concepts of data warehousing and on-line analytical

1972.3   Integrating Light-Weight Workflow Management Systems within Existing.. - Muth, Weissenfels, Gillmann, Weikum (1999)   (Correct)
Workflow management systems support the efficient, largely au- tomated execution of business processes. However, using a workflow management system typically requires implementing the application's ... /

1971.5   Recovering Information from Summary Data - Faloutsos, Jagadish, Sidiropoulos (1997)   (Correct)
Data is often stored in summarized form, as a histogram of aggregates (COUNTs, SUMs, or AVeraGes) over specified ranges. We study how to estimate the original detail data from the stored summary. We f... / information such as OLAP data warehousing and histograms in query

1970.2   Making Aggregate Views Self-Maintainable - Mohania, Kambayashi (1999)   (Correct)
Data in the warehouse can be seen as materialized views generated from the underlying multiple data sources. Materialized views are used to speed up query processing on large amounts of data. These vi... / Introduction Data in the data warehouse can be seen as materialized br n Self-Maintainable Warehouse Data Data Sources Figure An

1965.2   Workflow History Management in Virtual Enterprises using a.. - Muth, Weissenfels, Gillmann, Weikum   (Correct)
Enterprise-spanning workflows require workflow management systems that can be tailored to specific application needs, as well as enhanced support for interoperability between different workflow manage... / to deploy a large-footprint data warehouse in conjunction with the br cially in the areas of data warehousing and online analytical

1955.9   Minimizing View Sets without Losing Query-Answering Power - Li, Bawa, Ullman (2000)   (Correct)
The problem of answering queries using views has been studied extensively, due to its relevance in a wide variety of data-management applications. In these applications, we often need to select a sub... / computing the views. In a data warehouse views can preclude costly br as information integration data warehousing and query optimization. The

1952.8   View-based Query Processing for Regular Path Queries with Inverse - Calvanese, De Giacomo, Lenzerini, al. (2000)   (Correct)
View-based query processing is the problem of computing the answer to a query based on a set of materialized views, rather than on the raw data in the database. The problem comes in two different form... / No. DWQ Foundations of Data Warehouse Quality and by the Italian br with incomplete information data warehousing and data integration.

1947.7   Maintenance of Data Cubes and Summary Tables in a Warehouse - Mumick, Quass, Mumick (1997)   (Correct)
Data warehouses contain large amounts of information, often collected from a variety of independent sources. Decision-support functions in a warehouse, such as on-line analytical processing (OLAP), in... / Paper Number Abstract Data warehouses contain large amounts of

1945.9   Storing Semistructured Data with STORED - Deutsch, Fernandez, Suciu (1999)   (Correct)
this paper, we describe a technique for using relational databases to store and manage semistructured data. Our purpose is to use high-performance RDBM systems to store, query, and manage semistructur... / of query plans Ull or data warehouse design TS we must

1941.3   Repository Support For Multi-Perspective Requirements Engineering - Nissen, Jarke (1999)   (Correct)
Relationships among different modeling perspectives have been systematically investigated focusing either on given notations (e.g. UML) or on domain reference models (e.g. ARIS/SAP). In contrast, ma... / of business analysis and data warehouse design. Key words

1922.7   Scaling Heterogeneous Databases and the Design of DISCO - Tomasic, Raschid, Valduriez (1996)   (Correct)
Access to large numbers of data sources introduces new problems for users of heterogeneous distributed databases. End users and application programmers must deal with unavailable data sources. Databas... / A mediator may as in data warehousing also keep state or summary

1921.9   Designing View Maintenance Algorithm in Data Warehousing Environment - Mukesh Mohania (1999)   (Correct)
A data warehouse stores materialized views generated from the underlying source data. Materialized views are used to speed up query processing on large amounts of data. These views need to be maintain... / Vincent x Abstract A data warehouse stores materialized views br View Maintenance Algorithm in Data Warehousing Environment Mukesh Mohania

1916.4   Protocols for Integrity Constraint Checking in Federated Databases - Paul Grefen (1996)   (Correct)
A federated database is comprised of multiple interconnected database systems that primarily operate independently but cooperate to a certain extent. Global integrity constraints can be very useful in... / in ZGH in the context of data warehousing. In ZGH algorithms are

1862.1   The Data Warehouse of Newsgroups - Gupta, Srivastava (1999)   (Correct)
Electronic newsgroups are one of the primary means for the dissemination, exchange and sharing of information. We argue that the current newsgroup model is unsatisfactory, especially when posted art... / The Data Warehouse of Newsgroups Himanshu

1855.9   The ADMS Project: Views "R" Us - Roussopoulos, Chen, Kelley (1995)   (Correct)
The goal of the ADMS project is to create a framework for caching materialized views, access paths, and experience obtained during query execution. The rationale behind this project is to amortize dat... / is necessary in any large data warehouse in which the catalog cannot br their role in performance data warehousing management and control of

1853.9   Scaling Clustering Algorithms to Large Databases - Bradley, Fayyad, Reina (1998)   (Correct)
Practical clustering algorithms require multiple data scans to achieve convergence. For large databases, these scans become prohibitively expensive. We present a scalable clustering framework applicab... / over a potentially distributed data warehouse with much processing

1843.4   Object-Oriented View Technology: Challenges and Promises - Rundensteiner   (Correct)
With object-oriented database (OODB) systems reaching the status of a maturing technology, increased attention has been given in recent years to the realization of view mechanisms in OODBs. Objectorie... / and for analyzing data in a data warehouse Given the rising br schema evolution data warehousing and workflow management to

1837.2   Supporting Multiple View Maintenance Policies - Colby (1997)   (Correct)
Materialized views and view maintenance are becoming increasingly important in practice. In order to satisfy different data currency and performance requirements, a number of view maintenance policies... / retailing decision support data warehousing and data inte- The br decision support data warehousing and data inte- The work of L.

1820.3   State of the Art: Data Warehouse Refreshment - Fabret, Matulovic, Simon (1997)   (Correct)
Introduction 1.1 Data Layers in a Data Warehouse Architecture A Data Warehouse architecture exhibits various layers of data in which data from one layer is derived from data of the lower layer. Data s... / DWQ Foundations of Data Warehouse Quality DWQ v. .

1814.2   WATCHMAN: A Data Warehouse Intelligent Cache Manager - Scheuermann, Shim, Vingralek (1996)   (Correct)
Data warehouses store large volumes of data which are used frequently by decision support applications. Such applications involve complex queries. Query performance in such an environment is critical ... / WATCHMAN A Data Warehouse Intelligent Cache Manager

1809.4   Preprocessing of Intractable Problems - Cadoli, Donini, Liberatore, Schaerf (1999)   (Correct)
Some computationally hard problems --e.g., deduction in logical knowledge bases-- are such that part of an instance is known well before the rest of it, and remains the same for several subsequent i... / fast answers. Maintaining the data-warehouse e.g.with respect to br with the so-called data-warehousing. The results usually called

1806.6   Architecture and Quality in Data Warehouses: an Extended Repository.. - Jarke, Jeusfeld, Quix, Vassiliadis (1999)   (Correct)
This paper makes two unknown 6S8CDU@8UVS@6I9RV6GDU`DI96U6X6S@CPVT@T) 6I@YU@I9@9S@QPTDUPS`6QQSP68C MATTHIAS JARKE MANFRED JEUSFELD CHRISTOPH PANOS VASSILIADIS (1) Informatik V, RWTH Aachen, 52056 A... / researchers have studied data warehouses DW in their role as buffers br views in the data warehouse data transport agents that ship

1801.6   New Sampling-Based Summary Statistics for Improving Approximate Query .. - Gibbons, Matias (1998)   (Correct)
In large data recording and warehousing environments, it is often advantageous to provide fast, approximate answers to queries, whenever possible. Before DBMSs providing highly-accurate approximate an... / of ongoing insertions to the data warehouse. Introduction In large br Figure A traditional data warehouse. Data Warehouse New Data

1774.5   The Strobe Algorithms for Multi-Source Warehouse Consistency - Yue Zhuge (1996)   (Correct)
A warehouse is a data repository containing integrated information for efficient querying and analysis. Maintaining the consistency of warehouse data is challenging, especially if the data sources are... / Introduction A data warehouse is a repository of integrated br Maintaining the consistency of warehouse data is challenging especially if

1769.4   The MyView Project: from Heterogeneous Bibliographic Information.. - Wolff, Cremers (1998)   (Correct)
The MyView project aims at the integration of both structured and unstructured bibliographic information from a diversity of heterogeneous Internet repositories like electronic journals and traditiona... / a single repository called a data warehouse suitable for direct querying br to explore the personal local warehouse data. Our goal is to include for

1761.3   Non-Equivalent Query Rewritings - Lee, Koeller, Nica, Rundensteiner (1999)   (Correct)
this paper, we apply this concept of non-equivalent query rewriting to the problem of maintaining view definitions (data warehouses), where it now allows us to handle a much larger class of changes of... / maintaining view definitions data warehouses where it now allows us to

1758.1   Efficient Mining of Association Rules in Distributed Databases - Cheung, Ng, Fu, Fu (1996)   (Correct)
Many sequential algorithms have been proposed for mining of association rules. However, very little work has been done in mining association rules in distributed databases. A direct application of seq... / data mining together with data warehousing and data repositories are br mining together with data warehousing and data repositories are three new

1745.7   SchemaSQL - A Language for Interoperability in Relational.. - Lakshmanan, Sadri, Subramanian (1996)   (Correct)
We provide a principled extension of SQL, called SchemaSQL , that offers the capability of uniform manipulation of data and meta-data in relational multi-database systems. We develop a precise syntax ... /

1740.4   Rewriting of Regular Path Queries - Calvanese, De Giacomo, Lenzerini..   (Correct)
Recent work on semi-structured data has revitalized the interest in path queries, i.e. queries that ask for all pairs of objects in the database that are connected by a path conforming to a certain ... / well as in data integration data warehousing and query optimization the

1737.4   Ratio Rules: A New Paradigm for Fast, Quantifiable Data Mining - Korn, Labrinidis, Kotidis, Faloutsos (1998)   (Correct)
Association Rule Mining algorithms operate on a data matrix (e.g., customers \Theta products) to derive association rules [2, 23]. We propose a new paradigm, namely, Ratio Rules, which are quantifiab... / sources for use in a data warehouse ffl Forecasting If a

1737.0   A Logic Based Language for Parametric Inheritance - Jamil (2000)   (Correct)
Though overriding as a single and default mode of inheritance is adequate for most knowledge bases, a large class of applications naturally requires several inheritance modes and types. We propose... / What can hierarchies do for data warehouses In Proc. of the VLDB br secure databases data warehousing data mining etc.

1733.8   Quality-driven Integration of Heterogeneous Information Systems - Naumann, Leser, Freytag (1999)   (Correct)
Integrated access to information that is spread over multiple, distributed, and heterogeneous sources is an important problem in many scientific and commercial domains. Typically there are many ways t... / integrated to their data warehouse was unusable due to the poor

1727.2   Information Brokering: Design, Search and Transformation - Jeusfeld, Papazoglou (1996)   (Correct)
In large human-computer networks, information brokers provide links among syntactically and semantically heterogeneous information sources with information users who are equally diverse in their int... / LTR project Foundations of Data Warehouse Quality No. DWQ

1720.3   Layered Implementation of Temporal DBMSs - Concepts and Techniques - Torp, Jensen, Böhlen (1997)   (Correct)
A wide range of database applications manage time-varying data. Examples include, e.g., accounting, personnel, schedule, and data warehousing applications. At the same time, it is well-known that quer... / personnel schedule and data warehousing applications. At the same

1715.2   High-Performance Cluster Computing Using SCI - Ibel, Schauser, Scheiman, Weis (1997)   (Correct)
The Scalable Coherent Interface (SCI) is a recent communication standard for cluster interconnects. We study the use of SCI in a high-performance parallel computing setting, using a cluster of UltraSp... / dia data mining and data warehousing have created additional

1698.8   Collective Data Mining: A New Perspective Toward Distributed Data.. - Kargupta, Byung-Hoon, al (1999)   (Correct)
This paper introduces the collective data mining (CDM) framework, a new approach toward distributed data mining (DDM) from heterogeneous sites. It points out that naive approaches to distributed data ... / all the data to a centralized data warehouse followed by analysis using

1691.2   Answering Queries by Semantic Caches - Godfrey, Gryz (1998)   (Correct)
There has been growing interest in semantic query caches to aid in query evaluation. Semantic caches are simply the results of previously asked queries, or selected relational information chosen by an... / heterogeneous databases and data warehousing the use of semantic caches

1690.4   Estimating Alphanumeric Selectivity in the Presence of Wildcards.. - Krishnan, Vitter, Iyer (1996)   (Correct)
Success of commercial query optimizers and database management systems (object-oriented or relational) depend on accurate cost estimation of various query reorderings [BGI]. Estimating predicate selec... / by practitioners in data warehousing that one half of the records

1687.7   Views for Semistructured Data - Serge Abiteboul (1977)   (Correct)
Defining a view over a semistructured database introduces many new problems. In this paper we propose a view specification language and consider the problem of answering queries posed over views. The ... / For example consider a large data warehouse stored in Lore that

1687.0   On Efficiently Implementing SchemaSQL on a SQL Database System - Lakshmanan (1999)   (Correct)
SchemaSQL is a recently proposed extension to SQL for enabling multi-database interoperability. Several recently identified applications for SchemaSQL, however, mainly rely on its ability to treat dat... / Query Optimization in a Data Warehouse Optimization using br Introduction Data warehousing is a technology motivated by

1686.8   Supporting Data Integration and Warehousing Using H2O - Zhou, Hull, King, Franchitti   (Correct)
This paper presents a broad framework for data integration, that supports both data materialization and virtual view capabilities, and that can be used with legacy as well as modern database systems. ... / component of the mediator is a data warehouse that holds a materialized br addressing this problem data warehousing i.e.materializing

1683.8   Prefetching in Segmented Disk Cache for Multi-Disk Systems - Valery Soloviev (1996)   (Correct)
This paper investigates the performance of a multi-disk storage system equipped with a segmented disk cache processing a workload of multiple relational scans. Prefetching is a popular method of impro... / for decision support and data warehouse applications for multimedia

1665.6   Algorithms for Deferred View Maintenance - Colby, Griffin, Libkin, Mumick.. (1997)   (Correct)
Materialized views and view maintenance are important for data warehouses, retailing, banking, and billing applications. We consider two related view maintenance problems: 1) how to maintain views aft... / maintenance are important for data warehouses retailing banking and

1664.0   Utilizing Materialized Views: Methods for Creating Database.. - Lauzac (1999)   (Correct)
Data caching and hoarding provide the only means to support disconnected mobile operations. In the context of mobile database applications, data cached can take the form of a materialized view. In th... / View Maintenance for a Data Warehouse . br Fixed Mobile Network Data Warehouse Data Warehouse Maintenance

1653.7   Expiring Data in a Warehouse - Hector Garcia-Molina (1998)   (Correct)
Data warehouses collect data into materialized views for analysis. After some time, some of the data may no longer be needed or may not be of interest. In this paper, we handle this by expiring or rem... / Abstract Data warehouses collect data into br views are often used to store warehouse data. The amount of data copied

1651.2   Data Cube Approximation and Histograms via Wavelets (Extended.. - Vitter, al. (1998)   (Correct)
Jeffrey Scott Vitter Center for Geometric Computing and Department of Computer Science Duke University Durham, NC 27708--0129 USA jsv@cs.duke.edu Min Wang y Center for Geometric Computing and De... / in the analysis of data in data warehouses in the field of On-Line

1648.0   Data Warehouse Evolution: Trade-offs between Quality and Cost of.. - Lee, Koeller, Nica, Rundensteiner (1998)   (Correct)
The problem of rewriting queries has been heavily explored in recent years, including in work on query processing and optimization, semantic query refinement in decentralized environments, the rewriti... / WPI-CS-TR- - January Data Warehouse Evolution Trade-offs between

1646.4   Warehousing Structured and Unstructured Data for Data Mining - Miller, Honavar, Barta (1997)   (Correct)
More data, especially unstructured data, is available to users than ever. There is so much data available that it is difficult for users to make use of their data in its raw form. To handle the divers... / in either the multidatabase or data warehouse environment. The view system

1636.1   Query Answering Using Views for Data Integration over the Web - Calvanese, De Giacomo, Lenzerini.. (1999)   (Correct)
this paper we address the problem of query answering using views in the context of integrating data over the web. According to the above observation, we concentrate on the common core of any query lan... / approach generally adopted in data warehousing the precomputed views

1628.4   A Scalable Algorithm for Answering Queries Using Views - Pottinger, Levy (2000)   (Correct)
The problem of answering queries using views is to find efficient methods of answering a query using a set of previously materialized views over the database, rather than accessing the database rel... / and data warehouse and web-site design

1625.5   The Dynamics of Database Views - Arenas, Bertossi (1998)   (Correct)
The dynamics of relational database can be specified by means of Reiter's formalism based on the situation calculus. The specification of transaction based database updates is given in terms of Succ... / for certain purposes e.g. in data warehousing CD They are an important

1615.0   Capturing and Querying Multiple Aspects of Semistructured Data - Dyreson, Böhlen, Jensen (1998)   (Correct)
Motivated to a large extent by the substantial and growing prominence of the World-Wide Web and the potential benefits that may be obtained by applying database concepts and techniques to web data man... / analytical processing and data warehousing the framework makes it

1614.6   R-Tree Based Indexing of Now-Relative Bitemporal Data - Bliujute, Jensen, Saltenis.. (1998)   (Correct)
The databases of a wide range of applications, e.g., in data warehousing, store multiple states of time-evolving data. These databases contain a substantial part of now-relative data: data that became... / of applications e.g.in data warehousing store multiple states of

1606.2   Density-Based Indexing for Approximate Nearest-Neighbor Queries - Bennett, Fayyad, Geiger (1999)   (Correct)
We consider the problem of performing nearest-neighbor queries efficiently over large high-dimensional databases. Assuming that a full database scan to determine the nearest neighbor entries is not a... / databases. With the growth of Data Warehousing nearest-neighbor queries are

1587.6   Data Mining of Machine Learning Performance Data - Ibrahim (1999)   (Correct)
ith the development and penetration of data mining within different fields and industries, many data mining algorithms have emerged. The selection of a good data mining algorithm to obtain the best re... / ever. Using technologies like data warehousing data is being stored in br Using technologies like data warehousing data is being stored in large

1578.1   The 3W Model and Algebra for Unified Data Mining - Johnson, Lakshmanan, Ng (2000)   (Correct)
Real data mining/analysis applications call for a framework which adequately supports knowledge discovery as a multi-step process, where the input of one mining operation can be the output of anoth... / Regions Consider a sales data warehouse with measures like

1573.6   nD-SQL: A Multi-dimensional Language for Interoperability and OLAP - Fr'ed'eric Gingras (1998)   (Correct)
We propose a multi-dimensional language called nD-SQL with the following features: (i) nD-SQL supports queries that interoperate amongst multiple relational sources with heterogeneous schemas, inclu... / eventually the creation of a data warehouse storing integrated summaries

1572.6   The Multi-Operator Method: Integrating Algorithms for the Efficient.. - Jaedicke, Mitschang (1999)   (Correct)
There has been a long record of research for efficient join algorithms in RDBMS, but userdefined join predicates in ORDBMS are typically evaluated using a restriction after forming the complete Cartes... /

1570.1   Automated Discovery of Concise Predictive Rules for Intrusion.. - Helmer, Wong, Honavar, Miller (1999)   (Correct)
We examine the effectiveness of rule learning to detect intrusions against privileged programs, using a feature vector representation to describe the system calls executed by each process. We then use... / pursuit of attackers. The data warehouse knowledge provided by the top br and developing the data warehouse and data mining capabilities of our

1548.6   Update Propagation Strategies to Improve Freshness in Lazy Master.. - Pacitti, Simon (1997)   (Correct)
Many distributed database applications need to replicate data to improve data availability and query response time. The two-phase commit protocol guarantees mutual consistency of replicated data but d... / EP DWQ Foundations for Data Warehouse Quality Introduction

1528.4   Neural networks in business: techniques and applications for the.. - Smith, Gupta (2000)   (Correct)
This paper presents an overview of the di!erent types of neural network models which are applicable when solving business problems. The history of neural networks in business is outlined, leading to a... / is believed to be hidden in the data warehouse. Neural networks form the

1528.0   Answering Queries with Aggregation Using Views - Srivastava, Dar, H.V.Jagadish, Levy (1996)   (Correct)
We present novel algorithms for the problem of using materialized views to compute answers to SQL queries with grouping and aggregation, in the presence of multiset tables. In addition to its obvious ... / Example . Consider a data warehouse that holds information useful br in many applications such as data warehousing very large transaction

1527.4   Complexity of Answering Queries Using Materialized Views (Extended.. - Abiteboul, Duschka (1998)   (Correct)
We study the complexity of the problem of answering queries using materialized views. This problem has attracted a lot of attention recently because of its relevance in data integration. Previous work... / with the popularity of data warehouses LZW The problem br problem which arise in data warehousing. Part of the work

1525.8   Approximate Computation of Multidimensional Aggregates of Sparse Data .. - Vitter, Wang (1999)   (Correct)
Computing multidimensional aggregates in high dimensions is a performance bottleneck for many OLAP applications. Obtaining the exact answer to an aggregation query can be prohibitively expensive in te... / time and or storage space in a data warehouse environment. It is

1524.5   AQUA: System and Techniques for Approximate Query Answering - Gibbons, Poosala, Acharya, Bartal.. (1998)   (Correct)
In large data recording and warehousing environments, it is often advantageous to provide fast, approximate answers to queries. The goal is to provide an estimated response in orders of magnitude le... / a problem with joins and Data Warehouse New Data Queries br class of queries that arise in data warehousing scenarios. Finally the area

1523.3   Foundations Of Time-Critical Scalable Computing - McColl   (Correct)
The BSP model provides a unified framework for the design and programming of scalable servers for time-critical computing. This paper describes some of the recent developments in the area of BSP compu... / processing data mining data warehousing decision support enterprise

1520.1   A Model for Data Warehouse Operational Processes - Vassiliadis, Quix, Vassiliou, Jarke (2000)   (Correct)
Previous research has provided metadata models that enable the capturing of the static components of a Data Warehouse (DW) architecture, along with information on different quality factors over th... / Vassiliou Jarke. A Model for Data Warehouse Processes. Page of A

1506.1   BOAT - Optimistic Decision Tree Construction - Gehrke, Ganti, Ramakrishnan, Loh (1999)   (Correct)
Classification is an important data mining problem. Given a training database of records, each tagged with a class label, the goal of classification is to build a concise model that can be used to pre... / dynamic environments such as data warehouses in which the training

1504.2   An Inductive Learning Algorithm for Production Rule Discovery - Mehmet Tolun   (Correct)
Data mining is the search for relationships and global patterns that exist in large databases. One of the main problems for data mining is that the number of possible relationships is very large, thu... /

1500.8   FLORID - Ein Prototyp für F-Logik - Frohn, Himmeröder, Kandzia, Lausen.. (1997)   (Correct)
er verbindet: -- Der deduktive Ansatz versucht, die L ucke zwischen Datenbanken und Anwendungen (impedance mismatch) zu schließen oder zumindest zu verkleinern, indem die M achtigkeit der Datenbankab... / aus dem Problembereich des Data-Warehousing und der Integration

1494.7   The MyView System: Tackling the Interface Problem - Wolff, Kalinski (1997)   (Correct)
The MyView system aims at gathering bibliographic information from a diversity of heterogeneous distributed Internet repositories like electronic journals, text archives, and traditional libraries. It... / to explore the personal warehouse data. This includes for instance br recently become popular as data warehousing see Efficient

1490.2   Minimizing Detail Data in Data Warehouses - Akinde, Jensen, Böhlen (1998)   (Correct)
Data warehouses collect and maintain large amounts of data from several distributed and heterogeneous data sources. Because of security reasons, operational requirements, and technical feasibility it ... / Minimizing Detail Data in Data Warehouses M. O. Akinde O. G. Jensen

1488.9   Rewriting Regular Expressions in Semi-Structured Data - Calvanese, De Giacomo, Lenzerini.. (1999)   (Correct)
In this paper we address the problem of query rewriting in the context of semi-structured data. We present a method for computing the rewriting of a regular expression E in terms of other regular expr... / well as in data integration data warehousing and query optimization the

1482.8   Join Synopses for Approximate Query Answering - Acharya, Gibbons, Poosala, Ramaswamy (1999)   (Correct)
In large data warehousing environments, it is often advantageous to provide fast, approximate answers to complex aggregate queries based on statistical summaries of the full data. In this paper, we d... / for data reduction in large data warehouses. The three major classes of br Abstract In large data warehousing environments it is often

1479.7   Approximating multi-dimensional aggregate range queries over real.. - Gunopulos, Kollios, Tsotras.. (2000)   (Correct)
Finding approximate answers to multi-dimensional range queries over real valued attributes has significant applications in data exploration and database query optimization. In this paper we consider ... / optimization data mining and data warehousing. The query optimizer requires br task interactive. In data warehousing datasets can be very large.

1477.5   Shrinking the Warehouse Update Window - Labio, Yerneni, Garcia-Molina (1998)   (Correct)
Warehouse views need to be updated when source data changes. Due to the constantly increasing size of warehouses and the rapid rates of change, there is increasing pressure to reduce the time taken fo... / update. Introduction Data warehouses derive data from remote br that multiple versions of the warehouse data are maintained. V

1471.0   Reasoning with Aggregation Constraints in Views - Dar, Jagadish, Levy, Srivastava   (Correct)
We investigate the problem of using materialized views to compute answers to SQL queries with grouping and aggregation, in the presence of multiset tables. This problem is important in many applicat... / Example Consider a data warehouse that holds information useful br in many applications such as data warehousing mobile computing global

1470.3   Using Complex Substitution Strategies for View Synchronization - Nica, Rundensteiner (1998)   (Correct)
Abstract Large-scale information systems typically contain autonomous information sources (ISs) that dynamically modify their content, interfaces as well as their query services regardless of the dat... / services regardless of the data warehouses views that are built on

1462.9   A Foundation for Conventional and Temporal Query Optimization.. - Slivinskas, Jensen, Snodgrass (2000)   (Correct)
Most real-world databases contain substantial amounts of time-referenced, or temporal, data. Recent advances in temporal query languages show that such database applications may benefit substantially... / defining properties of a data warehouse Inm Recent advances in

1461.8   Practical Lineage Tracing in Data Warehouses - Cui (1999)   (Correct)
We consider the view data lineage problem in a warehousing environment: For a given data item in a materialized warehouse view, we want to identify the set of source data items that produced the view ... / Practical Lineage Tracing in Data Warehouses Yingwei Cui and

1460.9   Maintaining Temporal Views Over Non-Temporal Information Sources For.. - Yang, Widom (1998)   (Correct)
An important use of data warehousing is to provide temporal views over the history of source data that may itself be non-temporal. While recent work in view maintenance is applicable to data warehou... / architecture for the temporal data warehouse that automatically maintains br system-managed expiration of warehouse data which can be used to vacuum

1459.8   Recursive Plans for Information Gathering - Duschka, Levy (1997)   (Correct)
Generating query-answering plans for information gathering agents requires to translate a user query, formulated in terms of a set of virtual relations, to a query that uses relations that are actuall... / for query optimization and data warehousing Yang and Larson

1457.4   Incremental Maintenance of Externally Materialized Views - Staudt, Jarke (1996)   (Correct)
With the advent of the Internet, access to database servers from autonomous clients will become more and more popular. In this paper, we propose a monitoring service that could be offered by such data... / multi-databases and data warehouses Incremental br is change propagation in data warehousing Traditional

1441.9   Efficient I/O for Very Large Multimedia Applications - Prabhakar, Agrawal, Abbadi   (Correct)
The ever growing needs of large multimedia systems cannot be met by magnetic disks due to their high cost and low storage density. Consequently, cheaper and denser tertiary storage systems are being i... /

1441.4   WebOQL: Restructuring Documents, Databases and Webs - Arocena, Mendelzon (1998)   (Correct)
The widespread use of the Web has originated several new data management problems, such as extracting data from Web pages and making databases accessible from Web browsers, and has renewed the interes... / with certain features Web-data warehousing i.e.extracting

1434.0   Dynamic Load Balancing in Hierarchical Parallel Database Systems - Bouganim, Florescu, Valduriez (1996)   (Correct)
We consider the execution of multi-join queries in a hierarchical parallel system, i.e., a shared-nothing system whose nodes are shared-memory multiprocessors. In this context, the problem of load bal... / for decision support e.g.data warehousing The objective of parallel

1425.9   Mathematical Programming Approaches To Machine Learning And Data.. - Bradley   (Correct)
Machine learning problems of supervised classification, unsupervised clustering and parsimonious approximation are formulated as mathematical programs. The feature selection problem arising in the sup... /

1416.0   Multiple View Consistency for Data Warehousing - Zhuge, Wiener, Garcia-Molina (1997)   (Correct)
A data warehouse stores integrated information from multiple distributed data sources. In effect, the warehouse stores materialized views over the source data. The problem of ensuring data consistency... / Abstract A data warehouse stores integrated information br generates transactions for the warehouse database system. We make no

1412.7   Mining surprising patterns using temporal description length - Chakrabarti, Sarawagi, Dom (1998)   (Correct)
We propose a new notion of surprising temporal patterns in market basket data, and algorithms to find such patterns. This is distinct from finding frequent patterns as addressed in the common mining l... / roles. Introduction Data warehousing technology has enabled

1412.7   Spatial Databases: Accomplishments and Research Needs - Shekhar, Chawla, Ravada, Fetterer.. (1997)   (Correct)
Spatial databases have been an active area of research for over two decades, addressing the growing data management and analysis needs of spatial applications such as Geographic Information Systems.... / to newer applications such as data warehouses and multimedia information

1411.5   Data Integration and Reconciliation in Data Warehousing: Conceptual.. - Calvanese, De Giacomo, Rosati (1999)   (Correct)
Integration is one of the most important aspects of a Data Warehouse. When data passes from the sources of the application-oriented operational environment to the Data Warehouse, possible inconsistenc... / the most important aspects of a Data Warehouse. When data passes from the br Thus in the context of a Data Warehouse data integration and

1409.6   Efficient Rule-Based Attribute-Oriented Induction for Data Mining - Cheung, Hwang, Fu, Han   (Correct)
Data mining has become an important technique which has tremendous potential in many commercial and industrial applications. Attribute-oriented induction is a powerful mining technique and has been ... / have started to use data warehouse to store valuable data from br they are coupled with the data warehousing technology to become an

1407.8   Issues for On-Line Analytical Mining of Data Warehouses (Extended.. - Han, al.   (Correct)
Jiawei Han, Sonny H.S. Chee and Jenny Y. Chiang Intelligent Database Systems Research Laboratory School of Computing Science, Simon Fraser University, British Columbia, Canada V5A 1S6 f han, schee... / On-Line Analytical Mining of Data Warehouses Extended Abstract br mining tasks dynamically. Warehouse Data Data Base Data Meta

1403.7   Accessing the Web: Exploiting the Data Base Paradigm - Catarci, Iocchi, Nardi, Santucci   (Correct)
The open growth of the Internet, the amount of available information, and the typical access modality (i.e., browsing) cause the puzzled user to search for the information of interest in a labyrinth o... / to build a fully materialized data warehouse of the information in the Web.

1397.8   Concept Hierarchy in Data Mining: Specification, Generation and.. - Lu (1997)   (Correct)
Data mining is the nontrivial extraction of implicit, previously unknown, and potentially useful information from data. As one of the most important background knowledge, concept hierarchy plays a fun... / maintain state information data warehouses typically maintain br . Concept Hierarchy in Data Warehousing

1391.7   DBMiner: A System for Data Mining in Relational Databases and Data.. - Han (1997)   (Correct)
A data mining system, DBMiner, has been developed for interactive mining of multiple-level knowledge in large relational databases and data warehouses. The system implements a wide spectrum of data mi... / in Relational Databases and Data Warehouses Jiawei Han Jenny Y. br to develop powerful data warehousing and data mining tools for analysis

1390.8   Rewriting Aggregate Queries Using Views - Cohen, Nutt, Serebrenik (1999)   (Correct)
We investigate the problem of rewriting queries with aggregate operators using views that may or may not contain aggregate operators. A rewriting of a query is a second query that uses view predicates... / value. In fact most existing data warehouses make use of this idea in br recently by the surge of data warehousing and decision support

1386.0   Improving OLAP Performance by Multidimensional Hierarchical Clustering - Markl, Ramsak, Bayer (1999)   (Correct)
Data-warehousing applications cope with enormous data sets in the range of Gigabytes and Terabytes. Queries usually either select a very small set of this data or perform aggregations on a fairly larg... / scheme for the fact table of a data warehouse according to multiple br bayer in.tum.de Abstract Data-warehousing applications cope with

1385.2   Incremental Maintenance of Aggregate and Outerjoin Expressions - Gupta, Mumick (1999)   (Correct)
Views stored in a data warehouse need to be kept current. As recomputing the views is very expensive, incremental maintenance algorithms are required. Over recent years, several incremental maintenanc... / Abstract Views stored in a data warehouse need to be kept current. As

1380.7   Offering a Precision-Performance Tradeoff for Aggregation Queries.. - Olston, Widom (2000)   (Correct)
Strict consistency of replicated data is infeasible or not required by many distributed applications, so current systems often permit stale replication, in which cached copies of data values are allo... / a stale replication system is a data warehouse where we can view the data

1379.9   Physical Database Design for Data Warehouses - Labio, Quass, Adelberg (1997)   (Correct)
Data warehouses collect copies of information from remote sources into a single database. Since the remote data is cached at the warehouse, it appears as local relations to the users of the warehouse.... / Physical Database Design for Data Warehouses Wilburt Juan Labio y

1377.1   Distributed Multivariate Regression Using Wavelet-based Collective.. - Hershberger, Kargupta (1999)   (Correct)
This paper presents a method for distributed multivariate regression using wavelet-based Collective Data Mining (CDM). The method seamlessly blends machine learning and information theory with the sta... /

CiteSeer - citeseer.org - Terms of Service - Privacy Policy - Copyright © 1997-2002 NEC Research Institute