• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

Principles of distributed database systems. (2011)

by M T Ozsu, P Valduriez
Add To MetaCart

Tools

Sorted by:
Results 1 - 10 of 588
Next 10 →

The Cougar Approach to In-Network Query Processing in Sensor Networks

by Yong Yao, Johannes Gehrke - SIGMOD Record , 2002
"... The widespread distribution and availability of smallscale sensors, actuators, and embedded processors is transforming the physical world into a computing platform. One such example is a sensor network consisting of a large number of sensor nodes that combine physical sensing capabilities such as te ..."
Abstract - Cited by 498 (1 self) - Add to MetaCart
The widespread distribution and availability of smallscale sensors, actuators, and embedded processors is transforming the physical world into a computing platform. One such example is a sensor network consisting of a large number of sensor nodes that combine physical sensing capabilities such as temperature, light, or seismic sensors with networking and computation capabilities. Applications range from environmental control, warehouse inventory, and health care to military environments. Existing sensor networks assume that the sensors are preprogrammed and send data to a central frontend where the data is aggregated and stored for offline querying and analysis. This approach has two major drawbacks. First, the user cannot change the behavior of the system on the fly. Second, conservation of battery power is a major design factor, but a central system cannot make use of in-network programming, which trades costly communication for cheap local computation.
(Show Context)

Citation Context

...xisting techniques can be applied directly. There are several excellent surveys and books on distributed query processing, such as work by Yu and Chang [51], Ceri and Pelagatti [7], zsu and Valduriez =-=[33]-=-, Yu and Meng [50], and Kossmann [26]. 4.4 Adaptive Query Processing. We believe that techniques for adaptive query processing will be very relevant for data management in sensor networks. Chen and Ro...

Query Processing for Sensor Networks

by Yong Yao, Johannes Gehrke , 2003
"... Hardware for sensor nodes that combine physical sensors, actuators, embedded processors, and communication components has advanced significantly over the last decade, and made the large-scale deployment of such sensors a reality. Applications range from monitoring applications such as inventory main ..."
Abstract - Cited by 447 (4 self) - Add to MetaCart
Hardware for sensor nodes that combine physical sensors, actuators, embedded processors, and communication components has advanced significantly over the last decade, and made the large-scale deployment of such sensors a reality. Applications range from monitoring applications such as inventory maintenance over health care to military applications.

The state of the art in distributed query processing

by Donald Kossmann - ACM Computing Surveys , 2000
"... Distributed data processing is fast becoming a reality. Businesses want to have it for many reasons, and they often must have it in order to stay competitive. While much of the infrastructure for distributed data processing is already in place (e.g., modern network technology), there are a number of ..."
Abstract - Cited by 320 (3 self) - Add to MetaCart
Distributed data processing is fast becoming a reality. Businesses want to have it for many reasons, and they often must have it in order to stay competitive. While much of the infrastructure for distributed data processing is already in place (e.g., modern network technology), there are a number of issues which still make distributed data processing a complex undertaking: (1) distributed systems can become very large involving thousands of heterogeneous sites including PCs and mainframe server machines � (2) the state of a distributed system changes rapidly because the load of sites varies over time and new sites are added to the system� (3) legacy systems need to be integrated|such legacy systems usually have not been designed for distributed data processing and now need to interact with other (modern) systems in a distributed environment. This paper presents the state of the art of query processing for distributed database and information systems. The paper presents the \textbook " architecture for distributed query processing and a series of techniques that are particularly useful for distributed database systems. These techniques include special join techniques, techniques to exploit intra-query parallelism, techniques to reduce communication costs, and techniques to exploit caching and replication of data. Furthermore, the paper discusses di erent kinds of distributed systems such as client-server, middleware (multi-tier), and heterogeneous database systems and shows how query processing works in these systems. Categories and subject descriptors: E.5 [Data]:Files � H.2.4 [Database Management Systems]: distributed databases, query processing � H.2.5 [Heterogeneous Databases]: data translation General terms: algorithms � performance Additional key words and phrases: query optimization � query execution � client-server databases � middleware � multi-tier architectures � database application systems � wrappers� replication � caching � economic models for query processing � dissemination-based information systems 1
(Show Context)

Citation Context

...In terms of related work, there have been several surveys on distributed query processing� e.g., a paper by Yu and Chang [YC84] and parts of the books by Ceri and Pelagatti [CP84], Ozsu and Valduriez =-=[OV91]-=-, and Yu and Meng [YM97] are devoted to distributed query processing. These surveys, however, are mostly focussed on the presentation of the techniques used in the early prototypes of the seventies an...

An Adaptive Query Execution System for Data Integration

by Zachary Ives, Daniela Florescu, Inria Roquencourt, Marc Friedman, Alon Levy, Daniel Weld , 1999
"... Query processing in data integration occurs over networkbound, autonomous data sources. This requires extensions to traditional optimization and execution techniques for three reasons: there is an absence of quality statistics about the data, data transfer rates are unpredictable and bursty, and slo ..."
Abstract - Cited by 226 (21 self) - Add to MetaCart
Query processing in data integration occurs over networkbound, autonomous data sources. This requires extensions to traditional optimization and execution techniques for three reasons: there is an absence of quality statistics about the data, data transfer rates are unpredictable and bursty, and slow or unavailable data sources can often be replaced by overlapping or mirrored sources. This paper presents the Tukwila data integration system, designed to support adaptivity at its core using a two-pronged approach. Interleaved planning and execution with partial optimization allows Tukwila to quickly recover from decisions based on inaccurate estimates. During execution, Tukwila uses adaptive query operators such as the double pipelined hash join, which produces answers quickly, and the dynamic collector, which robustly and efficiently computes unions across overlapping data sources. We demonstrate that the Tukwila architecture extends previous innovations in adaptive execution (such as...
(Show Context)

Citation Context

... The key contribution is that adaptivity is designed into its core to facilitate interleaving of planning and execution. Furthermore, Tukwilasprovides a platform for incorporating hybrid optimization =-=[19, p181]-=- and important query optimization techniques that have been developed previously in isolation (e.g., query scrambling [22], choose nodes [12], runtime re-optimization [15], optimization of fusion quer...

An Adaptive Data Replication Algorithm

by Ouri Wolfson, Sushil Jajodia, Yixiu Huang - ACM Transactions on Database Systems , 1997
"... This paper addresses the performance of distributed database systems. Specifically, we present an algorithm for dynamic replication of an object in distributed systems. The algorithm is adaptive in the sense that it changes the replication scheme of the object (i.e. the set of processors at which th ..."
Abstract - Cited by 185 (0 self) - Add to MetaCart
This paper addresses the performance of distributed database systems. Specifically, we present an algorithm for dynamic replication of an object in distributed systems. The algorithm is adaptive in the sense that it changes the replication scheme of the object (i.e. the set of processors at which the object is replicated), as changes occur in the read-write pattern of the object (i.e. the number of reads and writes issued by each processor). The algorithm continuously moves the replication scheme towards an optimal one. We show that the algorithm can be combined with the concurrency control and recovery mechanisms of a distributed database management system. The performance of the algorithm is analyzed theoretically and experimentally. On the way we provide a lower bound on the performance of any dynamic replication algorithm.
(Show Context)

Citation Context

...ad is lower because they eliminate the extra messages required in the centralized case. The ADR algorithm works in the read-one-write-all context (see [Bernstein et al. 1987; Ceri and Pelagatti 1984; =-=Ozsu and Valduriez 1991-=-]), and may be combined with two-phase-locking or another concurrency control algorithm in order to ensure one-copy-serializability (see [Bernstein et al. 1987]). Read-one-write-all implies that write...

Parallel crawlers

by Junghoo Cho, Hector Garcia-molina - In Proceedings of the 11th international conference on World Wide Web , 2002
"... In this paper we study how we can design an effective parallel crawler. As the size of the Web grows, it becomes imperative to parallelize a crawling process, in order to finish downloading pages in a reasonable amount of time. We first propose multiple architectures for a parallel crawler and ident ..."
Abstract - Cited by 133 (3 self) - Add to MetaCart
In this paper we study how we can design an effective parallel crawler. As the size of the Web grows, it becomes imperative to parallelize a crawling process, in order to finish downloading pages in a reasonable amount of time. We first propose multiple architectures for a parallel crawler and identify fundamental issues related to parallel crawling. Based on this understanding, we then propose metrics to evaluate a parallel crawler, and compare the proposed architectures using 40 million pages collected from the Web. Our results clarify the relative merits of each architecture and provide a good guideline on when to adopt which architecture. 1

Methods and Tools for Ontology Evolution

by Ljiljana Stojanovic , 2004
"... ..."
Abstract - Cited by 115 (3 self) - Add to MetaCart
Abstract not found

Locating Objects in Mobile Computing

by Evaggelia Pitoura, George Samaras , 2001
"... In current distributed systems, the notion of mobility is emerging in many forms and applications. ..."
Abstract - Cited by 105 (7 self) - Add to MetaCart
In current distributed systems, the notion of mobility is emerging in many forms and applications.
(Show Context)

Citation Context

...se of a higher lookup resolution cost. Choosing the network sites at which to maintain replicas of the current location of a mobile user resembles the file allocation [15] and the database allocation =-=[46]-=- problem. These classical problems are concerned with the selection of sites at which to maintain replicas of files or database partitions. The selection of sites is based on the read/write pattern of...

A Performance Study of Monitoring and Information Services . . .

by Xuehai Zhang, Jeffrey L. Freschl, Jennifer M. Schopf , 2003
"... Monitoring and information services form a key component of a distributed system, or Grid. A quantitative study of such services can aid in understanding the performance limitations, advise in the deployment of the monitoring system, and help evaluate future development work. To this end, we study t ..."
Abstract - Cited by 104 (10 self) - Add to MetaCart
Monitoring and information services form a key component of a distributed system, or Grid. A quantitative study of such services can aid in understanding the performance limitations, advise in the deployment of the monitoring system, and help evaluate future development work. To this end, we study the performance of three monitoring and information services for distributed

A Methodological Framework for Data Warehouse Design

by Matteo Golfarelli, Stefano Rizzi - In Proc. DOLAP , 1998
"... Though designing a data warehouse requires techniques completely different from those adopted for operational systems, no significant effort has been made so far to develop a complete and consistent design methodology for data warehouses. In this paper we outline a general methodological framework f ..."
Abstract - Cited by 81 (6 self) - Add to MetaCart
Though designing a data warehouse requires techniques completely different from those adopted for operational systems, no significant effort has been made so far to develop a complete and consistent design methodology for data warehouses. In this paper we outline a general methodological framework for data warehouse design, based on our Dimensional Fact Model (DFM). After analyzing the existing information system and collecting the user requirements, conceptual design is carried out semi-automatically starting from the operational database scheme. A workload is then characterized in terms of data volumes and expected queries, to be used as the input of the logical and physical design phases whose output is the final scheme for the data warehouse. Keywords Data warehouse, design methodology, conceptual model. 1. INTRODUCTION The database community is devoting increasing attention ...
(Show Context)

Citation Context

...one of the measure subsets. The problem of determining the optimal partitioning given a workload has been widely investigated within the context of centralized as well as distributed database systems =-=[13]-=-. Unfortunately, the results reported in the literature cannot be applied to the DW case since the redundancy introduced by materializing views binds the partitioning problem to that of deciding on wh...

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University