37 citations found. Retrieving documents...
Phillip B. Gibbons and Yossi Matias. Synopsis data structures for massive data sets. DIMACS: Series in Discrete Mathematics and Theoretical Computer Science: Special Issue on External Memory Algorithms and Visualization, A, 1999.

 Home/Search   Document Details and Download   Summary   Related Articles   Check  

This paper is cited in the following contexts:

First 50 documents

On Approximation Algorithms for Data Mining Applications - Afrati (2002)   (Correct)

....numbers of exact precision. Thus, random generators are used in [53]to tackle the problem of how to reduce the number of required random bits. The problem of approximating L p distances in one pass algorithms has a variety of other applications too, e.g. in estimating the size of self join [7, 45] and in the estimation of statistics of network ow data [42] Computing sketches is a technique used with success to get summaries of the data. It has enabled compression of data and speeding up computation for various data mining tasks [54, 32, 23, 24, 33] See also in section 5 for a ....

P. Gibbons and Y. Matias. Synopsis data structures for massive data sets. In SODA, pages S909{S910, 1999.


Testing Properties of Directed Graphs: Acyclicity and.. - Suny Stony Brook   (Correct)

....rise to a family of approximation algorithms) it may give more insight into the problem in question. With the current trend of increasing memory and storage sizes, the problem of examining large structures in sublinear time has been studied in other contexts. For example, Gibbons and Matias [15, 16] develop a variety of data structures that glean information from large databases so they can be examined in sublinear time. Organization. This paper is organized as follows: In section 2 we present de nitions and terminology that will be used in the rest of the paper. In Section 3 we preset a ....

P. B. Gibbons and Y. Matias. Synopsis data structures for massive data sets. DIMACS: Series in Discrete Mathematics and Theoretical Computer Science: Special Issue on External Memory Algorithms and Visualization, A, 1999.


A Framework for the Physical Design Problem for Data Synopses - König, Weikum (2002)   (Correct)

.... of the information quality of Web information sources (e.g. number of reviews per book in an e commerce site) as a basis for planning global queries in a Web mediator [27] 28] This broad importance of statistics management has led to a plethora of approximation techniques, for which [15] have coined the general term data synopses : advanced forms of histograms [30, 16, 20] spline synopses [22, 23] sampling [6, 17, 14] and parametric curve fitting techniques [34, 9] all the way to highly sophisticated methods based on kernel estimators [2] or Wavelets and other transforms [26, ....

P. B. Gibbons and Y. Matias. Synopsis Data Structures for Massive Data Sets. In Symposium on Discrete Algorithms, 1999.


Online Algorithms for Mining Semi-structured Data Stream - Asai, al. (2002)   (2 citations)  (Correct)

.... data have attracted wide atten tion of networks, e commerce, information retrieval and databases [2, 19] In contrast, there have not been many studies on semi structured data min ing [1, 4, 7, 9, 13, 15, 16, 20, 22] There are a body of researches on online data processing and mining [10, 14, 18]. Most related work is Hidher [11] who proposed a model of continuous pattern discovery from unbounded data stream, and presented adaptive online algorithm for mining association rules. Parthasarathy et al. 17] and Mannila et al. 14] studied mining of sequential patterns and episode patterns. ....

P. B. Gibbons and Y. Matias, Synopsis Data Structures for Massive Data Sets, In External Memory Algorithms, DIMACS Series in Discr. Math. and Theor. Cornpt. Sci., Vol. 50, AMS, 39 70, 2000.


Recent Progress on Selected Topics in Database Research - Chen, Li, al.   (Correct)

....and performance of applications in a wide area network. Used as index structures, it provides fast alternative access paths to the base data. Used as materialized views [85] in databases or data warehouses [82] it improves the performance of complex queries over the base data. Used as synopses [84], it provides fast, approximate answer to queries or statistics necessary for cost based query optimization. Derived data varies in complexity: It can be a simple copy of the base data, in the cases of caching and replication, or it can be the result of complex structural or content transformation ....

P. B. Gibbons and Y. Matias. Synopsis data structures for massive data sets. DIMACS Series in Discrete Mathematics and Theoretical Computer Science: Special Issue on External Memory Algorithms and Visualization, A:39--70, 1999.


Efficient Approximation of Correlated Sums On Data Streams - Ananthakrishna, Das, Gehrke (2003)   (Correct)

....to monitor network tra#c behavior. The large volume of stream data, and the online nature of the various applications that operate on such data, make it imperative for the applications to compute a variety of summary information in an online fashion using a bounded amount of space (see, e.g. [1, 2, 4, 5, 6, 7, 8], and the references therein) Correlated aggregates (see, e.g. 3, 6] which provide a natural mechanism for the flexible composition of basic aggregates, are desirable since they are more descriptive than the basic aggregates for understanding relationships between variables in the stream ....

P. B. Gibbons and Y. Matias. Synopsis data structures for massive data sets. In Proceedings of SODA, 1999. 7


An Evaluation of Multi-Resolution Search and.. - Ganesan.. (2003)   (5 citations)  (Correct)

....DQT implementation. Data Mining. Geographic Information Systems (GIS) deal with data that exhibits spatio temporal correlations, but the processing is centralized, and algorithms are driven by the need to reduce search cost, typically by optimizing disk access latency. Some of these approaches ([23, 24, 26]) propose the construction of wavelet synopses, for fast processing of range sum queries. Many of the techniques proposed for approximate querying and data mining have informed the coding techniques that we choose in our system. An interesting example of a distributed infrastructure for wide area ....

Phillip B. Gibbons and Yossi Matias. Synopsis data structures for massive data sets. DIMACS: Series in Discrete Mathematics and Theoretical Computer Science: Special Issue on External Memory Algorithms and Visualization, A, 1999.


Synopsis Diffusion for Robust Aggregation in Sensor.. - Nath, Gibbons, Seshan.. (2003)   (11 citations)  Self-citation (Gibbons)   (Correct)

No context found.

P. B. Gibbons and Y. Matias. Synopsis data structures for massive data sets. DIMACS: Series in Discrete Math. and Theoretical Computer Science: Special Issue on External Memory Algorithms and Visualization, 1999.


Distributed Streams Algorithms for Sliding Windows - Gibbons, Tirthapura (2002)   (8 citations)  Self-citation (Gibbons)   (Correct)

No context found.

P. B. Gibbons and Y. Matias. Synopsis data structures for massive data sets. In J. M. Abello and J. S. Vitter, editors, External Memory Algorithms, pages 39--70. AMS, 1999. DIMACS: Series in Discrete Mathematics and Theoretical Computer Science, Vol. 50. A two page summary appeared as a short paper in SODA'99.


Synopsis Diffusion for Robust Aggregation in Sensor Networks - Nath, Gibbons (2003)   (11 citations)  Self-citation (Gibbons)   (Correct)

No context found.

GIBBONS, P. B., AND MATIAS, Y. Synopsis data structures for massive data sets. DIMACS: Series in Discrete Mathematics and Theoretical Computer Science: Special Issue on External Memory Algorithms and Visualization (1999).


Distributed Streams Algorithms for Sliding Windows - Gibbons, Tirthapura (2002)   (8 citations)  Self-citation (Gibbons)   (Correct)

No context found.

P. B. Gibbons and Y. Matias. Synopsis data structures for massive data sets. In J. M. Abello and J. S. Vitter, editors, External Memory Algorithms, pages 39-70. AMS, 1999. DIMACS: Series in Discrete Mathematics and Theoretical Computer Science, Vol. 50. A two page summary appeared as a short paper in SODA'99.


Tracking Join and Self-Join Sizes in Limited Storage - Alon, Gibbons, Matias, Szegedy (2002)   (27 citations)  Self-citation (Gibbons Matias)   (Correct)

....whenever the self join sizes are smaller than n B. The performance and accuracy bounds of the algorithms in this paper are valid for any data distributions. Related work. Tracking algorithms and other general data reduction techniques have a long history; see [BDF 97] for a recent survey. GM99] presented a formal framework for evaluating such sublinear space synopsis data structures, and a survey of some of the results in this area. There has been a flurry of recent work in approximate query answering (e.g. VL93, Olk93, BDF 97, HHW97, GM98, AGPR99, HH99, VW99, IP99, AGP00, ....

P. B. Gibbons and Y. Matias. Synopsis data structures for massive data sets. In J. M. Abello and J. S. Vitter, editors, External Memory Algorithms, pages 39--70. AMS, 1999. DIMACS: Series in Discrete Mathematics and Theoretical Computer Science, Vol. 50. A two page summary appeared as a short paper in SODA'99.


Analysis and design of approximate queries over XML - Documents Using Statistical   (Correct)

No context found.

Phillip B. Gibbons and Yossi Matias. Synopsis data structures for massive data sets. DIMACS: Series in Discrete Mathematics and Theoretical Computer Science: Special Issue on External Memory Algorithms and Visualization, A, 1999.


An Evaluation of Multi-resolution Storage for Sensor.. - Ganesan, Greenstein.. (2003)   (6 citations)  (Correct)

No context found.

Phillip B. Gibbons and Yossi Matias. Synopsis data structures for massive data sets. DIMACS: Series in Discrete Mathematics and Theoretical Computer Science: Special Issue on External Memory Algorithms and Visualization,A, 1999.


Efficient URL Caching for World Wide Web Crawling - Broder, Najork, Wiener (2003)   (1 citation)  (Correct)

No context found.

P. B. Gibbons and Y. Matias. Synopsis data structures for massive data sets. In Proceedings of the 10th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 909--910. ACM Press, 1999.


GenIc: A Single Pass Generalized Incremental Algorithm for.. - Gupta, Grossman (2004)   (1 citation)  (Correct)

No context found.

P. B. Gibbons, and Y. Matias, "Synopsis Data Structures for Massive Data Sets," Series in Discrete Mathematics and Theoretical Computer Science: Special Issue on External Memory Algorithms and Visualization, A, 1999.


Testing Properties of Directed Graphs: Acyclicity and Connectivity - Bender, Ron (2000)   (Correct)

No context found.

P. B. Gibbons and Y. Matias. Synopsis data structures for massive data sets. DIMACS: Series in Discrete Mathematics and Theoretical Computer Science: Special Issue on External Memory Algorithms and Visualization, A, 1999.


Accuracy vs Lifetime: Linear Sketches for Appoximate.. - Kalpakis.. (2004)   (Correct)

No context found.

Phillip B. Gibbons and Yossi Matias. Synopsis data structures for massive data sets. DIMACS: Series in Discrete Mathematics and Theoretical Computer Science: Special Issue on External Memory Algorithms and Visualization, A, 1999.


ForNet: A Distributed Forensics Network - Shanmugasundaram, Memon, Savant, .. (2003)   (Correct)

No context found.

P. Gibbons and Y. Matias. Synopsis data structures for massive data sets. In DIMACS: Series in Discrete Mathematics and Theoretical Computer Science: Special Issue on External Memory Algorithms and Visualization, 1999.


An Evaluation of Multi-resolution Storage for Sensor.. - Ganesan, Greenstein.. (2003)   (6 citations)  (Correct)

No context found.

Phillip B. Gibbons and Yossi Matias. Synopsis data structures for massive data sets. DIMACS: Series in Discrete Mathematics and Theoretical Computer Science: Special Issue on External Memory Algorithms and Visualization, A, 1999.


Finding Frequent Items in Data Streams - Charikar, Chen, Farach-Colton (2002)   (42 citations)  (Correct)

No context found.

Phillip Gibbons and Yossi Matias. Synopsis data structures for massive data sets. In Proc. 10th Annual ACM-SIAM Symposium on Discrete Algorithms, pages 909--910, 1999.


Clustering Data Streams: Theory and Practice - Guha, Meyerson, Mishra.. (2003)   (3 citations)  (Correct)

No context found.

P. Gibbons and Y. Matias. Synopsis data structures for massive data sets. Proc. SODA, pages S909-- S910, 1999.


Analysis and design of approximate queries over XML documents.. - Marrara   (Correct)

No context found.

Phillip B. Gibbons and Yossi Matias. Synopsis data structures for massive data sets. DIMACS: Series in Discrete Mathematics and Theoretical Computer Science: Special Issue on External Memory Algorithms and Visualization, A, 1999.


Online Algorithms for Mining Semi-structured Data Stream - Tatsuya Asai Hiroki (2002)   (2 citations)  (Correct)

No context found.

P. B. Gibbons and Y. Matias, Synopsis Data Structures for Massive Data Sets, In External Memory Algorithms, DIMACS Series in Discr. Math. and Theor. Compt. Sci., Vol. 50, AMS, 39--70, 2000.


RHist: Adaptive Summarization over Continuous Data Streams - Qiao, Agrawal, Abbadi (2002)   (3 citations)  (Correct)

No context found.

P. B. Gibbons and Y. Matias. Synopsis Data Structures for Massive Data Sets. Technical report, Bell Labs, 1998.

First 50 documents

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC