See this document in CiteSeerX!

Summarizing and Mining Inverse Distributions on Data Streams via Dynamic Inverse Sampling (2005)  (Make Corrections)  
Graham Cormode, S. Muthukrishnan, Irina Rozenbaum



  Home/Search   Context   Related

 
View or download:
vldb2005.org/program/...p25cormode.pdf
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  vldb2005.org/pr...ull_program.php (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Emerging data stream management systems approach the challenge of massive data distributions which arrive at high speeds while there is only small storage by summarizing and mining the distributions using samples or sketches. However, data distributions can be "viewed" in different ways. A data stream of integer values can be viewed either as the forward distribution f(x), ie., the number of occurrences of x in the stream, or as its inverse, f -1 (i), which is the number of items that ... (Update)

Active bibliography (related documents):   More   All
0.6:   Automated Modeling and Nonlinear Axis Scaling - Leejay Wu (2005)   (Correct)
0.5:   Streamline: A Scheduling Heuristic for Streaming.. - Agarwalla, Ahmed, .. (2005)   (Correct)
0.4:   Streams, Security and Scalability - Theodore Johnson Muthukrishnan (2005)   (Correct)

Similar documents based on text:
5.0:   Unknown -   (Correct)

BibTeX entry:   (Update)

@misc{ cormode-summarizing,
  author = "Graham Cormode and S. Muthukrishnan and Irina Rozenbaum",
  title = "Summarizing and Mining Inverse Distributions on Data Streams via Dynamic
    Inverse Sampling",
  url = "citeseer.ist.psu.edu/cormode05summarizing.html" }
Citations (may not include all citations):
387   The space complexity of approximating the frequency moments - Alon, Matias et al. - 1996
358   Universal classes of hash functions (context) - Carter, Wegman - 1979
197   Cambridge University Press (context) - Kushilevitz, Nisan - 1997
132   Randomized Algorithms (context) - Motwani, Raghavan - 1995
130   Models and issues in data stream systems (context) - Babcock, Babu et al. - 2002
111   New sampling-based summary statistics for improving approxim.. - Gibbons, Matias - 1998
83   New directions in traffic measurement and accounting - Estan, Varghese - 2002
67   Approximate frequency counts over data streams - Manku, Motwani - 2002
67   Maintaining stream statistics over sliding windows - Datar, Gionis et al. - 2002
60   Fast incremental maintenance of approximate histograms - Gibbons, Matias et al. - 1997
45   Gigascope: A stream database for network applications (context) - Cranor, Johnson et al. - 2003
38   Random Sampling from Databases - Olken - 1997
34   Random sampling for histogram construction: How much is enou.. - Chaudhuri, Motwani et al. - 1998
31   On random sampling over joins (context) - Chaudhuri, Motwani et al. - 1999
31   Estimating simple functions on the union of data streams - Gibbons, Tirthapura - 2001
26   Probabilistic counting algorithms for database applications (context) - Flajolet, Martin - 1985
21   Distinct sampling for highly-accurate answers to distinct va.. - Gibbons - 2001
18   Data streams: Algorithms and applications - Muthukrishnan - 2003
17   ACM Transactions on Mathematical Software (context) - Vitter, with et al. - 1985
17   Towards estimation error guarantees for distinct values (context) - Charikar, Chaudhuri et al. - 2000
16   What's hot and what's not: Tracking most frequent items dyna.. - Cormode, Muthukrishnan - 2003
14   Querying and mining data streams: You only get one look (context) - Garofalakis, Gehrke et al. - 2002
8   Estimating rarity and similarity over data stream windows - Datar, Muthukrishnan - 2002
7   Processing set expressions over continuous update streams (context) - Ganguly, Garofalakis et al. - 2003
6   Aurora: a data stream management system (context) - Abadi, Carney et al. - 2003
4   Holistic UDAFs at streaming speeds - Cormode, Korn et al. - 2004
2   TelegraphCQ: continuous dataflow processing (context) - Chandrasekaran, Cooper et al. - 2003
1   DIMACS working group on monitoring message streams (context) - Madigan - 2003
1   Coresets in dynamic geometric data streams (context) - Frahling, Sohler - 2005
http://ita.ee.lbl.gov/

Documents on the same site (http://www.vldb2005.org/program/full_program.php):   More
Getting Priorities Straight: Improving Linux Support for.. - Hall, Bonnet (2005)   (Correct)
Improving Database Performance on Simultaneous.. - Zhou, Cieslewicz.. (2005)   (Correct)
Tree-Pattern Queries on a Lightweight XML Processor - Moro, Vagena, Tsotras (2005)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC