90 citations found. Retrieving documents...
S. Guha, N. Mishra, R. Motwani, and L. O'Callaghan. Clustering data streams. In IEEE Symposium on Foundations of Computer Science (FOCS'00), Redondo Beach, CA, 2000.

 Home/Search   Document Details and Download   Summary   Related Articles   Check  

This paper is cited in the following contexts:

First 50 documents  Next 50

Optimal Time Bounds for Approximate Clustering - Mettu, Plaxton (2002)   (3 citations)  (Correct)

....factor of optimal. Specifically, for the case of uniform weights, our successive sampling algorithm yields a (k log (n k) O(1) configuration with high probability in O(nmax k, log n ) time. In addition to this sampling result, our algorithms rely on an extraction technique due to Guha et al. [8] that uses a black box O(1) approximate k median algorithm to compute a (k, O(1) configuration from any (m, O(1) assignment. The black box algorithm that we use is the linear time deterministic online median algorithm of Mettu and Plaxton [15] In developing our randomized algorithm for the ....

....Thorup gives a sampling technique that also consists of a series of sampling steps but produces an (O( k log n) #) 2 #) configuration for any positive real # with 0 # 0.4, but is only guaranteed to succeed with probability 1 2. For the data stream model of computation, Guha et al. [8] give a single pass O(1) approximate algorithm for the k median problem that runs in O(nk) time and requires O(n ) space for a positive constant #. They also establish a lower bound of # nk) for deterministic O(1) approximate k median algorithms. Mishra et al. 16] show that in order to ....

[Article contains additional citation context not shown here]

S. Guha, N. Mishra, R. Motwani, and L. O'Callaghan. Clustering data streams. In Proceedings of the 41st Annual IEEE Symposium on Foundations of Computer Science, pages 359--366, November 2000.


On Approximation Algorithms for Data Mining Applications - Afrati (2002)   (Correct)

....distance from data points to their closest cluster centers. The assumptions taken by the classical k median approach are: 17 1)each cluster can be e ectively modeled by a spherical Gaussian distribution, 2) each data item is assigned to one cluster and 3) the weights are assumed equal. In [50], a constant factor approximation algorithm for the k median problem is proposed which uses a single pass on the data stream model and requires workspace (n ) A related problem is the k center problem (minimize the maximum radius of a cluster) which is investigated in [30] where a single ....

....which uses a single pass on the data stream model and requires workspace (n ) A related problem is the k center problem (minimize the maximum radius of a cluster) which is investigated in [30] where a single pass algorithm which requires workspace O(k) is presented. In [75] results from [50] are used to develop an algorithm that achieves dramatically better clustering quality than BIRCH although it takes longer to run. In [16] a single pass algorithm is presented for points in the Euclidean space. The method used is based in identifying regions of the data that are compressible ....

S. Guha, N. Mishra, R. Motwani, and L. O'Callaghan. Clustering data streams. In FOCS, 2000.


Optimal Time Bounds for Approximate Clustering - Ramgopal Mettu Greg (2002)   (3 citations)  (Correct)

....factor of optimal. Specifically, for the case of uniform weights, our successive sampling algorithm yields a (k log (n=k) O(1) configuration with high probability in O(nmaxfk; log ng) time. In addition to this sampling result, our algorithms rely on an extraction technique due to Guha et al. [6] that uses a black box O(1) approximate k median algorithm to compute a (k, O(1) configuration from any (m, O(1) assignment. The black box algorithm that we use is the linear time deterministic online median algorithm of Mettu and Plaxton [12] In developing our randomized algorithm for the ....

....Thorup gives a sampling technique that also consists of a series of sampling steps but produces an (O( k log n) 2 ) configuration for any positive real with 0 0:4, but is only guaranteed to succeed with probability 1=2. For the data stream model of computation, Guha et al. [6] give a single pass O(1) approximate algorithm for the k median problem that runs in O(nk) time and requires O(n ) space for a positive constant . They also establish a lower bound of nk) for deterministic O(1) approximate k median algorithms. Mishra et al. 13] show that in order to ....

[Article contains additional citation context not shown here]

S. Guha, N. Mishra, R. Motwani, and L. O'Callaghan. Clustering data streams. In Proceedings of the 41st Annual IEEE Symposium on Foundations of Computer Science, pages 359--366, November 2000.


StatStream: Statistical Monitoring of Thousands of Data Streams .. - Zhu, Shasha (2002)   (25 citations)  (Correct)

....A domain specific lan Figure 6: The precision and pruning power using dif guage, Hancock[6] has been designed at AT T to ferent numbers of coefficients, thresholds and datasets extract signatures from massive transaction streams. Algorithms for constructing decision trees[8] and clustering [15] for data streams have been proposed. ReTable 2: Precision after post processing cent work of Manku et al. 20] Greenwald et al. 14] Dataset R0.85 R0.85 R0.9 R0.9 S0.85 Tolerance 0.001 0.0005 0.001 0.0005 0.0005 Precision 0.9933 0.9947 0.9765 0.9865 0.9931 Recall 1.0 0.9995 1.0 0.9987 1.0 have ....

S. Guha, N. Mishra, R. Motwani, and L. O'Callaghan. Clustering data streams. In the Annual Symposium on Foundations of Computer Science, IEEE, 2000.


Clustering Data Streams: Theory and Practice - Guha, Meyerson, Mishra.. (2003)   (3 citations)  Self-citation (Guha Mishra Motwani O'callaghan)   (Correct)

No context found.

S. Guha, N. Mishra, R. Motwani, and L. O'Callaghan. Clustering data streams. In Proc. FOCS, pages 359--366, 2000.


Mining Evolving Web Clickstreams with Explicit Retrieval .. - Nasraoui, Cardona, Rojas (2004)   (Correct)

No context found.

S. Guha, N. Mishra, R. Motwani, and L. O'Callaghan. Clustering data streams. In IEEE Symposium on Foundations of Computer Science (FOCS'00), Redondo Beach, CA, 2000.


Issues in Data Stream Management - Lukasz Golab And (2003)   (20 citations)  (Correct)

No context found.

S. Guha, N. Mishra, R. Motwani, L. O'Callaghan. Clustering Data Streams. In Proc. IEEE Symp. on Foundations of Computer Science, pp. 359--366.


Towards NIC-based Intrusion Detection - Otey Parthasarathy Ghoting (2003)   (1 citation)  (Correct)

No context found.

S. Guha, N. Mishra, R. Motwani, and L. O'Callaghan. Clustering data streams. In proceeding of the Annual Symp. on Foundations of Computer Science, 2000.


Holistic UDAFs at Streaming Speeds - Cormode, Johnson, Korn.. (2004)   (2 citations)  (Correct)

No context found.

S. Guha, N. Mishra, R. Motwani, and L. O'Callaghan. Clustering data streams. In Proc. FOCS, pages 359--366, 2000.


Online Algorithms for Mining Inter-Stream Associations.. - Loo, Tong, Kao, Cheung (2005)   (Correct)

No context found.

Sudipto Guha, Nina Mishra, Rajeev Motwani, and Liadan O'Callaghan. Clustering data streams. In FOCS, pages 359--366, 2000.


Resource-aware Knowledge Discovery in Data Streams - Gaber, Zaslavsky, Krishnaswamy (2004)   (Correct)

No context found.

Guha S., Mishra N., Motwani R., and O'Callaghan L.: Clustering data streams. In Proceedings of the Annual Symposium on Foundations of Computer Science. IEEE, November (2000).


A Framework for Mining Instant Messaging Services - Resig, Teredesai (2004)   (Correct)

No context found.

Guha, S., Mishra, N., Motwani, R., and O'Callaghan, L. Clustering data streams. In IEEE Symposium on Foundations of Computer Science (2000), pp. 359--366.


Data Streams: Algorithms and Applications - Muthukrishnan (2003)   (12 citations)  (Correct)

No context found.

S. Guha, N. Mishra, R. Motwani and L. O'Callaghan. Clustering data streams. IEEE FOCS, 2000, 359--366.


Online Mining of Changes from Data Streams: - Research Problems And (2003)   (Correct)

No context found.

S. Guha, N. Mishra, R. Motwani, and L. O'Callaghan. Clustering data streams. In FOCS'00, pages 359--366, Redondo Beach, CA, 2000.


TECNO-STREAMS: Tracking Evolving Clusters in Noisy.. - Nasraoui, Uribe.. (2003)   (Correct)

No context found.

S. Guha, N. Mishra, R. Motwani, and L. O'Callaghan. Clustering data streams. In IEEE Symposium on Foundations of Computer Science (FOCS'00), Redondo Beach, CA, 2000.


M-Kernel Merging: Towards Density Estimation over Data Streams - Zhou, Cai, Wei, Qian (2003)   (Correct)

No context found.

S. Guha, N. Mishra, R. Motwani, and L. O'Callaghan. Clustering data streams. In Proceedings of Symposium on Foundations of Computer Science (FOCS), pages 359--366, 2000.


Temporal and Spatio-Temporal Aggregations over Data.. - Zhang, Gunopulos.. (2003)   (2 citations)  (Correct)

No context found.

S. Guha, N. Mishra, R. Motwani and L. O'Callaghan, "Clustering Data Streams", Proc. of FOCS, 2000.


A Framework for Projected Clustering of High - Dimensional Data Streams   (Correct)

No context found.

S. Guha, N. Mishra, R. Motwani, L. O'Callaghan. Clustering Data Streams. IEEE FOCS Conference, 2000.


MAIDS: Mining Alarming Incidents from Data Streams - Cai, Clutter, Pape, Han..   (Correct)

No context found.

S. Guha, N. Mishra, R. Motwani, and L. O'Callaghan. Clustering data streams. In Proc. IEEE Symposium on Foundations of Computer Science (FOCS'00), pages 359--366, Redondo Beach, CA, 2000.


Facilitating Interactive Distributed Data Stream Processing.. - Amol Ghoting And   (Correct)

No context found.

S. Guha et al. Clustering data streams. In ACM FOCS, 2000.


TECNO-STREAMS: Tracking Evolving Clusters in Noisy.. - Nasraoui, Cardona, .. (2003)   (Correct)

No context found.

S. Guha, N. Mishra, R. Motwani, and L. O'Callaghan. Clustering data streams. In IEEE Symposium on Foundations of Computer Science (FOCS'00), Redondo Beach, CA, 2000.


Mining Evolving Web Clickstreams with Explicit Retrieval .. - Nasraoui, Cardona, Rojas (2004)   (Correct)

No context found.

S. Guha, N. Mishra, R. Motwani, and L. O'Callaghan. Clustering data streams. In IEEE Symposium on Foundations of Computer Science (FOCS'00), Redondo Beach, CA, 2000.


Temporal Aggregation over Data Streams using Multiple .. - Zhang, Gunopulos.. (2002)   (3 citations)  (Correct)

No context found.

S. Guha, N. Mishra, R. Motwani and L. O'Callaghan, "Clustering Data Streams", Proc. of FOCS, 2000.


A Framework for Projected Clustering of High - Dimensional Data Streams (2004)   (Correct)

No context found.

S. Guha, N. Mishra, R. Motwani, L. O'Callaghan. Clustering Data Streams. IEEE FOCS Conference, 2000.


StreamMiner: A Classifier Ensemble-based Engine to Mine.. - Fan (2004)   (1 citation)  (Correct)

No context found.

Guha, S., Milshra, N., Motwani, R., and O'Callaghan, L. (2000). Clustering data streams. In IEEE Symposium on Foundations of Computer Science (FOCS), pages 359--366.


GenIc: A Single Pass Generalized Incremental Algorithm for.. - Gupta, Grossman (2004)   (1 citation)  (Correct)

No context found.

S. Guha, N. Mishra, R. Motwani, and L. O'Callaghan, "Clustering Data Streams," In the Annual Symposium on Foundations of Computer Science, IEEE, 2000.


Finding Longest Increasing and Common Subsequences in.. - Liben-Nowell, Vee, Zhu (2003)   (Correct)

No context found.

Sudipto Guha, Nina Mishra, Rajeev Motwani, and Liadan O'Callaghan. Clustering data streams. In Proceedings of the IEEE Symposium on Foundations of Computer Science (FOCS), pages 359--366, 2000.


High-performance GRID Database Manager for Scientific Data - Risch, Koparanova, Thide (2002)   (Correct)

No context found.

S. Guha, N. Mishra, R. Motwani, and L. O'Callaghan. Clustering Data Streams. In Proc. of the 2000.


Merging Multiple Data Streams on Common Keys over.. - Mazzucco.. (2002)   (1 citation)  (Correct)

No context found.

S. Guha, N. Mishra, R. Motwani, and L. O'Callaghan, Clustering Data Streams, to appear.


Online Algorithms for Network Design - Meyerson (2003)   (Correct)

No context found.

S. Guha, N. Mishra, R. Motwani, and L. O'Callaghan. Clustering data streams. Proceedings of the 41st Symposium on Foundations of Computer Science, 2000.


Mining Data Streams - Jin   (Correct)

No context found.

S. Guha, N. Mishra, R. Motwani, and L. O'Callaghan. Clustering data streams. In Proc. of the Annual Symp. on Foundations of Computer Science (FOCS 2000.


Ubiquitous Data Stream Mining - Gaber, Krishnaswamy, Zaslavsky   (Correct)

No context found.

Guha S., Mishra N., Motwani R., and O'Callaghan L.: Clustering data streams. In Proceedings of the Annual Symposium on Foundations of Computer Science. IEEE, November (2000).


Distributed Streams Algorithms for Sliding Windows - Gibbons, Tirthapura (2002)   (8 citations)  (Correct)

No context found.

S. Guha, N. Mishra, R. Motwani, and L. O'Callaghan. Clustering data streams. In Proc. 41st IEEE Symp. on Foundations of Computer Science, pages 359--366, Nov. 2000.


Prototype--based Mining of Numeric Data Streams - Francisco Ferrer--Troyano..   (Correct)

No context found.

Sudipto Guha, Nina Mishra, Rajeev Motwani, and Liadan O'Callaghan. Clustering data streams. In IEEE Symposium on Foundations of Computer Science, pages 359--366, 2000.


An Algorithm for In-Core Frequent Itemset Mining on Streaming.. - Jin, Agrawal (2003)   (Correct)

No context found.

S. Guha, N. Mishra, R. Motwani, and L. O'Callaghan. Clustering Data Streams. In Proceedings of 2000.


Data Streams: Algorithms and Applications - Muthukrishnan (2003)   (12 citations)  (Correct)

No context found.

S. Guha, N. Mishra, R. Motwani and L. O'Callaghan. Clustering data streams. IEEE FOCS, 2000, 359--366.


One-Pass Wavelet Decompositions of Data Streams - Gilbert, Kotidis.. (2003)   (1 citation)  (Correct)

No context found.

S. Guha, N. Mishra, R. Motwani, and L. O'Callaghan, "Clustering Data Streams," Proc. IEEE Symp. Foundations of Computer Science (FOCS), pp. 359-366, 2000.


Approximating Extent Measures of Points - Agarwal, Har-Peled, Varadarajan (2003)   (1 citation)  (Correct)

No context found.

S. Guha, N. Mishra, R. Motwani, and L. O'Callaghan. Clustering data streams. In Proc. 41th Annu. IEEE Sympos. Found. Comput. Sci., pages 359-366, 2000. 28


Coresets for k-Means and k-Median Clustering and their.. - Har-Peled, Mazumdar (2003)   (Correct)

No context found.

S. Guha, R. Motwani N. Mishra, and L. O'Callaghan. Clustering data streams. In Proc. 41th Annu. IEEE Sympos. Found. Comput. Sci., pages 359-366, 2000.


Finding Longest Increasing and Common Subsequences in.. - Liben-Nowell, Vee, Zhu (2003)   (Correct)

No context found.

Sudipto Guha, Nina Mishra, Rajeev Motwani, and Liadan O'Callaghan. Clustering data streams. In Proceedings of the IEEE Symposium on Foundations of Computer Science (FOCS), pages 359--366, 2000.


Incremental Support Vector Machine Construction - Domeniconi, Gunopulos (2001)   (1 citation)  (Correct)

No context found.

S. Guha, N. Mishra, R. Motwani, L. O'Callaghan, "Clustering Data Stream", IEEE Foundations of Computer Science, 2000.


Finding Frequent Items in Data Streams - Charikar, Chen, Farach-Colton (2002)   (42 citations)  (Correct)

No context found.

Sudipto Guha, Nina Mishra, Rajeev Motwani, and Liadan O'Callaghan. Clustering data streams. In Proc. 41st IEEE Symposium on Foundations of Computer Science, pages 359--366, 2000.


Adaptive Mining Techniques for Data Streams Using.. - Gaber, Krishnaswamy, .. (2003)   (Correct)

No context found.

Guha S., Mishra N., Motwani R., and O'Callaghan L.: Clustering data streams. In Proceedings of the Annual Symposium on Foundations of Computer Science. IEEE, November (2000).


A Framework for Projected Clustering of High Dimensional.. - Aggarwal, Han, Wang, Yu (2004)   (Correct)

No context found.

S. Guha, N. Mishra, R. Motwani, L. O'Callaghan. Clustering Data Streams. IEEE FOCS Conference, 2000.


Efficient Decision Tree Construction on Streaming Data - Jin, Agrawal (2003)   (Correct)

No context found.

S. Guha, N. Mishra, R. Motwani, and L. O'Callaghan. Clustering Data Streams. In Proceedings of 2000.


Clustering Binary Data Streams with K-means - Carlos Ordonez Carlos (2003)   (5 citations)  (Correct)

No context found.

S. Guha, N. Mishra, R. Motwani, and L. O'Callaghan. Clustering data streams. In FOCS, 2000.


Thesis Proposal - Ruoming Jin Department   (Correct)

No context found.

S. Guha, N. Mishra, R. Motwani, and L. O'Callaghan. Clustering Data Streams. In Proceedings of 2000.


Thesis Proposal - Ruoming Jin Department   (Correct)

No context found.

S. Guha, N. Mishra, R. Motwani, and L. O'Callaghan. Clustering data streams. In Proc. of the Annual Symp. on Foundations of Computer Science (FOCS 2000.


Distributed Streams Algorithms for Sliding Windows - Gibbons, Tirthapura (2002)   (8 citations)  (Correct)

No context found.

S. Guha, N. Mishra, R. Motwani, and L. O'Callaghan. Clustering data streams. In Proc. 41st IEEE Symp. on Foundations of Computer Science, pages 359-366, Nov. 2000.


Tracking Moving Clutches in Streaming Graphs - Chawathe (2002)   (Correct)

No context found.

S. Guha, N. Mishra, R. Motwani, and L. O'Callaghan. Clustering data streams. In Proceedings of the Annual Symposium on Foundations of Computer Science, November 2000.

First 50 documents  Next 50

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC