See this document in CiteSeerX!

Bursty and Hierarchical Structure in Streams (2002)  (Make Corrections)  (19 citations)
Jon Kleinberg



  Home/Search   Context   Related

Links:   ACM   DBLP

 
View or download:
cornell.edu/home/kleinber/bhs.ps


From:  cornell.edu/home/kleinber/ (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: A fundamental problem in text data mining is to extract meaningful structure from document streams that arrive continuously over time. E-mail and news articles are two natural examples of such streams, each characterized by topics that appear, grow in intensity for a period of time, and then fade away. The published literature in a particular research field can be seen to exhibit similar phenomena over a much longer time scale. Underlying much of the text mining work in this area is the... (Update)

Cited by:   More
Temporal Dynamics of On-Line Information - Streams Jon Kleinberg   (Correct)
Data Association for Topic Intensity Tracking - Krause, Leskovec, Guestrin (2006)   (Correct)
The Lowlands' TREC Experiments 2005 - Henning Rode Georgina   (Correct)

Similar documents (at the sentence level):
71.4%:   Bursty and Hierarchical Structure in Streams - Kleinberg (2002)   (Correct)

Active bibliography (related documents):   More   All
0.6:   Discovery of implicit and explicit connections between .. - Robert McArthur.. (2003)   (Correct)
0.4:   Parameter Free Bursty Events Detection in Text Streams - Fung, Yu, Yu, Lu (2005)   (Correct)
0.3:   Automatic Hierarchical E-Mail Classification Using Association.. - Itskevitch (2001)   (Correct)

Similar documents based on text:   More   All
0.1:   Text Mining: The state of the art and the challenges - Tan (1999)   (Correct)
0.1:   Text Mining: Promises And Challenges - Tan   (Correct)
0.1:   Analysis of One-Way Reservation Algorithms - Cidon, Rom, Shavitt (1996)   (Correct)

Related documents from co-citation:   More   All
7:   Models and Issues in Data Stream Systems (context) - Babcock, Babu et al. - 2002
5:   Monitoring Streams - A New Class of Data Management Applications (context) - CARNEY, CETINTEMEL et al. - 2002
5:   Mining time-changing data streams - Hulten, Spencer et al. - 2001

BibTeX entry:   (Update)

J. Kleinberg. Bursty and hierarchical structure in streams. In the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining,July 23 - 26, 2002. http://citeseer.ist.psu.edu/article/kleinberg02bursty.html   More

@misc{ rg-bursty,
  author = "Jon Kleinberg",
  title = "Bursty and Hierarchical Structure in Streams",
  url = "citeseer.ist.psu.edu/article/kleinberg02bursty.html" }
Citations (may not include all citations):
1362   A tutorial on hidden Markov models and selected applications.. (context) - Rabiner - 1989  ACM
404   Agents that reduce work and information overload (context) - Maes - 1994  ACM   DBLP
340   Mining sequential patterns - Agrawal, Srikant - 1995  ACM   DBLP
286   Nearest neighbor pattern classification (context) - Cover, Hart - 1967
189   Discovering frequent episodes in sequences (context) - Mannila, Toivonen et al. - 1995  DBLP
116   Topic Detection and Tracking Pilot Study: Final Report - Allan, Carbonell et al. - 1998
107   Principles of Data Mining (context) - Hand, Mannila et al. - 2001  ACM   DBLP
76   A Bayesian approach to filtering junk email - Sahami, Dumais et al. - 1998
71   Principles of Mixed-Initiative User Interfaces - Horvitz - 1999  ACM   DBLP
67   Attention, intentions, and the structure of discourse - Grosz, Sidner - 1986  ACM   DBLP
66   Learning rules that classify e-mail - Cohen - 1996
63   Threading electronic mail: A preliminary study - Lewis, Knowles - 1997
60   Statistical Models for Text Segmentation - Beeferman, Berger et al. - 1999  ACM   DBLP
53   The Analysis of Time Series: An Introduction (context) - Chatfield - 1996
49   A probabilistic approach to fast pattern matching in time se.. - Keogh, Smyth - 1997  DBLP
44   E-mail overload: Exploring personal information management o.. (context) - Whittaker, Sidner - 1996
38   A Study on Retrospective and On-line Event Detection - Yang, Pierce et al. - 1998
37   line new event detection and tracking - Allan, Papka et al. - 1998
31   Interface agents that learn: An investigation of learning is.. - Payne, Edwards - 1997
27   Event detection from time series data (context) - Guralnik, Srivastava - 1999
23   MailCat: An intelligent assistant for organizing e-mail - Segal, Kephart - 1999
23   Mining Segment-Wise Periodic Patterns in Time-Related Databa.. - Han, Gong et al. - 1998  DBLP
21   A rule-based message filtering system (context) - Pollock - 1988
17   Automatic generation of overview timelines (context) - Swan, Allan - 2000
16   Improving text categorization methods for event tracking - Yang, Ault et al. - 2000  ACM   DBLP
15   ifile: An application of machine learning to e-mail filterin.. - Rennie - 2000
14   Extracting significant time-varying features from text - Swan, Allan - 1999
14   Mining of Concurrent Text and Time-Series - Lavrenko, Schmill et al.
14   Story and Discourse: Narrative Structure in Fiction and Film (context) - Chatman
12   Knowledge Discovery in Time Series Databases - Last, Klein et al. - 2001
11   Point estimation of the parameters of piecewise regression m.. (context) - Hawkins - 1976
10   Visualizing sequential patterns for text mining - Wong, Cowley et al. - 2000
10   Incremental Learning in SwiftFile - Segal, Kephart - 2000  ACM   DBLP
10   Structural Processing of Waveforms as Trees (context) - Shaw, DeFigueiredo - 1990
10   TimeMines: Constructing Timelines with Statistical Models of.. - Swan, Jensen
8   Ishmail: Immediate identification of important information - Helfman, Isbell - 1995
8   Topic Islands: A Wavelet-Based Text Visualization System - Miller, Wong et al. - 1998
8   Data mining for unusual movements in temporal data (context) - Martin, Yohai - 2001
7   Fitting segmented curves whose join points have to be estima.. (context) - Hudson - 1966
7   Representation of Random Waveforms by Relational Trees (context) - Ehrich, Foith - 1976  DBLP
6   Narrative Discourse: An Essay in Method (context) - Genette - 1980
6   Narrative Discourse Revisited (context) - Genette
6   Collection-Based Persistent Digital Archives -- Part 2 (context) - Moore, Baru et al. - 2000
6   Concept features in Re:Agent, an intelligent e-mail agent - Boone - 1998
6   ThemeRiver: Visualizing Theme Changes over Time (context) - Havre, Hetzler et al. - 2000  DBLP
6   Notes on e#ective bandwidths (context) - Kelly - 1996
6   Aspects of the Novel (context) - Forster - 1927
5   E-mail and potential loss to future archives and scholarship.. (context) - Lukesh - 1999
5   AlterEgo e-mail filtering agent (context) - Redmond, Adelson - 1998
5   E-mail: The good, the bad, and the ugly (context) - Berghel - 1997
5   Enterprise Integration Technologies (context) - Gruber
5   Facing Flood of E-Mail, Archives Seeks Help From Supercomput.. (context) - Olsen - 1999
5   White House E-mail (context) - Blanton - 1995  ACM
5   Unsupervised identification of sequential patterns under a M.. (context) - Chudova, Smyth - 2001
5   Mail-by-Example: A visual query interface for managing large.. (context) - Becker, Cardoso - 2000
5   Applied Cryptography Wiley (context) - Schneier - 1996
5   mail System (context) - Birrell, Perl et al. - 1997
5   EmVis -- A Visual e-Mail Analysis Tool (context) - Heckel, Hamann - 1997
3   Finding a Happy Medium: Explaining the Negative E#ects of El.. (context) - Markus - 1994
3   one, M. Grace-Martin, H. Hembrooke, "The e#ect of wireless c.. (context) - Gay, Stefan
3   s' Memorandum in Support of Proposed Final Judgment (context) - Klein - 2000



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://www.cs.cornell.edu/home/kleinber/):   More
On Invariants of Sets of Points or Line Segments Under.. - Huttenlocher, Kleinberg   (Correct)
Allocating Bandwidth for Bursty Connections - Kleinberg, Rabani, Tardos (1997)   (Correct)
The Localization Problem for Mobile Robots - Kleinberg (1994)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC