(Enter summary)
Abstract: We present a novel sequential clustering algorithm which is motivated by the Information Bottleneck (IB) method. In contrast to the Agglomerative IB algorithm, the new sequential (sIB) approach is guaranteed to converge to a local maximum of the information with time and space complexity linear in the data size. We apply this algorithm to unsupervised document classification. In all our evaluation on small and medium size corpra, the sIB is found to be consistently superior to all the other... (Update)
Similar documents based on text: More All
2.0: Unsupervised Document Classification Using Sequential.. - Slonim, Friedman, Tishby (2002)
(Correct)
0.7: Agglomerative Multivariate Information Bottleneck - Slonim, Friedman, Tishby (2001)
(Correct)
0.5: Document Clustering using Word Clusters via the Information.. - Slonim, Tishby (2000)
(Correct)
Related documents from co-citation: More All
11: The information bottleneck method
- Tishby, Pereira et al. - 1999
8: Multivariate information bottleneck
- Friedman, Mosenzon et al. - 2001
7: Elements of Information Theory (context) - Cover, Thomas - 1991
BibTeX entry: (Update)
N. Slonim, N. Friedman, and N. Tishby. Unsupervised document classification using sequential information maximization. In Proceeding of SIGIR'02, 25th ACM intermational Conference on Research and Development of Information Retireval, Tampere, Finland, 2002. ACM Press, New York, USA. http://citeseer.ist.psu.edu/article/slonim02unsupervised.html More
@misc{ slonim02unsupervised,
author = "N. Slonim and N. Friedman and N. Tishby",
title = "Unsupervised document classification using sequential information maximization",
text = "N. Slonim, N. Friedman, and N. Tishby. Unsupervised document classification
using sequential information maximization. In Proceeding of SIGIR'02, 25th
ACM intermational Conference on Research and Development of Information
Retireval, Tampere, Finland, 2002. ACM Press, New York, USA.",
year = "2002",
url = "citeseer.ist.psu.edu/article/slonim02unsupervised.html" }
Citations (may not include all citations):
2319
Elements of Information Theory (context) - Cover, Thomas - 1991
416
Information Retrieval
- van Rijsbergen - 1979
168
Distributional clustering of English words
- Pereira, Tishby et al. - 1993
106
The Information Bottleneck method
- Tishby, Pereira et al. - 1999
81
Developments in Automatic Text Retrieval (context) - Salton - 1990
79
Web Document Clustering: A Feasibility Demonstration
- Zamir, Etzioni - 1998
72
Bow: A toolkit for statistical language modeling (context) - Andrew - 1996
65
Divergence Measures Based on the Shannon Entropy (context) - Lin - 1991
56
Reexamining Cluster Hypothesi ScatterGather Retrieval Result
- Pedersen, Cluster et al. - 1996
49
Agglomerative Information Bottleneck
- Slonim, Tishby - 1999
40
Document Clustering using Word Clusters via the Information ..
- Slonim, Tishby - 2000
15
On Feature Distributional Clustering for Text Categorization
- Bekkerman, El-Yaniv et al. - 2001
10
Agnostic classi cation of Markovian sequences (context) - El-Yaniv, Fine et al. - 1997
10
The power of word clusters for text classi cation (context) - Slonim, Tishby - 2001
6
Iterative Double Clustering for Unsupervised and Semi-superv.. (context) - El-Yaniv, Souroujon - 2002
4
Learning to lter netnews (context) - Lang - 1995
3
Adaptive Cluster-based Browsing Using Incrementally Expanded.. (context) - Eguchi - 1999
1
Multivariate Agglomerative Information Bottleneck (context) - Slonim, Friedman et al. - 2002
1
the two-sample problem and the Jensen-Shannon divergence for.. (context) - Schriebman, El-Yaniv et al.
http://www.research.att.com/lewis
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://leibniz.cs.huji.ac.il/research/search.php?year=2002): More
New View Generation With a Bi-Centric Camera - Weinshall, Lee, Brodsky.. (2002)
(Correct)
The Impact of InfoCenters on E-Marketplaces - Yarom, Goldman (2002)
(Correct)
Hierarchical Bandwidth Sharing made Simple - Anker, Bergman, Dolev, Gelbourt (2002)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC