See this document in CiteSeerX!

Unsupervised Document Classification Using Sequential Information Maximization (2002)  (Make Corrections)  (17 citations)
Noam Slonim, Nir Friedman, Naftali Tishby



  Home/Search   Context   Related

 
View or download:
leibniz.cs.huji.ac...2_submission.ps.gz
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  leibniz.cs.huji...h.php?year=2002 (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: We present a novel sequential clustering algorithm which is motivated by the Information Bottleneck (IB) method. In contrast to the Agglomerative IB algorithm, the new sequential (sIB) approach is guaranteed to converge to a local maximum of the information with time and space complexity linear in the data size. We apply this algorithm to unsupervised document classification. In all our evaluation on small and medium size corpra, the sIB is found to be consistently superior to all the other... (Update)

Similar documents based on text:   More   All
2.0:   Unsupervised Document Classification Using Sequential.. - Slonim, Friedman, Tishby (2002)   (Correct)
0.7:   Agglomerative Multivariate Information Bottleneck - Slonim, Friedman, Tishby (2001)   (Correct)
0.5:   Document Clustering using Word Clusters via the Information.. - Slonim, Tishby (2000)   (Correct)

Related documents from co-citation:   More   All
11:   The information bottleneck method - Tishby, Pereira et al. - 1999
8:   Multivariate information bottleneck - Friedman, Mosenzon et al. - 2001
7:   Elements of Information Theory (context) - Cover, Thomas - 1991

BibTeX entry:   (Update)

N. Slonim, N. Friedman, and N. Tishby. Unsupervised document classification using sequential information maximization. In Proceeding of SIGIR'02, 25th ACM intermational Conference on Research and Development of Information Retireval, Tampere, Finland, 2002. ACM Press, New York, USA. http://citeseer.ist.psu.edu/article/slonim02unsupervised.html   More

@misc{ slonim02unsupervised,
  author = "N. Slonim and N. Friedman and N. Tishby",
  title = "Unsupervised document classification using sequential information maximization",
  text = "N. Slonim, N. Friedman, and N. Tishby. Unsupervised document classification
    using sequential information maximization. In Proceeding of SIGIR'02, 25th
    ACM intermational Conference on Research and Development of Information
    Retireval, Tampere, Finland, 2002. ACM Press, New York, USA.",
  year = "2002",
  url = "citeseer.ist.psu.edu/article/slonim02unsupervised.html" }
Citations (may not include all citations):
2319   Elements of Information Theory (context) - Cover, Thomas - 1991
416   Information Retrieval - van Rijsbergen - 1979
168   Distributional clustering of English words - Pereira, Tishby et al. - 1993
106   The Information Bottleneck method - Tishby, Pereira et al. - 1999
81   Developments in Automatic Text Retrieval (context) - Salton - 1990
79   Web Document Clustering: A Feasibility Demonstration - Zamir, Etzioni - 1998
72   Bow: A toolkit for statistical language modeling (context) - Andrew - 1996
65   Divergence Measures Based on the Shannon Entropy (context) - Lin - 1991
56   Reexamining Cluster Hypothesi ScatterGather Retrieval Result - Pedersen, Cluster et al. - 1996
49   Agglomerative Information Bottleneck - Slonim, Tishby - 1999
40   Document Clustering using Word Clusters via the Information .. - Slonim, Tishby - 2000
15   On Feature Distributional Clustering for Text Categorization - Bekkerman, El-Yaniv et al. - 2001
10   Agnostic classi cation of Markovian sequences (context) - El-Yaniv, Fine et al. - 1997
10   The power of word clusters for text classi cation (context) - Slonim, Tishby - 2001
6   Iterative Double Clustering for Unsupervised and Semi-superv.. (context) - El-Yaniv, Souroujon - 2002
4   Learning to lter netnews (context) - Lang - 1995
3   Adaptive Cluster-based Browsing Using Incrementally Expanded.. (context) - Eguchi - 1999
1   Multivariate Agglomerative Information Bottleneck (context) - Slonim, Friedman et al. - 2002
1   the two-sample problem and the Jensen-Shannon divergence for.. (context) - Schriebman, El-Yaniv et al.
http://www.research.att.com/lewis



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://leibniz.cs.huji.ac.il/research/search.php?year=2002):   More
New View Generation With a Bi-Centric Camera - Weinshall, Lee, Brodsky.. (2002)   (Correct)
The Impact of InfoCenters on E-Marketplaces - Yarom, Goldman (2002)   (Correct)
Hierarchical Bandwidth Sharing made Simple - Anker, Bergman, Dolev, Gelbourt (2002)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC