See this document in CiteSeerX!

Topic Detection and Tracking using idf-Weighted Cosine Coefficient (1999)  (Make Corrections)  (7 citations)
J. Michael Schultz, Mark Liberman University of Pennsylvania Philadelphia,...



  Home/Search   Context   Related

 
View or download:
upenn.edu/jms/bnews.ps
nist.gov/speech/publication...tdt310.ps
nist.gov/speech/publicatio...tdt310.pdf
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  upenn.edu/jms/ (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: The goal of TDT Topic Detection and Tracking is to develop automatic methods of identifying topically related stories within a stream of news media. We describe approaches for both detection and tracking based on the well-known idf-weighted cosine coefficient similarity metric. The surprising outcome of this research is that we achieved very competitive results for tracking using a very simple method of feature selection, without word stemming and without a score normalization scheme. The... (Update)

Context of citations to this paper:   More

...Here a small number of on topic training stories were used to identify stories of the same topic from a stream of news media. In [1] we showed that a simple method of feature selection together with the wellknown idf weighted cosine coefficient performed as well or better...

.... and development test portion of the TDT2 corpus to determine a threshold for English and one for Mandarin using the method described in [8] this time optimizing only the topic weighted cost. 2.8 Word Segmentation in Mandarin Another aspect of the translingual task, this...

Cited by:   More
Using Information Retrieval Methods For Language Model.. - Chen, Gauvain, Lamel.. (2001)   (Correct)
Language Model Adaptation for Broadcast News Transcription - Chen, Gauvain, Lamel.. (2001)   (Correct)
On-line New Event Detection and Tracking in a Multi-Resource.. - Kurt (2001)   (Correct)

Similar documents (at the sentence level):
25.0%:   Towards a "Universal Dictionary" for Multi-Language.. - Schultz, Liberman (2000)   (Correct)

Active bibliography (related documents):   More   All
0.8:   Cross-Lingual Topic Tracking using idf-Weighted Cosine.. - Michael Schultz Mark   (Correct)
0.0:   WebMate: A Personal Agent for Browsing and Searching - Chen, Sycara (1998)   (Correct)
0.0:   Mining of Concurrent Text and Time Series - Lavrenko, Schmill, Lawrie..   (Correct)

Similar documents based on text:   More   All
0.2:   Evaluating Lexicon Coverage for Cross-Language Information.. - Levow, Oard (1999)   (Correct)
0.1:   HDL Code Restructuring Using Timed Decision Tables - Li, Gupta   (Correct)
0.1:   Large, Multilingual, Broadcast News Corpora For.. - Cieri, Graff.. (2000)   (Correct)

Related documents from co-citation:   More   All
4:   The DET curve in assessment of detection task performance - Martin, Doddington et al. - 1997
3:   Topic Detection in Broadcast news - Walls, Jin et al. - 1999
3:   An evaluation of statistical approaches to text categorization - Yang - 1999

BibTeX entry:   (Update)

J.M. Schultz and M. Liberman, "Topic Detection and Tracking using idfWeighted Cosine Coefficient," Proceedings of the DARPA Broadcast News Workshop, 189-192, 1999. http://citeseer.ist.psu.edu/schultz99topic.html   More

@misc{ schultz99topic,
  author = "J. Schultz and M. Liberman",
  title = "Topic Detection and Tracking using idfWeighted Cosine Coefficient",
  text = "J.M. Schultz and M. Liberman, Topic Detection and Tracking using idfWeighted
    Cosine Coefficient, Proceedings of the DARPA Broadcast News Workshop, 189-192,
    1999.",
  year = "1999",
  url = "citeseer.ist.psu.edu/schultz99topic.html" }
Citations (may not include all citations):
1256   Introduction to Modern Information Retrieval (context) - Salton, McGill - 1983
201   Cluster Analysis (context) - Everitt - 1993
90   The DET Curve in Assessment of Detection Task Performance - Martin, Doddington et al. - 1997
3   The TDT Pilot Study Corpus Documentation (context) - Doddington - 1997
2   The Topic Detection and Tracking Phase 2 (TDT2) Evaluation P.. (context) - Doddington - 1998



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://www.ldc.upenn.edu/jms/):
Cross-Lingual Topic Tracking using idf-Weighted Cosine.. - Michael Schultz Mark   (Correct)
Towards a "Universal Dictionary" for Multi-Language.. - Schultz, Liberman (2000)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC