(Enter summary)
Abstract: The goal of TDT Topic Detection and Tracking is to develop automatic
methods of identifying topically related stories within a stream
of news media. We describe approaches for both detection and tracking
based on the well-known idf-weighted cosine coefficient similarity
metric. The surprising outcome of this research is that we
achieved very competitive results for tracking using a very simple
method of feature selection, without word stemming and without a
score normalization scheme. The... (Update)
Context of citations to this paper: More
...Here a small number of on topic training stories were used to identify stories of the same topic from a stream of news media. In [1] we showed that a simple method of feature selection together with the wellknown idf weighted cosine coefficient performed as well or better...
.... and development test portion of the TDT2 corpus to determine a threshold for English and one for Mandarin using the method described in [8] this time optimizing only the topic weighted cost. 2.8 Word Segmentation in Mandarin Another aspect of the translingual task, this...
Cited by: More
Using Information Retrieval Methods For Language Model.. - Chen, Gauvain, Lamel.. (2001)
(Correct)
Language Model Adaptation for Broadcast News Transcription - Chen, Gauvain, Lamel.. (2001)
(Correct)
On-line New Event Detection and Tracking in a Multi-Resource.. - Kurt (2001)
(Correct)
Similar documents (at the sentence level):
25.0%: Towards a "Universal Dictionary" for Multi-Language.. - Schultz, Liberman (2000)
(Correct)
Active bibliography (related documents): More All
0.8: Cross-Lingual Topic Tracking using idf-Weighted Cosine.. - Michael Schultz Mark
(Correct)
0.0: WebMate: A Personal Agent for Browsing and Searching - Chen, Sycara (1998)
(Correct)
0.0: Mining of Concurrent Text and Time Series - Lavrenko, Schmill, Lawrie..
(Correct)
Similar documents based on text: More All
0.2: Evaluating Lexicon Coverage for Cross-Language Information.. - Levow, Oard (1999)
(Correct)
0.1: HDL Code Restructuring Using Timed Decision Tables - Li, Gupta
(Correct)
0.1: Large, Multilingual, Broadcast News Corpora For.. - Cieri, Graff.. (2000)
(Correct)
Related documents from co-citation: More All
4: The DET curve in assessment of detection task performance
- Martin, Doddington et al. - 1997
3: Topic Detection in Broadcast news
- Walls, Jin et al. - 1999
3: An evaluation of statistical approaches to text categorization
- Yang - 1999
BibTeX entry: (Update)
J.M. Schultz and M. Liberman, "Topic Detection and Tracking using idfWeighted Cosine Coefficient," Proceedings of the DARPA Broadcast News Workshop, 189-192, 1999. http://citeseer.ist.psu.edu/schultz99topic.html More
@misc{ schultz99topic,
author = "J. Schultz and M. Liberman",
title = "Topic Detection and Tracking using idfWeighted Cosine Coefficient",
text = "J.M. Schultz and M. Liberman, Topic Detection and Tracking using idfWeighted
Cosine Coefficient, Proceedings of the DARPA Broadcast News Workshop, 189-192,
1999.",
year = "1999",
url = "citeseer.ist.psu.edu/schultz99topic.html" }
Citations (may not include all citations):
1256
Introduction to Modern Information Retrieval (context) - Salton, McGill - 1983
201
Cluster Analysis (context) - Everitt - 1993
90
The DET Curve in Assessment of Detection Task Performance
- Martin, Doddington et al. - 1997
3
The TDT Pilot Study Corpus Documentation (context) - Doddington - 1997
2
The Topic Detection and Tracking Phase 2 (TDT2) Evaluation P.. (context) - Doddington - 1998
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://www.ldc.upenn.edu/jms/):
Cross-Lingual Topic Tracking using idf-Weighted Cosine.. - Michael Schultz Mark
(Correct)
Towards a "Universal Dictionary" for Multi-Language.. - Schultz, Liberman (2000)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC