See this document in CiteSeerX!

Automating the Measurement of Linguistic Features to Help Classify Texts as Technical (2000)  (Make Corrections)  
Terry Copeck, Ken Barker, Sylvain Delisle, Stan Szpakowicz



  Home/Search   Context   Related

 
View or download:
uqtr.uquebec.ca/~delisle/...TALN2K.pdf
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  uqtr.uquebec.ca/~d...publications (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Text classification plays a central role in software systems which perform automatic information classification and retrieval. Occurrences of linguistic feature values must be counted by any mechanism that classifies or characterizes natural language text by topic, style, genre or, in our case, by the degree to which a text is technical. We discuss the methodology and key details of the feature value extraction process, paying attention to fast and reliable implementation. Our results are mixed ... (Update)

Active bibliography (related documents):   More   All
0.5:   User Assessment of a Visual Web Genre Classifier - Dimitrova, Kushmerick.. (2003)   (Correct)
0.2:   UMASS Approaches to Detection and Tracking at TDT2 - Ron Papka James (1999)   (Correct)
0.2:   A Clustering Method for Information Retrieval - Bellot, El-Bèze (1999)   (Correct)

Similar documents based on text:   More   All
1.5:   What is Technical Text? - Copeck, Barker, Delisle, Szpakowicz, .. (1997)   (Correct)
0.6:   Text Classification and Multilinguism: Getting at Words via.. - Biskri, Delisle (2002)   (Correct)
0.6:   Realistic Parsing: Practical Solutions of Difficult Problems - Delisle, Szpakowicz (1995)   (Correct)

BibTeX entry:   (Update)

@misc{ copeck-automating,
  author = "Terry Copeck and Ken Barker and Sylvain Delisle and Stan Szpakowicz",
  title = "Automating the Measurement of Linguistic Features to Help Classify Texts
    as Technical",
  url = "citeseer.ist.psu.edu/copeck00automating.html" }
Citations (may not include all citations):
265   A Simple Rule-based Part of Speech Tagger - BRILL - 1992
181   WordNet: An On-line Lexical Database (context) - MILLER - 1990
174   Nonparametric Statistics for the Behavioral Sciences (context) - SIEGEL, CASTELLAN - 1988
149   An Evaluation of Statistical Approaches to Text Categorizati.. - YANG - 1999
110   Context-sensitive Learning Methods for Text Categorization - COHEN - 1999
97   Assessing Agreement on Classification Tasks: The Kappa Stati.. - CARLETTA - 1996
82   Bayesian Analysis of Binary and Polychotomous Response Data (context) - ALBERT, CHIB - 1993
67   Frequency Analysis of English Usage (context) - FRANCIS, KUCERA - 1982
45   Variation Across Speech and Writing (context) - BIBER - 1988
43   Combining Classifiers in Text Categorization - LARKEY, CROFT - 1996
41   Automated Learning of Decision Rules for Text Categorization (context) - APT, DAMERAU et al. - 1994
26   Recognizing Text Genres with Simple Metrics Using Discrimina.. - KARLGREN, CUTTING - 1994
18   Text Classification Using WordNet Hypernyms - SCOTT, MATWIN - 1998
9   An Analysis of Statistical Term Strength and its Use in the .. (context) - WILBUR - 1996
9   Using Syntactic Information in Document Filtering: A Compara.. - CHANDRASEKAR, SRINIVAS - 1997
8   Exploiting Thesaurus Knowledge in Rule Induction for Text Cl.. - JUNKER, ABECKER - 1997
5   Document Classification Using Multiword Features - PAPKA, ALLEN - 1998
2   What is Technical Text (context) - COPECK, BARKER et al. - 1997
1   More Alike than not---An Analysis of Word Frequencies in Fou.. - COPECK, BARKER et al. - 1999
1   Review of Dimensions of Register Variation: A Cross-Linguist.. (context) - KILGARRIFF - 1995
1   Text Typology and Translation (context) - TROSBORG - 1999
1   Text Filtering in MUC-3 and MUC-4 (context) - LEWIS, TONG - 1992
1   Preliminary Recommendations on Text Typology (context) - SINCLAIR, BALL - 1996
1   A Surface-Based Approach to Identifying Discourse Markers an.. (context) - MARCU - 1998
1   Le document technique : unicit et pluralit (context) - FROISSART, LALLICH-BOIDIN - 1999
1   Transducive Inference for Text Classification Using Support .. (context) - JOACHIMS - 1999
1   Text Linguistic Models for the Study of Simultaneous Interpr.. (context) - NISKE - 1998

Documents on the same site (http://www.uqtr.uquebec.ca/~delisle/Recherche/publications.html):   More
From Text to Horn Clauses: Combining Linguistic.. - Delisle, Barker.. (1994)   (Correct)
Extraction of Predicate-Argument Structures from Texts - Delisle, Szpakowicz (1997)   (Correct)
More Alike Than Not - An Analysis Of WORD.. - COPECK, BARKER.. (1999)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC