See this document in CiteSeerX!

Integrating Lexical Knowledge in Learning-Based Text Categorization  (Make Corrections)  
Jose María Gomez Hidalgo, Manuel de Buenaga Rodríguez, Luis Alfonso Urena Lopez, María Teresa Martín Valdivia, Manuel García Vega



  Home/Search   Context   Related

 
View or download:
esi.uem.es/~jmgomez/pap...report01b.pdf
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  esi.uem.es/~jmgomez/papers/ (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Automatic Text Categorization (ATC) is an important task in the field of Information Access. The prevailing approach to ATC is making use of a a collection of prelabeled texts for the induction of a document classifier through learning methods. With the increasing availability of lexical resources in electronic form (including Lexical Databases (LDBs), Machine Readable Dictionaries, etc.), there is an # This work has been partially supported by Spanish Ministry of Science and Technology... (Update)

Active bibliography (related documents):   More   All
0.5:   Combining Machine Learning and Hierarchical Structures for Text.. - Ruiz (2001)   (Correct)
0.4:   Evaluating a User-Model Based Personalisation.. - Esteban.. (2000)   (Correct)
0.3:   Text Representation for Automatic Text Categorization - Hidalgo   (Correct)

Similar documents based on text:   More   All
0.7:   Using linear classifiers in the integration of.. - Esteban, Lopez.. (2001)   (Correct)
0.7:   Integrating and Evaluating WSD in the Adaptation of a .. - Lopez, DeBuenaga.. (1998)   (Correct)
0.6:   Combining Text and Heuristics for Cost-Sensitive Spam.. - Hidalgo, López..   (Correct)

BibTeX entry:   (Update)

@misc{ hidalgo-integrating,
  author = "Jose María Gomez Hidalgo and Manuel de Buenaga Rodríguez
    and Luis Alfonso Urena Lopez and María Teresa Martín Valdivia
    and Manuel García Vega",
  title = "Integrating Lexical Knowledge in Learning-Based Text Categorization",
  url = "citeseer.ist.psu.edu/596915.html" }
Citations (may not include all citations):
976   Machine Learning (context) - Mitchell - 1997
367   Stacked generalization - Wolpert - 1992
262   Data Mining: Practical Machine Learning Tools and Techniques.. (context) - Witten, Frank - 1999
257   WordNet: A lexical database for English (context) - Miller - 1995
140   Text classification from labeled and unlabeled documents usi.. - Nigam, McCallum et al. - 2000
116   Beyond independence: conditions for the optimality of the si.. - Domingos, Pazzani - 1996
110   Training algorithms for linear text classifiers - Lewis, Schapire et al. - 1996
109   Analysis and Retrieval of Information by Computer (context) - Salton, Processing et al. - 1989
76   A bayesian approach to filtering junk e-mail - Sahami, Dumais et al. - 1998
52   Improving text retrieval for the routing problem using laten.. (context) - Hull - 1994
51   Word sense disambiguation using conceptual density - Agirre, Rigau - 1996
34   Indexing with wordnet synsets can improve text retrieval - Gonzalo, Verdejo et al. - 1998
20   An evaluation of statistical approaches to text categorizati.. (context) - Yang - 1999
16   ConstrueTi system content based indexing database new storie (context) - Hayes, Construe et al. - 1990
14   Using wordnet to complement training information in text cat.. - de Buenaga, Gomez et al. - 2000
13   Eurowordnet: a multilingual database for information retriev.. - Vossen - 1997
13   The edr electronic dictionary (context) - Yokoi - 1995
12   Stacked generalization: when does it work - Ting, Witten - 1997
9   A tutorial on automated text categorisation - Sebastiani - 1999
8   Feature engineering for text classification - Scott, Matwin - 1999
8   Exploiting thesaurus knowledge in rule induction for text cl.. - Junker, Abecker - 1997
7   Using WordNet to complement training information in text cat.. (context) - de Buenaga, Gomez et al. - 1997
6   Integrating linguistic resources in tc through wsd (context) - Urena, de Buenaga et al. - 2001
6   An evaluation of statistical approaches to MEDLINE indexing (context) - Yang - 1996
6   Relevance feedback in information retrieval (context) - Rocchio - 1971
4   Combining text and heuristics for cost-sensitive spam filter.. (context) - Gomez, Mana et al. - 2000
3   Integrating a lexical database and a training collection for.. (context) - Gomez, de Buenaga - 1997
3   into an automatic Web page classifier (context) - Mladenic - 1998

Documents on the same site (http://www.esi.uem.es/~jmgomez/papers/):   More
Integrating and Evaluating WSD in the Adaptation of a .. - Lopez, DeBuenaga.. (1998)   (Correct)
Combining Text and Heuristics for Cost-Sensitive Spam.. - Hidalgo, López..   (Correct)
Evaluating Cost-Sensitive Unsolicited Bulk Email Categorization - Hidalgo (2002)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC