| G. Escudero, L. M`arquez, and G. Rigau. On the Portability and Tuning of Supervised Word Sense Disambiguation Systems. Research Report LSI-00-30-R, Software Department (LSI). Technical University of Catalonia (UPC), Barcelona, Catalonia, 2000. |
....learning. They have applied it successfully to a broad spectrum of natural language disambiguation tasks, including context sensitive spelling correction [89] PoS tagging [189] PP attachment disambiguation [107] shallow parsing [130] text categorization [62] and word sense disambiguation [74], achieving state of the art accuracies and surpassing several alternative algorithms. Other methods based on linear separators have been applied to the text categorization task. Cohen and Singer [51] presented Experts (based on the weighted majority algorithm [124] Lewis et al. 118, 64] ....
....[196] and text filtering [198] where an adapted version of the popular AdaBoost algorithm [82, 83, 197] is presented for information retrieval tasks. Other applications of AdaBoost variants to NLP tasks include: PoS tagging [2] PP attachment disambiguation [2] word sense disambiguation [72, 74], and full parsing [93] 13] is another relevant example of the combination of classifiers applied to information retrieval, in which the combination of classifiers allows the use of a big set of unlabelled examples (semi supervised approach) to iteratively improve the classification accuracy in ....
[Article contains additional citation context not shown here]
G. Escudero, L. M`arquez, and G. Rigau. On the Portability and Tuning of Supervised Word Sense Disambiguation Systems. Research Report LSI-00-30-R, Software Department (LSI). Technical University of Catalonia (UPC), Barcelona, Catalonia, 2000.
....handtagged data, the precision is limited to around 70 when tagging all words in a running text. In the course of extending available data, the efforts to use corpora tagged by independent teams of researchers have been shown to fail (Ng et al. 1999) as have failed some tuning experiments (Escudero et al. 2000), and an attempt to use examples automatically acquired from the Internet (Agirre Martinez, 2000) All these studies obviated the fact that the examples come from different genre and topics. Future work that takes into account the conclusions drawn in this paper will perhaps be able to ....
....when training and testing on the same corpus. No significance tests could be found for our comparison, as training and test sets differ. Because of the large amount of experiments involved, we focused on 21 verbs and nouns (cf. Table 2) selected from previous works (Agirre Martinez, 2000; Escudero et al. 2000). 3 Collocations considered For the sake of this work we take a broad definition of collocations, which were classified in three subsets: local content word collocations, local part of speech and function word collocations, and global content word collocations. If a more strict linguistic ....
[Article contains additional citation context not shown here]
Escudero, G. , L. Mrquez and G. Rigau. On the Portability and Tuning of Supervised Word Sense Disambiguation Systems. In Proceedings of the Joint Sigdat Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, Hong Kong. 2000.
.... Many standard ML algorithms for supervised learning have been applied, such as: Decision Lists (Yarowsky, 1994; Agirre and Martinez, 2000) Neural Networks (Towell and Voorhees, 1998) Bayesian learning (Bruce and Wiebe, 1999) Exemplar Based learning (Ng, 1997a; Fujii et al. 1998) Boosting (Escudero et al. 2000a) etc. Unfortunately, there have been very few direct comparisons between alternative methods for WSD. In general, supervised learning presumes that the training examples are somehow re ective of the task that will be performed by the trainee on other data. Consequently, the performance of ....
....example. In the experiments explained in section 4, the EB algorithm is run several times using di erent number of nearest neighbours (1, 3, 3 Although the use of MVDM metric (Cost and Salzberg, 1993) could lead to better results, current implementations have prohivitive computational overheads (Escudero et al. 2000b) 5, 7, 10, 15, 20 and 25) and the results corresponding to the best choice are reported 4 . Exemplar based learning is said to be the best option for WSD (Ng, 1997a) Other authors (Daelemans et al. 1999) point out that exemplar based methods tend to be superior in language learning ....
[Article contains additional citation context not shown here]
G. Escudero, L. Marquez, and G. Rigau. 2000c. On the Portability and Tuning of Supervised Word Sense Disambiguation Systems. Research Report LSI-00-30-R, Software Department (LSI). Technical University of Catalonia (UPC).
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC