| P. Pedersen and R. Bruce. Distinguishing word senses in untagged text. In Proceedings of the Second Conference on Empirical Methods in Natural Language Processing, 1997. |
....and, to date, annotated text collections are only available for a small number of selected words with predefined senses. The lack of resources has led several researchers to explore the use of unannotated, raw corpora to perform unsupervised learning. One example is the work of Pedersen and Bruce [78] who compared three different unsupervised learning algorithms on 13 different words. Each algorithm has access to the number of senses for each word and split the instances of each word into the appropriate number of clusters. These clusters were then mapped onto the closest sense from the ....
T. Pedersen and R. Bruce. Distinguishing word senses in untagged text. Proceedings of the Second Conference on Empirical Methods in Natural Language Processing, 1997.
No context found.
Pedersen, Ted, and Rebecca Bruce. Distinguishing word senses in untagged text. In Proceedings of the 2nd Conference on Empirical Methods in NLP (EMNLP-2), Providence, August 1997.
No context found.
Pedersen, Ted, and Rebecca Bruce. Distinguishing word senses in untagged text. In Proceedings of the 2nd Conference on Empirical Methods in NLP (EMNLP-2), Providence, August 1997.
No context found.
Pedersen, T. & Bruce, R. (1997). Distinguishing Word Senses in Untagged Text. Proceedings of the Second Conference on Empirical Methods in Natural Language Processing (EMNLP-97), August 1997.
....probable latent category given the tags assigned by the judges. The EM algorithm takes as input the number of latent categories hypothesized, i.e. the number of values of L, and produces estimates of the parameters. For a description of this process, see Goodman (1974) Dawid Skene (1979) or Pedersen Bruce (1997). Three different versions of the latent class model, each specifying a different number of latent categories, are considered: the two category, the three category and the four category latent class models. The models are applied to all three data configurations. In all cases, the model contains ....
Pedersen, T. & Bruce, R. (1997). Distinguishing Word Senses in Untagged Text. Proceedings of the Second Conference on Empirical Methods in Natural Language Processing (EMNLP-97), pp. 197--207.
....words. This result demonstrates the effectiveness of a small number of representative collocations as seeds in an iterative bootstrapping approach. A comparison of the EM algorithm and two agglomerative clustering algorithms as applied to unsupervised word sense disambiguation is discussed in (Pedersen Bruce 1997). Using the same data used in this study, Pedersen Bruce 1997) found that McQuitty s agglomerative algorithm is significantly more accurate for adjectives and verbs while the EM algorithm is significantly more accurate for nouns. These results indicate that McQuitty s analysis, which is based ....
....number of representative collocations as seeds in an iterative bootstrapping approach. A comparison of the EM algorithm and two agglomerative clustering algorithms as applied to unsupervised word sense disambiguation is discussed in (Pedersen Bruce 1997) Using the same data used in this study, (Pedersen Bruce 1997) found that McQuitty s agglomerative algorithm is significantly more accurate for adjectives and verbs while the EM algorithm is significantly more accurate for nouns. These results indicate that McQuitty s analysis, which is based on counts of dissimilar features, is most appropriate for highly ....
Pedersen, T., and Bruce, R. 1997. Distinguishing word senses in untagged text. In Proceedings of the Second Conference on Empirical Methods in Natural Language Processing, 197--207.
No context found.
Pedersen, T., and Bruce, R. 1997a. Distinguishing word senses in untagged text. In Proceedings of the Second Conference on Empirical Methods in Natural Language Processing, 197--207.
No context found.
P. Pedersen and R. Bruce. Distinguishing word senses in untagged text. In Proceedings of the Second Conference on Empirical Methods in Natural Language Processing, 1997.
No context found.
Pedersen, T. and Bruce, R. (1997) Distinguishing Word Senses in Untagged Text, Proceedings of the Second Conference on Empirical Methods in Natural Language Processing, pp. 197-207, Providence, RI.
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC