| Fisher, D. and Riloff, E. "Applying Statistical Techniques to Small Corpora: Benefiting from a Limited Domain." Proceedings of the AAAI Fall Symposium on Probabilistic Approaches to Natural Language, 1992. |
....are correctly assigned. Be that as it may, the performance is still unsatisfying, especially compared to Yarowsky s [10] average of 92 . One of the problems with this algorithm is that it requires a large training corpus from which to collect the association frequencies. Although I have shown in [2] that a corpus as small as 500,000 words can be used for a similar frequency based technique, that was in the context of a limited domain. The AP corpus used here totals 1.7 million words, which is small compared to the 10 million words used by Yarowsky [10] and the 8.7 million words used by ....
Fisher, D. and Riloff, E. Applying Statistical Techniques to Small Corpora: Benefitting from a Limited Domain. In the Proceedings of the AAAI Fall Symposium on Probabilistic Approaches to Natural Language, 1992.
No context found.
Fisher, D. and Riloff, E. "Applying Statistical Techniques to Small Corpora: Benefiting from a Limited Domain." Proceedings of the AAAI Fall Symposium on Probabilistic Approaches to Natural Language, 1992.
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC