Russsian tagger trained on Czech, obtaining emissions from morph. analysis. Russifications, tagger combination
Abstract: In this paper, we describe a resource-light system for the automatic morphological analysis and tagging of Russian. We eschew the use of extensive resources (particularly, large annotated corpora and lexicons), exploiting instead (i) pre-existing annotated corpora of Czech; (ii) an unannotated corpus of Russian. We show that our approach has benefits, and present what we believe to be one of the first full evaluations of a Russian tagger in the openly available literature. (Update)
Cited by: More
Tagging Portuguese with a Spanish Tagger Using Cognates - Hana, Feldman, Brew, Amaral (2006)
(Correct)
Portable Language Technology: Russian via Czech - Hana, Feldman (2004)
(Correct)
POS Tagging of Dialectal Arabic: A Minimally Supervised Approach - Kevin Duh And
(Correct)
Active bibliography (related documents): More All
0.5: American Russian: An Endangered Language? - Polinsky
(Correct)
0.3: Morphosyntactic Tagging of Slovene: Evaluating Taggers and .. - Dzeroski, Erjavec, Zavrel (2000)
(Correct)
0.3: Compiling and Using the IJS-ELAN Parallel Corpus - Erjavec (2002)
(Correct)
Similar documents based on text:
4.0: Unknown -
(Correct)
Related documents from co-citation: More All
3: TnT -- a statistical part-of-speech tagger
- Brants - 2000
2: Guide to collapsing Arabic tagset (context) - Bies - 2003
2: Automatic tagging of Arabic text: from raw text to base phrase chunks (context) - Diab, Hacioglu et al. - 2004
BibTeX entry: (Update)
J. Hana, A. Feldman, and C. Brew. 2004. A resource-light approach to Russian morphology: Tagging Russian using Czech resources. In Proc. of EMNLP 2004, July. http://citeseer.ist.psu.edu/hana04resourcelight.html More
@inproceedings{ hana:feldman:brew:2004,
author = {Jiri Hana and Anna Feldman and Chris Brew},
title = {{A Resource-light Approach to Russian Morphology: Tagging Russian
using Czech resources}},
booktitle = {{Proceedings of EMNLP 2004}},
year = {2004},
address = {Barcelona, Spain},
url = {citeseer.ist.psu.edu/hana04resourcelight.html} }
Citations (may not include all citations):
475
Building a large annotated corpus of English: The Penn Treeb..
- Marcus, Santorini et al. - 1993
35
Unsupervised Learning of the Morphology of a Natural Languag.. (context) - Goldsmith - 2001
11
TnT - A Statistical Part-ofSpeech Tagger (context) - Brants - 2000
11
Minimally supervised morphological analysis by multimodal al..
- Yarowsky, Wicentowski - 2000
5
Tagset design and inflected languages (context) - Elworthy - 1995
5
Identifying cognates by phonetic and semantic similarity
- Kondrak - 2001
3
Morphological and syntactic tagging of the prague dependency.. (context) - emov, Jan et al. - 1999
3
Morphological Tagging: Data vs (context) - Haji - 2000
2
Russian Morphology: An Engineering Approach (context) - Mikheev, Liubushkina - 1995
2
Dictionary-based Russian morphological analysis and synthesi.. (context) - Segalovich, Maslov - 1989
2
Automatic morphological annotation MYSTEM (context) - Segalovich, Titov - 2000
2
A fast morphological algorithm with unknown word guessing in.. (context) - Segalovich - 2003
2
A Probabilistic Morphological Analyzer for Russian and Ukran.. (context) - Kovalev - 2002
2
A Comprehensive Russian Grammar (context) - Wade - 1992
2
Russian Morphological Analysis (context) - Yablonsky - 1999
1
frprojectmultext east (context) - MULTEXTEAST, www et al. - 1996
1
Portable Language Technology: The case of Czech and Russian (context) - Hana, Feldman - 2004
1
Serial Combination of Rules and Statistics: A Case Study in .. (context) - Haji, Pavel et al. - 2001
1
Morphosyntactic Tagging of Slovene:Evaluating Taggers and Ta.. (context) - so, zeroski et al. - 2000
Documents on the same site (http://www.ling.ohio-state.edu/~hana/bibliography.html): More
Portable Language Technology: Russian via Czech - Hana, Feldman (2004)
(Correct)
Tagging Portuguese with a Spanish Tagger Using Cognates - Hana, Feldman, Brew, Amaral (2006)
(Correct)
Czech Clitics In Higher Order Grammar - Hana (2004)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC