2 citations found. Retrieving documents...
A. Mikheev. 2002. Periods, capitalized words, etc. Computational Linguistics, 28(3):289--318.

 Home/Search   Document Not in Database   Summary   Related Articles   Check  

This paper is cited in the following contexts:
Tokenization of Portuguese: resolving thr hard cases - Branco, Silva (2003)   (Correct)

....as being of an even lower level than tokenization in the spectrum of the language processing stages. The task of sentence chunking turns out not to be as straightforward as it might be assumed at first glance because one of the most frequent terminator characters, the period, is ambiguous [1]. It marks the end of sentences, as in the example Este um exemplo. it marks the end of an abbreviation as in the example Estive seg. em Lisboa. and it marks both the end of a sentence and the end of an abbreviation, as in the example Cheguei na seg. Lisboa estava calma. The interesting point ....

Mikheev, Andrei: Periods, Capitalized Words, etc. Computational Linguistics 28(3). (2002) 289-318.


Text Preprocessing for Speech Synthesis - Uwe Reichel Hartmut   (Correct)

No context found.

A. Mikheev. 2002. Periods, capitalized words, etc. Computational Linguistics, 28(3):289--318.

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC