| A. Mikheev. 2002. Periods, capitalized words, etc. Computational Linguistics, 28(3):289--318. |
....as being of an even lower level than tokenization in the spectrum of the language processing stages. The task of sentence chunking turns out not to be as straightforward as it might be assumed at first glance because one of the most frequent terminator characters, the period, is ambiguous [1]. It marks the end of sentences, as in the example Este um exemplo. it marks the end of an abbreviation as in the example Estive seg. em Lisboa. and it marks both the end of a sentence and the end of an abbreviation, as in the example Cheguei na seg. Lisboa estava calma. The interesting point ....
Mikheev, Andrei: Periods, Capitalized Words, etc. Computational Linguistics 28(3). (2002) 289-318.
No context found.
A. Mikheev. 2002. Periods, capitalized words, etc. Computational Linguistics, 28(3):289--318.
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC