Results 1 -
4 of
4
O.: Unsupervised Morpheme Discovery with Allomorfessor
- In Cross Language Evaluation Forum (CLEF
, 2009
"... We describe Allomorfessor, which extends the unsupervised morpheme segmentation method Morfessor to account for the linguistic phenomenon of allomorphy, where one morpheme has several different surface forms. The method discovers common base forms for allomorphs from an unannotated corpus by finding ..."
Abstract
-
Cited by 5 (0 self)
- Add to MetaCart
(Show Context)
We describe Allomorfessor, which extends the unsupervised morpheme segmentation method Morfessor to account for the linguistic phenomenon of allomorphy, where one morpheme has several different surface forms. The method discovers common base forms for allomorphs from an unannotated corpus by finding small modifications, called mutations, for them. Using Maximum a Posteriori estimation, the model is able to decide the amount and types of the mutations needed for the particular language. The method is evaluated in Morpho Challenge 2009.
Enriching Morphological Lexica through Unsupervised Derivational Rule Acquisition
- in "WoLeR 2011 at ESSLLI (International Workshop on Lexical Resources
, 2011
"... Abstract In a morphological lexicon, each entry combines a lemma with a specific inflection class, often defined by a set of inflection rules. Therefore, such lexica usually give a satisfying account of inflectional operations. Derivational information, however, is usually badly covered. In this pa ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
(Show Context)
Abstract In a morphological lexicon, each entry combines a lemma with a specific inflection class, often defined by a set of inflection rules. Therefore, such lexica usually give a satisfying account of inflectional operations. Derivational information, however, is usually badly covered. In this paper we introduce a novel approach for enriching morphological lexica with derivational links between entries and with new entries derived from existing ones and attested in large-scale corpora, without relying on prior knowledge of possible derivational processes. To achieve this goal, we adapt the unsupervised morphological rule acquisition tool MorphAcq (Nicolas et al., 2010) in a way allowing it to take into account an existing morphological lexicon developed in the Alexina framework (Sagot, 2010), such as the Lefff for French and the Leffe for Spanish. We apply this tool on large corpora, thus uncovering morphological rules that model derivational operations in these two lexica. We use these rules for generating derivation links between existing entries, as well as for deriving new entries from existing ones and adding those which are best attested in a large corpus. In addition to lexicon development and NLP applications that benefit from rich lexical data, such derivational information will be particularly valuable to linguists who rely on vast amounts of data to describe and analyse these specific morphological phenomena.
MORPHEME SEGMENTATION BY OPTIMIZING TWO-PART MDL CODES
"... In many real-world NLP applications, a compact yet representative vocabulary is a necessary ingredient. Words are often thought of as basic units of representation. In highly-inflecting and compounding languages, words can ..."
Abstract
- Add to MetaCart
(Show Context)
In many real-world NLP applications, a compact yet representative vocabulary is a necessary ingredient. Words are often thought of as basic units of representation. In highly-inflecting and compounding languages, words can
DATE OF APPROVAL: 29/04/2011 Acknowledgements
, 2011
"... I am honored to present my special thanks and deepest gratitude to my supervisor Assist. Prof. Olcay Taner YILDIZ for his guidance in this thesis. Without his endless patience and support I would not finish this work. I am feeling lucky to share his vision and knowledge throughout this thesis. I am ..."
Abstract
- Add to MetaCart
(Show Context)
I am honored to present my special thanks and deepest gratitude to my supervisor Assist. Prof. Olcay Taner YILDIZ for his guidance in this thesis. Without his endless patience and support I would not finish this work. I am feeling lucky to share his vision and knowledge throughout this thesis. I am also grateful to my family for their support. They have always been by my side whenever I needed.