Download:
|
by P. Zweigenbaum, N. Grabar, Pierre Zweigenbaum, Service D'informatique Mdicale
http://www.biomath.jussieu.fr/~pz/FTPapiers/./Zweigenbaum:MIM2000SUB.ps.gz
Add To MetaCart
Abstract:
Morphological knowledge, especially derivation and compounding, is extremely useful both for natural language processing and information retrieval. Whereas large morphological knowledge bases are available for some languages, this is not the case for French. In order to fill this gap, we aim at setting up a method that can acquire automatically various kinds of morphological knowledge for a given language and domain. This method relies on the synonym terms present in a thesaurus of the domain and a list of words that can be drawn from the same thesaurus. This paper presents a series of experiments whose goal is to learn morphological knowledge from this initial data and without a priori linguistic knowledge. It shows that one can obtain instantaneously a massive, gross description of word morphology in the domain addressed.
Citations
|
291
|
Twolevel morphology: A general computational model for word-form recognition and production
– Koskenniemi
- 1983
|
|
58
|
Corpus-based stemming using cooccurrence of word variants
– Xu, Croft
- 1998
|
|
50
|
Lexical methods for managing variation in biomedical terminologies
– McCray, Srinivasan, et al.
|
|
30
|
CELEX: a guide for users
– Burnage
- 1990
|
|
28
|
Guessing morphology from terms and corpora
– Jacquemin
- 1997
|
|
23
|
Morphological Analysis as Classification: an Inductive-Learning Approach
– Bosch, Daelemans, et al.
- 1996
|
|
11
|
Dictionnaires lectroniques et analyse automatique de textes : le systme INTEX
– Silberztein
- 1993
|
|
9
|
Morphosemantic analysis and translation of medical compound terms
– Dujols, Aubas, et al.
- 1991
|
|
8
|
Medical dictionaries for patient encoding systems: a methodology. Artif Intell Med 1998;14:201--14
– Lovis, Baud, et al.
- 1998
|
|
8
|
Acquisition Automatique de connaissances morphologiques sur le vocabulaire médical
– Grabar, Zweigenbaum
- 1999
|
|
7
|
Automated indexing into SNOMED and ICD
– Wingert, Rothwell, et al.
- 1989
|
|
7
|
Automatic acquisition of domain-specific morphological resources from thesauri
– Grabar, Zweigenbaum
- 2000
|
|
6
|
Automatic coding of medical vocabulary
– Wolff
- 1987
|
|
6
|
Towards a multilingual morpheme thesaurus for medical free-text retrieval
– Schulz, Romacker, et al.
- 1999
|
|
6
|
Morphemes as necessary concept for structures discovery from untagged corpora
– Djean
- 1998
|
|
6
|
Rpertoire d'anatomopathologie de la SNOMED internationale, v3.4. Universit de
– RA
- 1996
|
|
5
|
Extracting linguistic knowledge from an international classification
– RH, Lovis, et al.
- 1997
|
|
4
|
Morphosemantic analysis of-ITIS forms in medical language. Methods Inf Med 1980;19:99--105
– MG, LM, et al.
|
|
4
|
Language-independent automatic acquisition of morphological knowledge from synonym pairs
– Grabar, Zweigenbaum
- 1999
|
|
3
|
Towards consistent, minimal terminologies
– Webber, Markert, et al.
- 1999
|
|
3
|
Construire un lexique drivationnel : thorie et ralisations
– Dal, Namer, et al.
- 1999
|
|
2
|
Daille B, et al. Une approche linguistique et statistique pour l'analyse de l'information en corpus
– Toussaint, Namer
- 1998
|
|
2
|
PC-KIMMO: a two-level processor for morphological analysis. Number 16
– EL
- 1990
|
|
1
|
Perov YL, and Rykov VV. A Russian version of SNOMEDInternational
– R
- 1995
|