Results 1 -
6 of
6
A Method of Adding New Entries to a Valency Dictionary by Exploiting Existing Lexical Resources
- in Proc. of the 9th International Conference on Theoretical and Methodological Issues in Machine Translation (TMI 2002), (this volume
, 2002
"... Information on subcategorization and selectional restrictions in a valency dictionary is very important for natural language processing in tasks such as monolingual parsing, accurate rule-based machine translation and automatic summarization. ..."
Abstract
-
Cited by 5 (3 self)
- Add to MetaCart
Information on subcategorization and selectional restrictions in a valency dictionary is very important for natural language processing in tasks such as monolingual parsing, accurate rule-based machine translation and automatic summarization.
Semantic annotation of a Japanese speech corpus
- In COLING Workshop on Semantic Annotation and Intelligent Content
, 2000
"... This paper describes the semantic annotations we are performing on the CallHome Japanese corpus of spontaneous, unscripted telephone conversations (LDC, 1996). Our annotations include (i) semantic classes for all nouns and verbs; (ii) verb senses for all main verbs; and (iii) relations between main ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
This paper describes the semantic annotations we are performing on the CallHome Japanese corpus of spontaneous, unscripted telephone conversations (LDC, 1996). Our annotations include (i) semantic classes for all nouns and verbs; (ii) verb senses for all main verbs; and (iii) relations between main verbs and their complements in the same utterance. Our semantic tagset is taken from NTT's Goi-Taikei semantic lexicon and ontology (Ikehara et al., 1997). A pilot study demonstrates that the verb sense tagging can be e#ciently performed by native Japanese speakers using computergenerated HTML forms, and that good interannotator reliability can be obtained in the right conditions. 1 Introduction Semantic annotations have proved valuable for a variety of NLP tasks, including parsing, word sense disambiguation, coreference resolution, summarization, and information retrieval and extraction. The most challenging domain for all these tasks is spontaneous spoken language, which tends to be more...
Verb Alternations and Japanese - How, What and Where?
- in Proc. of the 14th Pacific Asia Conference on Language, Information and Computation (PACLIC
, 2000
"... We set out to empirically identify the range and frequency of basic verb alternation types in Japanese, through analysis of the Goi-Taikei Japanese pattern-based valency dictionary. This is achieved through comparison of the selectional preference annotation on corresponding case slots, based on ..."
Abstract
- Add to MetaCart
We set out to empirically identify the range and frequency of basic verb alternation types in Japanese, through analysis of the Goi-Taikei Japanese pattern-based valency dictionary. This is achieved through comparison of the selectional preference annotation on corresponding case slots, based on the assumption that selectional preferences are preserved under alternation. Three separate extraction methods are considered, founded around: (1) simple match of selectional restrictions; (2) selectional restriction matching, with recourse to penalised backing-o#; and (3) semantic density, again with recourse to backing-o#.
Preliminary analysis of the range and frequency of Japanese verb alternations
"... We set out to empirically identify the range and frequency of basic verb alternation types in Japanese, through analysis of the Goi-Taikei Japanese patternbased valency dictionary. This is achieved through comparison of the selectional preference annotation on corresponding case slots, based on t ..."
Abstract
- Add to MetaCart
We set out to empirically identify the range and frequency of basic verb alternation types in Japanese, through analysis of the Goi-Taikei Japanese patternbased valency dictionary. This is achieved through comparison of the selectional preference annotation on corresponding case slots, based on the assumption that selectional preferences are preserved under alternation. Three separate extraction methods are considered, founded around: (1) simple match of selectional restrictions; (2) selectional restriction matching, with recourse to penalised backing-off; and (3) semantic density, again with recourse to backingoff. 1 Introduction This is an attempt to document the full range of verb alternations in Japanese from analysis of structural regularities between entries in a valency dictionary, and determine the degree of permeation of each individual alternation type. Various techniques are analysed for extracting alternations, focusing on full or partial preservation of the select...
Japanese-English Paraphrase Corpus
, 2001
"... This paper introduces an attempt at collecting a corpus of various usages of Japanese predicates and synonymous expressions in English. We have learned that an effective consideration to exhaus- tively collect such various usages is to continue to create new sentences until no more sentences ..."
Abstract
- Add to MetaCart
This paper introduces an attempt at collecting a corpus of various usages of Japanese predicates and synonymous expressions in English. We have learned that an effective consideration to exhaus- tively collect such various usages is to continue to create new sentences until no more sentences can be conceived within one language. We have found that an effective way of collecting synonymous expressions of predicates in JapaneseEnglish or English-Japanese translation, is to create translations of the synonymous expressions and expand them to example sets of multiple pairs.
The 20th International Conference on Computational Linguistics Post-Conference Workshop Proceeding of the Workshop on
, 2004
"... In an ever expanding information society, most information systems are now facing the “multilingual challenge”. Multilingual language resources play an essential role in modern information systems. Such resources need to provide information on many languages in a common framework and should be (re)u ..."
Abstract
- Add to MetaCart
In an ever expanding information society, most information systems are now facing the “multilingual challenge”. Multilingual language resources play an essential role in modern information systems. Such resources need to provide information on many languages in a common framework and should be (re)usable in many applications (for automatic or human use). Many centres have been involved in national and international projects dedicated to building harmonised language resources and creating expertise in the maintenance and further development of standardised linguistic data. These resources include dictionaries, lexicons, thesauri, word-nets, and annotated corpora developed along the lines of best practices and recommendations. However, since the late 90’s, most efforts in scaling up these resources remain the responsibility of the local authorities, usually, with very low funding (if any) and few opportunities for academic recognition of this work. Hence, it is not surprising that many of the resource holders and developers have become reluctant to give free access to the latest versions of their resources, and their actual status is therefore currently rather unclear. The goal of this workshop is to study problems involved in the development, management and reuse of lexical resources in a multilingual context. Moreover, this workshop provides a forum for

