Results 11 - 20
of
8,249
Improved Alignment Models for Statistical Machine Translation
, 1999
"... In this paper, we describe improved alignment modelsforstatisticalmachinetranslation. The statisticaltranslationapproachusestwotypes of information: a translationmodel and a language model. Thelanguagemodelusedisa bigramorgeneral m-gram model. The translation model is decomposed into a lexical and a ..."
Abstract
-
Cited by 353 (52 self)
- Add to MetaCart
betweenphrasesandawordlevelalignment between single words. Wepresentresultsus-ingtheVerbmobiltask (German-English, 6000-word vocabulary) whichisalimited-domain spoken-language task. Theexperimentaltests wereperformedonboththetext transcription and thespeechrecognizeroutput.
The temporal structure of spoken language understanding
, 1980
"... The word-by-word time-course of spoken language understanding was investigated in two experiments, focussing simultaneously on word-recognition (local) processes and on structural and interpretative (global) processes. Both experiments used three word-monitoring tasks, which varied the description u ..."
Abstract
-
Cited by 203 (16 self)
- Add to MetaCart
The word-by-word time-course of spoken language understanding was investigated in two experiments, focussing simultaneously on word-recognition (local) processes and on structural and interpretative (global) processes. Both experiments used three word-monitoring tasks, which varied the description
A treebank study of clausal coordinate ellipsis in spoken and written language
"... One of the benefits of incremental sentence production is reduction of the working memory capacity needed for advance planning: The planning units can be considerably smaller (measured in terms of word length) than in case of non-incremental production. The same advantage has been claimed for the va ..."
Abstract
-
Cited by 2 (2 self)
- Add to MetaCart
for the various forms of ellipsis, which preempt the need to plan the detailed shape of one or more constituents and thereby reduce the size of planning units. Because working memory load tends to be higher in spoken than in written language, one expects that speakers, in comparison with writers, will more
Inside-outside reestimation from partially bracketed corpora
- In Proceedings of the 30th Annual Meeting of the ACL
, 1992
"... The inside-outside algorithm for inferring the parameters of a stochastic context-free grammar is extended to take advantage of constituent information (constituent bracketing) in a partially parsed corpus. Experiments on formal and natural language parsed corpora show that the new algorithm can ach ..."
Abstract
-
Cited by 306 (3 self)
- Add to MetaCart
System spoken language corpus. Finally, the new algorithm has better time complexity than the original one when sufficient bracketing is provided. 1
Gemini: A Natural Language System For Spoken-Language Understanding
- In Proceedings of the Thirty-First Annual Meeting of the Association for Computational Linguistics
, 1993
"... This paper describes the details of the system, and includes relevant measurements of size, efficiency, and performance of each of its components ..."
Abstract
-
Cited by 174 (39 self)
- Add to MetaCart
This paper describes the details of the system, and includes relevant measurements of size, efficiency, and performance of each of its components
Towards detecting emotions in spoken dialogs
- IEEE Transactions on Speech and Audio Processing
, 2005
"... Abstract—The importance of automatically recognizing emotions from human speech has grown with the increasing role of spoken language interfaces in human-computer interaction applications. This paper explores the detection of domain-specific emotions using language and discourse information in conju ..."
Abstract
-
Cited by 178 (22 self)
- Add to MetaCart
Abstract—The importance of automatically recognizing emotions from human speech has grown with the increasing role of spoken language interfaces in human-computer interaction applications. This paper explores the detection of domain-specific emotions using language and discourse information
The Penn Chinese treebank: Phrase structure annotation of a large corpus. Natural Language Engineering
, 2005
"... With growing interest in Chinese Language Processing, numerous NLP tools (e.g., word segmenters, part-of-speech taggers, and parsers) for Chinese have been developed all over the world. However, since no large-scale bracketed corpora are available to the public, these tools are trained on corpora wi ..."
Abstract
-
Cited by 170 (23 self)
- Add to MetaCart
With growing interest in Chinese Language Processing, numerous NLP tools (e.g., word segmenters, part-of-speech taggers, and parsers) for Chinese have been developed all over the world. However, since no large-scale bracketed corpora are available to the public, these tools are trained on corpora
Improved Inference for Unlexicalized Parsing
, 2007
"... We present several improvements to unlexicalized parsing with hierarchically state-split PCFGs. First, we present a novel coarse-to-fine method in which a grammar’s own hierarchical projections are used for incremental pruning, including a method for efficiently computing projections of a grammar wi ..."
Abstract
-
Cited by 255 (29 self)
- Add to MetaCart
without a treebank. In our experiments, hierarchical pruning greatly accelerates parsing with no loss in empirical accuracy. Second, we compare various inference procedures for state-split PCFGs from the standpoint of risk minimization, paying particular attention to their practical tradeoffs. Finally, we
The Design for the Wall Street Journal-based CSR Corpus
- DARPA Speech and Language Workshop
, 1992
"... The DARPA Spoken Language System (SLS) community has long taken a leadership position in designing, implementing, and globally distributing significant speech corpora widely used for advancing speech recognition research. The Wall Street Journal (WSJ) CSR Corpus described here is the newest addition ..."
Abstract
-
Cited by 235 (2 self)
- Add to MetaCart
The DARPA Spoken Language System (SLS) community has long taken a leadership position in designing, implementing, and globally distributing significant speech corpora widely used for advancing speech recognition research. The Wall Street Journal (WSJ) CSR Corpus described here is the newest
Probabilistic Phonotactics and Neighborhood Activation in Spoken Word Recognition
- Journal of Memory and Language
, 1999
"... nvestigated the implications of this information for the representation and processing of spoken language. Research on phonotactics in linguistics has examined the representations of various types of sequential constraints and segmental co-occurrence relations in syllables and words (Frisch, Broe, ..."
Abstract
-
Cited by 159 (4 self)
- Add to MetaCart
nvestigated the implications of this information for the representation and processing of spoken language. Research on phonotactics in linguistics has examined the representations of various types of sequential constraints and segmental co-occurrence relations in syllables and words (Frisch, Broe
Results 11 - 20
of
8,249