Results 1 - 10
of
4,731
to New Language Pairs
, 1996
"... There are three steps in porting SIMR to new language pairs. The first step is to choose an appropriate matching predicate, and collect any linguistic resources required by that matching predicate. The second step is to implement bitext space axis generation routines that are consistent with the mat ..."
Abstract
- Add to MetaCart
There are three steps in porting SIMR to new language pairs. The first step is to choose an appropriate matching predicate, and collect any linguistic resources required by that matching predicate. The second step is to implement bitext space axis generation routines that are consistent
Europarl: A Parallel Corpus for Statistical Machine Translation
"... We collected a corpus of parallel text in 11 languages from the proceedings of the European Parliament, which are published on the web 1. This corpus has found widespread use in the NLP community. Here, we focus on its acquisition and its application as training data for statistical machine translat ..."
Abstract
-
Cited by 519 (1 self)
- Add to MetaCart
translation (SMT). We trained SMT systems for 110 language pairs, which reveal interesting clues into the challenges ahead.
Porting SIMR to new language pairs
, 1996
"... University ofPennsylvania There are three steps in porting SIMR to new language pairs. The rst step is to choose an appropriate matching predicate, and collect any linguistic resources required by thatmatching predicate. The second step is to implement bitext space axis generation routines that are ..."
Abstract
-
Cited by 3 (2 self)
- Add to MetaCart
University ofPennsylvania There are three steps in porting SIMR to new language pairs. The rst step is to choose an appropriate matching predicate, and collect any linguistic resources required by thatmatching predicate. The second step is to implement bitext space axis generation routines
Statistical phrase-based translation
, 2003
"... We propose a new phrase-based translation model and decoding algorithm that enables us to evaluate and compare several, previously proposed phrase-based translation models. Within our framework, we carry out a large number of experiments to understand better and explain why phrase-based models outpe ..."
Abstract
-
Cited by 944 (11 self)
- Add to MetaCart
outperform word-based models. Our empirical results, which hold for all examined language pairs, suggest that the highest levels of performance can be obtained through relatively simple means: heuristic learning of phrase translations from word-based alignments and lexical weighting of phrase translations
Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language
, 1999
"... This article presents a measure of semantic similarityinanis-a taxonomy based on the notion of shared information content. Experimental evaluation against a benchmark set of human similarity judgments demonstrates that the measure performs better than the traditional edge-counting approach. The a ..."
Abstract
-
Cited by 609 (9 self)
- Add to MetaCart
and bicycles, but the latter pair are certainly more similar. Rada et al. #Rada, Mili, Bicknell, & Blett...
A Program for Aligning Sentences in Bilingual Corpora
, 1993
"... This paper will describe a method and a program (align) for aligning sentences based on a simple statistical model of character lengths. The program uses the fact that longer sentences in one language tend to be translated into longer sentences in the other language, and that shorter sentences tend ..."
Abstract
-
Cited by 529 (5 self)
- Add to MetaCart
, showing that error rates will depend on the corpus considered; however, both were small enough to hope that the method will be useful for many language pairs. To further research on bilingual corpora, a much larger sample of Canadian Hansards (approximately 90 million words, half in English and and half
New Language Pairs in TectoMT
"... Abstract The TectoMT tree-to-tree machine translation system has been updated this year to support easier retraining for more translation directions. We use multilingual standards for morphology and syntax annotation and language-independent base rules. We include a simple, non-parametric way of co ..."
Abstract
- Add to MetaCart
Abstract The TectoMT tree-to-tree machine translation system has been updated this year to support easier retraining for more translation directions. We use multilingual standards for morphology and syntax annotation and language-independent base rules. We include a simple, non-parametric way
2007. Rapid development of new language pairs at SYSTRAN
- In Proceedings of the Eleventh MT Summit
"... One of the mottos of pure statistical MT system promoters is that it is possible to “build a new language pair ” overnight, but the de-velopment of a new language pair for a proficient rule-based translator requires a great amount of effort in linguistic rules and re-sources description. Therefore, ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
One of the mottos of pure statistical MT system promoters is that it is possible to “build a new language pair ” overnight, but the de-velopment of a new language pair for a proficient rule-based translator requires a great amount of effort in linguistic rules and re-sources description. Therefore
The faculty of language: what is it, who has it, and how did it evolve?
- Science,
, 2002
"... We argue that an understanding of the faculty of language requires substantial interdisciplinary cooperation. We suggest how current developments in linguistics can be profitably wedded to work in evolutionary biology, anthropology, psychology, and neuroscience. We submit that a distinction should ..."
Abstract
-
Cited by 472 (7 self)
- Add to MetaCart
conserved developmental systems that read an (almost) universal language encoded in DNA base pairs. As such, life is arranged hierarchically with a foundation of discrete, unblendable units (codons, and, for the most part, genes) capable of combining to create increasingly complex and virtually limitless
Robust Language Pair-Independent Sub-Tree Alignment
"... Data-driven approaches to machine translation (MT) achieve state-of-the-art results. Many syntax-aware approaches, such as Example-Based MT and Data-Oriented Translation, make use of tree pairs aligned at sub-sentential level. Obtaining sub-sentential alignments manually is time-consuming and error- ..."
Abstract
-
Cited by 13 (3 self)
- Add to MetaCart
-prone, and requires expert knowledge of both source and target languages. We propose a novel, language pair-independent algorithm which automatically induces alignments between phrase-structure trees. We evaluate the alignments themselves against a manually aligned gold standard, and perform an extrinsic evaluation
Results 1 - 10
of
4,731