• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations

Tools

Sorted by:
Try your query at:
Semantic Scholar Scholar Academic
Google Bing DBLP
Results 1 - 10 of 4,731
Next 10 →

to New Language Pairs

by Dan Melamed University, I. Dan Melamed, I. Dan Melamed , 1996
"... There are three steps in porting SIMR to new language pairs. The first step is to choose an appropriate matching predicate, and collect any linguistic resources required by that matching predicate. The second step is to implement bitext space axis generation routines that are consistent with the mat ..."
Abstract - Add to MetaCart
There are three steps in porting SIMR to new language pairs. The first step is to choose an appropriate matching predicate, and collect any linguistic resources required by that matching predicate. The second step is to implement bitext space axis generation routines that are consistent

Europarl: A Parallel Corpus for Statistical Machine Translation

by Philipp Koehn
"... We collected a corpus of parallel text in 11 languages from the proceedings of the European Parliament, which are published on the web 1. This corpus has found widespread use in the NLP community. Here, we focus on its acquisition and its application as training data for statistical machine translat ..."
Abstract - Cited by 519 (1 self) - Add to MetaCart
translation (SMT). We trained SMT systems for 110 language pairs, which reveal interesting clues into the challenges ahead.

Porting SIMR to new language pairs

by I. Dan Melamed, I. Dan Melamed , 1996
"... University ofPennsylvania There are three steps in porting SIMR to new language pairs. The rst step is to choose an appropriate matching predicate, and collect any linguistic resources required by thatmatching predicate. The second step is to implement bitext space axis generation routines that are ..."
Abstract - Cited by 3 (2 self) - Add to MetaCart
University ofPennsylvania There are three steps in porting SIMR to new language pairs. The rst step is to choose an appropriate matching predicate, and collect any linguistic resources required by thatmatching predicate. The second step is to implement bitext space axis generation routines

Statistical phrase-based translation

by Franz Josef Och, Daniel Marcu , 2003
"... We propose a new phrase-based translation model and decoding algorithm that enables us to evaluate and compare several, previously proposed phrase-based translation models. Within our framework, we carry out a large number of experiments to understand better and explain why phrase-based models outpe ..."
Abstract - Cited by 944 (11 self) - Add to MetaCart
outperform word-based models. Our empirical results, which hold for all examined language pairs, suggest that the highest levels of performance can be obtained through relatively simple means: heuristic learning of phrase translations from word-based alignments and lexical weighting of phrase translations

Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language

by Philip Resnik , 1999
"... This article presents a measure of semantic similarityinanis-a taxonomy based on the notion of shared information content. Experimental evaluation against a benchmark set of human similarity judgments demonstrates that the measure performs better than the traditional edge-counting approach. The a ..."
Abstract - Cited by 609 (9 self) - Add to MetaCart
and bicycles, but the latter pair are certainly more similar. Rada et al. #Rada, Mili, Bicknell, & Blett...

A Program for Aligning Sentences in Bilingual Corpora

by William A. Gale , Kenneth W. Church , 1993
"... This paper will describe a method and a program (align) for aligning sentences based on a simple statistical model of character lengths. The program uses the fact that longer sentences in one language tend to be translated into longer sentences in the other language, and that shorter sentences tend ..."
Abstract - Cited by 529 (5 self) - Add to MetaCart
, showing that error rates will depend on the corpus considered; however, both were small enough to hope that the method will be useful for many language pairs. To further research on bilingual corpora, a much larger sample of Canadian Hansards (approximately 90 million words, half in English and and half

New Language Pairs in TectoMT

by Ondřej Dušek , Luís Gomes , Michal Novák , Martin Popel , Rudolf Rosa
"... Abstract The TectoMT tree-to-tree machine translation system has been updated this year to support easier retraining for more translation directions. We use multilingual standards for morphology and syntax annotation and language-independent base rules. We include a simple, non-parametric way of co ..."
Abstract - Add to MetaCart
Abstract The TectoMT tree-to-tree machine translation system has been updated this year to support easier retraining for more translation directions. We use multilingual standards for morphology and syntax annotation and language-independent base rules. We include a simple, non-parametric way

2007. Rapid development of new language pairs at SYSTRAN

by Sylvain Surcin, Elke Lange, Jean Senellart, La Grande Arche - In Proceedings of the Eleventh MT Summit
"... One of the mottos of pure statistical MT system promoters is that it is possible to “build a new language pair ” overnight, but the de-velopment of a new language pair for a proficient rule-based translator requires a great amount of effort in linguistic rules and re-sources description. Therefore, ..."
Abstract - Cited by 1 (0 self) - Add to MetaCart
One of the mottos of pure statistical MT system promoters is that it is possible to “build a new language pair ” overnight, but the de-velopment of a new language pair for a proficient rule-based translator requires a great amount of effort in linguistic rules and re-sources description. Therefore

The faculty of language: what is it, who has it, and how did it evolve?

by Marc D Hauser , Noam Chomsky , W Tecumseh Fitch - Science, , 2002
"... We argue that an understanding of the faculty of language requires substantial interdisciplinary cooperation. We suggest how current developments in linguistics can be profitably wedded to work in evolutionary biology, anthropology, psychology, and neuroscience. We submit that a distinction should ..."
Abstract - Cited by 472 (7 self) - Add to MetaCart
conserved developmental systems that read an (almost) universal language encoded in DNA base pairs. As such, life is arranged hierarchically with a foundation of discrete, unblendable units (codons, and, for the most part, genes) capable of combining to create increasingly complex and virtually limitless

Robust Language Pair-Independent Sub-Tree Alignment

by John Tinsley, Ventsislav Zhechev, Mary Hearne, Andy Way
"... Data-driven approaches to machine translation (MT) achieve state-of-the-art results. Many syntax-aware approaches, such as Example-Based MT and Data-Oriented Translation, make use of tree pairs aligned at sub-sentential level. Obtaining sub-sentential alignments manually is time-consuming and error- ..."
Abstract - Cited by 13 (3 self) - Add to MetaCart
-prone, and requires expert knowledge of both source and target languages. We propose a novel, language pair-independent algorithm which automatically induces alignments between phrase-structure trees. We evaluate the alignments themselves against a manually aligned gold standard, and perform an extrinsic evaluation
Next 10 →
Results 1 - 10 of 4,731
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University