• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations

Tools

Sorted by:
Try your query at:
Semantic Scholar Scholar Academic
Google Bing DBLP
Results 1 - 10 of 3,321
Next 10 →

A Program for Aligning Sentences in Bilingual Corpora

by William A. Gale , Kenneth W. Church , 1993
"... This paper will describe a method and a program (align) for aligning sentences based on a simple statistical model of character lengths. The program uses the fact that longer sentences in one language tend to be translated into longer sentences in the other language, and that shorter sentences tend ..."
Abstract - Cited by 529 (5 self) - Add to MetaCart
, showing that error rates will depend on the corpus considered; however, both were small enough to hope that the method will be useful for many language pairs. To further research on bilingual corpora, a much larger sample of Canadian Hansards (approximately 90 million words, half in English and and half

The Alignment Template Approach to Statistical Machine Translation

by Franz Josef Och, Hermann Ney , 2004
"... A phrase-based statistical machine translation approach — the alignment template approach — is described. This translation approach allows for general many-to-many relations between words. Thereby, the context of words is taken into account in the translation model, and local changes in word order f ..."
Abstract - Cited by 480 (26 self) - Add to MetaCart
in detail the process for learning phrasal translations, the feature functions used, and the search algorithm. The evaluation of this approach is performed on three different tasks. For the German–English speech Verbmobil task, we analyze the effect of various system components. On the French–English

Combining domain adaptation approaches for medical text translation

by Longyue Wang, Yi Lu, Derek F. Wong, Lidia S. Chao, Yiming Wang, Francisco Oliveira - In Proceedings of the ACL 2014 Ninth Workshop of Statistical Machine Translation , 2014
"... This paper explores a number of simple and effective techniques to adapt statisti-cal machine translation (SMT) systems in the medical domain. Comparative exper-iments are conducted on large corpora for six language pairs. We not only compare each adapted system with the baseline, but also combine t ..."
Abstract - Cited by 1 (0 self) - Add to MetaCart
them to further improve the domain-specific systems. Finally, we attend the WMT2014 medical summary sentence translation constrained task and our systems achieve the best BLEU scores for Czech-English, English-German, French-English language pairs and the second best BLEU scores for re-minding pairs. 1.

Mining the Web for Synonyms: PMI-IR Versus LSA on TOEFL

by Peter D. Turney , 2001
"... This paper presents a simple unsupervised learning algorithm for recognizing synonyms, based on statistical data acquired by querying a Web search engine. The algorithm, called PMI-IR, uses Pointwise Mutual Information (PMI) and Information Retrieval (IR) to measure the similarity of pairs of wo ..."
Abstract - Cited by 262 (13 self) - Add to MetaCart
of words. PMI-IR is empirically evaluated using 80 synonym test questions from the Test of English as a Foreign Language (TOEFL) and 50 synonym test questions from a collection of tests for students of English as a Second Language (ESL). On both tests, the algorithm obtains a score of 74%. PMI

The LIG machine translation system for WMT 2010

by Marion Potet, Laurent Besacier, Herve ́ Blanchon
"... This paper describes the system submit-ted by the Laboratory of Informatics of Grenoble (LIG) for the fifth Workshop on Statistical Machine Translation. We participated to the news shared transla-tion task for the French-English language pair. We investigated differents techniques to simply deal wit ..."
Abstract - Cited by 1 (0 self) - Add to MetaCart
This paper describes the system submit-ted by the Laboratory of Informatics of Grenoble (LIG) for the fifth Workshop on Statistical Machine Translation. We participated to the news shared transla-tion task for the French-English language pair. We investigated differents techniques to simply deal

Bike: Bilingual Keyphrase Experiments

by David Nadeau, Caroline Barrière, George Foster
"... Abstract: This paper presents a novel strategy for translating lists of keyphrases. Typical keyphrase lists appear in scientific articles, information retrieval systems and web page meta-data. Our system combines a statistical translation model trained on a bilingual corpus of scientific papers with ..."
Abstract - Add to MetaCart
to the French / English language pair.

The RWTH System Combination System for WMT 2011

by Gregor Leusch, Markus Freitag, Hermann Ney
"... RWTH participated in the System Combination task of the Sixth Workshop on Statistical Machine Translation (WMT 2011). For three language pairs, we combined 6 to 14 systems into a single consensus translation. A three-level metacombination scheme combining six different system combination setups with ..."
Abstract - Add to MetaCart
with three different engines was applied on the French–English language pair. Depending on the language pair, improvements versus the best single system are in the range of +1.9 % and +2.5 % abs. on BLEU, and between −1.8 % and −2.4% abs. on TER. Novel techniques compared with RWTH’s submission to WMT 2010

Aligning Sentences in Parallel Corpora

by Peter F. Brown, Jennifer C. Lai, Nd Robert L. Mercer - In Proceedings of the 29th Annual Meeting of the Association for Computational Linguistics (ACL'91 , 1991
"... In this paper we describe a statistical tech-nique for aligning sentences with their translations in two parallel corpora. In addition to certain anchor points that are available in our da.ta, the only information about the sentences that we use for calculating alignments i the number of tokens that ..."
Abstract - Cited by 223 (3 self) - Add to MetaCart
that they contain. Because we make no use of the lexical details of the sentence, the alignment com-putation is fast and therefore practical for appli-cation to very large collections of text. We have used this technique to align several million sen-tences in the English-French Hans~trd corpora nd have achieved

FrameNet translation using bilingual dictionaries with evaluation on the English-French pair

by Claire Mouton, Gaël De Chalendar, Benoit Richert
"... Semantic Role Labeling cannot be performed without an associated linguistic resource. A key resource for such a task is the FrameNet resource based on Fillmore’s theory of frame semantics. Like many linguistic resources, FrameNet has been built by English native speakers for the English language. To ..."
Abstract - Add to MetaCart
Semantic Role Labeling cannot be performed without an associated linguistic resource. A key resource for such a task is the FrameNet resource based on Fillmore’s theory of frame semantics. Like many linguistic resources, FrameNet has been built by English native speakers for the English language

Language discrimination by newborns: Towards an understanding of the role of rhythm

by Thierry Nazzi, Josiane Bertoncini, Jacques Mehler - Journal of Experimental Psychology: Human Perception and Performance , 1998
"... Three experiments investigated the ability of French newborns to discriminate between sets of sentences in different foreign languages. The sentences were low-pass filtered to reduce segmental information while sparing prosodic information. Infants discriminated between stress-timed English and mora ..."
Abstract - Cited by 162 (16 self) - Add to MetaCart
Three experiments investigated the ability of French newborns to discriminate between sets of sentences in different foreign languages. The sentences were low-pass filtered to reduce segmental information while sparing prosodic information. Infants discriminated between stress-timed English
Next 10 →
Results 1 - 10 of 3,321
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University