• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations

Tools

Sorted by:
Try your query at:
Semantic Scholar Scholar Academic
Google Bing DBLP
Results 1 - 10 of 916
Next 10 →

bilingual corpus analysis

by Prof Rohini, Chandrica Widyalankara
"... term formation in ICT terminology in Sinhala: A ..."
Abstract - Add to MetaCart
term formation in ICT terminology in Sinhala: A

X-iota: An open xml framework for ir experimentation application on multiple weighting scheme tests in a bilingual corpus

by Jean-pierre Chevallet - in Proc. AIRS
"... in a bilingual corpus ..."
Abstract - Cited by 10 (9 self) - Add to MetaCart
in a bilingual corpus

Using RBMT Systems to Produce Bilingual Corpus for SMT

by Xiaoguang Hu, Haifeng Wang, Hua Wu
"... This paper proposes a method using the existing Rule-based Machine Translation (RBMT) system as a black box to produce synthetic bilingual corpus, which will be used as training data for the Statistical Machine Translation (SMT) system. We use the existing RBMT system to translate the monolingual co ..."
Abstract - Cited by 3 (1 self) - Add to MetaCart
This paper proposes a method using the existing Rule-based Machine Translation (RBMT) system as a black box to produce synthetic bilingual corpus, which will be used as training data for the Statistical Machine Translation (SMT) system. We use the existing RBMT system to translate the monolingual

E.: Bilingual corpus cleaning focusing on translation literality

by Kenji Imamura, Eiichiro Sumita - In: 7th International Conference on Spoken Language Processing (ICSLP-2002 , 2002
"... When we automatically acquire translation knowledge from a bilingual corpus, redundant rules are generated due to translation variety. To overcome this problem, we propose bilingual corpus cleaning based on translation literality. Word-level correspondence and phrase-level correspondence are applied ..."
Abstract - Cited by 2 (0 self) - Add to MetaCart
When we automatically acquire translation knowledge from a bilingual corpus, redundant rules are generated due to translation variety. To overcome this problem, we propose bilingual corpus cleaning based on translation literality. Word-level correspondence and phrase-level correspondence

Grammarless Bracketing in an Aligned Bilingual Corpus

by Jorge Kinoshita, Escola Politécnica, Paulo Brazil
"... We propose a simple grammarless procedure to extract phrasal translation examples from aligned parallel texts. It is based on the difference of word sequence in two languages. 1. ..."
Abstract - Add to MetaCart
We propose a simple grammarless procedure to extract phrasal translation examples from aligned parallel texts. It is based on the difference of word sequence in two languages. 1.

A Bilingual Corpus of Inter-linked Events

by Nancy Ide, Roberto Bartolini
"... This paper describes the creation of a bilingual corpus of inter-linked events for Italian and English. Linkage is accomplished through the Inter-Lingual Index (ILI) that links ItalWordNet with WordNet. The availability of this resource, on the one hand, enables contrastive analysis of the linguisti ..."
Abstract - Cited by 2 (0 self) - Add to MetaCart
This paper describes the creation of a bilingual corpus of inter-linked events for Italian and English. Linkage is accomplished through the Inter-Lingual Index (ILI) that links ItalWordNet with WordNet. The availability of this resource, on the one hand, enables contrastive analysis

Toward a Name Entity Aligned Bilingual Corpus

by Xiaoyi Ma
"... This paper describes a co-training framework in which, through named entity aligned bilingual text, named entity taggers can complement and improve each other via an iterative process. This co-training approach allows us to 1) apply our method to not only parallel but also comparable text, greatly e ..."
Abstract - Add to MetaCart
extending the applicability of the approach; and to 2) adapt named entity taggers to new domains; 3) create a named entity aligned bilingual corpus. Experiment results on Chinese-English data are shown and discussed. 1.

Primary Data Encoding of a Bilingual Corpus

by Johann Gamper, Paolo Dongilli , 1999
"... This paper discusses the building of a bilingual corpus of legal and administrative texts, focusing on the encoding of documentation and structural information according to the Corpus Encoding Standard. The corpus is one module in an ongoing research project about (semi-)automatic terminology acquis ..."
Abstract - Add to MetaCart
This paper discusses the building of a bilingual corpus of legal and administrative texts, focusing on the encoding of documentation and structural information according to the Corpus Encoding Standard. The corpus is one module in an ongoing research project about (semi-)automatic terminology

2.1. Translation Acquisition from Bilingual Corpus

by unknown authors
"... been published. We describe these previous research results in section 2. In this paper, we propose a disambiguation method for Japanese-English word translations that are extracted from Japanese and English bilingual comparable corpora. The proposed method is two-fold. �� In the first stage, simple ..."
Abstract - Add to MetaCart
-parallel bilingual corpus. We describe the background of automatic term recognition technology which is very essential in our method also in section 2. The principle of our disambiguation is described in section 3 and 4. More precisely we describe a term extraction method we use in section 3, and a disambiguation

Block Analysis of Bilingual Corpus for Chinese-English Statistical Machine Translation1

by Hairong Xia, Bo Xu, Taiyi Huang
"... In this paper, we describe a bilingual corpus processing strategy, block analysis, from a new point of view. By this analysis strategy, we want to extract more information from bilingual corpus for future statistical machine translation. At first, we define some block types and give some statistical ..."
Abstract - Add to MetaCart
In this paper, we describe a bilingual corpus processing strategy, block analysis, from a new point of view. By this analysis strategy, we want to extract more information from bilingual corpus for future statistical machine translation. At first, we define some block types and give some
Next 10 →
Results 1 - 10 of 916
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University