Results 1 - 10
of
916
X-iota: An open xml framework for ir experimentation application on multiple weighting scheme tests in a bilingual corpus
- in Proc. AIRS
"... in a bilingual corpus ..."
Using RBMT Systems to Produce Bilingual Corpus for SMT
"... This paper proposes a method using the existing Rule-based Machine Translation (RBMT) system as a black box to produce synthetic bilingual corpus, which will be used as training data for the Statistical Machine Translation (SMT) system. We use the existing RBMT system to translate the monolingual co ..."
Abstract
-
Cited by 3 (1 self)
- Add to MetaCart
This paper proposes a method using the existing Rule-based Machine Translation (RBMT) system as a black box to produce synthetic bilingual corpus, which will be used as training data for the Statistical Machine Translation (SMT) system. We use the existing RBMT system to translate the monolingual
E.: Bilingual corpus cleaning focusing on translation literality
- In: 7th International Conference on Spoken Language Processing (ICSLP-2002
, 2002
"... When we automatically acquire translation knowledge from a bilingual corpus, redundant rules are generated due to translation variety. To overcome this problem, we propose bilingual corpus cleaning based on translation literality. Word-level correspondence and phrase-level correspondence are applied ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
When we automatically acquire translation knowledge from a bilingual corpus, redundant rules are generated due to translation variety. To overcome this problem, we propose bilingual corpus cleaning based on translation literality. Word-level correspondence and phrase-level correspondence
Grammarless Bracketing in an Aligned Bilingual Corpus
"... We propose a simple grammarless procedure to extract phrasal translation examples from aligned parallel texts. It is based on the difference of word sequence in two languages. 1. ..."
Abstract
- Add to MetaCart
We propose a simple grammarless procedure to extract phrasal translation examples from aligned parallel texts. It is based on the difference of word sequence in two languages. 1.
A Bilingual Corpus of Inter-linked Events
"... This paper describes the creation of a bilingual corpus of inter-linked events for Italian and English. Linkage is accomplished through the Inter-Lingual Index (ILI) that links ItalWordNet with WordNet. The availability of this resource, on the one hand, enables contrastive analysis of the linguisti ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
This paper describes the creation of a bilingual corpus of inter-linked events for Italian and English. Linkage is accomplished through the Inter-Lingual Index (ILI) that links ItalWordNet with WordNet. The availability of this resource, on the one hand, enables contrastive analysis
Toward a Name Entity Aligned Bilingual Corpus
"... This paper describes a co-training framework in which, through named entity aligned bilingual text, named entity taggers can complement and improve each other via an iterative process. This co-training approach allows us to 1) apply our method to not only parallel but also comparable text, greatly e ..."
Abstract
- Add to MetaCart
extending the applicability of the approach; and to 2) adapt named entity taggers to new domains; 3) create a named entity aligned bilingual corpus. Experiment results on Chinese-English data are shown and discussed. 1.
Primary Data Encoding of a Bilingual Corpus
, 1999
"... This paper discusses the building of a bilingual corpus of legal and administrative texts, focusing on the encoding of documentation and structural information according to the Corpus Encoding Standard. The corpus is one module in an ongoing research project about (semi-)automatic terminology acquis ..."
Abstract
- Add to MetaCart
This paper discusses the building of a bilingual corpus of legal and administrative texts, focusing on the encoding of documentation and structural information according to the Corpus Encoding Standard. The corpus is one module in an ongoing research project about (semi-)automatic terminology
2.1. Translation Acquisition from Bilingual Corpus
"... been published. We describe these previous research results in section 2. In this paper, we propose a disambiguation method for Japanese-English word translations that are extracted from Japanese and English bilingual comparable corpora. The proposed method is two-fold. �� In the first stage, simple ..."
Abstract
- Add to MetaCart
-parallel bilingual corpus. We describe the background of automatic term recognition technology which is very essential in our method also in section 2. The principle of our disambiguation is described in section 3 and 4. More precisely we describe a term extraction method we use in section 3, and a disambiguation
Block Analysis of Bilingual Corpus for Chinese-English Statistical Machine Translation1
"... In this paper, we describe a bilingual corpus processing strategy, block analysis, from a new point of view. By this analysis strategy, we want to extract more information from bilingual corpus for future statistical machine translation. At first, we define some block types and give some statistical ..."
Abstract
- Add to MetaCart
In this paper, we describe a bilingual corpus processing strategy, block analysis, from a new point of view. By this analysis strategy, we want to extract more information from bilingual corpus for future statistical machine translation. At first, we define some block types and give some
Results 1 - 10
of
916