Skip to Main content Skip to Navigation
Conference papers

Evaluating the impact of using a domain-specific bilingual lexicon on the performance of a hybrid machine translation approach

Abstract : This paper describes an Example-Based Machine Translation prototype and presents an evaluation of the impact of using a domain specific vocabulary on its performance. This prototype is based on a hybrid approach which needs only monolingual texts in the target language and consists to combine translation candidates returned by a cross-language search engine with translation hypotheses provided by a finite-state transducer. The results of this combination are evaluated against a statistical language model of the target language in order to obtain the n-best translations. To measure the performance of this hybrid approach, we achieved several experiments using corpora on two domains from the European Parliament proceedings (Europarl) and the European Medicines Agency documents (Emea). The obtained results show that the proposed approach out performs the state-of-the-art Statistical Machine Translation system Moses when texts to translate are related to the specialized domain.
Document type :
Conference papers
Complete list of metadatas

https://hal-cea.archives-ouvertes.fr/cea-01844051
Contributor : Léna Le Roy <>
Submitted on : Thursday, July 19, 2018 - 10:14:52 AM
Last modification on : Monday, February 10, 2020 - 6:13:48 PM

Identifiers

  • HAL Id : cea-01844051, version 1

Collections

Citation

N. Semmar, O. Zennaki, M. Laib. Evaluating the impact of using a domain-specific bilingual lexicon on the performance of a hybrid machine translation approach. 10th International Conference on Recent Advances in Natural Language Processing, RANLP 201, Sep 2015, Hissar, Bulgaria. pp.579-587. ⟨cea-01844051⟩

Share

Metrics

Record views

56