|
[9001733.] رقم البحث : 9001733 - |
|
Toward Building a Comprehensive Phrase-based English-Arabic Statistical Machine Translation System / ٍٍٍ |
| تخصص البحث : Machine Translation |
|
هندسة اللغة: / العدد 2 - مجلد (4) - سبتمبر 2017 |
|
Sara Ebrahim ( sara.elkafrawy@gmail.com - ) - مؤلف رئيسي |
|
Doaa Higazyy ( doaa.hegazy@cis.asu.edu.eg - ) |
|
Mostafa G. M. Mostafa ( mgmostafa@cis.asu.edu.eg - ) |
|
Samhaa R. El-Beltagy ( samhaa@computer.org - ) |
|
Machine Translation, Arabic Natural Language Processing, Phrase-based, Statistical Machine Translation. |
|
This paper explores a phrase-based statistical machine translation (PBSMT) pipeline for English-Arabic (En-Ar) language pair. The work surveys the most recent experiments conducted to enhance Arabic machine translation in the En-Ar direction. It also focuses on free datasets and linguistically motivated ideas that enhance phrase-based En-Ar statistical machine translation (SMT) as it is as aims to use those only in order to build a large scale En-Ar SMT system. In addition, the paper highlights Arabic linguistic challenges in Machine Translation (MT) in general. This paper can be considered a guide for building an En-Ar PBSMT system. Furthermore, the presented pipeline can be generalized to any language pairs. |
|
Download Paper |