اختيار الموقع            تسجيل دخول
 

تسجيل دخول للنظام
  كود المستخدم
  كلمة السر
نسيت كلمة السر؟
دوريات النشر الإلكتروني



Mansoura journal for computer and information sciences /
 Mansoura journal for computer and information sciences /
  تفاصيل البحث
 
[9003016.] رقم البحث : 9003016 -
A Hybrid Approach for Automatic Morphological Diacritizationof Arabic Text /
تخصص البحث :
  Mansoura journal for computer and information sciences / / Vol.14 - No.2
  Hatem M Noaman ( hatemnoaman@yahoo.com - ) - مؤلف رئيسي
  Shahenda S. Sarhan ( shahenda_sarhan@yahoo.com - )
  M. A. A. Rashwan ( mrashwan@RDI-eg.com - )
  Arabic Natural Language Processing ; Automatic Morphological Diacritization; deep encode-decode recurrent neural networks.
  Arabic Modern texts are commonly written without diacritization, which is a critical task for other Arabic processing tasks as word sense disambiguation, automatic speech recognition, and text to speech, where word meaning or pronunciation is decided based on the diacritic signs assigned to each letter.
This paper presents a novel approach for automatic Arabic text diacritization using deep encode-decode recurrent neural networks that is followed by several text correction techniques, to improve the overall system output accuracy. Experimental results of the proposed system on Wikinews test set show superior performance and are competitive with those of the-state-of-the-art diacritization methods. Namely, our method achieves morphological diacritization Word Error Rate (WER) 3.85% and Diacritic Error Rate (DER) 1.12%.


 







Powered by Future Library Software.All rights reserved © CITC - Mansoura University. Sponsored by Mansoura University Privacy Policy