اختيار الموقع            تسجيل دخول
 

تسجيل دخول للنظام
  كود المستخدم
  كلمة السر
نسيت كلمة السر؟
دوريات النشر الإلكتروني



هندسة اللغة:
 هندسة اللغة:
  تفاصيل البحث
 
[9000518.] رقم البحث : 9000518 -
Automatic Speech Annotation Using HMM based on Best Tree Encoding (BTE) Feature /
تخصص البحث : Speech Processing, Recognition and Synthesis
  هندسة اللغة: / عدد(1) - مجلد (1) - يناير 2014
  Amr M. Gody ( amg00@fayoum.edu.eg - )
  Rania Ahmed Abul Seoud ( r-abulseoud@k-space.org - )
  Mohamed Hassan ( mh1323@fayoum.edu.eg - )
  BTE, MFCC, HTK, Gaussian Mixture, Speech Recognition
  Manual annotation for time-aligning a speech waveform against the corresponding phonetic sequence is a tedious and time consuming task. This paper aimed to introduce a completely automated phone recognition system based on Best Tree Encoding (BTE) 4-point speech feature. BTE is used to find phoneme boundaries along speech utterance. Comparison to Mel-frequency cepstral coefficients (MFCCs) speech feature in solving the same problem is provided. Hidden Markov Model (HMM) and Gaussian Mixtures are used for building the statistical models through this research. HTK software toolkit is utilized for implementation of the model. The System can identify spoken phone at 59.1% recognition rate based on MFCC and 22.92% recognition rate based on BTE. The current BTE vector is 4 components compared to 39 components of MFCC. This makes it very promising features vector, BTE with 4 components gives a comparable recognition success rate compared to the 39 components MFCC vector widely in the area of ASR.
  Download Paper


 







Powered by Future Library Software.All rights reserved © CITC - Mansoura University. Sponsored by Mansoura University Privacy Policy