Search In this Thesis
   Search In this Thesis  
العنوان
A Novel AlGorithm For Selection Of Bio-Bases For Amino Acid Sequence Analysis/
الناشر
Bassem Elsayed Saad,
المؤلف
Saad, Bassem Elsayed.
الموضوع
Amino Acid
تاريخ النشر
2009 .
عدد الصفحات
80 P.:
الفهرس
يوجد فقط 14 صفحة متاحة للعرض العام

from 80

from 80

المستخلص

This thesis introduces a new algorithm termed as Rough Fuzzy C-Medoids based on Randomized Search (RFCMRANS) algorithm, to select the most informative bio-bases; the minimum set of bio-bases with maximum information. As amino acids are nonnumeric variables, they need encoding before using them as inputs for any pattern recognition algorithm. Therefore, a bio-basis function has been used to map a nonnumeric sequence space to a numerical feature space. It is designed using an amino acid mutation matrix. The (RFCMRANS) algorithm is comprised of a judicious integration of the principles of rough sets, fuzzy sets, the C-Medoids based on randomized search algorithm, and the amino acid mutation matrix. While the membership function of fuzzy sets enables efficient handling of overlapping partitions, the concept of lower and upper bounds of rough sets deals with uncertainty, vagueness, and incompleteness in class definition. The concept of crisp lower bound and fuzzy boundary of class, introduced in RFCMRANS, enables efficient selection of the minimum set of the most informative bio-bases. Some quantitative indices are used for evaluating quantitatively the quality of selected bio-bases. The effectiveness of the proposed algorithm, along with a comparison with other algorithms, has been demonstrated on dfferent types ofprotein data sets.