Search In this Thesis
   Search In this Thesis  
العنوان
Mlrmud :
الناشر
Ahmed Karama Mahboab Alhebshi ,
المؤلف
Ahmed Karama Mahboab Alhebshi
هيئة الاعداد
باحث / Ahmed Karama Mahboab Alhebshi
مشرف / Samir I. Shaheen
مشرف / Amir F. Atiya
مشرف / Mona F. Ahmed
مناقش / Ihab Elsayed Talkhan
تاريخ النشر
2019
عدد الصفحات
75 P. :
اللغة
الإنجليزية
الدرجة
ماجستير
التخصص
هندسة النظم والتحكم
الناشر
Ahmed Karama Mahboab Alhebshi ,
تاريخ الإجازة
5/11/2019
مكان الإجازة
جامعة القاهرة - كلية الهندسة - Computer Engineering
الفهرس
Only 14 pages are availabe for public view

from 96

from 96

Abstract

The missing value problem (MV) is the problem of predicting the missing value in the data set while achieving accurate values. An additional attribute has been imposed on the missing value problem which is an unknown dependent variable. In this work, a new approach, MLRMUD, based on multiple linear regression is used to predict missing values for a data set with an Unknown Dependent variable if complete rows are at least 20%. If they are less than that the mean method is used to fill some rows until the complete rows reach 20%, after that MLRMUD can be applied normally. This approach is composed of three algorithms; splitting algorithm, dependent variable selection algorithm and multi linear regression algorithm. MLRMUD is compared to other counterparts in the literature where it was proved that it outperforms them all in the accuracy of missing values computation determined in terms of the root mean square error (RMSE) and mean standard error (MSE). A method to determine the unknown dependent variable from the training set is proposed. The results show that the proposed method can successfully select the dependent variable with an accuracy of 83% overall the data sets examined