Search In this Thesis
   Search In this Thesis  
العنوان
Frequent Itemset Mining for Big Data\
المؤلف
El_Aziz,Ahmed Farouk Abd
هيئة الاعداد
باحث / أحمد فاروق عبد العزيز إبراهيم حسن
مشرف / إسلام احمد محمد المداح
مشرف / هدى قرشى محمد
مناقش / جمال الدين على
تاريخ النشر
2015.
عدد الصفحات
96p.:
اللغة
الإنجليزية
الدرجة
الدكتوراه
التخصص
الهندسة الكهربائية والالكترونية
تاريخ الإجازة
1/1/2015
مكان الإجازة
جامعة عين شمس - كلية الهندسة - كهرباء حاسبات
الفهرس
Only 14 pages are availabe for public view

from 32

from 32

Abstract

Discovering the frequent item sets requires a lot of computation power,
memory and input/output values, which is better to be provided by
parallel processing. The implementation of data mining ideas in high
performance parallel and distributed computing environments is
becoming crucial for ensuring system scalability and interactivity as data
continues to grow in size and complexity.
The aim of this work is to present a modified parallel algorithm that
makes an enhancement of parallel buddy prima algorithm. The modified
parallel algorithm doesn’t need to load all transaction in memory. It may
be inapplicable for large data sets with many distinct items. Modified
algorithm prevent full scan of all transaction set each time we calculate
the support count of a candidate set. The algorithm is implemented, and is
compared to its predecessor algorithms. The comparisons were made on
processing time and the accuracy of each algorithm. Results of applying
the proposed algorithm show faster performance than other algorithms
without scarifying the accuracy.