Search In this Thesis
   Search In this Thesis  
العنوان
Improvement of XML Data Clustering /
المؤلف
Rizk, Nermeen Gamal.
هيئة الاعداد
باحث / نرمين جمال رزق
مشرف / امانى محمود سرحان
مناقش / محمد طلعت فهيم سيد احمد
مناقش / رضا حسين ابو العز
الموضوع
Computers Engineering.
تاريخ النشر
2017.
عدد الصفحات
111 p. :
اللغة
الإنجليزية
الدرجة
ماجستير
التخصص
هندسة النظم والتحكم
تاريخ الإجازة
25/7/2017
مكان الإجازة
جامعة طنطا - كلية الهندسه - Computer and Control Engineering
الفهرس
Only 14 pages are availabe for public view

from 132

from 132

Abstract

With the continuous growth of the EXtensible Markup Language (XML) data on the web, it becomes essential to effectively organize these XML data to retrieve useful information. The need for dealing with and processing these large amount of data bring complications to many applications such as: Information Retrieval, Data Integration, and many others. To deal with obtaining useful information from huge amount of data, large amount of XML data must be broken into smaller groups via clustering techniques. However, XML clustering is a complex task due to the nature of XML data which is semistructured. Most of the existing research on XML clustering focus on a subset of features and ignore the others due the scalability and complexity problems. When handling these data, it was found that the structure and the content of XML documents plays different role and has an importance on the use and the purpose of datasets.