Search In this Thesis
   Search In this Thesis  
العنوان
Co-Spatial Hadoop :
المؤلف
Nasr, Mariam Malak Fahmy.
هيئة الاعداد
باحث / مريم ملاك فهمى نصر
مشرف / مجدى حسين محمود راتب ناجى
مشرف / ايمان ابراهيم أحمد الغندور
ielghand@yahoo.com
مناقش / نجوى مصطفى إسماعيل المكى
مناقش / صالح عبد الشكورشهابى
الموضوع
Computer and Systems Sciences.
تاريخ النشر
2014.
عدد الصفحات
66 p. :
اللغة
الإنجليزية
الدرجة
ماجستير
التخصص
الهندسة (متفرقات)
تاريخ الإجازة
1/12/2014
مكان الإجازة
جامعة الاسكندريه - كلية الهندسة - هندسة الحاسبات و النظم
الفهرس
Only 14 pages are availabe for public view

from 79

from 79

Abstract

The recent evolution of ubiquitous positioning technologies has led to the rapid growth of the usage of spatial data by various systems, such as geographic information systems (GIS) and location based services (LBS). The spatial queries of these systems are com-plex and computationally intensive, this caused the need of handling them using high performance platforms.
MapReduce and its open source implementation Hadoop, have been extensively used by large enterprises in processing large-scale data sets in parallel and therefore, they are suitable for performing the intensive computations of spatial data. However, Hadoop is not aware with the spatial properties of spatial data, which is why computation benefits of considering the spatial properties of the data when processing, may be wasted by this limitation. This introduces the need of extending Hadoop to be aware of spatial features of the data in order to obtain better performance.
Several research work have proposed approaches to extend Hadoop to support spatial data. For example, SpatialHadoop extends Hadoop to add support to spatial data and spatial queries. This is achieved through the following: (1) using spatial indexes to effi-ciently access the spatial data stored on the Hadoop distributed file system (HDFS) and (2) adding an operation layer that supports spatial operation.
This thesis introduces Co-SpatialHadoop; it is a Hadoop-based spatial approach and an extension for SpatialHadoop platform. It enhances the network usage of spatial com-plex queries such as “spatial join” by colocating related spatial files together on HDFS, which may be joined later. It enhances the saving operation of HDFS by introducing a new spatial placement policy which colocates data using spatial attributes of spatial records.
Co-SpatialHadoop also builds inverted indexes using non-spatial attributes of data records to enhance the response time of non-spatial queries. It adds non-spatial opera-tions to operation layer of SpatialHadoop. These modules are implemented using the functionality of MapReduce.