Deriving skyline points over dynamic and incomplete databases

The rapid growth of data is inevitable, and retrieving the best results that meet the user’s preferences is essential. To achieve this, skylines were introduced in which data items that are not dominated by the other data items in the database are retrieved as results (skylines). In most of the exis...

Full description

Bibliographic Details
Main Authors: Babanejad, Ghazaleh, Ibrahim, Hamidah, Udzir, Nur Izura, Sidi, Fatimah, Aljuboori, Ali A.Alwan
Format: Conference or Workshop Item
Language:English
English
Published: Universiti Utara Malaysia(UUM) 2017
Subjects:
Online Access:http://irep.iium.edu.my/57268/
http://irep.iium.edu.my/57268/
http://irep.iium.edu.my/57268/1/54268_Deriving%20skyline%20points.pdf
http://irep.iium.edu.my/57268/7/57268%20Deriving%20skyline%20points%20over%20dynamic%20and%20incomplete%20databases%20WOS.pdf
Description
Summary:The rapid growth of data is inevitable, and retrieving the best results that meet the user’s preferences is essential. To achieve this, skylines were introduced in which data items that are not dominated by the other data items in the database are retrieved as results (skylines). In most of the exist-ing skyline approaches, the databases are assumed to be static and complete. However, in real world scenario, databases are not complete especially in multidimensional databases in which some dimensions may have missing values. The databases might also be dynamic in which new data items are inserted while existing data items are deleted or updated. Blindly performing pairwise comparisons on the whole data items after the changes are made is inappropriate as not all data items need to be compared in identifying the skylines. Thus, a novel skyline algorithm, DInSkyline, is proposed in this study which finds the most relevant data items in dynamic and incomplete databases. Several experiments have been conducted and the results show that DInSkyline outperforms the previous works by reducing the number of pairwise comparisons in the range of 52% to 73%.