Cluster Analysis of Data Points using Partitioning and Probabilistic Model-based Algorithms

Exploring the dataset features through the application of clustering algorithms is a viable means by which the conceptual description of such data can be revealed for better understanding, grouping and decision making. Some clustering algorithms, especially those that are partitioned-based, clusters...

Full description

Bibliographic Details
Main Authors: Raheem, Ajiboye Adeleke, Hauwau, Isah-Kebbe, O., Oladele Tinuke
Format: Article
Language:English
Published: Foundation of Computer Science (FCS) 2014
Subjects:
Online Access:http://umpir.ump.edu.my/id/eprint/6418/
http://umpir.ump.edu.my/id/eprint/6418/
http://umpir.ump.edu.my/id/eprint/6418/
http://umpir.ump.edu.my/id/eprint/6418/1/Cluster_Analysis_of_Data_Points_using_Partitioning_and_Probabilistic_Model-based_Algorithms.pdf
Description
Summary:Exploring the dataset features through the application of clustering algorithms is a viable means by which the conceptual description of such data can be revealed for better understanding, grouping and decision making. Some clustering algorithms, especially those that are partitioned-based, clusters any data presented to them even if similar features do not present. This study explores the performance accuracies of partitioning-based algorithms and probabilistic model-based algorithm. Experiments were conducted using k-means, k-medoids and EM-algorithm. The study implements each algorithm using RapidMiner Software and the results generated was validated for correctness in accordance to the concept of external criteria method. The clusters formed revealed the capability and drawbacks of each algorithm on the data points.