Cluster Analysis of Data Points using Partitioning and Probabilistic Model-based Algorithms
Exploring the dataset features through the application of clustering algorithms is a viable means by which the conceptual description of such data can be revealed for better understanding, grouping and decision making. Some clustering algorithms, especially those that are partitioned-based, clusters...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Foundation of Computer Science (FCS)
2014
|
Subjects: | |
Online Access: | http://umpir.ump.edu.my/id/eprint/6418/ http://umpir.ump.edu.my/id/eprint/6418/ http://umpir.ump.edu.my/id/eprint/6418/ http://umpir.ump.edu.my/id/eprint/6418/1/Cluster_Analysis_of_Data_Points_using_Partitioning_and_Probabilistic_Model-based_Algorithms.pdf |
Summary: | Exploring the dataset features through the application of clustering algorithms is a viable means by which the conceptual description of such data can be revealed for better understanding, grouping and decision making. Some clustering algorithms, especially those that are partitioned-based, clusters any data presented to them even if similar features do not present. This study explores the performance accuracies of partitioning-based algorithms and probabilistic model-based algorithm. Experiments were conducted using k-means, k-medoids and EM-algorithm. The study implements each algorithm using RapidMiner Software and the results generated was validated for correctness in accordance to the concept of external criteria method. The clusters formed revealed the capability and drawbacks of each algorithm on the data points. |
---|