Utilizing hierarchical extreme learning machine based reinforcement learning for object sorting

Automatic and intelligent object sorting is an important task that can sort different objects without human intervention, using the robot arm to carry each object from one location to another. These objects vary in colours, shapes, sizes and orientations. Many applications, such as fruit and vegeta...

Full description

Bibliographic Details
Main Authors: AlDahoul, Nouar, Htike@Muhammad Yusof, Zaw Zaw
Format: Article
Language:English
English
Published: Institute of Advanced Science Extension (IASE) 2019
Subjects:
Online Access:http://irep.iium.edu.my/69667/
http://irep.iium.edu.my/69667/
http://irep.iium.edu.my/69667/
http://irep.iium.edu.my/69667/1/69667_Utilizing%20hierarchical%20extreme%20learning%20machine.pdf
http://irep.iium.edu.my/69667/2/69667_Utilizing%20hierarchical%20extreme%20learning%20machine_WOS.pdf
Description
Summary:Automatic and intelligent object sorting is an important task that can sort different objects without human intervention, using the robot arm to carry each object from one location to another. These objects vary in colours, shapes, sizes and orientations. Many applications, such as fruit and vegetable grading, flower grading, and biopsy image grading depend on sorting for a structural arrangement. Traditional machine learning methods, with extracting handcrafted features, are used for this task. Sometimes, these features are not discriminative because of the environmental factors, such as light change. In this study, Hierarchical Extreme Learning Machine (HELM) is utilized as an unsupervised feature learning to learn the object observation directly, and HELM was found to be robust against external change. Reinforcement learning (RL) is used to find the optimal sorting policy that maps each object image to the object’s location. The reason for utilizing RL is lack of output labels in this automatic task. The learning is done sequentially in many episodes. At each episode, the accuracy of sorting is increased to reach the maximum level at the end of learning. The experimental results demonstrated that the proposed HELM-RL sorting can provide the same accuracy as the labelled supervised HELM method after many episodes.