Image based static hand gesture recognition as user input using self organizing maps / Fakhzan Firdaus Abdullah
Gesture interaction is one of the aspects of HCI that has gathered a lot of attention in the recent years. Apart from the mouse, no significant HCI technologies have garnered attention in the consumer market. Gesture in humans for example are movements or symbols that are made by using any part o...
Main Author: | |
---|---|
Format: | Thesis |
Language: | English |
Published: |
2006
|
Subjects: | |
Online Access: | http://ir.uitm.edu.my/id/eprint/1435/ http://ir.uitm.edu.my/id/eprint/1435/1/TB_FAKHZAN%20FIRDAUS%20ABDULLAH%20CS%2006_5%20P01.pdf |
Summary: | Gesture interaction is one of the aspects of HCI that has gathered a lot of attention
in the recent years. Apart from the mouse, no significant HCI technologies have garnered
attention in the consumer market. Gesture in humans for example are movements or symbols that
are made by using any part of the human body that acts as way of communication. This project is
focusing on the development of a symbolic hand gesture recognition prototype based the shape
of the hand using Neural Network by means of the Kohonen's Self Organizing Map (SOM) to be
used as computer input. For this project, several sets of hand gestures are collected from different
people. The images are preprocessed by scaling, cleaning and converting into a format that is
easy to be input into the neural network, namely binary. In the neural network design, each of the
bits that represents the image will be input into the SOM network. The SOM method of
identification is by reducing the dimensions of inputs and grouping similar patterns together.
Therefore, by adjusting the weights of each neuron path, the images used to frain the network
will group into similar pattern groups. Testing is then done by inputting a different set of images
from the frained images and identify whether the correct gesture is identified. After the
identification, commands to the Windows console are executed according to the recognized
image that is associated with the gesture. In the preprocessing side, the system has proven to
normalize images quickly and accurately. However, in the SOM Network, with the lack of
trained images, the accuracy rate of the system is at most 25%, with variable results on each
training. Nonetheless, since the system is using a viable and tested Kohonen engine, the system
can be improved with a more extensive collection of images for training. Moreover, albeit the
lack of accuracy, each of the programs associated with the gestures are executed flawlessly |
---|