Development of language identification system using MFCC and vector quantization

This paper investigates the development of language identification based on Mel-Frequency Cepstral Coefficients (MFCC) and Vector Quantization (VQ) algorithm. In this study, a total of ten speakers were chosen randomly with different languages from online language database. A total of six males and...

Full description

Bibliographic Details
Main Authors: Gunawan, Teddy Surya, Husain, Rashida, Kartiwi, Mira
Format: Conference or Workshop Item
Language:English
Published: 2017
Subjects:
Online Access:http://irep.iium.edu.my/60070/
http://irep.iium.edu.my/60070/
http://irep.iium.edu.my/60070/13/60070-Development%20of%20Language%20Identification.pdf
id iium-60070
recordtype eprints
spelling iium-600702017-12-14T06:43:49Z http://irep.iium.edu.my/60070/ Development of language identification system using MFCC and vector quantization Gunawan, Teddy Surya Husain, Rashida Kartiwi, Mira TK Electrical engineering. Electronics Nuclear engineering This paper investigates the development of language identification based on Mel-Frequency Cepstral Coefficients (MFCC) and Vector Quantization (VQ) algorithm. In this study, a total of ten speakers were chosen randomly with different languages from online language database. A total of six males and four females were selected as subjects for this research and each of them spoke different languages, including Arabic, Chinese, English, Korean and Malay. The MFCC will be extracted to derive the related feature vector. Vector Quantization (VQ) algorithm is then used as classifier. The recognition rate is then calculated for each language. Several experiments were conducted to find the optimum parameters, in which we found that sampling frequency of 16000 Hz and codebook size of 75 provided good results. On average, the recognition rate for all five languages evaluated was 78%. The experimental results show that our proposed system provides a good recognition rate. 2017 Conference or Workshop Item NonPeerReviewed application/pdf en http://irep.iium.edu.my/60070/13/60070-Development%20of%20Language%20Identification.pdf Gunawan, Teddy Surya and Husain, Rashida and Kartiwi, Mira (2017) Development of language identification system using MFCC and vector quantization. In: 4th IEEE International Conference on Smart Instrumentation, Measurement and Applications (ICSIMA) 2017, 28th-30th November 2017, Putrajaya. (Unpublished) http://icsima.ieeemy-ims.org/17/
repository_type Digital Repository
institution_category Local University
institution International Islamic University Malaysia
building IIUM Repository
collection Online Access
language English
topic TK Electrical engineering. Electronics Nuclear engineering
spellingShingle TK Electrical engineering. Electronics Nuclear engineering
Gunawan, Teddy Surya
Husain, Rashida
Kartiwi, Mira
Development of language identification system using MFCC and vector quantization
description This paper investigates the development of language identification based on Mel-Frequency Cepstral Coefficients (MFCC) and Vector Quantization (VQ) algorithm. In this study, a total of ten speakers were chosen randomly with different languages from online language database. A total of six males and four females were selected as subjects for this research and each of them spoke different languages, including Arabic, Chinese, English, Korean and Malay. The MFCC will be extracted to derive the related feature vector. Vector Quantization (VQ) algorithm is then used as classifier. The recognition rate is then calculated for each language. Several experiments were conducted to find the optimum parameters, in which we found that sampling frequency of 16000 Hz and codebook size of 75 provided good results. On average, the recognition rate for all five languages evaluated was 78%. The experimental results show that our proposed system provides a good recognition rate.
format Conference or Workshop Item
author Gunawan, Teddy Surya
Husain, Rashida
Kartiwi, Mira
author_facet Gunawan, Teddy Surya
Husain, Rashida
Kartiwi, Mira
author_sort Gunawan, Teddy Surya
title Development of language identification system using MFCC and vector quantization
title_short Development of language identification system using MFCC and vector quantization
title_full Development of language identification system using MFCC and vector quantization
title_fullStr Development of language identification system using MFCC and vector quantization
title_full_unstemmed Development of language identification system using MFCC and vector quantization
title_sort development of language identification system using mfcc and vector quantization
publishDate 2017
url http://irep.iium.edu.my/60070/
http://irep.iium.edu.my/60070/
http://irep.iium.edu.my/60070/13/60070-Development%20of%20Language%20Identification.pdf
first_indexed 2023-09-18T21:25:09Z
last_indexed 2023-09-18T21:25:09Z
_version_ 1777412141988446208