Development of language identification system using MFCC and vector quantization
This paper investigates the development of language identification based on Mel-Frequency Cepstral Coefficients (MFCC) and Vector Quantization (VQ) algorithm. In this study, a total of ten speakers were chosen randomly with different languages from online language database. A total of six males and...
Main Authors: | , , |
---|---|
Format: | Conference or Workshop Item |
Language: | English |
Published: |
2017
|
Subjects: | |
Online Access: | http://irep.iium.edu.my/60070/ http://irep.iium.edu.my/60070/ http://irep.iium.edu.my/60070/13/60070-Development%20of%20Language%20Identification.pdf |
id |
iium-60070 |
---|---|
recordtype |
eprints |
spelling |
iium-600702017-12-14T06:43:49Z http://irep.iium.edu.my/60070/ Development of language identification system using MFCC and vector quantization Gunawan, Teddy Surya Husain, Rashida Kartiwi, Mira TK Electrical engineering. Electronics Nuclear engineering This paper investigates the development of language identification based on Mel-Frequency Cepstral Coefficients (MFCC) and Vector Quantization (VQ) algorithm. In this study, a total of ten speakers were chosen randomly with different languages from online language database. A total of six males and four females were selected as subjects for this research and each of them spoke different languages, including Arabic, Chinese, English, Korean and Malay. The MFCC will be extracted to derive the related feature vector. Vector Quantization (VQ) algorithm is then used as classifier. The recognition rate is then calculated for each language. Several experiments were conducted to find the optimum parameters, in which we found that sampling frequency of 16000 Hz and codebook size of 75 provided good results. On average, the recognition rate for all five languages evaluated was 78%. The experimental results show that our proposed system provides a good recognition rate. 2017 Conference or Workshop Item NonPeerReviewed application/pdf en http://irep.iium.edu.my/60070/13/60070-Development%20of%20Language%20Identification.pdf Gunawan, Teddy Surya and Husain, Rashida and Kartiwi, Mira (2017) Development of language identification system using MFCC and vector quantization. In: 4th IEEE International Conference on Smart Instrumentation, Measurement and Applications (ICSIMA) 2017, 28th-30th November 2017, Putrajaya. (Unpublished) http://icsima.ieeemy-ims.org/17/ |
repository_type |
Digital Repository |
institution_category |
Local University |
institution |
International Islamic University Malaysia |
building |
IIUM Repository |
collection |
Online Access |
language |
English |
topic |
TK Electrical engineering. Electronics Nuclear engineering |
spellingShingle |
TK Electrical engineering. Electronics Nuclear engineering Gunawan, Teddy Surya Husain, Rashida Kartiwi, Mira Development of language identification system using MFCC and vector quantization |
description |
This paper investigates the development of language identification based on Mel-Frequency Cepstral Coefficients (MFCC) and Vector Quantization (VQ) algorithm. In this study, a total of ten speakers were chosen randomly with different languages from online language database. A total of six males and four females were selected as subjects for this research and each of them spoke different languages, including Arabic, Chinese, English, Korean and Malay. The MFCC will be extracted to derive the related feature vector. Vector Quantization (VQ) algorithm is then used as classifier. The recognition rate is then calculated for each language. Several experiments were conducted to find the optimum parameters, in which we found that sampling frequency of 16000 Hz and codebook size of 75 provided good results. On average, the recognition rate for all five languages evaluated was 78%. The experimental results show that our proposed system provides a good recognition rate. |
format |
Conference or Workshop Item |
author |
Gunawan, Teddy Surya Husain, Rashida Kartiwi, Mira |
author_facet |
Gunawan, Teddy Surya Husain, Rashida Kartiwi, Mira |
author_sort |
Gunawan, Teddy Surya |
title |
Development of language identification system using MFCC and vector quantization |
title_short |
Development of language identification system using MFCC and vector quantization |
title_full |
Development of language identification system using MFCC and vector quantization |
title_fullStr |
Development of language identification system using MFCC and vector quantization |
title_full_unstemmed |
Development of language identification system using MFCC and vector quantization |
title_sort |
development of language identification system using mfcc and vector quantization |
publishDate |
2017 |
url |
http://irep.iium.edu.my/60070/ http://irep.iium.edu.my/60070/ http://irep.iium.edu.my/60070/13/60070-Development%20of%20Language%20Identification.pdf |
first_indexed |
2023-09-18T21:25:09Z |
last_indexed |
2023-09-18T21:25:09Z |
_version_ |
1777412141988446208 |