Bangla speech-to-text conversion using SAPI

Speech is the most natural form of communication and interaction between humans; whereas, text and symbols are the most common form of transaction in computer systems. Therefore, interest regarding conversion between speech and text is increasing day by day for speech oriented human-computer inter...

Full description

Bibliographic Details
Main Authors: Sultana, Shaheena, Akhand, M. A. H, Das, Prodip Kumar, Rahman, M.M. Hafizur
Format: Conference or Workshop Item
Language:English
Published: 2012
Subjects:
Online Access:http://irep.iium.edu.my/24980/
http://irep.iium.edu.my/24980/1/1164C.pdf
id iium-24980
recordtype eprints
spelling iium-249802012-09-06T06:52:06Z http://irep.iium.edu.my/24980/ Bangla speech-to-text conversion using SAPI Sultana, Shaheena Akhand, M. A. H Das, Prodip Kumar Rahman, M.M. Hafizur TK7885 Computer engineering Speech is the most natural form of communication and interaction between humans; whereas, text and symbols are the most common form of transaction in computer systems. Therefore, interest regarding conversion between speech and text is increasing day by day for speech oriented human-computer interaction. Microsoft Corporation developed Speech Application Program Interface (SAPI) for speech related works in its Windows operating systems that includes features for only eight languages including English. So, the aim of this study is to investigate Speech-to-Text (STT) conversion using SAPI for Bangla language. Bangla is an important language with a rich heritage; 21st February is declared as the International Mother Language day by UNESCO to respect the language martyrs for the language in Bangladesh at the year of 1952. We managed SAPI to match pronunciation from continuous Bangla speech in precompiled grammar file of SAPI and SAPI returned Bangla words in English character if matches occur. The words are then used to fetch Bangla words from database and return words in true Bangla characters and to complete the sentences. Several English words for particular Bangla word in the grammar file of SAPI is found to overcome tone variation of persons as well as pronunciation variation in language communities and shown to improve overall performance of the system. Experimental study is carried out for the technique on an article from a news paper and the recognition rate was approximately 78% on an average. Although achieved performance is promising for STT related studies, we identified several elements to improve the performance and might give better accuracy. The theme of this study will also be helpful for other languages for Speech-to-Text conversion and similar tasks. 2012-07-03 Conference or Workshop Item PeerReviewed application/pdf en http://irep.iium.edu.my/24980/1/1164C.pdf Sultana, Shaheena and Akhand, M. A. H and Das, Prodip Kumar and Rahman, M.M. Hafizur (2012) Bangla speech-to-text conversion using SAPI. In: International Conference on Computer and Communication Engineering (ICCCE 2012), 3-5 July 2012, Seri Pacific Hotel Kuala Lumpur.
repository_type Digital Repository
institution_category Local University
institution International Islamic University Malaysia
building IIUM Repository
collection Online Access
language English
topic TK7885 Computer engineering
spellingShingle TK7885 Computer engineering
Sultana, Shaheena
Akhand, M. A. H
Das, Prodip Kumar
Rahman, M.M. Hafizur
Bangla speech-to-text conversion using SAPI
description Speech is the most natural form of communication and interaction between humans; whereas, text and symbols are the most common form of transaction in computer systems. Therefore, interest regarding conversion between speech and text is increasing day by day for speech oriented human-computer interaction. Microsoft Corporation developed Speech Application Program Interface (SAPI) for speech related works in its Windows operating systems that includes features for only eight languages including English. So, the aim of this study is to investigate Speech-to-Text (STT) conversion using SAPI for Bangla language. Bangla is an important language with a rich heritage; 21st February is declared as the International Mother Language day by UNESCO to respect the language martyrs for the language in Bangladesh at the year of 1952. We managed SAPI to match pronunciation from continuous Bangla speech in precompiled grammar file of SAPI and SAPI returned Bangla words in English character if matches occur. The words are then used to fetch Bangla words from database and return words in true Bangla characters and to complete the sentences. Several English words for particular Bangla word in the grammar file of SAPI is found to overcome tone variation of persons as well as pronunciation variation in language communities and shown to improve overall performance of the system. Experimental study is carried out for the technique on an article from a news paper and the recognition rate was approximately 78% on an average. Although achieved performance is promising for STT related studies, we identified several elements to improve the performance and might give better accuracy. The theme of this study will also be helpful for other languages for Speech-to-Text conversion and similar tasks.
format Conference or Workshop Item
author Sultana, Shaheena
Akhand, M. A. H
Das, Prodip Kumar
Rahman, M.M. Hafizur
author_facet Sultana, Shaheena
Akhand, M. A. H
Das, Prodip Kumar
Rahman, M.M. Hafizur
author_sort Sultana, Shaheena
title Bangla speech-to-text conversion using SAPI
title_short Bangla speech-to-text conversion using SAPI
title_full Bangla speech-to-text conversion using SAPI
title_fullStr Bangla speech-to-text conversion using SAPI
title_full_unstemmed Bangla speech-to-text conversion using SAPI
title_sort bangla speech-to-text conversion using sapi
publishDate 2012
url http://irep.iium.edu.my/24980/
http://irep.iium.edu.my/24980/1/1164C.pdf
first_indexed 2023-09-18T20:37:21Z
last_indexed 2023-09-18T20:37:21Z
_version_ 1777409134569717760