Similarities and dissimilarities between character frequencies of written text of Melayu, English and Indonesian languages

This research paper present some statistical similarities and dissimilarities between the character frequencies of three languages, Malay, Indonesian and English. Thew reason for their comparison is that they all share a common symbol set ?(A-Z). It has been found, through investigations that statis...

Full description

Bibliographic Details
Main Authors: Shah, Asadullah, Saidin, Aznan Zuhid, Alshaikhli, Imad Fakhri Taha, Zeki, Akram M., Bhatti, Zeeshan
Format: Conference or Workshop Item
Language:English
Published: 2013
Subjects:
Online Access:http://irep.iium.edu.my/37045/
http://irep.iium.edu.my/37045/
http://irep.iium.edu.my/37045/1/similarities-dissimilarities-2014.pdf
id iium-37045
recordtype eprints
spelling iium-370452018-05-24T07:42:47Z http://irep.iium.edu.my/37045/ Similarities and dissimilarities between character frequencies of written text of Melayu, English and Indonesian languages Shah, Asadullah Saidin, Aznan Zuhid Alshaikhli, Imad Fakhri Taha Zeki, Akram M. Bhatti, Zeeshan T10.5 Communication of technical information This research paper present some statistical similarities and dissimilarities between the character frequencies of three languages, Malay, Indonesian and English. Thew reason for their comparison is that they all share a common symbol set ?(A-Z). It has been found, through investigations that statistically Malay and Indonesian languages character frequencies are very close to each other. For example, character "A" "N", and "E" in both Malay and Indonesian languages have frequencies (19%, 20.4%), (Q10%, 9.33%) and (9%, 8.28%), respectively. However the case of English is different, where characters "E", "T" and "A" come with three highest frequencies occurring letters, respectively. An intresting observation is that in spite of some similarities and dissimilarities between the characters, all three languages follow envelop of the frequencies identically rising and falling trend for all characters. Moreover, for all three languages, last four characters, W, x,y,z" , also exhibit lowest usage in their respective languages. 2013 Conference or Workshop Item PeerReviewed application/pdf en http://irep.iium.edu.my/37045/1/similarities-dissimilarities-2014.pdf Shah, Asadullah and Saidin, Aznan Zuhid and Alshaikhli, Imad Fakhri Taha and Zeki, Akram M. and Bhatti, Zeeshan (2013) Similarities and dissimilarities between character frequencies of written text of Melayu, English and Indonesian languages. In: 2103 International Conference of Advanced Computer Science Applications and Technologies (ACSAT), 23-24 Dec. 2013, Kuching, Sarawak. http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=6836574&tag=1
repository_type Digital Repository
institution_category Local University
institution International Islamic University Malaysia
building IIUM Repository
collection Online Access
language English
topic T10.5 Communication of technical information
spellingShingle T10.5 Communication of technical information
Shah, Asadullah
Saidin, Aznan Zuhid
Alshaikhli, Imad Fakhri Taha
Zeki, Akram M.
Bhatti, Zeeshan
Similarities and dissimilarities between character frequencies of written text of Melayu, English and Indonesian languages
description This research paper present some statistical similarities and dissimilarities between the character frequencies of three languages, Malay, Indonesian and English. Thew reason for their comparison is that they all share a common symbol set ?(A-Z). It has been found, through investigations that statistically Malay and Indonesian languages character frequencies are very close to each other. For example, character "A" "N", and "E" in both Malay and Indonesian languages have frequencies (19%, 20.4%), (Q10%, 9.33%) and (9%, 8.28%), respectively. However the case of English is different, where characters "E", "T" and "A" come with three highest frequencies occurring letters, respectively. An intresting observation is that in spite of some similarities and dissimilarities between the characters, all three languages follow envelop of the frequencies identically rising and falling trend for all characters. Moreover, for all three languages, last four characters, W, x,y,z" , also exhibit lowest usage in their respective languages.
format Conference or Workshop Item
author Shah, Asadullah
Saidin, Aznan Zuhid
Alshaikhli, Imad Fakhri Taha
Zeki, Akram M.
Bhatti, Zeeshan
author_facet Shah, Asadullah
Saidin, Aznan Zuhid
Alshaikhli, Imad Fakhri Taha
Zeki, Akram M.
Bhatti, Zeeshan
author_sort Shah, Asadullah
title Similarities and dissimilarities between character frequencies of written text of Melayu, English and Indonesian languages
title_short Similarities and dissimilarities between character frequencies of written text of Melayu, English and Indonesian languages
title_full Similarities and dissimilarities between character frequencies of written text of Melayu, English and Indonesian languages
title_fullStr Similarities and dissimilarities between character frequencies of written text of Melayu, English and Indonesian languages
title_full_unstemmed Similarities and dissimilarities between character frequencies of written text of Melayu, English and Indonesian languages
title_sort similarities and dissimilarities between character frequencies of written text of melayu, english and indonesian languages
publishDate 2013
url http://irep.iium.edu.my/37045/
http://irep.iium.edu.my/37045/
http://irep.iium.edu.my/37045/1/similarities-dissimilarities-2014.pdf
first_indexed 2023-09-18T20:53:08Z
last_indexed 2023-09-18T20:53:08Z
_version_ 1777410127337357312