Similarities and dissimilarities between character frequencies of written text of Melayu, English and Indonesian languages
This research paper present some statistical similarities and dissimilarities between the character frequencies of three languages, Malay, Indonesian and English. Thew reason for their comparison is that they all share a common symbol set ?(A-Z). It has been found, through investigations that statis...
Main Authors: | , , , , |
---|---|
Format: | Conference or Workshop Item |
Language: | English |
Published: |
2013
|
Subjects: | |
Online Access: | http://irep.iium.edu.my/37045/ http://irep.iium.edu.my/37045/ http://irep.iium.edu.my/37045/1/similarities-dissimilarities-2014.pdf |
id |
iium-37045 |
---|---|
recordtype |
eprints |
spelling |
iium-370452018-05-24T07:42:47Z http://irep.iium.edu.my/37045/ Similarities and dissimilarities between character frequencies of written text of Melayu, English and Indonesian languages Shah, Asadullah Saidin, Aznan Zuhid Alshaikhli, Imad Fakhri Taha Zeki, Akram M. Bhatti, Zeeshan T10.5 Communication of technical information This research paper present some statistical similarities and dissimilarities between the character frequencies of three languages, Malay, Indonesian and English. Thew reason for their comparison is that they all share a common symbol set ?(A-Z). It has been found, through investigations that statistically Malay and Indonesian languages character frequencies are very close to each other. For example, character "A" "N", and "E" in both Malay and Indonesian languages have frequencies (19%, 20.4%), (Q10%, 9.33%) and (9%, 8.28%), respectively. However the case of English is different, where characters "E", "T" and "A" come with three highest frequencies occurring letters, respectively. An intresting observation is that in spite of some similarities and dissimilarities between the characters, all three languages follow envelop of the frequencies identically rising and falling trend for all characters. Moreover, for all three languages, last four characters, W, x,y,z" , also exhibit lowest usage in their respective languages. 2013 Conference or Workshop Item PeerReviewed application/pdf en http://irep.iium.edu.my/37045/1/similarities-dissimilarities-2014.pdf Shah, Asadullah and Saidin, Aznan Zuhid and Alshaikhli, Imad Fakhri Taha and Zeki, Akram M. and Bhatti, Zeeshan (2013) Similarities and dissimilarities between character frequencies of written text of Melayu, English and Indonesian languages. In: 2103 International Conference of Advanced Computer Science Applications and Technologies (ACSAT), 23-24 Dec. 2013, Kuching, Sarawak. http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=6836574&tag=1 |
repository_type |
Digital Repository |
institution_category |
Local University |
institution |
International Islamic University Malaysia |
building |
IIUM Repository |
collection |
Online Access |
language |
English |
topic |
T10.5 Communication of technical information |
spellingShingle |
T10.5 Communication of technical information Shah, Asadullah Saidin, Aznan Zuhid Alshaikhli, Imad Fakhri Taha Zeki, Akram M. Bhatti, Zeeshan Similarities and dissimilarities between character frequencies of written text of Melayu, English and Indonesian languages |
description |
This research paper present some statistical similarities and dissimilarities between the character frequencies of three languages, Malay, Indonesian and English. Thew reason for their comparison is that they all share a common symbol set ?(A-Z). It has been found, through investigations that statistically Malay and Indonesian languages character frequencies are very close to each other. For example, character "A" "N", and "E" in both Malay and Indonesian languages have frequencies (19%, 20.4%), (Q10%, 9.33%) and (9%, 8.28%), respectively. However the case of English is different, where characters "E", "T" and "A" come with three highest frequencies occurring letters, respectively. An intresting observation is that in spite of some similarities and dissimilarities between the characters, all three languages follow envelop of the frequencies identically rising and falling trend for all characters. Moreover, for all three languages, last four characters, W, x,y,z" , also exhibit lowest usage in their respective languages. |
format |
Conference or Workshop Item |
author |
Shah, Asadullah Saidin, Aznan Zuhid Alshaikhli, Imad Fakhri Taha Zeki, Akram M. Bhatti, Zeeshan |
author_facet |
Shah, Asadullah Saidin, Aznan Zuhid Alshaikhli, Imad Fakhri Taha Zeki, Akram M. Bhatti, Zeeshan |
author_sort |
Shah, Asadullah |
title |
Similarities and dissimilarities between character frequencies of written text of Melayu, English and Indonesian languages |
title_short |
Similarities and dissimilarities between character frequencies of written text of Melayu, English and Indonesian languages |
title_full |
Similarities and dissimilarities between character frequencies of written text of Melayu, English and Indonesian languages |
title_fullStr |
Similarities and dissimilarities between character frequencies of written text of Melayu, English and Indonesian languages |
title_full_unstemmed |
Similarities and dissimilarities between character frequencies of written text of Melayu, English and Indonesian languages |
title_sort |
similarities and dissimilarities between character frequencies of written text of melayu, english and indonesian languages |
publishDate |
2013 |
url |
http://irep.iium.edu.my/37045/ http://irep.iium.edu.my/37045/ http://irep.iium.edu.my/37045/1/similarities-dissimilarities-2014.pdf |
first_indexed |
2023-09-18T20:53:08Z |
last_indexed |
2023-09-18T20:53:08Z |
_version_ |
1777410127337357312 |