Text normalization algorithm for facebook chats in Hausa language
The rapid increase in using non-standard words (NSWs) in communication through the social media is causing difficulties in understanding contents of the text messages. In addition, it affects the performance of several natural language processing (NLP) task such as machine translation, informat...
Main Authors: | , , , , , , |
---|---|
Format: | Conference or Workshop Item |
Language: | English English |
Published: |
IEEE
2014
|
Subjects: | |
Online Access: | http://irep.iium.edu.my/40213/ http://irep.iium.edu.my/40213/ http://irep.iium.edu.my/40213/ http://irep.iium.edu.my/40213/11/40213-Text%20normalization%20algorithm%20for%20facebook%20chats%20in%20hausa%20language-edited.pdf http://irep.iium.edu.my/40213/12/40213-Text%20normalization%20algorithm%20for%20facebook%20chats%20in%20hausa%20language_SCOPUS.pdf |
Summary: | The rapid increase in using non-standard words
(NSWs) in communication through the social media is causing
difficulties in understanding contents of the text messages. In
addition, it affects the performance of several natural language
processing (NLP) task such as machine translation,
information retrievals, summarization and etc. In this study,
we present an automatic text normalization system on
Facebook chatting based on Hausa language. The proposed
algorithm manually developed dictionary that employ
normalization of each non-standard word with its equivalent
standard word. This is accomplished through modification of
the technique employed by [1] to fit Hausa NSWs' formation.
It was found that our proposed algorithm was able to
normalized Hausa NSWs with an accuracy of 100%. The
results of this research can facilitate comprehensive
communication via Facebook using Hausa language. |
---|