Heuristics-based method for head and modifier detection in Malay sentences from the cultural heritage domain

The process of detection for the head and modifier in Malay sentences from the cultural heritage domain is difficult to identify. This is due to the position of head and modifier which varies in sentences depending on the sentence structures. Hence, there are different point of views about the theor...

Full description

Bibliographic Details
Main Authors: Suhaimi Ab Rahman, Nazlia Omar
Format: Article
Language:English
Published: Penerbit Universiti Kebangsaan Malaysia 2017
Online Access:http://journalarticle.ukm.my/11840/
http://journalarticle.ukm.my/11840/
http://journalarticle.ukm.my/11840/1/13767-54962-1-PB.pdf
id ukm-11840
recordtype eprints
spelling ukm-118402018-07-09T04:05:59Z http://journalarticle.ukm.my/11840/ Heuristics-based method for head and modifier detection in Malay sentences from the cultural heritage domain Suhaimi Ab Rahman, Nazlia Omar, The process of detection for the head and modifier in Malay sentences from the cultural heritage domain is difficult to identify. This is due to the position of head and modifier which varies in sentences depending on the sentence structures. Hence, there are different point of views about the theory and concept of detection for the head and modifier in a compound noun that have been discussed by language experts. Additionally, the existing research is also limited especially in the areas of computational linguistics. Therefore, research should be conducted to identify appropriate methods especially used in the detection of head and modifier which appear in Malay setences from the cultural heritage domain. The aim of this study is to construct a list of heuristic rules to be used for detecting the position of compound nouns in Malay sentences from cultural heritage domain. By using 15 rules, the position of head and modifier that exist in a compound noun can also be detected. These rules are called heuristic rules. The purpose of formulating these 15 rules is to detect the head and modifier that exist in the Malay sentences from the cultural heritage domain. To measure the accuracy of the results, precision, recall and F1-score values are used. Based on the results of the experiments, Sentence Structure of Malay Cultural Heritage Domain (SADWBM) have an F1-score of 80.4% compared to Noun Phrase Structure (SFN) which is 56%. Consequently, SADWBM shows better scores compared to SFN. Therefore it is clear that the approach used in this study is effective in resolving the identified problems. Penerbit Universiti Kebangsaan Malaysia 2017-06 Article PeerReviewed application/pdf en http://journalarticle.ukm.my/11840/1/13767-54962-1-PB.pdf Suhaimi Ab Rahman, and Nazlia Omar, (2017) Heuristics-based method for head and modifier detection in Malay sentences from the cultural heritage domain. Asia-Pacific Journal of Information Technology and Multimedia, 6 (1). pp. 13-21. ISSN 2289-2192 http://ejournal.ukm.my/apjitm/issue/view/899
repository_type Digital Repository
institution_category Local University
institution Universiti Kebangasaan Malaysia
building UKM Institutional Repository
collection Online Access
language English
description The process of detection for the head and modifier in Malay sentences from the cultural heritage domain is difficult to identify. This is due to the position of head and modifier which varies in sentences depending on the sentence structures. Hence, there are different point of views about the theory and concept of detection for the head and modifier in a compound noun that have been discussed by language experts. Additionally, the existing research is also limited especially in the areas of computational linguistics. Therefore, research should be conducted to identify appropriate methods especially used in the detection of head and modifier which appear in Malay setences from the cultural heritage domain. The aim of this study is to construct a list of heuristic rules to be used for detecting the position of compound nouns in Malay sentences from cultural heritage domain. By using 15 rules, the position of head and modifier that exist in a compound noun can also be detected. These rules are called heuristic rules. The purpose of formulating these 15 rules is to detect the head and modifier that exist in the Malay sentences from the cultural heritage domain. To measure the accuracy of the results, precision, recall and F1-score values are used. Based on the results of the experiments, Sentence Structure of Malay Cultural Heritage Domain (SADWBM) have an F1-score of 80.4% compared to Noun Phrase Structure (SFN) which is 56%. Consequently, SADWBM shows better scores compared to SFN. Therefore it is clear that the approach used in this study is effective in resolving the identified problems.
format Article
author Suhaimi Ab Rahman,
Nazlia Omar,
spellingShingle Suhaimi Ab Rahman,
Nazlia Omar,
Heuristics-based method for head and modifier detection in Malay sentences from the cultural heritage domain
author_facet Suhaimi Ab Rahman,
Nazlia Omar,
author_sort Suhaimi Ab Rahman,
title Heuristics-based method for head and modifier detection in Malay sentences from the cultural heritage domain
title_short Heuristics-based method for head and modifier detection in Malay sentences from the cultural heritage domain
title_full Heuristics-based method for head and modifier detection in Malay sentences from the cultural heritage domain
title_fullStr Heuristics-based method for head and modifier detection in Malay sentences from the cultural heritage domain
title_full_unstemmed Heuristics-based method for head and modifier detection in Malay sentences from the cultural heritage domain
title_sort heuristics-based method for head and modifier detection in malay sentences from the cultural heritage domain
publisher Penerbit Universiti Kebangsaan Malaysia
publishDate 2017
url http://journalarticle.ukm.my/11840/
http://journalarticle.ukm.my/11840/
http://journalarticle.ukm.my/11840/1/13767-54962-1-PB.pdf
first_indexed 2023-09-18T20:01:16Z
last_indexed 2023-09-18T20:01:16Z
_version_ 1777406864105930752