A proposed method for the semantic Similarity use WordNet to handle the ambiguity in Social Media text

The semantic similarity between two concepts is widely used in natural language processing. In this article, we propose a method using WordNet 3.1 to determine the similarity based on feature combinations. This work focuses on overcoming the ambiguity in social media text via the selection of inform...

Full description

Bibliographic Details
Main Authors: Ali Muttaleb, Hasan, Noorhuzaimi@Karimah, Mohd Noor, Rassem, Taha H., Shahrul Azman, Mohamed Noah, Ahmed Muttaleb, Hasan
Format: Conference or Workshop Item
Language:English
English
Published: Springer Singapore 2019
Subjects:
Online Access:http://umpir.ump.edu.my/id/eprint/27882/
http://umpir.ump.edu.my/id/eprint/27882/
http://umpir.ump.edu.my/id/eprint/27882/1/127.%20A%20proposed%20method%20for%20the%20semantic%20Similarity.pdf
http://umpir.ump.edu.my/id/eprint/27882/2/127.1%20A%20proposed%20method%20for%20the%20semantic%20Similarity.pdf
Description
Summary:The semantic similarity between two concepts is widely used in natural language processing. In this article, we propose a method using WordNet 3.1 to determine the similarity based on feature combinations. This work focuses on overcoming the ambiguity in social media text via the selection of informative features to improve semantic representation. In addition, this research uses social media as its research domain used in this work, and the study is only limited to the politic dataset. A feature-based method is applied to predict the outcome and improve the performance of the proposed method depending on factors related to the fidelity, continuity, and balance of knowledge sources in WordNet 3.1. Semantic similarity measurements among words are insufficient and unbalanced features. However, this study presents a semantic similarity measure of a feature-based method in 1WordNet 3.1 to determine the similarity between two concepts/words depending on the selected features used to measure their similarity, which is also known as a “noun” and “is-a” relations-based method. We evaluate our proposed method using the data set in Agirre et al. (2009) ( 2AG203) and compare our results of our new method as which three of methods taxonomy relation, non-taxonomy and Glosses with those of related studies. The correlation with human judgments is subjective and low based on our results was a better. Experimental results show that our new method significantly outperforms other existing computational methods with the following results: r = 0.73%, p = 0.69%, m = 0.71% and nonzero = 0.95%.