A proposed method for the semantic Similarity use WordNet to handle the ambiguity in Social Media text
The semantic similarity between two concepts is widely used in natural language processing. In this article, we propose a method using WordNet 3.1 to determine the similarity based on feature combinations. This work focuses on overcoming the ambiguity in social media text via the selection of inform...
Main Authors: | , , , , |
---|---|
Format: | Conference or Workshop Item |
Language: | English English |
Published: |
Springer Singapore
2019
|
Subjects: | |
Online Access: | http://umpir.ump.edu.my/id/eprint/27882/ http://umpir.ump.edu.my/id/eprint/27882/ http://umpir.ump.edu.my/id/eprint/27882/1/127.%20A%20proposed%20method%20for%20the%20semantic%20Similarity.pdf http://umpir.ump.edu.my/id/eprint/27882/2/127.1%20A%20proposed%20method%20for%20the%20semantic%20Similarity.pdf |
Summary: | The semantic similarity between two concepts is widely used in natural language processing. In this article, we propose a method using WordNet 3.1 to determine the similarity based on feature combinations. This work focuses on overcoming the ambiguity in social media text via the selection of informative features to improve semantic representation. In addition, this research uses social media as its research domain used in this work, and the study is only limited to the politic dataset. A feature-based method is applied to predict the outcome and
improve the performance of the proposed method depending on factors related to the fidelity,
continuity, and balance of knowledge sources in WordNet 3.1. Semantic similarity measurements among words are insufficient and unbalanced features. However, this study presents a semantic similarity measure of a feature-based method in 1WordNet 3.1 to determine the similarity between two concepts/words depending on the selected features used to measure their similarity, which is also known as a “noun” and “is-a” relations-based method. We evaluate our proposed method using the data set in Agirre et al. (2009) ( 2AG203) and compare our results of our new method as which three of methods taxonomy relation, non-taxonomy and Glosses with those of related studies. The correlation with human judgments is subjective and low based on our results was a better. Experimental results show that our new method significantly outperforms other existing computational methods with the following results: r = 0.73%, p = 0.69%, m = 0.71% and nonzero = 0.95%. |
---|