USING OBJECTIVE WORDS IN THE REVIEWS TO IMPROVE THE COLLOQUIAL ARABIC SENTIMENT ANALYSIS

Page 1

International Journal on Natural Language Computing (IJNLC) Vol. 6, No.3, June 2017

USING OBJECTIVE WORDS IN THE REVIEWS TO IMPROVE THE COLLOQUIAL ARABIC SENTIMENT ANALYSIS Omar Al-Harbi Department of Computer and Information, Community College, Jazan University, Jazan, KSA

ABSTRACT One of the main difficulties in sentiment analysis of the Arabic language is the presence of the colloquialism. In this paper, we examine the effect of using objective words in conjunction with sentimental words on sentiment classification for the colloquial Arabic reviews, specifically Jordanian colloquial reviews. The reviews often include both sentimental and objective words; however, the most existing sentiment analysis models ignore the objective words as they are considered useless. In this work, we created two lexicons: the first includes the colloquial sentimental words and compound phrases, while the other contains the objective words associated with values of sentiment tendency based on a particular estimation method. We used these lexicons to extract sentiment features that would be training input to the Support Vector Machines (SVM) to classify the sentiment polarity of the reviews. The reviews dataset have been collected manually from JEERAN website. The results of the experiments show that the proposed approach improves the polarity classification in comparison to two baseline models, with accuracy 95.6%.

KEYWORDS Arabic sentiment analysis, opinion mining, colloquial Arabic language, colloquial Jordanian reviews

1. INTRODUCTION Measuring the satisfaction and obtaining the feedback from users have always been the concern of companies that offer services or products to make decisions that would improve their business. Years ago, this process was hard, but with the advancement in web 2.0 platforms such as forums, blogs, and social media including their extensive data of opinions made that easier [1]. Platforms that allow the customers to express their opinions and emotions in reviews about the products have become a reliable source of feedback to both companies and customers. These reviews can make the process of satisfaction detection for companies easier and can influence on customer’s decision making whether to buy a product [2]. However, the vast amount of reviews that are published in digital form about products makes it harder for companies and customers to find the true underlying sentiment about the products without an automatic mechanism. Therefore, there was a need for systems that can automatically perform the process of sentiment analysis or opinion mining in the reviews. Sentiment analysis or opinion mining field is a task of Natural Language Processing (NLP) which aims to automatically detect the subjective information in textual information and predict its sentiment polarity [3]. Recently, the attention is being paid to the sentiment analysis and its applications in different domains such as finance, economic, healthcare, etc. The core function of sentiment analysis process is assigning positive, negative, or neutral polarities to opinionated texts [4]. In general, sentiment analysis has been investigated at three levels [5]: document level, sentence level, and entity and Aspect level. The approaches of sentiment classification that have DOI: 10.5121/ijnlc.2017.6301

1


Turn static files into dynamic content formats.

Create a flipbook
Issuu converts static files into: digital portfolios, online yearbooks, online catalogs, digital photo albums and more. Sign up and create your flipbook.