International Journal of Artificial Intelligence & Applications (IJAIA) Vol.10, No.5, September 2019
CUSTOMER OPINIONS EVALUATION: A CASESTUDY ON ARABIC TWEETS Manal Mostafa Ali Al-Azhar University, Faculty of Engineering Computer& System Engineering, Egypt
ABSTRACT This paper presents an automatic method for extracting, processing, and analysis of customer opinions on Arabic social media. We present a four-step approach for mining of Arabic tweets. First, Natural Language Processing (NLP) with different types of analyses had performed. Second, we present an automatic and expandable lexicon for Arabic adjectives. The initial lexicon is built using 1350 adjectives as seeds from processing of different datasets in Arabic language. The lexicon is automatically expanded by collecting synonyms and morphemes of each word through Arabic resources and google translate. Third, emotional analysis was considered by two different methods; Machine Learning (ML) and rulebased method. Finally, Feature Selection (FS) is also considered to enhance the mining results. The experimental results reveal that the proposed method outperforms counterpart ones with an improvement margin of up to 4% using F-Measure.
KEYWORDS Opinion Mining - Arabic - Bag of Words - Feature Selection - Emotions- Adjective Lexicons.
1. INTRODUCTION Mining customer’s opinions aid in gauging reactions, targeting advertising, and evaluation of public voters' opinions. Besides reputation management and public relations, one could perform trend prediction in sales or other relevant data. Hence, by polling of this information, we can give quantitative indications of customer’s positive or negative attitude about products, services or business [1]. In general, extracting useful patterns and detecting customer feedback from natural language is challenging and could be time consuming for several reasons. It is difficult to distinguish between objective and subjective information. News itself can be generally classified as good or bad news without being subjective [2].Text Classification (TC) also requires deeper analysis and understanding of textual features [3]. In opinion texts, lexical content alone can be misleading. Furthermore, most of the current studies related to this topic focus mainly on English texts with very limited resources available for other languages such as Arabic [4] [5]. Most of the recent work in Arabic TC is not yet releasing their resources [6] [7]. Several studies share the same weak points such as using a few features for opinion mining. Some of these attempts are based on statistical approaches applied on Bag of Words (BoW) as in [4]. Most of them neglect semantic analysis as in as in [8], and [9]. Others consider semantic features but ignore morphological information as in [10]. Some systems don’t handle negation that inverts the statement classification as in [4] [8]. Another type of researches do not pay attention to emotions such as [5] [10] and depends only on the linguistic information. As a result, they lack a common framework which combines genre types of features. Lexical features only capture DOI: 10.5121/ijaia.2019.10503
25