Survey of data mining techniques for social

Page 1

A Survey of Data Mining Techniques for Social Network Analysis Mariam Adedoyin-Olowe1, Mohamed Medhat Gaber1 and Frederic Stahl2 1

School of Computing Science and Digital Media, Robert Gordon University Aberdeen, AB10 7QB, UK 2 School of Systems Engineering, University of Reading PO Box 225, Whiteknights, Reading, RG6 6AY, UK

Abstract. Social network has gained remarkable attention in the last decade. Accessing social network sites such as Twitter, Facebook LinkedIn and Google+ through the internet and the web 2.0 technologies has become more affordable. People are becoming more interested in and relying on social network for information, news and opinion of other users on diverse subject matters. The heavy reliance on social network sites causes them to generate massive data characterised by three computational issues namely; size, noise and dynamism. These issues often make social network data very complex to analyse manually, resulting in the pertinent use of computational means of analysing them. Data mining provides a wide range of techniques for detecting useful knowledge from massive datasets like trends, patterns and rules [44]. Data mining techniques are used for information retrieval, statistical modelling and machine learning. These techniques employ data pre-processing, data analysis, and data interpretation processes in the course of data analysis. This survey discusses different data mining techniques used in mining diverse aspects of the social network over decades going from the historical techniques to the up-to-date models, including our novel technique named TRCM. All the techniques covered in this survey are listed in the Table.1 including the tools employed as well as names of their authors.

Keywords: Social Network, Social Network Analysis, Data Mining Techniques 1.

Introduction

Social network is a term used to describe web-based services that allow individuals to create a public/semi-public profile within a domain such that they can communicatively connect with other users within the network [22]. Social network has improved on the concept and technology of Web 2.0, by enabling the formation and exchange of User-Generated Content [46]. Simply put, social network is a graph consisting of nodes and links used to represent social relations on social network sites [17]. The nodes include entities and the relationships between them forms the links (as presented in Fig. 1).

Fig. 1. Social Network showing nodes and links


Turn static files into dynamic content formats.

Create a flipbook
Issuu converts static files into: digital portfolios, online yearbooks, online catalogs, digital photo albums and more. Sign up and create your flipbook.
Survey of data mining techniques for social by Dani Hasan - Issuu