Aijrstem15 596

Page 1

International Association of Scientific Innovation and Research (IASIR) (An Association Unifying the Sciences, Engineering, and Applied Research)

ISSN (Print): 2279-0063 ISSN (Online): 2279-0071

International Journal of Software and Web Sciences (IJSWS) www.iasir.net Association Mining in Social Networks 1 Sanjeev Dhawan, 2Tamanna Faculty of Computer Science & Engineering, 2Student of M.Tech. (Software Engineering) 1,2 University Institute of Engineering and Technology, Kurukshetra University, Kurukshetra, Haryana, INDIA. __________________________________________________________________________________________ Abstract: WWW opened flood gate to many new concept like emails, net meetings, online advertisement and social media. Social media has revolutionarized the world by bringing lacks of people on a single platform ans sharing their contents. SNS sites like facebook, twitter, google+ etc have millions of people and billions of articles posted by these people on the social media. The performance of these sites is a critical issue in the cut throat competition. There novace association mining technique normally used in mining SNS data. We can create frequent item sets of articles which are shared amongst different set of people. The common technologies in association mining are apriori and fp tree algorithm. Different modifications of these algorithms have been proposed by different researchers each having its own merits. This research proposes technique which is hybrid of apriori and fp split tree. Fp split tree performs better than fp tree avoiding recursive creation of subtree. Apriori growth has efficient mining in comparison to fp tree or fp split tree. So, we propose itemsets to be fp split tree. So, we propose itemsets to be generated and stored in fp split tree and then uses apriori growth for mining the experimental result confirm the effectiveness of proposed technique against traditional techniques. 1

Keywords: SNS, Frequent Patterns, Association mining, Apriori growth, FP-Split algorithm, Access Logs. ____________________________________________________________________________________________________

I. Introduction Web is a vast, explosive, diverse, dynamic and mostly unstructured data repository, which supplies an incredible amount of information, and also raises the complexity of how to deal with the information from different perspectives of users, view, web service providers, business analysts.[2] The users want to have effective search tools to find relevant information easily as well as precisely. The Web service providers want to find the best way to predict the users’ behaviors and personalize information to reduce the traffic load and design the Web site that is suited for different group of users. For on-line social networks analysis, the analysis targets are mainly focused on resources from the web, such as its content, structures and the user behaviors. Using Apriori algorithm for web usage mining is a novel technique. Therefore, association rules can help to discover the hidden relationships between nodes in a social network or even cross networks. For e.g.: the person who read or liked A’s blog article and B’s blog article. ● The design of the whole site (interface, content, structure, usability, etc.) is one of the most important aspects for an institution that wants to survive in the cyberspace. ● Understand the way user browses the site and find out which is the most frequently used link and pattern of using the features available in the site. II. Related work Recently, A.Gupta, et al. in 2014 described that Web usage mining focused and analyzed the useful patterns on websites. It was the technique to finding the facts and figures to better serve the need of web based applications. It was classified into three categories are web server, server and application level data. Apriori Algorithm generated the association rules to work on databases that contain lots of transactions. With the use of Apriori algorithm and Improved frequent pattern tree algorithm gave comparison of memory usage and speed of producing association rules. Drawback: In Apriori Algorithm, candidate generation sets large no. of usage patterns before each scan and it made costly.FP Tree Algorithm lacked a better candidate generation method. Jun Yang et al. in 2013, proposed an improved algorithm named Featured Apriori Algorithm. In the database all the transactions were equally important. Traditional Apriori Algorithm was not designed for looking a particular item. Due to mining all the association rules, it generated many redundancies. In this paper an improved traditional based algorithm was proposed. Every Transaction items had its own features, during mining the association rules of the same featured items would be scanned and computed. After analysis in a book recommendation system, it took less time and got more reasonable association rules. Lastly, Shipra Khare et al. in 2013 in their research stated that web usage mining is the application of data mining techniques to discover interesting usage patterns from web data, to better serve the needs of Web-based applications. This paper implemented the process of Web Usage Mining using basic association rules algorithm called as Apriori Algorithm. Web usage mining architecture divided process in two main parts- the first part included preprocessing, data integration components and transaction identification. Second part included the largely domain independent application of generic data mining and pattern matching. It also contains an efficient improved iterative FP Tree algorithm for generating frequent

AIJRSTEM 15-596; © 2015, AIJRSTEM All Rights Reserved

Page 187


Turn static files into dynamic content formats.

Create a flipbook
Issuu converts static files into: digital portfolios, online yearbooks, online catalogs, digital photo albums and more. Sign up and create your flipbook.