ISSN (ONLINE) : 2045 -8711 ISSN (PRINT) : 2045 -869X
INTERNATIONAL JOURNAL OF INNOVATIVE TECHNOLOGY & CREATIVE ENGINEERING
JULY 2016 VOL- 6 NO - 7
NOVEMBER 2015 VOL- 5 NO - 11
@IJITCE Publication @IJITCE Publication
INTERNATIONAL JOURNAL OF INNOVATIVE TECHNOLOGY AND CREATIVE ENGINEERING (ISSN:2045-8711) VOL. 6 NO.7 JULY 2016 IMPACT FACTOR: 0.61
UK: Managing Editor International Journal of Innovative Technology and Creative Engineering 1a park lane, Cranford London TW59WA UK E-Mail: editor@ijitce.co.uk Phone: +44-773-043-0249 USA: Editor International Journal of Innovative Technology and Creative Engineering Dr. Arumugam Department of Chemistry University of Georgia GA-30602, USA. Phone: 001-706-206-0812 Fax:001-706-542-2626 India: Editor International Journal of Innovative Technology & Creative Engineering Dr. Arthanariee. A. M Finance Tracking Center India 66/2 East mada st, Thiruvanmiyur, Chennai -600041 Mobile: 91-7598208700
www.ijitce.co.uk
INTERNATIONAL JOURNAL OF INNOVATIVE TECHNOLOGY AND CREATIVE ENGINEERING (ISSN:2045-8711) VOL. 6 NO.7 JULY 2016 IMPACT FACTOR: 0.61
www.ijitce.co.uk
IJITCE PUBLICATION
International Journal of Innovative Technology & Creative Engineering Vol.6 No.7 July 2016
www.ijitce.co.uk
www.ijitce.co.uk
INTERNATIONAL JOURNAL OF INNOVATIVE TECHNOLOGY AND CREATIVE ENGINEERING (ISSN:2045-8711) VOL. 6 NO.7 JULY 2016 IMPACT FACTOR: 0.61
From Editor's Desk Dear Researcher, Greetings! Research article in this issue discusses about motivational factor analysis. Let us review research around the world this month. Knuckleballs baffle baseball players with their unpredictable swerves. A new study suggests a possible cause of the pitch’s erratic flight sudden changes in the drag force on a ball, due to a phenomenon called a drag crisis. The result is at odds with previous research that attributed the zigzags to the effect of airflow over the baseball’s seams. Scientists report the finding July 13 in the New Journal of Physics. Knuckleballs are well known in baseball, but similar phenomena also confound players in soccer and volleyball. Knuckleballs occur when balls sail through the air with very little spin, producing unstable flight. In drag crisis, the thin layer of air surrounding the ball flips between turbulent and smooth flow, abruptly changing the drag forces on the ball. If the transition occurs asymmetrically, it can push the ball to one side. Balls must move at a certain speed to experience a drag crisis, which may be why knuckleballs tend to be thrown slower than other pitches, the researchers suggest. While the fastest pitches can top 100 miles per hour, knuckleballs are usually closer to 60 or 70 miles per hour. Every day, millions of Americans with diabetes have to inject themselves with insulin to manage their bloodsugar levels. But less painful alternatives are emerging. Scientists are developing a new way of administering the medicine orally with tiny vesicles that can deliver insulin where it needs to go without a shot. Today, they share their in vivo testing results. The researchers are presenting their work at the 252nd National Meeting & Exposition of the American Chemical Society (ACS). We have developed a new technology called a Cholestosome. A Cholestosome is a neutral, lipid-based particle that is capable of doing some very interesting things. The biggest obstacle to delivering insulin orally is ushering it through the stomach intact. Proteins such as insulin are no match for the harsh, highly acidic environment of the stomach. They degrade before they get a chance to move into the intestines and then the bloodstream where they're needed. It has been an absolute pleasure to present you articles that you wish to read. We look forward to many more new technologies related research articles from you and your friends. We are anxiously awaiting the rich and thorough research papers that have been prepared by our authors for the next issue.
Thanks, Editorial Team IJITCE
www.ijitce.co.uk
INTERNATIONAL JOURNAL OF INNOVATIVE TECHNOLOGY AND CREATIVE ENGINEERING (ISSN:2045-8711) VOL. 6 NO.7 JULY 2016 IMPACT FACTOR: 0.61
Editorial Members Dr. Chee Kyun Ng Ph.D Department of Computer and Communication Systems, Faculty of Engineering,Universiti Putra Malaysia,UPMSerdang, 43400 Selangor,Malaysia. Dr. Simon SEE Ph.D Chief Technologist and Technical Director at Oracle Corporation, Associate Professor (Adjunct) at Nanyang Technological University Professor (Adjunct) at ShangaiJiaotong University, 27 West Coast Rise #08-12,Singapore 127470 Dr. sc.agr. Horst Juergen SCHWARTZ Ph.D, Humboldt-University of Berlin,Faculty of Agriculture and Horticulture,Asternplatz 2a, D-12203 Berlin,Germany Dr. Marco L. BianchiniPh.D Italian National Research Council; IBAF-CNR,Via Salaria km 29.300, 00015 MonterotondoScalo (RM),Italy Dr. NijadKabbaraPh.D Marine Research Centre / Remote Sensing Centre/ National Council for Scientific Research, P. O. Box: 189 Jounieh,Lebanon Dr. Aaron Solomon Ph.D Department of Computer Science, National Chi Nan University,No. 303, University Road,Puli Town, Nantou County 54561,Taiwan Dr. Arthanariee. A. M M.Sc.,M.Phil.,M.S.,Ph.D Director - Bharathidasan School of Computer Applications, Ellispettai, Erode, Tamil Nadu,India Dr. Takaharu KAMEOKA, Ph.D Professor, Laboratory of Food, Environmental & Cultural Informatics Division of Sustainable Resource Sciences, Graduate School of Bioresources,Mie University, 1577 Kurimamachiya-cho, Tsu, Mie, 514-8507, Japan Dr. M. Sivakumar M.C.A.,ITIL.,PRINCE2.,ISTQB.,OCP.,ICP. Ph.D. Project Manager - Software,Applied Materials,1a park lane,cranford,UK Dr. Bulent AcmaPh.D Anadolu University, Department of Economics,Unit of Southeastern Anatolia Project(GAP),26470 Eskisehir,TURKEY Dr. SelvanathanArumugamPh.D Research Scientist, Department of Chemistry, University of Georgia, GA-30602,USA.
Review Board Members Dr. Paul Koltun Senior Research ScientistLCA and Industrial Ecology Group,Metallic& Ceramic Materials,CSIRO Process Science & Engineering Private Bag 33, Clayton South MDC 3169,Gate 5 Normanby Rd., Clayton Vic. 3168, Australia Dr. Zhiming Yang MD., Ph. D. Department of Radiation Oncology and Molecular Radiation Science,1550 Orleans Street Rm 441, Baltimore MD, 21231,USA Dr. Jifeng Wang Department of Mechanical Science and Engineering, University of Illinois at Urbana-Champaign Urbana, Illinois, 61801, USA
www.ijitce.co.uk
INTERNATIONAL JOURNAL OF INNOVATIVE TECHNOLOGY AND CREATIVE ENGINEERING (ISSN:2045-8711) VOL. 6 NO.7 JULY 2016 IMPACT FACTOR: 0.61
Dr. Giuseppe Baldacchini ENEA - Frascati Research Center, Via Enrico Fermi 45 - P.O. Box 65,00044 Frascati, Roma, ITALY. Dr. MutamedTurkiNayefKhatib Assistant Professor of Telecommunication Engineering,Head of Telecommunication Engineering Department,Palestine Technical University (Kadoorie), TulKarm, PALESTINE. Dr.P.UmaMaheswari Prof &Head,Depaartment of CSE/IT, INFO Institute of Engineering,Coimbatore. Dr. T. Christopher, Ph.D., Assistant Professor &Head,Department of Computer Science,Government Arts College(Autonomous),Udumalpet, India. Dr. T. DEVI Ph.D. Engg. (Warwick, UK), Head,Department of Computer Applications,Bharathiar University,Coimbatore-641 046, India. Dr. Renato J. orsato Professor at FGV-EAESP,Getulio Vargas Foundation,São Paulo Business School,RuaItapeva, 474 (8° andar),01332-000, São Paulo (SP), Brazil Visiting Scholar at INSEAD,INSEAD Social Innovation Centre,Boulevard de Constance,77305 Fontainebleau - France Y. BenalYurtlu Assist. Prof. OndokuzMayis University Dr.Sumeer Gul Assistant Professor,Department of Library and Information Science,University of Kashmir,India Dr. ChutimaBoonthum-Denecke, Ph.D Department of Computer Science,Science& Technology Bldg., Rm 120,Hampton University,Hampton, VA 23688 Dr. Renato J. Orsato Professor at FGV-EAESP,Getulio Vargas Foundation,São Paulo Business SchoolRuaItapeva, 474 (8° andar),01332-000, São Paulo (SP), Brazil Dr. Lucy M. Brown, Ph.D. Texas State University,601 University Drive,School of Journalism and Mass Communication,OM330B,San Marcos, TX 78666 JavadRobati Crop Production Departement,University of Maragheh,Golshahr,Maragheh,Iran VineshSukumar (PhD, MBA) Product Engineering Segment Manager, Imaging Products, Aptina Imaging Inc. Dr. Binod Kumar PhD(CS), M.Phil.(CS), MIAENG,MIEEE HOD & Associate Professor, IT Dept, Medi-Caps Inst. of Science & Tech.(MIST),Indore, India Dr. S. B. Warkad Associate Professor, Department of Electrical Engineering, Priyadarshini College of Engineering, Nagpur, India Dr. doc. Ing. RostislavChoteborský, Ph.D. Katedramateriálu a strojírenskétechnologieTechnickáfakulta,Ceskázemedelskáuniverzita v Praze,Kamýcká 129, Praha 6, 165 21 Dr. Paul Koltun Senior Research ScientistLCA and Industrial Ecology Group,Metallic& Ceramic Materials,CSIRO Process Science & Engineering Private Bag 33, Clayton South MDC 3169,Gate 5 Normanby Rd., Clayton Vic. 3168 DR.ChutimaBoonthum-Denecke, Ph.D Department of Computer Science,Science& Technology Bldg.,HamptonUniversity,Hampton, VA 23688
www.ijitce.co.uk
INTERNATIONAL JOURNAL OF INNOVATIVE TECHNOLOGY AND CREATIVE ENGINEERING (ISSN:2045-8711) VOL. 6 NO.7 JULY 2016 IMPACT FACTOR: 0.61 Mr. Abhishek Taneja B.sc(Electronics),M.B.E,M.C.A.,M.Phil., Assistant Professor in the Department of Computer Science & Applications, at Dronacharya Institute of Management and Technology, Kurukshetra. (India). Dr. Ing. RostislavChotěborský,ph.d, Katedramateriálu a strojírenskétechnologie, Technickáfakulta,Českázemědělskáuniverzita v Praze,Kamýcká 129, Praha 6, 165 21
Dr. AmalaVijayaSelvi Rajan, B.sc,Ph.d, Faculty – Information Technology Dubai Women’s College – Higher Colleges of Technology,P.O. Box – 16062, Dubai, UAE Naik Nitin AshokraoB.sc,M.Sc Lecturer in YeshwantMahavidyalayaNanded University Dr.A.Kathirvell, B.E, M.E, Ph.D,MISTE, MIACSIT, MENGG Professor - Department of Computer Science and Engineering,Tagore Engineering College, Chennai Dr. H. S. Fadewar B.sc,M.sc,M.Phil.,ph.d,PGDBM,B.Ed. Associate Professor - Sinhgad Institute of Management & Computer Application, Mumbai-BangloreWesternly Express Way Narhe, Pune - 41 Dr. David Batten Leader, Algal Pre-Feasibility Study,Transport Technologies and Sustainable Fuels,CSIRO Energy Transformed Flagship Private Bag 1,Aspendale, Vic. 3195,AUSTRALIA Dr R C Panda (MTech& PhD(IITM);Ex-Faculty (Curtin Univ Tech, Perth, Australia))Scientist CLRI (CSIR), Adyar, Chennai - 600 020,India Miss Jing He PH.D. Candidate of Georgia State University,1450 Willow Lake Dr. NE,Atlanta, GA, 30329 Jeremiah Neubert Assistant Professor,MechanicalEngineering,University of North Dakota Hui Shen Mechanical Engineering Dept,Ohio Northern Univ. Dr. Xiangfa Wu, Ph.D. Assistant Professor / Mechanical Engineering,NORTH DAKOTA STATE UNIVERSITY SeraphinChallyAbou Professor,Mechanical& Industrial Engineering Depart,MEHS Program, 235 Voss-Kovach Hall,1305 OrdeanCourt,Duluth, Minnesota 55812-3042 Dr. Qiang Cheng, Ph.D. Assistant Professor,Computer Science Department Southern Illinois University CarbondaleFaner Hall, Room 2140-Mail Code 45111000 Faner Drive, Carbondale, IL 62901 Dr. Carlos Barrios, PhD Assistant Professor of Architecture,School of Architecture and Planning,The Catholic University of America Y. BenalYurtlu Assist. Prof. OndokuzMayis University Dr. Lucy M. Brown, Ph.D. Texas State University,601 University Drive,School of Journalism and Mass Communication,OM330B,San Marcos, TX 78666 Dr. Paul Koltun Senior Research ScientistLCA and Industrial Ecology Group,Metallic& Ceramic Materials CSIRO Process Science & Engineering
www.ijitce.co.uk
INTERNATIONAL JOURNAL OF INNOVATIVE TECHNOLOGY AND CREATIVE ENGINEERING (ISSN:2045-8711) VOL. 6 NO.7 JULY 2016 IMPACT FACTOR: 0.61 Dr.Sumeer Gul Assistant Professor,Department of Library and Information Science,University of Kashmir,India Dr. ChutimaBoonthum-Denecke, Ph.D Department of Computer Science,Science& Technology Bldg., Rm 120,Hampton University,Hampton, VA 23688
Dr. Renato J. Orsato Professor at FGV-EAESP,Getulio Vargas Foundation,São Paulo Business School,RuaItapeva, 474 (8° andar)01332-000, São Paulo (SP), Brazil Dr. Wael M. G. Ibrahim Department Head-Electronics Engineering Technology Dept.School of Engineering Technology ECPI College of Technology 5501 Greenwich Road Suite 100,Virginia Beach, VA 23462 Dr. Messaoud Jake Bahoura Associate Professor-Engineering Department and Center for Materials Research Norfolk State University,700 Park avenue,Norfolk, VA 23504 Dr. V. P. Eswaramurthy M.C.A., M.Phil., Ph.D., Assistant Professor of Computer Science, Government Arts College(Autonomous), Salem-636 007, India. Dr. P. Kamakkannan,M.C.A., Ph.D ., Assistant Professor of Computer Science, Government Arts College(Autonomous), Salem-636 007, India. Dr. V. Karthikeyani Ph.D., Assistant Professor of Computer Science, Government Arts College(Autonomous), Salem-636 008, India. Dr. K. Thangadurai Ph.D., Assistant Professor, Department of Computer Science, Government Arts College ( Autonomous ), Karur - 639 005,India. Dr. N. Maheswari Ph.D., Assistant Professor, Department of MCA, Faculty of Engineering and Technology, SRM University, Kattangulathur, Kanchipiram Dt - 603 203, India. Mr. Md. Musfique Anwar B.Sc(Engg.) Lecturer, Computer Science & Engineering Department, Jahangirnagar University, Savar, Dhaka, Bangladesh. Mrs. Smitha Ramachandran M.Sc(CS)., SAP Analyst, Akzonobel, Slough, United Kingdom. Dr. V. Vallimayil Ph.D., Director, Department of MCA, Vivekanandha Business School For Women, Elayampalayam, Tiruchengode - 637 205, India. Mr. M. Moorthi M.C.A., M.Phil., Assistant Professor, Department of computer Applications, Kongu Arts and Science College, India PremaSelvarajBsc,M.C.A,M.Phil Assistant Professor,Department of Computer Science,KSR College of Arts and Science, Tiruchengode Mr. G. Rajendran M.C.A., M.Phil., N.E.T., PGDBM., PGDBF., Assistant Professor, Department of Computer Science, Government Arts College, Salem, India. Dr. Pradeep H Pendse B.E.,M.M.S.,Ph.d Dean - IT,Welingkar Institute of Management Development and Research, Mumbai, India Muhammad Javed Centre for Next Generation Localisation, School of Computing, Dublin City University, Dublin 9, Ireland Dr. G. GOBI Assistant Professor-Department of Physics,Government Arts College,Salem - 636 007 Dr.S.Senthilkumar Post Doctoral Research Fellow, (Mathematics and Computer Science & Applications),UniversitiSainsMalaysia,School of Mathematical Sciences,
www.ijitce.co.uk
INTERNATIONAL JOURNAL OF INNOVATIVE TECHNOLOGY AND CREATIVE ENGINEERING (ISSN:2045-8711) VOL. 6 NO.7 JULY 2016 IMPACT FACTOR: 0.61 Pulau Pinang-11800,[PENANG],MALAYSIA. Manoj Sharma Associate Professor Deptt. of ECE, PrannathParnami Institute of Management & Technology, Hissar, Haryana, India RAMKUMAR JAGANATHAN Asst-Professor,Dept of Computer Science, V.L.B Janakiammal college of Arts & Science, Coimbatore,Tamilnadu, India Dr. S. B. Warkad Assoc. Professor, Priyadarshini College of Engineering, Nagpur, Maharashtra State, India Dr. Saurabh Pal Associate Professor, UNS Institute of Engg. & Tech., VBS Purvanchal University, Jaunpur, India Manimala Assistant Professor, Department of Applied Electronics and Instrumentation, St Joseph’s College of Engineering & Technology, Choondacherry Post, Kottayam Dt. Kerala -686579 Dr. Qazi S. M. Zia-ul-Haque Control Engineer Synchrotron-light for Experimental Sciences and Applications in the Middle East (SESAME),P. O. Box 7, Allan 19252, Jordan Dr. A. Subramani, M.C.A.,M.Phil.,Ph.D. Professor,Department of Computer Applications, K.S.R. College of Engineering, Tiruchengode - 637215 Dr. SeraphinChallyAbou Professor, Mechanical & Industrial Engineering Depart. MEHS Program, 235 Voss-Kovach Hall, 1305 Ordean Court Duluth, Minnesota 55812-3042 Dr. K. Kousalya Professor, Department of CSE,Kongu Engineering College,Perundurai-638 052 Dr. (Mrs.) R. Uma Rani Asso.Prof., Department of Computer Science, Sri Sarada College For Women, Salem-16, Tamil Nadu, India. MOHAMMAD YAZDANI-ASRAMI Electrical and Computer Engineering Department, Babol"Noshirvani" University of Technology, Iran. Dr. Kulasekharan, N, Ph.D Technical Lead - CFD,GE Appliances and Lighting, GE India,John F Welch Technology Center,Plot # 122, EPIP, Phase 2,Whitefield Road,Bangalore – 560066, India. Dr. Manjeet Bansal Dean (Post Graduate),Department of Civil Engineering,Punjab Technical University,GianiZail Singh Campus,Bathinda -151001 (Punjab),INDIA Dr. Oliver Jukić Vice Dean for education,Virovitica College,MatijeGupca 78,33000 Virovitica, Croatia Dr. Lori A. Wolff, Ph.D., J.D. Professor of Leadership and Counselor Education,The University of Mississippi,Department of Leadership and Counselor Education, 139 Guyton University, MS 38677
www.ijitce.co.uk
INTERNATIONAL JOURNAL OF INNOVATIVE TECHNOLOGY AND CREATIVE ENGINEERING (ISSN:2045-8711) VOL. 6 NO.7 JULY 2016 IMPACT FACTOR: 0.61
Contents A Novel Technique for Prediction of Diabetic Patients Data Using Naives Bayes Classification in WEKA Tool S.Nivetha & Dr.A.Geetha ……………….…………………………………….[363]
www.ijitce.co.uk
INTERNATIONAL JOURNAL OF INNOVATIVE TECHNOLOGY AND CREATIVE ENGINEERING (ISSN:2045-8711) VOL.6 NO.7 JULY 2016, IMPACT FACTOR:0.61
A Novel Technique for Prediction of Diabetic Patients Data Using Naives Bayes Classification in WEKA Tool S.Nivetha Assistant Professor, Department of Computer Science, Kamban Arts and Science College, Pollachi, Tamil Nadu, India. Email: nive0501@gmail.com Dr.A.Geetha Assistant Professor, PG & Research Department of Computer Science, Chikkanna Govt. Arts College, Tirupur, Tamil Nadu, India. Email: gee_sam@yahoo.com Abstract - Data mining is a process of extracting information from a dataset and transform it into understandable structure for further use, also it discovers patterns in large data sets. Data mining has number of techniques such as pre-processing, classification. Classification is a technique used for predicting group for the diabetic dataset instance. In this paper, classification on diabetes database are developed classifiers are compared with the result based on certain parameters using WEKA tool. India is suffering from diabetes patients of the population with equal rates in both women and men resulted in deaths with worldwide. A comparative analysis has been performed with the classifiers which result in the chance of diabetic patients getting heart disease. The performances compared with precision, recall, F-measure, Kappa statistic, root mean square and time seconds build the model has exhibited a great overall performance. Keywords- Data Mining, Diabetic Data, RF, NB, WEKA.
1. INTRODUCTION Today, information has a great value and the amount of information has been expansively growing during last few years. Especially, text databases are rapidly growing due to the increasing amount of information available in electronic forms. Generally, data mining is the process of analyzing data from different perspectives and summarizing it into useful information. That type of Information can be used to increase revenue, cuts costs or both. It allows users to analyze data from many different dimensions or angles, categorize it and summarize the relationships identified. Data mining has attracted a great deal of attention in the information industry and in society as a whole in recent years, due to the wide availability of huge amounts of data and the imminent need for turning such data into useful information and knowledge. Generally, data mining is the process of analyzing data from different perspectives and summarizing it into useful information. 2. REVIEW OF LITERATURE In this research work, discussed about the methodology for feature extraction and document classification. In order to propose this works are have analyzed various Literatures
363
which are very much relevant and helpful to do this work. The literature where are have retrieved and analyzed are presented in the following section. Thangaraju [1] et al., proposed a preclusion and discovery of skin melanoma risk using clustering techniques. The skin melanoma patients data are gathered from different diagnostic centre which contains both cancer and non-cancer patient’s information. The gathered data are pre-processed and then clustered using K-means algorithm for separating relevant and non- relevant data to skin melanoma. It is implemented in c#.net to predict skin melanoma risk level with suggestions which is easier, cost reducible and time savable. Mohd Fauzi bin Othman [2] et al., examined the performance of different classification and clustering methods for a set of bulk data. The algorithm and methods tested are Bayes Network, Radial Basis Function, Pruned Tree, Single Conjunctive Rule Learner and Nearest Neighbors Algorithm. Bharat Chaudharil [3] et al., analyzed the comparison of three major clustering algorithms are K- Means, Hierarchical clustering and Density based clustering algorithm. The performances of these three algorithms are compared based on the feature of correctly class wise clustering. The performance of these three clustering algorithms is compared using a Data mining tool WEKA. Amandeep Kaur Mann [4] et al., presented a clustering and its different techniques in data mining is done. Kawsar Ahmed [11] et.al proposed a system to detect the Lung cancer risk. Their proposed system was easy, cost effective and time saving. The data are collected from different diagnosis centres. The collected data are pre-processed and clustered using K-means algorithm. Then Apriori and Decision tree algorithm are used to find significant frequent pattern. Then they developed a significant frequent pattern tool for lung cancer prediction system. Rajalingam [6] et al., presented a comparative study of implementation of hierarchical clustering algorithms agglomerative and divisive clustering for various attributes. The Visual Programming Language is used for implementation of these algorithms. The result of this paper study is the performance of divisive algorithm works as twice as fast as the agglomerative algorithm.
INTERNATIONAL JOURNAL OF INNOVATIVE TECHNOLOGY AND CREATIVE ENGINEERING (ISSN:2045-8711) VOL.6 NO.7 JULY 2016, IMPACT FACTOR:0.61 Khaled Hammouda [5] et al., presented the reviews of four off-line clustering algorithms are K-means clustering, Fuzzy C-means clustering, Mountain clustering, and Subtractive clustering. The algorithms are implemented and tested against a medical problem of heart disease diagnosis. The accuracy and performance are compared. Aastha Joshi [7] et al., proposed a brief review of six different types of clustering techniques are K-means clustering, Hierarchical clustering, DBSCAN clustering, OPTICS, and STING. Manish Verma et al., [8] proposed a analysis of six types of clustering techniques are k-Means Clustering, Hierarchical Clustering, DBSCAN clustering, Density Based Clustering, Optics and EM Algorithm. WEKA tool is used for implemented and analyzed. Shraddha K.Popat et.al [9] focused on different clustering techniques. They are Partition algorithms, Hierarchical algorithms, Density based clustering algorithm. The result was hierarchical clustering can be perform better than the other techniques. Pradeep Rai et al., [10] presented a survey is to provide a comprehensive review of different clustering techniques in data mining.
Fig 3.2 Diabetic Attributes Selected in WEKA 4.1 EXISTING METHODOLOGY 4.1.1 J48 Pruned Tree J48 is a module for generating a pruned or unpruned C4.5 decision tree. When we applied J48 onto refreshed data, got the results shown as below on Figure.
3. DATA MINING TOOL An open-source development model usually means that the tool is a result of a community effort, not necessary supported by a single institution but instead the result of contributions from an international and informal development team. This development style offers a means of incorporating the diverse experiences. The open source tools available for data mining. 3.1 WEKA Waikato Environment for Knowledge Analysis. Weka is a collection of machine learning algorithms for data mining tasks. These algorithms can either be applied directly to a data set or can be called from your own Java code. Weka contains a collection of several tools for visualization and algorithms for analytics of data and predictive modeling, together with graphical user interfaces for easy access to this functionality. Fig 4.1 J48 Diabetic Dataset Classifier Output
Fig 3.1 Explorer in WEKA Fig 4.2 J48 Diabetic Dataset Classifier Accuracy
364
INTERNATIONAL JOURNAL OF INNOVATIVE TECHNOLOGY AND CREATIVE ENGINEERING (ISSN:2045-8711) VOL.6 NO.7 JULY 2016, IMPACT FACTOR:0.61 4.1.2 Random Forest Random forest is an algorithm that consists of many decision trees. It was first developed by Leo Breiman and Adele Cutler [Bre01]. The idea behind it is to build several trees, to have the instance classified by each tree, and to give a vote at each class. The model uses a bagging approach and the random selection of features to build a collection of decision trees with controlled variance. The instances class is to the class with the highest number of votes, the class that occurs the most within the leaf in which the instance is placed. By using trees that classify the instances with low error the error rate of the forest decreases. The correlation and strength of the forest increases with the number m of variables selected. A smaller m returns a smaller correlation and strength. To improve the prediction’s accuracy, a bootstrap method is used to create different trees. Every time a tree is created, one-third of the bootstrap sample is kept aside to estimate the test error. The subset that is not included in the tree construction is used as a test set for the same tree. Once a tree is built, it is used to predict all data and this allows computing proximities. Every time two instances fall into the same node, it increases the proximities. This computation is done for each tree and proximities can be used to replace missing values.
Fig 4.3 Random Forest Diabetic Dataset Classifier Output
4.2 PROPOSED METHODOLOGY Classification is the process of finding a model that describes and distinguishes data classes or concepts, for the able purpose of being to use the model to predict the class of objects whose class label is unknown. It is a technique which is used to predict group membership for data instances. Classification is a two step process, first, it builds classification model using training data. Every object of the dataset must be pre-classified and the second the model generated in the preceding step is tested by assigning class labels to data objects in a test dataset. Here using diabetes dataset now a days the percentage of diabetes patient is growing very fast. India accounts for the largest number of people suffering from diabetes in the world. The diabetes in country's population is likely to be affected from the disease. It is estimated that every five person with diabetes will be an Indian. It means that India has highest number of diabetes in any one of the country in the world. The attributes predict whether a person having diabetes or not. 4.2.1 Naive Bayes Approach Naive Bayes classifier as a term dealing with a simple probabilistic classifier based on application of Bayes theorem with strong independence assumptions. It assumes that the presence or absence of particular feature of a class is unrelated to the presence or absence of any other feature. It is based on conditional probabilities. It uses Bayes' theorem which finds the probability of an event occurring, given the probability of another event that has already occurred. An advantage of the Naive Bayes classifier is that it requires only a small amount of training data to estimate the parameters necessary for classification. Since independent variables are assumed, only the variances of the variables for each class need to be determined. It can be used for both binary and multi class classification problems. Naive Bayes data mining classifier technique has been applied which produces an optimal prediction model using minimum training set to predict the chances of diabetic patient getting heart disease. The diagnosis of diseases plays vital role in medical field. Using diabetic’s diagnosis, the proposed system predicts attributes such as age, sex, blood pressure and blood sugar and the chances of a diabetic patient getting a heart disease. It should be noted that the attributes used in our proposed method are those used for diagnosis of diabetes and are not direct indicators of heart disease. Each algorithm requires submission of data in a specified format.
Fig 4.4 Random Forest Diabetic Dataset Classifier Accuracy
Fig 4.6 Naive Bayes Diabetic Dataset Classifier Accuracy 365
INTERNATIONAL JOURNAL OF INNOVATIVE TECHNOLOGY AND CREATIVE ENGINEERING (ISSN:2045-8711) VOL.6 NO.7 JULY 2016, IMPACT FACTOR:0.61 5. EXPERIMENTATION & RESULTS 5.1 Performance evaluation To measure the performance sensitivity, accuracy and specificity are used. TP is true positive, FP is false positive, TN is true negative and FN is false negative. TPR is true positive rate, which is equivalent to Recall. Sensitivity= Specificity=
True Positive Rate ...... 1 ( True Positive + False Negative)
True Negative Rate ...... 2 ( False Positive + True Negative)
5.2 False Positive Rate The ratio of false positives to false positive plus true negatives. It can be defined as False Positive Rate=
False Positive ...... 3 ( False Positive + True Negative)
5.3 Precision Precision is calculated as number of correctly classified instances can be defined as Precision=
True Positive ...... 4 ( True Positive + False Positive)
5.4 F-Measure F-measure is combining recall and precision scores into a single measure of performance. F-Measure=
2*recall*precision ...... 5 (recall + precision) J48 tree
Random Forest
Naive Bayes Classifier
768
768
768
TP Rate FP Rate Precision Recall
0.936 0.336 0.839 0.936
1.000 0.001 1.000 1.000
0.842 0.384 0.803 0.842
F-Measure
0.885
1.000
0.822
Kappa statistic
0.6319
1
0.4674
Mean absolute error
0.2383
0.114
0.2811
Root mean squared Time taken
0.3452 0.02 sec
0.1506 0.53 sec
0.4133 0.01 sec
Methods / Parameters Number of Instances
Table.5.1 Comparison Results From the above table 5.1 shows the performance of naive bayes classifier. The fig.5.1 shows comparison graphical representation of methods. The method can over perform the traditional method with classify recall rate of 0.842.
Fig.5.1 Comparison Results 6. CONCLUSION Data mining plays a important role in extracting the hidden information in the diabetic data base. The preprocessing is used in order to improve the quality of
the data. This model is built based as a test case on the diabetic dataset. The experiment has been successfully performed with several data mining classifier techniques and naives bayes gives a better performance than other existing methods. To improve the quality of health care of diabetes patients can be implemented using WEKA tool. In this paper, considered time taken to build a model by the algorithms as a parameter. REFERENCES 1. P.Thangaraju, B.Deepa, “A Case study on Perclusion and Discovery of Skin Melanoma Risk using Clustering Techniques”, International Journal of Advanced Research in Electronics & Communication Engineering, Vol.3,Iss.7, 2014. 2. Mohd Fauzi bin Othman, Thomas Moh Shan Yau, “Comparison ofDifferent Classification Techniques using WEKA for BreastCancer”, F.Ibrahim, N.A. Abu Osman, J. Usman and N.A. Kadri: Biomed 06, IFMBE Proceedings 15, pp.520-523, 2007. 3. Bharat Chaudharil, Manan Parikh, “A Comparative Study of clustering algorithms using weka tools”, International Journal of Application or Innovation in Engineering & Management, Volume 1, Issue 2, 2012. 4. Amandeep Kaur Mann and Navneet Kaur, “Survey Paper on Clustering Techniques”, International Journal of Science, Engineering and Technology Research (IJSETR) Volume 2, Issue 4, 2013. 5. Khaled Hammouda, Prof. Fakhreddine Karray, “A Comparative Study of Data Clustering Techniques”, University of Waterloo, Ontario, Canada. 6. N. Rajalingam, K. Ranjini, “Hierarchical Clustering Algorithm, A Comparative Study”, International Journal of Computer Applications, Vol.19, No 3, 2011. 7. Aastha Joshi, Rajneet Kaur,“A Review: Comparative Study of Various Clustering Techniques in Data Mining”, International journal of Advanced Research in Computer Science & Software Engineering, Vol.3, Iss.3, 2013. 8. Manish Verma, “A Comparative Study of Various Clustering Algorithms in Data Mining” International Journal of Engineering Research and Applications, Vol. 2, Issue 3, pp.1379-1384,2012. 9. Shraddha K. Popat , “Review and Comparative Study of Clustering Techniques” International Journal of Computer Science & Information Technologies,Vol.5(1), 805-812, 2014. 10. K. Ruth Ramya, “A Class Based Approach for Medical Classification of Chest Pain”, International Journal of Engineering Trends and Technology, Vol. 3, Issue.2, pp.89-93, 2012. 11. Ru Xu, “Survey of Clustering Algorithms”, IEEE Transactions on Neural Networks, Vol.16, No 3, 2005. 12. A.Geetha,G.M.Nasira, “Rainfall Prediction using Logistic Regression Technique”,CiiT International Journal of Artficial Intelligence Systems and Machine Learning,Vol.6,No.7,pp.246-250,2014. 13. Sripriya, A.Geetha, “Cyclone Storm Prediction using KNN Algorithm”, Indian Journal of Engineering, Discovery Publication,12(30), pp.350-354,2015.
366
NOVEMBER 2015 VOL- 5 NO - 11
@IJITCE Publication @IJITCE Publication