Tr 00086

IDL - International Digital Library Of Technology & Research Volume 1, Issue 5, May 2017

Available at: www.dbpublications.org

International e-Journal For Technology And Research-2017

Prediction of heart disease using classification mining technique on spark Rashmi G Saboji Computer Science &Engineering C.M.R Institute of Technology Bangalore, India rashmikaneri@gmail.com

This paper identifies the increasing health care data which is being accumulated digitally every day. The healthcare industry is becoming very data intensive. Worldwide digital healthcare data is estimated to be equal to 500 petabytes (1015 bytes), and is expected to reach 25 exabytes (1018 bytes) in 2020 [6].In this paper, heart disease is one such disease selected among variety of disease in healthcare. The purpose of this work is to predict the diagnosis of heart disease with reduced number of attributes. Each dataset stored in HDFS is classified based on attributes. This prediction solution using random forest on apache spark gives massive opportunity for health care analysts to deploy this solution on ever changing, scalable big data landscape for insightful decision making.

(EHR). Concurrently, there is fast progress are being made in clinical analytics, such as techniques for analyzing large volumes of data and derive new insights from that analysis, which is known as big data analytics. As a result of this, we can utilize remarkable opportunities provided by big data to reduce the costs of health care as well as diagnosing the diseases.In this paper, heart disease is one such disease selected among variety of disease in healthcare. Heart disease is a general name for a variety of diseases. Heart disease symptoms may vary depending on the specific type of heart disease.

Keywords: Spark, HDFS, Heart disease, Random forest, verification

So by using big data with data mining algorithms makes it possible to do many things such as,identify healthcare trends, prevent diseases, and diagnose the diseases and so on.

Abstract:

1.

INTRODUCTION

2.

The health care system is rapidly adopting electronic health records, which will drastically increase the quantity of clinical dataâ&#x20AC;&#x2122;s that are available digitally IDL - International Digital Library

The hospitalsuse the hospital database systems to store and manage their patient data. These systems generate large volumes of data, but these data are rarely used to support insightful clinical decision making.

1|P a g e

OBJECTIVES

The purpose of this work is to predict the diagnosis of heart disease with reduced number of attributes. Each Copyright@IDL-2017

Turn static files into dynamic content formats.

Create a flipbook