JournalNX-SURVEY ON RECOGNITION OF S1 AND S2 HEART SOUND USING DEEP NEURAL NETWORKS by journalnx

Novateur Publication’s

International Journal of Research Publications in Engineering and Technology (ISSN No: 2454-7875) Conference Proceedings of A National Conference on “Modern Trends in Electrical Engineering”

(NCMTEE-2K17) 27th March 2017

SURVEYON RECOGNITION OF S1 AND S2 HEART SOUND USING DEEP NEURAL NETWORKS PRERNA R. KUMBHARE, Department of ENTC, Kashibai Navle College of Engineering, Pune, SavitribaiPhule Pune University, Pune India. MRS. ANJALI A. YADAV Assistant Professor Department of ENTC, Sinhgad Education Society's College of Engineering, Pune, SavitribaiPhule Pune University, Pune India. ABSTRACT This study focuses on the first(S1) and second (S2) heart sound affirmation build just concerning acoustic attributes; the suppositions of the individual ranges of S1 and S2 additionally, time breaks of S1–S2 and S2–S1 are excluded in the affirmation handle. The crucial target is to analyze whether tried and true S1 and S2 affirmation execution can even now be accomplished under conditions where the term and interval information won't not be accessible. Methodologies: A significant neural framework (DNN) procedure is used for seeing S1 and S2 heart sounds. In this system, heart sound signs are at first changed over into a course of action of Mel-repeat Cepstrum coefficients (MFCCs). The K-infers Count is associated with pack MFCC highlights into two social events to refine their portrayal and discriminative capacity. The refined components are then urged to a DNN classifier to perform S1 and S2 affirmation. The DNN classifier gives higher evaluation scores differentiated and other without a doubt comprehended case grouping methods. Centrality: The DNN-based system can finish tried and true S1 and S2 affirmation execution in perspective of acoustic qualities without using an ECG reference or joining the assumptions of the individual terms of S1 and S2 likewise, time between times of S1–S2 and S2–S1. INDEX KEYWORD: Heart sound recognition, deep neural networks, acoustic fingerprinting, S1 and S2 recognition. INTRODUCTION In the current years the most widely recognized illness is Heart infection, which causes individuals' passing in vast numbers. Diabetes is one of the fundamental purpose behind heart assault. Around 80% of passing’s occurred in low and Middle wage nations and 25 of them in the age gathering of 26-69 years thus of heart ailments. The Patients are at hazard for vascular entanglements and people groups don't know about the Heart disease.In the

most recent decades, a few apparatuses and different approaches have been proposed by the specialists for creating compelling medicinal choice of supportive networks. Heart auscultation, characterized as tuning in and understanding of the sound delivered by the heart, has been an essential strategy to discover heart infections from the early phases of drug, since most heart sicknesses are reflected to the sound that the heart produces. The passing rate brought on by heart abnormality is progressively high, it gets to be distinctly one of the greatest dangers to human wellbeing .The customary heart auscultation is Under the Support on the ear affectability and the subjective experience of the doctor, which can't meet the high precision necessity under clinical conditions. Heart auscultation is an ordinary physical examination for assessing cardiovascular capacity and exercises. Capable of being heard heart sounds are created via cardiovascular valves snapping close or by turbulent streams. In grown-ups, two typical heart sounds happen in sequence in a cardiovascular cycle. The pitch and event time of heart sounds take after specific examples. The primary heart sound (S1) happens toward the begin of the ventricular systole stage, which comes about because of shutting the mitral and tricuspid valves (all things considered known as the atriventricular valves). The second heart sound (S2) toward the begin of the ventricular diastole stage, which comes about because of shutting the aortic and pulmonic valves. S1 is a low-pitch sound with longer span, though S2 is a high-pitch sound with shorter length. In ordinary circumstances, the S1– S2 interim (systole) is shorter than the S2–S1 interim (diastole).

Fig 1. S1 and S2 waveforms of heart sound

Organized by Department of Electrical Engineering Savitribai Phule Shikshan Prasarak Mandal’s, SKN Sinhgad College of Engineering, Pandharpur

27 | P a g e

Novateur Publication’s

International Journal of Research Publications in Engineering and Technology (ISSN No: 2454-7875) Conference Proceedings of A National Conference on “Modern Trends in Electrical Engineering”

(NCMTEE-2K17) 27th March 2017 LITERATURE REVIEW 1. K NEARSET NEIGHBOUR(KNN) The k-closest neighbor’s calculation (k-NN) is a nonparametric strategy utilized for grouping and regression. In both cases, the information comprises of the k nearest preparing cases in the component space. The yield relies on upon whether k-NN is utilized for grouping or relapse. Nonparametric relapse is an arrangement of methods for evaluating a relapse bend without making solid presumptions about the state of the genuine relapse work. These strategies are thusly valuable for building what are more, checking parametric models, and additionally for information portrayal. Part and closest neighbor relapse estimators are nearby forms of univariate area estimators, thus they can promptly be acquainted with starting understudies and customers who are well known with so much outlines as the specimen mean and middle. K-NN is a sort of occurrence based learning, or apathetic realizing, where the capacity is just approximated locally and all calculation is conceded until order. The k-NN calculation is among the least complex of all machine learning calculations. Both for grouping and relapse, it can be valuable to allocate weight to the commitments of the neighbors, so that the closer neighbors contribute more to the normal than the more removed ones. For instance, a typical weighting plan comprises in giving each neighbor a weight of 1/d, where d is the separation to the neighbor. The neighbors are taken from an arrangement of items for which the class (for k-NN order) or the protest property estimation (for k-NN relapse) is known. This can be considered as the preparation set for the calculation; however no unequivocal preparing step is required. In this review, the KNN classifier utilizes the Euclidean metric for the separation calculation. The working idea of the KNN calculation is generally basic in light of the fact that it includes utilizing just elements as the assessment standard for distances. A weakness of the kNN calculation is that it is delicate to the neighborhood structure of the information. KNN accomplishes more awful execution contrasted and the other three classifiers, perhaps as a result of its excessively basic structure and restricted arrangement ability. 2. GAUSIAN MIXTURE MODELS(GMM) A Gaussian blend model is a probabilistic model that accepts every one of the information focuses are created from a blend of a limited number of Gaussian disseminations with obscure parameters. One can

consider blend models as summing up k-implies bunching to join data about the covariance structure of the information and in addition the focuses of the inert Gaussians. In insights, a blend model is a probabilistic model for speaking to the nearness of subpopulations inside a general populace, without requiring that a watched informational index ought to distinguish the subpopulace to which an individual perception has a place. Formally a blend show relates to the blend circulation that speaks to the likelihood appropriation of perceptions in the general populace. Nonetheless, while issues related with "blend circulations" identify with inferring the properties of the general populace from those of the sub-populaces, "blend models" are utilized to make factual derivations about the properties of the sub-populaces given just perceptions on the pooled populace, without sub-populace character data. GMM is a notable generative model. In the preparing stage, a GMM is prepared for every class. In the testing stage, the probabilities of testing information on GMMs of the whole classes are figured, and the characterization is resolved in light of likelihood scores.For the GMM demonstrate, eight Gaussian blend models were utilized. DNN outflank GMM, proposing that discriminative classifiers can give better execution when S1 and S2 marks are given. 3. LOGISTIC REGRESSION (LR) Strategic relapse is a factual technique for dissecting a dataset in which there are at least one autonomous factors that decide a result. The result is measured with a dichotomous variable in which there are just two conceivable outcomes insights, calculated relapse, or logit relapse, or logit model is a relapse model where the reliant variable is clear cut. This article covers the instance of a twofold ward variable—that is, the place it can take just two qualities, "0" and "1", which speak to results, for example, pass/fall flat, win/lose, alive/dead or solid/debilitated. Situations where the needy variable has more than two result classes might be broke down in multinomial calculated relapse, or, if the numerous classifications are requested, in ordinal strategic regression. In the wording of financial matters, strategic relapse is a case of a subjective reaction/discrete decision show. The LR model is an exceptionally well known order display that plans to expand the contingent logprobability so as to improve the model parameters. LR gives the best S1 precision however the most exceedingly bad S2 exactness, demonstrating that LR

Organized by Department of Electrical Engineering Savitribai Phule Shikshan Prasarak Mandal’s, SKN Sinhgad College of Engineering, Pandharpur

28 | P a g e

Novateur Publication’s

International Journal of Research Publications in Engineering and Technology (ISSN No: 2454-7875) Conference Proceedings of A National Conference on “Modern Trends in Electrical Engineering”

(NCMTEE-2K17) 27th March 2017 won't not give adjusted execution to both classes 4. SUPPORT VECTOR MACHINE(SVM) In machine learning, bolster vector machines SVMs, likewise bolster vector networks are directed learning models with related learning calculations that dissect information utilized for characterization and relapse examination. Given an arrangement of preparing illustrations, each set apart as having a place with either of two classifications, a SVM preparing calculation manufactures a model that doles out new cases to one class or the other, making it a non-probabilistic twofold direct classifier. A SVM model is a portrayal of the cases as focuses in space, mapped so that the cases of the different classes are separated by a reasonable hole that is as wide as could reasonably be expected. New illustrations are then mapped into that same space and anticipated to have a place with a class in view of which side of the crevice they fall. The SVM classifier speaks to information tests as focuses in space and decides to isolate the two classes of information in the preparation stage. In the testing stage, every information test is initially mapped into that same space, and after that anticipated to have a place to a class in view of the side of the crevice on which the specimen falls.For the SVM classifier, the Gaussian distribution capacity was utilized as the bit work. 5. DEEP NEURAL NETWORK (DNN) Deep adapting otherwise called profound organized learning, progressive learning or profound machine learning is a branch of machine learning in light of an arrangement of calculations that endeavor to model abnormal state reflections in information. In a basic case, there may be two arrangements of neurons: ones that get an info flag and ones that send a yield flag. At the point when the info layer gets an information it passes on an altered rendition of the contribution to the following layer. In a profound system, there are many layers between the information and yield (and the layers are not made of neurons but rather it can consider it that way), permitting the calculation to utilize different preparing layers, made out of various straight and nondirect changes. Profound learning is a piece of a more extensive group of machine learning techniques in light of learning portrayals of information. A perception (e.g., a picture) can be spoken to from various perspectives, for example, a vector of power qualities per pixel, or in a more theoretical manner as an arrangement of edges, locales of specific shape, and so on. A few portrayals are

superior to anything others at disentangling the learning undertaking (e.g., confront acknowledgment or outward appearance acknowledgment). One of the guarantees of profound learning is supplanting carefully assembled highlights with effective calculations for unsupervised or semi-regulated element learning and progressive component extraction.

Fig.4 Structure of DNN model Different profound learning structures, for example, profound neural systems, convolution profound neural systems, profound conviction systems and intermittent neural systems have been connected to fields like PC vision, programmed discourse acknowledgment, characteristic dialect handling, sound acknowledgment and bioinformatics where they have been appeared to create best in class comes about on different undertakings. DNN achieves the best performance with higher accuracy, confirming its better classification capability when compared with other classifiers. This table shows comparison between classifiers.

SYSTEM ARCHITECTURE Heart sounds are the clamors created by the pulsating heart and the resultant stream of blood through it. In particular, the sounds mirror the turbulence made when the heart valves snap close. In healthy adults, there are two normal heart sounds regularly depicted as a lub and dub (or dup), that happen in arrangement with every pulse. These are the first heart sound (S1) and second heart sound (S2), produced by the closing of the atrioventricular valves and semi lunar valves, respectively.

Organized by Department of Electrical Engineering Savitribai Phule Shikshan Prasarak Mandal’s, SKN Sinhgad College of Engineering, Pandharpur

29 | P a g e

Novateur Publication’s

International Journal of Research Publications in Engineering and Technology (ISSN No: 2454-7875) Conference Proceedings of A National Conference on “Modern Trends in Electrical Engineering”

(NCMTEE-2K17) 27th March 2017

Fig2: block diagram of system In machine learning, pattern recognition and in image processing, feature extraction begins from an underlying arrangement of measured information and manufactures determined values (features) intended to be informative and non-redundant, facilitating the subsequent learning and generalization steps, and now and again prompting to better human elucidations. Feature extraction is related to dimensionality reduction. Feature extraction includes the amount of assets required to depict a substantial arrangement of information. Overall S1 and S2 recognition architecture Fig. 3 shows the flowchart for S1 and S2 recognition framework. As appeared in this figure, the feature extraction process is initially performed to convert heart sound signals into a set of feature vectors. In the offline phase, the feature vectors for S1 and S2 are utilized to prepare a DNN classifier. In the online stage, testing feature vectors are introduced into the DNN classifier, and the testing information classification is resolved by the classifier yield.

Fig 3: Flowchart for S1 and S2 recognition system. FEATURE EXTRACTION FOR SYSTEM: A. MFCC In sound processing, the Mel-frequency Cepstrum (MFC) is a representation of the transient power range of a sound, in view of a direct cosine change of a log control spectrum on a nonlinear Mel scale of frequency. The MFCC highlight extraction method involves six operations: pre-accentuation, windowing, fast Fourier

transform (FFT), Mel-filtering, nonlinear transformation, and discrete cosine transform (DCT). The pre-emphasis operation enhances the received signals to compensate for signal distortions. The windowing operation partitions a given into a sequence of frames. The FFT operation is applied to the windowed signals for spectral analysis. The Mel-filtering operation is designed based on human perception, and it incorporates the frequency compositions from one Mel-filter band into one energy intensity. The non-linear change operation takes the logarithm of all Mel-filter band intensities. The transformed intensities are then converted into MFCCs using DCT. MFCC feature extraction has been confirmed to be effective in speech recognition, speaker recognition, and various acoustic pattern recognition tasks. B. K-means algorithm The main goal of the K-means algorithm is to decide representative information points from huge numbers of data points. Such information points are called “population centers.” Data compression (i.e., using a low number of data points to represent a high amount of data for compressing data) and grouping (i.e., utilizing a low number of representative points to represent specific categories for lowering the amount of data and avoiding adverse effects caused by noise) are applied to population centers. DNN CLASSIFIER FOR PROPOSED SYSTEM ANN is a scientific model that emulates organic NN structures and permits a PC framework to execute characterization or relapse tasks. Various researchers have proposed diverse NN models for taking care of different issues. CONCLUSION The probability of using the heart sound flag for human character check is researched, and proposes an audit on the usage of MFCC. Heart sound can act normally used for recognizable proof or we can utilize it with other accessible distinguishing proof system basic and trustworthy to execute. In this paper a couple part extraction techniques for sound affirmation were analyzed. MFCC is outstanding strategies used as a piece of sound acknowledgment to portray the flag qualities, relative to the sound discriminative vocal tract properties. The Mel Frequency Cepstrum Coefficient (MFCC) highlight has been utilized for outlining a content ward speaker distinguishing proof system.The rule objective is to look at whether dependable S1 and S2 acknowledgment execution can in any case be

Organized by Department of Electrical Engineering Savitribai Phule Shikshan Prasarak Mandal’s, SKN Sinhgad College of Engineering, Pandharpur

30 | P a g e

Novateur Publication’s

International Journal of Research Publications in Engineering and Technology (ISSN No: 2454-7875) Conference Proceedings of A National Conference on “Modern Trends in Electrical Engineering”

(NCMTEE-2K17) 27th March 2017 achieved under circumstances, where the traverse and between time information won't not be open. The various measures of characterization procedures were looked at, and the outcomes demonstrated that S1 and S2 can be viably perceived with over 91% exactness. Another dedication of this paper is that asserted the adequacy of utilizing DNN for building acoustic models to depict S1 and S2 heart sounds. REFERRENCES 1) Tien-En Chen, Shih-I Yang, Li-Ting Ho, Kun-Hsi Tsai, Yu-Hsuan Chen,” S1 and S2 Heart Sound Recognitionusing Deep Neural Networks” IEEE Transactions on Biomedical Engineering,2016. 2) Y. Xu, J. Du, L.-R. Dai and C.-H. Lee, “A regression approach to speech enhancement based on deep neural networks,” IEEE Transactions on Audio, Speech, and Language Processing, vol. 23, pp. 7-19, 2015. 3) Mohamed, G. E. Dahl, and G. E. Hinton, “Acoustic modeling using deep belief networks,” IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, pp. 14–22, 2012.

4) D. Kumar, P.Carvalho, M.Antunes, J. Henriques, L. Eugenio, R. Schmidt and J. Habetha, “Detection of S1 and S2 heart sounds by high frequency signatures,” in Proc. EMBS, pp. 1410-1416, 2006. 5) D. Kumar, P. Carvalho, P. Gil, J. Henriques, M. Antunes and L. Eugenio, “A new algorithm for detection of S1 and S2 heart sounds,” in Proc. ICASSP, pp. 1180-1183, 2006. 6) Y. N. Wen, A. P. Lee, F. Fang, C. N. Jin and C. M. Yu, “Beyond auscultation: Acoustic cardiography in clinical prac-tice,” International journal of cardiology, vol. 172, pp. 548-560, 2014. 7) E. G. Dimond and A. Benchimol, “Phonocardiography”, California Medicine, 94(3), pp. 139–146, 1961. 8) M. B. Malarvili, I. Kamarulafizam, S. Hussain and D.Helmi, “Heart sound segmentation algorithm based on instantaneous energy of electrocardiogram,” in Proc. CinC, pp.327-330, 2003. 9) M. El-Segaier, O. Lilja, S. Lukkarinen, L. Srnmo, R. Seppo-nen and E. Pesonen, “Computer-based detection and analysis of heart sound murmur,” Annals of Biomedical Engineering, vol. 33, pp. 937942,2005.

Organized by Department of Electrical Engineering Savitribai Phule Shikshan Prasarak Mandal’s, SKN Sinhgad College of Engineering, Pandharpur

31 | P a g e