Issuu

Proc. of Int. Conf. on Control, Communication and Power Engineering 2010

Machine Learning Approaches to Determine the “Drug-Likeness” of the Proteomic Targets Varun Gopal.K, Jiyesh M.M, Sankaranarayanan.A, P.K Krishnan Namboori

Harilal.P, Sai Krishna.A Computational Chemistry Group Amrita School of Biotechnology Amrita Vishwa Vidyapeetham University, Amritapuri, Kollam – 690 525, INDIA harilal.navami@gmail.com , saikrisharjun@gmail.com

Computational Chemistry Group Computational Engineering and Networking Amrita Vishwa Vidyapeetham University, Ettimadai, Coimbatore-641 105 varungopal19@gmail.com , n_krishnan@cb.amrita.edu

Abstract—Compounds from discovery are often poor candidates for lead optimization or preclinical testing because screening efforts focus on target affinity, while paying limited attention to ADME/Tox properties. So here the ADME/Tox properties of certain drugs have been studied and a mathematical model has been developed using machine learning algorithms. This model will predict whether the given molecule is a proteomic drug or not which will be a preliminary step in drug designing. 630 proteomic drugs and equal number of non-drugs were obtained from database sources. Around 1103 descriptors for both drugs and nondrugs were generated. The obtained datasets were manually validated. Descriptor load was reduced using PCA. Statistical machine learning techniques like ANN and SVM were used to explore the data and study drug-likeness. SVM was found to be the best classifier providing a classification accuracy of 93%.

This allows the identification of the chemical groups responsible for inducing a target biological effect in the organism. This technique was later modified to build a mathematical model to propose a relationship between a chemical structure and its biological function, called as quantitative structure-activity relationships (QSAR) Structural activity relationship is defined as the relationship between a chemical structure of a compound and its pharmacological activity. The structural activity relationship is further classified as ‘Quantitative structureactivity relationships’ (QSAR) and ‘Qualitative structureactivity relationships’ (qSAr). Quantitative structure-activity relationships (QSAR) represent an attempt to compare structural or property descriptors of compounds with activities [1]. These physicochemical descriptors include parameters to account for hydrophobicity, topology, electronic properties and the steric effects are determined empirically by computational methods. Activities used in QSAR include chemical measurements and biological assays. At present QSAR is mainly applied in the field of Drug designing.ADMETox is very important in the field of drug designing Owing to increasing importance in advancing high quality candidate drugs. ADMETox explains how a compound interacts with the rest of the body to cause an activity and toxicity (Fig.1).

Keywords— Machine Learning, Drug-Likness, Proteomic Targets, SVM

INTRODUCTION

Drug likeliness is a qualitative means of analysis to check whether the given molecule is a drug or not and it is defined as a complex balance of various molecular properties and structural features which determine whether a particular molecule is similar to the known drugs. These properties, mainly hydrophobicity, electronic distribution, hydrogen bonding characteristics, molecule size and flexibility and presence of various pharmacophoric features influence the behavior of a molecule in a living organism, including bioavailability, transport properties, affinity to proteins, reactivity, toxicity, metabolic stability and many others.

The success of a drug’s journey through the body is measured in the dimensions of absorption, distribution, metabolism, and elimination (ADME).

Structure-activity relationships (SAR) are the conventional practices of medicinal chemistry which try to alter the effect or the activity of bioactive chemical compounds by changing their chemical structure [1]. Medicinal chemists utilize the techniques of chemical synthesis to introduce new chemical groups into the biochemical compound and test the alteration for their biological effects. Figure 1. Pictorial representation of ADME