9 minute read
Role of Artificial Intelligence and Machine Learning in Nanosafety.
Medical applications of nanomaterials span drug, protein and vaccine delivery, diagnostics, theranostics. There is a need for a ‘safe-by-design’ paradigm for nanomaterials, and machine learning is increasingly used to predict their properties. I briefly review nanomaterials machine learning modelling and provide examples where model predictions have been validated by experiments.
David Winkler
Professor of Biochemistry & Chemistry at La Trobe University, Professor of Pharmacy at the University of Nottingham, and Professor of Medicinal Chemistry at Monash University.
Nanomaterials have been one of the most exciting scientific and technical innovations of the past few decades. Due to their very high surface to volume ratios, they exhibit properties that can differ dramatically from those for the same material in bulk. This, and their ability to be designed and synthesized with multiple surface functionalities, has seen them used for a myriad of bespoke applications in industry and medicine.
Their medical applications span delivery systems for drugs, proteins and DNA/RNA to diagnostics, targeted cancer treatments, to theranostics. They have been used very successfully
for the first-time to deliver mRNA in vaccines against SARS-Cov-2 and have massive potential for revolutionizing vaccine therapies for a wide range of diseases in the future. In industry, nanomaterials are used in hundreds of applications that exploit their unique physicochemical properties, and they are finding use as antimicrobial materials for sanitizing water, clothing, refrigeration etc.
However, commercial and medical applications can outrun the ability of the relevant agencies to regulate nanomaterials. The unusual properties of nanomaterials can potentially generate adverse effects in the workplace, public, and environment and this is particularly an issue for nanomaterials that are introduced to the body, as exposure levels are much higher than the adventitious exposure of workers and the public. Consequently, there is a growing need for nanomaterials that are ‘safe-by-design’ (SbD) that is driving research efforts to achieve that goal. (Guinée et al., 2022; Oksel Karakus et al., 2021)
Why are machine learning models needed?
Experimental assessment of the biological effects of the myriad of commercial and medically applied nanomaterials is infeasible due to time, cost, and resource issues. Computational methods, especially machine learning, are being increasingly used to leverage smaller quantities of experimental data on nanomaterials to predict properties of material not tested or synthesized. Such models can also be invaluable for implementing the SbD paradigm.
Nanomaterials pose particular challenges for computational studies. These materials are much larger than small, discrete molecules like drugs, and their composition is usually not precisely known. Most nanomaterials are distributions of sizes, shapes, surface functionality, and charge. Their surface properties are significantly modulated by the coating of proteins or other macromolecules The so-called nanoparticle corona) that often accumulate on the surface of nanomaterials. (Tomak et al., 2021)
Several key factors determine how effective machine learning models of nanoparticle properties are, and most of these can be problematic for nanomaterials. The first is the quantity, quality, and diversity of data available to train models. There is still a paucity of high-quality data sets for this purpose. The second factor is related to descriptors or features, mathematical entities that capture important physicochemical, provenance, or corona properties of nanomaterials. Given the complexities of nanomaterials mentioned above, converting these structures into meaningful mathematical features is more problematic than for small organic molecules, for example.
The rise of nanosafety consortia
The increasingly large amount of research into modelling the biological (and other) properties of nanoparticles has stimulated the formation of consortia that bring together researchers with complementary skills. In particular, the European Union has supported the development of research networks that aim to provide rapid ways of informing regulators of potential risks of nanomaterials, and create the technologies required to design safety as well as functionality into nanomaterials. This was initiated by EU COST support for a 2011 Maastricht workshop on the use of quantitative structure-activity relationship (QSAR) methods (largely machine learning-based) for predicting the biological effects of nanomaterials. This generated a COST Action, MODENA that generated a network of an impressive core group of researchers in this field, mainly from Europe, but also including Australia and the US.(Winkler et al., 2013) More recently, the EU has provided a large amount of financial support for multiple large nanosafety-related Horizon 2020 projects, the most relevant ones for the theme of this paper being SABYDOMA and NanoSolveIT. These have aimed to generate larger datasets relevant to nanosafety, to explore new ML methods for modelling, and predicting biologically relevant properties and to develop better mathematical descriptions of nanomaterials for training models. (Lynch et al., 2020).
Examples of the application of machine learning to nanosafety
One of the first reports of quantitative modelling of biological properties of nanomaterials
was by Puzyn’s group in Gdansk. (Puzyn et al., 2011) They showed that for a series of 17 metal oxide nanoparticles, they could predict cytotoxicity to bacteria using a single, albeit arcane property, the enthalpy of formation of a gaseous cation having the same oxidation state as that in the metal oxide structure. Subsequently, in a collaboration with Harvard University and the Massachusetts General Hospital, we showed that the smooth muscle apoptosis of different types of metal ion oxide nanoparticles could be predicted from their size, core composition, and surface coating. (Epa et al., 2012) This study also showed that, for a set of 109 nanoparticles whose surfaces were functionalized with small organic molecules, the ability to penetrate pancreatic cancer (PaCa2) and human umbilical vein endothelial cell (HUVEC) cells varied markedly depending on the surface chemistry (Figure 1). This was of particular interest in targeting nanoparticles to specific tissues or tumours. Using machine learning models, it was possible to accurately predict the uptake in these cells (within a factor of 2) for both the training set used to create the models and a separate test set used to validate predictions.
Figure 1. Cell uptake of nanoparticles as a function of surface chemistry
Later work on the nanophase behaviour of amphiphilic, lyotropic liquid crystalline lipid nanomaterials used as drug delivery vehicles (Figure 2) showed that machine learning could predict the existence of multiple, coexisting phases as a function of drug type and concentration, temperature, and time. (Le et al., 2013; Le and Tran, 2019) These models could successfully predict the coexistence of nanophases for new drugs not used to generate the models (Figure 3). As only some of these nanophases are useful for drug delivery, it is important to be able to predict the phase behaviour over time and temperature to ensure drug delivery is optimal and safe.
Figure 2 Types of selfassembled nanophases that form from amphiphilic lyotropic liquid crystalline lipid molecules such as phytantriol and monoloein. Structures of the inversebicontinuous diamond QIID (Pn3m), gyroid QIIG (Ia3d), and primitive QIIP (Im3m) cubic phases.
Figure 3. Individual phases, QIID (Pn3m) cubic (blue), hexagonal HII (yellow), and primitive QIIP (Im3m) cubic (purple), predicted by the best machine learning model for monoolein nanoparticles loaded with drugs at 25 and 37 °C. This model made zero prediction errors for these phases.
Although most of the work to date has focused on the safety of nanomaterials for commercial applications or generally, there is a growing interest in applying ML methods to ensure the safety of medical nanomaterials. For nanomaterials used in medicine, safety is of course a critical issue as the materials are deliberatively introduced to the body at higher concentrations than that acquired from the environment. Singh et al. recently reviewed how AL and ML invitro and in vivo datasets are used to generate in silico models for nanomedicine. (Singh et al., 2020) They described how physiologically based pharmacokinetic and ADMET (absorption,
distribution, metabolism, excretion, and toxicology) in silico methods and dosimetry models can be used to support safety in nanomedicine. Shin and co-workers reviewed the application of ML to multi-omics (transcriptomics, proteomics, metabolomics and phosphoproteomics) data. (Shin et al., 2021) As a complete understanding of the nanoparticle-induced toxicity and other biological changes induced in cells is very challenging, rich omics data combined with ML elucidate mechanistic details, identify beneficial and detrimental nanomaterials properties, and allow prediction of likely biological properties of new medical nanomaterials.
Fig.4. Overview of the ML approaches to modelling multi-omics data. Creative Commons Attribution (CC BY) license. (Shin et al., 2021)
What is needed to generate better models of nanomaterials properties?
Machine learning is a data driven method and the quality and utility of models are strongly dependent on the data and mathematical features used to train them. Recent efforts to synthesize nanoparticles with controlled sizes, shapes, cores, and coatings and adoption of high throughput synthesis and characterization methods should alleviate the current paucity of quality data sets. More extensive use of genomic and proteomic data will also assist in unravelling the mechanisms of biological effects of nanomaterials on organisms and provide additional data sources for modelling. A particularly difficult roadblock is the need for more effective mathematical features (nano-specific descriptors) to describe complex nanomaterials. (Winkler, 2016; 2020; Wyrzykowska et al., 2022) New developments in convolutional neural networks may partially alleviate this problem and lead to enhanced interpretability of models. If more metadata relating to the synthesis and processing of nanomaterials can be captured, then machine learning models can also contribute to the optimisation of bespoke nanomaterials. Other important issues include better data integration at the nanoscale, the need for development of central repositories and databases of nanomaterialsbiodata, robust standards for information storage and exchange, improved nano-ontologies, and better tools for decision support. Ultimately, combined models predicting adverse biological effects and those predicting commercially useful properties will lead to more quantitative, useful, and rational SbD approaches.
AUTHOR BIO
David Winkler is a Professor of Biochemistry & Chemistry at La Trobe University, a Professor of Pharmacy at the University of Nottingham, and a Professor of Medicinal Chemistry at Monash University. He applies computational chemistry, AI, and machine learning to the design of drugs, agrochemicals, electrooptic materials, nanomaterials, and biomaterials. He is ranked 227th of 81,000 medicinal chemists worldwide and has written >250 journal articles and book chapters (4 ISI Highly Cited) and is an inventor on 25 patents.
PRODUCTS & SERVICES
SUPPLIERS GUIDE
Company.......................................................................Page No. Company.......................................................................Page No.
STRATEGY
Clariant .........................................................................................05 Etihad Cargo ................................................................................93 Turkish Cargo...............................................................................99 Valsteam ADCA Engineering .......................................................13 Qatar Cargo .............................................................................OBC
RESEARCH & DEVELO PMENT
Clariant .........................................................................................05 F. P. S. Food and Pharma Systems Srl ........................................21 IMCD ............................................................................................26 Sartorius.......................................................................................03 ThermoFisher...............................................................................41
MANUFACTURING
Clariant .........................................................................................05 F. P. S. Food and Pharma Systems Srl ........................................21 IMCD ............................................................................................26 Valsteam ADCA Engineering .......................................................13 Sartorius.......................................................................................03 ThermoFisher...............................................................................41 SUEZ Water Technologies & Solutions .....................................IFC
INFORMATION TECHNOLOGY
ThermoFisher...............................................................................41
Company.......................................................................Page No. Company.......................................................................Page No.
Clariant ......................................................................................05 www.clariant.com
Etihad Cargo .............................................................................93 etihadcargo.com
F. P. S. Food and Pharma Systems Srl ..................................... 21 www.fps-pharma.com
IMCD .........................................................................................26 www.imcdgroup.com
Qatar Cargo .......................................................................... OBC https://qrcargo.com Sartorius....................................................................................03 www.sartorius.com
ThermoFisher............................................................................41 www.thermofisher.com
Turkish Cargo............................................................................99 www.turkishcargo.com
Valsteam ADCA Engineering .................................................... 13 www.valsteam.com
Water Technologies & Solutions............................................. IFC www.watertechnologies.com
To receive more information on products & services advertised in this issue, please fill up the "Info Request Form" provided with the magazine and fax it. 1.IFC: Inside Front Cover 2.IBC: Inside Back Cover 3.OBC: Outside Back Cover