Dimitra-Danai
Varsou
ab,
Antreas
Afantitis
a,
Andreas
Tsoumanis
a,
Georgia
Melagraki
*a,
Haralambos
Sarimveis
b,
Eugenia
Valsami-Jones
c and
Iseult
Lynch
*c
aNanoinformatics Department, Novamechanics Ltd, Nicosia, 1065, Cyprus. E-mail: melagraki@novamechanics.com
bSchool of Chemical Engineering, National Technical University of Athens, 157 80 Athens, Greece
cSchool of Geography, Earth and Environmental Sciences, University of Birmingham, B15 2TT Birmingham, UK. E-mail: i.lynch@bham.ac.uk
First published on 5th November 2018
Multi-walled carbon nanotubes are currently used in numerous industrial applications and products, therefore fast and accurate evaluation of their biological and toxicological effects is of utmost importance. Computational methods and techniques, previously applied in the area of cheminformatics for the prediction of adverse effects of chemicals, can also be applied in the case of nanomaterials (NMs), in an effort to reduce expensive and time consuming experimental procedures. In this context, a validated and predictive nanoinformatics model has been developed for the accurate prediction of the biological and toxicological profile of decorated multi-walled carbon nanotubes. The nanoinformatics workflow was fully validated according to the OECD principles before it was released online via the Enalos Cloud platform. The web-service is a ready-to-use, user-friendly application whose purpose is to facilitate decision making, as part of a safe-by-design framework for novel carbon nanotubes.
A complete experimental toxicity assessment requires expensive and time consuming in vitro and in vivo practices,9 rendering it unfeasible to thoroughly test the NMs already on the market, as well as novel emerging variants. Additionally, it is currently not known how different or modified a NM needs to be to constitute a unique NM (or nanoform in the emerging regulatory arena) – i.e. are different surface functionalisations considered different nanoforms? For chemicals, their uniqueness is established through their individual Chemical Abstract Number (CAS), while NMs currently share a CAS number with their bulk form. Thus, current approaches to the risk assessment of NMs are undertaken on a case-by-case basis, which has been estimated to require 10 years just for the 500–1000 NMs expected to have been registered in the EU by the May 2018 REACH registration deadline.10 To overcome this obstacle in the risk assessment framework, a significant number of alternative – fast and inexpensive – novel techniques, such as Quantitative Nanostructure Activity Relationship (QNAR) models for the prediction of the biological and toxicological effects of NMs, have been proposed in literature.9,11–14 These approaches are collectively moving knowledge and regulatory practice closer to a future of in silico toxicity analysis based on dramatically reduced, or eventually no, experimental input.
Similar to the Quantitative Structure Activity Relationship (QSAR) models utilised in chemoinformatics, QNAR models are mainly based on well-structured databases or well-organized datasets, to establish robust and predictive correlations between NM properties and their biological or toxicological effects. So far, significant efforts have taken place to organize the already available data, including NM biological and toxicological assessment, such as the eNanoMapper database15 and the NanoMILE database16 which together, and along with other emerging datasets, will form the basis of future data structures (e.g. the NanoCommons nanoinformatics infrastructure and KnowledgeBase, http://www.nanocommons.eu). Targeted datasets are also available for specific NM categories for further exploitation. For example, Zhou and co-workers (2008)17 designed and synthesized a library of 80 combinatorially surface modified Multi-walled Carbon Nanotubes (MWCNTs) and tested their binding with 4 specific proteins, and their cytotoxicity (% cell viability) and immunotoxicity (release of Nitric Oxide, NO) to THP-1 cells, which is a widely-accepted dataset, and has already been used in various in silico nanotoxicity studies.12,13,18,19
Different nanostructures have different levels of structural complexity and heterogeneity (presence of inorganic–organic elements and coatings, varying stoichiometry between the particles etc.) and thus extracting quantitative parameters for the characterization of the structural and chemical properties of the nanostructures is a very challenging task that is not yet fully addressed computationally. The development of in silico methods is thus hindered by the absence of sufficiently large physicochemical, geometrical, structural and biological datasets of different nanostructures in available databases.20 The hypothesis that each nanostructure can be represented by its surface modifiers when the core remains identical, can be considered pragmatic, especially taking into account the near- and long-term hazard and risk assessment goals, and the time and cost required for a full characterization – experimental and/or computational – of all available nanostructures. This hypothesis has already been accepted and used in different studies found in the literature.9,21–23
Fourches et al.13 built and validated classification models for the prediction of the protein binding and cytotoxicity of MWCNTs, and made the underlying experimental dataset at least partially available for further analysis. These models were based on Molecular Operating Environment (MOE) and Dragon molecular descriptors computed only from the surface-modifying compounds, assuming that the MWCNT core was the same in all samples. Support vector machines, random forest and k nearest neighbours, have been employed as machine-learning techniques, and the reported accepted CCR (Correct Classification Rate, mean of sensitivity and specificity) of the validation sets ranged from 73 to 75% for the protein binding, and from 70 to 77% for the toxicity endpoint.
Singh et al.19 reported an ensemble learning approach based nano-QSAR model for predicting biological effects of NMs based on molecular descriptors, calculated with Chemistry Development Kit (CDK). Here, the 29 most toxic surface-modified (decorated) MWCNTs from the Zhou et al.17 dataset have been used for the prediction of their impact on cellular viability. For model development, decision tree boost and decision tree forest methods were implemented based on six molecular descriptors of the decorators. The models resulted in R2 values of 0.903 and 0.922 respectively. Shao et al.12 used the 29 most toxic samples in order to build QSAR models based on different sets of descriptors. The CNT–decorator complex was geometrically optimized using the molecular dynamics simulation package GROMACS with the ffgmx force field. All possible combinations of calculated MOE, VolSurf, and 4D-fingerprints descriptors have been used. Multiple linear regression (MLR) and trial QSAR models were built, in a genetic function approximation scheme. For the carbonic anhydrase protein binding endpoint, using only the decorators for the descriptor calculations, R2 and QLOO2 accuracy was reported as 0.892 and 0.832 respectively, while using the combination of a 10 Å nanotube and the decorators, the R2 and QLOO2 measures were reported as high as 0.903 and 0.851 respectively. For the cell viability endpoint, using only the decorators, R2 and QLOO2 were equal to 0.922 and 0.863 respectively, while using the combination of a 10 Å nanotube and the decorators the R2 and QLOO2 measures were 0.857 and 0.759 respectively. These results suggest that depending on the end-point being modelled, and the role of the core versus surface in the specific interaction, inclusion of both components should be assessed to determine whether the core plays a role or not. Unsurprisingly, in the case of protein binding, a minor contribution from the CNTs was found, whereas in the case of toxicity, the surface functionalization played the dominant role, probably by controlling the amount of cellular adhesion and internalization of the CNTs. This reinforces the hypothesis that the decorated MWCNT with the same core can be represented by their surface modifiers for prediction of protein binding and cellular receptor attachment. Given that following the attachment step, nanoparticles including MWCNTs are actively taken up into THP-1 cells via an active endocytotic process (e.g. phagocytosis), we can safely assume that the particle scaffold (core), which is common to the whole dataset, is the driver once attachment, which is ligand-specific, has occurred, and thus the discrimination in terms of the amount of uptake (and thus toxicity) is driven by the ligands, allowing us to ignore the role of the core.
In this present work, a fully-validated predictive QNAR workflow was developed to assess the biological and toxicological profile of MWCNTs, based solely on calculated molecular descriptors of the surface decorators, in order to avoid computationally challenging and time-consuming molecular dynamics simulations and to achieve a fast classification of the samples employing the kNN method. Each MWCNT sample has been evaluated against two different endpoints; protein binding of carbonic anhydrase and toxicity, and was classified as a “binder” or “non-binder” and “toxic” or “non-toxic”, respectively. The driving force for adsorption of Human Carbonic Anhydrase II (HCAII) to nanoparticles has been shown previously to be electrostatic in nature, driven by attraction to negatively charged particle surfaces, and the hydrophobic effect alone was shown not to be strong enough to drive the initial binding at least to positively charged hydrophobic polystyrene nanoparticles.24 For modelling and validation, we tried to use as many of the available CNT samples as possible, and not only the most toxic ones as in previous computational studies. The main target of the proposed workflow was to offer a computational tool that will simplify the design and screening of novel MWCNTs by allowing prediction of the CA binding and cellular toxicity based only on the chemical structure of the surface decoration molecule, as part of a safe-by-design strategy that would allow elimination of potentially toxic modifications at the design stage. Making the tool available online with a user friendly interface enhances its utility as an aid for the decision making of interested research, industry and regulatory groups.
The following analysis steps were entirely implemented using the KNIME Analytics Platform (Konstanz Information Miner, https://www.knime.com/knime-analytics-platform). In the developed KNIME workflow the available nodes were combined with the Enalos+ nodes, developed by NovaMechanics Ltd (http://enalosplus.novamechanics.com/), in order to build a robust and accurate model development. The workflow was incorporated later in the Enalos Cloud platform (http://www.insilicotox.com/), which hosts predictive models released as web services. Through this platform the need to reduce the amount of time and cost spent in experimental testing can be addressed, using in silico tools for safe-by-design that produce accurate predictions for drug discovery and risk assessment of small molecules and nanomaterials.
The workflow generated also included the EnalosMold2 KNIME node26 that calculates 777 descriptors per CNT-decorating molecule accounting for the topological, geometric and structural characteristics of the organic modifiers (see Fig. 1). An important step in the modelling procedure is the reduction of the original pool of descriptors before the feature selection, in order to increase the model quality.27 Thus, the descriptors containing the same values at a percentage equal or higher than 20% among the samples were excluded from further analysis using the Enalos+/Remove column node.
After the first (external) partition the training set was repeatedly divided into calibration and validation sets. The calibration set was used for variable selection and model development, whereas the validation set was used for the determination of the accuracy of the produced models. The multiple splits of the initial training set into two subsets removed any bias in descriptor selection that may be introduced by the use of only one training set of firm composition.
A variable selection method included in WEKA was used in order to remove noisy variables and to retain only the ones relevant to each endpoint. In both cases, the most significant descriptors were selected using the InfoGain variable selection (InfoGainAttributeEval) with Ranker evaluator.
InfoGainAttributeEval measured the attribute's information gain with respect to the current endpoint, whereas Ranker prioritized the variables and removed the lower-ranking ones.30 In this way the modelling computational time and space were reduced, and the predictive performance was greatly improved.
Consequently, we proceeded with model development with the aim to correlate the available endpoints to the selected molecular descriptors. The machine learning method that proved to best correlate the available data was the k-nearest neighbours (kNN) methodology. The kNN method belongs to the “lazy” (instance-based) learning techniques, that classify an instance based on the closest training examples (neighbours) in the feature space. Each instance is assigned to the class indicated by the weighted majority vote of the k closest neighbours.30 This prediction scheme places the kNN method among the read-across strategies, as it requires only a few neighbouring – in terms of similarity – decorators, in order to predict the MWCNT's endpoint class.31 Among the modelling parameters, an optimal k value has been selected, with Euclidean distance between the chosen descriptors and the inversed distance as the weighting factor for the majority vote.
The kNN method was employed in our workflow, using the EnaloskNN KNIME node.32 With this node, apart from the endpoint predictions, we were able to identify the groups of k neighbours of each test decorated CNT and later map the analogous area, as required by the read-across framework.33
(1) |
(2) |
(3) |
Positive predicted | Negative predicted | |
---|---|---|
Positive observed | TP | FN |
Negative observed | FP | TN |
The selected model was finally validated using the external test-blank set by calculating the same accuracy measurements (eqn (1)–(3)). The final model was considered satisfactory when the values of all the above statistics exceeded 0.7. In the case that the previous criterion was not satisfied, the external partitioning into training and test sets was repeated, as well as the internal partitioning and all the processes of model development and validation.
Moreover, the Y-randomization test was performed in the internal loop, in order to validate the robustness and the statistical significance of the produced models. In this test, all modelling calculations were repeated several times, using the original values of the independent variables, but also using randomly shuffled values for the dependent endpoint. The statistical metrics of the so-produced models were evaluated and were expected to be reduced in comparison to those of the initial model, thus demonstrating that the initial model was not the result of random chance. If this was not the case, both the applied methodology and the training set would not produce reliable predictive models.35 In addition to the previous validation practices, internal validation was performed in order to reduce the bias produced from a possible unbalanced representation of the two classes between the two subsets. Both for the calibration sets (inner loop) and the training set (external loop), leave-one-out (LOO) and leave-five-out (L5O) cross-validation methods were employed for both models (protein binding and cell viability).
APD = 〈d〉 + Zσ | (4) |
For the development of our model, the dataset of 83 MWCNTs with the same core and different organic surface ligands (decorators), tested in vitro for carbonic anhydrase (CA) binding and acute toxicity (% cell viability), as described above, has been used.13,17 Two QNAR models were built to classify samples as “binders” and “non-binders” as well as “toxic” and “non-toxic” to assess their CA binding and toxicity.
Since the surface modification differentiated the MWCNTs, we transformed their structural, topological and geometrical characteristics into numerical values, using Mold2 descriptors.25 EnalosMold2 KNIME node was used to calculate 777 molecular descriptors for each decorator that were then reduced to 403 descriptors for QNAR development after filtering out descriptors that contained the same values at percentage equal or higher than 20%.
For the development of each model, the dataset of decorators was randomly divided into training and test sets in a ratio of 75:25. The descriptor values of the training set were normalized, and the applied normalization parameters were used for the normalization of the test set during external validation. The training set was further divided into calibration and validation sets in a proportion that ensured that the calibration set contained 50% of the samples of the initial dataset (75% of the training set). The variable selection and model building processes followed, and the produced model performances were tested using the corresponding validation set. The processes of partitioning and model development were repeated until a satisfactory model was built (inner loop).
The InfoGain variable selection with Ranker evaluator method (which are included in the WEKA platform), were applied to the calibration data, to select the most critical, among the 403 available descriptors. From the ranked descriptors, six emerged as important for predicting the CA binding endpoint and six descriptors have been selected as the most relevant to predicting the toxicity endpoint, as well.37
The proposed KNIME workflow gave us the flexibility to test the performance of different modelling methodologies and finally select the best performing combination. Among the applied methodologies, the k-nearest neighbours (kNN) appeared to outperform the others, providing the best correlation between the selected descriptors and the endpoints. The kNN method was applied to the calibration data with an optimized value for the number of neighbours equal to 3 for the CA binding model and, equal to 7 for the toxicity model. After model development based on the calibration data, binding and toxicity predictions for the validation set of decorated MWCNTs were performed. In order to test the accuracy of the developed models, several statistical measurements were calculated, as described in the Materials and methods section, consistent with the OECD proposed tests. Table 2 presents the accuracy statistics of the models for validation sets (internal loop). The Y-randomization robustness test when applied, proved the statistical significance of the proposed models. Random shuffles of the endpoints were performed while the descriptor matrix of the calibration set remained intact. Predictions using the validation set demonstrated that the resulting models (same parameters as the proposed ones) presented statistically lower predictive power (0.40–0.55 for the CA binding and 0.33–0.53 for the toxicity model) in comparison to the models using the original training values, thus the possibility of chance correlation was eliminated.
Model | Set | Accuracy | Sensitivity | Specificity |
---|---|---|---|---|
CA binding | Validation | 0.750 | 0.778 | 0.727 |
Test | 0.857 | 0.727 | 1.000 | |
Toxicity | Validation | 0.778 | 0.778 | 0.778 |
Test | 0.842 | 0.875 | 0.818 |
After the selection of the optimal model from the inner loop, predictions were performed using the test set of the external loop, in order to assess their actual performance in a blank dataset. The accuracy statistics using the test sets are also presented in Table 2.
As far as internal validation is concerned, the models' stability to the inclusion–exclusion of data was tested by performing L0O and L5O cross-validation, in the training sets. The accuracy values of cross-validation for both models are presented in Table 3 and are higher than 0.7 thus, both models can be considered stable.
CA binding | Toxicity | |
---|---|---|
L0O | 0.810 | 0.750 |
L5O | 0.833 | 0.722 |
Finally, the domain of applicability (APD) has been determined in order to define the area of reliable predictions. The APD threshold was calculated, according to the training set, to be 2.166 for the CA binding model. All samples in the test set had values in the range of 0.219–2.297. Similarly, for the toxicity model, the APD threshold was calculated equal to 1.805 and the decorators in the test set had values in the range of 0.25–2.305. Therefore, in both cases, the prediction for the samples that exceeded the APD threshold was considered unreliable.
A representative case of the read-across process is presented below using the sample AMOO4AC008 which belongs to both test sets for CA binding and toxicity. In Fig. 3, the 3 CA binding and the 7 toxicity neighbours are presented and their structural similarity in terms of common substituents is depicted using a color code. In Table 4 the neighbours, along with their distance from the AMOO4AC008 sample, are presented.
Sample | AMOO4AC008 |
Experimental | Non-binder/toxic |
Prediction | Non-binder/toxic |
CA binding | Toxicity | ||||
---|---|---|---|---|---|
Neighbours | Distance | Neighbours | Distance | ||
AMOO1AC008 | 0.1793 | Non-binder | AMOO5AC008 | 0.0420 | Toxic |
AMOO3AC008 | 0.2212 | Non-binder | AMOO5AC006 | 0.0704 | Toxic |
AMOO7AC006 | 0.3317 | Non-binder | AMOO3AC008 | 0.0733 | Toxic |
AMOO3AC007 | 0.0909 | Non-toxic | |||
AMOO4AC006 | 0.0928 | Toxic | |||
AMOO2AC006 | 0.1158 | Toxic | |||
AMOO8AC006 | 0.1185 | Toxic |
CA binding | Toxicity | ||
---|---|---|---|
D522 | Mean molecular topological order-2 charge index | D468 | Geary topological structure autocorrelation length-6 weighted by atomic Sanderson electronegativities |
D473 | Geary topological structure autocorrelation length-3 weighted by atomic polarizabilities | D173 | Mohar order-2 index |
D472 | Geary topological structure autocorrelation length-2 weighted by atomic polarizabilities | D454 | Geary topological structure autocorrelation length-8 weighted by atomic masses |
D269 | Information content order-0 index | D254 | Radial centric index |
D133 | Mean value of atomic composition index | D250 | EXP5 of path-distance/walk-distance over all atoms |
D541 | Lowest eigenvalue from Burdex matrix weighted by van der Waals order-2 | D255 | Vertex distance count equality index |
The user can insert one or several structures of compounds being considered as potential decorating molecules for MWCNTs and get, within seconds, the prediction of the CA binding and their toxicity profile, along with a warning on the reliability of the predictions based on the models' domain of applicability limits. The user has three different options for providing the structures of the compounds to be screened: (i) by drawing the chemical structure of interest, (ii) by entering the SMILES notation of the compounds in the appropriate field or (iii) by uploading an .sdf file with a batch of compounds (Fig. 4). During a safe-by-design process, different data sets with decorators of interest can be imported, and their effects on the biological and toxicological behaviour of the resulting decorated MWCNTs can be studied.
The developed models can be used under a virtual screening framework for the development of novel, plus safe, decorated MWCNTs. As an initial case study, we tried to improve the profiles of MWCNT samples identified in the initial dataset as having unsatisfactory toxicity and high protein binding properties (toxic and a CA binder sample). We have to underline at this point that, depending on the nature of the specific proteins that bind, protein binding can increase a NM's engagement with specific cellular receptors thus enhancing uptake, or can increase or reduce the susceptibility to phagocytosis (depending on whether the corona presents opsonising or disopsonising proteins) or can create cryptic epitopes in cellular signaling proteins causing toxic responses.17,44 As a second case study we performed a sensitivity analysis in order to explore the toxicity and the protein binding limits of the samples, by inserting, deleting or modifying substituents at different positions of the decorators. These safe-by-design case studies are presented below.
After filtering the duplicate generated substituent SMILES, we created a list of 942 candidate surface modifiers by combining the different substituents in positions R1 and R2 with the core molecule. We uploaded an .sdf file including these structures to the web-service, and within seconds we acquired the predictions for their CA binding and toxicity profiles, as well as the reliability of these predictions according to the APD limits. According to our initial plan we were only interested in MWCNTs with reduced toxicity and low protein binding, thus from the generated predictions we focused only on non-toxic and CA non-binder results. From these, we excluded the samples with unreliable outcomes and 32 MWCNT samples with desired properties remained. In a final step we checked if the valence on the atoms of the structure is correct in KNIME, using the Valence Checker node. The valence was correct for the structures, therefore they can be considered feasible. Three candidate surface decorators are presented in Table 6.
The predictive power of the proposed models is improved in terms of sensitivity and specificity, especially in the case of the toxicity endpoint, compared to the models developed by Fourches et al.13 and Singh et al.19 Shao et al.12 and Esposito et al.18 reported high accuracy statistics, nevertheless, their findings are not directly comparable with the results reported here, as they considered a decreased dataset focused only on the most toxic 29 MWCNTs.
The main advantages of the models presented here compared to other relevant models proposed in the literature,12,13,19 are: the immediate release and dissemination of the models to all interested parties through the user-friendly interface of the Enalos Cloud platform, the important new insights into the significant molecular descriptors and the determination of the domain of applicability of the model allowing for the discrimination between reliable and unreliable predictions. The web service is publicly available and ready-to-use by any interested user (e.g., experimentalists or regulators) in the computer-aided design of novel MWCNTs or in the prioritization of novel potent MWCNTs based on their predicted toxic effects, taking into account that predictions can be produced rapidly (about 30 seconds) along with an indication of their reliability. Thus, it represents a useful tool within a safety-by-design framework and can contribute to the reduction of in vivo experiments and their replacement by in vitro and in due course only in silico experiments. Finally, the dissemination of the models facilitates their utility as they are easily expandable and adjustable to address the requirements of other NMs, other decorating molecules or other toxicity end-points, provided sufficient experimental data is available to train the extended models.
While it was not possible based on the current dataset to link the binding and toxicity QNARs, since the uptake studies were performed in serum-containing medium rather than on the single protein-bound MWCNTs (i.e. CA-MWCNT complexes) it is clear that as suitable datasets become available where protein binding and toxicity are performed under the same conditions, a linked model, that can determine whether high protein binding correlates with high or low toxicity, would be possible. Indeed, reduction of protein binding via surface decoration of NMs with PEG or other hydrophilic polymers has been suggested as a route to reducing their recognition and phagocytosis as a “stealth” strategy for nanomedicines.47,48 Conversely, corona thickness as driven by use of different media supplemented with 10% foetal bovine serum was shown to affect cellular uptake and toxicity for gold NMs: while DMEM elicited the formation of a large time-dependent protein corona, RPMI showed different dynamics with reduced protein coating which correlated with more abundant internalized by two cell lines (HeLa and U937) cells and higher cytotoxic effects as compared to DMEM.49 Similarly, models predicting which proteins in the NM corona drive cellular association have been developed,14,50 so the ultimate QNAR will link protein binding amount, presence of specific proteins linked to cellular adhesion and uptake, and the toxicity effects, thus enabling safe-by-design based on several critical aspects that must be controlled for drug delivery and for safe utilization of NMs broadly.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/c8na00142a |
This journal is © The Royal Society of Chemistry 2019 |