Issue 55, 2016, Issue in Progress

NPred: QSAR classification model for identifying plant based naturally occurring anti-cancerous inhibitors

Abstract

The prediction of naturally occurring plant based compounds as anticancer agents is the key to developing new chemical entities in the area of therapeutic oncology. Therefore, in the present study various machine learning techniques viz. Naive Bayesian classifier (NB), sequential minimal optimization (SMO), instance based learner (IBK) and random forest (RF) have been used to develop models of the relationship between the chemical structures of plant based natural compounds and their anti-cancerous inhibition activity. These models were trained, tested and validated using 549 active and 424 inactive compounds deposited in the NPACT database. We observe that the random forest based model using 881 PubChem fingerprints showed the best performance with an MCC of 0.54 and an accuracy of 77.6% on a five-fold cross-validation set and an MCC of 0.35 with an accuracy of 68.4% on an independent external validation set. Also, a frequency-based feature selection method was used to identify the fingerprints that have differential occurrence percentages in an active inhibitor dataset from an inactive set. We find that almost the entire top 10 fingerprints (FP797, FP818, FP12, FP179, FP3, FP143, FP712, FP704, FP334 and FP711) are present in vincristine, vinblastine and paclitaxel, the three therapeutic drugs that are derived from natural products and used as anticancer drugs in clinics. Finally, we have also developed a web server NPred, to predict the potential of natural compounds as anticancer agents and thus help the researchers working in this area. We expect that the results of this study will pave the way for identifying and designing novel natural products as cancer growth inhibitors.

Graphical abstract: NPred: QSAR classification model for identifying plant based naturally occurring anti-cancerous inhibitors

Supplementary files

Article information

Article type
Paper
Submitted
30 Jan 2016
Accepted
06 May 2016
First published
09 May 2016

RSC Adv., 2016,6, 49395-49400

NPred: QSAR classification model for identifying plant based naturally occurring anti-cancerous inhibitors

K. Dhiman and S. M. Agarwal, RSC Adv., 2016, 6, 49395 DOI: 10.1039/C6RA02772E

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements