NPred: QSAR classification model for identifying plant based naturally occurring anti-cancerous inhibitors

Kanika Dhiman; Subhash Mohan Agarwal

doi:10.1039/C6RA02772E

NPred: QSAR classification model for identifying plant based naturally occurring anti-cancerous inhibitors†

Kanika Dhiman^a and Subhash Mohan Agarwal*^a

* Corresponding authors

^a Bioinformatics Division, Institute of Cytology and Preventive Oncology, I-7, Sector-39, Noida-201301, India
E-mail: smagarwal@yahoo.com

Abstract

The prediction of naturally occurring plant based compounds as anticancer agents is the key to developing new chemical entities in the area of therapeutic oncology. Therefore, in the present study various machine learning techniques viz. Naive Bayesian classifier (NB), sequential minimal optimization (SMO), instance based learner (IBK) and random forest (RF) have been used to develop models of the relationship between the chemical structures of plant based natural compounds and their anti-cancerous inhibition activity. These models were trained, tested and validated using 549 active and 424 inactive compounds deposited in the NPACT database. We observe that the random forest based model using 881 PubChem fingerprints showed the best performance with an MCC of 0.54 and an accuracy of 77.6% on a five-fold cross-validation set and an MCC of 0.35 with an accuracy of 68.4% on an independent external validation set. Also, a frequency-based feature selection method was used to identify the fingerprints that have differential occurrence percentages in an active inhibitor dataset from an inactive set. We find that almost the entire top 10 fingerprints (FP797, FP818, FP12, FP179, FP3, FP143, FP712, FP704, FP334 and FP711) are present in vincristine, vinblastine and paclitaxel, the three therapeutic drugs that are derived from natural products and used as anticancer drugs in clinics. Finally, we have also developed a web server NPred, to predict the potential of natural compounds as anticancer agents and thus help the researchers working in this area. We expect that the results of this study will pave the way for identifying and designing novel natural products as cancer growth inhibitors.

Supplementary files

Article information

DOI: https://doi.org/10.1039/C6RA02772E
Article type: Paper
Submitted: 30 Jan 2016
Accepted: 06 May 2016
First published: 09 May 2016

Download Citation

RSC Adv., 2016,6, 49395-49400

Permissions

Request permissions

NPred: QSAR classification model for identifying plant based naturally occurring anti-cancerous inhibitors

K. Dhiman and S. M. Agarwal, RSC Adv., 2016, 6, 49395 DOI: 10.1039/C6RA02772E

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

RSC Advances

NPred: QSAR classification model for identifying plant based naturally occurring anti-cancerous inhibitors†

Abstract

Supplementary files

Article information

Download Citation

Permissions

NPred: QSAR classification model for identifying plant based naturally occurring anti-cancerous inhibitors

Social activity

Search articles by author

Spotlight

Advertisements