Issue 13, 2019

Quantitative structure–activity relationship (QSAR) study of carcinogenicity of polycyclic aromatic hydrocarbons (PAHs) in atmospheric particulate matter by random forest (RF)

Abstract

The carcinogenicity or toxicity information of a substance can be quickly and easily obtained by using a quantitative structure–activity relationship (QSAR) model. In this study, the carcinogenicity of PAHs was analyzed and predicted by using a random forest (RF) model with the molecular structure information and carcinogenicity data of PAHs. The molecular structure information of 91 PAHs was represented by molecular descriptors (such as structure descriptors, topology descriptors, molecular connectivity index and geometric descriptors) which were calculated by using Dragon5.4 software. The model parameters (ntree and mtry) and input variables were optimized and evaluated with respect to the accuracy, positive predictive value (PPV), negative predictive value (NPV) and out-of-bag (OOB) error. Then, based on the optimized model parameters and input variables, the RF, partial least squares-discriminant analysis (PLS-DA) and artificial neural network (ANN) models were constructed to predict the carcinogenicity of PAHs. The results show that the classification accuracy, PPV, NPV and modeling time are 0.9333, 0.8889, 1.0000 and 10.40 s for the RF model, respectively, which shows a better predictive ability than the PLS-DA and ANN models for the prediction of the carcinogenicity of PAHs. Therefore, it is demonstrated that RF are a very promising method for the accurate prediction of the carcinogenicity of PAHs.

Graphical abstract: Quantitative structure–activity relationship (QSAR) study of carcinogenicity of polycyclic aromatic hydrocarbons (PAHs) in atmospheric particulate matter by random forest (RF)

Supplementary files

Article information

Article type
Paper
Submitted
13 Dec 2018
Accepted
03 Mar 2019
First published
04 Mar 2019

Anal. Methods, 2019,11, 1816-1821

Quantitative structure–activity relationship (QSAR) study of carcinogenicity of polycyclic aromatic hydrocarbons (PAHs) in atmospheric particulate matter by random forest (RF)

N. Li, J. Qi, P. Wang, X. Zhang, T. Zhang and H. Li, Anal. Methods, 2019, 11, 1816 DOI: 10.1039/C8AY02720J

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements