Integrating multiscale and machine learning approaches towards the SAMPL9 log P challenge

Michael R. Draper; Asa Waterman; Jonathan E. Dannatt; Prajay Patel

doi:10.1039/D3CP04140A

Integrating multiscale and machine learning approaches towards the SAMPL9 log P challenge†

Michael R. Draper,^a Asa Waterman, IV,

^a Jonathan E. Dannatt

*^a and Prajay Patel

*^a

Author affiliations

* Corresponding authors

^a Chemistry Department, University of Dallas, Irving, Texas, USA
E-mail: jdannatt@udallas.edu, pmpatel@udallas.edu

Abstract

The partition coefficient (log P) is an important physicochemical property that provides information regarding a molecule's pharmacokinetics, toxicity, and bioavailability. Methods to accurately predict the partition coefficient have the potential to accelerate drug design. In an effort to test current methods and explore new computational techniques, the statistical assessment of the modeling of proteins and ligands (SAMPL) has established a blind prediction challenge. The ninth iteration challenge was to predict the toluene–water partition coefficient (log P_tol/w) of sixteen drug molecules. Herein, three approaches are reported broadly under the categories of quantum mechanics (QM), molecular mechanics (MM), and data-driven machine learning (ML). The three blind submissions yield mean unsigned errors (MUE) ranging from 1.53–2.93 log P_tol/w units. The MUEs were reduced to 1.00 log P_tol/w for the QM methods. While MM and ML methods outperformed DFT approaches for challenge molecules with fewer rotational degrees of freedom, they suffered for the larger molecules in this dataset. Overall, DFT functionals paired with a triple-ζ basis set were the simplest and most effective tool to obtain quantitatively accurate partition coefficients.

This article is part of the themed collection: The SAMPL Challenges

Supplementary files

Article information

DOI: https://doi.org/10.1039/D3CP04140A
Article type: Paper
Submitted: 28 Aug 2023
Accepted: 12 Feb 2024
First published: 15 Feb 2024

Download Citation

Phys. Chem. Chem. Phys., 2024,26, 7907-7919

Permissions

Request permissions

Integrating multiscale and machine learning approaches towards the SAMPL9 log P challenge

M. R. Draper, A. Waterman, J. E. Dannatt and P. Patel, Phys. Chem. Chem. Phys., 2024, 26, 7907 DOI: 10.1039/D3CP04140A

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Physical Chemistry Chemical Physics

Integrating multiscale and machine learning approaches towards the SAMPL9 log P challenge†

Abstract

Supplementary files

Article information

Download Citation

Permissions

Integrating multiscale and machine learning approaches towards the SAMPL9 log P challenge

Social activity

Search articles by author

Spotlight

Advertisements