Detecting reliable non interacting proteins (NIPs) significantly enhancing the computational prediction of protein–protein interactions using machine learning methods

A. Srivastava; G. Mazzocco; A. Kel; L. S. Wyrwicz; D. Plewczynski

doi:10.1039/C5MB00672D

Detecting reliable non interacting proteins (NIPs) significantly enhancing the computational prediction of protein–protein interactions using machine learning methods†

A. Srivastava,‡^a G. Mazzocco,‡^bc A. Kel,^d L. S. Wyrwicz^a and D. Plewczynski*^b

Author affiliations

* Corresponding authors

^a Maria Sklodowska-Curie Memorial Cancer Center and Institute of Oncology, Warsaw, Poland

^b Centre of New Technologies, University of Warsaw, Banacha 2c Str., 02-097 Warsaw, Poland
E-mail: d.plewczynski@cent.uw.edu.pl

^c Institute of Computer Science, Polish Academy of Sciences, Warsaw, Poland

^d GeneXplain GmbH, Am Exer 10b, D-38302, Wolfenbüttel, Germany

Abstract

Protein–protein interactions (PPIs) play a vital role in most biological processes. Hence their comprehension can promote a better understanding of the mechanisms underlying living systems. However, besides the cost and the time limitation involved in the detection of experimentally validated PPIs, the noise in the data is still an important issue to overcome. In the last decade several in silico PPI prediction methods using both structural and genomic information were developed for this purpose. Here we introduce a unique validation approach aimed to collect reliable non interacting proteins (NIPs). Thereafter the most relevant protein/protein-pair related features were selected. Finally, the prepared dataset was used for PPI classification, leveraging the prediction capabilities of well-established machine learning methods. Our best classification procedure displayed specificity and sensitivity values of 96.33% and 98.02%, respectively, surpassing the prediction capabilities of other methods, including those trained on gold standard datasets. We showed that the PPI/NIP predictive performances can be considerably improved by focusing on data preparation.

Supplementary files

Article information

DOI: https://doi.org/10.1039/C5MB00672D
Article type: Paper
Submitted: 08 Oct 2015
Accepted: 21 Dec 2015
First published: 21 Dec 2015

Download Citation

Mol. BioSyst., 2016,12, 778-785

Author version available

Download author version (PDF)

Permissions

Request permissions

Detecting reliable non interacting proteins (NIPs) significantly enhancing the computational prediction of protein–protein interactions using machine learning methods

A. Srivastava, G. Mazzocco, A. Kel, L. S. Wyrwicz and D. Plewczynski, Mol. BioSyst., 2016, 12, 778 DOI: 10.1039/C5MB00672D

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Molecular BioSystems

Detecting reliable non interacting proteins (NIPs) significantly enhancing the computational prediction of protein–protein interactions using machine learning methods†

Abstract

Supplementary files

Article information

Download Citation

Author version available

Permissions

Detecting reliable non interacting proteins (NIPs) significantly enhancing the computational prediction of protein–protein interactions using machine learning methods

Search articles by author

Spotlight

Advertisements