Semi-Supervised Transductive Hot Spot Predictor Working on Multiple Assumptions

ISSN: 2212-392X (Online)
ISSN: 1574-8936 (Print)


Volume 9, 5 Issues, 2014


Download PDF Flyer




Current Bioinformatics

Aims & ScopeAbstracted/Indexed in

Ranking and Category:
  • 20th of 52 in Mathematical & Computational Biology

Submit Abstracts Online Submit Manuscripts Online

Editor-in-Chief:
Alessandro Giuliani
Istituto Superiore di Sanitá (Italian NIH) Environment and Health Dept
Roma
Italy


View Full Editorial Board

Subscribe Purchase Articles Order Reprints

Current: 1.726
5 - Year: 1.577

Semi-Supervised Transductive Hot Spot Predictor Working on Multiple Assumptions

Author(s): Jim Jing-Yan Wang, Islam Khaleel Almasri, Yuexiang Shi and Xin Gao

Affiliation: Xin Gao, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia.

Abstract

Protein-protein interactions are critically dependent on just a few residues (“hot spots”) at the interfaces. Hot spots make a dominant contribution to the binding free energy and if mutated they can disrupt the interaction. As mutagenesis studies require significant experimental efforts, there exists a need for accurate and reliable computational hot spot prediction methods. Compared to the supervised hot spot prediction algorithms, the semi-supervised prediction methods can take into consideration both the labeled and unlabeled residues in the dataset during the prediction procedure. The transductive support vector machine has been utilized for this task and demonstrated a better prediction performance. To the best of our knowledge, however, none of the transductive semi-supervised algorithms takes all the three semisupervised assumptions, i.e., smoothness, cluster and manifold assumptions, together into account during learning. In this paper, we propose a novel semi-supervised method for hot spot residue prediction, by considering all the three semisupervised assumptions using nonlinear models. Our algorithm, IterPropMCS, works in an iterative manner. In each iteration, the algorithm first propagates the labels of the labeled residues to the unlabeled ones, along the shortest path between them on a graph, assuming that they lie on a nonlinear manifold. Then it selects the most confident residues as the labeled ones for the next iteration, according to the cluster and smoothness criteria, which is implemented by a nonlinear density estimator. Experiments on a benchmark dataset, using protein structure-based features, demonstrate that our approach is effective in predicting hot spots and compares favorably to other available methods. The results also show that our method outperforms the state-of-the-art transductive learning methods.

Keywords: Hot spot prediction, multiple semi-supervised assumptions, nonlinear density estimator, nonlinear manifold, semisupervised learning.

Purchase Online Rights and Permissions

  
  



Article Details

Volume: 9
Issue Number: 3
First Page: 258
Last Page: 267
Page Count: 10
DOI: 10.2174/1574893609999140523124421
Advertisement

Related Journals




Webmaster Contact: urooj@benthamscience.org Copyright © 2014 Bentham Science