Title: Predicting Protein Subcellular Localization by Pseudo Amino Acid Composition with a Segment-Weighted and Features-Combined Approach
Volume: 18
Issue: 5
Author(s): Wei Wang, XingBo Geng, Yongchao Dou, Taigang Liu and Xiaoqi Zheng
Affiliation:
Keywords:
Jackknife test, mature protein, optimal splice site, pseudo amino acid composition, sorting signal, subcellular localization, segment-weighted, SVM classifier, prokaryotic and eukaryotic proteins, fluorescence microscopy, transmembrane proteins, GalNAc-transferase, stereochemical, MCC, Intel XeonJackknife test, mature protein, optimal splice site, pseudo amino acid composition, sorting signal, subcellular localization, segment-weighted, SVM classifier, prokaryotic and eukaryotic proteins, fluorescence microscopy, transmembrane proteins, GalNAc-transferase, stereochemical, MCC, Intel Xeon
Abstract: Information of protein subcellular location plays an important role in molecular cell biology. Prediction of the subcellular location of proteins will help to understand their functions and interactions. In this paper, a different mode of pseudo amino acid composition was proposed to represent protein samples for predicting their subcellular localization via the following procedures: based on the optimal splice site of each protein sequence, we divided a sequence into sorting signal part and mature protein part, and extracted sequence features from each part separately. Then, the combined features were fed into the SVM classifier to perform the prediction. By the jackknife test on a benchmark dataset in which none of proteins included has more than 90% pairwise sequence identity to any other, the overall accuracies achieved by the method are 94.5% and 90.3% for prokaryotic and eukaryotic proteins, respectively. The results indicate that the prediction quality by our method is quite satisfactory. It is anticipated that the current method may serve as an alternative approach to the existing prediction methods.