iHSP-PseRAAAC: Identifying the heat shock protein families using pseudo reduced amino acid alphabet composition. Heat shock proteins (HSPs) are a type of functionally related proteins present in all living organisms, both prokaryotes and eukaryotes. They play essential roles in protein-protein interactions such as folding and assisting in the establishment of proper protein conformation and prevention of unwanted protein aggregation. Their dysfunction may cause various life-threatening disorders, such as Parkinson’s, Alzheimer’s, and cardiovascular diseases. Based on their functions, HSPs are usually classified into six families: (i) HSP20 or sHSP, (ii) HSP40 or J-class proteins, (iii) HSP60 or GroEL/ES, (iv) HSP70, (v) HSP90, and (vi) HSP100. Although considerable progress has been achieved in discriminating HSPs from other proteins, it is still a big challenge to identify HSPs among their six different functional types according to their sequence information alone. With the avalanche of protein sequences generated in the post-genomic age, it is highly desirable to develop a high-throughput computational tool in this regard. To take up such a challenge, a predictor called iHSP-PseRAAAC has been developed by incorporating the reduced amino acid alphabet information into the general form of pseudo amino acid composition. One of the remarkable advantages of introducing the reduced amino acid alphabet is being able to avoid the notorious dimension disaster or overfitting problem in statistical prediction. It was observed that the overall success rate achieved by iHSP-PseRAAAC in identifying the functional types of HSPs among the aforementioned six types was more than 87%, which was derived by the jackknife test on a stringent benchmark dataset in which none of HSPs included has ≥40% pairwise sequence identity to any other in the same subset. It has not escaped our notice that the reduced amino acid alphabet approach can also be used to investigate other protein classification problems. As a user-friendly web server, iHSP-PseRAAAC is accessible to the public at http://lin.uestc.edu.cn/server/iHSP-PseRAAAC.

References in zbMATH (referenced in 16 articles )

Showing results 1 to 16 of 16.
Sorted by year (citations)

  1. Adilina, Sheikh; Farid, Dewan Md; Shatabda, Swakkhar: Effective DNA binding protein prediction by using key features via Chou’s general PseAAC (2019)
  2. Chen, Guodong; Cao, Man; Yu, Jialin; Guo, Xinyun; Shi, Shaoping: Prediction and functional analysis of prokaryote lysine acetylation site by incorporating six types of features into Chou’s general PseAAC (2019)
  3. Jia, Jianhua; Li, Xiaoyan; Qiu, Wangren; Xiao, Xuan; Chou, Kuo-Chen: iPPI-PseAAC(CGR): identify protein-protein interactions by incorporating chaos game representation into PseAAC (2019)
  4. Pan, Yi; Wang, Shiyuan; Zhang, Qi; Lu, Qianzi; Su, Dongqing; Zuo, Yongchun; Yang, Lei: Analysis and prediction of animal toxins by various Chou’s pseudo components and reduced amino acid compositions (2019)
  5. Rout, Subhashree; Mahapatra, Rajani Kanta: \textitInsilico analysis of \textitplasmodiumfalciparum CDPK5 protein through molecular modeling, docking and dynamics (2019)
  6. Tian, Baoguang; Wu, Xue; Chen, Cheng; Qiu, Wenying; Ma, Qin; Yu, Bin: Predicting protein-protein interactions by fusing various Chou’s pseudo components and using wavelet denoising approach (2019)
  7. Arif, Muhammad; Hayat, Maqsood; Jan, Zahoor: IMem-2LSAAC: a two-level model for discrimination of membrane proteins and their types by extending the notion of SAAC into Chou’s pseudo amino acid composition (2018)
  8. Cheng, Xiang; Xiao, Xuan; Chou, Kuo-Chen: pLoc_bal-mGneg: predict subcellular localization of Gram-negative bacterial proteins by quasi-balancing training dataset and general PseAAC (2018)
  9. Jia, Cangzhi; Yang, Qing; Zou, Quan: NucPosPred: predicting species-specific genomic nucleosome positioning via four different modes of general PseKNC (2018)
  10. Mei, Juan; Fu, Yi; Zhao, Ji: Analysis and prediction of ion channel inhibitors by using feature selection and Chou’s general pseudo amino acid composition (2018)
  11. Sabooh, M. Fazli; Iqbal, Nadeem; Khan, Mukhtaj; Khan, Muslim; Maqbool, H. F.: Identifying 5-methylcytosine sites in RNA sequence using composite encoding feature into Chou’s PseKNC (2018)
  12. Zhang, Shengli; Liang, Yunyun: Predicting apoptosis protein subcellular localization by integrating auto-cross correlation and PSSM into Chou’s PseAAC (2018)
  13. Bakhtiarizadeh, Mohammad Reza; Moradi-Shahrbabak, Mohammad; Ebrahimi, Mansour; Ebrahimie, Esmaeil: Neural network and SVM classifiers accurately predict lipid binding proteins, irrespective of sequence homology (2014)
  14. Lin, Thy-Hou; Tsai, Tsung-Lin: Constructing a linear QSAR for some metabolizable drugs by human or pig flavin-containing monooxygenases using some molecular features selected by a genetic algorithm trained SVM (2014)
  15. Lyons, James; Biswas, Neela; Sharma, Alok; Dehzangi, Abdollah; Paliwal, Kuldip K.: Protein fold recognition by alignment of amino acid residues using kernelized dynamic time warping (2014)
  16. Nanni, Loris; Lumini, Alessandra; Brahnam, Sheryl: A set of descriptors for identifying the protein-drug interaction in cellular networking (2014)