Assessment of optimized Markov models in protein fold classification

J Bioinform Comput Biol. 2014 Aug;12(4):1450016. doi: 10.1142/S0219720014500164. Epub 2014 Jul 14.

Abstract

Protein fold classification is a challenging task strongly associated with the determination of proteins' structure. In this work, we tested an optimization strategy on a Markov chain and a recently introduced Hidden Markov Model (HMM) with reduced state-space topology. The proteins with unknown structure were scored against both these models. Then the derived scores were optimized following a local optimization method. The Protein Data Bank (PDB) and the annotation of the Structural Classification of Proteins (SCOP) database were used for the evaluation of the proposed methodology. The results demonstrated that the fold classification accuracy of the optimized HMM was substantially higher compared to that of the Markov chain or the reduced state-space HMM approaches. The proposed methodology achieved an accuracy of 41.4% on fold classification, while Sequence Alignment and Modeling (SAM), which was used for comparison, reached an accuracy of 38%.

Keywords: Fold classification; Markov chain; hidden Markov model; optimization.

MeSH terms

  • Databases, Protein
  • Markov Chains*
  • Models, Molecular*
  • Protein Folding*