Prediction of Protein-Ligand Interaction Based on the Positional Similarity Scores Derived from Amino Acid Sequences

Int J Mol Sci. 2019 Dec 18;21(1):24. doi: 10.3390/ijms21010024.

Abstract

The affinity of different drug-like ligands to multiple protein targets reflects general chemical-biological interactions. Computational methods estimating such interactions analyze the available information about the structure of the targets, ligands, or both. Prediction of protein-ligand interactions based on pairwise sequence alignment provides reasonable accuracy if the ligands' specificity well coincides with the phylogenic taxonomy of the proteins. Methods using multiple alignment require an accurate match of functionally significant residues. Such conditions may not be met in the case of diverged protein families. To overcome these limitations, we propose an approach based on the analysis of local sequence similarity within the set of analyzed proteins. The positional scores, calculated by sequence fragment comparisons, are used as input data for the Bayesian classifier. Our approach provides a prediction accuracy comparable or exceeding those of other methods. It was demonstrated on the popular Gold Standard test sets, presenting different sequence heterogeneity and varying from the group, including different protein families to the more specific groups. A reasonable prediction accuracy was also found for protein kinases, displaying weak relationships between sequence phylogeny and inhibitor specificity. Thus, our method can be applied to the broad area of protein-ligand interactions.

Keywords: local sequence similarity; prediction of protein targets; protein–ligand interaction.

MeSH terms

  • Algorithms
  • Amino Acid Sequence*
  • Area Under Curve
  • Binding Sites
  • Computational Biology
  • Databases, Protein
  • Humans
  • Ligands*
  • Models, Molecular*
  • Phylogeny
  • Protein Binding
  • Protein Conformation
  • Proteins / chemistry*
  • Proteins / metabolism
  • ROC Curve

Substances

  • Ligands
  • Proteins