Customizing scoring functions for docking

Tuan A Pham; Ajay N Jain

doi:10.1007/s10822-008-9174-y

Customizing scoring functions for docking

J Comput Aided Mol Des. 2008 May;22(5):269-86. doi: 10.1007/s10822-008-9174-y. Epub 2008 Feb 14.

Authors

Tuan A Pham¹, Ajay N Jain

Affiliation

¹ University of California, San Francisco, Box 0128, San Francisco, CA 94143-0128, USA.

Abstract

Empirical scoring functions used in protein-ligand docking calculations are typically trained on a dataset of complexes with known affinities with the aim of generalizing across different docking applications. We report a novel method of scoring-function optimization that supports the use of additional information to constrain scoring function parameters, which can be used to focus a scoring function's training towards a particular application, such as screening enrichment. The approach combines multiple instance learning, positive data in the form of ligands of protein binding sites of known and unknown affinity and binding geometry, and negative (decoy) data of ligands thought not to bind particular protein binding sites or known not to bind in particular geometries. Performance of the method for the Surflex-Dock scoring function is shown in cross-validation studies and in eight blind test cases. Tuned functions optimized with a sufficient amount of data exhibited either improved or undiminished screening performance relative to the original function across all eight complexes. Analysis of the changes to the scoring function suggest that modifications can be learned that are related to protein-specific features such as active-site mobility.

Publication types

Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't

MeSH terms

Binding Sites
Computer-Aided Design*
Databases, Protein
Drug Design*
HIV Protease / chemistry
HIV Protease / metabolism
Ligands
Models, Molecular
Poly(ADP-ribose) Polymerases / chemistry
Poly(ADP-ribose) Polymerases / metabolism
Software
Software Design

Substances

Ligands
Poly(ADP-ribose) Polymerases
HIV Protease
p16 protease, Human immunodeficiency virus 1

Abstract

Publication types

MeSH terms

Substances

Grants and funding