Abstract
Protein structure refinement is an important but unsolved problem; it must be solved if we are to predict biological function that is very sensitive to structural details. Specifically, critical assessment of techniques for protein structure prediction (CASP) shows that the accuracy of predictions in the comparative modeling category is often worse than that of the template on which the homology model is based. Here we describe a refinement protocol that is able to consistently refine submitted predictions for all categories at CASP7. The protocol uses direct energy minimization of the knowledge-based potential of mean force that is based on the interaction statistics of 167 atom types (Summa and Levitt, Proc Natl Acad Sci USA 2007;104:3177-3182). Our protocol is thus computationally very efficient; it only takes a few minutes of CPU time to run typical protein models (300 residues). We observe an average structural improvement of 1% in GDT-TS, for predictions that have low and medium homology to known PDB structures (Global Distance Test score or GDT-TS between 50 and 80%). We also observe a marked improvement in the stereochemistry of the models. The level of improvement varies amongst the various participants at CASP, but we see large improvements (>10% increase in GDT-TS) even for models predicted by the best performing groups at CASP7. In addition, our protocol consistently improved the best predicted models in the refinement category at CASP7 and CASP8. These improvements in structure and stereochemistry prove the usefulness of our computationally inexpensive, powerful and automatic refinement protocol.
Original language | English |
---|---|
Pages (from-to) | 2668-2678 |
Number of pages | 11 |
Journal | Proteins: Structure, Function and Bioinformatics |
Volume | 78 |
Issue number | 12 |
DOIs | |
State | Published - Sep 2010 |
Externally published | Yes |
Keywords
- CASP7
- Comparative modeling
- ENCAD
- Knowledge-based
- MESHI
- Refinement
- Stereochemistry