TY - JOUR
T1 - A novel energy-based stochastic method for positioning polar protons in protein structures from X-rays
AU - Glick, Meir
AU - Goldblum, Amiram
PY - 2000/2/15
Y1 - 2000/2/15
N2 - A novel automated method for the optimal placement of polar hydrogens in a protein structure is presented. The algorithm adds initially, to a protein data bank file of the protein, nonrotatable hydrogens such as peptide backbone hydrogens according to geometric considerations. Then, water protons and polar side chain protons of lysine, serine, threonine, tyrosine, aspartic acid, glutamic acid, and the C and N termini of a protein are added according to energy considerations. A unique stochastic approach has been developed to overcome a combinatorial explosion in the search for the lowest energy structure. First, the system is divided into ensembles. Each ensemble is treated separately: N conformations are sampled at random, their energies computed, whereas common components of high-energy combinations are gathered on one hand, and low-energy combinations on the other. Components that yield only high-energy conformations and do not contribute to any low energies are excluded. This is reiterated while the total amount of combinations is decreased along the iterative process. When the total number of combinations is lower than a user defined threshold, all remaining combinations are evaluated by exhaustive search. Energy evaluations use nonbonding energy expressions alone. The program was tested on five high-resolution crystal structures: bovine pancreatic trypsin inhibitor (Brookhaven Protein Data Bank file 5PTI), RNase-A (5RSA), trypsin (1NTP), and carbon monoxymyoglobin (2MB5), for which neutron diffraction structures are available, as well as phosphate binding protein (1IXH) for which very high resolution X-ray crystallography was used. The low RMS values prove the efficiency of this algorithm as a tool for positioning protons in proteins. It may be used for other biological structures. (C) 2000 Wiley-Liss, Inc.
AB - A novel automated method for the optimal placement of polar hydrogens in a protein structure is presented. The algorithm adds initially, to a protein data bank file of the protein, nonrotatable hydrogens such as peptide backbone hydrogens according to geometric considerations. Then, water protons and polar side chain protons of lysine, serine, threonine, tyrosine, aspartic acid, glutamic acid, and the C and N termini of a protein are added according to energy considerations. A unique stochastic approach has been developed to overcome a combinatorial explosion in the search for the lowest energy structure. First, the system is divided into ensembles. Each ensemble is treated separately: N conformations are sampled at random, their energies computed, whereas common components of high-energy combinations are gathered on one hand, and low-energy combinations on the other. Components that yield only high-energy conformations and do not contribute to any low energies are excluded. This is reiterated while the total amount of combinations is decreased along the iterative process. When the total number of combinations is lower than a user defined threshold, all remaining combinations are evaluated by exhaustive search. Energy evaluations use nonbonding energy expressions alone. The program was tested on five high-resolution crystal structures: bovine pancreatic trypsin inhibitor (Brookhaven Protein Data Bank file 5PTI), RNase-A (5RSA), trypsin (1NTP), and carbon monoxymyoglobin (2MB5), for which neutron diffraction structures are available, as well as phosphate binding protein (1IXH) for which very high resolution X-ray crystallography was used. The low RMS values prove the efficiency of this algorithm as a tool for positioning protons in proteins. It may be used for other biological structures. (C) 2000 Wiley-Liss, Inc.
KW - Combinatorial explosion
KW - Computer algorithm
KW - Crystal structure
KW - Hydrogen bond
KW - Minimization
KW - Polar hydrogens
UR - http://www.scopus.com/inward/record.url?scp=0034651879&partnerID=8YFLogxK
U2 - 10.1002/(SICI)1097-0134(20000215)38:3<273::AID-PROT4>3.0.CO;2-I
DO - 10.1002/(SICI)1097-0134(20000215)38:3<273::AID-PROT4>3.0.CO;2-I
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 10713988
AN - SCOPUS:0034651879
SN - 0887-3585
VL - 38
SP - 273
EP - 287
JO - Proteins: Structure, Function and Genetics
JF - Proteins: Structure, Function and Genetics
IS - 3
ER -