VCU Bioinformatics and Bioengineering Summer Institute

Using HINT to Model Non-covalent Interactions in Electron Density Maps

The structure of molecules invisible to the naked eye has become incredibly important both in better understanding biochemical interaction in life forms and in the design of drugs. Based solely upon the structure of a protein, and prior to any expensive and time-consuming wet lab, it is possible (in certain cases) to determine whether drug candidates have any potential binding affinity for their target. With an ever increasing number of anti-biotic resistant strains of pathogenic bacteria, and the emergence of deadly viruses, the development of effective drugs is more pertinent than ever. Already, structural and medicinal chemists have had success in developing specifically designed targeting drugs. A prime example of this process is the development of HIV1 protease inhibitors However, to build such drugs, one must first have a model into which the drug candidate can be fit.
Ever since it was utilized by Rosalind Franklin during the discovery of the structure of DNA, x-ray crystallography has been the primary tool utilized by structural chemists in determining DNA/RNA structure and the tertiary and quaternary structures of proteins. The process of x-ray crystallography consists of crystalizing (precipitating the protein into a crystalline structure) the target protein and then bombarding the structure with x-rays. X-rays are diffracted by crystalline structures and are thus scatterd by the electron clouds orbiting the nucleus of the atoms. By measuring the diffraction of the x-rays from many angles, and comparing it with the expected diffraction from the particular crystal structure formed by the protein, one can construct an electron density map. Since the highest concentrations of electrons will be orbiting nuclei, the electron density map provides a 3 dimensional estimate of the position of each atom.
Generating the electron density data is only the first step in determining overall structure. Only after threading the known amino acid sequence, atom by atom, through the high density regions of the map is the 3-dimensional structure complete. Despite the fact that x-ray crystallography provides the (theoretically) most accurate view of these microscopic structures the process is imperfect. Several possible problems arise. First, the protein may not fully crystalize (or may not do so at all, in which case the researcher must use NMR). Second, the highest attainable resolution is ~1 angstroms but typical experimental resolutions can be significantly higher than that (i.e. ~2.5 angstroms), at the less precise resolutions, atom position is often ambiguous. Finally, even with high resolution results, determining the precise position and orientation of a side-chain is a difficult process at the early stages of model building.
Like so many other fields, the relatively recent surge in computing power has allowed structural chemists to develop tools that partially alleviate these problems. With regard to x-ray crystallography, the primary benefit of such powerful computers has been the generation of programs that can create 3-dimensional models based on experimental data and perform thousands of calculations each second, effectively modeling, in silico, the interactions between each atom. One such set of programs attempts to fit the protein structure to an electron density map. By coding known “rules of interaction” between atoms into such programs the computer can create a best guess at the proper threading of a protein’s sequence through the electron density map. Thus, even with early, rough results, the computer can begin mapping the structure of the molecule before the conclusion of the experiment. There exists an initiative amongst a consortium of labs to generate enough 3-dimensional structures of proteins that a program could potentially generate the tertiary structure of a protein based solely on its similarity to other proteins. However, both a lack of sufficient models and our incomplete understanding of atomic interactions currently inhibit our ability to correctly predict tertiary structure. In fact, even the programs that attempt to thread a protein through an electronic density field make incorrect assertions about potential structures. A simple example: a program may create an appropriate ring-shaped backbone around an active site, but rather than orienting portions of adjacent amino acids away from each other, it may simply face all the ends of the amino acids into the center of the ring. Any knowledgeable human would instantly detect this error, but as far as the program is concerned, it is a perfectly valid configuration. This is because, while the computer is good at modeling electrostatic interactions between atoms, most software packages do a miserable job at accounting for the obfuscated hydrophobic effect. The hydrophobic effect is, essentially, all non-polar interactions between molecules.
However, certain researchers, realizing that current models will always be inaccurate if they fail to account for the hydrophobic effect have taken steps to account for it. The HINT program, developed by Glen Kellogg, calculates a score based on noncovalent interactions between each atom. The algorithm used to develop the scores is *equation forthcoming*. The equation is explained as follows: “ai and aj are hydrophobic atom constants for the two atoms being evaluated, S is the solvent accessible surface are, T is a function that differentiates polar-polar interactions (acid-acid, acid-base or base-base), and R, r are functions of the distance between atoms i and j” (Cashman, 120). The HINT program sums the score for each atom to obtain a score for the entire molecule.
After the development of HINT, the next step is to utilize the results of the calculations to make existing models more accurate. One program that currently does a good job of generating three dimensional models of proteins from electron density maps is XTalView. The program is especially useful during the beginning stages of the crystallography, before the repetition of many phases. During this time, electron density is noisy, but XTalView can utilize these approximations to create a fairly accurate representation of the molecule. Also, side-chains are difficult to generate because of this noise, and often are only modelled using placeholders and later filled in after acquiring more comprehensive data. However, in threading the amino acid chain through the map, XTalView orients portion of the molecule in a manner that is immediately obvious as incorrect to a researcher examining the data. This project seeks to extend XTalView to consider HINT scores while generating these rough drafts of the final tertiary structure, creating an accurate model from less accurate data. XTalView is a good candidate to interface with HINT because it is extensible (i.e. it was designed so that programmers could expand its functionality) and it already considers several factors during the fitting of the model to the electron density map. These factors are weighted, and thus one could examine the models both with and without the HINT scores, allowing one to judge how the noncovalent interactions help to refine the tertiary structure.