Robust Structure-based Protein NMR Assignments
using Native Structural Ensembles
Time: November 12, 2007, 1pm - 2pm
Place: D344, LSRC
Speaker:Mehmet Serkan Apaydin
An important step in NMR protein structure determination is the
assignment of resonances and NOEs to corresponding nuclei.
Structure-based assignment (SBA) uses a model structure (``template'')
for the target protein to expedite this process. Nuclear Vector
Replacement (NVR) is an SBA framework that combines multiple sources
of NMR data and has high accuracy when the template is close to the
target protein's structure (less than 2 Angstroms backbone RMSD).
However, a close template may not always be available. We extend the
circle of convergence of NVR for distant templates by using an
ensemble of structures. This ensemble corresponds to the low-frequency
perturbations of the given template and is obtained using Normal Mode
Analysis (NMA). Our algorithm assigns resonances and sparse NOEs using
each of the structures in the ensemble separately, and aggregates the
results using a voting scheme based on maximum bipartite matching.
Experimental results on a 76-residue protein, human ubiquitin, using
4 distant template structures (each with a backbone RMSD ranging
between 3.2-7.7 Angstroms to the target protein) show an increase of
up to 22% in the assignment accuracy. Our algorithm also improves the
robustness of NVR with respect to structural noise. We provide a
confidence measure for each assignment using the percentage of the
structures that agree on that assignment. We use this measure to
assign a subset (more than 35\%) of peaks with more than 90\%
accuracy. We further validate our algorithm on data for two additional
proteins with NVR. We then show the general applicability of our
approach by applying our NMA ensemble-based voting scheme to another
SBA tool, MARS. For three test proteins with corresponding templates,
including the 370-residue maltose binding protein, we increase the
number of reliable assignments made by MARS, by up to 4.2-fold.
Finally, we show that our voting scheme is sound and optimal, by
proving that it is a maximum likelihood estimator of the correct
assignments.
Joint work with Bruce Donald and Vincent Conitzer.