Computer Science 663
Algorithms in Structural Biology


Overview | Course Mechanics, and Textbooks | Schedule | Topics | How to give a good talk
Assignments | Supplemental Materials | Some Relevant WWW Links

This webpage is under construction. In particular, the syllabus, schedule, and topics pages are not final, but they give an idea.

For each class, I assign primary reading (textbook chapters or journal papers) which are the main thing to study. Background and introductory material is occasionally given to help you.

Schedule and Readings

Please check this webpage, and schedule frequently, since I will post new papers and new readings and new assignments frequently, as we proceed through the term.

Please note: These dates and times might move some (see "The Queue", below), as we adapt to the time required to discuss the papers, or if I am unexpectedly called to Washington, etc.

*Papers that are not available online (below) have been handed out on paper.

*RECOMB papers (Proceedings of the Nth Annual International Conference on Computational Molecular Biology (N=1,2,3,4,...)) are available online via the ACM Digital Library.

A few papers will be handed out in class. If you miss class, you can copy them from a classmate.

Announcements will be made in class. I will try to post them here, so consult this website.

Here is a useful bibliography of papers (and PDFs) in the area of this course.


Class Schedule

NOTE: Some PDF links may only work when accessed while on Duke's network!

  1. Wednesday, 1/9 LSRC A155
    Presenting: Bruce Donald.
    Introduction to the Course
    Reading:
    ASMB Preface and ASMB Chapters 1-4.

    The Textbook for this class is: Algorithms in Structural Molecular Biology (MIT Press, 2011), also available on Amazon. We call this "ASMB" in these webpages.

    Assignment:
    Fill out Questionaire.

    Please download and install Pymol and King.

    I recommend for you the textbooks Brandon and Tooze, Streyer, and CLRS. They are valuable for this class.


  2. Monday, 1/14 LSRC A155
    Presenting: Bruce Donald.
    Big Questions in Computational Structural Biology
    Reading: Please see the 1/9 Lecture.

  3. Wednesday, 1/16 LSRC A155
    Presenting: Bruce Donald.
    Protein Backbone Geometry, NMR
    Pymol Demo
    Reading:

  4. Monday, 1/21 LSRC A155

    No Class, MLK Holiday

  5. Wednesday, 1/23 LSRC A155
    Presenting: JJ.
    KING Demo/Tutorial
    Due: Assignment #1
    Due: Assignment #2
    Reading:

  6. Monday, 1/28 LSRC A155
    Presenting: Bruce Donald.
    Nuclear Magnetic Resonance and Structural Biology: Introduction to NOESY and Chemical shifts
    Reading:
    ASMB Chapters 1-4.


  7. Wednesday, 1/30 LSRC A155
    Presenting: Bruce Donald.
    The Jigsaw Algorithm, Graphs, and Networks in Structural Biology
    Due: Assignment #3

    Students, to do before class: Solve DLL puzzle posed on Monday!
    Students, to do before class: What is the readout of the (HN,15N)-edited TOCSY experiment?

    Primary Reading: ASMB Chapter 8.
    Background Reading (NMR Review): ASMB Chapters 1-3.
    Biological and Biomedical significance of CBFandbeta;: [PDF, PDB structure model]


  8. Monday, 2/4 LSRC A155
    Presenting: Bruce Donald.
    Dead-End Elimination and Protein Design: Full Sequence Design
    Primary Reading: ASMB Chapter 11
    PDB id 1FSD, Full sequence design 1 (FSD-1) of beta beta alpha motif, NMR
    Background Reading: ASMB Chapters 10 and 9.

  9. Wednesday, 2/6 LSRC A155
    Presenting: Bruce Donald.
    Introduction to Research In Protein Design:
    Computational Protein Interface Design, Cystic Fibrosis, and HIV

    Reading: ASMB Chapter 12
    Zhou, T. et al. Structural Basis for Broad and Potent Neutralization of HIV-1 by Antibody VRC01. Science 329, 811-817 (2010). PDF
    Diskin, R. et al. Increasing the Potency and Breadth of an HIV Antibody by Using Structure-Based Rational Design. Science 334, 1289-1293 (2011).PDF Supplemental Materials

  10. Monday, 2/11 LSRC A155
    Presenting: Jeff Martin.
    Structure Determination of Symmetric Protein Complexes
    Slides
    Reading: ASMB Chapters 15-16, 17.5, and 18.1.4
    Background reading: ASMB 15-18

  11. Wednesday, 2/13 LSRC A155
    Presenting: Dr. Chittu Tripathy.
    Structure Determination of Protein Loops, and Ensembles of Loops
    Slides
    Reading: ASMB Chapters 15-16, 17.1, and 18.1.1
    Background reading: ASMB Chapters 24, 15-18

  12. Monday, 2/18 LSRC A155
    Presiding: Jonathan Jou.
    Recitation 1
    Slides
    Reading:

  13. Wednesday, 2/20 LSRC A155
    Presiding: Jonathan Jou.
    Recitation 2
    Slides
    Due Friday, 2/22/2013: Assignment #4
    Reading:

  14. Monday, 2/25 LSRC A155
    Presenting: Bruce Donald.
    Drug Design: An Example of Core-Binding Factor Beta
    and , the K* Algorithm (with Constant-factor speedup)

    Primary Reading:


    Background Reading (Design Review): ASMB Chapter 11.

  15. Wednesday, 2/27 LSRC A155
    Presenting: Bruce Donald.
    K* Algorithm (continued), MinDEE, and combinatorial speedup
    Reading: K*: ASMB Chapter 12,
    plus read ASMB chapters 15-18
    Slides

  16. Monday, 3/4 LSRC A155
    Presenting: Bruce Donald.
    An Example of Core-Binding Factor, Evidence for the benefit of Continuous Rotamers, Evidence for the benefit of Ensembles.
    Reading:
    Project Proposals Due

  17. Wednesday, 3/6 LSRC A155
    Presenting: Bruce Donald.
    Issues in Empirical Protein Redesign
    Slides
    Reading:

  18. Monday, 3/18 LSRC A155
    Presenting: Bruce Donald.
    RDCs and Structure Determination
    Boards: (all times ante meridiem)
    10:21:51 10:21:59 10:49:02 11:08:52 11:09:04 11:21:06 11:21:15
    Reading:
    ASMB Chapters 7, 15, 16, 17.3, 18.1.1
    Revised Project Proposals Due Friday 3/22

  19. Wednesday, 3/20 LSRC A155
    Presenting: Bruce Donald.
    RDC-based Structure Determiation
    Boards: (all times ante meridiem)
    10:27:15 10:43:27 11:11:00 11:11:09 11:11:16 Reading:
    ASMB Chapters 7, 15, 16, 17.3, 18.1.1

    Revised Project Proposals Due Friday

  20. Monday, 3/25 LSRC A155
    Presenting: Bruce Donald.
    RDC-based Structure Determiation
    Boards: (all times ante meridiem)
    10:22:50 10:39:27 11:03:14 11:17:13 Reading:
    ASMB Chapters 7, 15, 16, 17.3, 18.1.1


  21. Wednesday, 3/27 LSRC A155
    Presenting: Bruce Donald.
    RDC-based Structure Determiation
    Slides
    Boards: (all times ante meridiem)
    11:04:26 Reading:
    ASMB Chapters 7, 15, 16, 17.3, 18.1.1


  22. Monday, 4/1 LSRC A155
    Conferences
    Ryan 10:05~10:25
    Qing 10:25~10:45
    Jae 10:45~11:05
    Reading:

  23. Wednesday, 4/3 LSRC A155
    Conferences
    Ryenne 10:05~10:25
    Julia 10:25~10:45
    Yang 10:45~11:05
    Reading:

  24. Monday, 4/8 LSRC A155
    Presenting: Ryenne and Julia.
    Probing the Correlation between Deamidation of Aggrecan and Osteoarthritis through a Shotgun Proteomics Approach

    Abstract:
    Deamidiation of proteins, at the asparagine and glutamine residues, has been shown to reliably indicate age of a protein, a so-called "molecular clock". The deamidataion of a specific protein, aggrecan, has been linked to osteoarthritis, a degenerative joint disease. An algorithm has been developed to predict deamidation sites of the protein and have been validated through the use of mass spectrometry. Determining a correlation between structural stability of the protein and deamidation sites will allow for a more comprehensive understanding of the link between deamidation and osteoarthritis. The stability of the protein will be probed through the Stability of Proteins by Rates of Oxidation (SPROX) protocol.
    Slides
    Reading:
    Take home final released, due 4/22/2013

  25. Wednesday, 4/10 LSRC A155
    Presenting: Ryan.
    Computational redesign of GtfD from the vancomycin biosynthetic pathway
    As multidrug resistant pathogens become increasingly common, the drug of last resort, vancomycin, is called upon with increasing frequency to treat otherwise incurable infections. As a result, some pathogens have developed resistance to even vancomycin, resulting in an increased need for productive drug development. It has been hypothesized that the glycosylation states of antibiotics such as vancomycin play an important role in recognition and effectiveness. In this study, we aim to change the substrate specificity of the glycosyltransferase GtfD from the vancomycin biosynthetic pathway to diversify vancomycin using computational structure-based protein design methods. Reading:
    Presenting: Adriana
    Reading:

  26. Monday, 4/15 LSRC A155
    Presenting: Yang.
    Algorithms for building protein ensembles and their applications to a multi-domain protein SpA-N Protein A is a crucial virulence factor in Staphylococcus aureus. The N-terminal half of Staphylococcal Protein A, named as SpA-N, has five IgG-binding domains. SpA-N has segmental flexibility because of four conserved short linkers between the five domains. The flexibility is probably necessary for SpA-N to effectively bind maximally five antibody molecules. Consequently, it is important to experimentally characterize the flexibility or the structure ensemble of SpA-N through efficient algorithms. The goal is to analyze current algorithms and find or design an algorithm for building an ensemble of this specific system.
    Reading:
    Presenting: Qing
    Molecular dynamics simulation predicts the unfolding order of Ankyrin repeat protein with coiled-coil insert Co-translational folding of nascent peptide chain on ribosome has a strong vectorial character, which to some extent can be mimicked by force-induced unfolding experiments such as single molecule force spectroscopy measurments via atomic force micoroscope (AFM). The process can be modeled by steered molecular dynamics (SMD) simulation. Recent studies in our group suggested that NI10C, a concensus ankyrin repeat protein consists of 10 Ankyrin repeats, might unfolding vectorially under external force applied by AFM tip. To further test this hypothesis, we engineered a new protein construct that contains a ~70 amino acids coiled-coil in the 8th Ankyrin repeat of NI10C and used a combination of AFM single molecule force spectroscopy and SMD simulation to exploit the unfolding sequence of the protein. Both results show that the new protein unfolds sequentially from C terminus to N terminus.
    Reading:

  27. Wednesday, 4/17 LSRC A155
    Presenting: Jae.
    Molecular Dynamic Simulation of P Loop Ejection in HipA
    Abstract:
    HipA (high persistence A protein) is one of the proteins known to affect persistence of bacteria. Persistence is bacteria avoiding antibiotics by becoming dormant during antibiotic treatment. Then, the dormant bacteria can revert back to the growth phase when the drugs are gone and continue to grow. HipA is believed to be involved in this bacteria's ability to switch back and forth between the dormant and the growth phase. HipA has two conformational states, active and inactive. Active HipA can induce the cells to go into a dormant state, and the conversion to the inactive conformation can allow the cells to revert back to the growth phase. This conversion between two states involves a loop ejection, but the mechanism of how such transition can occur is poorly understood. So this study aims to use molecular dynamic simulation to generate realistic trajectory of the loop ejection. Reading:
    TAKE HOME FINAL DUE 4/22/2013