Transcript
Page 1: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

Lessons on Protein Structurefrom Lattice Model

HC Lee 李弘謙

Nanjing University Nanjing, China 2002 May 22 – 25

Page 2: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

What is a protein?

• Large molecule:

chain of amino acids

• Several tens to thousands residues

• Folds to specific shape

• Biological machines

Page 3: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

DNA & Gene

Now we know, for higher life forms: one gene, many proteins

Page 4: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

轉錄與翻譯

Gene to Protein

Page 5: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

What do proteins do?

• Links Genotype & Phenotype 基因型與現象型• Structural and Functional 結構與功能

– Structural• blood, muscle, bone, etc.

– Functional• catalytic (enzyme), metabolic, neural, reproductive

催化、新陳代謝、神經、 複製

Aberrant gene > malfunction protein > disease

Page 6: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

Protein Conformation

Page 7: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

Alpha helix

Beta sheets

HIV reverse transcriptase 反轉錄脢

Page 8: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

Understanding protein folding

Page 9: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

Driving Force for Protein Folding

-Most important is interaction of residues with water – hydrophobic and hydrophilic

Page 10: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

Miyazawa-Jernigan Statistical Interaction

Page 11: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

Li-Tang-Wingreen’s representation of MJ Matrix

one-bodytwo-body

Page 12: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

Theoretical analysis [Wang & Lee, PRL 84 (2000)]

Page 13: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

Fit to one (a) and two-body (b) terms

M

J-m

atri

x

Theory

Page 14: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

Compare with MJ-matrix

Correct to first order; dominatedone-body term - hydrophobicity

Page 15: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

Lattice Model

-Simple way to learn something about a very complex subject

Page 16: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

Lattice model

• Represent space (or, in field theory, space-time) by a discrete lattice.

• Represent a structure by a path on the lattice.

• A peptide is a string of residues.

• A peptide whose residues occupy a path is in a state, or have a conformation.

• Residues may interact with each other according to relative distance. Or,

• In mean-field model, residue interacts only with lattice sites.

Page 17: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

Putting a binary peptide on 2D lattice

Random coil and compact path

Binary rep’n of Peptide:0101011010010110010110010

Page 18: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

• The most important interaction for protein folding is residue with water: residues are hydrophobic ( 厭水 ) or hydrophilic ( 親水 ).

• In real protein in native conformation, hydrophobic residues like to be buried, hydrophilic residues like to be exposed to water.

• Simplest model: divide residues into hydrophobic and hydrophilic, structure into core and surface sites.

• Both peptide and structure are binary sequences.

Mean-Field HP Model

Page 19: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

Structure-path on a 2D lattice

Structure-path on a 2D lattice

Pay attention toonly whether path is on a core (1) or a surface (0) site

Structure has a binary representation: 001100110000110000110011000011111100 (from Li et al. PRL 79 (1997) 765-768)

Page 20: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

Designability of Structures

-Very, very few structures are good for proteins

Page 21: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

Structure space >> observed structures

Page 22: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

Protein Designability

Page 23: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

The LTW model

Ground state of peptide p is structure s closest to it in n-dimensional hyperspace. All peptides in Voronoi volume of s has s as ground state.

Page 24: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

The Hamiltonian H = ½ (p – s)**2 is a mapping of the set of peptides P to the set of sructures S that partitions P into equivalent classes labeled by s in S. Target of each class is the ground state/conformation of the class.

Designability of a structure is the number of peptides in the class mapped to that structure

Page 25: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

Vonoroi volume Voronoi volume

In hyperspace, all peptide sequences within the Voronoi volume of a structure is closest to that structure (from Li et al. PRL (1997)).

Page 26: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

No. of structures vs designability

Li, Tang and Wingreen, PRL (1997)

Very few structures have high designability

Designability

Num

ber

of s

truc

ture

s

Page 27: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

• Shortest possible Hamming distance btw two paths proportional to difference in switchback numbers (n10)

• Few paths have high n10

• Path with high n10 has large Voronoi volume, hence high designability

Paths with high switchback numbers have high designability

[Shih et al. & HCL, PRL 84 (2000)]

Page 28: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

Hi switchback > hi design’ty

Page 29: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

Distribution of Hamming dist.

Page 30: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

Log distrib’n vs switchback no.

Designability vs n10; (a) 6x6 (b) 21-site triangular

Page 31: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

Foldability of Peptides

-Vast majority of peptides do not fold

Page 32: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

Alpha helices like paths with high switchback numbers

• Conformation degeneracy – disfavor peptides w/ long strings of identical/similar residues

• Hence proteins rarely have long strings of contiguous hydrophobic or hydrophilic residues

• Alternating short stretches of hydrophobic and hydrophilic residues yields structurally non-degenerate and robust conformations

• 0011 switchback motif simulate alpha helix on the surface

• Empirically most alpha helices on surface

Page 33: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

Compare with real proteins

• Compare model high designability peptides with binarized (by hydrphobicity) protein sequences in PDB– Represent peptide by frequency of occurrenc

e of set of all binary words of fixed length l=2k

– Has 22k such words, put frequencies on a 2k x 2k lattce

[Shih et al. & HCL, PRE 65 (2002)]

Page 34: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

PDBAlpha-HP

HP-LS

PDBAll - PDBAlpha

PDBAll-HP

Oligomer length

Ove

rlap

of

bina

ry s

eque

nce

Highly foldable peptides in HP-modelresemble alpha-helices in real proteins

[Shih et al. PRL 84 (2000)]

Page 35: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

In HP model: peptide that folds into high designability conformations correspond to peptides that fold to alph

a helices in real proteins

Page 36: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

Many models give designabilitybut not all are correct

• Any Hamiltonian (H) is a mapping of peptide space (P) onto conformation space (C)

• For coarse grained C, H partitions P into equivalent classes, each class corresponding to a point in C

• Designability results from a highly skewed distribution of the SIZES of the classes

• Example. The LS (Large-Small) model: structure dominated by steric effect; small residues inside, large residues outside. Almost same math as HP model; has designability but wrong physics.

Page 37: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

PDBAlpha-LS

HP-LS

PDBAll - PDBAlpha

PDBAll-LS

Oligomer length

Ove

rlap

of

bina

ry s

eque

nce

Highly foldable peptides in LS-modeldoes notresemble alpha-helices in real proteins

[Shih et al. PRL 84 (2000)]

Page 38: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

Unlike hydrophobicitySteric effect does not play a domina

nt rolein the determination of native struct

ure

Page 39: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

Folding Funneland

Free-energy Barrier

-Why is folding so fast yet so slow ?

Page 40: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

Folding funnel

Folding Funnel

Page 41: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

Folding funnel (picture)

http://www.npaci.edu/envision/v15.4/proteinfolding.html

Page 42: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

Free energy and entropy

Free Energy, Entropy and Monte Carlo

Page 43: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

Free-energy barrier

Free-energy barrier [Guan, Su, Shih & Lee (2000)]

(a) Biding energy increase with compactness(b) Entropy lost rapidly as bindin

g energy increases(c) Free-energy barrier formed b

y competition btw energy gain and entropy lost

Log

(S)

|E/Enative|

No.

of

cont

acts (c)

(b)

G =

(E

– T

S)/

Ena

tive

|E/Enative| |E/Enative|

low T

high T

annealing

barrier(a)

Page 44: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

Getting over the barrier takes all the folding time

Page 45: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

Summary of lessons

• Average hydrophobic/hydrophlic property of residues can be understood by simple physics.

• Lattice model useful for examining coarse-grain phenomena.

• Long folding time caused by need to surmount free-energy barrier formed by rapid lost of entropy.

• Designability of structure is a direct consequence of hydrophobic/hydrophlic dichotomy of residues.

Page 46: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

• Very few structures are highly designable; those that are have large switchback numbers.

• Very few peptides are foldable; many of those that are alternate rapidly between hydrophobic and hydrophlic residues.

• Highly foldable peptides folded into high designability structures form robust proteins.

• They fold easily into alpha-helices and to a lesser extent to beta-sheets; hence alpha-helices are formed very, very early in folding process, then beta-sheets.

Summary of lessons (cont’d)

Page 47: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

Molecular Dynamics - atomistic description of protein

folding

-takes one giga-flop PC to run one-million days to fold a medium small protein

Page 48: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

Massively Distributive Computation

• Molecular dynamics. – Atomistic level simulation needed to understand protein f

olding and function relevant to biology and drug design

• Annealing time very long– Boltzmann probability:

one machine x 1 M days = 1 M machines x one day

• Starting a program of massively distributive computation - use screen saver program for simulation

• of Vijay Pande, Stanford

Page 49: Lessons on Protein Structure from Lattice Model HC Lee 李弘謙 Nanjing University Nanjing, China 2002 May 22 – 25

The End謝謝大家


Top Related