d00922025 黃任鋒 r00922102 張庭耀 r00922156 陳子筠 r99922158 蘇宏麒

Post on 13-Jan-2016

59 Views

Category:

Documents

5 Downloads

Preview:

Click to see full reader

DESCRIPTION

Protein multiple sequence alignment by hybrid bio-inspired algorithms Vincenzo Cutello, Giuseppe Nicosia*, Mario Pavone and Igor Prizzi Nucleic Acids Research, 2011. D00922025 黃任鋒 R00922102 張庭耀 R00922156 陳子筠 R99922158 蘇宏麒. 1. Outline. Introduction & background IMSA - PowerPoint PPT Presentation

TRANSCRIPT

Protein multiple sequence alignment by hybrid bio-inspired algorithmsVincenzo Cutello, Giuseppe Nicosia*, Mario Pavone and Igor PrizziNucleic Acids Research, 2011

D00922025 黃任鋒R00922102 張庭耀R00922156 陳子筠R99922158 蘇宏麒

1

Outline

•Introduction & background

•IMSA

•Cloning and hypermutation operators

•Results

•Conclusion

Introduction and BackgroundD00922025 黃任鋒

3

About this paper

4

Problem of MSA

5

Methods for MSA

6

Progressive alignments

7

Exact algorithms

8

Survey of MSA

9

Outline

•Introduction & background

•IMSA

•Cloning and hypermutation operators

•Results

•Conclusion

Immunological Multiple Sequence Alignment(IMSA)

R00922102 張庭耀

11

IMSA

• Two different strategies to create the initial population

• New hypermutation operators

- solving protein MSA that insert or remove gaps

• Gap columns, which have been matched, are moved to the end of the sequence

• The remaining elements(i.e. amino acids) and existing gaps are shifted into the freed space

12

IMSA

•Considers antigens (Ags) and B cells

- Ag is a given MSA instance, i.e. the protein sequences to align

- B cells are a population of alignments that have solved(or approximated) the initial problem

13

14

15

Initial population strategies

16

Random_initialization

17

Random_initialization

18

CLUSTALW-seeding

19

Outline

•Introduction & background

•IMSA

•Cloning and hypermutation operators

•Results

•Conclusion

IMSA-Cloning and hypermutation operatorsR00922156 陳子筠

21

Cloning and hypermutation operators

22

•Represented by “Static cloning operators”

•Clones B cells dup times

•P(clo) of Nc = d * dup B cells, d is population size

Cloning and hypermutation operators

23

InsGap

24

25

InsGap P(gap)

26

RemGap P(gap)

BlockShuffling operator

•Select randomly start point in a sequence

•BlockMove

•BlockSplitHor

•BlockSplitVer

27

28

BlockMove P(block)

29

BlockSplitHor P(block)

30

BlockSplitVer P(block)

STRIP_GAPS

31

Aging operator

32

•Eliminates old B cells in populations P(t), P(gap) and P(block)

•The generation number of B cell is τB

•New population P(t+1) of d B cells selected best survivors by (μ+λ) - selection

33

Outline

•Introduction & background

•IMSA

•Cloning and hypermutation operators

•Results

•Conclusion

ResultsR99922158 蘇宏麒

35

Classical Benchmark

•BAliBASE version 1.0, 2.0 and 3.0

- A benchmark alignment database .

- The evaluation of multiple sequence alignment.

36

BAliBASE version 1.0

•141 reference alignments

•5 reference sets

37

BAliBASE version 1.0, cont.

• Reference 1: equi-distant sequences with various levels of conservation

• Reference 2: family aligned with a highly divergent “orphan” sequence

• Reference 3: subgroups with < 25% residue identity between groups

• Reference 4: sequences with N/C-terminal extensions

• Reference 5: internal insertion

38

BAliBASE version 2.0

•Include all alignments in version 1.0

•Alignments are verified and corrected

39

BAliBASE version 3.0

•same as version 2.0

•contains 218 alignments

40

IMSA - reference 1 lad2

41

IMSA - reference 1 laym3

42

IMSA - reference 1 1hfh

43

IMSA - reference 1 2mhr

44

IMSA - Reference 3 luky

45

IMSA - Reference 5 1qpg

46

IMSA - BAliBASE 1.0

47

IMSA vs CLUSTALW-seeding - BAliBASE 1.0

48

IMSA - BAliBASE 2.0

49

IMSA vs CLUSTALW-seeding - BAliBASE 2.0

50

IMSA vs AIS - BAliBASE 2.0

51

IMSA vs ClonAlign - BAliBASE 2.0

52

IMSA vs COBALT, PROBCONS, PCMA, MUSCLE, CLUSTALW - BAliBASE 3.0

53

IMSA - BAliBASE 3.0 - SP

54

IMSA - BAliBASE 3.0 - CS

55

Running time - BAliBASE 3.0

56

Outline

•Introduction & background

•IMSA

•Cloning and hypermutation operators

•Results

•Conclusion

Final Remarks

•Clonal Selection Algorithm

•IMSA

•IMSA

•CLUSTALW-seeding

•Two specific ad-hoc mutation operators

•Generating more than a single suboptimal alignment, for every MSA instance.

58

Final Remarks, cont.

•BAliBASE 1.0

•IMSA is superior to PRRP, CLUSTALX, SAGA, DIALIGN, PIMA, MULTIALIGN and PILEUP 8.

59

Final Remarks, cont.

•BAliBASE 2.0

•high SP, low CS

•future work - improvement of the CS score

60

Final Remarks, cont.

•BAliBASE 2.0

•IMSA shows best performance, and hence best alignments, than both ClonAlign and AIS.

61

Final Remarks, cont.

•BAliBASE 3.0 - new testbed

•compare with state-of-the-art alignment algorithms, IMSA also shows good alignments.

62

Reference

•http://bips.u-strasbg.fr/fr/Products/Databases/BAliBASE2/

63

Thank you .D00922025 黃任鋒R00922102 張庭耀R00922156 陳子筠R99922158 蘇宏麒

64

top related