whoscall 的 realtime monitoring 經驗分享

121
Architect @ Gogolook How Realtime Monitoring Works in…

Upload: william-yeh

Post on 08-Jan-2017

14.039 views

Category:

Software


3 download

TRANSCRIPT

Page 1: Whoscall 的 Realtime Monitoring 經驗分享

Architect @ Gogolook

How Realtime Monitoring

Works in…

Page 2: Whoscall 的 Realtime Monitoring 經驗分享
Page 3: Whoscall 的 Realtime Monitoring 經驗分享

★ Instant Caller Identification

Whoscall identifies background information of incoming unknown calls in seconds through tags reported by other users, Internet search results, and our comprehensive global database.

Instant Caller Identification

Page 4: Whoscall 的 Realtime Monitoring 經驗分享

Incoming Call Dialogue

Incoming Call DialogueFraud Call

Business Corporation

Restaurant

Page 5: Whoscall 的 Realtime Monitoring 經驗分享

★ Database with over 700 Million Phone Numbers

Whoscall boasts an online database with over 700 million phone numbers. The database of Whoscall covers yellow pages, spammers, telemarketers, costumer services...,etc. with numerous community tags contributed by users and comments based on real users’ experiences.

Database & Number Details

0287871XXX

1111 Job Bank1111 – Job bank of trust

No.1, Lane 35, An Ho Road, Taipei City, Taiwan

http://www.1111.com

Page 6: Whoscall 的 Realtime Monitoring 經驗分享

3 of every 5 strangers’ calls

can be identified.

Over 500M phone calls are identified every month.

3,000 spammer numbers

are reported by Whoscall users

every month in Taiwan.

Number Identification

– 2015.03 – 2015.03

Page 7: Whoscall 的 Realtime Monitoring 經驗分享

Market

United States

Brazil

Saudi Arabia

India

Thailand

Indonesia

Malaysia

Taiwan

Japan

South Korea

Hong Kong

SouthKorea

Taiwan HongKong

Japan India

Top 5 countries of Whoscall users

Page 8: Whoscall 的 Realtime Monitoring 經驗分享

Join us in creating a contact network of trust

Page 9: Whoscall 的 Realtime Monitoring 經驗分享

10

Page 10: Whoscall 的 Realtime Monitoring 經驗分享

❶11

Page 11: Whoscall 的 Realtime Monitoring 經驗分享

12

Page 12: Whoscall 的 Realtime Monitoring 經驗分享

12

Page 13: Whoscall 的 Realtime Monitoring 經驗分享

13

Page 14: Whoscall 的 Realtime Monitoring 經驗分享

14

Page 15: Whoscall 的 Realtime Monitoring 經驗分享

15

Page 16: Whoscall 的 Realtime Monitoring 經驗分享

❷16

Page 17: Whoscall 的 Realtime Monitoring 經驗分享

17

Page 18: Whoscall 的 Realtime Monitoring 經驗分享

18

Page 19: Whoscall 的 Realtime Monitoring 經驗分享

19

👎

Page 20: Whoscall 的 Realtime Monitoring 經驗分享

19

👍👎

Page 21: Whoscall 的 Realtime Monitoring 經驗分享

20

Consequence

Likelihood

Risk register

Page 22: Whoscall 的 Realtime Monitoring 經驗分享

20

Consequence

Likelihood

service outage

app crash

Risk register

Page 23: Whoscall 的 Realtime Monitoring 經驗分享

20

Consequence

Likelihood

service outage

app crash

db outage

Risk register

Page 24: Whoscall 的 Realtime Monitoring 經驗分享

20

Consequence

Likelihood

service outage

app crash

db outage

malfunction

Risk register

Page 25: Whoscall 的 Realtime Monitoring 經驗分享

20

Consequence

Likelihood

service outage

app crash

db outage

poor performance

capacity shortage

malfunction

Risk register

Page 26: Whoscall 的 Realtime Monitoring 經驗分享

20

Consequence

Likelihood

service outage

app crash

db outage

poor performance

capacity shortage

internal errors malfunction

Risk register

Page 27: Whoscall 的 Realtime Monitoring 經驗分享

21

Consequence

Likelihood

service outage

app crash

db outage

poor performance

capacity shortage

internal errors malfunction

Risk register

short term detection &

recovery

Page 28: Whoscall 的 Realtime Monitoring 經驗分享

21

Consequence

Likelihood

service outage

app crash

db outage

poor performance

capacity shortage

internal errors malfunction

Risk register

long term diagnosis

short term detection &

recovery

Page 29: Whoscall 的 Realtime Monitoring 經驗分享

22

API servers DB servers

Page 30: Whoscall 的 Realtime Monitoring 經驗分享

22

Load balancer API servers DB servers

Page 31: Whoscall 的 Realtime Monitoring 經驗分享

CDN

22

Load balancer API servers DB servers

Page 32: Whoscall 的 Realtime Monitoring 經驗分享

CDN

Virtual Private Cloud

22

Load balancer API servers DB servers

Page 33: Whoscall 的 Realtime Monitoring 經驗分享

23

VPC

CloudFront ELB API servers MongoDB

Page 34: Whoscall 的 Realtime Monitoring 經驗分享

24

Consequence

Likelihood

service outage

app crash

db outage

poor performance

capacity shortage

internal errors malfunction

Risk register

Page 35: Whoscall 的 Realtime Monitoring 經驗分享

24

Consequence

Likelihood

service outage

app crash

db outage

poor performance

capacity shortage

internal errors malfunction

Service levels: • availability • response time • error

Risk register

Page 36: Whoscall 的 Realtime Monitoring 經驗分享

25

CloudFront

CloudWatchBlack Box

Page 37: Whoscall 的 Realtime Monitoring 經驗分享

25

CloudFront

CloudWatchBlack Box

Page 38: Whoscall 的 Realtime Monitoring 經驗分享

25

CloudFront

CloudWatchBlack Box

Page 39: Whoscall 的 Realtime Monitoring 經驗分享

26

Page 40: Whoscall 的 Realtime Monitoring 經驗分享

27

Page 41: Whoscall 的 Realtime Monitoring 經驗分享

27

Page 42: Whoscall 的 Realtime Monitoring 經驗分享

27

Page 43: Whoscall 的 Realtime Monitoring 經驗分享

28

CloudFront

CloudWatchBlack BoxCrashlytics

Page 44: Whoscall 的 Realtime Monitoring 經驗分享

28

CloudFront

CloudWatchBlack BoxCrashlytics

Page 45: Whoscall 的 Realtime Monitoring 經驗分享

29

Consequence

Likelihood

service outage

app crash

db outage

poor performance

capacity shortage

internal errors malfunction

Risk register

Page 46: Whoscall 的 Realtime Monitoring 經驗分享

29

Consequence

Likelihood

service outage

app crash

db outage

poor performance

capacity shortage

internal errors malfunction

Risk register

Service levels: • R/W lock • size • load • speed

Page 47: Whoscall 的 Realtime Monitoring 經驗分享

30

VPC

CloudFront ELB API servers MongoDB

Page 48: Whoscall 的 Realtime Monitoring 經驗分享

30

CloudWatch

VPC

CloudFront ELB API servers MongoDB

Page 49: Whoscall 的 Realtime Monitoring 經驗分享

30

CloudWatch

VPC

CloudFront ELB API servers MongoDB

Page 50: Whoscall 的 Realtime Monitoring 經驗分享

31

VPC

CloudFront ELB API servers MongoDB

Page 51: Whoscall 的 Realtime Monitoring 經驗分享

31

VPC

CloudFront ELB API servers MongoDB

Cloud Manager

Page 52: Whoscall 的 Realtime Monitoring 經驗分享

32

Page 53: Whoscall 的 Realtime Monitoring 經驗分享

33

Consequence

Likelihood

service outage

app crash

db outage

poor performance

capacity shortage

internal errors malfunction

Risk register

Page 54: Whoscall 的 Realtime Monitoring 經驗分享

33

Consequence

Likelihood

service outage

app crash

db outage

poor performance

capacity shortage

internal errors malfunction

Risk register

Diagnosis: • instance metrics • app metrics • resource

Page 55: Whoscall 的 Realtime Monitoring 經驗分享

34

VPC

CloudFront ELB API servers MongoDB

Page 56: Whoscall 的 Realtime Monitoring 經驗分享

34

CloudWatch

VPC

CloudFront ELB API servers MongoDB

Page 57: Whoscall 的 Realtime Monitoring 經驗分享

34

CloudWatch

VPC

CloudFront ELB API servers MongoDB

Page 58: Whoscall 的 Realtime Monitoring 經驗分享

35

Page 59: Whoscall 的 Realtime Monitoring 經驗分享

35

memory space?

disk space?

Page 60: Whoscall 的 Realtime Monitoring 經驗分享

36

Page 61: Whoscall 的 Realtime Monitoring 經驗分享

36

Page 62: Whoscall 的 Realtime Monitoring 經驗分享

36mnemonic?

Page 63: Whoscall 的 Realtime Monitoring 經驗分享

37

CloudWatch

CloudFront ELB API servers MongoDB

StatsD for long-term metrics

Page 64: Whoscall 的 Realtime Monitoring 經驗分享

37

CloudWatch

CloudFront ELB API servers MongoDB

StatsD for long-term metrics

Page 65: Whoscall 的 Realtime Monitoring 經驗分享

37

CloudWatch

CloudFront ELB API servers MongoDB

StatsD

StatsD for long-term metrics

Page 66: Whoscall 的 Realtime Monitoring 經驗分享

38

@ Taipei.py — Feb 26, 2015Centralized logging and monitoring in Fluentd

http://www.slideshare.net/suitingtseng/fluentd-49952996

For more details:

Page 67: Whoscall 的 Realtime Monitoring 經驗分享

39http://www.oreilly.com/webops-perf/free/lightweight-systems.cspFor more details:

Page 68: Whoscall 的 Realtime Monitoring 經驗分享

39http://www.oreilly.com/webops-perf/free/lightweight-systems.cspFor more details:

Page 69: Whoscall 的 Realtime Monitoring 經驗分享

39http://www.oreilly.com/webops-perf/free/lightweight-systems.cspFor more details:

Page 70: Whoscall 的 Realtime Monitoring 經驗分享

40

CloudWatch

CloudFront ELB API servers MongoDB

StatsD

Application-specific metrics • gauges • counters

• histograms • meters & timers

Page 71: Whoscall 的 Realtime Monitoring 經驗分享

41

Page 72: Whoscall 的 Realtime Monitoring 經驗分享

42

Metrics

Page 73: Whoscall 的 Realtime Monitoring 經驗分享

42

http://jcconf.tw/2014/manage-servers-on-the-cloud-with-opensource-tools.html

Metrics

Page 74: Whoscall 的 Realtime Monitoring 經驗分享

43

CloudWatch

CloudFront ELB API servers MongoDB

low-level metrics

profiling

Page 75: Whoscall 的 Realtime Monitoring 經驗分享

43

CloudWatch

CloudFront ELB API servers MongoDB

low-level metrics

profiling

Page 76: Whoscall 的 Realtime Monitoring 經驗分享

44

Page 77: Whoscall 的 Realtime Monitoring 經驗分享

45

Page 78: Whoscall 的 Realtime Monitoring 經驗分享

46

Page 79: Whoscall 的 Realtime Monitoring 經驗分享

47

Page 80: Whoscall 的 Realtime Monitoring 經驗分享

48

Page 81: Whoscall 的 Realtime Monitoring 經驗分享

49

Consequence

Likelihood

service outage

app crash

db outage

poor performance

capacity shortage

internal errors malfunction

Risk register

Page 82: Whoscall 的 Realtime Monitoring 經驗分享

49

Consequence

Likelihood

service outage

app crash

db outage

poor performance

capacity shortage

internal errors malfunction

Risk registerDiagnosis: • logging • patterns • drill down analysis

Page 83: Whoscall 的 Realtime Monitoring 經驗分享

50

CloudFront ELB API servers MongoDB

CloudWatch

Page 84: Whoscall 的 Realtime Monitoring 經驗分享

50

CloudFront ELB API servers MongoDB

CloudWatch

log in S3

Page 85: Whoscall 的 Realtime Monitoring 經驗分享

50

CloudFront ELB API servers MongoDB

CloudWatch

log in S3

Page 86: Whoscall 的 Realtime Monitoring 經驗分享

50

CloudFront ELB API servers MongoDB

CloudWatch

log in S3

Page 87: Whoscall 的 Realtime Monitoring 經驗分享

50

CloudFront ELB API servers MongoDB

CloudWatch

log in S3

Page 88: Whoscall 的 Realtime Monitoring 經驗分享

50

CloudFront ELB API servers MongoDB

CloudWatch

log in S3

StatsD

Page 89: Whoscall 的 Realtime Monitoring 經驗分享

50

CloudFront ELB API servers MongoDB

CloudWatch

log in S3

StatsD

BigQuery

Page 90: Whoscall 的 Realtime Monitoring 經驗分享

51

http://www.slideshare.net/tw_dsconf/elasticsearch-kibana

Aug 23, 2015

Page 91: Whoscall 的 Realtime Monitoring 經驗分享

52

Consequence

Likelihood

service outage

app crash

db outage

poor performance

capacity shortage

internal errors malfunction

Risk register

Page 92: Whoscall 的 Realtime Monitoring 經驗分享

52

Consequence

Likelihood

service outage

app crash

db outage

poor performance

capacity shortage

internal errors malfunction

Risk register

Diagnosis: • logging • context • aggregation

Page 93: Whoscall 的 Realtime Monitoring 經驗分享

53

CloudFront ELB API servers MongoDB

CloudWatch

log in S3

error logs

Page 94: Whoscall 的 Realtime Monitoring 經驗分享

53

CloudFront ELB API servers MongoDB

CloudWatch

log in S3

error logs

Page 95: Whoscall 的 Realtime Monitoring 經驗分享

54

Page 96: Whoscall 的 Realtime Monitoring 經驗分享

55

Consequence

Likelihood

service outage

app crash

db outage

poor performance

capacity shortage

internal errors malfunction

Risk register

Page 97: Whoscall 的 Realtime Monitoring 經驗分享

56

CloudFront ELB API servers MongoDB

Cloud Manager

CloudWatch

log in S3

StatsD

BigQuery

Page 98: Whoscall 的 Realtime Monitoring 經驗分享

57

CloudFront ELB API servers MongoDB

Cloud Manager

CloudWatch

log in S3

StatsD

BigQuery

Page 99: Whoscall 的 Realtime Monitoring 經驗分享

58

Taipei.py — Feb 26, 2015Centralized logging and monitoring in Fluentd

http://www.slideshare.net/suitingtseng/fluentd-49952996

For more details:

Page 100: Whoscall 的 Realtime Monitoring 經驗分享

58

Taipei.py — Feb 26, 2015Centralized logging and monitoring in Fluentd

http://www.slideshare.net/suitingtseng/fluentd-49952996

For more details:

We’ve built an

unified logging mechanism…

Page 101: Whoscall 的 Realtime Monitoring 經驗分享

59

CloudFront ELB API servers MongoDB

Cloud Manager

CloudWatch

log in S3

StatSD

BigQuery

Page 102: Whoscall 的 Realtime Monitoring 經驗分享

59

CloudFront ELB API servers MongoDB

Cloud Manager

CloudWatch

log in S3

StatSD

BigQuery

How about

unified monitoring alerts ?

Page 103: Whoscall 的 Realtime Monitoring 經驗分享

60

Page 104: Whoscall 的 Realtime Monitoring 經驗分享

60

Page 105: Whoscall 的 Realtime Monitoring 經驗分享

60

Page 106: Whoscall 的 Realtime Monitoring 經驗分享

61

CloudFront ELB API servers MongoDB

Cloud Manager

CloudWatch

log in S3

BigQuery

Page 107: Whoscall 的 Realtime Monitoring 經驗分享

61

CloudFront ELB API servers MongoDB

Cloud Manager

CloudWatch

log in S3

BigQuery

Page 108: Whoscall 的 Realtime Monitoring 經驗分享

62

CloudFront ELB API servers MongoDB

Cloud Manager

CloudWatch

log in S3

BigQuery

Page 109: Whoscall 的 Realtime Monitoring 經驗分享

63

Page 110: Whoscall 的 Realtime Monitoring 經驗分享

64

Page 111: Whoscall 的 Realtime Monitoring 經驗分享

64

Page 112: Whoscall 的 Realtime Monitoring 經驗分享

65

Page 113: Whoscall 的 Realtime Monitoring 經驗分享

66

Page 114: Whoscall 的 Realtime Monitoring 經驗分享

66

Page 115: Whoscall 的 Realtime Monitoring 經驗分享

67

Page 116: Whoscall 的 Realtime Monitoring 經驗分享

68

👍👎

Page 117: Whoscall 的 Realtime Monitoring 經驗分享

69

Consequence

Likelihood

service outage

app crash

db outage

poor performance

capacity shortage

internal errors malfunction

Risk register

Page 118: Whoscall 的 Realtime Monitoring 經驗分享

70

Consequence

Likelihood

service outage

app crash

db outage

poor performance

capacity shortage

internal errors malfunction

Risk register

long term diagnosis

short term detection &

recovery

Page 119: Whoscall 的 Realtime Monitoring 經驗分享

71

CloudFront ELB API servers MongoDB

Cloud Manager

CloudWatch

log in S3

BigQuery

Page 120: Whoscall 的 Realtime Monitoring 經驗分享

72

Page 121: Whoscall 的 Realtime Monitoring 經驗分享

73