whoscall 的 realtime monitoring 經驗分享
TRANSCRIPT
Architect @ Gogolook
How Realtime Monitoring
Works in…
★ Instant Caller Identification
Whoscall identifies background information of incoming unknown calls in seconds through tags reported by other users, Internet search results, and our comprehensive global database.
Instant Caller Identification
Incoming Call Dialogue
Incoming Call DialogueFraud Call
Business Corporation
Restaurant
★ Database with over 700 Million Phone Numbers
Whoscall boasts an online database with over 700 million phone numbers. The database of Whoscall covers yellow pages, spammers, telemarketers, costumer services...,etc. with numerous community tags contributed by users and comments based on real users’ experiences.
Database & Number Details
0287871XXX
1111 Job Bank1111 – Job bank of trust
No.1, Lane 35, An Ho Road, Taipei City, Taiwan
http://www.1111.com
3 of every 5 strangers’ calls
can be identified.
Over 500M phone calls are identified every month.
3,000 spammer numbers
are reported by Whoscall users
every month in Taiwan.
Number Identification
– 2015.03 – 2015.03
Market
United States
Brazil
Saudi Arabia
India
Thailand
Indonesia
Malaysia
Taiwan
Japan
South Korea
Hong Kong
SouthKorea
Taiwan HongKong
Japan India
Top 5 countries of Whoscall users
Join us in creating a contact network of trust
10
❶11
12
12
13
14
15
❷16
17
18
•
•
•
•
19
👎
•
•
•
•
19
•
•
•
•
👍👎
20
Consequence
Likelihood
Risk register
20
Consequence
Likelihood
service outage
app crash
Risk register
20
Consequence
Likelihood
service outage
app crash
db outage
Risk register
20
Consequence
Likelihood
service outage
app crash
db outage
malfunction
Risk register
20
Consequence
Likelihood
service outage
app crash
db outage
poor performance
capacity shortage
malfunction
Risk register
20
Consequence
Likelihood
service outage
app crash
db outage
poor performance
capacity shortage
internal errors malfunction
Risk register
21
Consequence
Likelihood
service outage
app crash
db outage
poor performance
capacity shortage
internal errors malfunction
Risk register
short term detection &
recovery
21
Consequence
Likelihood
service outage
app crash
db outage
poor performance
capacity shortage
internal errors malfunction
Risk register
long term diagnosis
short term detection &
recovery
22
API servers DB servers
22
Load balancer API servers DB servers
CDN
22
Load balancer API servers DB servers
CDN
Virtual Private Cloud
22
Load balancer API servers DB servers
23
VPC
CloudFront ELB API servers MongoDB
24
Consequence
Likelihood
service outage
app crash
db outage
poor performance
capacity shortage
internal errors malfunction
Risk register
24
Consequence
Likelihood
service outage
app crash
db outage
poor performance
capacity shortage
internal errors malfunction
Service levels: • availability • response time • error
Risk register
25
CloudFront
CloudWatchBlack Box
25
CloudFront
CloudWatchBlack Box
25
CloudFront
CloudWatchBlack Box
26
27
27
27
28
CloudFront
CloudWatchBlack BoxCrashlytics
28
CloudFront
CloudWatchBlack BoxCrashlytics
29
Consequence
Likelihood
service outage
app crash
db outage
poor performance
capacity shortage
internal errors malfunction
Risk register
29
Consequence
Likelihood
service outage
app crash
db outage
poor performance
capacity shortage
internal errors malfunction
Risk register
Service levels: • R/W lock • size • load • speed
30
VPC
CloudFront ELB API servers MongoDB
30
CloudWatch
VPC
CloudFront ELB API servers MongoDB
30
CloudWatch
VPC
CloudFront ELB API servers MongoDB
31
VPC
CloudFront ELB API servers MongoDB
31
VPC
CloudFront ELB API servers MongoDB
Cloud Manager
32
33
Consequence
Likelihood
service outage
app crash
db outage
poor performance
capacity shortage
internal errors malfunction
Risk register
33
Consequence
Likelihood
service outage
app crash
db outage
poor performance
capacity shortage
internal errors malfunction
Risk register
Diagnosis: • instance metrics • app metrics • resource
34
VPC
CloudFront ELB API servers MongoDB
34
CloudWatch
VPC
CloudFront ELB API servers MongoDB
34
CloudWatch
VPC
CloudFront ELB API servers MongoDB
35
35
memory space?
disk space?
36
36
36mnemonic?
37
CloudWatch
CloudFront ELB API servers MongoDB
StatsD for long-term metrics
37
CloudWatch
CloudFront ELB API servers MongoDB
StatsD for long-term metrics
37
CloudWatch
CloudFront ELB API servers MongoDB
StatsD
StatsD for long-term metrics
38
@ Taipei.py — Feb 26, 2015Centralized logging and monitoring in Fluentd
http://www.slideshare.net/suitingtseng/fluentd-49952996
For more details:
39http://www.oreilly.com/webops-perf/free/lightweight-systems.cspFor more details:
39http://www.oreilly.com/webops-perf/free/lightweight-systems.cspFor more details:
39http://www.oreilly.com/webops-perf/free/lightweight-systems.cspFor more details:
40
CloudWatch
CloudFront ELB API servers MongoDB
StatsD
Application-specific metrics • gauges • counters
• histograms • meters & timers
41
42
Metrics
42
http://jcconf.tw/2014/manage-servers-on-the-cloud-with-opensource-tools.html
Metrics
43
CloudWatch
CloudFront ELB API servers MongoDB
low-level metrics
profiling
43
CloudWatch
CloudFront ELB API servers MongoDB
low-level metrics
profiling
44
45
46
47
48
49
Consequence
Likelihood
service outage
app crash
db outage
poor performance
capacity shortage
internal errors malfunction
Risk register
49
Consequence
Likelihood
service outage
app crash
db outage
poor performance
capacity shortage
internal errors malfunction
Risk registerDiagnosis: • logging • patterns • drill down analysis
50
CloudFront ELB API servers MongoDB
CloudWatch
50
CloudFront ELB API servers MongoDB
CloudWatch
log in S3
50
CloudFront ELB API servers MongoDB
CloudWatch
log in S3
50
CloudFront ELB API servers MongoDB
CloudWatch
log in S3
50
CloudFront ELB API servers MongoDB
CloudWatch
log in S3
50
CloudFront ELB API servers MongoDB
CloudWatch
log in S3
StatsD
50
CloudFront ELB API servers MongoDB
CloudWatch
log in S3
StatsD
BigQuery
51
http://www.slideshare.net/tw_dsconf/elasticsearch-kibana
Aug 23, 2015
52
Consequence
Likelihood
service outage
app crash
db outage
poor performance
capacity shortage
internal errors malfunction
Risk register
52
Consequence
Likelihood
service outage
app crash
db outage
poor performance
capacity shortage
internal errors malfunction
Risk register
Diagnosis: • logging • context • aggregation
53
CloudFront ELB API servers MongoDB
CloudWatch
log in S3
error logs
53
CloudFront ELB API servers MongoDB
CloudWatch
log in S3
error logs
54
55
Consequence
Likelihood
service outage
app crash
db outage
poor performance
capacity shortage
internal errors malfunction
Risk register
56
CloudFront ELB API servers MongoDB
Cloud Manager
CloudWatch
log in S3
StatsD
BigQuery
57
CloudFront ELB API servers MongoDB
Cloud Manager
CloudWatch
log in S3
StatsD
BigQuery
58
Taipei.py — Feb 26, 2015Centralized logging and monitoring in Fluentd
http://www.slideshare.net/suitingtseng/fluentd-49952996
For more details:
58
Taipei.py — Feb 26, 2015Centralized logging and monitoring in Fluentd
http://www.slideshare.net/suitingtseng/fluentd-49952996
For more details:
We’ve built an
unified logging mechanism…
59
CloudFront ELB API servers MongoDB
Cloud Manager
CloudWatch
log in S3
StatSD
BigQuery
59
CloudFront ELB API servers MongoDB
Cloud Manager
CloudWatch
log in S3
StatSD
BigQuery
How about
unified monitoring alerts ?
60
•
•
•
•
60
•
•
•
•
60
61
CloudFront ELB API servers MongoDB
Cloud Manager
CloudWatch
log in S3
BigQuery
61
CloudFront ELB API servers MongoDB
Cloud Manager
CloudWatch
log in S3
BigQuery
62
CloudFront ELB API servers MongoDB
Cloud Manager
CloudWatch
log in S3
BigQuery
63
64
64
65
66
66
67
•
•
•
•
68
•
•
•
•
👍👎
69
Consequence
Likelihood
service outage
app crash
db outage
poor performance
capacity shortage
internal errors malfunction
Risk register
70
Consequence
Likelihood
service outage
app crash
db outage
poor performance
capacity shortage
internal errors malfunction
Risk register
long term diagnosis
short term detection &
recovery
71
CloudFront ELB API servers MongoDB
Cloud Manager
CloudWatch
log in S3
BigQuery
72
73