dyn roadshow: cory von wallenstein & eric rosenberry talk scalability & availability

35
Scalability and Availability in the Real World Portland, OR – October 1, 2013 Cory von Wallenstein Chief Technologist, Dyn Inc. [email protected] @cvwdyn Eric Rosenberry Principal Infrastructure Architect, iovation Inc. [email protected] @eprosenx

Post on 21-Oct-2014

846 views

Category:

Technology


0 download

DESCRIPTION

As part of a Dyn Roadshow event in Portland, Oregon, Dyn Chief Technologist Cory von Wallenstein & Iovation Principal Infrastructure Architect Erick Rosenberry did a session on scalability and availability in the real world.

TRANSCRIPT

Page 1: Dyn Roadshow: Cory von Wallenstein & Eric Rosenberry talk scalability & availability

Scalability and Availability

in the Real WorldPortland, OR – October 1, 2013

Cory von WallensteinChief Technologist,

Dyn [email protected]

@cvwdyn

Eric RosenberryPrincipal Infrastructure Architect,

iovation [email protected]

@eprosenx

Page 2: Dyn Roadshow: Cory von Wallenstein & Eric Rosenberry talk scalability & availability

Pg. 2 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx

What do we care about?

• Achieving high(er) availability

• Flexibility to change infrastructure without

downtime

• Ability to expand infrastructure beyond current 4

walls

• Disaster resilience

• Performance

Page 3: Dyn Roadshow: Cory von Wallenstein & Eric Rosenberry talk scalability & availability

Pg. 3 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx

How can we do it? Go active/active.

• The Iovation Technical Story• Going from one datacenter/cloud region to five• Application architectures and data replication

• How Dyn Helped• Global load balancing & traffic management via DNS

Page 4: Dyn Roadshow: Cory von Wallenstein & Eric Rosenberry talk scalability & availability

Scalability and Availability Fighting Fraud Non-Stop

Eric Rosenberry

Principal Infrastructure Architect

[email protected]

@eprosenx

Page 5: Dyn Roadshow: Cory von Wallenstein & Eric Rosenberry talk scalability & availability

5© 2012 iovation Inc.

What iovation Does

Recognize devices connecting to websites

Understands how these devices are related to each other

Block devices that are known to be associated with fraud or abuse and flag anomalies

Page 6: Dyn Roadshow: Cory von Wallenstein & Eric Rosenberry talk scalability & availability

6© 2012 iovation Inc.

Driving Factors• Successful Product

But ran into capacity limits scaling vertically Vertically scaling became cost prohibitive Unable to add features due to compute capacity limits

• Availability As we entered new markets, customers demanded higher

levels of availability Taking monthly downtimes for maintenance and code

upgrades no longer possible

• Disaster Preparedness We were operating out of a single datacenter which

represented risk to the business

Page 7: Dyn Roadshow: Cory von Wallenstein & Eric Rosenberry talk scalability & availability

7© 2012 iovation Inc.

Design Criteria• Scalability

System must be able to scale horizontally by adding more nodes

• Availability Code deployments do not require taking any real time

services down Real time services must continue functioning through the

loss of a datacenter PLUS the simultaneous loss of a single server in another datacenter

Serving datacenters must be geographically disparate

• Financial Must be based on commodity x86 hardware, running on

open source software, without depending on SAN’s

Page 8: Dyn Roadshow: Cory von Wallenstein & Eric Rosenberry talk scalability & availability

8© 2012 iovation Inc.

Service Oriented Architecture

Consumer Facing

Subscriber Facing

Internal Service

Real-time

Asynchronous

Web Service APIs

Device Recognition

Service

Association & Reputation

Service

Business Rules Service

Admin Console UI Reporting

Message Bus

Web Device Print Distribution

GeoService

VelocityService

Analytics

WWW Internet

iovationsubscribers

consumers

Page 9: Dyn Roadshow: Cory von Wallenstein & Eric Rosenberry talk scalability & availability

9© 2012 iovation Inc.

Datacenter Types• Primary Data Processing

Real-time customer requests are sent to these facilities Every piece of the real-time system is N+1 redundant

• Data Storage Each datacenter that is designated for data storage has one

copy of all key data elements Storage nodes do not have storage level redundancy (the

redundancy is across-datacenters)

• Content Delivery These datacenters deliver content to our customers end users

computers Are N+1 redundant such that individual failures do not cause

the loss of the entire node

Page 10: Dyn Roadshow: Cory von Wallenstein & Eric Rosenberry talk scalability & availability

10© 2012 iovation Inc.

Network Design

BB1

AMS

MIA

10g

20g

10gInternet

consumers

Content Delivery

#3

Data StoragePDX

MIAData StorageSEA

iovationsubscribers

Data Storage

Content Delivery

Data Processing

Subscriber Queries

Private Network

Content Downloads

Page 11: Dyn Roadshow: Cory von Wallenstein & Eric Rosenberry talk scalability & availability

11© 2012 iovation Inc.

Portland to Seattle

http://www.zayo.com/sites/default/files/images/Zayo-US-Network-EXTERNAL-11-1-2012.kmz

Page 12: Dyn Roadshow: Cory von Wallenstein & Eric Rosenberry talk scalability & availability

Pg. 12 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx

How does Iovation use Dyn?

•API Interface• Active/Active between two sites

•Admin Console• Active/Active between two sites

•Content Distribution• GSLB among four sites

Page 13: Dyn Roadshow: Cory von Wallenstein & Eric Rosenberry talk scalability & availability

Pg. 13 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx

• DNS cache poisoning, DNSSEC and general DNS security

Page 14: Dyn Roadshow: Cory von Wallenstein & Eric Rosenberry talk scalability & availability

Pg. 14 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx

Anatomy of a HTTP connection

Page 15: Dyn Roadshow: Cory von Wallenstein & Eric Rosenberry talk scalability & availability

Pg. 15 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx

How does DNS loadbalancing work?

Page 16: Dyn Roadshow: Cory von Wallenstein & Eric Rosenberry talk scalability & availability

Pg. 16 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx

Simple active/passive example

• Primary location assumes 100% traffic

• In event of disaster, swing 100% of traffic to

a standby location• Could be a “we’ll be back soon” or “status” page• Could be a backup copy of your app

• We call this Active Failover

Page 17: Dyn Roadshow: Cory von Wallenstein & Eric Rosenberry talk scalability & availability

Pg. 17 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx

Page 18: Dyn Roadshow: Cory von Wallenstein & Eric Rosenberry talk scalability & availability

Pg. 18 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx

Page 19: Dyn Roadshow: Cory von Wallenstein & Eric Rosenberry talk scalability & availability

Pg. 19 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx

Page 20: Dyn Roadshow: Cory von Wallenstein & Eric Rosenberry talk scalability & availability

Pg. 20 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx

Active/Active Load Balancing

(Global Server Load Balancing, GSLB)

(Hot/Hot Load Balancing)

(High Availability Load Balancing)

Page 21: Dyn Roadshow: Cory von Wallenstein & Eric Rosenberry talk scalability & availability

Pg. 21 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx

Traffic management with

Dyn Traffic Director

Page 22: Dyn Roadshow: Cory von Wallenstein & Eric Rosenberry talk scalability & availability

Pg. 22 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx

Page 23: Dyn Roadshow: Cory von Wallenstein & Eric Rosenberry talk scalability & availability

Pg. 23 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx

Dyn Traffic Director

• Fast Anycast network enables low TTLs

• Monitor endpoints for health

• Globally load balance among 7 regions

• Use Anycast to gauge “where is the user?”

Page 24: Dyn Roadshow: Cory von Wallenstein & Eric Rosenberry talk scalability & availability

Pg. 24 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx

Page 25: Dyn Roadshow: Cory von Wallenstein & Eric Rosenberry talk scalability & availability

Pg. 25 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx

Page 26: Dyn Roadshow: Cory von Wallenstein & Eric Rosenberry talk scalability & availability

Pg. 26 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx

Favor performance over network topology?

• Real-time monitoring of endpoints

• Always serve the fastest endpoint for each

user, regardless of network topology

• That’s real-time traffic management with

Dyn’s Traffic Director

Page 27: Dyn Roadshow: Cory von Wallenstein & Eric Rosenberry talk scalability & availability

Pg. 27 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx

Page 28: Dyn Roadshow: Cory von Wallenstein & Eric Rosenberry talk scalability & availability

Pg. 28 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx

Page 29: Dyn Roadshow: Cory von Wallenstein & Eric Rosenberry talk scalability & availability

Pg. 29 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx

Page 30: Dyn Roadshow: Cory von Wallenstein & Eric Rosenberry talk scalability & availability

Pg. 30 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx

Favor geopolitical boundaries above all?

• Per query lookup on source address

• Geopolitical IP mapping database

• State by state and country by country

granularity

• That’s geo traffic management with Dyn’s

Traffic Director

Page 31: Dyn Roadshow: Cory von Wallenstein & Eric Rosenberry talk scalability & availability

Pg. 31 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx

Page 32: Dyn Roadshow: Cory von Wallenstein & Eric Rosenberry talk scalability & availability

Pg. 32 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx

Traffic Management Recap

• Active/Passive with health checks• Dyn Active Failover

• Active/Active with health checks• 7 global regions by network topology -> Dyn Traffic

Director• Add in real-time latency measurement -> Dyn Traffic

Director with real-time traffic management• Add in geopolitical granularity -> Dyn Traffic Director

with geo traffic management

Page 33: Dyn Roadshow: Cory von Wallenstein & Eric Rosenberry talk scalability & availability

Pg. 33 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx

Page 34: Dyn Roadshow: Cory von Wallenstein & Eric Rosenberry talk scalability & availability

Pg. 34 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx

Dyn Delivers Internet Performance

•Traffic management and managed

DNS

•Message management and email

delivery

•Remote access and domain services

Page 35: Dyn Roadshow: Cory von Wallenstein & Eric Rosenberry talk scalability & availability

Pg. 35 Presentation Title (edit from Slide Master) @twitterhandle

Scalability and Availabilityin the Real World

Cory von WallensteinChief Technologist,

Dyn [email protected]

@cvwdyn

Eric RosenberryPrincipal Infrastructure Architect,

iovation [email protected]

@eprosenx

Thank You!