dyn roadshow: cory von wallenstein & eric rosenberry talk scalability & availability
Post on 21-Oct-2014
846 views
DESCRIPTION
As part of a Dyn Roadshow event in Portland, Oregon, Dyn Chief Technologist Cory von Wallenstein & Iovation Principal Infrastructure Architect Erick Rosenberry did a session on scalability and availability in the real world.TRANSCRIPT
Scalability and Availability
in the Real WorldPortland, OR – October 1, 2013
Cory von WallensteinChief Technologist,
@cvwdyn
Eric RosenberryPrincipal Infrastructure Architect,
iovation [email protected]
@eprosenx
Pg. 2 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx
What do we care about?
• Achieving high(er) availability
• Flexibility to change infrastructure without
downtime
• Ability to expand infrastructure beyond current 4
walls
• Disaster resilience
• Performance
Pg. 3 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx
How can we do it? Go active/active.
• The Iovation Technical Story• Going from one datacenter/cloud region to five• Application architectures and data replication
• How Dyn Helped• Global load balancing & traffic management via DNS
Scalability and Availability Fighting Fraud Non-Stop
Eric Rosenberry
Principal Infrastructure Architect
@eprosenx
5© 2012 iovation Inc.
What iovation Does
Recognize devices connecting to websites
Understands how these devices are related to each other
Block devices that are known to be associated with fraud or abuse and flag anomalies
6© 2012 iovation Inc.
Driving Factors• Successful Product
But ran into capacity limits scaling vertically Vertically scaling became cost prohibitive Unable to add features due to compute capacity limits
• Availability As we entered new markets, customers demanded higher
levels of availability Taking monthly downtimes for maintenance and code
upgrades no longer possible
• Disaster Preparedness We were operating out of a single datacenter which
represented risk to the business
7© 2012 iovation Inc.
Design Criteria• Scalability
System must be able to scale horizontally by adding more nodes
• Availability Code deployments do not require taking any real time
services down Real time services must continue functioning through the
loss of a datacenter PLUS the simultaneous loss of a single server in another datacenter
Serving datacenters must be geographically disparate
• Financial Must be based on commodity x86 hardware, running on
open source software, without depending on SAN’s
8© 2012 iovation Inc.
Service Oriented Architecture
Consumer Facing
Subscriber Facing
Internal Service
Real-time
Asynchronous
Web Service APIs
Device Recognition
Service
Association & Reputation
Service
Business Rules Service
Admin Console UI Reporting
Message Bus
Web Device Print Distribution
GeoService
VelocityService
Analytics
WWW Internet
iovationsubscribers
consumers
9© 2012 iovation Inc.
Datacenter Types• Primary Data Processing
Real-time customer requests are sent to these facilities Every piece of the real-time system is N+1 redundant
• Data Storage Each datacenter that is designated for data storage has one
copy of all key data elements Storage nodes do not have storage level redundancy (the
redundancy is across-datacenters)
• Content Delivery These datacenters deliver content to our customers end users
computers Are N+1 redundant such that individual failures do not cause
the loss of the entire node
10© 2012 iovation Inc.
Network Design
BB1
AMS
MIA
10g
20g
10gInternet
consumers
Content Delivery
#3
Data StoragePDX
MIAData StorageSEA
iovationsubscribers
Data Storage
Content Delivery
Data Processing
Subscriber Queries
Private Network
Content Downloads
11© 2012 iovation Inc.
Portland to Seattle
http://www.zayo.com/sites/default/files/images/Zayo-US-Network-EXTERNAL-11-1-2012.kmz
Pg. 12 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx
How does Iovation use Dyn?
•API Interface• Active/Active between two sites
•Admin Console• Active/Active between two sites
•Content Distribution• GSLB among four sites
Pg. 13 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx
• DNS cache poisoning, DNSSEC and general DNS security
Pg. 14 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx
Anatomy of a HTTP connection
Pg. 15 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx
How does DNS loadbalancing work?
Pg. 16 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx
Simple active/passive example
• Primary location assumes 100% traffic
• In event of disaster, swing 100% of traffic to
a standby location• Could be a “we’ll be back soon” or “status” page• Could be a backup copy of your app
• We call this Active Failover
Pg. 17 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx
Pg. 18 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx
Pg. 19 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx
Pg. 20 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx
Active/Active Load Balancing
(Global Server Load Balancing, GSLB)
(Hot/Hot Load Balancing)
(High Availability Load Balancing)
Pg. 21 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx
Traffic management with
Dyn Traffic Director
Pg. 22 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx
Pg. 23 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx
Dyn Traffic Director
• Fast Anycast network enables low TTLs
• Monitor endpoints for health
• Globally load balance among 7 regions
• Use Anycast to gauge “where is the user?”
Pg. 24 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx
Pg. 25 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx
Pg. 26 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx
Favor performance over network topology?
• Real-time monitoring of endpoints
• Always serve the fastest endpoint for each
user, regardless of network topology
• That’s real-time traffic management with
Dyn’s Traffic Director
Pg. 27 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx
Pg. 28 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx
Pg. 29 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx
Pg. 30 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx
Favor geopolitical boundaries above all?
• Per query lookup on source address
• Geopolitical IP mapping database
• State by state and country by country
granularity
• That’s geo traffic management with Dyn’s
Traffic Director
Pg. 31 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx
Pg. 32 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx
Traffic Management Recap
• Active/Passive with health checks• Dyn Active Failover
• Active/Active with health checks• 7 global regions by network topology -> Dyn Traffic
Director• Add in real-time latency measurement -> Dyn Traffic
Director with real-time traffic management• Add in geopolitical granularity -> Dyn Traffic Director
with geo traffic management
Pg. 33 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx
Pg. 34 Scalability and Availability in the Real World -- @cvwdyn & @eprosenx
Dyn Delivers Internet Performance
•Traffic management and managed
DNS
•Message management and email
delivery
•Remote access and domain services
Pg. 35 Presentation Title (edit from Slide Master) @twitterhandle
Scalability and Availabilityin the Real World
Cory von WallensteinChief Technologist,
@cvwdyn
Eric RosenberryPrincipal Infrastructure Architect,
iovation [email protected]
@eprosenx
Thank You!