bleeding edge databases

38
Bleeding Edge Databases @LynnLangit

Upload: lynn-langit

Post on 14-May-2015

684 views

Category:

Technology


3 download

DESCRIPTION

On Aerospike, AlgebraixData and Google BigQuery for BigDataCampLA

TRANSCRIPT

Page 1: Bleeding Edge Databases

Bleeding Edge Databases

@LynnLang i t

Page 2: Bleeding Edge Databases

Unstructured Data

Page 3: Bleeding Edge Databases
Page 4: Bleeding Edge Databases

Live Tweets on a Building

Page 5: Bleeding Edge Databases
Page 6: Bleeding Edge Databases

What is Aerospike?

Real-time NoSQL• Flash Optimized• In-memory• Exponentially

Scalable

Super Fast• 1M TPS on one

server (reads)• 40K TPS on one

server (writes)

More• ACID

compliance• Tunable

Consistency

Page 7: Bleeding Edge Databases

Benchmark Results• 200,000 tps (read-write) & 300,000 tps (read-heavy)• 10X Faster for R/W loads on SSDs

Page 8: Bleeding Edge Databases

DEMO

Page 9: Bleeding Edge Databases

More Benchmark Results

Config• 10G network• Aerospike 3• Same hardware• 4-node CentOS

Data• 500GB• 50M recordsEach Record • 100 bytes• 23 byte key• 10 fields

Page 10: Bleeding Edge Databases

Aerospike Architecture

Page 11: Bleeding Edge Databases

Example Architecture

Page 12: Bleeding Edge Databases
Page 13: Bleeding Edge Databases
Page 14: Bleeding Edge Databases

How to try it out

• Bare metal or pick a Cloud, set up a VM• Get the free community edition• Go…

Page 15: Bleeding Edge Databases

Linked Open Data Cloud

Page 16: Bleeding Edge Databases
Page 17: Bleeding Edge Databases

What is Algebraix Data?

IoT – Semantic Web

Super Powerful 1 Billion

Triples on 1 Node

Native Mathematical

Engine

Triple store RDF (Graph)

Page 18: Bleeding Edge Databases

SPARQL Server™

W3C & OGC compliant RDF / SPARQL Semantic Database

Natively built with proprietary Math• Algebraix technology (and patents)

Runs on commodity hardware• In the cloud (or on premise)• Scales Up and Down

Significantly better benchmark performance• over leading RDF databases

Page 19: Bleeding Edge Databases

Benchmark Results

• SP2Bench SPARQL Performance Benchmark

Page 20: Bleeding Edge Databases

SP^2 Benchmark Visualized

Page 21: Bleeding Edge Databases

DEMO

Page 22: Bleeding Edge Databases

It’s the Math…

Page 23: Bleeding Edge Databases

Patents

Page 24: Bleeding Edge Databases

Runs on common hardware

• Any Cloud or• On Rremises

High Performance & Capacity

• Needs no indexes

• Works particularly well w/sparse data

Self-tuning

• Retains results & intermediate sets

• Supports point-in-time queries

SPARQL Server™

Page 25: Bleeding Edge Databases

Algebraix Solution Stack

Data Algebra

DatabaseNoSQL Relational

RDF Semantic

ApplicationsMeaning

Organization

Optimization& Execution

Conceptual

Data Loaders Query Translators

• Modern abstract algebra• Zermelo-Fraenkel set theory

• Mathematics-based data management platform• Universal data language• Collection of I.P.

• SPARQL Server – RDF• A2DB - Relational

• Search• Analytics• Business Intelligence• Data Integration

Algebraix Platform

Page 26: Bleeding Edge Databases

How to try it out

• Sign up on their website• Try out when notified (this July)

Page 27: Bleeding Edge Databases
Page 28: Bleeding Edge Databases
Page 29: Bleeding Edge Databases

What is Google Big Query?

QaaS – interactive

RESTful web service

SQL-like language

Queries data stored in Google

cloud

Wide Column Tables

Uses OAuth for

access control

Very Fast 750M

Rows in <10 secs

Page 30: Bleeding Edge Databases

Easy & Fast

• Text or Json• Up to 100k inserts/sec (streaming)

Load it

• Supports core SQL query concepts• SELECT, FROM, JOIN, WHERE, ORDER BY, GROUP BY • Windowing functions (OVER / PARTITION)• Common Aggregates (SUM, COUNT, MAX)

• Includes ‘analytic’ SQL• STDDEV, VARIANCE, CORRELATION• REGEXP_MATCH

Query it

• Query is $ 5 per TB processed• Storage is around $30 TB per month

Pay (for) it

Page 31: Bleeding Edge Databases

Benchmark Results

• TCP-H Benchmark

Page 32: Bleeding Edge Databases

DEMO

Page 33: Bleeding Edge Databases
Page 34: Bleeding Edge Databases

Partners and BigQuery

Google Sheets Tableau QlikView

Bime Excel

Page 35: Bleeding Edge Databases

How to try it out

• Set up a Google Cloud account• Upload or stream data• Query

Page 36: Bleeding Edge Databases

Google Cloud Starter Pack

Use code“gde-in”

Page 37: Bleeding Edge Databases

Next steps

Try them out

@LynnLang i t

Page 38: Bleeding Edge Databases