unlocking data science in the enterprise - with oracle and cloudera

20
Unlocking data science in the enterprise with Cloudera Data Science Workbench for Oracle Big Data Jochen Faltermeier | Partner Manager, Central EMEA Balazs Gaspar | Sales Engineer, Central EMEA / CEE

Upload: cloudera-inc

Post on 21-Jan-2018

175 views

Category:

Business


0 download

TRANSCRIPT

Page 1: Unlocking data science in the enterprise - with Oracle and Cloudera

1© Cloudera, Inc. All rights reserved.

Unlocking data science in the enterprise withCloudera Data Science Workbench for Oracle Big Data

Jochen Faltermeier | Partner Manager, Central EMEA

Balazs Gaspar | Sales Engineer, Central EMEA / CEE

Page 2: Unlocking data science in the enterprise - with Oracle and Cloudera

2CONFIDENTIAL. INTERNAL. ©Cloudera

We believe data can make what is impossible

today, possible tomorrow

Page 3: Unlocking data science in the enterprise - with Oracle and Cloudera

3CONFIDENTIAL. INTERNAL. ©Cloudera 3© Cloudera, Inc. All rights reserved.

Cloudera at-a-glance

Customer successLarge enterprises fueling growth

48% 140%+customer growth net expansion

Last 4 years Global 8000 customers

Expansion driven by data and new

use cases

Open partner networkBest of breed solutions

3000+partners

Vast ecosystem of solution &

service providers

First to marketOpen source innovation

2008founded

1600+Clouderans

Global team doing business in 28 countries

Big data innovators from Google,

Yahoo and Oracle

Page 4: Unlocking data science in the enterprise - with Oracle and Cloudera

4© Cloudera, Inc. All rights reserved.

Teaming strengths

• Executive sponsorship

• Install base: nearly 500 customers worldwide

• Complementary data management platform

• Simplify decision, total cost of ownership & time

to market

• Architecture and outcome led capabilities

• Customer support interlock

Innovation strengths

• Full stack of platform capabilities

(EDW/EDL/OCS)

• On-premise, hybrid and cloud deployment

options

• Tools for LOB, analysts, data scientists, IT

• Very high performance and data management

and analytics capabilities (EDH+ORAAH)

• Product development and integration across

BDA/BDCS/Public Cloud offerings

Partnership strengths

Page 5: Unlocking data science in the enterprise - with Oracle and Cloudera

5© Cloudera, Inc. All rights reserved.

Cost of compute

Data volume

Time

MachineLearning

NOMachineLearning

1950s 1960s 1970s 1980s 1990s 2000s 2010s

Age of machine learning

Page 6: Unlocking data science in the enterprise - with Oracle and Cloudera

6© Cloudera, Inc. All rights reserved.

PATTERN

RECOGNITIO

N

ANOMALY

DETECTIO

N

PREDICTION

SELF-SERVICE

INTELLIGENCE

SECURE

REPORTING

REAL-TIME

ANALYTICS

MACHINE LEARNING ANALYTICS

Enterprise-proven machine learning and analytics

700+CUSTOMERS RUN

ON

750+CUSTOMERS RUN

ON

Page 7: Unlocking data science in the enterprise - with Oracle and Cloudera

7© Cloudera, Inc. All rights reserved.

The data-driven enterprise

Explosion of data and devices

(IoT)

30Bconnected

devices

440x more data

Transformation of IT infrastructure

open source

cloud

machine learning

$200Btotal

market1

1 IDC Worldwide Big Data and Business Analytics Market Through 2020

Page 8: Unlocking data science in the enterprise - with Oracle and Cloudera

8© Cloudera, Inc. All rights reserved.

Data science / machine learning workflowFaster from data to exploration to action in a single platform

Data engineering Data science (Exploratory) Production (Operational)

Data wrangling

Visualization and analysis

Model training & testing

Productiondata pipelines Batch scoring

Online scoringServing

Data GovernanceGovernance

Processing

AcquisitionReports,

dashboards

Page 9: Unlocking data science in the enterprise - with Oracle and Cloudera

9© Cloudera, Inc. All rights reserved.

Good news

Data Engineering Data science (Exploratory) Production (Operational)

Data wrangling

Visualization and analysis

Model training & testing

Productiondata pipelines Batch scoring

Online scoringServing

Data GovernanceGovernance

Processing

AcquisitionReports,

dashboards

Data has never been more plentiful

Open source data science and machine learning libraries are rapidly evolving

Commodity (and on-demand) compute makes scalable production machine learning affordable

Page 10: Unlocking data science in the enterprise - with Oracle and Cloudera

10© Cloudera, Inc. All rights reserved.

Bad news

Data engineering Data science (Exploratory) Production (Operational)

Data wrangling

Visualization and analysis

Model training & testing

Productiondata pipelines Batch scoring

Online scoringServing

Data GovernanceGovernance

Processing

AcquisitionReports,

dashboards

Data needs to move across multiple different systems

Teams have different, conflicting requests for languages & libraries

Most data science done at small scale, individually, and is difficult to replicate

Very few models reach production

Page 11: Unlocking data science in the enterprise - with Oracle and Cloudera

11© Cloudera, Inc. All rights reserved.

Access Scale Developer experience

Additional challenges

Page 12: Unlocking data science in the enterprise - with Oracle and Cloudera

12© Cloudera, Inc. All rights reserved.

Our goal is to enable data science and machine learning at scale

Page 13: Unlocking data science in the enterprise - with Oracle and Cloudera

13© Cloudera, Inc. All rights reserved.

Open data science in the enterprise

ITdrive adoption while maintaining compliance

Data Scientistexplore, experiment, iterate

Page 14: Unlocking data science in the enterprise - with Oracle and Cloudera

14© Cloudera, Inc. All rights reserved.

Our goal: an open platform for data science at scale

Help more data scientistsuse the power of Hadoop

Use a powerful, familiar environment with direct access to

Hadoop data and compute

Data scientistData engineer

Make it easy and secure to add new users, use cases

Offer secure self-service analytics and a faster path to production on common, affordable infrastructure

Enterprise architectHadoop admin

Page 15: Unlocking data science in the enterprise - with Oracle and Cloudera

15© Cloudera, Inc. All rights reserved.

Introducing Cloudera Data Science WorkbenchSelf-service data science for the enterprise

Accelerates data science from development to production with:

• Secure self-service environments for data scientists to work against Cloudera clusters

• Support for Python, R, and Scala, plus project dependency isolation for multiple library versions

• Workflow automation, version control, collaboration and sharing

Page 16: Unlocking data science in the enterprise - with Oracle and Cloudera

16© Cloudera, Inc. All rights reserved.

Demo

Page 17: Unlocking data science in the enterprise - with Oracle and Cloudera

17© Cloudera, Inc. All rights reserved.

With Cloudera Data Science Workbench…

Data scientists can:• Use R, Python, or Scala from a web

browser, with no desktop footprint

• Install any library or framework within isolated project environments

• Directly access data in secure clusters with Spark and Impala

• Share insights with their team for reproducible, collaborative research

• Automate and monitor data pipelines using built-in job scheduling

IT can:• Give their data science team the freedom

to work how they want, when they want

• Stay compliant with out-of-the-box support for full platform security, especially Kerberos

• Run on-premises or in the cloud, wherever data is managed

Page 18: Unlocking data science in the enterprise - with Oracle and Cloudera

19CONFIDENTIAL. INTERNAL. ©Cloudera

Customer Data Center

Customer Managed

19

Big Data Appliance

Customer Data Center

Oracle Managed

Oracle Cloud

Oracle Managed

BDA Cloud Service

On-Premises Cloud @ Customer Public Cloud

Big Data Cloud to

Customer

Portfolio and Product Alignmentpowered by Cloudera

Page 19: Unlocking data science in the enterprise - with Oracle and Cloudera

20CONFIDENTIAL. INTERNAL. ©Cloudera

Why Oracle and Cloudera?

Oracle Exadata

EDW

Relational/Transactional/Data Mining

Oracle Data Integrator/

Golden Gate<-------------->

Oracle Big Data Appliance

DATA LAKEHadoop/NoSQLSocial/Web/IoT

Oracle Big Data SQL

VALUE DRIVERS

• TTM, Data drive decisions

• Reduces Cost

• Reduce Risk

• TCO

TECHNICAL VALUE

• Query more data with BD SQL; a tool you

know and have invested in

• Easily integrate more data with existing app’s

• Secure, integrated, scalable data platform

USE CASES

• Customer 360

• Digital Transformation/Instrument your

business

• Secure your business (Cyber/Fraud)

Oracle Analytics Cloud & BDCS(Visualization, Automatic, BI, Analytics, Discovery,

Preparation)

Page 20: Unlocking data science in the enterprise - with Oracle and Cloudera

21© Cloudera, Inc. All rights reserved.

Thank you

Watch the webinar series:

go.cloudera.com/cdsw-webinar-emea