microsoft azure + r

Post on 14-Apr-2017

353 Views

Category:

Data & Analytics

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Microsoft Azure ♥ RData Science with Microsoft Azure

and R

Dmitry Petukhov,Microsoft Data Platform MVP, C# MCP, Big Data Enthusiast && Coffee Addicted

𝑃 (𝐴 𝑗|𝐵 )=𝑃 (𝐴 𝑗 )𝑃 (𝐵∨𝐴 𝑗)

∑𝑖=1

𝑁

𝑃 (𝐴𝑖 )𝑃 (𝐵∨𝐴𝑖)

⟨Ω ,𝔘 ,ℙ ⟩

Microsoft Azure + R. Prototype to Product Challenge

PrototypingFlexibilityDistributedScalableFault-toleranceReliable

ProductionFlexibilityDistributedScalableFault-toleranceReliable+ Big Data Ready+ LSML

Black Magic!

Migration

Microsoft Azure + R. Hello R!

Python is a COOL language!But R…

Specialized in statistical analyzeTime-effective => ideal for…

…prototyping, competition, researching, and for fun!

Standalone computing => not bad scalable Open sourceBig bearded community

Storage

ResourceManagement

ML Framework

Execution Engine

Local OS

Local Disc

Pyth

on R

untim

e

Yet A

noth

er

Runt

ime

scikitlearn

HDFS

YARN

MapReduce

Mahout

HDFS / S3

YARN / Apache Mesos

Spark

MLlib

HDFS / S3

YARN / Apache Mesos

Python / R on Spark

Python/Rtools

Spark

Local PC Hybrid Model Cluster (on-premises/on-demand)

somelibrar

y

Machine Learning in Finance. Infrastructure for Data Scientist

Low HighCost of deployment/ownership

Distributed FS

Dark Magic…

ML as a Service

Python/Rtools

Microsoft Azure + R. Infrastructures for Data Scientists

Microsoft Azure + R. Microsoft ♥ R

R Server for Azure HDInsightData Science VMAzure Machine Learning

Support R-scripts executionAllow authoring custom R modulesJupyter Notebooks with R kernel support

Azure HDInsightHadoop/Spark-cluster as a Service

SQL Server R ServicesPower BI

Running R Scripts & excellent visualization R Tools for Visual Studio

Microsoft Azure

References

Reference: http://www.r-bloggers.com/using-microsoft-r-server-to-address-scalability-issues-in-r/

Microsoft Azure + R. Microsoft R Server Platform

References

Reference: http://www.r-bloggers.com/using-microsoft-r-server-to-address-scalability-issues-in-r/

Microsoft Azure + R. DistributedR: write once, deploy anywhere

R Server for Azure HDInsight Killer features list:

100% open source R implementation;workload running inside HDInsight (Hadoop/Spark).

Microsoft Azure + R. R Server for Azure HDInsight

R, Python, SQL, C#

Microsoft Azure + R. Data Science VM

Microsoft R Server Developer Edition, Anaconda Python distribution, Jupyter notebooks for Python and R, Visual Studio Community Edition with Python and R Tools, Power BI desktop, SQL Server Express editionML libs: CNTK, xgboost and Vowpal WabbitAzure SDK

Data Science VM inside:

R Tools in Azure Machine Learning:Support R-scripts execution;Allow authoring custom R modules;Jupyter Notebooks with R kernel support.

Microsoft Azure + R. Azure Machine Learning

Microsoft Azure + R. Azure Machine Learning

Team

Data

JupyterNotebook

Azure ML Studio

GitHub/TFS in Azure

Share results

Source codeIdeas

Best practices

Cloud

Contribute

Azure Machine Learning

You

h(θ0, θn)

Azure Storage

Local

Commands flow

Data flow

Publish model

REST API

Test model

Request/response flow

ReferencesCortana Intelligence and Machine Learning BlogR for Azure Machine Learning. QuickstartMachine Learning Algorithm Cheat SheetMachine Learning Hackathon. How to win?Azure ML Repositories on GitHubMicrosoft Azure for all group on FacebookSoon in Slack (invite form)

Microsoft Azure + R. References

© 2016 Dmitry Petukhov All rights reserved. Microsoft and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries.

Data Science must win!

Q&ANow or later (send on

d.petukhov@outlook.com)

Ping meHabr: @codezombie

LinkedIn: @dpetukhovFacebook: @code.zombi

Read my tech code instinct blog ( http://0xCode.in/ )

Microsoft Azure + R. Stay in Touch!

top related