presentation title goes heredownload.microsoft.com/download/c/f/f/cff0a653-6cd6-4e52-b97… ·...

Post on 28-May-2020

1 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

HDInsight-微软Azure云中的Hadoop大数据解决方案概览及案例分享 孙巍 资深项目经理 微软亚太研发集团

课程代码DBI-B308

Hadoop?

Apache 开源项目

高扩展性分布式文件系统 (HDFS)

分布式数据处理框架

Microsoft解决方案

Hadoop 2.2 and 2.4

80% data compression with ORC

Hadoop

on

Windows

Hive 100x Query Speed Up

30,000+ code line contributions

HDFS in Cloud

(Azure)

10,000+ engineering hours

Committers to Hadoop

Hadoop 2.0

Data Node Data Node Data Node Data Node

Task Tracker Task Tracker Task Tracker Task Tracker

Name Node

Job Tracker

HMaster Coordination

Region Server Region Server Region Server Region Server

Stream processin

g

Search and query

Data analytics (Excel)

Web/thick client

dashboards

Devices to take action

RabbitMQ /

ActiveMQ

HDInsight on Hadoop 2.2 April 2014

HDInsight on Hadoop 1.1.2 Oct 2013

HDInsight on Hadoop 2.4 June 2014

O/S Upgrades

O/S Patching

$£€¥

Cloud

案例

课后提醒

top related