akhil's hadoop

24
A NEW FILE SYSTEM TECHNOLOGY BASED ON HADOOP Presented by M.V.AKHIL PREM KUMAR

Upload: akhil-prem

Post on 15-Jan-2017

23 views

Category:

Education


1 download

TRANSCRIPT

Page 1: Akhil's hadoop

A NEW FILE SYSTEM TECHNOLOGY BASED ON

HADOOP

Presented byM.V.AKHIL PREM KUMAR

Page 2: Akhil's hadoop

Appendix

1. Hadoop’s history

2. Architecture in detail

3. Hadoop in industry

4. Advantages

Page 3: Akhil's hadoop

What is

HaDooP

Page 4: Akhil's hadoop

Brief History of HadoopDesigned to answer the question:

“How to process big data with reasonable cost and time?”

Page 5: Akhil's hadoop

Search engines in 1990s

1996

1996

1997

1996

Page 6: Akhil's hadoop

Google search engines

1998

2013

Page 7: Akhil's hadoop

Google Origins2003

2004

2006

Page 8: Akhil's hadoop

Hadoop’s Developers

Doug Cutting

2005: Doug Cutting and  Michael J. Cafarella developed Hadoop to support distribution for the Nutch search engine project.

The project was funded by Yahoo.

2006: Yahoo gave the project to Apache Software Foundation.

Page 9: Akhil's hadoop

Hadoop Framework Tools

Page 10: Akhil's hadoop

Hadoop’s Architecture

Page 11: Akhil's hadoop

HDFS ARCHITECTURE

Page 12: Akhil's hadoop

Hadoop MapReduce

Page 13: Akhil's hadoop

Hadoop in the Wild• Advertisement (Mining user behavior to generate

recommendations

• Hadoop is in use at most organizations that handle big data:

o Yahoo! o Facebooko Amazono Netflixo Etc…

Security (search for uncommon patterns)

Page 14: Akhil's hadoop

Why use Hadoop?

Page 15: Akhil's hadoop

Some Hadoop Milestones

Page 16: Akhil's hadoop
Page 17: Akhil's hadoop
Page 18: Akhil's hadoop

CONCLUSION:•Distributed by the user all the world

•Efficently used for job tracker and task tracker in its map reduce engine

•Top level apache project

•Having more capabilites by adapting cloud computing too

Page 19: Akhil's hadoop
Page 20: Akhil's hadoop
Page 21: Akhil's hadoop
Page 22: Akhil's hadoop

QUERIES

Page 23: Akhil's hadoop
Page 24: Akhil's hadoop

Developer(s) Apache Software FoundationInitial release December 10, 2011; 3 years ago[1]

Stable release 2.7.1 / July 6, 2015[2]

Development status ActiveWritten in JavaOperating system Cross-platformType Distributed file systemLicense Apache License 2.0Website hadoop.apache.org

The name "Hadoop" was given by one of Doug Cutting's sons to that son's toy elephant. Doug used the name for his open source project because it was easy to pronounce and to Google. If you have 2 minutes, you can watch a video [1] with more details.