technical presentation on hadoop

13
ABID MERCHANT ZAID KHAN Technical Presentation on

Upload: zaid-khan

Post on 11-Apr-2017

206 views

Category:

Software


1 download

TRANSCRIPT

Page 1: Technical Presentation on Hadoop

ABID MERCHANT ZAID KHAN

Technical Presentation

on

Page 2: Technical Presentation on Hadoop

What is ?

“Hadoop” is a free, Java-based programming framework that supports the processing of large data sets in a distributed computing environment. It is part of the Apache project sponsored by the Apache Software Foundation.

Page 3: Technical Presentation on Hadoop
Page 4: Technical Presentation on Hadoop
Page 5: Technical Presentation on Hadoop

Other Notable users

New York Times

Baidu

eHarmony

Rackspace

Page 6: Technical Presentation on Hadoop

in the real world.

Telecommunications

Data Warehousing

Market Research Forecasting

Social Networking

Natural Language Processing (NLP)

Image Video Processing

Academic Research

Financial Analysis

Page 7: Technical Presentation on Hadoop

‘s History Inspired by Big Table and MapReduce papers circa. 2004.

Created By Doug Cutting.

Originally built to support distribution for Nutch Search Engine.

Named after a stuff elephant.

Page 8: Technical Presentation on Hadoop

What is NOT ?

It isn’t a relational database... an online transaction processing

system... a structured data store of any kind!

Page 9: Technical Presentation on Hadoop

Components of :

Hadoop Libraries HDFS

YARN MapReduce

Page 10: Technical Presentation on Hadoop

Why is important ?

Page 11: Technical Presentation on Hadoop

Challenges of using :

There’s a widely acknowledged talent gap. (it can be difficult for entry level programmers who don’t have sufficient skills to be productive with MapReduce)

Data Security.

Full fledged data management and governance.

Page 12: Technical Presentation on Hadoop

References: http://www.sas.com/en_us/insights/big-

data/hadoop.html

http://searchcloudcomputing.techtarget.com/definition/Hadoop

http://wiki.apache.org/hadoop/

Page 13: Technical Presentation on Hadoop