oracle nosql database...running hadoop in nosql database (interactive slide) steps to run a hadoop...
TRANSCRIPT
Copyright © 2014, Oracle and/or its affiliates. All rights reserved.
Oracle NoSQL Database
Integrating Apache Hadoop with Oracle NoSQL Database
Prathima Trivedi
PROPERTIES
Allow user to leave interaction: Anytime
Show ‘Next Slide’ Button: Don't show
Completion Button Label: Next Slide
PROPERTIES
Allow user to leave interaction: Anytime
Show ‘Next Slide’ Button: Don't show
Completion Button Label: Next Slide
Copyright © 2014, Oracle and/or its affiliates. All rights reserved.
Road Map
Introduction
to Hadoop
Benefits of
Integration
NoSQL
Database
APIs
Case Study:
Tunes.fm
Copyright © 2014, Oracle and/or its affiliates. All rights reserved.
Road Map
Introduction
to Hadoop
Benefits of
Integration
NoSQL
Database
APIs
Case Study:
Tunes.fm
Copyright © 2014, Oracle and/or its affiliates. All rights reserved.
Background
NoSQL
Database
Copyright © 2014, Oracle and/or its affiliates. All rights reserved.
Hadoop Framework
• Is a Java framework for processing large sets of data
• Is used for a variety of analytics
• Incorporates MapReduce functionality to segregate the
data
Copyright © 2014, Oracle and/or its affiliates. All rights reserved.
MapReduce Technique
• Divides the application into many small blocks of work
• Processes the data that is located in NoSQL Database
• Has two phases:
– Map phase
– Reduce phase
Copyright © 2014, Oracle and/or its affiliates. All rights reserved.
Road Map
Introduction
to Hadoop
Benefits of
Integration
NoSQL
Database
APIs
Case Study:
Tunes.fm
Copyright © 2014, Oracle and/or its affiliates. All rights reserved.
NoSQL Database Support to Connect with Hadoop
NoSQL Database acts as the input or output for Hadoop.
NoSQL Database NoSQL Database
Input Data Output Data
Hadoop Cluster
Copyright © 2014, Oracle and/or its affiliates. All rights reserved.
Example: Scenario
NoSQL Database
Commodity
Customer profile
Preferences
Views
Transactions
Online Display Advertising
MapReduce Job
Search preferences
Prediction
Additional services
Copyright © 2014, Oracle and/or its affiliates. All rights reserved.
Example: Business Conclusions
NoSQL Database
Commodity
Customer profile
Preferences
Views
Transactions
Online Display Advertising
MapReduce Job
Search preferences
Prediction
Additional services
Copyright © 2014, Oracle and/or its affiliates. All rights reserved.
Road Map
Introduction
to Hadoop
Benefits of
Integration
NoSQL
Database
APIs
Case Study:
Tunes.fm
Copyright © 2014, Oracle and/or its affiliates. All rights reserved.
oracle.kv.hadoop Package
oracle.kv.hadoop consists of two classes:
• KVInputFormat
• KVAvroInputFormat
Producer Note: Convert to interactive slide
Copyright © 2014, Oracle and/or its affiliates. All rights reserved.
Road Map
Introduction
to Hadoop
Benefits of
Integration
NoSQL
Database
APIs
Case Study:
Tunes.fm
Copyright © 2014, Oracle and/or its affiliates. All rights reserved.
Case Study: Tunes.fm
Data collected:
• Customer Schema
– Age
– Gender
– Location
– Date
• Song Schema
– Track title
– Artist
– Number of times a
track is played
Copyright © 2014, Oracle and/or its affiliates. All rights reserved.
Generating Charts
Track ID 733143757 is played 154 times and is the most
frequently streamed track.
Track ID Number of times that
the track is played
Copyright © 2014, Oracle and/or its affiliates. All rights reserved.
Hardware and Software Requirements (Interactive Slide)
• Hardware Requirements:
– Disk space
– RAM above 2 GB
• Software Requirements:
– Linux-based operating system
– NoSQL Database
– Oracle Java Development Kit 1.6 or later
– Hadoop 2.2
Copyright © 2014, Oracle and/or its affiliates. All rights reserved.
Running Hadoop in NoSQL Database (Interactive Slide)
Steps to run a Hadoop operation in NoSQL Database:
1. Start Kvlite.
2. Load data into NoSQL Database.
3. Start the Hadoop interface.
4. Run the MapReduce job.
5. Display the output.
Copyright © 2014, Oracle and/or its affiliates. All rights reserved.
Demonstration (Interactive Slide)
Copyright © 2014, Oracle and/or its affiliates. All rights reserved.
Course Summary
Introduction
to Hadoop
Benefits of
Integration
NoSQL
Database
APIs
Case Study:
Tunes.fm
Copyright © 2014, Oracle and/or its affiliates. All rights reserved.
Quiz
Oracle NoSQL Database provides critical linear scaling and
extremely low latency for those operations closest to the user’s
browser.
a. True
b. False
Copyright © 2014, Oracle and/or its affiliates. All rights reserved.
Quiz
The KVInputFormat class enables you to modify data in
NoSQL Database.
a. True
b. False
Copyright © 2014, Oracle and/or its affiliates. All rights reserved.
Quiz
NoSQL Database provides high availability of data storage.
a. True
b. False
PROPERTIES
Allow user to leave interaction: Anytime
Show ‘Next Slide’ Button: Don't show
Completion Button Label: Next Slide
Copyright © 2014, Oracle and/or its affiliates. All rights reserved.
Credits
Author and Narrator Prathima Trivedi
Technical Contributor and Reviewer
Anuj Sahni
Editor
Susan Moxley
Producer
Swarnapriya Shridhar
Thanks for watching!
Credits
Copyright © 2014, Oracle and/or its affiliates. All rights reserved.