drupal big data
DESCRIPTION
Big Data Drupal with Cloudera, Hadoop, MapReduce, Nutch and Solr by niccolo http://groups.drupal.org/node/286763TRANSCRIPT
![Page 1: Drupal Big Data](https://reader030.vdocuments.us/reader030/viewer/2022020207/554a3e27b4c905293a8b4e64/html5/thumbnails/1.jpg)
Big Data DrupalDEMOCRATIZING BIG DATA PROCESSES
![Page 2: Drupal Big Data](https://reader030.vdocuments.us/reader030/viewer/2022020207/554a3e27b4c905293a8b4e64/html5/thumbnails/2.jpg)
Elements
Bonita
Cloudera
NutchSolr
Drupal
![Page 3: Drupal Big Data](https://reader030.vdocuments.us/reader030/viewer/2022020207/554a3e27b4c905293a8b4e64/html5/thumbnails/3.jpg)
BonitaJAVA/ECLIPSE-BASED COMMERCIAL OPEN-SOURCE BUSINESS PROCESS AUTOMATION & MODELLING
![Page 4: Drupal Big Data](https://reader030.vdocuments.us/reader030/viewer/2022020207/554a3e27b4c905293a8b4e64/html5/thumbnails/4.jpg)
Bonita StudioDesign business process models
Human or Service Tasks
Human Tasks have Forms
Service Tasks have Connectors
![Page 5: Drupal Big Data](https://reader030.vdocuments.us/reader030/viewer/2022020207/554a3e27b4c905293a8b4e64/html5/thumbnails/5.jpg)
Bonita ExperienceWeb-based admin & workflow
![Page 6: Drupal Big Data](https://reader030.vdocuments.us/reader030/viewer/2022020207/554a3e27b4c905293a8b4e64/html5/thumbnails/6.jpg)
Bonita Forms
![Page 7: Drupal Big Data](https://reader030.vdocuments.us/reader030/viewer/2022020207/554a3e27b4c905293a8b4e64/html5/thumbnails/7.jpg)
Shell Script Task
sudo -u hdfs hadoop jar /opt/nutch/basil-apache-nutch-1.6/build/apache-nutch-1.6.job org.apache.nutch.crawl.Crawl/user/nutch/demo-crawl/urls -dir${dir} -depth ${depth} -topN 10 -threads 50
Runs Nutch job for Hadoop
![Page 8: Drupal Big Data](https://reader030.vdocuments.us/reader030/viewer/2022020207/554a3e27b4c905293a8b4e64/html5/thumbnails/8.jpg)
ClouderaBIG DATA COMMERCIAL OPEN SOURCE
![Page 9: Drupal Big Data](https://reader030.vdocuments.us/reader030/viewer/2022020207/554a3e27b4c905293a8b4e64/html5/thumbnails/9.jpg)
ClouderaCloudera Manager 4 (Free Edition)
Hbase
HDFS
Hive
Hue
Impala
Mapreduce
Oozie
Zookeeper
![Page 10: Drupal Big Data](https://reader030.vdocuments.us/reader030/viewer/2022020207/554a3e27b4c905293a8b4e64/html5/thumbnails/10.jpg)
Nutch Job Hadoop job started by Bonita Shell connector
![Page 11: Drupal Big Data](https://reader030.vdocuments.us/reader030/viewer/2022020207/554a3e27b4c905293a8b4e64/html5/thumbnails/11.jpg)
Apache Foundation
Nutch
Solr
Hbase
HDFS
Hive
Impala
Mapreduce
Home to many of these projects
![Page 12: Drupal Big Data](https://reader030.vdocuments.us/reader030/viewer/2022020207/554a3e27b4c905293a8b4e64/html5/thumbnails/12.jpg)
NutchIndustrial strength general purpose web-crawler
http://blog.csdn.net/hadoopstudy/article/details/1501123
![Page 13: Drupal Big Data](https://reader030.vdocuments.us/reader030/viewer/2022020207/554a3e27b4c905293a8b4e64/html5/thumbnails/13.jpg)
Nutch
http://blog.csdn.net/hadoopstudy/article/details/1501123
![Page 14: Drupal Big Data](https://reader030.vdocuments.us/reader030/viewer/2022020207/554a3e27b4c905293a8b4e64/html5/thumbnails/14.jpg)
SolrSearch & indexing
![Page 15: Drupal Big Data](https://reader030.vdocuments.us/reader030/viewer/2022020207/554a3e27b4c905293a8b4e64/html5/thumbnails/15.jpg)
DrupalPHP WEB APPLICATION FRAMEWORK
![Page 16: Drupal Big Data](https://reader030.vdocuments.us/reader030/viewer/2022020207/554a3e27b4c905293a8b4e64/html5/thumbnails/16.jpg)
Aegir BOA
![Page 17: Drupal Big Data](https://reader030.vdocuments.us/reader030/viewer/2022020207/554a3e27b4c905293a8b4e64/html5/thumbnails/17.jpg)
DrupalNutch & Solr modules
Integrate with search & views
Created at IAS
Sponsored by Acquia
![Page 18: Drupal Big Data](https://reader030.vdocuments.us/reader030/viewer/2022020207/554a3e27b4c905293a8b4e64/html5/thumbnails/18.jpg)
Apache SolrModule
![Page 19: Drupal Big Data](https://reader030.vdocuments.us/reader030/viewer/2022020207/554a3e27b4c905293a8b4e64/html5/thumbnails/19.jpg)
Apache SolrExamples Module
http://drupal.org/project/apachesolr_examples
![Page 20: Drupal Big Data](https://reader030.vdocuments.us/reader030/viewer/2022020207/554a3e27b4c905293a8b4e64/html5/thumbnails/20.jpg)
Nutch Mulisite
![Page 21: Drupal Big Data](https://reader030.vdocuments.us/reader030/viewer/2022020207/554a3e27b4c905293a8b4e64/html5/thumbnails/21.jpg)
Drupal SearchNutch crawl
Solr indexed
Drupal search & views
![Page 22: Drupal Big Data](https://reader030.vdocuments.us/reader030/viewer/2022020207/554a3e27b4c905293a8b4e64/html5/thumbnails/22.jpg)
Nutch SolrSandbox
![Page 23: Drupal Big Data](https://reader030.vdocuments.us/reader030/viewer/2022020207/554a3e27b4c905293a8b4e64/html5/thumbnails/23.jpg)
Big Data DrupalDEMOCRATIZING BIG DATA PROCESSES
![Page 24: Drupal Big Data](https://reader030.vdocuments.us/reader030/viewer/2022020207/554a3e27b4c905293a8b4e64/html5/thumbnails/24.jpg)
Big Data DrupalAuthor