shubham bhindwal big data

3
Shubham Bhindwal Hadoop Developer (Research and development) Cell:09755922969,09907021341 Email:[email protected] SUMMARY I am a TechGeek and Experienced big data (data science) developer having out of box thinking , I can easily adapt any technology HIGHLIGHTS Hadoop, MapReduce, HDFS, HBase, Zookeeper, Hive, Pig, Sqoop,Flume, MongoDB , Nutch , Solr and Mahout JAVA (J2SE and J2EE) R and Python Machine Learning Statistical Analysis and NLP(Natural Language Processing) EXPERIENCE Junior Software Developer (Research and Development) Techvalens Indore (M.P) ACHIEVEMENT Creator of Search Engine for the Developers www.hadoop.org.in Having Response time vary between .2 to .8 seconds implemented new Custom PageRank algorithm implemented Stop word using StopAnalyzer implemented Stemming using Snowball Algorithm HADOOP PROJECTS Sentiment Analysis ( banking domain) : Environment: Hadoop, Apache hive, SQOOP, Java, LINUX, and MySQL. Role: Hadoop Developer. Project Description: The purpose of the project is to store information generated by the bank's historical data, extract meaning information out of it and based on the information predict the customer's category. The solution is based on the open source BigData s/w Hadoop .The data will be stored in Hadoop file system and processed using Map/Reduce jobs for product and pricing information. Roles & Responsibilities: Involved in developing the Pig scripts Developed the Sqoop scripts inorder to make the interaction between Pig and MySQL Database. Completely involved in the requirement analysis phase Developed Map Reduce application using Hadoop, Map Reduce programming and HBase. Involved in developing the Pig scripts Experience loading data to hive partitions and creating Buckets in Hive. Worked on Cloudera distribution and have strong knowledge on creating and monitoring Hadoopclusters on CDH5 Cloudera Manager on Linux, Ubuntu OS.

Upload: shubham-bhindwal

Post on 07-Jan-2017

94 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: shubham bhindwal big data

 

  Shubham Bhindwal   Hadoop Developer (Research and development) 

   Cell:09755922969,09907021341  Email:[email protected]  SUMMARY I am a Tech­Geek and Experienced big data (data science) developer  having out of box thinking , I can easily adapt any technology   HIGHLIGHTS 

● Hadoop, MapReduce, HDFS, HBase, Zookeeper, Hive, Pig, Sqoop,Flume, MongoDB ,   ●  Nutch , Solr and Mahout ● JAVA (J2SE and J2EE) ● R and Python ● Machine Learning ● Statistical Analysis and NLP(Natural Language Processing) 

 EXPERIENCE  Junior Software  Developer (Research and Development)  Techvalens Indore (M.P)  ACHIEVEMENT  Creator of Search Engine for the Developers www.hadoop.org.in Having Response time vary between .2 to .8 seconds implemented new Custom Page­Rank algorithm implemented Stop word using StopAnalyzer implemented Stemming using Snowball Algorithm   HADOOP PROJECTS    Sentiment Analysis ( banking domain) :­ Environment: Hadoop, Apache hive, SQOOP, Java, LINUX, and MySQL.  Role: Hadoop Developer.  Project Description:  The purpose of the project is to store information generated by the bank's historical data, extract meaning information out of it and based on the information predict the customer's category. The solution is based on the open source BigData s/w Hadoop .The data will be stored in Hadoop file system and processed using   

● Map/Reduce jobs for product and pricing information.  ● Roles & Responsibilities:  ●  Involved in developing the Pig scripts  ●  Developed the Sqoop scripts inorder to make the interaction between Pig and MySQL Database.  ●  Completely involved in the requirement analysis phase  ●  Developed Map Reduce application using Hadoop, Map Reduce programming and HBase.  ●  Involved in developing the Pig scripts  ●  Experience loading data to hive partitions and creating Buckets in Hive.  ●  Worked on Cloudera distribution and have strong knowledge on creating and monitoring 

Hadoopclusters on CDH5 Cloudera Manager on Linux, Ubuntu OS.   

Page 2: shubham bhindwal big data

 

 Hadoop Administrator  

● As per job requirement i deployed Several Hadoop Multinode cluster  ● HDP(Ambari) 2.3 In Rackspace ubuntu(12.02) (5 nodes) ● CDH4 in AWS (14.04) ● Created Chef Script for automated installation of Spark , Cassandra  

   and Kafka with apache Mesos  

Machine Learning Algorithm  Twitter Sentiment Analysis Using Mahout machine learning algorithm Using Naive Bayes Binary Classification (Mahout) Using Java Training datasets with 1.6 million tweet records  Result 80 % tweet classification(Positive or Negative) are correct tested with 10k samples   

  9 Million Crawler  

● Deployed Hadoop multinode  cluster setup ● Create Nutch Plugin for parsing Meta tags ● Enable indexing and Full text search Elasticsearch ● Implemented Mahout Clustering algorithm ● Implemented MongoDB as back­end database 

  

  Research on R and python    As a part of my research job i have worked in very dynamic environment  

i have created POC (proof of concept demo) app for the clients in  R and python  R ­ stock market analysis (time­series regression )   Python­ having basic working knowledge 

   

EDUCATION  Bachelor of Engineering :Computer Science : 2014  Mandsaur Institute of Technology (Mandsaur M.P)  PERSONAL STRENGTH  ­ High Grasping Power. ­ Keen Intellect. ­Troubleshooter 

PERSONAL DETAIL  

Date of Birth: 04/08/1992 High Language proficiency English, Hindi 

 

Page 3: shubham bhindwal big data