shubham bhindwal big data
TRANSCRIPT
Shubham Bhindwal Hadoop Developer (Research and development)
Cell:09755922969,09907021341 Email:[email protected] SUMMARY I am a TechGeek and Experienced big data (data science) developer having out of box thinking , I can easily adapt any technology HIGHLIGHTS
● Hadoop, MapReduce, HDFS, HBase, Zookeeper, Hive, Pig, Sqoop,Flume, MongoDB , ● Nutch , Solr and Mahout ● JAVA (J2SE and J2EE) ● R and Python ● Machine Learning ● Statistical Analysis and NLP(Natural Language Processing)
EXPERIENCE Junior Software Developer (Research and Development) Techvalens Indore (M.P) ACHIEVEMENT Creator of Search Engine for the Developers www.hadoop.org.in Having Response time vary between .2 to .8 seconds implemented new Custom PageRank algorithm implemented Stop word using StopAnalyzer implemented Stemming using Snowball Algorithm HADOOP PROJECTS Sentiment Analysis ( banking domain) : Environment: Hadoop, Apache hive, SQOOP, Java, LINUX, and MySQL. Role: Hadoop Developer. Project Description: The purpose of the project is to store information generated by the bank's historical data, extract meaning information out of it and based on the information predict the customer's category. The solution is based on the open source BigData s/w Hadoop .The data will be stored in Hadoop file system and processed using
● Map/Reduce jobs for product and pricing information. ● Roles & Responsibilities: ● Involved in developing the Pig scripts ● Developed the Sqoop scripts inorder to make the interaction between Pig and MySQL Database. ● Completely involved in the requirement analysis phase ● Developed Map Reduce application using Hadoop, Map Reduce programming and HBase. ● Involved in developing the Pig scripts ● Experience loading data to hive partitions and creating Buckets in Hive. ● Worked on Cloudera distribution and have strong knowledge on creating and monitoring
Hadoopclusters on CDH5 Cloudera Manager on Linux, Ubuntu OS.
Hadoop Administrator
● As per job requirement i deployed Several Hadoop Multinode cluster ● HDP(Ambari) 2.3 In Rackspace ubuntu(12.02) (5 nodes) ● CDH4 in AWS (14.04) ● Created Chef Script for automated installation of Spark , Cassandra
and Kafka with apache Mesos
Machine Learning Algorithm Twitter Sentiment Analysis Using Mahout machine learning algorithm Using Naive Bayes Binary Classification (Mahout) Using Java Training datasets with 1.6 million tweet records Result 80 % tweet classification(Positive or Negative) are correct tested with 10k samples
9 Million Crawler
● Deployed Hadoop multinode cluster setup ● Create Nutch Plugin for parsing Meta tags ● Enable indexing and Full text search Elasticsearch ● Implemented Mahout Clustering algorithm ● Implemented MongoDB as backend database
Research on R and python As a part of my research job i have worked in very dynamic environment
i have created POC (proof of concept demo) app for the clients in R and python R stock market analysis (timeseries regression ) Python having basic working knowledge
EDUCATION Bachelor of Engineering :Computer Science : 2014 Mandsaur Institute of Technology (Mandsaur M.P) PERSONAL STRENGTH High Grasping Power. Keen Intellect. Troubleshooter
PERSONAL DETAIL
Date of Birth: 04/08/1992 High Language proficiency English, Hindi