Download PDF


Big Data Consultant with strong business acumen and technical  experience in Big Data /Data Analytics space. Result Oriented , strong technical knowledge in both Computing and Machine Learning and that combines as hard work spirit with corporate-refined execution in Big Data strategy, Big Data consulting  and implementations.

Specialties: Data Modeling , Data Products, Visualization,  Hadoop framework implementation and Big data strategy.

Work History

May 2016October 2016

Data Scientist Intern

Data2B, Rennes(France)

Project - Developing Open Data Product for Share Bicycle System in Rennes- BIGCYCLE DATA .

  • Data Extraction from  Hadoop clusters(HDFSnode) and synchronization. 
  • Data Cleaning, Feature engineering, Descriptive and Linear regression for bicycle usage prediction using R.
  • Developed open Data Product "choosing best Station". 
Oct 2014Jun 2015

Data Analyst, India
  • Developed Merchant model that predicts future sales as a part of sales optimization system that ranks them based on their current and future potential values.
  • Analyzed Social Media data for marketing purposes.
  • Implemented demand forecast model for the Good's optimizing for maximum coverage while minimizing the overbuy.
Sep 2013Aug 2014

Functional Analyst, India 
  • Developed social media plug-in using Fb-plugin language used PHP.
  • Social media data Analysis using Open source tools
  • Product lead for the Sunday flea Market.
Jan 2013Jun 2013

Asp.Net Developer

Ducat Pvt Ltd., India
  • Worked as developer intern in Ducat 
  • Developed a website for online health solution named as using visual studio and mysql 2008.



Masters of Science in Big Data

National School of Statistics and Information Analysis(ENSAI)
Sep 2014Mar 2015

PG Diploma in Business Analytics

NIIT (India)
Oct 2014Nov 2014

Online Training on Big Data with Hadoop


 license Number Edureka, License 449252030

Aug 2009May 2013

Bachelors of Technology (Computer Science)

Gurgaon College of Engineering

Academic Projects

  •  Moving Clusters - Application to Flow Cytometry Data Analysis - Modelisation of the effect of external parameters on species evolution using Machine Learning and Hadoop.
  • It Tools For BIGDATA- Hadoop platform from Hortonworks was settled up on the local machine. Referred Data was loaded on to the Hadoop platform using Hive queries.The processed data was stored on HDFS tables using Hive queries.Synchronized code scheduling was performed using Oozie scripts.
  • Nosql Databases (MonogoDB)- Best part of NYC to live on the basis of Data which was choosen from NYC open data source. Five diffrent kind of datasets were choosen to make the pridiction and hypothsis to best part to live. Robomongo was used for the MonogoDB platform to process and handle the dataset.


English - Fluent

French - Professional

Hindi - Native