Download PDF

Summary

A product developer, big data architect, DataScientist carry a profound experience in the industry from  Telecom, financial products, CMS and identity management background. Data crunching at tera-bytes scale on graph and also on Hadoop, research on Image Search indexing and information retrieval ,provided architecture on enterprise Integration and portal for the bank, a cloud search Engine for e-commerce and framework for Real-Time Streaming Analysis of News.

Highlights:
-----------------

  • Architected and implemented data pipelines to handle tera bytes of data in realtime etl , ingestion.
  • Realtime Stream Analysis( XtremeData, Spark Streaming, Kafka), Unstructured Information Management Architecture, Middle-ware Integration
  • AWS Cloud with Solr Search and Recommendations Engine.
  • Hadoop administration on Cloudera and MapR
  • Java,J2EE and Oracle with Product Development and DataAnalytics.
  • Performance Tuning, data Modeling, capacity-Planning and Partitioning on Oracle. Multi-tenancy postgres cluster with synchronous replication and Failover.
  • Probabilistic Topic Modelling and Real time Topic inferencing.
  • Graph-Databases: DEX, Neo4J, iGraph, Graph Traversal Algorithms.
  • Created Deep Learning Architecture for image processing using cuDNN and Caffe with CUDA GPUs from NVIDIA.
  • Created a Real-Time Data ingestion Engine using Message Oriented Middleware like Kafka and ActiveMQ.
  • Micro Services with Docker . Geo-Spatial APIs development with ESRI, ArcGIS and GeoJSON.
  • Served as the member of Smart City Industry Round table Conference, IDA Singapore in Real time Streaming Analysis from Sensory devices , networks and other datasources.
  • Created real time metrics product of spark jobs using graffana, graphite and Apache Spark.
  • Created Benchmarks for fast data analytics with various in memory databases and grids.

Education

20012005

B.Technology

JNTU

Publications

Skills

InfraStructure

80 Node Hadoop Cluster with Kerberos Security.

Virtualisation through Docker and service resiliency through NetflixOSS ( Zuul , Eureka and Hystrix ).

AWS cloud Services.

Ansible, Mesosphere, Single Click Cloud deployments

Java

Core Java , Scala, Python , J2EE,  Spring, WebServices, Hibernate.

Machine Learning

Support Vector Machines.

Singular value Decomposition (SVD).

Logistic Regression.

Bayesian Networks.

Graph Traversal Algorithms.

Agent Simulation on Networks.

Deep Learning

Applied Recurrent Neural Nets for the Real time analysis.

Used RBM as a part of unsupervised learning.

Implemented Word2Vector for text analysis and Topic Modelling.

Databases

Oracle , MySQL, Postgres Multi-Tenancy.

Streaming Analysis

Spark Streaming for all Real-Time Jobs.

Objects State Machine in Redis

Time Spatial Stream fusion.

Apache Spark

Created a ETL framework on Spark.

Spark ML.

Spark Metrics Instrumentation Using Graffana and Graphite.

Spark on Mesos and Ec2.

Big Data Analytics and Hadoop

Hadoop BigTop.

Created a Docker based Hadoop EcoSystem Distribution.

Cloudera

Titan Graph DB, DEX.

SOLR and Lucene

Real time Indexing Pipelines on SOLR.

UIMA Integration with Analytic Engines on SOLR.

Image Search using LIRE ( MPEG-7)

Text Analysis, and recommendation engines on SOLR.

NLP using Mallet.

Work History

2015-04Present

Senior Data Engineer

Dataspark, SIngtel
  • Implementation of Geo Analytics Platform at various Telcos.  
  • MRT Traffic analysisusing the Telco Data for Urban Planning and detection using Geo Analytics.
  • Implementation of City in Motion[Realtime Grid] from the Telco Data from Geo Spatial Analysis.
  • Implementation of real time predictive models, for next place predictions.
  • Implementation of Geo Analytics Platform on the scale of Terabytes Telecommunication Systems and ISPs.
  • Created a Custom ETL framework on Apache Spark. Created a SQL like engine using Presto on various hadoop components.
  • Adoption of Kafka to all enterprise datasources.
  • DataMining on Solr and Lucene using NLP techniques for different types of content. Built custom annotators for various field Types during indexing pipelines.
  • GeoSpatial Search using Solr and JTS.
  • MicroServices at scale using docker and dockerOS with Netflix OSS
  • Implemented open-source Postgres multi tenant cluster with 100 TB of data
2014-052015-04

Data Scientist

Singtel
  •  Implementation of Image Search using Lucene Image Retrieval and Extraction(LIRE). Worked on the Support for MPEG-7 formats for the ecommerce Image Search.
  • Implemented the Solr Image Search and full text Search using NLP techniques and metadata-curation and extraction.
  • Unstructured Information Management Architecture(UIMA-AS streaming analysis): Real-time streaming Analysis of twitter News content, RSS feeds, Data Extraction through Crawling, fusion of RSS and twitter; do Named Entity Extraction(NER) using OpenNLP from the content. Usage of AS, Spring-XtremeData for the frameworks. Time Annotation based annotator using Heidel time.
2013-032014-05

BigData Lead

Barclays PLC(Optimum Solutions)
  • Graph processing of the network to dofraud identification.
  • MapReduce Use-cases and Identity Data Management using a high-performance Graph Database called DEX.
  • Created frameworks forRealtime recertificationof identity data of bank assets.
  • Data Modelling, Data Partitioning and capacity Planning on 10X TB data on Oracle.
2011-022013-03

Senior Software Engineer and Framework Specialist

StandardCharteredBank Optimum Solutions
  • Pioneer in building a Relationship Manager's Work bench.
  • The Relationship Manager's Workbench is a one-stop shop for all the activities performed by the RMs in the bank say account opening, client-on boarding, Bookings, Credit and Limit allocations, Risk and credit Pricing etc.,
  • A one stop for the Clients View and Performance graphs of the Clients, which helps in analyzing the success rates of the RM with respect to the Clients.
  • It also contains Social Networking, where the RM's can interact socially and share the deal information with their associates.
  • Implementation of Micro Strategy Business Analytics with various KPIs and Dashboards.
2009-022011-02

Advanced Software Engineer

OpenText Corporation
  • Project I: Open Text Social Communities (OTSC)/CRM
    OTSC is known as Enterprise-Social. I am a Scrum team Member involved in the design-phases of the Product.
  • It is aimed at creating a Enterprise-Social CRM by leveraging the site-creation capabilities of the Portal and the Remote Objects capability of Collaboration.
  • This provides the feasibility to integrate the business Models along with the workflows.
  • Project II: Vignette Community Applications(VCA)
     The Vignette Social MediaSolution makes it possible to enhance the Web presence and enable user-generated content with the integrated, enterprise product offerings.
  • It includes Web 2.0 features such as Walls, blogs, wikis, forums, ratings and tags.I worked on creating the web 2.0 Applications.
2007-042009-01

Senior Software Analyst

Franklin Templeton Investments
2005-112007-04

Senior Software Engineer

LaserSoft Info Systems(Polaris Company)

Courses and Certifications

Apr 2014Apr 2014

Cloudera Developer Training

Cloudera

Cloudera Hadoop developer.

Mar 2014Mar 2014

Computing for Data Analysis

John Hopkins University (Coursera)


Micro-Services & Docker

Recommendation Systems

Honors & Awards

  • Best Employee of the year in my team at Franklin Templeton
  • Received Appreciation Cheque for the Work I have done at OpenText.
  • Appreciation from manager for creating a graph database proof of concept for Master Data Management in Barclays.
  • Received Spot Awards for 6 times in DataSpark and Singtel for achieving critical milestones.

References

Available upon Request