Summary
Overview
Work History
Education
Skills
Additional Information
Timeline
Generic

Ajay Dubey

Delhi,DL

Summary

A fast learner and passionate technologist with 15+ years of proven experience in conceptualizing, architecting, planning and executing enterprise scale, and time critical software solutions.

I have commercial experience in delivering green field as well as brown field projects with responsibility of making key technology decisions and leading team of developers. The past couple of years I have designed & delivered batch & cdc data pipelines for gaining insights and doing analytics on massive scale data using the latest technologies like Spark, Hadoop & various cloud services. I have production experience in migrating applications and databases to AWS with a certified Solutions Architect Associate certification.

I am hands on developer and have expertise in Java, Scala & JavaScript programming languages. I have rich experience in developing array of multithreaded, resilient and scalable microservices using Java and Akka concurrent framework.

Overview

4
4
years of post-secondary education
12
12
years of professional experience

Work History

Sr Technical Lead Big Data

Airtel
Gurgaon
05.2021 - Current
  • Analysis of structured and and semi structured data of different sources like flat files, Relational database etc.
  • Design Data Lakes for Telecom data basically to process and store CDRS.
  • Creating tools/ framework to efficiently process and store data.
  • Creating tools to create aggregate data that will be used by different consumers.
  • Creating Services to expose data.
  • Collaborating with different product teams to ensure timely delivery of features.
  • Develop the code and liaise with testing for QC, UAT and promote and maintain the code in Production.
  • Driving daily scrum meetings to provide the status of work to management and business.

Senior Big Data Developer

Macquarie Glocal services
Gurgaon
10.2017 - 05.2021

· Helping teams to arrive at the best possible solution & technical architecture under given enterprise structure and constraints

· Helping the teams to leverage the latest tools and technologies on Big Data and Data Science to modernize their analytical and operational systems.

· Setting up the CDC pipeline of all the General Ledger transactions to Cloudera an AWS based corporate data hub.

· Taking key low level design decisions, hands on coding and code reviews.

· Carry out POCs to evaluate the technologies to be on boarded.

· Currently engaged in building a framework to migrate RDBMS and file data from across the applications to Cloudera Data Lake for archival, retrieval and disposal purpose.

Hadoop Developer

Sapient Consulting Ltd
08.2016 - 10.2017

1. Description: Lead a team of 5 developers in designing and devloping a CDC pipeline to capture real time data into Big Data Lake for getting Client Insights and analytics. It involves capturing the data from 250 data sources (Oracle and DB2) of one of the largest investment banks in UK.

Technlogies: Spark, Kafka, Hive, Elasticsearch, Attunity, IBM CDC and Scala & Java as programming languages.

2. Description: Lead a team of 4 developers in building a collection of micro-services for a large investment bank's Big Data repository for risk valuations & sensitivities, trade data. Micro-services were designed in accordance to Lambda- Architecture principles so to handle massive quantities of data by taking advantage of both batch and stream processing methods.

Technlogies: Spark, Kafka, Hive, Elasticsearch, MongoDB, Akka and Scala & Java, Cucumber BDD

United Health Group
10.2014 - 07.2016

CSC India Pvt. Ltd
08.2011 - 09.2014

Team Computer Pvt Ltd
05.2010 - 08.2011

Soft Creations
04.2009 - 08.2009

Education

B.Tech - Electrical, Electronics And Communications Engineering

I.P. University
Delhi
06.2004 - 07.2008

Skills

SQL

undefined

Additional Information

  • Having Total 12+ years of extensive Big Data and Application Development experience in fields of software design, development, implementation and support of business applications in Devops methodology. Currently Working as Sr technical Lead Big Data at Airtel International Gurgaon. 8+ Years of extensive experience of working in a Big-Data project and Well versed experienced in the Hadoop ecosystem components like HDFS, Sqoop, Scala, Apache Spark, Apache Spark-SQL, Hive, Drools, snowFlake, redshift, MongoDb and Microsoft Power BI etc. 5 Years of extensive experience of working in Professional Software Development with Microsoft Stack. Able to assess business rules, collaborate with stakeholders and perform source-to-target Big Data pipelines. Improve Hive query performance by using partitioning, bucketing, indexing and other techniques. Used Spark API over Cloudera Hadoop YARN to perform analytics data on Hive. Implemented Spark using Scala and SparkSQL for faster processing of data. Involved in converting Hive/SQL queries into Spark Transformation using Spark RDDs and action. Incorporating data analysis of raw data in order come up with conclusions/recommendations Hands on experience in Sqoop to transfer data from RDBMSs directly into HDFS and Hive. Analyze and gather the business requirement and prepared Business documents as per requirement. Good analytical abilities, quick grasping power, zeal for learning new technologies like Spark (real time processing) and Machine learning Concepts. Efficient in achieving version controls by using GIT, STASH, TFS. Well versed with Devops and Agile with Scrum Model and JIRA for development. Creation and Execution of Unit Test Cases and Code review of junior developers. Good development skills using Spark, Impala, Hive, Kafka, Apache Kudu, AWS redshift, AWS Dynamo DB, MongoDb, SnowFlake, Redshift, Azure Cosmos DB, Scala, Python, ASP.NET, MVC, C#, SQL Server2X, Web Services, WCF, Win Form, Multithreading, WebApi, ZeroMq, RabbitMQ, AWS, Kafka Azure and LINQ, Fix protocol. Technology: Domain Specific Area in Domain Languages C#, SQl, Python , Scala, Operating Windows Systems(OS) Databases Sql Server, Hive, Redis, CosmosDB, Sybase, MongoDb , Redshift, Snowflake, Big Data Hadoop, Map-Reduce, SQOOP, HIVE, Flume, Apache Spark, Apache Spark-SQL, Kafka, Scala, Python, Power BI Desktop, InteliJ, Impala, Apache Druid, AWS S3, Snowflake. Reporting Tool Power BI. Version Control Tools GIT/Stash/Source Tree, SVN, TFS. Other Tools Jira, Rally, Database Control Manager (DCM), Confluence

Timeline

Sr Technical Lead Big Data

Airtel
05.2021 - Current

Senior Big Data Developer

Macquarie Glocal services
10.2017 - 05.2021

Hadoop Developer

Sapient Consulting Ltd
08.2016 - 10.2017

United Health Group
10.2014 - 07.2016

CSC India Pvt. Ltd
08.2011 - 09.2014

Team Computer Pvt Ltd
05.2010 - 08.2011

Soft Creations
04.2009 - 08.2009

B.Tech - Electrical, Electronics And Communications Engineering

I.P. University
06.2004 - 07.2008
Ajay Dubey