Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

RAKESH SINGH

Bengaluru

Summary

  • Lead Data Engineer having more than 8 years in cloud and distributed system and overall 15 + year of industry experience.
  • Experience in requirement gathering and architecting data engineering project.
  • Expertise in designing and implementing cloud base data pipeline using AWS tech stack and pyspark.
  • Expertise in implementing ETL solution for datawarehouse project.
  • Experience in DevOps.
  • Experience in implementing Machine learning module using Python based module.
  • Expertise in python programming and used in many project for different need.
  • Solid understanding of statistical model.
  • Solid Understanding of data structure and programming construct.
  • Experience in BI reporting and visualisation.
  • Experience of people management and team handling.

Overview

18
18
years of professional experience
1
1
Certification

Work History

Lead Data Engineer

Agilon Health
Bengaluru, Karnataka
09.2021 - Current
  • Agilon Health is leading heath-care based IT company in United states which is transferring health care from fee for service to value based total care for managed medicare patients . It has created model among payor, provider and patients to achieve this.
  • Involved in requirement gathering for building AWS cloud based data solution.
  • Involved in designed and built the data pipeline using procedure oriented paradigm and used snowflake for data processing.
  • Involved in designed and developed the metadata system for ETL.
  • Used airflow for all data orchestration purpose.
  • Used DBT for data quality check .

Senior Technical Lead

ATMECS
Bangalore, India
11.2019 - 09.2021
  • Client: US based financial research firm.
  • Involved in requirement gathering for building AWS cloud based data solution.
  • Designed and built the data pipeline using object oriented paradigm and used pyspark for data processing.
  • Designed and developed the metadata system for ETL process using Python based API .
  • Used airflow for all data orchestration purpose.
  • Created the ETL workflow architecture and same has adopted across projects for different feed.
  • Designed and built the rejection handling process.
  • Design and build the logic to handle time series data.
  • Perform performance tuning using.
  • Used Pytest and unit test for unit test need.
  • Working collaboratively in scrum agile process using Jira.
  • Used different AWS services for different context in building pipelining
  • AWS S3 Object store for file input place holder and landing zone.
  • AWS EMR cluster for spark engine to process the data and hdfs for storing intermediate data.
  • AWS secret manger to handle sensitive credential.
  • AWS SES for e-mail service.
  • Used Git for version management.
  • and Azure deveops for CICD .

Team Lead

Agreeya Solution
India
02.2017 - 08.2019
  • Client: US based leading collection agency.
  • Played key role in developing and deploying machine learning model for defaulted account for legal department of collection agency using predicative analytics'.
  • Contributed to expose new skill for business perspective using POC and use case presentation to client.
  • Done POC on mainly Azure services like azure stream analytics, Cosmos DB.
  • Worked on ETL.

Team Lead

Ciber
India
11.2014 - 02.2017
  • Client: US based World renowned automobile company.
  • Worked on migration project moving data from EDW system to hive.
  • Worked on ETL project.

Application Consultant

IBM
India
08.2011 - 11.2014
  • Worked on multiple BI projects as ETL developer.

Software Engineer

Accenture
India
08.2010 - 08.2011
  • Client: UK based parcel service.
  • Worked on BI projects as ETL developer.

Software Engineer

09.2007 - 08.2010
  • Patni computer, Noida, India, India Client: US based world famous fast food chain.
  • Worked on BI projects as ETL developer, tester and also supported application.

Education

Master of Science - Computer Science

Banaras Hindu University
India
06.2007

Bachelor of Science - Computer Science, Statistics, Maths

Banaras Hindu University
India
06.2005

Skills


    Cloud & Platforms: AWS (EMR, Glue, S3, Lambda, ECS/EKS), Snowflake (Snowpipe, Streams, Time Travel), Databricks Lakehouse
    Big Data & Processing: PySpark, Spark Structured Streaming, Delta Lake (ACID, Schema Evolution), Hadoop, Hive, Sqoop
    Orchestration & MLOps: Apache Airflow (50 DAGs, TaskFlow), MLflow (Model Registry), dbt Cloud/Core, Great Expectations
    DevOps & IaC: Docker, Kubernetes (EKS), Terraform, GitOps, Azure DevOps, CI/CD Pipelines, Pytest
    Data Governance: Unity Catalog, Metadata-Driven ETL, Data Lineage, Rejection Handling, Schema Registry
    Programming: Python (Pandas, Scikit-learn, Flask), SQL, Bash Scripting
    BI & Analytics: Power BI, SSRS, Tableau (Intermediate), Predictive Analytics
    Leadership: Team Handling, Agile (Jira), Stakeholder Management, US Client Delivery

Certification

  • AWS Certified Developer - Associate

Timeline

Lead Data Engineer

Agilon Health
09.2021 - Current

Senior Technical Lead

ATMECS
11.2019 - 09.2021

Team Lead

Agreeya Solution
02.2017 - 08.2019

Team Lead

Ciber
11.2014 - 02.2017

Application Consultant

IBM
08.2011 - 11.2014

Software Engineer

Accenture
08.2010 - 08.2011

Software Engineer

09.2007 - 08.2010

Bachelor of Science - Computer Science, Statistics, Maths

Banaras Hindu University

Master of Science - Computer Science

Banaras Hindu University
RAKESH SINGH