Summary
Overview
Work History
Education
Skills
Accomplishments
Certification
Timeline
Generic

Avinash Khajure

Consultant Data Engineering (AWS | Snowflake)
Pune

Summary

Experienced Data Engineer specializing in scalable data platforms using AWS, PySpark, and Snowflake. Designs high-performance data pipelines and processes multi-terabyte datasets while implementing architectures like Data Mesh and Medallion. Delivers resilient, cloud-native data solutions that enhance engineering practices and drive business value.

Overview

3
3
Certifications
13
13
years of professional experience

Work History

Consultant / Lead Data Engineer

Principal Global Services
04.2021 - Current
  • Built and implemented Data Mesh-based architectures with reusable, domain-oriented data products, enabling teams to manage and scale their own data capabilities more effectively across the organization.
  • Developed an AI-powered orchestration solution using LLMs to automate Snowflake data model generation and publish technical documentation directly to Confluence, improving developer productivity, and reducing manual effort.
  • Designed reusable ingestion and transformation frameworks on AWS and Snowflake, introducing consistent data standards, and automated schema validation for both structured and semi-structured data.
  • Built and managed scalable data pipelines to improve data availability, reliability, and overall platform stability.
  • Implemented automated CI/CD workflows using GitHub Actions, including automated data quality checks to improve deployment reliability, and maintain data integrity.
  • Led platform optimization initiatives by tuning Snowflake workloads, isolating compute workloads, and improving storage strategies, helping to reduce cloud processing costs by 20%.
  • Designed and implemented a multi-region disaster recovery architecture with continuous replication and high availability to support critical business workloads and resiliency requirements.
  • Led the development and enhancement of ETL pipelines, improving processing efficiency, scalability, and data accuracy across multiple workflows.

Cloud Data Engineer

InfoCepts Technologies
12.2014 - 03.2021
  • Played a key role in modernizing legacy on-premise big data platforms by helping to migrate them to AWS and Snowflake-based cloud architectures, while coordinating closely with engineering teams and business stakeholders.
  • Designed and built scalable micro-batch data pipelines using PySpark, AWS Glue, Lambda, and Snowflake to process multi-terabyte event data while meeting strict reporting and analytics timelines.
  • Developed a centralized logging and monitoring framework for core data pipelines, improving observability, and reducing incident resolution time by 30%.
  • Built and improved automated deployment pipelines using Jenkins, introducing version control, rollback mechanisms, and more reliable deployment processes.
  • Worked closely with product, analytics, and business teams to translate complex business requirements into scalable data models and backend data solutions that supported key reporting and product insights.

Oracle Developer

BetaMonks Technology Factory Ltd.
07.2013 - 11.2014
  • Developed complex Oracle PL/SQL procedures, functions, and reports to enhance the functionality of financial and prepaid banking systems.
  • Designed reports to provide actionable, operational insights.

Education

Bachelor of Engineering - Information Technology

Nagpur University
KITS Ramtek (Dt. Nagpur)
05-2012

Skills

AWS, Snowflake & PySpark

Batch & Streaming Data Processing

High-Performance Data Pipelines

Event-Driven Architecture

Medallion Architecture

Accomplishments

  • Strategic Migration & Alignment: Successfully steered the technical validation and modernization of multi-terabyte on-premise data ecosystems to cloud-native architectures, serving as the core technical anchor between engineering squads and executive stakeholders.
  • Reusable Platform Assets: Built modular, enterprise-grade frameworks for ingestion, automated CI/CD pipelines, and proactive observability that significantly accelerated deployment stability and engineering delivery speed.
  • Technical Mentorship: Mentored and coached senior and mid-level data engineers across multiple teams, establishing architecture review boards and contributing to internal best-practice forums to elevate engineering quality.

Certification

AWS Certified Solutions Architect – Associate (Expired, Renewal in Progress)

Timeline

Consultant / Lead Data Engineer

Principal Global Services
04.2021 - Current

Cloud Data Engineer

InfoCepts Technologies
12.2014 - 03.2021

Oracle Developer

BetaMonks Technology Factory Ltd.
07.2013 - 11.2014

Bachelor of Engineering - Information Technology

Nagpur University
Avinash KhajureConsultant Data Engineering (AWS | Snowflake)