Results-driven Data Engineer with 5+ years of experience in designing, building, and optimizing scalable data pipelines and cloud-based data platforms. Strong expertise in Python, SQL, Apache Spark, and Databricks, with hands-on experience across AWS and Azure environments. Proven ability to handle batch and real-time data ingestion, implement robust ETL/ ELT workflows, and deliver high-quality, analytics-ready datasets. Experienced in data modeling, performance tuning, data security, and governance, with a strong focus on enabling data-driven business decisions.
Overview
6
6
years of professional experience
Work History
Data Engineer
GLENEAGLES HEALTHCARE INDIA PRIVATE LIMITED
01.2025 - Current
Data Engineer
TCS
03.2022 - 01.2025
Designed and implemented scalable ETL/ ELT pipelines using Python, SQL, Spark, and Airflow for ingesting transactional and behavioral data.
Migrated data from on-premise databases and Hadoop to cloud data warehouses such as Snowflake/ Redshift/ BigQuery.
Developed data models using star and snowflake schemas to support reporting and analytics use cases.
Implemented real-time data ingestion using Kafka for order and user activity events.
Ensured data quality through validation checks, reconciliation, and automated monitoring.
Optimized query performance and reduced data processing cost through partitioning and indexing strategies.
Worked closely with BI and analytics teams to enable dashboards in Power BI/ Tableau.
Implemented security controls including IAM, role-based access, and data masking for sensitive customer data.
Used Git and CI/ CD pipelines for version control and automated deployment of data pipelines.
Junior Technical Consultant (Data Engineer)
FOIWE INFO GLOBAL SOLUTIONS PRIVATE LIMITED
03.2020 - 04.2021
Designed and developed scalable ETL/ ELT pipelines using Python, SQL, Spark, and Airflow for ingesting POS, inventory, and supplier data.
Built and optimized data models using star and snowflake schemas for sales, product, store, and customer analytics.
Implemented near real-time data ingestion using Kafka for transaction and inventory events.
Migrated data from on-premise databases and Hadoop to cloud data warehouses such as Snowflake/ Redshift/ BigQuery.
Applied data quality checks, reconciliation, and monitoring to ensure accuracy and consistency of sales and inventory data.
Optimized query performance through partitioning and indexing, improving report performance and reducing processing cost.
Implemented security and access controls using IAM and role-based permissions for sensitive business data.
Collaborated with BI teams to enable dashboards and reports in Power BI/ Tableau for sales, supply chain, and customer insights.
Set up CI/ CD pipelines and Git-based version control for automated deployment of data pipelines.