

Results-driven Data Engineer and Analyst with 4+ years of experience designing, building, and optimizing scalable data pipelines and cloudbased solutions. Strong expertise in Python, SQL, Apache Airflow, PySpark, AWS Glue, Snowflake, FastAPI, and distributed data systems. Ability to deliver high-performance ETL pipelines, real-time APIs, and analytics solutions that improve data reliability, efficiency, and business outcomes.
Cloud Platforms:
AWS (S3, Lambda, Glue, Redshift, Athena, DynamoDB, EMR, SNS, CloudWatch),
Google Cloud Platform (BigQuery, Cloud Storage, Compute Engine)
Data Engineering & Big Data:
PySpark, Apache Spark, Apache Airflow, Databricks, Hive, Apache Kafka,
ETL/ELT Pipelines, Data Transformation, Workflow Orchestration (Airflow, Step Functions)
Data Warehousing & Databases:
Snowflake, Amazon Redshift, BigQuery, Trino, PostgreSQL, MySQL
Programming Languages:
Python, JavaScript
Backend & API Development:
FastAPI, Django, REST APIs, API Integration, Asynchronous Processing
Business Intelligence & Analytics:
Power BI, Tableau
DevOps & Containers:
Docker, Kubernetes, Podman, Git, GitLab CI/CD
Data Quality & Monitoring:
Schema Validation, Data Validation, Monitoring, Alerting, Pipeline Failure Handling