Results-driven Senior Data Engineer with over 8 years of experience in building and optimizing scalable, production-ready data solutions in cloud-native environments (GCP, AWS). Proven success in reducing data pipeline runtimes, driving forecasting accuracy, and delivering high-impact ETL solutions across healthcare and financial domains. Skilled in Python, SQL, BigQuery, Snowflake, Spark, and Airflow. Known for architecting efficient data models, implementing best practices for observability, and enabling data-driven decision-making across enterprise-scale systems.
Programming & Scripting: Python, SQL, PySpark
Cloud Platforms: GCP (BigQuery, Vertex AI, Cloud Storage), AWS (S3, Lambda, Glue)
Data Warehousing: BigQuery, Snowflake, Redshift, Hive, Teradata, Oracle, MS SQL
Big Data & Processing: Apache Spark, Hadoop, YARN
Workflow Orchestration: Apache Airflow
Infrastructure & DevOps: Kubernetes, Docker, GitHub
Data Governance & Observability: Data quality frameworks, metadata management, JIRA, Confluence
Visualization Tools: Tableau, Looker, LucidChart