
Senior Data Engineer with 8+ years of experience building scalable data platforms, distributed data pipelines, and enterprise-grade data infrastructure. Currently working at Red Hat, focusing on designing and operating robust ETL/ELT systems that support large-scale data and AI-driven use cases.
Expertise in developing pipelines using Python, SQL, and PySpark, along with modern data stack tools such as Snowflake, Databricks, and Starburst/Trino. Strong experience in implementing Medallion Architecture (Raw, Silver, Gold layers) to enable reliable, scalable, and well-governed data processing.
Proficient in data quality frameworks, data governance, and secure data handling including PII adherence.Experienced in building production-grade data systems with CI/CD pipelines using GitLab, ensuring reliability, observability, and performance of data workflows.