Big Data Engineer with 5+ years of experience in building and optimizing large-scale data platforms using PySpark, Hadoop, Spark Streaming, and Kubernetes. Proven track record of improving pipeline efficiency by 50% and leading migrations from on-prem to cloud platforms, including AWS and Microsoft Fabric. Hands-on with containerization, CI/CD, and platform observability.
- Academy Accreditation - Generative AI Fundamentals- Databricks
- Google Cloud Skills Boost – Perform Foundational Data, ML, and AI Tasks in Google Cloud
- Google Cloud Skills Boost – Serverless Data Processing with Dataflow
- Use Apache Spark in Microsoft Fabric
- Use real-time intelligence in MS Fabric