DAYANAND PALLEMONI

Turbito Infotainment Private Limited

10.2018 - 11.2024

Led the design and enhancement of ETL workflows utilizing PySpark and Spark Scala for efficient data processing.
Architected and implemented scalable data solutions on Google Cloud Platform (GCP).
Built and maintained robust data pipelines for both batch and real-time processing using PySpark and Spark Scala.
Partnered with cross-functional teams to gather business requirements and deliver optimized data solutions.
Optimized Spark jobs to enhance processing efficiency, reduce execution time, and improve resource utilization.
Streamlined data pipeline deployment through automation using Apache Airflow.
Integrated multiple data sources and ensured seamless data ingestion and transformation for analytics.
Developed and optimized complex SQL queries for data extraction, transformation, and reporting.
Ensured data integrity, quality, and compliance with industry standards and best practices.
Conducted root cause analysis and troubleshooting to resolve pipeline failures and improve system reliability.
Implemented performance tuning techniques such as partitioning, bucketing, and caching in Spark to enhance query performance.
Worked with distributed computing frameworks, leveraging Hadoop, Hive, and Spark SQL for big data processing.