Joined on 13th August 2024, trained in AWS, PySpark, Python, and Oracle SQL, gaining hands-on experience with cloud computing, data processing, and database management.
Proficient in utilizing cloud platforms for data-driven solutions and optimizing data workflows.
Developed a strong understanding of data engineering concepts, including ETL pipelines and big data technologies.
Highly adaptable and quick to learn, with a strong work ethic and excellent problem-solving skills.
A dedicated team player, known for leadership abilities and driving projects to successful completion.
PySpark Mini Project: Olympics Games Analysis (2008-2016)
Used the ELT method to extract, load, and transform Olympic data, employing a Galaxy Schema for efficient analysis. Cleaned and processed large datasets using PySpark, optimizing performance with partitioning and caching. Conducted in-depth analysis on medal counts, country performance, and athlete success across multiple Olympic years.