Summary
Overview
Work History
Education
Skills
Certification
Work Preference
Languages
Timeline
ResearchAssistant

Madhura Sail

Big Data Engineer
Mumbai

Summary

Results-driven Big Data Engineer with over 3+ years of hands-on experience in data warehousing, data modelling, and orchestration. Skilled in developing ETL workflows using Pentaho Data Integration and Apache Spark. Adept at translating business requirements into effective solutions, with a strong commitment to continuous learning and the application of best practices in data engineering.

Overview

3
3
years of professional experience
1
1
Certification
5
5
years of post-secondary education

Work History

System Engineer

Infosys Ltd
Pune, Maharashtra
06.2021 - Current
  • ETL: Designed and Implement automated ETL processes to streamline data workflows, ensuring efficient and reliable data ingestion, transformation, and loading.
  • Business Insights: Delivered well-structured and optimized datasets to business analysts, ensuring data integrity and enabling accurate and actionable insights.
  • Solving Data Issues: Enhanced business reporting timelines by 70-80% through effective root cause analysis of data issues, significantly reducing turnaround time for taking further actions.
  • Data Lineage: Maintained comprehensive data lineage to track and document data flow from source to destination, ensuring transparency and supporting data governance.
  • Performance enhancements: Enhanced performance of data pipelines by migrating them from Pentaho Data Integration to Spark framework and achieve 5x faster processing.
  • Data Quality checks: Ensured data quality through comprehensive checks and supported business analyst with required data checks using SQL, especially during the time of BAU.
  • Data flow orchestration: Implemented Control-M data orchestration to streamline data flow, enhancing efficiency and ensuring seamless data processing, reducing 99% manual effort for data refreshes throughout the data lake pipeline.
  • Data View Planning: Increased data analysis effectiveness by harnessing IBM TM1 to create and control data views and build calculations, ultimately optimizing structuring of data and empowering informed business decisions.
  • Impact and Requirement Analysis: Performed Impact Analysis and led the team to analyse business requirements for any new changes to be promoted in the existing system, thereby reducing the turnaround time by 50-60%.
  • Agile Collaboration: Worked in cross-functional teams in an agile framework to collect requirements, leading to successful solution implementation and improved business results.
  • Documenting Processes: Documented processes, data flows and pipeline architecture for easy reference and team collaboration.

Education

Master of Computer Applications (Correspondence) -

Jain University
Bengaluru
11.2021 - 11.2023

Bachelors of Computer Science -

University of Mumbai
06.2018 - 04.2021

Skills

  • Languages & API: Python, SQL, PySpark
  • Data Planning & Analysis: IBM TM1 Planning & Analytics
  • Big Data Frameworks: Pentaho, Apache Spark
  • Database: Hive, SQL
  • Version Control and CI/CD: GitHub, Jenkins
  • Tracking and Documentation: JIRA, Confluence Business Intelligence
  • Visualisation: QlikSense

Certification

Python Certificate February 2022- Present HackerRank

Work Preference

Work Type

Full Time

Location Preference

HybridOn-Site

Important To Me

Work-life balanceCompany CultureFlexible work hoursCareer advancement

Languages

English
Bilingual or Proficient (C2)
Marathi
Bilingual or Proficient (C2)
Hindi
Bilingual or Proficient (C2)

Timeline

Master of Computer Applications (Correspondence) -

Jain University
11.2021 - 11.2023

System Engineer

Infosys Ltd
06.2021 - Current

Bachelors of Computer Science -

University of Mumbai
06.2018 - 04.2021
Madhura SailBig Data Engineer