Adept Data Engineer with a proven track record at Probo and Orcablue Technologies, showcasing proficiency in ETL development, data pipeline design, and Apache Airflow. Demonstrated ability to enhance data quality and efficiency, leveraging Python and DBT skills. Excelled in creating impactful data visualizations and ensuring data integrity, driving informed decision-making.
Overview
2
2
years of professional experience
Work History
Data Engineer
Probo
Gurgaon
05.2023 - 12.2024
Developed and managed ETL pipelines using Apache Airflow to automate data extraction, transformation, and loading processes, ensuring efficient data flow across systems.
Designed and implemented data models, and optimized data storage solutions, working with relational and NoSQL databases (e.g., PostgreSQL, MySQL, MongoDB) to support high-performance queries and reporting.
Created and maintained scalable, fault-tolerant data pipelines for processing and transforming large volumes of data, ensuring high availability, and data integrity.
Built and deployed data cleaning and processing workflows to handle inconsistencies, missing values, and outliers, improving data quality for downstream analysis.
Developed data visualization dashboards using tools like Tableau, Power BI, or custom visualizations to present key metrics and insights to stakeholders for informed decision-making.
Implemented data governance and security best practices to ensure the privacy and integrity of sensitive data within cloud environments.
Developed and implemented data validation pipelines in Apache Airflow to ensure the correctness of data by performing checks on raw and processed data against predefined validation rules, improving data reliability and consistency across databases.
Implemented data visualization tools like Tableau and Power BI to create dashboards and reports for business stakeholders.
Managed the 2024 General Election vote count for opinion trading in Probo app, along with complete ownership of the daily live YouTube data automation pipeline at Probo.
Data Engineer Intern
Orcablue Technologies
Bangalore
11.2022 - 04.2023
Extracted diverse datasets including RBI balance of payments, EV sales, and EPFO data.
Implemented Data Build Tool (DBT) for precise modeling, establishing specific staging and mart layers.
Extracted Indian election data covering state and central from 1965.
Created ETL scripts to automate manual processes for efficient data loading.
Built dashboards using Tableau or other visualization software to display key metrics in a clear, concise manner.
Education
Bachelor of Technology - Electronics Instrumentation And Control
Thapar Institute of Engineering & Technology
Patiala, Punjab
04-2023
XIIth -
Kairali School
Ranchi, Jharkhand
01.2019
Xth -
St. Thomas School
Ranchi, Jharkhand
01.2017
Skills
Proficient in Python
Data Structures Proficiency
ETL development
Data Schema Development
Data Pipeline Design
Apache Airflow Proficiency
Proficient in Git
DBT Proficiency
AWS Cloud Services
MySQL, Postgres, and NoSQL databases
Docker
OOPS
Networking
Databases
Timeline
Data Engineer
Probo
05.2023 - 12.2024
Data Engineer Intern
Orcablue Technologies
11.2022 - 04.2023
Bachelor of Technology - Electronics Instrumentation And Control
Thapar Institute of Engineering & Technology
XIIth -
Kairali School
Xth -
St. Thomas School
Similar Profiles
Ciara WebsterCiara Webster
Director, Support Services&Contract Administration at AlphaSource Group (Probo Medical, Acquired 1/2024)Director, Support Services&Contract Administration at AlphaSource Group (Probo Medical, Acquired 1/2024)