Summary
Overview
Work History
Education
Skills
Timeline
Generic

Mihir Sharma

Jaipur

Summary

Results-driven Data Engineer with 3+ years of experience designing and optimizing scalable data pipelines and architectures using Big Data technologies including Hadoop, Apache Spark, and Python. Proficient in ETL processes and data warehousing solutions, prioritizing data quality and integrity. Experienced in creating interactive dashboards with Tableau and Power BI for actionable insights. Skilled in implementing CI/CD practices, leveraging AWS and GCP cloud platforms, and applying data governance principles for compliance and security. Adept at collaborating with cross-functional teams to identify data needs and develop robust solutions that drive business performance.

Overview

3
3
years of professional experience

Work History

Lead Assistant Manager

EXL Services
09.2023 - Current
  • Engaged with clients to gather and document requirements, ensuring data solutions aligned with business objectives and healthcare standards (HL7, FHIR, CCDA).
  • Led the development of scalable data pipelines using Apache Spark, Hive, and Airflow, improving ETL operations and reducing processing times by 30%.
  • Designed and implemented data architectures to enhance interoperability and compliance, streamlining data management.
  • Spearheaded end-to-end data pipelines using Google Dataflow and Apache Beam, increasing efficiency and cutting operational costs.
  • Managed and mentored offshore engineers, focusing on automated, scalable data solutions using Google Cloud Dataflow and Apache Spark.
  • Orchestrated workflows and pipelines with Airflow, Jenkins, and Cloud Composer for seamless data integration and timely reporting.
  • Resolved production issues, improving system stability and reducing downtime by 25%.
  • Delivered real-time insights using Looker and Tableau to support business decisions.
  • Optimized data transformations with SQL and Python, enhancing processing speed by 20%.
  • Implemented Pub/Sub for low-latency data streaming, further improving data processing times.
  • Enhanced Tableau and Power BI dashboard performance by refining data models and optimizing queries.
  • Ensured data security and regulatory compliance, including encryption and access control management

Data Engineer

WeHeal
06.2022 - 06.2023
  • Built and optimized ETL pipelines using Hadoop and Apache Spark for large-scale data processing, improving efficiency and retrieval times.
  • Designed and maintained relational databases and contributed to the development of data lakes to enhance data accessibility.
  • Utilized SQL and Python for ETL processes, reducing data retrieval times by 30%.
  • Developed interactive dashboards using Tableau and Power BI to provide real-time insights for decision-making.
  • Implemented CI/CD pipelines to automate data integration and deployment, improving delivery timelines and reducing manual intervention.
  • Monitored and troubleshot data pipelines, ensuring continuous performance and reliability.
  • Collaborated with data scientists to build scalable data solutions and documented workflows for future use.

Associate Consultant

Atrium
05.2021 - 05.2022
  • Developed scalable ETL pipelines using Python to ingest, process, and load data into data lakes and data warehouses.
  • Managed and analyzed large datasets using BigQuery, Redshift, and Snowflake to optimize data processing.
  • Built data models using dbt, ensuring consistent and efficient data transformations.
  • Designed and optimized SQL queries for data extraction, providing actionable insights to stakeholders.
  • Implemented best practices for data governance and security, ensuring compliance with industry standards.
  • Worked within AWS and GCP environments, optimizing multi-cloud infrastructure for data operations.
  • Monitored and optimized data pipelines, troubleshooting issues to maintain performance and reliability.

Education

Bachelor's of Technology in Computer Science -

JECRC University
08.2021

Skills

  • Big Data & ETL Tools: Hadoop, Apache Spark, Hive, Airflow, Docker, Kubernetes
  • Programming & Data Processing: Python, SQL, BigQuery, Redshift, Snowflake
  • Data Visualization & Modeling: Tableau, Power BI, dbt, Data Governance
  • Cloud & Monitoring: AWS, GCP, Grafana, PromQL

Timeline

Lead Assistant Manager

EXL Services
09.2023 - Current

Data Engineer

WeHeal
06.2022 - 06.2023

Associate Consultant

Atrium
05.2021 - 05.2022

Bachelor's of Technology in Computer Science -

JECRC University
Mihir Sharma