Summary
Overview
Work History
Education
Skills
Websites
Certification
Languages
Honor Awards
Timeline
Generic
Dhiraj Narkar

Dhiraj Narkar

Navi Mumbai

Summary

Results-driven Data Engineer and BI Specialist with over 14 years of experience delivering high-impact data solutions, business intelligence platforms, and advanced analytics across pharmaceutical and IT domains. Currently leading data analytics initiatives at Roche, specializing in end-to-end data lineage mapping, Python-based data pipelines, and Tableau visualization solutions.

Expert in Python development for ETL automation, predictive modeling (Prophet, SARIMAX, XGBoost), FastAPI microservices, and data quality validation. Extensive experience with Tableau for creating complex dashboards, KPI scorecards, and interactive visualizations that drive operational insights.

Proven expertise in data lineage and impact analysis, establishing traceability frameworks across AWS-based data pipelines (Glue, Lambda, Athena), Vertica data warehouses, and downstream BI applications. Skilled in documenting data flows, identifying dependencies, and supporting migration strategies for enterprise-scale projects.

Adept at building scalable data architectures, optimizing SQL performance, managing cross-functional teams, and aligning technical solutions with business objectives to enable data-driven decision-making.

Overview

15
15
years of professional experience
1
1
Certification

Work History

Data Analytics Lead

Roche
12.2021 - Current

• Conducted comprehensive impact analysis and data lineage mapping across data pipelines, tracking data flow from source systems through AWS Glue ETL, Vertica data warehouse, to downstream Tableau dashboards and FastAPI services, documenting dependencies for customer review and migration projects.

• Developed Python-based forecasting solutions using Prophet, SARIMAX, Holt-Winters, and XGBoost to predict lab sample volumes and turnaround times, implementing custom utilities for model training, evaluation, automated forecast generation, and model versioning.

• Designed and deployed scalable data pipelines using AWS Glue and Apache Airflow to automate ingestion, transformation, and orchestration of lab operational data with integrated logging and exception handling.

• Built FastAPI-based microservices in Python to serve processed data and forecasting results to React UI applications, ensuring low-latency, high-availability APIs with robust error handling.

• Developed complex Tableau dashboards and KPI scorecards using Tableau Desktop, implementing advanced statistical methods including standard deviation, moving averages, and statistical process control to detect instrument errors and trend anomalies.

• Created Turn Around Time (TAT) monitoring dashboards in Tableau to track test and order volumes against threshold benchmarks, enabling proactive performance management and operational insights.

• Designed comprehensive QC analytics dashboards displaying Bias%, Coefficient of Variance%, Total Error metrics (90%/95%), QC result counts, error tracking, mean, and standard deviation calculations with interactive filters and parameters.

• Established data lineage documentation frameworks to trace data transformations from source databases through ETL processes, Vertica flat tables, and FastAPI endpoints to Tableau visualizations, ensuring traceability, compliance, and impact assessment capabilities.

• Performed data quality validation by comparing Tableau dashboard outputs with source database queries, ensuring accuracy and implementing automated Python-based validation scripts.

• Optimized Vertica SQL queries and designed denormalized flat tables for high-volume data processing, supporting forecasting and statistical aggregations at daily and hourly granularity, significantly reducing dashboard query execution time.

• Successfully migrated Tableau Server from single-node to multi-node architecture, ensuring enterprise scalability, high availability, and improved performance for reporting needs.

• Delivered prototypes, proof-of-concepts, and requirement estimates for large-scale projects and system migrations, collaborating with onshore teams and stakeholders.

• Led and coached a team of BI analysts, facilitating sprint planning and backlog refinement in Agile environments, fostering technical development and ensuring delivery excellence.

• Collaborated with UI developers to design interactive forecast visualization dashboards with confidence bands, average benchmarks, and drill-down capabilities.

• Deployed end-to-end machine learning and data solutions on AWS infrastructure (Lambda, S3, Athena, Glue), enabling near real-time insights for lab operations teams.

Senior System Analyst

IBM India Pvt. Limited
08.2018 - 12.2021
  • As a BI lead, Perform the impact analysis and share it with customer and submit prototype, Proof of concept for larger projects and migration to new systems.
  • Assist Technical Architect for System understanding and providing use-cases.
  • Involved in Sprint planning and backlog refinement process
  • Creating user stories and updating story points and assigning it to the individuals
  • Providing the estimates for the requirement to onshore team
  • Developed complex dashboards & KPI scorecards in Tableau using Tableau Desktop.
  • Experience in Migrating Dashboards from Netezza DB to Athena (AWS) DB
  • Connected Tableau server to publish dashboard to a central location for portal integration.
  • Data quality testing by comparing output of dashboard with database using manual SQL
  • Publishing dashboards on server and giving access to users
  • Designed, developed and tune queries in Complex SQL, which handle Large volume of data

Software Lead II

Rolta India Pvt. Limited
09.2017 - 08.2018
  • Sr. Tableau & Cognos developer

Consultant

Capgemini India Pvt. Limited
12.2015 - 09.2017
  • Sr. Cognos developer

Software Engineer II

Aon Hewitt
09.2014 - 12.2015
  • Cognos developer

Sr Software Engineer

Capgemini India Pvt. Limited
03.2011 - 09.2013
  • ETL & Cognos developer

Associate

Cognizant
09.2013
  • Cognos developer

Education

BE - Computer Science

University of Mumbai
06.2010

Skills

  • Tableau
  • Cognos
  • Power BI
  • SQL
  • Data Analytics
  • Data Warehousing
  • Vertica
  • AWS Athena
  • DB2
  • Oracle PL-SQL
  • Agile
  • JIRA
  • Data pipeline design
  • Predictive modeling
  • Data visualization
  • SQL optimization
  • Team leadership
  • Forecasting models
  • AWS glue ETL management
  • Apache Airflow
  • FastAPI
  • Python
  • AWS (S3, Lambda, Athena)

Certification

  • Microsoft Certified Azure Data Engineer
  • SAFe 5 Practitioner

Languages

  • English
  • Marathi
  • Hindi

Honor Awards

  • Star of the Month Award for April 2021
  • Star of the Month Award for April 2020
  • Spot award in August 2012 & September 2012
  • Project Star award in November 2012
  • Plaudit Award in December 2014 & May 2015
  • Star Award in Jun 2017

Timeline

Data Analytics Lead

Roche
12.2021 - Current

Senior System Analyst

IBM India Pvt. Limited
08.2018 - 12.2021

Software Lead II

Rolta India Pvt. Limited
09.2017 - 08.2018

Consultant

Capgemini India Pvt. Limited
12.2015 - 09.2017

Software Engineer II

Aon Hewitt
09.2014 - 12.2015

Associate

Cognizant
09.2013

Sr Software Engineer

Capgemini India Pvt. Limited
03.2011 - 09.2013

BE - Computer Science

University of Mumbai
Dhiraj Narkar