

Results-driven Data Engineer and BI Specialist with over 14 years of experience delivering high-impact data solutions, business intelligence platforms, and advanced analytics across pharmaceutical and IT domains. Currently leading data analytics initiatives at Roche, specializing in end-to-end data lineage mapping, Python-based data pipelines, and Tableau visualization solutions.
Expert in Python development for ETL automation, predictive modeling (Prophet, SARIMAX, XGBoost), FastAPI microservices, and data quality validation. Extensive experience with Tableau for creating complex dashboards, KPI scorecards, and interactive visualizations that drive operational insights.
Proven expertise in data lineage and impact analysis, establishing traceability frameworks across AWS-based data pipelines (Glue, Lambda, Athena), Vertica data warehouses, and downstream BI applications. Skilled in documenting data flows, identifying dependencies, and supporting migration strategies for enterprise-scale projects.
Adept at building scalable data architectures, optimizing SQL performance, managing cross-functional teams, and aligning technical solutions with business objectives to enable data-driven decision-making.
• Conducted comprehensive impact analysis and data lineage mapping across data pipelines, tracking data flow from source systems through AWS Glue ETL, Vertica data warehouse, to downstream Tableau dashboards and FastAPI services, documenting dependencies for customer review and migration projects.
• Developed Python-based forecasting solutions using Prophet, SARIMAX, Holt-Winters, and XGBoost to predict lab sample volumes and turnaround times, implementing custom utilities for model training, evaluation, automated forecast generation, and model versioning.
• Designed and deployed scalable data pipelines using AWS Glue and Apache Airflow to automate ingestion, transformation, and orchestration of lab operational data with integrated logging and exception handling.
• Built FastAPI-based microservices in Python to serve processed data and forecasting results to React UI applications, ensuring low-latency, high-availability APIs with robust error handling.
• Developed complex Tableau dashboards and KPI scorecards using Tableau Desktop, implementing advanced statistical methods including standard deviation, moving averages, and statistical process control to detect instrument errors and trend anomalies.
• Created Turn Around Time (TAT) monitoring dashboards in Tableau to track test and order volumes against threshold benchmarks, enabling proactive performance management and operational insights.
• Designed comprehensive QC analytics dashboards displaying Bias%, Coefficient of Variance%, Total Error metrics (90%/95%), QC result counts, error tracking, mean, and standard deviation calculations with interactive filters and parameters.
• Established data lineage documentation frameworks to trace data transformations from source databases through ETL processes, Vertica flat tables, and FastAPI endpoints to Tableau visualizations, ensuring traceability, compliance, and impact assessment capabilities.
• Performed data quality validation by comparing Tableau dashboard outputs with source database queries, ensuring accuracy and implementing automated Python-based validation scripts.
• Optimized Vertica SQL queries and designed denormalized flat tables for high-volume data processing, supporting forecasting and statistical aggregations at daily and hourly granularity, significantly reducing dashboard query execution time.
• Successfully migrated Tableau Server from single-node to multi-node architecture, ensuring enterprise scalability, high availability, and improved performance for reporting needs.
• Delivered prototypes, proof-of-concepts, and requirement estimates for large-scale projects and system migrations, collaborating with onshore teams and stakeholders.
• Led and coached a team of BI analysts, facilitating sprint planning and backlog refinement in Agile environments, fostering technical development and ensuring delivery excellence.
• Collaborated with UI developers to design interactive forecast visualization dashboards with confidence bands, average benchmarks, and drill-down capabilities.
• Deployed end-to-end machine learning and data solutions on AWS infrastructure (Lambda, S3, Athena, Glue), enabling near real-time insights for lab operations teams.