Project: Cloud Migration across AWS environments
- Tech Stack: Databricks ETL Pipelines, AWS, Data Warehousing, CDOCS, Python, PySpark, SQL, GxP
- Migrated clinical modeling workloads from Client-hosted infrastructure to Vendor-hosted cloud
- Reducing maintenance overhead and improving scalability.
- Defined ETL pipelines for ingestion and transformation of clinical trial data under GxP standards
- Created HLDs and data flow diagrams to support regulatory audit documentation
- Acted as the primary liaison with QA and Validation teams to ensure compliance and CAPA coverage
Project: Data Engineering & Compliance
- Tech Stack: Databricks ETL Pipelines, PySpark, AWS, Python, CDOCS, Data Security
- Supported ongoing enhancements and KTLO tasks for biomarker data pipeline.
- Led delivery for Security tool - Privacera Sunset release.
- Designed scalable data ingestion workflows for high-throughput genomic data
- Proposed automation enhancements that improved validation turnaround time by 30%
Validation Leadership across GxP Projects
- Led test case creation, execution, and deviation handling for validated systems
- Ensured audit readiness by managing controlled documentation in CDOCS
- Coordinated functional testing with QA for validated systems hosted on AWS