Summary
Overview
Work History
Education
Skills
Accomplishments
Certification
Timeline
Generic

Santoshini Pradhan

Bangalore

Summary

Ambitious and strategic data architect with 13+ years of experience working with Databricks, AWS, Spark. My focus on data integrity and cross-functional integration across all company departments has consistently boosted efficiency by >25% and optimized data pipelines. Looking forward to bringing my skills in management and data analytics/analysis to solve problems, develop new platforms, and construct data pipelines.

Overview

14
14
years of professional experience
1
1
Certification

Work History

Staff Data Architect

GE Vernova
Bangalore
02.2017 - Current
  • Led the migration from Greenplum to databricks using Apache spark, airflow, elastic search and S3, resulting in an annual cost savings of 10% and an increase in performance of 14%
  • Built data model for Sourcing , logistics and material management space providing robust data foundation and better scalability by creating reusable codebase for 12 different ERPs
  • Used Python, SQL and ReactJS to collaborate with one intern and a junior data engineer to create a homegrown code review tool that eliminated the manual review for data architects (480 hours per year)
  • With additional responsibility of product owner , gathered business requirements and defined enabler KPIs for each PI on quarterly basis to provide visibility to 50K USD benefits

Technology Consultant

Deloitte USI
Bangalore
10.2013 - 02.2017
  • Created python code using dataframes for API data ingestion to S3, involved in converting SQL queries into spark transformations using Spark RDDs and python
  • Enhanced existing data model to include Infant Care in the healthcare data warehouse in lieu with Reporting user requirements
  • Automated quarterly audit process to ensure EDW data is matching with the source systems for critical reporting metrics and to compare the foundation layer and presentation layer

Data Analyst

Tech Mahindra
Bangalore
09.2010 - 10.2013
  • Worked with client to understand business needs and translated 50+ custom SQLs into ODS and the ODS layer served many actionable reports in Tableau, saving 17 hours of manual work each week
  • Optimized the long running jobs(Informatica) and DB (Teradata) using Teradata utilities like Bteq, Mload, Fload, Tpump which brought down the overall runtime by 45%
  • Designed reusable Unix script to collate workflow duration, load stats, volumetric from session logs
  • Led data certification projects to increase DQI score and automated cleansing rules via stored procedures

Education

Master in Computer Application -

Utkal University
Odisha, India
05-2012

Skills

Databricks
Spark
AWS Services (S3,Glue,Lambda)
Scheduling - Apache Airflow, Ctrl M
ETL - Talend, Informatica
DB - PostgreSQL, Teradata, Oracle
data modelling - Erwin, drawio

Accomplishments

  • Awarded with CDO team award for my contribution on Code checker tool
  • Have received “Spotlight” award in GE for successfully migrating Sourcing360 app from on-prem to AWS cloud.
  • Presented with ‘Applause’ award in Deloitte for consistent performance.
  • Awarded with ‘Bravo’, ‘Pat on Back’, ‘STAR’ performer awards in Tech Mahindra for proactive identification and addressing business challenges.
  • Star Volunteer Award for contribution towards Mahindra Satyam foundation initiatives for Society.

Certification

Databricks Data engineer Associate
Databricks Generative AI fundamentals
AWS Cloud Practitioner
SAFe PO PM

Timeline

Staff Data Architect

GE Vernova
02.2017 - Current

Technology Consultant

Deloitte USI
10.2013 - 02.2017

Data Analyst

Tech Mahindra
09.2010 - 10.2013

Master in Computer Application -

Utkal University
Santoshini Pradhan