Summary
Overview
Work History
Education
Skills
Accomplishments
Projects
Timeline
Generic

Gitesh Gawai

Pune

Summary

Results-driven Data Engineer & AI/ML Professional with 5+ years of experience in designing scalable data pipelines, building NLP & ML solutions, developing Power Platform applications, and implementing cloud-based architectures. Proficient in GCP, AWS, Microsoft Fabric, SQL, Python, and full-stack development. Strong background in BPO analytics, healthcare IoT analytics, and enterprise reporting integrations.

Overview

7
7
years of professional experience

Work History

Sonata Softwares
Bangalore
10.2023 - Current
  • Developed web application for tracking resolution status and customer sentiment of service calls.
  • Integrated application with GenAI (Anthropic) to enhance functionality.
  • Managed backend processes, ensuring robust architecture and scalability.
  • Deployed applications on AWS Lambda for efficient serverless operations.
  • Utilized AWS S3 for optimized file storage and management.
  • Enabled seamless uploading, searching, and categorization of transcript files.
  • Maintained performance metric calculations for PPPC project, tracking operational KPIs.
  • Contributed to SIRIUS project by building end-to-end data pipeline for training and HR integration.

Data Scientist – AI

Reality Premedia Services
Pune
04.2022 - 05.2023
  • Detection of False Alarm is a key element for modern Health Care services as it can significantly contribute to a continuing effort of service quality improvement.
  • In order to meet use expectation and achieve higher quality levels, health care need to develop a specific mechanism of health care (sensor devices) measurement.
  • This study examines the underlying forces of sensors based data influences on person present in a particular room.
  • Responsibilities: Design and conduct examinations to measure the learning outcome of participants, Identify Solve interesting problems involving rich datasets in various domains, Identify key emerging trends in the industry and maintain a rich reference material, Identify key reporting metrics and create dashboards to enable quick decision making.

Project’s Scientist -AI

Amdocs
Pune
08.2018 - 04.2022
  • Music plays a very important role in people’s lives. Music brings like-minded people together and is the glue that holds communities together.
  • The aim of this project is to build a machine learning model which classifies music into its respective genre.
  • Responsibilities: Search for ways to get new data sources and assess their accuracy, Assess the effectiveness of new data sources and data gathering techniques, Coordinate with different functional teams to implement models and monitor outcomes, Develop processes and tools to monitor and analyze model performance and data accuracy.

Education

Bachelor of Science - Computer Engineering

Amravati University
Akola
06-2016

Skills

Programming: Python, C#, SQL, JavaScript
Data Engineering: BigQuery, Dataflow, MySQL, MSSQL, ETL Pipelines, Incremental Loads
Cloud: GCP, AWS (Lambda, S3), Microsoft Fabric
AI / ML: NLP, Sentiment Analysis, Clustering, Classification, Anomaly Detection
Frameworks: ASPNET Web API, Reactjs, Flask, Spring Boot
Power Platform: Power Apps, Power BI
Tools: Git, Azure Storage, MS Word Automation, Anthropic GenAI
Other: Stored Procedures, KPI Calculation, Data Visualization, Word Cloud Generation

Accomplishments

Best teammate for working and delivering the requirements for the customer.

Projects

Project: Call Transcript Analysis Platform

Tech: C#, ASP.NET Web API, React.js, AWS Lambda, AWS S3, Anthropic GenAI

  • Developed a GenAI-powered web application to analyze customer service call transcripts.
  • Integrated Anthropic GenAI for automated sentiment and resolution insights.
  • Designed backend architecture and deployed scalable workloads on AWS Lambda.
  • Implemented transcript upload, search, categorization, and multi-file consolidated reporting.
  • Utilized Amazon S3 for secure transcript storage and retrieval.
Microsoft (Client Project) Project: User Feedback Analytics & Reporting Solution

Tech: Power Apps, Microsoft Fabric, NLP, Word Automation

  • Built Power Apps to capture and manage user feedback across business units.
  • Used Microsoft Fabric for scalable data processing and integration.
  • Applied NLP to extract sentiment, themes, and frequently occurring pain points.
  • Created word cloud visualizations to highlight recurring issues.
  • Automated MS Word report generation with insights and actionable recommendations.
BPO/ITES Client – SIRIUS Platform

Role: Data Engineer / SQL Developer / Python Developer
Duration: [Add Duration]

Project: SIRIUS – Batch Management & Attrition Analytics

Tech: MySQL, BigQuery, GCP, Python, Dataflow, Spring Boot

  • Built end-to-end data pipelines from on-premises systems to Google Cloud using Python & Dataflow.
  • Designed optimized SQL views and DW layers (dwh_batch_info, dwh_trainee_hr_info, dwh_training_phase_info).
  • Implemented surrogate keys, merge keys, and incremental load frameworks.
  • Developed CTE-heavy SQL views for RAG scoring, attendance summary, dispute tracking, and attrition analytics.
  • Integrated Spring Boot microservices to expose batch metrics to dashboards.
  • Ensured zero-duplicate data through strict join logic, deduplication, and optimization.
  • Delivered models supporting dashboards through pivoted views and multi-level unions.
PPPC Project (Contact Center Operations)

Role: SQL Developer / Reporting Analyst
Duration: [Add Duration]

Project: PPPC – Performance Metrics & Forecast Dashboard

Tech: MSSQL, Power BI, Stored Procedures

  • Designed & validated KPI logic (QA Score, Quality Variance %, HTF %, etc.).
  • Developed dynamic update scripts with NULLIF to prevent divide-by-zero errors.
  • Automated ingestion into key tables (PPPC_MetricMasterPBI, PPPC_WeeklyPerformance).
  • Fixed schema validation issues (SQL71501), unresolved references, cross-database joins.
  • Optimized SQL for integration with Power BI dashboards.
  • Ensured data quality through cleansing, deduplication, and mapping validation.
Project: False Alarm Detection (Healthcare – IoT)
  • Built ML models to identify real vs. false alarms triggered by healthcare IoT sensors.
  • Conducted detailed EDA and developed dashboards for insights.
Project: Anomaly Detection (Healthcare – Sensor Data)
  • Processed historical Azure sensor data and applied clustering (K-Means) for behavior pattern detection.
  • Identified and calculated outlier percentages to improve classification accuracy.
Project: Music Genre Classification (Entertainment)
  • Developed ML models to classify music tracks into genres and monitored model performance.
Project: E-commerce Sentiment Analysis
  • Developed sentiment analysis models to analyze customer reviews and provide insights for product improvement.
  • Built predictive models to optimize customer experience and retention.

Timeline

Sonata Softwares
10.2023 - Current

Data Scientist – AI

Reality Premedia Services
04.2022 - 05.2023

Project’s Scientist -AI

Amdocs
08.2018 - 04.2022

Bachelor of Science - Computer Engineering

Amravati University
Gitesh Gawai