Summary
Overview
Work History
Education
Skills
Certification
FRAMEWORKS AND TOOLS
Accomplishments
PROJECTS
Timeline
Generic
Sai Purushoth G

Sai Purushoth G

Coimbatore

Summary

As an AWS certified data engineer, I excel in independently handling tasks from start to finish, offering creative solutions to complex problems, and enhancing system efficiency. I collaborate effectively to guide teams towards timely completion of tasks. I thrive in dynamic environments where my problem-solving skills contribute significantly.

Overview

3
3
years of professional experience
1
1
Certification

Work History

Data Engineer

Presidio Cloud Solutions
Coimbatore
08.2023 - Current

EHE Health - Preventive Healthcare Company

  • Received the "Above and Beyond" award for leading the team, guiding them in the right direction, and providing the right solutions to the customer.
  • Automated the process of updating patient eligibility information, reducing manual processing time by over approx 12 hours and minimizing errors. Utilized AWS Services (S3, Lambda (Python), Glue (PySpark), and RDS (MySQL) for event-based architecture, ensuring efficient operations.
  • Transformed Java-based processes for transmitting patient, exam, and lab-related information into AWS Glue with PySpark, customizing logic for each client to enhance data quality and processing efficiency.
  • Designed and implemented ETL pipelines to daily migrate data from relational databases of microservice applications to data warehouses and Salesforce, incorporating custom schema mapping. Developed and deployed ETL pipelines for loading data from S3 and relational database into the data warehouse.
  • Created a PySpark-based solution in AWS Glue for generating daily email summaries from a reporting dashboard for EHE leadership.
  • Upgraded AWS Glue from version 2.0 to 4.0, significantly reducing costs and improving data processing performance. Additionally, worked on the analysis of upgrading MySQL RDS from version 5.7 to 8.0.
  • Collaborated on data modeling for Data Warehouse tables from MySQL RDS to Redshift, with a focus on designing facts and dimension tables for appointment and exam-related datasets.
  • Developed stakeholder dashboards, including "Super" dashboards for all clients and custom solutions for specific clients. Played a key role in troubleshooting ETL pipeline errors and optimizing overall efficiency.

Associate Data Engineer

Presidio Cloud Solutions
Coimbatore
12.2022 - 08.2023

EHE Health - Preventive Healthcare Company

  • Developed Quicksight Migration tool using Python framework for seamless resource migration between environments,reducing deployment time significantly compared to manual processes.
  • Created Quicksight Backup tool to generate JSON templates for resource backup and restoration, aiding in deletion prevention and facilitating versioning.
  • Designed and implemented stakeholder dashboards in AWS Quicksight by translating complex business logic into SQL queries, providing actionable insights for a large number of stakeholders.
  • Analyzed and resolved bugs in ETL pipelines of the data warehouse, ensuring data integrity and reliability using PySpark.
  • Investigated and rectified discrepancies in dashboards with complex SQL queries, enhancing data accuracy and usability

Associate Software Engineer

Presidio Cloud Solutions
Coimbatore
08.2022 - 11.2022

Event Buzz - Event Management System

  • Developed Restful APIs with Serverless Framework (Typescript) and implemented backend solutions using AWS services (Lambda, API Gateway, AWS Aurora).
  • Designed schema for optimized backend.
  • Integrated AWS services including Cognito for authentication, AWS Amplify for backend/frontend coordination, and AWS AppSync for real-time notifications.
  • Contributed to the development and debugging of React UI components.

Associate Engineer Trainee

Presidio Cloud Solutions
03.2022 - 07.2022
  • Engineered a web application for movie ticket booking utilizing Node.js for the backend and Angular for the frontend, ensuring seamless user experience and efficient ticket management.
  • Designed and implemented microservices architecture for library management system using .NET Core, facilitating modular and scalable application development while enhancing system performance and maintainability.
  • Completed comprehensive training in Spark, HDFS, and Kafka, gaining proficiency in big data processing and real-time data streaming technologies to contribute effectively to data-driven projects and initiatives.

Education

B.Tech - Computer Science Engineering

SASTRA Deemed University
Thanjavur
07.2022

Skills

Python

Pyspark

SQL

Java

Javascript

Typescript

HTML & CSS

Certification

Solutions Architect Associate

  • Platform - Amazon Web Services
  • Link - https://www.credly.com/badges/502a7b2b-940d-49a1-b51d-7d413d62a225/public_url

Project Management For Managers

  • Platform - IIT Roorkee In NPTEL
  • Link - https://drive.google.com/file/d/12UuSkS9QuhJpq_ZNFJDyGR-5cfoqCqRb/view?usp=drive_link

FRAMEWORKS AND TOOLS

Tools:
AWS Glue (ETL)
AWS Quicksight
AWS Lambda
AWS Secrets Manager
AWS CloudFormation
AWS CodeCommit
AWS Cognito
AWS AppSync
AWS Amplify

Databases:
AWS  Aurora & RDS
AWS DynamoDB

AWS S3



Frameworks:
Aapche Airflow
Serverless Framework
Angular
Node with Express JS
Django

Saas Integration:
Salesforce

CI/CD:
BitBucket
AWS CloudFormation

Accomplishments

Above and Beyond (Feb, 2024)

  • Criteria:- Consistent high performance, Innovative way of working, Collaboration & Customer-focus
  • Drive Link - https://drive.google.com/file/d/10zs_VcE0_RiAx3rIw9-4axu6hWplbWBQ/view?usp=drive_link

PROJECTS

CodeDocsGen AI - a VS Code Extension

  • CodeDocsGen AI transforms code documentation using AI, making it easier to generate method documentation in VS Code.
  • This extension integrates seamlessly with your editor, automating documentation creation for cleaner, well-documented code.
  • CodeDocsGen is publicly available to enhance your programming projects.
  • Tech Stack - Typescript, Amazon Bedrock(Anthropic), AWS Lambda

Github Link - https://github.com/hackathon-personal/codedocsgen

Timeline

Data Engineer

Presidio Cloud Solutions
08.2023 - Current

Associate Data Engineer

Presidio Cloud Solutions
12.2022 - 08.2023

Associate Software Engineer

Presidio Cloud Solutions
08.2022 - 11.2022

Associate Engineer Trainee

Presidio Cloud Solutions
03.2022 - 07.2022

B.Tech - Computer Science Engineering

SASTRA Deemed University
Sai Purushoth G