Summary
Overview
Work History
Education
Skills
Certification
Languages
Accomplishments
Timeline
Generic
Bharati P

Bharati P

Rourkela

Summary

Dynamic and results-driven Data Engineer with 4 years of experience crafting high- performance data pipelines and optimizing cloud architectures to deliver impactful cost- saving solutions. Successfully reduced storage overhead and boosted system performance, cutting production and maintainence costs. Skilled in data analysis, MySQL, PySpark, AWS, and Qlik, with a sharp focus on scalability and operational efficiency. A fast learner with a passion for innovation, currently pursuing AWS Machine Learning and Databricks certifications to stay ahead of the curve. Eager to apply my skills to unlock growth opportunities, drive data-powered decision-making, and fuel company success through creative, data-driven solutions.

Overview

4
4
years of professional experience
1
1
Certification

Work History

Data Engineer/PySpark Developer

Accenture Pvt LTD
Hyderabad
02.2023 - Current
  • Migrated AWS EMR to EMR Serverless, designing and automating infrastructure using the AWS Boto3 library.
  • Tested over 900 jobs, optimizing performance, ensuring data accuracy, and reducing execution times.
  • Developed migration and data validation scripts for the Seattle Police Department using MySQL and AWS Athena, ensuring accurate data transfer between legacy systems and cloud platforms.
  • Cleansed and optimized the criminal records database by implementing fuzzy matching with the Jaro-Winkler algorithm, improving data accuracy and match probability for individuals and addresses.
  • Served as a PySpark developer for the California State Automated Welfare System (CalSAWS), delivering reports across multiple domains (state, fiscal, administrative), streamlining data processing, and reporting.
  • Resolved critical system defects within tight deadlines, performing extensive regression and unit testing to maintain system integrity and functionality.
  • Collaborated with business analysts and onshore teams to gather requirements, design reports, and ensure accurate data validation between source and target systems, improving reporting efficiency.
  • Led the architectural design and upgrade of EMR systems, ensuring scalability and enhanced performance during key system upgrades.
  • Designed and developed Spark Delta Lake 3.0-based vacuum jobs, automating weekly S3 bucket cleanups to reduce storage costs by 44% and enhance performance by compacting bulk files into smaller, more manageable ones.
  • Optimized SQL queries and database schemas for performance improvements in data retrieval operations and conducted data analysis using SQL and Python to derive insights and support decision-making processes.
  • Designed, constructed, and maintained scalable data pipelines for data ingestion, cleaning, and processing using Python and SQL, and automated data quality checks and error handling processes to ensure the integrity and reliability of datasets.
  • Delivered complex System Change Requests (SCRs), managing dynamic requirements and tight deadlines, while designing over 150+ test cases to ensure high code quality and reliability.
  • Monitored data system performance, identifying bottlenecks and implementing solutions to maintain system efficiency. Maintained cloud-based data infrastructure on AWS to enhance data storage and computation capabilities.

Python Developer

Wipro Ltd.
Bangalore
09.2020 - 10.2022
  • Translated business requirements into efficient code designs, achieving minimal regression and maximizing reliability.
  • Developed Python scripts to automate data processing tasks and implemented object-oriented programming in Python to build custom classes and functions.
  • Resolved Application Faults, Vulnerabilities, Catastrophic issues, Critical Defects identified by customers and QA team, managed escalations, attended security and technical webinars and collaborated with internal and cross-functional teams for timely delivery.
  • Independently worked on designing and developing feature enhancements for Data Loss Prevention, Anti-Malware Protection, URL Detection, and Content Scanning for the Email Security Appliance.
  • Partnered with leads, Business Analysts, SMEs, and Technical Architects for requirement analysis, design, and developing scalable and consistent solutions, ensuring thorough Static Analysis and Unit testing is done.
  • Used Python, Perforce, and Linux to implement change control methodologies, minimizing disruptions for end-users during software enhancements.

Education

Bachelors of Technology - I.T

National Institute of Science & Technology
Odisha
01.2020

Intermediate -

D.A.V Public School
01.2016

Matriculation -

St. Gregorious School
01.2014

Skills

  • Python and Data Processing with PySpark
  • Apache Spark and DeltaLake Enterprise
  • MySQL, Oracle, PostGreSQL, SQLServer
  • AWS - S3, Athena, Glue, EMR, EMR Serverless
  • QlikSense, Nprinting
  • GIT, Perforce, Thunderbird, Confluence, Technical Writing for Release Notes and User Guides
  • Debugging, Data Analysis, ETL Process, Database Management, GEN AI

Certification

  • AWS Solutions Architect - Associate - July 2024 - July 2027
  • Android Application Development - NIIT
  • 5 Star Hackerrank SQL Developer
  • AWS Machine Learning Specialty - Pursuing
  • DataBricks Data Engineering - Associate - Pursuing

Languages

English, Hindi, Telugu, Odia

Accomplishments

  • Accenture - CalSAWS - Star of the Month Sept - 2024
  • Safecity Champion at Red Dot Foundation

Timeline

Data Engineer/PySpark Developer

Accenture Pvt LTD
02.2023 - Current

Python Developer

Wipro Ltd.
09.2020 - 10.2022
  • AWS Solutions Architect - Associate - July 2024 - July 2027
  • Android Application Development - NIIT
  • 5 Star Hackerrank SQL Developer
  • AWS Machine Learning Specialty - Pursuing
  • DataBricks Data Engineering - Associate - Pursuing

Bachelors of Technology - I.T

National Institute of Science & Technology

Intermediate -

D.A.V Public School

Matriculation -

St. Gregorious School
Bharati P