Summary
Overview
Work History
Education
Skills
Certification
Accomplishments
Internships
Timeline
Receptionist
Akash Bolliwar

Akash Bolliwar

Data Engineer (AWS & Generative AI)
Hyderabad

Summary

Results-driven Data Engineer with 5+ years of experience designing and optimizing data pipelines on AWS and big data platforms. Proven expertise in Apache Spark, Python/Scala, and AWS services to deliver scalable ETL solutions. Adept at integrating generative AI capabilities (via Retrieval-Augmented Generation) into data workflows for conversational AI use cases. Demonstrated ability to modernize legacy systems, automate processes, and ensure data integrity - leading to improved performance, enhanced analytics, and significant cost savings.

Overview

5
5
years of professional experience
3
3
Certifications

Work History

Data Software Engineer

EPAM Systems
01.2021 - Current
  • Engineered an AI-driven conversational search solution using Retrieval-Augmented Generation (RAG). Built an OpenSearch index of millions of legal and financial documents and integrated it with LLM-based chatbots, improving answer relevance and reducing lookup time by 30%.
  • Built end-to-end data pipelines on AWS (S3, EMR, Lambda, DynamoDB, Step Functions, API Gateway, OpenSearch) to ingest and process terabyte-scale datasets, enabling real-time analytics and search functionality.
  • Designed scalable indexing and retrieval systems with Elasticsearch/OpenSearch, improving enterprise search response times by 40% and enhancing query relevance.
  • Orchestrated cloud workflows using AWS Step Functions, Lambda, and AWS Batch to automate ETL processes and error handling, reducing manual intervention by 80%.
  • Implemented real-time data migration from on-premises Hadoop (HDFS) to Amazon S3 with zero downtime, ensuring 99.9% data availability during the cloud transition.
  • Migrated on-prem databases from Oracle to Amazon RDS and BerkeleyDB to RocksDB, boosting query performance by 25% and reducing maintenance overhead.
  • Developed Python automation scripts for OpenSearch backups and cluster cost optimization, cutting cloud infrastructure expenses by ~15%.
  • Created custom data extraction and reporting scripts that enabled deeper business insight, speeding up data retrieval for analytics by 50%.
  • Enforced high code quality with CI/CD pipelines (GitHub Actions, AWS CodeBuild/CodePipeline), accelerating deployment cycles by 30% and minimizing errors.
  • Ensured robust data validation at each pipeline stage, maintaining 99+% data accuracy in production ETL outputs.
  • Upgraded legacy Spark 1.6 jobs to Spark 3.x, replacing RDD-based transformations with optimized DataFrame APIs to improve batch processing performance ~2×.
  • Developed and orchestrated data migration workflows from Cloudera Hadoop to AWS, using Spark on EMR and AWS Step Functions to establish a scalable data lake on S3.
  • Processed and transformed large volumes of user data (millions of XML records) to integrate Australia, U.S. and Canadian legal data into unified analytics databases for downstream reporting.

Data Engineer Intern

BridgeI2I Analytics Solutions Pvt. Ltd.
06.2020 - 12.2020
  • Migrated IoT sensor data from MS SQL Server to Azure SQL Database, and automated data ingestion using Python scripts – ensuring seamless integration with Azure Cosmos DB for downstream pipelines.
  • Extracted and transformed semi-structured data from websites and PDFs, feeding into Azure Cosmos DB to enable Power BI analytics and interactive client dashboards.
  • Optimized data processing workflows by refining Python automation scripts using selenium, reducing data extraction and loading time by 20% and improving overall pipeline efficiency.

Education

Bachelor of Engineering - Computer Science And Engineering

MVSR Engineering College
Hyderabad, India
04.2001 -

Diploma - Electronics And Communications Engineering

Govt. Polytechnic College
Hyderabad, India
04.2001 -

SSC -

Z.P. High School
Madnoor, India
04.2001 -

Skills

Apache Spark

undefined

Certification

AWS Certified Developer - Associate

Accomplishments

  • Delivery Excellence Award
  • Pat on the Back
  • Rising Star Nomination, 2020, Nominated

Internships

Bridgelabz Solutions, MumbaiData Engineer Intern (Jan 2020 – May 2020)

Altech Power & Energy Systems, HyderabadSoftware Developer (Intern) (Jan 2019 – Apr 2019)

Timeline

Data Software Engineer

EPAM Systems
01.2021 - Current

Data Engineer Intern

BridgeI2I Analytics Solutions Pvt. Ltd.
06.2020 - 12.2020

Bachelor of Engineering - Computer Science And Engineering

MVSR Engineering College
04.2001 -

Diploma - Electronics And Communications Engineering

Govt. Polytechnic College
04.2001 -

SSC -

Z.P. High School
04.2001 -
Akash BolliwarData Engineer (AWS & Generative AI)