Summary
Overview
Work History
Education
Skills
Timeline
Generic

AKSHAY KUMAR REDDY

Hatfield

Summary

Experienced Data Engineer with an experience of 4+ years with a demonstrated history of working in designing and implementing Data Ingestion and curation pipelines with objective, insightful and shrewd in understanding and applying business requirements. Experience noting patterns and interpreting data.

Overview

4
4
years of professional experience

Work History

Data Engineer

Modak
Hyderabad
10.2021 - 09.2022
  • Defined the process of acquisition, data pre-processing, and data curation by closely working with stakeholders.
  • Employed data cleansing methods, significantly Enhanced data quality.
  • Developed PySpark applications to improve the performance and optimization of the data anonymization process and to ingest the CSV/SAS files data from a remote location to Hive.
  • Collaborated on ETL (Extract, Transform, Load) tasks, maintaining data integrity and verifying pipeline stability.
  • Designed and developed an Incremental data unification process by constructing global schema upfront and cleaning, transforming, and merging datasets based on business requirements
  • Managed team resources to effectively deliver on sprint goals.

Software Development Engineer

Modak
Hyderabad
10.2020 - 09.2021
  • Designed and developed a dynamic Hive query generation application which converts the clinical source data to SDTM format based on the conversion mappings
  • Used various java features such as Multi-Threading, Collections
  • Framework to develop Curation pipelines and improve the release time of the clinical data
  • Involved in fine-tuning the Hive/Impala queries and converting complex transformation queries into python scripts using pandas
  • Developed Azure CI/CD pipeline to deploy and trigger the curation pipelines in LINUX machine.

Software Development Engineer

Modak
Hyderabad
12.2018 - 09.2020
  • Developed a java code for loading data into Hive from semi- structured SAS/CSV files
  • Worked with HAR file system and developed scripts to archive
  • SAS/CSV files as part of ingestion to avoid small File problem
  • Worked with Hive to write queries as part of developing an ETL pipeline to transform data for further downstream processing
  • Devised test cases to probe enhancements and expansions of functionality
  • Resolved Jira tickets with comprehensive bug fixes.

Education

Bachelor of Engineering - Electronics and Communication

MVSR Engineering College
2019

Skills

  • PySpark
  • Python
  • Hive
  • PostgreSQL
  • Azure Data Factory
  • Azure Synapse Analytics
  • Azure Data Lake
  • SQL Server
  • Kafka
  • Tableau
  • Bug Fixes
  • Data Cleaning

Timeline

Data Engineer

Modak
10.2021 - 09.2022

Software Development Engineer

Modak
10.2020 - 09.2021

Software Development Engineer

Modak
12.2018 - 09.2020

Bachelor of Engineering - Electronics and Communication

MVSR Engineering College
AKSHAY KUMAR REDDY