Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

Suraj Akula

Hyderabad

Summary

Results-driven Data Engineer with 3 years of experience in building, optimizing, and maintaining data pipelines and analytics solutions using Databricks, PySpark, Scala Spark, and cloud platforms like Azure. Hands-on experience in handling large-scale data processing, ETL pipelines, and modern data lake solutions leveraging Apache Iceberg, Kafka, and SnapLogic. Proficient in delivering robust data solutions for real-time and batch processing, enhancing data quality, and supporting critical business analytics. Strong collaborator with experience working in Agile teams using tools like Jira, Azure Boards, and Git.

Overview

3
3
years of professional experience
1
1
Certification

Work History

Data Engineer

Syngenta
09.2023 - Current
  • Company Overview: Modak Analytics
  • Designed and developed data ingestion pipelines using SnapLogic, connecting multiple enterprise systems and integrating structured and semi-structured data
  • Processed and transformed large datasets on Databricks, leveraging PySpark for scalable batch processing and data cleansing operations
  • Implemented data quality checks and validations to ensure accurate and reliable data for reporting and analytics
  • Managed code versioning and CI/CD using GitLab, streamlining code integration and deployment processes
  • Worked in an Agile environment, collaborating with team members through Jira for sprint planning and tracking deliverables
  • Modak Analytics

Data Engineer

Abbvie
02.2022 - 08.2023
  • Company Overview: Modak Analytics
  • Developed custom ETL pipelines in PySpark and Scala to ingest data from various sources into Iceberg tables for scalable and optimized data storage
  • Utilized Apache Iceberg to enable ACID transactions, time travel, and schema evolution for large-scale data management
  • Implemented data transformation layers to cleanse, enrich, and standardize data before delivering it to business analytics teams or back to source systems
  • Integrated Kafka for real-time data ingestion and processing, ensuring low-latency and high-throughput pipelines
  • Managed ETL workflows using Nabu, automating data ingestion, transformation, and validation processes
  • Collaborated with cross-functional teams through Azure Boards and Jira for sprint planning, issue tracking, and project delivery
  • Performed code versioning and CI/CD processes using Git, ensuring code quality and automated deployments
  • Modak Analytics

Education

Bachelor's Degree - Information Technology

Gokaraju Rangaraju Institute of Engineering and Technology
05.2022

Skills

  • Databricks
  • Apache Spark
  • Scala
  • PySpark
  • Python
  • Kafka
  • Snaplogic
  • Azure
  • Iceberg
  • SQL
  • PostgreSQL
  • Nabu
  • Jira
  • Azure Boards
  • Data modeling
  • Data warehousing

Certification

Databricks Certified Data Engineer Associate

Timeline

Data Engineer

Syngenta
09.2023 - Current

Data Engineer

Abbvie
02.2022 - 08.2023

Bachelor's Degree - Information Technology

Gokaraju Rangaraju Institute of Engineering and Technology
Suraj Akula