Summary
Overview
Work History
Education
Skills
Certification
Accomplishments
Timeline
Generic

Arunkumar A

KOCHI

Summary

Value-oriented Data Engineer with 6 years of expertise in ETL, Data Warehousing, SQL and Pyspark across retail, accounting and insurance domains. Skilled in building scalable data pipelines, automating ETL workflows, and ensuring data integrity with proficiency in Azure Data Services along with AI-driven data solutions.

Overview

6
6
years of professional experience
1
1
Certification

Work History

Data Engineer

EY Global Services
Kochi
06.2023 - Current
  • Build, maintain, and refine data pipelines by leveraging SQL, PySpark, and Databricks to facilitate seamless data ingestion, transformation, and storage across enterprise systems
  • Develop and fine-tune scalable ETL processes for data warehousing and integration, ensuring the efficient extraction, transformation, and loading of structured and unstructured data from various sources
  • Implement and optimize SQL-based data solutions by designing data models, tuning query performance, and applying indexing strategies to improve data retrieval and processing efficiency
  • Utilize and manage Azure Data Services, including Azure Data Factory, Azure Data Lake, and Synapse Analytics, to design and maintain cloud-based data solutions for enterprise-scale analytics and reporting
  • Implement Generative AI to automate manual comparisons and reviews, resulting in a 60% improvement in performance and operational efficiency

Data/ETL Engineer

Cognizant Technology Solutions
Kochi
01.2022 - 04.2023
  • Optimize and scale data solutions to enhance performance and manage large datasets efficiently.
  • Design and refine SQL queries to improve database efficiency and reduce execution time.
  • Lead ETL migrations, ensuring seamless transitions to cloud-based architectures.
  • Implement data governance policies to maintain data quality, security, and compliance.
  • Automate and monitor data pipelines, ensuring reliability and real-time issue resolution.

ETL Developer

Tata Consultancy Services
Kochi
06.2019 - 01.2022
  • Worked on data migration efforts, assisting in on-premises to Azure cloud transitions (Azure SQL, Data Lake) while ensuring minimal disruption through incremental loading and parallel processing
  • Optimized SQL queries to enhance database performance, reduce execution time, and improve overall efficiency in large-scale data processing
  • Created and managed technical documentation, detailing ETL processes, best practices, and troubleshooting guidelines for seamless team collaboration
  • Performed unit testing and debugging on ETL workflows, ensuring data accuracy, integrity, and compliance while proactively identifying and resolving issues
  • Collaborated with database and frontend teams to enhance data pipelines, optimize cloud-based ETL architectures, and improve system performance through efficient data integration and processing

Education

Master of Computer Applications - Computer Science

Kerala Technological University
Thiruvananthapuram
06-2019

Bachelor of Computer Applications - Computer Science

Mahathma Gandhi University
Kottayam
05-2016

Skills

  • Data Engineering – Building and optimizing scalable data pipelines
  • Python & Spark framework – Data transformation, processing, and automation
  • SQL & Database Management – Writing complex queries, data modeling, and performance tuning
  • Data Governance & Integrity – Ensuring data quality, consistency, and compliance
  • Databricks Ecosystem – Delta Lake, Delta Live Tables, Unity Catalog
  • Azure Data Services – Synapse, Data Factory, Cosmos DB
  • ETL Operations – Informatica Powercenter, Talend Openstudio
  • Generative AI & Vector Databases – ChromaDB, LangChain, Sentence Transformers, Embeddings, Azure OpenAI, LLMs

Certification

  • Microsoft Certified: Azure Data Engineer Associate (DP-203)
  • Microsoft Certified: Azure AI Fundamentals (AI-900)
  • Databricks Certified Data Engineer Associate

Accomplishments

  • Presented a paper on Cryptocurrency Hijacking at an ICSSR-sponsored national conference on Computational Intelligence.
  • Earned a Scrum Alliance badge for completing an in-person Scrum Foundation program.

Timeline

Data Engineer

EY Global Services
06.2023 - Current

Data/ETL Engineer

Cognizant Technology Solutions
01.2022 - 04.2023

ETL Developer

Tata Consultancy Services
06.2019 - 01.2022

Master of Computer Applications - Computer Science

Kerala Technological University

Bachelor of Computer Applications - Computer Science

Mahathma Gandhi University
Arunkumar A