Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

LOKESHWARAN R

Coimbatore

Summary

Experienced Data Engineer with 8+ years in designing and optimizing data pipelines using Databricks, Azure Data Services, and Big Data frameworks. Skilled in scalable ELT/ETL, data governance, and performance tuning (partitioning, bucketing, AQE). Passionate about transforming raw data into actionable insights and driving cloud-based innovations. Seeking to contribute expertise in Databricks, PySpark, and cloud technologies to a data-driven organization.

Overview

8
8
years of professional experience
1
1
Certification

Work History

Data Engineering, Management & Governance Specialist

ACCENTURE
Hyderabad
10.2023 - Current

Pharmaceutical client.

  • Implemented an ETL pipeline in PySpark using Databricks and MySQL, with Azure Data Lake, transforming raw data into meaningful insights for pharmaceutical health analytics.
  • Strengthened Delta Lake frameworks using PySpark and Spark SQL in Databricks, improving data accessibility and reliability.
  • Utilized various data file formats, reducing data processing time by 25%, and enhancing workflow efficiency.
  • Engineered data governance in Unity Catalog, streamlining access for 200+ users, and increasing data accuracy.
  • Executed a proof of concept for the Databricks Asset bundle, enhancing workflow deployments, and achieving a 25% efficiency improvement.
  • Implemented a proof of concept for a data pipeline from the source to the Gold layer with Delta Live Tables, reducing latency by 20%.
  • Optimized query performance using Adaptive Query Execution (AQE), reducing data retrieval times across complex datasets.

Data Engineering Senior Analyst

Accenture
Hyderabad
05.2021 - 10.2023

Banking Client

  • Designed and developed data ingestion into ADLS Gen2 using Synapse Spark spool with recon.
  • Developed and optimized transformation logic using PySpark, resulting in a 25% reduction in processing time for reporting views
  • Automated data processing systems efficiently handle over 10 terabytes of data daily, reducing processing time by 40%.
  • Proficient in handling various data file formats, including Parquet, CSV, JSON, and Delta tables.
  • Optimized query performance by 40% through effective partitioning and bucketing strategies in large-scale data sets
  • Implemented a CI/CD pipeline using Azure DevOps, reducing deployment time, and enhancing release frequency.
  • Developed a proof of concept with Databricks Streaming DataFrames, reducing data processing latency in real-time applications.
  • Prioritized and organized tasks to efficiently accomplish service goals.

ETL Developer

COGNIZANT
Coimbatore
11.2016 - 04.2021

Healthcare Client.

  • De-identification of metadata fields from various applications to classify: PHI, PII, PCI.
  • Process the raw data from the Enterprise Data Warehouse and transform it based on the requirements.
  • Worked on Informatica PowerCenter and Test Data Manager tools.
  • Develop data masking and data subsetting techniques.
  • Design and develop Power Center mappings, mapplets, sessions, and workflows.
  • Migrate code moves from the Test to the QA repositories.
  • Create metadata patterns, policy creation in ILM, TDM.
  • Perform unit testing, fix issues, and validate target tables.
  • Develop Informatica workflow automation, leveraging Shell scripts.
  • Configure the scheduling of workflows to run in particular time frames.
  • Worked on various transformations and performance tuning in Informatica PowerCenter.
  • Analysis of HL7/EDI transaction files in B2B Informatica Developer.
  • Develop data transformation in legacy systems: IBM Mainframe using Power Exchange.

Education

Bachelor of Engineering - Electronics and Communication Engineering

SNS College of Technology
Coimbatore, Tamil Nadu
01.2016

Higher Secondary - General Education - Group -1

Lisieux Matriculation Higher Secondary School
Coimbatore, Tamil Nadu
01.2012

SSLC - Matriculation School Leaving Certificate

Lisieux Matriculation Higher Secondary School
Coimbatore, Tamil Nadu
01.2010

Skills

  • Databricks
  • Databricks Asset Bundle
  • PySpark
  • Spark SQL
  • Hive Query Language
  • Git Hub
  • Azure DevOps
  • VS Code
  • Jupyter Notebook
  • Azure Databricks
  • Azure ADLS Gen2
  • Azure Synapse Analytics
  • AWS Databricks
  • S3 Buckets
  • Redshift

Certification

  • Databricks Data Engineer-Professional certification
  • Databricks Data Engineer-Associate certification
  • Azure Data Engineer DP-203 cloud certification
  • Microsoft Azure Data Fundamentals DP-900 cloud certification

Timeline

Data Engineering, Management & Governance Specialist

ACCENTURE
10.2023 - Current

Data Engineering Senior Analyst

Accenture
05.2021 - 10.2023

ETL Developer

COGNIZANT
11.2016 - 04.2021

Bachelor of Engineering - Electronics and Communication Engineering

SNS College of Technology

Higher Secondary - General Education - Group -1

Lisieux Matriculation Higher Secondary School

SSLC - Matriculation School Leaving Certificate

Lisieux Matriculation Higher Secondary School
LOKESHWARAN R