Summary
Overview
Work History
Education
Skills
COURSEWORK
ADDITIONAL WORK
ACHIEVEMENTS / HIGHLIGHTS
Timeline
Generic
Deepika C

Deepika C

Coimbatore

Summary

Results-driven Data Engineer with 6+ years of experience in designing and implementing scalable, cloud-based data solutions. Proficient in Python, SQL, and AWS (S3, Glue, Lambda, Redshift, EC2, Airflow). Skilled in ETL pipeline design, data modeling, and machine learning workflows using Pandas, PySpark, and Scikit-learn. Adept at automation, performance optimization, and delivering actionable insights from complex datasets.

Overview

7
7
years of professional experience

Work History

Senior Developer

Tata Consultancy Services
07.2022 - Current
  • - Designed and automated ETL pipelines using AWS Glue and Lambda for large-scale data ingestion.
  • - Optimized Redshift performance using partitioning and compression techniques for high-volume data queries.
  • - Built reusable PySpark scripts for data cleansing and transformation.
  • - Created materialized views to improve query performance and reduce computation time.
  • - Managed workflow orchestration using Apache Airflow for end-to-end automation and scheduling.
  • - Collaborated with analytics teams to streamline data delivery and reporting pipelines.

Associate

Cognizant Technology Solutions
06.2018 - 05.2022
  • - Migrated analytical models from R to Python ensuring 100% validation accuracy and reproducibility.
  • - Developed predictive analytics scripts using Python (Pandas, NumPy, Scikit-learn) to identify performance trends.
  • - Built automation utilities for report generation using Excel VBA Macros integrated with Python-based data pipelines.
  • - Designed Oozie-triggered Python workflows for Hadoop-based data processing.
  • - Performed static code analysis and vulnerability resolution using Veracode, SonarQube, and SAST tools.
  • - Developed standalone Python applications with PyInstaller for ease of deployment.

Education

B.Tech - Electronics and Communication Engineering

Vellore Institute of Technology
01.2017

Skills

  • Programming: Python, SQL, R, Java
  • Cloud & Data Engineering: AWS (S3, Glue, Lambda, Redshift, EC2), Apache Airflow, Data Go
  • Big Data & Analytics: PySpark, Pandas, NumPy, Scikit-learn
  • Automation & Tools: Excel Macros (VBA), Eclipse, VS Code, JIRA, Git, TeamCity, Qlik Sense, PyInstaller
  • Databases: MySQL, SQL Server

COURSEWORK

Data Structures, Object-Oriented Programming, Algorithms (Design and Analysis), Database Management Systems

ADDITIONAL WORK

  • - Created simulation models with AnyLogic to analyze complex system behavior.
  • - Automated manual reporting tasks using Excel Macros and Python scripts.
  • - Contributed to internal tool enhancements improving data flow efficiency.

ACHIEVEMENTS / HIGHLIGHTS

  • - Improved ETL pipeline performance by 40% through optimized data transformation strategies.
  • - Enhanced data reliability by implementing automated validation and reconciliation scripts.
  • - Recognized for reducing report generation time by 60% using Excel Macro-based automation.
  • - Played a key role in migrating legacy R systems to Python, ensuring seamless transition and scalability.

Timeline

Senior Developer

Tata Consultancy Services
07.2022 - Current

Associate

Cognizant Technology Solutions
06.2018 - 05.2022

B.Tech - Electronics and Communication Engineering

Vellore Institute of Technology
Deepika C