Summary
Overview
Work History
Education
Skills
Accomplishments
Certification
Timeline
Generic

Ajeet Kumar

Chennai

Summary

  • Experienced Data Engineer: Over 4.5 years of hands-on experience in designing, building, and optimizing robust data pipelines to support business intelligence, analytics, and machine learning initiatives.
  • Technical expertise: Proficient in Spark and Python for distributed data processing, SQL for advanced querying, and AWS for cloud-based data infrastructure and storage solutions.
  • End-to-End Data Solutions: Skilled in using Apache Airflow for orchestrating complex workflows, ensuring seamless data integration, and delivery across systems.
  • Collaborative Development: Adept at using GitLab for version control and collaboration, maintaining clean, efficient codebases in fast-paced team environments.
  • Agile Methodology: Experienced in Agile project management using Jira, delivering iterative improvements, and meeting deadlines while adapting to evolving business needs.
  • Problem-Solving Focus: Proven ability to translate complex data requirements into reliable, high-performance systems, driving actionable insights, and operational efficiency.

Overview

4
4
years of professional experience
1
1
Certification

Work History

Software Engineer

Nielsen
Chennai
03.2024 - Current

1. HDAM (Household Data Adjustment Model):

  • Developed and maintained a predictive model to estimate household demographics from TV viewing patterns.
  • Developed and maintained a predictive model to estimate household demographics from TV viewing patterns.
  • Leveraged Nielsen's panel data to refine and validate demographic predictions, implementing robust data quality checks and validation processes.

2. Subminute TV Audience Measurement Application:

  • Pioneered a novel application to reverse-engineer tuning information at a sub-minute level from known demographics.
  • Utilized advanced PySpark techniques to analyze high-volume television viewing data, significantly enhancing the granularity and accuracy of Nielsen's audience measurement capabilities.

3. Big Data Processing and Optimization:

  • Designed and optimized ETL processes and big data pipelines in AWS for both HDAM and sub-minute applications, reducing processing time, and enabling more frequent updates to household demographic assignments and tuning predictions.
  • Implemented efficient data processing strategies to handle the increased granularity of sub-minute level data without compromising system performance.

4. Data Analysis and Reporting:

  • Conducted an in-depth analysis of TV viewing patterns and demographic predictions using Python and PySpark.
  • Developed automated reporting and visualization tools to communicate insights from both HDAM and sub-minute data to stakeholders, enhancing decision-making processes.

5. Cross-functional collaboration and continuous improvement:

  • Worked closely with data scientists, product managers, and business stakeholders to refine model algorithms and improve data processing efficiency for both HDAM and sub-minute applications.
  • Continuously monitored and optimized system performance, implementing updates to maintain high accuracy in demographic predictions, and subminute tuning data in response to changing viewing patterns and data sources.

Associate Software Engineer

TCS
Chennai
10.2020 - 03.2024

1. Experian Common Homes and SRLD Processing:

  • Developed data pipelines for Experian-matched common homes and Synthetic Respondent Level Data (SRLD).
  • Implemented matching algorithms and synthetic data generation techniques to enhance audience measurement accuracy.
  • Optimized processing of large-scale datasets, handling millions of records efficiently.

2. PPM (Portable People Meter) Fusion:

  • Designed and implemented data fusion algorithms to integrate PPM out-of-home viewing data with in-home panel data.
  • Developed matching processes to fuse PPM donor panelists with National TV recipient panelists based on viewing behavior and demographics.

3. Big Data Processing and Analytics:

  • Utilized Apache Spark and Hadoop to process and analyze large volumes of viewing data across multiple systems.
  • Implemented scalable solutions to accommodate growing data volumes and increasing granularity of audience measurement.

Education

B. Tech - Instrumentation And Control Engineering

Sri Manakula Vinayagar Engineering College
Puducherry
10-2020

Skills

  • Data Processing Framework: Spark
  • Data Processing & Scripting: SQL, Python, Pyspark
  • Cloud and Integration Technologies: AWS Cloud infrastructure for storage, compute, and data solutions (eg, S3, EMR, EC2, ECR, ECS, CloudWatch, Glue, Lambda)
  • Project and team coordination: Jira Agile project management and task tracking for efficient delivery
  • Development Tools & Productivity: GitLab, version control, and collaborative code management
  • Data Analytics Enablement: Databricks processing and preparing data for analytics and visualization workflows

Accomplishments

  • Special Initiative Award – Recognition by the organization for inspiring colleagues and being an inspiring role model
  • Continuous feedback star certificate – recognition by the organization for contributing to making continuous feedback a way of life and being a role model for colleagues
  • Appreciation from the client for delivering the deliverables on time
  • Appreciation from the client for fixing and debugging escalated issues

Certification

Astronomer Certification for Apache Airflow Fundamentals

Timeline

Software Engineer

Nielsen
03.2024 - Current

Associate Software Engineer

TCS
10.2020 - 03.2024

B. Tech - Instrumentation And Control Engineering

Sri Manakula Vinayagar Engineering College
Ajeet Kumar