Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

Umesh U

Senior Data Engineer
Bengaluru,Karanataka

Summary

Specializing in ETL pipeline for big data project and data analysis. Experienced with all stages of the development cycle for complex data loads and predictive analysis projects.

Overview

9
9
years of professional experience
5
5
years of post-secondary education
2
2
Certifications

Work History

Senior Data Engineer

Nike India
Bengaluru
07.2022 - Current
  • Envisioned, architected, and solely developed the project in collaboration with the principal engineer, and successfully made it an open-source project through the Nike repository
  • The Spark-Expectations data quality tool is a major project at Nike, with over 50+ data teams having adopted the solution.
  • Designed the clickstream modernization solution and successfully delivered the KPIs for the nikedotcom, nikeapp, and nikesnkrs applications utilizing the c360 database.
  • Migrated the legacy pipelines of 3PP and implemented unified data pipeline solutions for other partners to onboard more efficiently. Enhanced the SLA by reducing processing time by 30 minutes compared to the legacy pipelines.
  • Discovered, identified, and architected the solution for PAO and ATC in collaboration with stakeholders to enhance the experience for Nikenet users.
  • Collaborated closely with principal engineers, stakeholders, and managers to gather information, feedback, and reviews for improving the pipelines and fostering self-development while considering the team's needs.

Data Engineer

Accion Labs
Bengaluru
07.2019 - 07.2022
  • Envisioned, architected and implemented a generic Azure data pipeline and Spark based model for batch processing of enterprise level complex retail data, resulting in saving $5000 per month.
  • The pipeline reduced the effort by 70% of previous architecture for all loads.
  • Sourced, analyzed, processed, validated, transformed, aggregated and distributed data from more than 5+ sources, using generic Azure data pipeline.
  • Configured and developed automated data bricks Spark replication system from PostgreSQL to Snowflake for 100 tables, which improved loading speed of data by 75%.
  • Developed and maintained reporting tool ensure 100% errors were recorded and reported.
  • Used Airflow to orchestrate the ETL solution that helped improve conversion rate by 20%.
  • Mentored, documented and maintained best practices of Git usage in data engineering project.
  • Awarded a prestigious “Team Marvel” award for zero defect go-live of complex feature.
  • Received “Customer Focus” award for architecting complex data load.
  • Client accolade on their LinkedIn page in recognition for my contribution.

Big Data Engineer

Reni Analytics
Bengaluru
09.2018 - 07.2019
  • Implemented a predictive analysis project called ‘Cell Site Degradation (CSD)' that trained a ML algorithm to collate raw network and weather data and predict network site degradation, which resulting in 32% increase in revenue.
  • Involved in design, implemented shipment tracking and alerting product called ‘Dice-Platform' using Scala Spark, Hadoop and Kafka, improving operating efficiency by 80%.
  • Built an algorithm to match vendor delivery quotation and shipment tracking and alert deviation with 100% accurate results.
  • Designed and built an end-to-end user configurable CICD pipeline that was adopted by more than 20+ projects in the company.
  • Developed monitoring and alerting capabilities to ensure 100% of data pipelines were working.
  • Implemented a ‘Log Analyzer' using Spark framework, Elasticsearch and Kibana.

Data Engineer

Mindtree(C2H) Through GVS Infotec
Bengaluru
08.2016 - 09.2018
  • Implemented and automated distributed ETL system for smart data access to determine the aggregation of a traffic of the calls over a network in a particular area for 15mins, half an hour, weekly and monthly, which reduced manual monitoring effort of 24 hours.
  • Created alerting system, dashboard and monitored to ensure 100% data was processed and transferred on time.
  • Participated in Agile planning 40+ data feature request.
  • Identified and fine tuned tombstone issue in Cassandra to speed up the insert and update by 60% for 10M transaction data per day.

Education

Bachelor of Science - Electrical Engineering

Visvesvaraya Technological University
Mangaluru
08.2012 - 08.2017

Skills

Big data framework: Spark, Snowflake, Kafka, Hadoop, Databricks, NumPy, SciPy, Pandas and Matplotlib

CICD Tools: Jenkins, Git and Docker

Cloud: Azure, AWS

Schedulers: Airflow

Database: Postgress, Snowflake, HBase, Hive and Cassandra

Big data filesystem: HDFS, Azure Blob3 and Amazon S3

Big data framework: Spark, Snowflake, Kafka, Hadoop, Databricks, NumPy, SciPy, Pandas and Matplotlib

Certification

Java, Software Testing and SQL

Timeline

Senior Data Engineer

Nike India
07.2022 - Current

Data Engineer

Accion Labs
07.2019 - 07.2022

Big Data Engineer

Reni Analytics
09.2018 - 07.2019

Data Engineer

Mindtree(C2H) Through GVS Infotec
08.2016 - 09.2018

Bachelor of Science - Electrical Engineering

Visvesvaraya Technological University
08.2012 - 08.2017
Umesh USenior Data Engineer