Summary
Overview
Work History
Education
Skills
Timeline
Project
Technologies And Frameworks
Databases
Build Tools
Key Result Areas
Generic

Hariharasudhan Sabapathi

Chennai

Summary

Sharp and talented Lead Data Engineer with 12 years of product development history. Expertise in driving projects and leading cross-functional teams to consistently meet key program deliverables, targeting Lead level assignments in Big Data/Data Analytics with an organization of reputation.

Overview

12
12
years of professional experience

Work History

Lead Data Engineer

LTIMindtree | Client : Marriott
11.2024 - Current
  • Led cross-functional engineering teams, enforced data quality standards, and collaborated with stakeholders to deliver robust, data-driven solutions aligned with strategic goals.

Development Engineer-3

Comcast
07.2021 - 11.2024
  • Ensuring the successful delivery of emerging big data solutions and defining strategy, principles, vision, and standards; providing advisory services around the data frameworks, governance, and quality.

Member of Technical Staff

Zoho Chennai
06.2013 - 07.2021
  • Designing, developing & deploying computationally complex and practical data; building and delivering comprehensive data strategy roadmaps; ensuring final deliverables are of the highest quality.

Education

MCA -

National Institute Of Technology, Tirucirappalli (NIT - Trichy)
06.2013

PG Diploma - Artificial Intelligence And Machine Learnings

IIT Madras
Chennai
06.2024

Skills

  • Spark with Scala
  • Coding
  • Debugging
  • Software Development
  • Big Data Analytics
  • Big Data Technologies
  • Hadoop Architecture
  • Spark
  • Scala, Java, Python
  • Kafka
  • Core Java
  • SQL
  • AWS Services
  • ETL
  • Agile Methodologies
  • Motivator
  • Collaborator
  • Planner
  • Analytical skills
  • Problem-solving skills
  • Leadership
  • Machine Learning (Basics)

Timeline

Lead Data Engineer

LTIMindtree | Client : Marriott
11.2024 - Current

Development Engineer-3

Comcast
07.2021 - 11.2024

Member of Technical Staff

Zoho Chennai
06.2013 - 07.2021

MCA -

National Institute Of Technology, Tirucirappalli (NIT - Trichy)

PG Diploma - Artificial Intelligence And Machine Learnings

IIT Madras

Project

Spark Tech Upgrade : Client - Marriott Led Spark and EMR Upgrade Initiative Directed the successful upgrade of Apache Spark and Amazon EMR versions, ensuring seamless migration of all Spark jobs with full backward compatibility. Coordinated cross-functional efforts to minimize downtime and maintain data pipeline integrity throughout the transition. 

AOS (Ad-server Operative System) Data Mapping Spark Developer, Existing on-prem Operative (o1) migrating to cloud operative system. Extracting the source from different topics from datalake and apply transformation to map the datasets and load the results into S3 target bucket for the downstream teams., Spark Scala, AWS S3, Terraform, concourse CI/CD, Databricks.

AOS DataStream To DataLake Big data Developer, Spark Streaming application ingest configured topics from AOS kafka broker to S3 datalake and store them as 'delta' format files. Application uses AOS Kafka Schema registry to deserialize AVRO payloads from AOS datastream. AWS (s3, cloudwatch, elastic search, athena, spark scala), Terraform, Grafana, Kafka, DynamoDB, Concourse CI/CD, Databricks Delta lake and data versioning. Zoho CRM Data Migration from various sources Spark Developer, To get the Co-ordinates (latitude & longitude) of a record based on the address information and update/insert the Co-ordinates values in those corresponding records. Using these geocode info, sales persons can find the entities located near him. We used BigData technologies and Zoho Maps api to achieve our goal. For streaming data we used Kafka., Cloudera, Sqoop, Spark with Scala, Hive.

EffecTv Data Portal NodeJS, Aws(glue api, ECS, cloud watch, event rule, glue job, lambda), Spark with Scala, Angular, Postges Aurora Serverless, Internal portal/tool for Comcast-EffecTV customers where they can visit this website (dataportal.comcast.com) and ask for access of tables and buckets and view how many roles/users accessed the table/bucket on any day. Admin/Managers can approve or deny the bucket access request.

Technologies And Frameworks

  • Spark
  • Apache Hadoop
  • Hive
  • Sqoop
  • Databricks
  • Terraform
  • Concourse CI/CD
  • AWS Services (S3, EMR, Lambda, ECS, RDS, CloudWatch, SNS, SES)
  • MonteCarlo

Databases

  • Mysql
  • PostgresSQL
  • NoSQL

Build Tools

  • Apache Maven
  • SBT

Key Result Areas

  • Ensuring the successful delivery of emerging big data solutions
  • Defining strategy, principles, vision, and standards
  • Providing advisory services around data frameworks, governance, and quality
  • Simulating, designing, developing & deploying computationally complex and practical data.
  • Leading the development of enterprise technology standards, governance processes, and performance metrics
  • Analyzing & reviewing business, functional, and high-level technical requirements
Hariharasudhan Sabapathi