Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

Vinoth Thoguluva Santharam

Chennai

Summary

Staff Data Engineer with 13+ years of experience building scalable, cloud-native and on-prem data pipelines. Proven expertise in Spark, Python, AWS, Databricks, and real-time streaming platforms. Successfully led cross-functional teams, modernized legacy systems, reduced cloud costs and mentored engineers.

Overview

14
14
years of professional experience
1
1
Certification

Work History

Development Engineer 5

Comcast
02.2019 - Current
  • Agent AI Innovator – Integrated Agentic AI solutions (LangChain, Azure AI) for data pipeline monitoring and diagnostics, significantly reducing L1 support effort by 40%.
  • LLM Skill Enabler – Built proof-of-concepts using Large Language Models for failure summarization, intelligent alerting, and contextual root cause analysis.
  • SLA Optimizer – Automated recovery suggestions and pipeline re-runs using GenAI agents, leading to improved SLA adherence and minimized manual interventions.
  • GenAI Tools Developer – Created internal assistants to interpret logs, fetch documentation, and answer run-time questions, accelerating incident response.
  • Solution Architect for scalable, reusable data solutions across teams.
  • Framework Designer – Built custom frameworks to accelerate development.
  • Tech Stack Leader – Collaborated with stakeholders to define cloud-native/hybrid stacks.
  • PoC Champion – Led evaluations to adopt new tools and architectural patterns.
  • Migration Lead – Directed Platform X pipeline redesign, enabling cloud readiness.
  • Cost Optimizer – Reduced cloud infrastructure costs by 60% via architecture revamp.
  • Data Engineering Expert – Implemented CDC pipelines; built base-to-golden transformations.
  • Platform Agnostic Developer – Built Docker/K8s pipelines for hybrid deployments.
  • CI/CD Advocate – Led observability and reliability enhancements.
  • Standards Enforcer – Drove engineering and data governance best practices across DE teams.
  • Mentor & Coach – Supported engineers in adopting best practices and growing technically.

Big Data Consultant

Capgemini
04.2016 - 02.2019
  • Delivered DataHub ETL pipelines for BNP Paribas.
  • Built real-time streaming apps with Spark, Kafka, and HBase.
  • Managed 40+ production pipelines using AVRO, ORC, Parquet.

Lead Engineer

HCL
03.2015 - 03.2016
  • Built automated data quality and monitoring pipelines.

Application Developer

IBM
01.2012 - 03.2015
  • Migrated RDBMS data to Hadoop, developed Hive-based ETL jobs.

Education

M.Tech. - Data Science & Engineering

BITS, Pilani
06.2026

B.E. - ECE

Anna University
05.2011

Skills

  • Languages: Python, Scala, Java, SQL
  • AI ML: LangChain, Agentic AI, Azure AI, Prompt Engineering, RAG Pipelines, LLMOps, GenAI Workflow Automation, ML Observability & Alerting, Anomaly Detection, Failure Prediction, AutoML for Data Quality
  • Big Data: Spark (Batch & Streaming), Hadoop, Hive, Kafka, Kinesis
  • Cloud: AWS, Databricks, Platform X
  • Tools: Airflow, Docker, Kubernetes, Git, CI/CD
  • Databases: Oracle, MySQL, HBase, NoSQL
  • Formats: Parquet, Avro, ORC, CSV
  • Patterns: CDC Type 1 & 2, Base & Golden Layer

Certification

  • System Design for Large-Scale Analytics – IIT Madras (Diploma)
  • Databricks certified associate developer for Apache Spark

Timeline

Development Engineer 5

Comcast
02.2019 - Current

Big Data Consultant

Capgemini
04.2016 - 02.2019

Lead Engineer

HCL
03.2015 - 03.2016

Application Developer

IBM
01.2012 - 03.2015

M.Tech. - Data Science & Engineering

BITS, Pilani

B.E. - ECE

Anna University
Vinoth Thoguluva Santharam