Summary
Work History
Education
Skills
Technical Expertise
Architecture And Design Experience
Experience
Personal Information
Innovation And Automation
Core Expertise
Timeline
Generic
JAYAKRISHNAN HARIKUMAR

JAYAKRISHNAN HARIKUMAR

Summary

Seasoned Data Engineer and Architectural Specialist with over 12 years of hands-on experience in designing, developing, and scaling robust, data-intensive solutions across diverse domains including Healthcare, Telecom, and Insurance. Core Expertise: End-to-end implementation of Medallion Architecture (Bronze → Silver → Gold) and high-volume ETL/ELT data pipelines using Cloud-native and open-source tools. Technical Stack: Expert in the modern data ecosystem, including PySpark, Apache Airflow, AWS Glue, PostgreSQL, Snowflake, and Redshift. Innovation & Automation: Proven track record in automating data ingestion, transformation, and analytical data modeling, including proficiency with LLM Integrations (Gemini, ChatGPT) to accelerate analytics and insight generation. Seasoned Senior Data Engineer with background in developing, testing, and maintaining data architectures. Possess strong skills in database management systems, Big Data processing frameworks, data modeling and warehousing. Have successfully led teams in creating innovative data solutions to improve system efficiency and business decision-making processes. Demonstrated impact through enhanced data availability and accuracy in previous roles.

Work History

Senior Data Engineer

Innovation Incubator Advisory Pvt Ltd
  • Architected and Implemented a complete, scalable Medallion Architecture pipeline for critical patient data, including vitals, insurance claims, and complex clinical workflows, using PostgreSQL, AWS Glue, PySpark, and Airflow.
  • Engineered patient program ingestion and claims processing pipelines, focusing on advanced JSON-based data transformations and schema enforcement.
  • Delivered the UCM + Cardio1 Analytics Layer, featuring automated dashboards (Power BI, DAX) integrated with Python-based LLM agents for real-time insight generation.
  • Designed a highly layered (Medallion-style) data platform within Redshift and S3 for advanced telecom usage and billing analytics, handling massive transactional datasets.
  • Developed automated ETL processes leveraging AWS Glue and PySpark for data cleansing and aggregation.
  • Implemented end-to-end ETL pipelines for auto retail data and built data ingestion frameworks, including web scraping using Scrapy/Kapow and ETL automation via Talend.
  • Developed large-scale data pipelines for real estate products and built structured ETL workflows for equipment rental marketplaces using Talend, MySQL, and PostgreSQL.
  • Engineered robust pipelines for a digital insurance marketplace, focusing on data integrity and transactional completeness.
  • Built ETL workflows for an AI-driven fashion recommendation engine, processing large volumes of unstructured data.

Education

B.Tech - Electrical and Electronics Engineering - EEE

College of Engineering Perumon
01.2006

Skills

  • AWS S3
  • ETL process automation
  • Data pipeline architecture
  • Advanced data transformations
  • Real-time analytics integration
  • Data quality assurance
  • Problem solving
  • Metadata management
  • Data modeling
  • Data curating
  • Data pipeline design
  • AWS Glue
  • AWS DataBrew
  • AWS Redshift
  • Snowflake
  • PostgreSQL
  • Neo4j (Cypher)
  • Apache Airflow
  • Shell Scripting
  • PySpark
  • Python
  • SQL
  • DAX
  • Talend Big Data Integration
  • Kapow
  • Medallion Architecture
  • Dimensional Modeling
  • ETL/ELT Automation
  • Power BI
  • Tableau
  • Google Data Studio
  • Apache Superset
  • LLM Integrations (Gemini, ChatGPT)
  • Web Scraping (Scrapy)

Technical Expertise

AWS S3, AWS Glue, AWS DataBrew, AWS Redshift, Snowflake, PostgreSQL, Neo4j (Cypher), Apache Airflow, Shell Scripting, PySpark, Python, SQL, DAX, Talend Big Data Integration, Kapow, Medallion Architecture, Dimensional Modeling, ETL/ELT Automation, Power BI, Tableau, Google Data Studio, Apache Superset, LLM Integrations (Gemini, ChatGPT), Web Scraping (Scrapy)

Architecture And Design Experience

  • Medallion Architecture Implementation, Led the design and deployment of the Medallion Architecture (Bronze/Raw, Silver/Refined, Gold/Curated) across multiple high-stakes projects (Cardio1, Pavlov, ACS), ensuring data quality and consumption readiness.
  • Scalable Pipeline Design, Engineered robust and scalable data ingestion and transformation pipelines using AWS Glue and PySpark orchestrated via Apache Airflow.
  • LLM Integration, Developed Python-based LLM agents for the UCM + Cardio1 analytics layer to automate complex insight generation and accelerate data analysis workflows.

Experience

12+ Years

Personal Information

Title: Data Engineer & Data Architecture Specialist

Innovation And Automation

Proven track record in automating data ingestion, transformation, and analytical data modeling, including proficiency with LLM Integrations (Gemini, ChatGPT) to accelerate analytics and insight generation.

Core Expertise

End-to-end implementation of Medallion Architecture (Bronze → Silver → Gold) and high-volume ETL/ELT data pipelines using Cloud-native and open-source tools.

Timeline

Senior Data Engineer

Innovation Incubator Advisory Pvt Ltd

B.Tech - Electrical and Electronics Engineering - EEE

College of Engineering Perumon
JAYAKRISHNAN HARIKUMAR