Summary
Overview
Work History
Education
Skills
Certification
Tools
Skills
Timeline
Generic

Girish Kumar

Solutions Architect Data & AI
Banglore

Summary

Accomplished Data & AI Architect with 12+ years of experience designing and implementing distributed data and AI solutions. Specializes in generative AI, including large language model (LLM) training/optimization and designing retrieval-augmented generation (RAG) workflows. Expert in PyTorch and Hugging Face Transformers for model development, with extensive experience leveraging NVIDIA GPUs for accelerated training and inference. Strong background in big data platforms (Apache Spark, Hadoop) and cloud services (Azure Databricks, Synapse Analytics) to support scalable data pipelines. Excellent communicator skilled in leading technical workshops, customer engagements, and providing pre-sales support.

Overview

7
7
years of professional experience
5
5
years of post-secondary education
1
1
Certification

Work History

Data & AI Solutions Architect

Daimler Truck Innovation Center India (DTICI)
Banglore
12.2021 - Current
  • Designed and deployed end-to-end generative AI solutions using large language models and RAG workflows to support business intelligence and analytics use cases.
  • Fine-tuned transformer models (GPT, BERT, T5, etc.) on domain-specific datasets using PyTorch and Hugging Face libraries, optimizing accuracy and throughput on NVIDIA GPU clusters.
  • Developed custom RAG pipelines, including document ingestion, vector indexing (e.g., FAISS/Pinecone), and context-aware query processing for question-answering systems.
  • Collaborated with clients and internal teams to translate requirements into scalable AI architectures; delivered technical demos and provided pre-sales support for NVIDIA-powered solutions.
  • Conducted hands-on workshops and training sessions on generative AI best practices, GPU-accelerated model training, and distributed computing for engineers and stakeholders.
  • Evaluated emerging technologies to stay current on industry trends, making informed decisions for technology adoption.
  • Developed comprehensive documentation for technical specifications, project plans, and user guides, streamlining communication across teams.
  • Managed project timelines effectively while juggling multiple priorities, consistently meeting deadlines without compromising quality.
  • Enhanced application performance by integrating cloud technologies and microservices architecture.
  • Promoted collaboration between cross-functional teams by serving as a liaison between developers, product managers, and stakeholders during all phases of projects.
  • Delivered highly-available systems utilizing load balancing techniques, allowing for seamless scalability during peak usage periods.
  • Championed a culture of continuous improvement by conducting code reviews and providing constructive feedback to peers.
  • Increased security measures by implementing robust authentication and encryption protocols, protecting sensitive data from potential breaches.
  • Maintained current and in-depth understanding of business processes, needs and objectives.
  • Implemented monitoring tools for proactive issue detection, reducing downtime due to unforeseen technical issues.
  • Streamlined system integrations through the use of RESTful APIs, improving interoperability between various applications.
  • Reduced time-to-market by automating deployment processes using CI/CD pipelines.
  • Mentored junior team members in best practices and coding standards, fostering professional growth within the team.

Senior Big Data Engineer

Reserve Bank of Australia
Sydney
11.2018 - 12.2021
  • Developed and optimized PySpark applications on Hortonworks Hadoop for large-scale data ingestion and ETL, processing multi-terabyte datasets for analytics.
  • Implemented data pipelines integrating AWS S3, Hive, and HBase; used Spark SQL and Impala to execute high-performance queries on distributed data.
  • Employed various data formats (Avro, Parquet, ORC) and SerDes (JSON/XML) in Hive to improve storage efficiency and interoperability.
  • Automated data workflows and scheduling with Oozie; migrated legacy batch processes to Spark for improved reliability.
  • Mentored junior engineers on Spark programming and Hadoop ecosystem tools, fostering team best practices in big data development.
  • Enhanced data quality by developing robust validation strategies to identify and correct inconsistencies.
  • Proactively addressed potential bottlenecks in the ETL process through regular monitoring, enabling seamless workflow operations.
  • Automated routine tasks through scripting languages, reducing manual effort and human error risks.
  • Integrated real-time streaming technologies for accurate monitoring of critical business metrics.
  • Increased operational efficiency by automating repetitive tasks using Python scripts, allowing focus on higher-priority projects.
  • Led a team of engineers in designing an innovative big data solution that significantly impacted business growth and profitability.
  • Migrated legacy systems to modern cloud-based platforms for increased efficiency and scalability.
  • Evaluated emerging big data technologies to ensure relevance with evolving industry trends, maintaining competitive advantage over peers.
  • Collaborated with cross-functional teams to determine business requirements and translate them into functional specifications.

Education

Artificial Intelligence And Machine Learning - M.Tech.

Birla Institute of Technology
Banglore
11.2023 - Current

Electronics Engineering - Electronics Engineering

University of Pune
Pune
07.2008 - 06.2012

Skills

  • Generative AI

  • NLP

  • Training and optimizing LLMs

  • RAG pipelines

  • prompt engineering

  • Deep Learning Frameworks

  • PyTorch

  • TensorFlow

  • Keras

  • Hugging Face Transformers

  • Distributed Systems

  • Big Data

  • Apache Spark

  • DevOps

  • Microsoft Azure

  • Databricks

AWS

  • Docker

  • Kubernetes

  • Python

  • SQL

  • Linux shell scripting

  • Communication

  • Leadership

  • Technical presentations

  • customer workshops

Mentoring

Cross-functional collaboration

  • Libraries

  • spaCy

  • Containers

  • Git

  • NVIDIA

  • GPU

  • CUDA

  • TensorRT

  • cuDNN

  • NVIDIA Triton Inference Server

  • Linux

Certification

Microsoft Certified: Azure Data Engineer Associate (DP-203)

Tools

, , , , , , , , , , , , , , , , , , , , , , ,

Skills

, , , , , , , , , , , , , , , , , , , , ,

Timeline

Artificial Intelligence And Machine Learning - M.Tech.

Birla Institute of Technology
11.2023 - Current

Data & AI Solutions Architect

Daimler Truck Innovation Center India (DTICI)
12.2021 - Current

Senior Big Data Engineer

Reserve Bank of Australia
11.2018 - 12.2021

Electronics Engineering - Electronics Engineering

University of Pune
07.2008 - 06.2012
Girish KumarSolutions Architect Data & AI