Summary
Overview
Work History
Education
Skills
Accomplishments
Awards
Programming Languages
Development Tools
Timeline
Generic
Dayananda V

Dayananda V

Mysore

Summary

Experienced AI and Deep Learning Framework Solution Engineer with 8 years in AI acceleration and distributed training. Expertise in model optimization and performance tuning for specialized hardware such as Intel Habana Gaudi and Huawei AI/NPU. Contributed to open-source projects like Apache TVM and TensorFlow, enhancing scalability and high-performance computing.

Overview

15
15
years of professional experience

Work History

AI Solution Engineer

Intel India Private Limited
Begaluru
03.2022 - Current
Intel Corporation (Habana / XPU Team) — AI Solution Engineer / Distributed Systems Engineer

March 2022 – Present | Bengaluru, Karnataka

  • Designed and implemented scalable solutions to improve distributed AI training stability and performance across multi-node environments.
  • Collaborated with framework and hardware teams to optimize performance, scalability, and memory efficiency for large-scale AI workloads, improving overall system capabilities.
  • Optimized memory and resolved framework-level issues for multi-node distributed workloads on Habana Gaudi 2 and Gaudi 3 hardware, enhancing system reliability.
  • Contributed to PyTorch distributed framework enablement and porting support for Habana Gaudi hardware platforms.
  • Worked on distributed communication and framework optimization to bridge gaps between AI frameworks and Intel accelerator hardware.
  • Integrated SGLang framework with DeepEP framework to enable efficient Mixture-of-Experts (MoE) solutions for Intel XPU hardware in multi-node scale-up configurations.

Senior System Architect

Huawei Technologies India Pvt Ltd
Bangalore
03.2011 - 03.2022
  • Led development of lightweight AI inference frameworks for microcontroller and edge-device-based AI solutions with optimized memory and performance characteristics.
  • Core member of Huawei AI framework and compiler optimization team contributing to multiple AI solutions deployed globally across Huawei device platforms.
  • Enhanced Huawei MindSpore AI compiler framework through development of generic operator fusion and pre-compiler optimization solutions.
  • Specialized in AI model optimization techniques including quantization, pruning, graph optimization, and operator fusion.
  • Deployed advanced computer vision models, including dehaze, deblurring, reflection removal, and spam filtering on Huawei handset platforms, improving user experience.
  • Contributed to TensorFlow and Apache TVM community solutions involving framework integration, kernel optimization, and performance tuning.
  • Worked extensively on Android framework development, Bluetooth stack integration, and AOSP platform optimization.
  • Led and mentored engineering teams while contributing as designer, developer, architect, and technical lead across multiple projects.
  • Developed tailored mobile solutions for AT&T, TMO, Verizon, and other international customers, addressing specific operational needs.

Education

Bachelor of Engineering - Electrical, Electronics And Communications Engineering

Sridevi Institute Of Engineering And Technology
Tumkuru, Karnataka
07-2010

Advance Embedded System Engineering - Embedded Engineering

India Service Machine(ISM)
Bengaluru, Karnataka
02-2011

Skills

AI and Deep learning
  • Language model support
  • Distributed training methodologies
  • MoE architecture
  • Performance acceleration
  • Quantization techniques
  • Graph optimization strategies
  • Operator fusion methods
  • Performance tuning
  • Memory efficiency optimization
  • PyTorch and TensorFlow
Technological frameworks overview
  • PyTorch Distributed
  • ONNX integration
  • Apache TVM knowledge
Distributed & Hardware Platforms
  • Multi-node systems
  • Infrastructure scalability
  • Framework enhancement

Accomplishments

  • Integrated and optimized DeepEP framework with SGLang for scalable MoE workloads on Intel XPU hardware.
  • Successfully optimized memory utilization and resolved distributed framework issues for Intel Habana multi-node training environments.
  • Contributed to distributed AI framework enablement for Intel Habana Gaudi 2, Gaudi 3, and Intel XPU hardware platforms.
  • Successfully contributed optimization solutions to AI compiler framework including operator fusion and graph optimization.
  • Contributed to Apache TVM, TensorFlow, and PyTorch ecosystem improvements focused on scalability and performance.
  • Deployed production-grade computer vision solutions including dehaze, deblurring, and reflection removal models on Huawei devices.
  • Delivered lightweight AI inference frameworks for resource-constrained edge and embedded devices at Huawei.

Awards

  • HQ Gold Award — Huawei Individual Best Contributor Award
  • APRI Award for Innovation in Deep Machine Learning
  • Advanced Coding Assessment and ECQ Recognition from Huawei
  • Recognized contributor for AI framework optimization and distributed AI solutions
  • Appreciated for scalable AI infrastructure and multi-node optimization contributions at Intel

Programming Languages

  • C
  • C++
  • Python
  • Java

Development Tools

  • Git
  • Linux
  • Android Studio
  • Microsoft Visual Studio
  • Cross Compiler Optimization
  • Debugging & Profiling Tools

Timeline

AI Solution Engineer

Intel India Private Limited
03.2022 - Current

Senior System Architect

Huawei Technologies India Pvt Ltd
03.2011 - 03.2022

Bachelor of Engineering - Electrical, Electronics And Communications Engineering

Sridevi Institute Of Engineering And Technology

Advance Embedded System Engineering - Embedded Engineering

India Service Machine(ISM)
Dayananda V