Dynamic HPC-AI Application Engineer with a robust history of optimizing and porting scientific applications for peak performance on high-performance computing systems. Expertise in aligning HPC applications with specific workloads to ensure efficient resource utilization across diverse computing environments, enhancing the capabilities of HPC facilities throughout India. Proven ability to troubleshoot complex challenges and deliver innovative solutions that drive operational excellence. Deep knowledge of parallel programming, optimization techniques, and HPC architectures, complemented by proficiency in C/C++ and Python to create effective and scalable solutions.
HPC Infrastructure Setup & Management
AI/ML Benchmarking & Performance Tuning
Solution Architecture & Customer Engagement
Networking & Storage Optimization
Programming Languages: C/C, Python, Shell
HPC Projects under National Supercomputing Mission:
Project Summary: The National Supercomputing Mission (NSM) is a government initiative in India aimed at boosting the country's supercomputing capabilities. Launched in 2015, the mission involves setting up a network of high-performance computing (HPC) facilities across India. The goal is to empower researchers and scientists with advanced computing power to tackle complex problems in various fields, such as climate modeling, drug discovery, and materials science.
Responsibilities: Parallelization and Optimization of scientific codes on CPUs/GPUs, Parallelize and Optimize given scientific application on CPUs/GPUs (on single node or multi-node cluster), Work with the domain expert on defining the objectives and outcomes of the Parallelization and Optimizations efforts, Mentoring teams on Parallelization and Optimization efforts for ongoing projects, Training on Parallel Computing technologies.
Roles: Porting, Profiling, Optimization, Debugging and getting benchmarks of scientific applications and checking the application scalability. Providing support to researchers to interface with HPC infrastructures. Implementing and serving the management of job scheduling systems. Maintaining application like GROMACS, LAMMPS, NAMD, OpenFOAM, Quantum Espresso, WRF(Weather Research Forecasting), etc. Exposure of having worked on NSM Facilities pan India with 14+ NSM Sites, total computing power up to 22+ PF
Technical Skills: Parallel Programming, HPC, SLURM Workload Management, Optimization, Containerization, Benchmarking Application
Artificial Intelligence Projects under National Supercomputing Mission:
Project Summary: The National Supercomputing Mission (NSM) in India aims to enhance the nation's capabilities in the field of HPC and make supercomputing resources accessible to research and industry. Under this mission, the Indian government has been pushing forward various artificial intelligence (AI) and machine learning (ML) projects to boost innovation and provide solutions in critical sectors.
Responsibilities: Developed a real-time image processing based application pipeline using TensorFlow and a GPU-accelerated CNN and Transformers for tasks like computer vision, natural language processing (NLP). Leading the benchmarking activities for evaluating AI application across GPU platforms ranging from different OEMs. Created in house benchmark application kernels for system acceptance test and evaluation. Working with cross-functional teams to deliver AI solutions.
Roles: Develop Machine Learning and Deep Learning models to solve complex problems, Implement advanced AI techniques (e.g., CNNs, RNNs and Transformers) for tasks like computer vision and Natural Language Processing (NLP). Gathering the data for image segmentation and classification task, data annotation, and preprocessing for creating the model. Deployment of ML applications on platforms like HuggingFace, Streamlit, Gradio, Heroku, etc. Actively support NSM sites and internal groups of AI/ML workloads. Benchmarking AI Application on open consortium platform like MLCommons.
Technical Skills: Python Programming, ML, DL, Relevant libraries awareness (Tensorflow, Pytorch, etc), OpenCV, NLP, LLMs, MLOps
PARAM Shavak Product Development:
Project Summary: PARAM Shavak - solution, aims to provide computational resource (Capacity building) with advanced technologies to perform high-end computations for scientific, engineering and academic programs to address and catalyze the research using modelling, simulation and data analysis. This initiative is expected to create HPC aware skilled workforce (Capability building) and for promoting research by integrating leading-edge emerging technologies at grass root level.
Responsibilities: System setup and configuration according to client requirement, Updating the product documentation, Ensuring the integrity with latest technology support. Technical support for client along with training on different variants of PARAM Shavak series like HPC, DL-GPU and VR.
Roles: System Solution Design, Development & Integration, Technical evaluation, System development, Product Documentation, Training activities for the marketing teams, Deployments at the client site.
Technical Skills: Shell & Python Scripting, Product Management