Summary
Overview
Work History
Education
Certification
Affiliations
Timeline
Generic

K. H. Vishwanathan Iyer

Cloud Support, Technical Support And Server Admin

Summary

Skilled MLOps engineer and team leader/manager with 10+ years of experience in AI/ML, DevOps, and cloud technologies. Specializes in designing and deploying scalable machine learning pipelines using Azure, Kubeflow, and MLflow. Expertise includes automation through CI/CD, cross-functional collaboration, and AI-driven solution architecture. Holds an MS in Data Science and has a strong track record of leading successful AI/ML projects. Hands-on experience across leading observability and monitoring stacks including OpenTelemetry, Cribl, DataDog, Prometheus, Grafana, and ELK Stack.

Overview

12
12
years of professional experience
5
5
Certificates
8
8
years of post-secondary education

Work History

Team Lead AI/ML

Kion Group AG
07.2024 - 12.2024
  • Spearheaded implementation and deployment of scalable machine learning pipelines using Azure, KEDA functions, Kubeflow, and MLflow.
  • Architected and managed cloud environments on Azure, ensuring robust support and automation for machine learning pipelines through Infrastructure as Code (Terraform) and scripting (Python, Bash).
  • Developed and maintained CI/CD pipelines using GitHub Actions and Azure DevOps to enable continuous integration and deployment, significantly reducing release cycle times.
  • Collaborated with cross-functional teams to integrate security and compliance measures, aligning with enterprise cloud adoption best practices.
  • Provided end-to-end management and monitoring of containerized applications in Azure Kubernetes Service (AKS) and OpenShift clusters, optimizing performance and ensuring high availability.
  • Automated artifact management, container deployments, and environment provisioning using tools such as Docker, Jenkins, and Ansible.
  • Integrated OpenTelemetry and Cribl into cloud observability pipelines.
  • Implemented proactive monitoring and logging solutions (Prometheus, Grafana, ELK Stack) to maintain system reliability and facilitate rapid troubleshooting.

Technical Lead- LLM /GenAI

Encora Software India Pvt.Ltd
03.2024 - 06.2024
  • Led a team to design, evaluate, and deploy LLM-based solutions (e.g., an internal chatbot) with an emphasis on scalability and robust production-readiness.
  • Integrated Azure services (Cognitive Search, OpenAI) within a secure, automated pipeline using Infrastructure as Code (Terraform) and scripting.
  • Implemented CI/CD processes using GitHub Actions and Azure DevOps, ensuring reliable, continuous delivery of updates.
  • Established monitoring protocols for performance and security compliance across the cloud environment.
  • Implemented OpenTelemetry distributed tracing.

Site Reliability Engineer

CloudZenix
02.2023 - 07.2023
  • Assisted in monitoring and maintaining high availability and performance of production systems using tools like Prometheus and Grafana.
  • Developed and automated scripts in Python and Bash to support incident management and streamline troubleshooting processes, thereby minimizing system downtime.
  • Collaborated with the SRE team to implement and maintain CI/CD pipelines, ensuring seamless deployments and rapid recovery from issues.
  • Contributed to infrastructure automation efforts using Infrastructure as Code (Terraform) and configuration management tools, enhancing scalability and security.
  • Participated in on-call rotations and incident response, providing timely resolutions to system alerts and performance bottlenecks.
  • Worked cross-functionally to integrate security and compliance measures into the cloud infrastructure, aligning with enterprise best practices.

Sr. Product Engineer- DevOps, App Maintenance

Verint Systems India Pvt Ltd
12.2020 - 09.2022
  • Built and managed CI/CD pipelines across multiple cloud environments (AWS, GCP, Oracle, Azure) to streamline the deployment of AI models and applications.
  • Automated cloud infrastructure provisioning using IaC (Terraform) and configuration management tools.
  • Developed scripts (Python, Bash) for automating deployment, monitoring, and incident response processes.
  • Integrated advanced monitoring and logging solutions (Prometheus, Grafana, ELK) to ensure high availability, performance, and security compliance.

Sr. IT Engineer

UBS India business pvt ltd
03.2020 - 12.2020
  • Provided technical support and maintenance for Microsoft 365 and cloud-based solutions, incorporating automation to streamline operations.
  • Developed scripts in Python and PowerShell to automate routine tasks and system monitoring.
  • Assisted in the configuration and management of cloud environments to ensure compliance with security and enterprise standards.

Sr. DSS Tech engineer

AXA XL Business India Pvt Ltd
11.2017 - 06.2019
  • Designed and implemented ETL procedures using cloud-based platforms (AWS, Azure, GCP) for data warehousing solutions.
  • Monitored systems using tools such as CloudWatch, Opsgenie, and Splunk to ensure optimal performance and security.
  • Collaborated with cross-functional teams to maintain data integrity, security compliance, and operational efficiency.

Technical Consultant

Nice Interactive solutions India Pvt ltd
08.2015 - 08.2017
  • Sizing, tuning and pre-installation server checks before installation of NICE WFO-WFM product
  • Providing Technical support for NICE WFO-WFM application to the customers across Globe
  • Troubleshooting infrastructural and application issues related to the product
  • Integration and configuration of ACD (Cisco, Avaya, Nortal, Symposium) with WFM application
  • Managing application Hosted on cloud, overcoming cloud infrastructural issues
  • Maintaining and managing Platform, Application and Apache Web servers
  • Troubleshooting issues from backend by querying Database like Oracle, MSSQL and Postgres
  • Setting up scheduled tasks in application server for various smartsync jobs
  • Managing basic functions of recording system NICE Integration Management recording solution
  • Managing the complex distributed server environment both Linux and Windows
  • Migration of application from current version to latest version
  • Setting up remote connection to the different customer WFM server environments.

Senior Systems Engineer - MLOps Engineer

EPAM Systems (Contractual with ServiceNow)
01.2025 - Current
  • Built and maintained a Retrieval-Augmented Generation (RAG)-based LLM solution using Azure cloud services.
  • Developed production-grade pipelines integrating Azure AI Search, Azure Data Factory, Azure Document Intelligence, and other AI/ML components.
  • Implemented secure document ingestion with PII redaction, malware scanning, and robust parsing techniques. •Deployed and managed services in Azure private clusters with end-to-end security controls.
  • Integrated Kong API Gateway for traffic management and access control.
  • Enabled observability and real-time monitoring using Splunk. •The project is currently in production; engagement continues under contract until June 2025.

Windows Administrator Jr

Amdocs DVCI
12.2014 - 07.2015
  • Support Internal Amdocs Employees with Application, Hardware and Remote related issue
  • Had large level of access to Windows Server 2008 and Active Directory services
  • Handle calls, Emails mostly on issues such as VPN connection, Network, Outlook and office suite package problems and account related issues
  • Cater to issues relating to Cisco Configuration, Backup and storage
  • Handle issues on VMware supporting end customers with VDI related issues
  • Administering DNS, DHCP servers
  • Resolving DNS,WINS and AD Issues
  • Troubleshooting connectivity issue between different domains
  • Troubleshooting problems pertaining to System Performance, Network Administration and System Bugs
  • Auditing Users Authentication and Resource access
  • Co-ordinate with Dell/HP through site to troubleshoot the Hardware issues and replace the faulty Server hardware if it is under warranty
  • Analyzing logs to find root cause of an issue
  • Monitoring Windows 2003, 2008, 2008 and 2012 R2 Servers and trouble shooting
  • Providing hardware / software / network problem diagnosis / resolution for customer's end users
  • Windows Server 2008 and 2012 Active Directory Users and Groups
  • Resolving routine customer problem (Customers from UK, US and EMEA)
  • Routing problems to internal 3rd level IT support staff
  • Coordinate and manage relationships with vendors and support staff that provide hardware / software / network problem resolution
  • Responding to email, instant messages, and assigned tickets from users and assigning work orders / incidents to appropriate support teams and follow up until closure
  • Involved in deploying and supporting patch management, updating services and software installation.

Tech Support Associate

Mphasis
06.2013 - 07.2014
  • Support clients via phone with diagnosing and solving hardware as well as software issues pertaining to their Cargill imaged computers
  • Provided assistance and support of core load and web applications
  • Worked on RSA Pin reset, SDI hard token synchronization, Activation of token and deactivation
  • Provide resolutions to Internet and wireless issues
  • Install and Configure Lotus Notes and Cisco IP communicator
  • Add Blackberry users through Blackberry Delegation tool, Adding Blackberry users with BES Admin rights in Active Directory, Activating Blackberry phones and synchronizing the email, calendar and contacts
  • Troubleshoot and fix outlook issues
  • Install Symantec Endpoint Protection(SEP) anti-virus, updating policy
  • Use of Remedy system to create tickets and escalate issues to respective support teams.

Education

Master of Science - Data Science

International Institute of Information Technology
07.2021 - 08.2022

B.tech - computer science engineering

Kakatiya University
03.2009 - 03.2013

intermediate -

Narayana Junior College Under A.P Board
03.2007 - 03.2009

Matriculation -

Bhashyam Public School
03.2006 - 03.2007

Certification

ITIL Foundation Certificate in IT Service Management

Affiliations

Highly passionate for new technological applications with good working knowledge and would like to cherish any opportunity given to me in this field. Captivated in studying new languages and will easily mingle with other members.

Timeline

Senior Systems Engineer - MLOps Engineer

EPAM Systems (Contractual with ServiceNow)
01.2025 - Current

Team Lead AI/ML

Kion Group AG
07.2024 - 12.2024

Technical Lead- LLM /GenAI

Encora Software India Pvt.Ltd
03.2024 - 06.2024

Site Reliability Engineer

CloudZenix
02.2023 - 07.2023

Master of Science - Data Science

International Institute of Information Technology
07.2021 - 08.2022

Sr. Product Engineer- DevOps, App Maintenance

Verint Systems India Pvt Ltd
12.2020 - 09.2022

Sr. IT Engineer

UBS India business pvt ltd
03.2020 - 12.2020

Sr. DSS Tech engineer

AXA XL Business India Pvt Ltd
11.2017 - 06.2019

Technical Consultant

Nice Interactive solutions India Pvt ltd
08.2015 - 08.2017

Windows Administrator Jr

Amdocs DVCI
12.2014 - 07.2015

Tech Support Associate

Mphasis
06.2013 - 07.2014

B.tech - computer science engineering

Kakatiya University
03.2009 - 03.2013

intermediate -

Narayana Junior College Under A.P Board
03.2007 - 03.2009

Matriculation -

Bhashyam Public School
03.2006 - 03.2007
K. H. Vishwanathan IyerCloud Support, Technical Support And Server Admin