Summary
Overview
Work History
Education
Skills
Certification
DECLEARATION
Timeline
Generic

Brijesh Kumar Mishra

Pratapgarh,uttar pradesh

Summary

Strategic and technically adept Senior Site Reliability Engineer (SRE) with 12+ years of experience in Cloud Infrastructure, DevOps automation, and cloud-native application deployments across public and private cloud infrastructures including Microsoft Azure, VMware vSphere, Nutanix AHV, and Red Hat OpenShift. Proven expertise in managing large-scale Kubernetes and Docker environments, implementing CI/CD pipelines, and driving infrastructure automation using Terraform, Ansible, Helm, Git, and Azure DevOps. Experienced in Linux system administration across RHEL, CentOS, and Ubuntu environments with strong knowledge of cloud infrastructure operations, hybrid cloud management, and Site Reliability Engineering practices focused on automation, scalability, resiliency, and high availability. Hands-on experience in cloud infrastructure integration and production-grade deployments across enterprise environments. Extensive expertise in enterprise hardware provisioning, troubleshooting, and infrastructure maintenance involving Dell XR11, Dell R750, HP DL360 Gen10 Plus, PowerFlex, PowerMax, and Dell EMC Unity platforms. Skilled in monitoring and observability tools including Grafana, Prometheus, Azure Monitor, and vROPS for proactive incident management and performance optimization. Strong ability to collaborate with cross-functional teams, troubleshoot complex infrastructure issues, and deliver reliable, secure, and scalable cloud-native solutions across hybrid and multi-cloud environments.

Knowledgeable [Desired Position] with solid background in maintaining and enhancing system reliability. Demonstrated success in implementing automated solutions to streamline processes and improve system performance. Proven ability to troubleshoot complex issues and optimize infrastructure through proactive monitoring and collaboration.

Overview

2
2
Certification
14
14
years of professional experience

Work History

Senior Site Reliability Engineer

Dell Technologies
11.2023 - Current
  • managing cloud infrastructure operations on Microsoft Azure public cloud alongside VMware vSphere 7 & private cloud data centers leveraging Red Hat OpenShift and RHEL environments.
  • Managed daily data center hardware provisioning, maintenance, and troubleshooting involving Dell XR11, HP DL360 Gen10+, Dell R650s, Supermicro, and Dell EMC Unity servers.
  • Administered Azure cloud services featuring virtual machines, VNets, NSGs, load balancers, Azure Monitor, storage accounts, RBAC, and Azure policies.
  • Managed operational aspects of hybrid cloud infrastructure including Azure, VMware, vSphere, Nutanix, AHV, OpenShift, and RHEL environments.
  • Mitigated significant data center hardware challenges including HDD failures, PCIe riser card faults, memory errors, NIC connectivity disruptions, CPU thermal alerts, and BIOS updates while preserving minimal downtime.
  • Directed cloud infrastructure operations across Microsoft Azure public cloud and private cloud environments.
  • Executed and directed Ansible playbooks to facilitate automation of BIOS configurations, OS patching, and application stack provisioning across both physical and virtualized environments.
  • Conducted comprehensive Linux system administration, including user management, service setup, OS updates, and performance optimization on RHEL and Ubuntu systems.
  • Administered hybrid cloud infrastructure uniting Azure, VMware, RHEL, and OpenShift.
  • Implemented automated provisioning for over 50 services through Terraform and Ansible.
  • Managed hardware troubleshooting operations related to BIOS, HDD, NIC, and CPU alerts.
  • Orchestrated CI/CD pipeline deployment through GitHub Actions and Azure DevOps.leading site reliability engineering (SRE) initiatives prioritizing automation of system resilience and improvements in service availability across telco cloud infrastructure.
  • Coordinate feature testing, regression testing, and defect management through Jira while maintaining detailed technical documentation in Confluence.
  • Optimized cloud governance and security by utilizing Azure policies, applying role-based access control (RBAC), and deploying automated auditing scripts for minimizing misconfigurations.
  • Administered user permissions, cron jobs, and service configurations (systemd) to ensure seamless operations for development and operations teams.
  • Conducted comprehensive system monitoring log analysis with tools including journalctl and syslog, addressing performance discrepancies in network and disk through tools such as top, htop, vmstat, and netstat.

System Administrator and Tester

Nokia Solutions and Networks R&D Centre
07.2022 - 11.2023
  • Spearheaded installation and commissioning of Nokia NetAct NMS systems using Bare Metal Servers (HP DL360 Gen10 Plus and Dell R650s), integrating with VMware vSphere and Red Hat OpenShift (3+3+1 architecture).
  • Developed and executed Ansible roles and inventories to streamline OpenShift post-install configurations and automate node/cluster-level setup tasks in lab and staging environments.
  • Performed Linux administration on RHEL systems including service management, network configuration, and troubleshooting of deployment issues.
  • Managed backup and disaster recovery processes using Dell Avamar, ensuring data resilience for OpenShift-based deployments.
  • Executed NZDT (Near Zero Downtime) upgrade and rollback activities with thorough planning and coordination across lab and production environments.
  • Implemented and maintained CI/CD pipelines for automated software testing, deployment, and configuration validation workflows.
  • Developed and executed Ansible roles and inventories to streamline OpenShift post-install configurations and automate node/cluster-level setup tasks in lab and staging environments.
  • Performed Linux administration on RHEL systems including service management, network configuration, and troubleshooting of deployment issues.
  • Delivered NetAct NMS cluster deployments (3+3+1 & SNO).
  • Led feature testing, RCA, EDA, and FTS documentation.
  • Handled OpenShift & VMware provisioning, backup via Dell Avamar.
  • Administered Azure resources and automated via ARM/Terraform.
  • Permanent
  • R&D NMS (MN)

Senior Technical Lead

HCL Technologies Limited
10.2021 - 07.2022
  • Migrated 15+ legacy monolithic applications into Docker containers, deployed on Kubernetes clusters, resulting in 35% improved system uptime and faster release cycles.
  • Led Telco Cloud CNF deployments for Samsung and Mavenir across multiple telecom customers using Kubernetes, Helm, and CI/CD automation.
  • Migrated legacy apps to Docker/K8s with OpenShift & AKS.
  • Designed Helm charts, automated CI/CD in Azure DevOps.
  • Configured RAID/LUN, OS, BIOS/firmware upgrades.
  • Permanent
  • Rakuten, One-to-One

Sr. RAN deployment engineer

Amdocs development Centre
06.2018 - 10.2021
  • CI/CD pipeline setup (Azure DevOps, GitLab).
  • Monitoring via Prometheus, Azure Monitor.
  • Automated infrastructure with Terraform, Ansible.
  • Improved system reliability by implementing health checks, liveness probes, and readiness probes in Kubernetes deployments.
  • Permanent
  • Open RAN

Network Integration specialist

Altran Technologies (Nokia GDC Chennai)
01.2018 - 06.2018
  • Administered and maintained Linux servers (RHEL, CentOS, Ubuntu) for production and staging environments, ensuring 99.9% uptime across all systems.
  • Handled user management, file permissions, cron jobs, and service configurations (systemd), supporting development and operations teams.
  • Performed system monitoring, log analysis (journalctl, syslog), and troubleshooting for performance, network, and disk issues using tools like top, htop, vmstat, and netstat.
  • Permanent
  • AT&T US

Technical Support Engineer & Ran Trainer

Mobile COMM Professionals
02.2017 - 12.2017
  • Configured and maintained Linux servers (RHEL, CentOS, Ubuntu) in cloud and on-prem environments, ensuring high availability and secure operations.
  • Monitored infrastructure health and performance using Prometheus, Grafana, Nagios, and Azure Monitor, setting up real-time alerts for CPU, memory, disk, and service status.
  • Performed log analysis using journalctl, syslog, and Cloud-native tools like Azure Log Analytics, enabling proactive issue detection and resolution.
  • Automated health checks and reporting scripts using Bash and Python, reducing manual monitoring efforts by 60%.
  • Permanent
  • AT&T US and T Mobile US

FAULT MANAGEMENT ENGINEER

N R SWITCH-N-RADIO SERVICES PRIVATE LIMITED
10.2015 - 07.2016
  • Installed, configured, and updated Linux OS, security patches, and applications.
  • Automated user account management, backups, and patching using Shell scripts.
  • Monitored infrastructure with Prometheus, Grafana, and Azure Monitor, reducing mean time to resolution (MTTR) by 50%.
  • Permanent
  • Bharti Airtel India

O&M engineer

Nokia Solutions and Networks India (P) Ltd
11.2012 - 02.2015
  • Installed, configured, and updated Linux OS, security patches, and applications.
  • On the payroll of InTarvo Technologies Ltd
  • Vodafone India

Education

B.Tech. - Electronics & Communications

RTU Rajasthan
JAIPUR RAJASHTHAN
01-2010

12th UP BOARD -

GIC PRATAPGARH
PRATHAPGARH UTTAR PRADESH
01-2004

10th UP BOARD -

GIC PRATAPGARH
PRATHAPGARH UTTAR PRADESH
01-2002

Skills

  • Cloud Platforms: Azure, VMware vSphere 7/8, OpenShift, RHEL, HCI Nutanix
  • Infrastructure Automation: Terraform, Helm Charts, Ansible, YAML
  • Containers: Docker, Kubernetes, OpenShift
  • Bare Metal Hardware: Installation, provisioning, and deep-level troubleshooting for Dell XR11, Dell R750, HP DL360 Gen10 Plus
  • Helm Chart Management: Designing, templating, packaging, and upgrading Kubernetes applications using Helm
  • Hyper-Converged Infrastructure (HCI) & Enterprise Storage: Worked on HCI infrastructure platforms including Nutanix AHV, Dell PowerFlex, PowerMax storage, and Metalsoft for cloud infrastructure provisioning, storage management, virtualization, and data center automation
  • Programming & Scripting: Bash, Shell, Python
  • CI/CD & Version Control: Azure DevOps, Git, GitHub Actions
  • Linux & Systems: RHEL, CentOS, Ubuntu, User/Service/File System Mgmt
  • Monitoring & Backup: Grafana, Prometheus, Dell Avamar, Azure Monitor
  • Networking & Protocols: DNS, CIDR, SR-IOV, Multus, MACVLAN
  • Storage: Dell EMC Unity, Unisphere, RAID/LUN, NAS/SAN
  • Project & Issue Tracking: JIRA, Confluence, ServiceNow, Infoblox
  • System Maintenance: Patching changes, firmware updates, BIOS upgrades, OS installation, and troubleshooting
  • Monitoring Tools: Grafana, Prometheus, V-Center, V-ROPS
  • Version Control: Git, Azure Repos

Certification

  • Microsoft Azure Administrator Associate (AZ-104)
  • Certified Kubernetes Administrator (CKA)
  • Nutanix Certified Professional - Multicloud Infrastructure

DECLEARATION

I hereby declare that all the information provided above is true to the best of my knowledge. Date: - Place: - Bangalore (Brijesh Kumar Mishra)

Timeline

Senior Site Reliability Engineer

Dell Technologies
11.2023 - Current

System Administrator and Tester

Nokia Solutions and Networks R&D Centre
07.2022 - 11.2023

Senior Technical Lead

HCL Technologies Limited
10.2021 - 07.2022

Sr. RAN deployment engineer

Amdocs development Centre
06.2018 - 10.2021

Network Integration specialist

Altran Technologies (Nokia GDC Chennai)
01.2018 - 06.2018

Technical Support Engineer & Ran Trainer

Mobile COMM Professionals
02.2017 - 12.2017

FAULT MANAGEMENT ENGINEER

N R SWITCH-N-RADIO SERVICES PRIVATE LIMITED
10.2015 - 07.2016

O&M engineer

Nokia Solutions and Networks India (P) Ltd
11.2012 - 02.2015

B.Tech. - Electronics & Communications

RTU Rajasthan

12th UP BOARD -

GIC PRATAPGARH

10th UP BOARD -

GIC PRATAPGARH
Brijesh Kumar Mishra