Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic
Bishwajeet Dey

Bishwajeet Dey

Bangalore

Summary

Principal Site Reliability Engineer (SRE) with a decade of experience designing, building, and operating large-scale, high-availability systems. Proven expertise in architecting scalable platforms, strengthening system reliability, and leading cross-functional initiatives that enhanced operational excellence. Skilled in system design, automation, and observability, with a focus on delivering resilient and business-critical solutions.

Overview

11
11
years of professional experience
1
1
Certification

Work History

Principal Engineer, Site Reliability Engineering

Oracle
Bangalore
11.2021 - Current
  • Designed and implemented an in-house monitoring and observability platform that provided real-time status dashboards for various applications, centralized dashboards on Grafana using Prometheus as the data source, and integrated OCI-based alerting. This system enabled faster root cause analysis, improved meantime-to-recovery, and delivered scalable cross-service visibility for operations teams.
  • Architected an organizational-level change automation platform integrating OCI Functions, APIs, and Jira workflows, significantly streamlining release and patch operations.
  • Built a self-service infrastructure provisioning portal using GitLab CI, Terraform, and Ansible, reducing manual dependencies and empowering teams with on-demand resource access.
  • Defined technical design blueprints and security architecture for internal tools, ensuring compliance with corporate security assurance processes before production rollout.
  • Established best practices for system scalability and reliability, mentoring junior engineers and fostering a culture of resilience.
  • Delivered robust documentation for service design and operations, improving onboarding efficiency and enabling cross-team collaboration.
  • Led on-call operations for critical production systems, performing incident triage, root cause analysis, and driving long-term reliability fixes through blameless postmortems.

Senior Site Reliability Engineer

GE Digital
Bangalore
01.2020 - 11.2021
  • Led Kubernetes (EKS) cluster design and optimization, strengthening application scalability, resilience, and efficient resource utilization.
  • Designed container security enhancements with Falco, embedding runtime protection into Kubernetes workloads.
  • Integrated CI/CD pipelines with observability platforms (Jenkins + Splunk), improving proactive issue detection and release confidence.
  • Built synthetic monitoring solutions with Python and AppDynamics to simulate real-world user experience and improve availability.
  • Managed large-scale microservices deployments on AWS, enabling faster feature delivery and resilient rollback strategies.
  • Collaborated on incident response and postmortems, embedding reliability engineering principles into service design.

Senior DevOps Engineer

Oracle
11.2019 - 12.2019
  • Supported infrastructure automation and deployment pipelines, enabling seamless integration into OCI environments.

Senior Software Engineer

Amadeus Software Labs
Bangalore
04.2018 - 10.2019
  • Automated deployment workflows and logging pipelines, enhancing observability and reducing operational overhead.
  • Designed infrastructure-as-code solutions on AWS with Terraform, scaling environments for high-traffic travel applications.

Systems Engineer

Cerner Healthcare
Bangalore
07.2016 - 03.2018
  • Managed production and non-production healthcare systems with a focus on uptime, compliance, and automation.

Enterprise System Analyst

Unisys
Bangalore
07.2014 - 07.2016
  • Delivered system support and operational improvements across enterprise environments.

Education

Bachelor Of Engineering - Electronics & Telecommunication

Chhattisgarh Institute Of Technology
Apr 2014

Skills

  • Cloud Services: AWS, Oracle Cloud (OCI)
  • Operating Systems: Linux administration and internals
  • System Design & Architecture: High availability, scalability, observability, fault tolerance
  • Automation & Infrastructure as Code: Python, Shell, Terraform, Ansible, GitLab CI, Jenkins
  • Containerization & Orchestration: Kubernetes, Docker
  • Serverless Computing: AWS Lambda, OCI Functions
  • Monitoring & Logging: Prometheus, Grafana, Splunk, AppDynamics
  • Version Control & Collaboration: Git, Bitbucket, Jira, Confluence
  • Incident & Change Management: RCA, process automation, service health dashboards, on-call operations

Certification

  • Site Reliability Engineering Foundation
  • Oracle Cloud Infrastructure Certified Architect Associate
  • ITIL Foundation

Timeline

Principal Engineer, Site Reliability Engineering

Oracle
11.2021 - Current

Senior Site Reliability Engineer

GE Digital
01.2020 - 11.2021

Senior DevOps Engineer

Oracle
11.2019 - 12.2019

Senior Software Engineer

Amadeus Software Labs
04.2018 - 10.2019

Systems Engineer

Cerner Healthcare
07.2016 - 03.2018

Enterprise System Analyst

Unisys
07.2014 - 07.2016

Bachelor Of Engineering - Electronics & Telecommunication

Chhattisgarh Institute Of Technology
Bishwajeet Dey