Summary
Overview
Skills
Work History
Education
Accomplishments
Timeline
SeniorSoftwareEngineer
Surbhi Bhadviya

Surbhi Bhadviya

Summary

Surbhi is passionate about building and running complex

systems, and can quickly grab & understand how system

works. She's always curious to learn, implement and love

to share knowledge.

Overview

8
8
years of professional experience

Skills

  • Monitoring & Logging: Grafana, Instana, Prometheus, Kibana, OpenSearch, Logstash, Amazon Cloudwatch, Ingress, Opsgenie
  • Kubernetes & Orchestration: Docker, Harbor, Kubernetes, Helm
  • Certificate Management: Venafi, SSL/TLS certificates
  • Infra as Code (IaC): Terraform
  • Alerting & Notification Tool: OpsGenie
  • Cloud Services: AWS, GCP, Databricks
  • Data Streaming & Messaging: Kafka
  • CI/CD: Jenkins, GitLab, GitHub Actions
  • Platform: Salesforce (SFCC), SFMC
  • Security & Compliances: Fortanix, Vault, Venafi, AWS Security Best Practices
  • Database: MongoDB, SQL, NoSQL
  • Service Mesh: Consul
  • Programming Languages: NodeJs, JavaScript, Python, GO, Shell scripting, Microservices, ReactJs
  • Collaboration Tools: Jira, Confluence, Slack
  • Version Controlling Tools: GitHub, Bitbucket
  • AI Technologies: Generative AI, AutoML

Work History

Software Engineer (Site Reliability Engineer)

adidas
03.2023 - Current
  • Infrastructure & Automation: Spearheaded the migration of AWS resources to Terraform and automated CI/CD pipelines using Jenkins and GitLab, reducing deployment times by 40%.
  • Monitoring & Alerts: Implemented automated Grafana dashboards using Jsonnet for real-time data visualization and proactive monitoring, maintaining 99% uptime.
  • Cost & Resource Optimization: Identified and removed unused AWS resources, achieving significant cost savings; optimized Kafka consumers for efficient data streaming.
  • Holiday Readiness: Established a 24x7 support channel for peak seasons, ensuring minimal downtime
  • Security & Compliance: Designed and integrated security measures and compliance protocols for email notification systems using AWS API Gateway and Lambda; managed certificate upgrades using Venafi.
  • Disaster Recovery & Performance Monitoring: Developed comprehensive disaster recovery plans; produced and maintained technical documentation
  • Collaboration & Process Improvement: Worked with cross-functional teams to develop, test, and deploy scalable solutions, reducing downtime for critical applications through regular maintenance.
  • Hackathon Project: Implemented Send Time Optimization (STO) using AI to analyze email responses and send emails at optimal times, increasing open rates by up to 25%. Utilized Generative AI for personalized email content, improving engagement rates.

Technology Lead | Site Reliability Engineer

RattanIndia Enterprises
08.2022 - 03.2023

NeoSky is a part of RattanIndia group, the project is focused on developing consumer drones

  • Team Leadership: Led a team of 6 to develop technical specifications for APIs and web-based user experiences.
  • SRE Principles: Implemented SLA, SLO, and SLI concepts, focusing on availability and observability.
  • Project Management: Integrated Jira to effectively work with the engineering teams and management, to prioritize, manage the backlogs and ensure to follow agile process
  • Cloud Management: Managed AWS instances, S3 buckets, and CI/CD pipelines, ensuring security and quality.
  • Monitoring Systems: Integrated monitoring & logging systems like Grafana and Kibana.
  • Collaboration: Engaged with technical and cross-functional teams to reduce latency and error budgets.

Specialist Programmer

Infosys
01.2021 - 08.2022
  • KPI Monitoring Application: Developed KPI monitoring application highly used in cyber weeks & critical sales in ReactJS
  • Feature Development: Collaborated with peers on the development of new feature (eg. Loqate, Mutinode) or fix, review code & the code quality, and apply monitoring to deploy the feature successfully on production
  • System Availability: Responsible for maintaining the availability of systems and services once they're in production, starting by setting service-level objectives (SLOs), service-level agreements (SLAs) and service-level indicators (SLIs) for the underlying service
  • Sprint Planning: Sprint Refine & Planning discussion, feature prioritization, impact analysis, Implemented Logging, monitoring dashboard in Grafana using Jsonnet to monitor performance metrics (like PUDO, CnC, chkapi)
  • Pre-Production Checks: Ensure secret keys, metrics, readiness, cluster usage, server availability before production build
  • Release Management: Ensured production release guidelines (Release Fitness Go/No GO) and implementation are adhered to for changes to Production
  • Defect Tracking: Track down defects and come up with innovative solutions to improve observability, resiliency and availability
  • Incident Management: Provided support to client on-call bridge 24*7, incident report, troubleshoot production issues and fixed bugs and errors, and resolved any IT or Ops related issues
  • Mentorship: As a SRE Lead, mentored 6 team members and worked with cross-functional team members

Software Developer

Amdocs
07.2016 - 01.2021
  • Application Development: Implemented apps using the MERN stack, Jira, ServiceNow, Slack, chatbot, and Alexa
  • CI/CD Automation: Boosted testing capabilities from manual to 70% automatic by implementing e2e testing pipeline as part of CI/CD architecture
  • Development: As a full stack developer, worked on View360 (a crossplatform application with the intention to provide management on the go).The application is based on the MeteorJs framework to provide real time updates to the client devices
  • Logging & Monitoring: Implemented and setup logging, alerting, monitoring of the application based on the running services, servers up time, restart server automatically to reduce human interaction and send the push notification over registered mobile to notify developers once threshold breach
  • API Architecture: Designed and developed API flow architecture increasing API maintainability by 80%
  • Automated Testing: Boosted testing capabilities from manual to 70% automatic by implementing e2e testing pipeline as part of CI/CD architecture
  • Resource Optimization: Improved server utilization, toil elimination by identifying unnecessary resources and decommissioning them when possible.

Education

Computer Engineering -

Institute of Engineering and Technology, DAVV
04.2016

Accomplishments

    Hackathon:

  • During a recent hackathon, developed an AI-driven Send Time Optimization (STO) system to enhance email management. By analyzing email response patterns, the system determined optimal send times, boosting open rates by up to 25%. Additionally, leveraged Generative AI and AutoML in Databricks to personalize email content on a 1:1 basis, significantly improving user engagement through individualized subject lines and messages. This innovative project demonstrated her ability to apply advanced AI techniques to real-world challenges, resulting in improved email campaign performance.
  • Dopple Ganger Application

  • Designed and Implemented a user friendly UI for a client, where user can do the web search by Voice/Text or image search and get the expected result with approx. 99% accuracy.
  • Appreciated for delivering a user-friendly UI with 99% accuracy before the deadline.
  • Alexandrians

  • Awarded for integrating VR chatbot with Alexa during a Voicathon event.

Timeline

Software Engineer (Site Reliability Engineer)

adidas
03.2023 - Current

Technology Lead | Site Reliability Engineer

RattanIndia Enterprises
08.2022 - 03.2023

Specialist Programmer

Infosys
01.2021 - 08.2022

Software Developer

Amdocs
07.2016 - 01.2021

Computer Engineering -

Institute of Engineering and Technology, DAVV
Surbhi Bhadviya