Senior Software Engineer
1. Routing Next Gen
- Collaborated with a senior architect to execute the migration of NGINX routing from PKS to Kubernetes on Bare Metal (KoB), enhancing infrastructure control and aligning with long-term architectural goals.
- Contributed to the infrastructure setup and helped build GitLab CI/CD runners to automate the generation and deployment of NGINX configurations through a structured release strategy.
- Supported the integration of observability tooling including Loki, Prometheus, and Grafana, enabling real-time insights into routing performance and deployment health across environments.
2. EIROS (Expedite Incident Resolution and Optimized Solutions)
- Led the deployment of Root Cause Analysis (RCA) and incident prediction models on Kubernetes with Persistent Volume Claims (PVCs) for model storage and retrieval.
- Developed a data pipeline integrating multidimensional incident and telemetry sources, achieving a 15% reduction in MTTR and MTRS, and improving service reliability.
- Coordinated a feedback initiative with five user groups to align solutions with user workflows and pain points; received recognition for improving product line/owner identification in major incident triage.
3. SRE-AI-Hub
- Consolidated multiple internal AI tools into a unified SRE platform, featuring intelligent incident chatbots, dynamic knowledge base search, and RCA explainability, all seamlessly integrated with EIROS.
- Incorporated advanced AI techniques including RAG (Retrieval-Augmented Generation) and Agentic AI to deliver context-aware, generative responses for operational efficiency and troubleshooting.
- Built a FastAPI-based NGINX configuration generator that dynamically converts structured JSON input into production-ready NGINX configurations, supporting scalable routing for GenAI endpoints.