Summary
Overview
Work History
Education
Skills
Websites
Certification
It Skill Set
Timeline
Generic

Kapil Dev

Jammu,Jammu and Kashmir

Summary

Results-driven Site Reliability and DevOps Engineer with extensive experience at Weave and Think Future Technologies. Proven expertise in managing large-scale Kubernetes clusters, developing Terraform modules for enhanced alerting and monitoring, building custom Kubernetes operators, and implementing security and compliance through Open Policy Agent and advanced telemetry solutions. Led the development of a Go-based GRPC API to facilitate migration from PostgreSQL to starrocks, contributing to critical API endpoints, and crafted Python scripts for cluster monitoring to track upgrades and proactively manage incidents. Demonstrated success in migrating HashiCorp Vault storage to a Raft-based system and delivering robust multi-cloud architectures. Proficient in leveraging diverse cloud services, integrating multiple data sources, and designing scalable application architectures to drive operational excellence and innovation.

Overview

7
7
years of professional experience
1
1
Certification

Work History

Site Reliability Engineer

Weave
09.2021 - Current
  • Deploy services on Kubernetes using GitHub Actions and Argocd
  • Infrastructure management using Terraform and Atlantis
  • GitHub Action workflows for application deployment and Docker image builds
  • Use of distroless base images to reduce CVEs
  • Deadman switch to alert team for infrastructure alerts
  • Migration of Vault storage to raft storage
  • Python script to alert the team on Kubernetes upgrade via Slack
  • Deployment of Cloud Functions and Cloud Run using Terraform
  • MySQL migration and read replica setup for HA
  • Analytics migration from PostgreSQL to starRocks database
  • Modified Analytics API in Go to support starRocks and fetch data from Kafka
  • Opentelemetry distributed tracing with quickwit
  • Maintain config Connector and Kubernetes operators (Prometheus, strimzi, Helm charts)
  • Implemented security scanning tools to detect passwords or sensitive information in GitHub repositories
  • Monitored automated build and continuous software integration process to drive build/release failure resolution.
  • Worked on GCP to build custom models and data labeling services for call intel in analytics.
  • Implemented monitoring and logging solutions using Prometheus and Grafana, enhancing system visibility and reducing incident response.
  • Implemented robust Kubernetes security by integrating Open Policy Agent (OPA) for compliance and developing custom operators, enhancing overall system security and operational efficiency.
  • Developed and deployed AWS Lambda functions while automating infrastructure provisioning using AWS CDK, resulting in improved scalability and streamlined deployment processes.

DevOps Engineer

Think Future Technologies
12.2018 - 09.2021

As a DevOps Engineer at Think Future Technologies, responsibilities included working on Kubernetes, implementing in-house Kubernetes clusters to manage containerized applications efficiently. Infrastructure automation was a key focus, utilizing Ansible for configuration management and Terraform for provisioning and managing cloud resources. Additionally, Jenkins pipelines were developed and maintained to streamline CI/CD processes, ensuring smooth and automated software deployments. The role involved optimizing workflows, improving infrastructure reliability, and enhancing deployment automation.

System Admin

Appster LLC
04.2018 - 12.2018

As a System Administrator at Appster, i am responsible for managing cloud infrastructure on AWS and GCP, ensuring the smooth deployment and maintenance of resources, set up new environments for development, staging, and production, configuring servers, networking, and storage to support application deployment. Additionally, implemented alerting and monitoring systems to track system performance, detect issues. Automation was a key part of role, where I used shell scripts to streamline repetitive tasks, improve efficiency, and reduce manual effort.

Education

Electronics And Communication Engineering -

University of Jammu
Jammu And Kashmir
06-2016

Skills

  • Cloud Infrastructure Oversight
  • Proficient in Shell Scripting
  • Experienced in Python Development
  • Git Workflow Optimization
  • Bitbucket Repository Management
  • Containerization Expertise
  • Proficient in Kubernetes Management
  • ECS Automation Expertise
  • Terraform Automation Expertise
  • Infrastructure Automation Expertise
  • Incident management
  • Software development
  • Linux administration
  • Load balancing
  • System monitoring
  • Disaster recovery
  • Capacity planning
  • Continuous deployment
  • Strategic planning
  • Security best practices

Certification

  • AWS Certified Solution Architect and Developer Associate
  • Terraform Certified Associate
  • Certified Kubernetes Administrator
  • DevOps Certification Training - Edureka, 06/01/18, https://www.edureka.co/devops
  • AWS for DevOps: Continuous Delivery and Process Automation (Lynda.com)
  • Certificates of Completion (Lynda.com), Azure: Create and Manage Virtual Machines, Design and Implement Storage Strategy, Microsoft Azure: Design and Deploy ARM Templates

It Skill Set

AWS, GCP, Microsoft Azure, Paperspace, N'cloud, Shell, Python, Git, Bitbucket, Jenkins, Ansible, Docker, Kubernetes, ECS, Terraform, CloudFormation, Apache2, Nginx, Tomcat, Puma, Windows 8, Windows 7, Windows XP, UNIX, Linux, macOS, Sublime Text 2, Sublime Text 3, Git, Elasticsearch, Zabbix, Nagios, New Relic, Redshift, BigQuery

Timeline

Site Reliability Engineer

Weave
09.2021 - Current

DevOps Engineer

Think Future Technologies
12.2018 - 09.2021

System Admin

Appster LLC
04.2018 - 12.2018

Electronics And Communication Engineering -

University of Jammu
Kapil Dev