DevOps and SRE professional with 5+ years of experience enhancing system performance and security using AWS services, including IAM, EC2, and Kubernetes. Expertise in CI/CD pipeline management and infrastructure automation with Terraform and Docker. Proficient in incident management and root cause analysis, ensuring seamless application releases through cross-team collaboration.
• Proficient in AWS services such as IAM, EC2, EBS, EFS, S3, CloudFront, Route53, RDS, VPC, SG, NACL, Lambda, ECR, EKS, CloudWatch, ALB, NLB, SNS, with a focus on optimizing performance, security, and scalability.
• Managed IAM policies, roles, and permissions to enforce least privilege access and secure resource access across AWS services. Integrated IAM with AWS SSO for centralized authentication and configured trust relationships for cross-account access.
• Managed Amazon S3 buckets for scalable storage, ensuring proper access controls using IAM policies and S3 bucket policies. Implemented versioning, lifecycle policies, and data archival strategies using S3 Glacier for cost-effective storage management.
• Skilled in designing and implementing CI/CD pipelines using Jenkins, GitHub, and Maven, ensuring continuous, efficient
code delivery, and enhancing software quality.
• Hands-on experience with Terraform for infrastructure as code (IaC), enabling the automated provisioning and management of cloud resources.
• Proficient in containerization and orchestration using Docker and Kubernetes (EKS) for deploying scalable, highly available applications in cloud environments.
• Managed and maintained Kubernetes clusters on Amazon EKS, ensuring high availability and scalability across multiple availability zones. Deployed containerized applications on EKS, utilizing Kubernetes features like deployments, services, and Ingress controllers.
• Managed Docker container images and deployed them through Amazon ECR for seamless application delivery.
• Configured Horizontal Pod Autoscaler and Cluster Autoscaler in EKS for workload management and optimization.
• Ensured security in EKS by configuring IAM roles, Kubernetes RBAC, and using AWS Secrets Manager for sensitive data.
• Managed Amazon RDS databases (MySQL), ensuring high availability, backups, and performance optimization. Automated RDS backups and snapshots, and used Multi-AZ deployments for improved availability and reliability.
• Experienced in monitoring solutions with CloudWatch, Prometheus, Grafana, and ELK Stack to track system health, analyze metrics, and optimize performance.
• Strong understanding of security practices and best practices for cloud services to ensure compliance, mitigate risks, and
safeguard sensitive data.
• Focused on incident management, prioritizing issues based on SLA, SLO, and SLI, ensuring rapid resolution and improved system reliability.
• Conducted Root Cause Analysis (RCA) and Post-incident reviews (PIRs) to identify issues, prevent recurrence, and improve
system performance and stability.
• Collaborated with development, QA, and operations teams to streamline workflows, enhance deployment processes, and enable seamless application releases across Dev, QA, and Production environments.