Results-driven Site Reliability Engineer with 4 years of experience in ensuring the reliability, performance, and scalability of complex distributed systems. Expertise in proactive monitoring, issue resolution, and infrastructure optimization. Adept in deployment automation, cost optimization, and supporting development teams to achieve seamless production releases.
Infrastructure Operations & Optimization:
Deployment & Release Management:
Distributed Systems & Troubleshooting:
Teamwork & Mentorship:
Key Achievements:
Programming Languages: Python, Shell Script
Server administration: Linux
Databases: PostgreSQL, MongoDB, Redis
Cloud Platforms: AWS Services (EC2, S3, CloudWatch, RDS, etc)
Containerization: Docker
IAC: Ansible, Packer
Monitoring and logging Tools: AWS CloudWatch, Grafana, graylog, kibana, elasticsearch
Alerting Tools: Nagios, Cloudwatch alarms
CI/CD Tools: Ansible