About
Highly accomplished and results-driven Senior DevOps and Site Reliability Engineer with over 5 years of proven expertise in designing, deploying, and managing scalable cloud infrastructures across AWS, Azure, and GCP. A strategic leader adept at leveraging Kubernetes, Docker, Terraform, and advanced CI/CD pipelines to significantly enhance operational efficiency, system performance, and high availability. Proven track record in automation, cloud security, and performance monitoring, utilizing tools like Prometheus, Grafana, and New Relic to reduce IT costs by 25%, optimize resource management, and improve system observability by 35% for enterprise-grade applications. Committed to driving innovation and delivering robust, secure, and highly performant cloud solutions.
Work
Summary
Led the development and management of critical infrastructure supporting high-volume financial transactions.
Highlights
Configured and managed Kubernetes clusters using Helm charts and Kustomize with GitOps, streamlining deployments and enhancing system stability.
Successfully deployed and maintained 9+ payment gateways, serving over 2 million daily transactions with high availability.
Automated secure code scanning pipelines, resulting in a 25% reduction in deployment vulnerabilities.
Built dynamic monitoring dashboards with Prometheus, Grafana, and ELK stack, improving issue detection by 40%.
Designed and managed 15+ multi-branch CI/CD pipelines, significantly accelerating deployment cycles and improving error tracking efficiency.
Enhanced security posture by implementing and managing SSL/TLS certificates and their rotation.
Summary
Engineered and optimized cloud infrastructure solutions for a global data center provider.
Highlights
Deployed and managed 20+ private Kubernetes clusters using SUSE Rancher, supporting applications for over 500,000 users.
Automated 80% of infrastructure deployments using Terraform, reducing provisioning time by 60%.
Developed templates for repeatable deployments, cutting manual setup efforts by 30% and ensuring consistency.
Participated in post-mortem reviews and implemented actionable recommendations, enhancing system reliability to 99.99%.
Enhanced system observability with tools like Loki and Prometheus, boosting troubleshooting efficiency by 35%.
Created comprehensive documentation and knowledge base articles, improving internal and external stakeholder issue resolution times by 40%.
Summary
Played a key role in cloud migration and infrastructure automation initiatives for a global logistics company.
Highlights
Migrated over 100 on-premises servers to Azure, achieving 25% cost savings on infrastructure.
Developed and maintained CI/CD pipelines with Jenkins and Azure DevOps, reducing deployment times by 40%.
Provisioned and managed Azure Virtual Desktop across 17 host pools, facilitating remote collaboration for teams across geographies.
Automated infrastructure and application monitoring using Python, Bash scripting, Terraform, and New Relic, with auto-trigger pipelines, increasing overall monitoring coverage to 98%.
Monitored Azure-hosted services with Application Insights and New Relic, improving performance tracking by 90%.
Gained extensive experience in Linux system administration, including user management, permissions, and performance tuning.
Summary
Contributed to the optimization and management of cloud infrastructure and CI/CD processes.
Highlights
Implemented and optimized CI/CD pipelines, enabling 50% faster software releases.
Managed over 200 AWS resources, including EC2, S3, and RDS, ensuring seamless scaling and high availability.
Designed and optimized relational database schemas for high-performance applications using MySQL and PostgreSQL.
Configured Kubernetes clusters for 15+ containerized applications, significantly improving workload distribution.
Automated deployment processes using Terraform, decreasing provisioning times by 50%.
Secured containerized environments with standard security measures, reducing vulnerabilities by 20%.
Languages
English
Fluent
Certificates
Skills
Cloud Platforms
AWS, Azure, GCP, Multi-Cloud Environments.
Containerization & Orchestration
Kubernetes, Docker, Helm, Kustomize, SUSE Rancher.
Infrastructure as Code (IaC)
Terraform, Ansible, GitOps.
CI/CD
Jenkins, GitHub Actions, Azure DevOps, Pipelines Automation.
Monitoring & Observability
Prometheus, Grafana, Zabbix, New Relic, ELK Stack (Elasticsearch, Logstash, Kibana), Loki, Application Insights.
Scripting & Programming
Python, Bash Scripting.
Databases & Messaging
MySQL, PostgreSQL, Redis, RabbitMQ, Kafka.
Operating Systems
Linux, Windows Server.
Security
Cloud Security, SSL/TLS Certificates, Vulnerability Management, Secure Code Scanning.
Networking
Networking Concepts, DNS, Load Balancing.
Version Control
Git, GitHub.