Senior SRE Profesional involved in Analysis, Design, Development, Deploy and Maintenance of application and Infrastructure.Proficient with configuration Management tools,and in developing CI/CD pipelines. Strong domain knowledge in AWS Cloud, Infrastructure configuration management, Container services .
Lead Site Reliability Engineer
- Designed and implemented continuous integration and delivery pipelines for multiple projects, resulting in faster and more reliable software releases
- Secured GCP Load Balancers for the cloudrun and kubernetes based, user facing products
- Infrastructure and DevOps Engineer Actively manage, improve and monitor infrastructure on Kubernetes with Splunk and Grafana. CloudWatch on AWS
- Responsible for developing new monitoring and enhancing existing monitoring solution to prevent down time of prisma access and its components
- Automate upgrade workflows using python for various components on the vsphere environment that were not available in the platform
- Automated the entire CI/CD pipeline using Jenkins, Github, Jfrog artifactory and writing automation scripts in Python, Bash
- Created and maintained fully automated CI-CD pipeline using Jenkins, Spinnaker and Kubernetes on Azure Platform. Reduced site downtime by implementing automated monitoring and response system
- Experience in Linux System Administration, Build Engineering & Release Management process, including end-to- end code configuration, building binaries & deployments
- Ensured site reliability by monitoring site performance and investigating site issues
Site Reliability Engineer Fashion Ecommerce Company
- Troubleshooting infrastructure level issues on core AWS services like EC2, VPC, Load- Balancers, Amazon S3, Route 53, Cloudwatch, RDS, IAM, Lambda, ElasticSearch, Cloudfront, Cloudtrail, etc
- Managed build server and setup/monitor daily continuous builds and deployments to the Development, Test, and Production environments
- Managed cloud infrastructure on Google Cloud, Microsoft Azure, and Amazon Web Services.
- Created dierent automations using dierent technologies (Python, GoLang, Bash)
- Setup and manage CICD pipeline script for UAT and PROD deployments
- Developed dashboard on Splunk to display the reports.Used splunk alert feature to handle crucial failure scenarios. Developed multi-cloud microservices architecture on Kubernetes.
- Automated SRE Platform installation using Configuration management tools.
- Developed APIs using Django framework to support authentication with SSO, list, insert, update and delete device details Responsible for HA of environment,OS updates, software security patches, KPI, alerting and monitoring.
Cloud Support Engineer Workplace/Company
- Setup, configuration, upgrade, maintenance, Performance monitoring and troubleshooting of servers running on dierent Linux OS platforms
- Working with Developers to convert applications into microservices using Docker Design POC scrum/sprint implement and support CI- CD(bitbucket code pipeline)
- Created pipelines using dierent tools (Bamboo, GitLab CI/CD, Azure Pipelines, Jenkins) for dierent tech stack (NodeJs, Java, Python, GoLang).
- Performing setup and troubleshooting of multiple services which use Kubernetes
- Providing support for various AWS Services including EMR, Glue, DynamoDB, EC2, VPC, IAM, S3, CloudWatch, etc Created alerting platform with Prometheus, Nagios, Alertmanager, Grafana and Pagerduty
- Securing the Infrastructure on AWS using IAM, KMS, Cloud Trail, Cloud Watch, Security Groups, NACL etc.