Remote
We're looking for a Sr. Site Reliability Engineer
About the Role
Join our team to design, build, and scale the infrastructure that keeps critical systems running for a large U.S. client. You'll own reliability end to end, automate everything you can, and lead the response when things go wrong. If architecting resilient cloud-native platforms and making sure production never sleeps excites you, this role is for you.
What You'll Do
- Own the reliability and scalability of critical infrastructure on AWS, using Terraform, Helm, and ArgoCD within GitOps workflows.
- Design and manage multi-region, highly available systems, leading incident response, postmortems, and everything in between.
- Define SLOs, SLIs, and error budgets, and build the observability stack (Datadog, Prometheus, Grafana) to back them up.
- Keep CI/CD pipelines running smoothly with GitHub Actions and ensure security best practices are embedded across the board.
- Drive cost optimization, capacity planning, and performance tuning for cloud workloads.
- Work closely with engineering and infrastructure teams, and help junior engineers grow along the way.
What We're Looking For
- 5+ years of professional experience in Site Reliability Engineering, DevOps, or Cloud Infrastructure roles.
- Strong proficiency with AWS (EKS, RDS, S3, IAM, Lambda, CloudFormation/Terraform).
- Hands-on experience with Terraform, Helm, Kubernetes, and CI/CD automation.
- Solid understanding of networking, DNS, TLS, and load balancers.
- Experience with monitoring and alerting tools such as Datadog, Prometheus, and Grafana.
- Strong scripting skills in Python, Bash, or Go.
- Upper-intermediate or better English; comfortable working directly with U.S. teams.
- Bachelor's degree in Computer Science, Computer Engineering, Information Technology, or a related field.
Nice to Have
- AWS Certified Solutions Architect or DevOps Engineer certification.
- Experience with Crossplane, OpenSearch, or multi-cloud architecture.
- Prior experience in a dedicated SRE team or initiative.
- Master's degree in Computer Science or a related field.