Back to jobs

Infra Application - SIte Reliability Engineer

Zoom
Remote Remote - India
$85k - $130k (est.) -30% vs avg
Posted Apr 14, 2026
Apply on themuse

Leaving for themuse in 10s

About This Role

Key technical requirements and skills: - Proficiency in Python or Golang for building automation tools and services - Experience with Infrastructure as Code practices using Terraform, Pulumi, or Ansible - Knowledge of authentication and security practices, including IAM, MFA, SSO, and PKI - Experience administering AWS cloud environments and Kubernetes clusters - Ability to configure and tune monitoring stacks like Prometheus, Grafana, or ELK/Splunk - Familiarity with building CI/CD pipelines using Git Team/project information: - The engineering team drives internal technology innovation and operates with a DevOps/SRE approach - They manage infrastructure across AWS, Azure, GCP, and essential SaaS platforms like Okta and Zoom - The team's mission is to transition from manual processes to automated, self-service solutions Unique or notable aspects: - The role

What You Can Expect Our engineering team drives internal technology innovation for the organization. We act as "Customer Zero," refining our cloud infrastructure for security and efficiency. Operating with a DevOps and SRE approach, we manage AWS, Azure, GCP, and essential SaaS platforms like Okta and Zoom. Our mission is to transition from manual processes to automated, self-service solutions. This ensures all employees can excel in their roles using seamless and reliable infrastructure. About the Team We deliver the internal technology infrastructure that powers the organization across AWS, Azure, GCP, and critical SaaS platforms. Our team operates with a DevOps/SRE mindset, building programmatic automation to replace manual processes. We exist to enable friction-free infrastructure for everyone. Responsibilities Architecting and maintaining secure multi-cloud infrastructure and identity management systems across AWS, Azure, GCP, Okta, and Active Directory Developing Infrastructure as Code using Terraform and Ansible to automate provisioning of cloud resources, Kubernetes clusters, and configuration management Building and scaling unified observability solutions using Prometheus, Loki, and Grafana to monitor infrastructure health and performance Implementing GitOps workflows with ArgoCD to manage infrastructure deployments with Git as the single source of truth Leading incident response and conducting blameless post-mortems while defining SLOs to ensure service reliability and continuous improvement What We're Looking For Demonstrate proficiency in Python or Golang for building automation tools and services Apply Infrastructure as Code practices using Terraform, Pulumi, or Ansible to automate infrastructure provisioning Implement authentication and security practices including IAM, MFA, SSO, and PKI Administer AWS cloud environments and Kubernetes clusters Configure and tune monitoring stacks such as Prometheus, Grafana, or ELK/Splunk Build CI/CD pipelines using G

Similar Jobs at Zoom