Jobgether
Senior Site Reliability Engineer (Remote Build)
Accountabilities:
- You will design and maintain scalable infrastructure-as-code solutions using tools like Terraform and Kubernetes, ensuring robust, repeatable, and secure deployments across environments. You will also support platform evolution by improving automation and deployment workflows.
- You will build and operate observability systems including monitoring, logging, and alerting, while leading incident response, postmortems, and reliability improvements to ensure high system availability.
- You will embed security and compliance practices into infrastructure and operational workflows, ensuring adherence to global regulatory requirements while minimizing friction for engineering teams.
- You will optimize system performance, reliability, and cloud costs through continuous analysis and tuning of infrastructure and workloads across distributed systems.
- You will eliminate operational toil by developing automation tools and scalable processes that reduce manual intervention and improve engineering efficiency.
- You will partner with product and platform teams to improve APIs, deployment systems, and developer experience, ensuring infrastructure supports long-term scalability and maintainability.
- You bring senior-level experience in Site Reliability Engineering, DevOps, or Systems Engineering, with a proven track record of operating production systems at scale in cloud environments. You are comfortable owning reliability end-to-end.
- You have deep hands-on expertise with Kubernetes and AWS, including networking, compute, storage, and managed services, and understand how to operate resilient distributed systems.
- You are highly proficient with infrastructure-as-code tools such as Terraform and apply software engineering principles to infrastructure design and management.
- You have strong experience with CI/CD pipelines and deployment automation using tools like GitHub Actions, GitLab, or similar, including rollback strategies and safe deployment practices.
- You are comfortable working with Linux systems, debugging production issues, writing scripts (especially in Bash), and understanding system-level behavior.
- You are an effective communicator who can translate complex infrastructure concepts into clear explanations, documentation, and runbooks for both technical and non-technical stakeholders.
- Competitive salary aligned with global benchmarks and experience level
- Fully remote work with flexible scheduling and async-first culture
- Equity or stock option opportunities depending on role eligibility
- Flexible paid time off and generous parental leave policies
- Learning and development budget to support continuous growth
- Home office and equipment support to set up your workspace
- Mental health and wellness support services
- Opportunities to work on globally distributed, high-impact infrastructure systems
Requirements:
Benefits:
How Jobgether works:
We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team.
We appreciate your interest and wish you the best!
Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.
#LI-CL1
Jobgether runs the largest remote job platform, effectively linking job seekers with over 200,000 flexible and remote opportunities that match their unique skills and preferences. Our focus is on enhancing the hiring process, ensuring efficiency while prioritizing the candidate experience, particularly in the growing health and wellness sector.
- Founded
- Founded 2020
- Employees
- 11-50 employees
- Industry
- Professional Services
Senior Site Reliability Engineer