Senior Site Reliability Engineer (SRE)
TLDR
Lead design and operation of scalable, high-availability infrastructure to support millions of users, fostering DevOps culture and cross-team reliability improvements.
- Design, build, and operate scalable and reliable shared infrastructure platforms supporting product growth and international expansion.
- Analyze complex technical problems and deliver pragmatic, end-to-end engineering solutions.
- Collaborate with multiple engineering teams to ensure systems are resilient, efficient, and production-ready.
- Improve developer experience by treating the platform as a product, focusing on automation, simplification, and faster delivery cycles.
- Enable and coach software engineers in DevOps and SRE best practices to support autonomous “you build it, you run it” teams.
- Ensure platform security, compliance, reliability, and cost optimization across systems and services.
- Monitor system performance, proactively identify risks, and implement reliability improvements.
- Stay updated on emerging technologies and assess their applicability to platform evolution.
- Participate in code reviews, providing constructive feedback and supporting engineering quality standards.
- Mentor engineers and contribute to technical growth across teams.
- Strong experience as a Site Reliability Engineer or in similar infrastructure/platform engineering roles.
- Solid understanding of distributed systems, scalability, reliability, and performance engineering.
- Experience designing and operating cloud infrastructure (AWS preferred).
- Proficiency with infrastructure as code tools such as Terraform and Kubernetes.
- Experience with observability tools and practices (monitoring, logging, alerting, tracing).
- Strong software engineering skills in at least one programming language (e.g., Python, Go, Java, Node.js, or similar).
- Experience working in high-growth, fast-paced, product-oriented environments.
- Ability to design simple, scalable, and maintainable architectures.
- Strong communication skills and ability to collaborate across multidisciplinary teams.
- Nice to have: experience with CI/CD, cost optimization, platform security, or developer experience initiatives.
- Competitive salary and benefits package.
- Health and dental insurance.
- Flexible and remote-friendly work environment.
- Meal and mobility allowances.
- Mental health and well-being support programs.
- Learning and professional development opportunities.
- International and collaborative engineering environment.
- Exposure to large-scale, high-impact distributed systems.
Requirements:
Benefits:
Benefits
Flexible Work Hours
Flexible and remote-friendly work environment.
Free Meals & Snacks
Meal and mobility allowances.
Health Insurance
Health and dental insurance.
Learning Budget
Learning and professional development opportunities.
exposure to large-scale distributed systems
Exposure to large-scale, high-impact distributed systems.
Wellness Stipend
Mental health and well-being support programs.
Jobgether runs the largest remote job platform, effectively linking job seekers with over 200,000 flexible and remote opportunities that match their unique skills and preferences. Our focus is on enhancing the hiring process, ensuring efficiency while prioritizing the candidate experience, particularly in the growing health and wellness sector.
- Founded
- Founded 2020
- Employees
- 11-50 employees
- Industry
- Professional Services