DevOps Engineer IV (Obs)
TLDR
Senior contributor driving observability, reliability, and scalable cloud services across global production systems, mentoring teams and shaping architecture.
- Design, deploy, and operate highly available and scalable services in public cloud environments, ensuring reliability across production systems.
- Own end-to-end observability and operational excellence, including monitoring, logging, alerting, performance tuning, and automation of infrastructure workflows.
- Lead incident response for production issues, ensuring effective mitigation, communication, and long-term remediation of system failures.
- Develop and improve cloud infrastructure solutions using Infrastructure as Code, container orchestration, and automation frameworks to enhance system resilience.
- Drive technical initiatives across teams, collaborating with engineering and business stakeholders to deliver cross-functional infrastructure projects.
- Identify operational gaps and introduce innovative solutions that improve system efficiency, reliability, and business impact.
- Mentor junior engineers and contribute to establishing engineering best practices across DevOps and platform reliability domains.
- 8+ years of experience in DevOps, infrastructure engineering, or site reliability engineering, including significant hands-on cloud experience.
- Strong expertise in cloud platforms (especially AWS), including orchestration tools such as Kubernetes, ECS, or EKS.
- Deep understanding of distributed systems, networking, operating systems, and large-scale system architecture.
- Proven experience building and maintaining observability stacks using tools such as Datadog, New Relic, Prometheus, Grafana, or ELK.
- Strong scripting and automation skills using Python, Bash, or similar languages, with experience in production-grade tooling.
- Hands-on experience with Infrastructure as Code tools such as Terraform, CloudFormation, or Packer, and containerization with Docker.
- Strong debugging and troubleshooting skills with the ability to resolve complex production issues under pressure.
- Excellent communication skills with experience working across global, cross-functional teams and stakeholder groups.
- Bachelor’s or Master’s degree in Computer Science or a related field, or equivalent practical experience.
- Competitive compensation package aligned with experience and market benchmarks
- Equity opportunities in addition to base salary
- Fully remote work setup within India
- Comprehensive health, dental, and vision insurance coverage
- Retirement savings support and employer contribution programs where applicable
- Professional development reimbursement and learning support
- Paid time off, holidays, and additional long-term tenure benefits
- Home office and internet reimbursement allowances
- Wellness and mental health support programs
- Opportunity to work on global-scale infrastructure and high-impact systems
Requirements:
Benefits:
Benefits
Equity Compensation
Equity opportunities in addition to base salary
Health Insurance
Comprehensive health, dental, and vision insurance coverage
Home Office Stipend
Home office and internet reimbursement allowances
Learning Budget
Professional development reimbursement and learning support
Paid Time Off
Paid time off, holidays, and additional long-term tenure benefits
Remote-Friendly
Fully remote work setup within India
Wellness Stipend
Wellness and mental health support programs
Jobgether runs the largest remote job platform, effectively linking job seekers with over 200,000 flexible and remote opportunities that match their unique skills and preferences. Our focus is on enhancing the hiring process, ensuring efficiency while prioritizing the candidate experience, particularly in the growing health and wellness sector.
- Founded
- Founded 2020
- Employees
- 11-50 employees
- Industry
- Professional Services