Jobgether
Jobgether

Senior DevOps Engineer

Accountabilities:

You will be responsible for ensuring the stability, scalability, and efficiency of platform operations while enabling engineering teams to deliver software reliably and safely. This includes:

  • Operating and improving platform tooling to support reliable software delivery, including ticket triage, issue resolution, and service request handling
  • Maintaining and evolving self-service workflows, including documentation, templates, and deployment guardrails
  • Managing Kubernetes environments, including Helm deployments, namespace management, rollout troubleshooting, and incident response support
  • Supporting and enhancing CI/CD pipelines (primarily GitLab CI), including job configuration, deployment strategies, and quality gates
  • Monitoring and improving observability systems using tools such as Prometheus, Alertmanager, Thanos, and OpenTelemetry
  • Maintaining dashboards, alerts, and SLO/SLA indicators while reducing noise and improving signal quality
  • Supporting service instrumentation across metrics, logs, and traces using OpenTelemetry
  • Participating in on-call rotations, incident response, and post-incident documentation and improvements
  • Driving automation and cost optimization efforts, including resource right-sizing and operational efficiency improvements
  • Contributing to documentation, runbooks, onboarding guides, and operational playbooks
  • Requirements:

    The ideal candidate is an experienced DevOps or SRE professional with strong automation skills, deep cloud-native expertise, and a focus on operational excellence in production environments.

    • 8+ years of experience in DevOps, SRE, or platform engineering roles
    • Strong hands-on experience with Kubernetes and related ecosystem tools (Helm, Docker, ingress controllers, etc.)
    • Solid experience with CI/CD systems, preferably GitLab CI, including pipeline design and deployment strategies
    • Strong scripting ability in Bash or Python (Go is a plus) for automation and tooling
    • Practical experience with AWS services such as IAM, EC2/EKS, S3, CloudWatch, and Secrets Manager
    • Deep understanding of observability concepts including metrics, logs, tracing, and alerting systems
    • Experience with Prometheus, Alertmanager, Thanos, and OpenTelemetry
    • Comfortable working in ticket-driven environments (Jira, ServiceNow) and following change management processes
    • Strong communication skills and ability to collaborate with engineering and product teams
    • Bonus: Terraform experience for infrastructure as code and AWS/Kubernetes provisioning
    • Bonus: API integration experience (Python, Java, or Go) for internal tooling
    • Bonus: Strong Linux and container runtime debugging knowledge
    • Bonus: Exposure to regulated industries such as finance or insurance environments
    • Benefits:

      • Competitive compensation package aligned with experience
      • Fully remote role within the United States
      • Opportunity to work on large-scale, cloud-native infrastructure systems
      • High-impact role focused on reliability, automation, and platform engineering excellence
      • Exposure to modern DevOps tooling including Kubernetes, CI/CD, and observability stacks
      • Collaborative engineering culture focused on continuous improvement and innovation
      • Opportunity to work in fast-paced environments solving complex technical challenges
How Jobgether works:
We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team.
We appreciate your interest and wish you the best!
 
Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.
 
 
#LI-CL1

Jobgether runs the largest remote job platform, effectively linking job seekers with over 200,000 flexible and remote opportunities that match their unique skills and preferences. Our focus is on enhancing the hiring process, ensuring efficiency while prioritizing the candidate experience, particularly in the growing health and wellness sector.

Founded
Founded 2020
Employees
11-50 employees
Industry
Professional Services
View company profile
Apply for this job