InVision is hiring a

Manager, Site Reliability Engineering

InVision is the world's leading product design platform, powering the future of digital product design through our deep understanding of the dynamics of collaboration. We provide two million people with the power to prototype, review, refine, manage and user test web and mobile products. InVision drives the product design process at leading Fortune 100 companies, including at Disney, IBM, Walmart, Apple, Verizon and General Motors. Backed by Accel, ICONIQ Capital, FirstMark Capital, Tiger Global and others. InVision is a distributed team with over 200 employees around the world.

At InVision, you will play a crucial role in the design, implementation and management of our application infrastructure using today’s cutting edge technology including Docker, Kubernetes, and CoreOS. Help us implement a highly available, scalable infrastructure and create the tooling we need to get it done. Here you can directly influence the application configuration, deployment process of our application and create tools to improve our processes, monitoring and application infrastructure, all in a container centric environment!

Automation is our mantra!

What you love to do...

  • Implement highly available, scalable, infrastructures leveraging Docker and container schedulers - Kubernetes preferred!
  • Create operational tooling for monitoring, self-healing infrastructures, and continuous integration
  • Code in Golang!
  • Lead agile scrum and project planning
  • Leverage AWS cloud services and its cloud ecosystem
  • Promote Docker best practices to design images/containers for micro-service architectures
  • Identify performance bottlenecks and troubleshoot application and database issues and fix them!
  • Collaborate to problem solving and design
  • Participate in the on-call process for application monitoring and incident resolution
  • Mentor other developers and site reliability engineers in new technologies being implemented