interface.ai
interface.ai

Site Reliability Engineer

TLDR

Design and troubleshoot large-scale distributed systems, automate reliability and deployment, and maintain production-grade cloud infrastructure.

About Us

interface.ai provides an out-of-the-box AI Assistant that acts as a “Personal Bank Teller” to help financial institutions’ customers 24x7 through every step of the journey from a prospect to a customer. It is used by several financial institutions (FIs) across 5 countries spanning millions of conversations. Our customers have already witnessed over $50M+ ROI in just under 12 months.

Our vision is to put an AI-powered Personal Banking Assistant in everyone's pocket that not only helps with day-to-day banking needs but also helps each individual achieve financial wellness.

We have built an NLU platform ground up just for financial institutions based on some of the novel techniques like zero-shot learning. It is also based on a fully event-driven processing engine leading to minimal or no manual configuration required to manage the context in a dialog. You can learn more here - https://interface.ai/platform/

 

As a Site Reliability Engineer you will be in charge of :

  • Designing, analyzing and troubleshooting large-scale distributed systems
  • Engaging in cross-functional team discussions on design, deployment, operation, and maintenance,  in a fast-moving, collaborative set up
  • Building automation scripts to validate the stability, scalability, and reliability of interface.ai’s products & services as well as enhance interface.ai’s employees’ productivity
  • Debugging and optimizing code and automating routine tasks
  • Troubleshoot and diagnose issues (hardware or software), propose and implement solutions to ensure they occur with reduced frequency
  • Perform the periodic on-call duty to handle security, availability, and reliability of interface.ai’s products 
  • You will follow and write good code and solid engineering practices

 

You can be a great fit if you are :

  1. Extremely self motivated
  2. Ability to learn quickly
  3. Growth Mindset (read this if you don't know what it means - link)
  4. Emotional Maturity (read this if you don't know what it means - link)
  5. Passionate about the possibilities at the intersection of AI + Banking
  6. Worked in a startup of 5 to 30 employees
  7. Developer with a strong interest in systems Design. You will be building, maintaining, and scaling our cloud infrastructure through software tooling and automation. 
  8. 3+ years of industry experience developing and troubleshooting large-scale infrastructure on the cloud
  9. Have a solid understanding of system availability, latency, and performance
  10. Strong programming skills in at least one major programming language and the ability to learn new languages as needed  
  11. Strong System/network debugging skills
  12. Experience with management/automation tools such as Terraform/Puppet/Chef/SALT
  13. Experience with setting up production-level monitoring and telemetry
  14. Expertise in Container management & AWS
  15. Experience with kubernetes is a plus
  16. Experience building CI/CD pipelines
  17. Experience working with Web sockets, Redis, Postgres, Elastic search, Logstash
  18. Experience working in an agile team environment and proficient understanding of code versioning tools, such as Git.
  19. Ability to effectively articulate technical challenges and solutions.
  20. Proactive outlook for ways to make our systems more reliable

 

 

Apply for this job