interface.ai

Site Reliability Engineer

Bengaluru, India

Full-Time

Remote

TLDR

Design and troubleshoot large-scale distributed systems, automate reliability and deployment, and maintain production-grade cloud infrastructure.

About Us

interface.ai provides an out-of-the-box AI Assistant that acts as a “Personal Bank Teller” to help financial institutions’ customers 24x7 through every step of the journey from a prospect to a customer. It is used by several financial institutions (FIs) across 5 countries spanning millions of conversations. Our customers have already witnessed over $50M+ ROI in just under 12 months.

Our vision is to put an AI-powered Personal Banking Assistant in everyone's pocket that not only helps with day-to-day banking needs but also helps each individual achieve financial wellness.

We have built an NLU platform ground up just for financial institutions based on some of the novel techniques like zero-shot learning. It is also based on a fully event-driven processing engine leading to minimal or no manual configuration required to manage the context in a dialog. You can learn more here - https://interface.ai/platform/

As a Site Reliability Engineer you will be in charge of :

Designing, analyzing and troubleshooting large-scale distributed systems
Engaging in cross-functional team discussions on design, deployment, operation, and maintenance, in a fast-moving, collaborative set up
Building automation scripts to validate the stability, scalability, and reliability of interface.ai’s products & services as well as enhance interface.ai’s employees’ productivity
Debugging and optimizing code and automating routine tasks
Troubleshoot and diagnose issues (hardware or software), propose and implement solutions to ensure they occur with reduced frequency
Perform the periodic on-call duty to handle security, availability, and reliability of interface.ai’s products
You will follow and write good code and solid engineering practices

You can be a great fit if you are :

Extremely self motivated
Ability to learn quickly
Growth Mindset (read this if you don't know what it means - link)
Emotional Maturity (read this if you don't know what it means - link)
Passionate about the possibilities at the intersection of AI + Banking
Worked in a startup of 5 to 30 employees
Developer with a strong interest in systems Design. You will be building, maintaining, and scaling our cloud infrastructure through software tooling and automation.
3+ years of industry experience developing and troubleshooting large-scale infrastructure on the cloud
Have a solid understanding of system availability, latency, and performance
Strong programming skills in at least one major programming language and the ability to learn new languages as needed
Strong System/network debugging skills
Experience with management/automation tools such as Terraform/Puppet/Chef/SALT
Experience with setting up production-level monitoring and telemetry
Expertise in Container management & AWS
Experience with kubernetes is a plus
Experience building CI/CD pipelines
Experience working with Web sockets, Redis, Postgres, Elastic search, Logstash
Experience working in an agile team environment and proficient understanding of code versioning tools, such as Git.
Ability to effectively articulate technical challenges and solutions.
Proactive outlook for ways to make our systems more reliable

Apply for this job

interface.ai

View company profile

Site Reliability Engineer