Site Reliability Engineer
TLDR
Build and maintain CI/CD pipelines, monitoring, and disaster recovery for scalable, secure infrastructure while collaborating with development teams and mentoring junior engineers.
Job Role - Site Reliability Engineer
Location - Pune
We are seeking SRE Engineer to join our team and help us build, deploy, and maintain our infrastructure and applications. The ideal candidate will have experience working in a fast-paced environment and a strong background in Site Reliability Engineering (SRE). You will be responsible for ensuring the reliability, scalability, and security of our applications and infrastructure.
- Build and maintain our CI/CD pipeline and deployment automation tools
- Design and implement monitoring and alerting systems to ensure the health of our applications and infrastructure
- Work closely with development teams to ensure that code is deployed in a reliable and scalable manner
- Participate in on-call rotations to provide 24/7 support for our production systems
- Develop and maintain disaster recovery plans and processes
- Continuously improve our infrastructure and processes to ensure scalability, reliability, and security
- Mentor and provide technical leadership to junior team members
- Keep up-to-date with industry best practices and emerging technologies in SRE
- Bachelor’s degree in Computer Science, Engineering, or a related field
- 5+ years of experience in SRE
- Strong programming skills in at least one of the following languages: Python, Go, Ruby, or Java
- Experience with infrastructure as code tools such as Terraform or CloudFormation
- Experience with containerization technologies such as Docker and Kubernetes
- Strong understanding of networking concepts such as TCP/IP, DNS, and load balancing
- Experience with monitoring and logging tools such as Prometheus, Grafana, and ELK stack
- Excellent problem-solving skills and the ability to troubleshoot complex issues in a fast-paced environment
- Strong communication and collaboration skills with both technical and non-technical stakeholders
- Experience with cloud providers such as AWS or Azure
- Experience with building and maintaining large-scale distributed systems
- Experience with database technologies such as MySQL, PostgreSQL, or MongoDB
- Experience with automation tools such as Ansible or Chef
- Experience with Agile development methodologies such as Scrum or Kanban
About CrelioHealth
CrelioHealth is a fast-growing health-tech product company delivering cloud-based LIMS, RIS, CRM, and Inventory solutions to diagnostics labs and hospitals globally. We are building scalable systems that power modern healthcare operations.
Website: www.creliohealth.com
CrelioHealth builds cloud-based solutions like LIMS, RIS, CRM, and Inventory management systems tailored for diagnostics labs and hospitals. Our technology empowers healthcare organizations to enhance their operations with scalable, efficient systems that modernize how they deliver services.