DevOps / Site Reliability Engineer
TLDR
Build and optimize multi-cloud AIOps infrastructure, monitoring, cost observability, and security operations while evolving internal R&D tooling.
Who We Are
What You’ll Be Doing
- Build and maintain the core infrastructure of the AIOps platform, including the unified monitoring & alerting system and the FinOps cost observability platform.
- Maintain and continuously optimize internal R&D infrastructure (GitLab, Nexus, Sonar, etc.).
- Manage monitoring data collection, alert governance, and cost data visualization across multi-cloud environments (Alibaba Cloud / AWS).
- Support cloud security operations, including cloud security alert management and compliance auditing.
What We Look For In You
- 3+ years of DevOps or SRE experience; experience with AIOps or observability platform development is a plus.
- Proficient in Python; familiar with at least one of Go or Java. Full-stack capability (React/Vue frontend + backend API) is a plus.
- Hands-on experience with at least one major cloud platform (Alibaba Cloud or AWS); familiar with cloud monitoring products (CloudWatch / Alibaba Cloud CloudMonitor) and cost management tools.
- Familiar with monitoring and logging stacks such as Prometheus, Grafana, and ELK.
- Experience maintaining and optimizing CI/CD toolchains (GitLab CI, Nexus, container registries).
- Experience with AI/LLM application development (e.g., LLM API integration, RAG, Agent frameworks) is a plus.
- Good written and verbal English communication skills.
Perks & Benefits
-
Competitive total compensation package
- L&D programs and education subsidy for employees' growth and development
-
Various team building programs and company events
- Wellness and meal allowances
- Comprehensive healthcare schemes for employees and dependants
- More that we love to tell you along the process!
Benefits
Education Stipend
L&D programs and education subsidy for employees' growth and development
Health Insurance
Comprehensive healthcare schemes for employees and dependants
Wellness Stipend
Wellness and meal allowances
OKX operates as a prominent cryptocurrency exchange, enabling users to buy, sell, and trade a wide range of digital assets, including Bitcoin and Ethereum. In addition to facilitating crypto trading, they've developed OKX Wallet, a widely-used platform for accessing decentralized applications and exploring the Web3 landscape.
- Founded
- Founded 2017
- Employees
- 500+ employees
- Industry
- Diversified Financial Services