White Circle
White Circle

Data Labeler

$30,000 – $50,000 per year

TLDR

Evaluate and label AI conversations and model outputs for safety and quality, provide structured feedback to researchers, and help advance AI safety evaluations.

About us

White Circle is an AI Safety company building the safety, reliability, and optimization layer for AI systems. At the core of our platform are policies – simple natural-language rules that define what an AI model should and shouldn’t do. We automatically test, enforce, and continuously improve these policies at scale.

  • We’ve raised $11M from top funds, founders, and senior leaders at OpenAI, Anthropic, HuggingFace, Mistral, DeepMind, Datadog, Sentry, and others

  • We process over one hundred million API calls every month

  • We fine-tune and train our own LLMs so they run faster and cheaper than any open or proprietary model

We’re a small, highly focused team. If you want to work deeply on hard problems, see your work ship to production quickly, and influence how AI safety is actually built – you’re the one we need.

In this role, you will

  • Review and evaluate AI conversations and model outputs

  • Assess responses for safety, quality, accuracy, policy compliance, and user intent

  • Identify harmful, unsafe, misleading, or low-quality behavior

  • Label and categorize model outputs according to internal evaluation frameworks

  • Moderate sensitive content and identify policy violations

  • Compare, rank, and score model responses

  • Investigate edge cases and ambiguous situations

  • Provide structured feedback to researchers and engineers

  • Help improve evaluation guidelines and annotation processes

  • Contribute to the datasets used to train and evaluate AI systems

We're looking for someone who

  • Has exceptional attention to detail

  • Can make consistent decisions across large volumes of data

  • Enjoys analysing nuanced situations where there isn't always a clear answer

  • Can follow guidelines while exercising good judgment

  • Has strong written English skills

  • Communicates clearly and explains reasoning well

  • Is curious about AI and how these systems work

You might be a great fit if you

  • Have experience with content moderation, trust & safety, quality assurance, compliance, or policy enforcement

  • Have experience in data annotation, AI evaluation, RLHF, or model assessment

  • Have worked with AI tools extensively and understand their strengths and limitations

  • Enjoy finding edge cases and unusual model behavior

Important note

This role may involve reviewing content that is offensive, harmful, violent, sexual, or otherwise disturbing We provide tooling, and support, but candidates should be comfortable working with sensitive content when necessary

Why White Circle

  • Salary of $30,000 to $50,000 + equity

  • Paid time off in line with your local regulations, no matter where you work from

  • All the hardware, tools, and services you need

How we hire

  1. Intro call with HR (25 min)

  2. Take-home assignment

  3. Final conversation with our CEO (35 min)

Please submit your application in English.

Benefits

Equity Compensation

+ equity

hardware, tools, and services

All the hardware, tools, and services you need

Paid Time Off

Paid time off in line with your local regulations, no matter where you work from

White Circle builds a safety, reliability, and optimization layer for AI systems, focusing on natural-language policies that define the boundaries for AI models. Our platform automatically tests, enforces, and continuously improves these policies at scale, ensuring that AI operates within safe and defined parameters.

View company profile
Report this job
Apply for this job