A new type of profession is developing: AI tutors for models.
Here’s the twist - if you’ve ever nitpicked a code review or triaged a messy bug queue, you already qualify. You literally get paid to teach robots how to behave.
I had my AI research agent pull the raw data across Reddit worker threads and platform pages. The signal is noisy, but the pattern is clear: RLHF work is a real side hustle with real money for people who can write, reason, or code.
Top 10 platforms not on the usual list
- Outlier AI - expert RLHF across writing, math, code. High activity, best pay.
- DataAnnotation.tech - evals for writing, safety, coding. High activity, selective onboarding.
- Amazon MTurk - find AI eval, ranking, response rating. Always on, quality varies.
- Remotasks - constant microtasks, RLHF pops up. Volume beats rate.
- Toloka - agent skills, coding, safety. Relaunched, mixed volume.
- Clickworker + UHRS - relevance and judgment tasks, some LLM eval. Locale dependent.
- Prolific - academic and industry studies, AI eval in waves. Strong hourly when it hits.
- OneForma - translation, data, growing GenAI eval. Bursty and competitive.
- Defined.ai - historically active; recent low volume in many regions.
- Hive Micro - micro-annotation; occasional LLM-adjacent tasks.
Top 5 right now by community activity
- Outlier AI
- DataAnnotation.tech
- MTurk
- Remotasks
- Clickworker + UHRS
What this pays in the real world
- Generalist - about 8-20 per hour on better requesters, 3-10 on microtask hubs.
- STEM - roughly 30-60 on Outlier, around 20-35 on DataAnnotation when slots open.
- Coding - about 30-50 on Outlier, around 40 on DataAnnotation for code eval. Heads up: advertised rates can melt if time caps are tight or work gets rejected. Effective hourly is what matters.
The catch Work is inconsistent. Qual tests can be unpaid and picky. Region lock is real. Some safety tasks are rough. And algorithmic management can nuke your access without a clean appeal. This is a side stream, not rent money.
How to start without wasting weekends
- Pick 3 and rotate: Outlier, DataAnnotation, plus one of MTurk or UHRS or Toloka.
- Pass quals fast: build a simple rubric for consistent ratings and stick to it.
- Specialize: math proofs, code review, chemistry, finance. Niches get the better queues.
Bottom line for 2026 AI tutor for models is a legit micro-profession. If you can write clearly, reason under time pressure, or review code, you can turn spare hours into 20-60 per hour when the queues are hot. Keep your expectations cold and your spreadsheets open. 📈
Which platform has treated you best lately - and what was your real effective hourly?