Skip to content

Instantly share code, notes, and snippets.

Show Gist options
  • Select an option

  • Save TravnikovDev/6206d6c8e170892a2d90e8508d2dd188 to your computer and use it in GitHub Desktop.

Select an option

Save TravnikovDev/6206d6c8e170892a2d90e8508d2dd188 to your computer and use it in GitHub Desktop.
LinkedIn Post - 2025-12-05 15:35

A new type of profession is developing: AI tutors for models.

Here’s the twist - if you’ve ever nitpicked a code review or triaged a messy bug queue, you already qualify. You literally get paid to teach robots how to behave.

I had my AI research agent pull the raw data across Reddit worker threads and platform pages. The signal is noisy, but the pattern is clear: RLHF work is a real side hustle with real money for people who can write, reason, or code.

Top 10 platforms not on the usual list

  • Outlier AI - expert RLHF across writing, math, code. High activity, best pay.
  • DataAnnotation.tech - evals for writing, safety, coding. High activity, selective onboarding.
  • Amazon MTurk - find AI eval, ranking, response rating. Always on, quality varies.
  • Remotasks - constant microtasks, RLHF pops up. Volume beats rate.
  • Toloka - agent skills, coding, safety. Relaunched, mixed volume.
  • Clickworker + UHRS - relevance and judgment tasks, some LLM eval. Locale dependent.
  • Prolific - academic and industry studies, AI eval in waves. Strong hourly when it hits.
  • OneForma - translation, data, growing GenAI eval. Bursty and competitive.
  • Defined.ai - historically active; recent low volume in many regions.
  • Hive Micro - micro-annotation; occasional LLM-adjacent tasks.

Top 5 right now by community activity

  1. Outlier AI
  2. DataAnnotation.tech
  3. MTurk
  4. Remotasks
  5. Clickworker + UHRS

What this pays in the real world

  • Generalist - about 8-20 per hour on better requesters, 3-10 on microtask hubs.
  • STEM - roughly 30-60 on Outlier, around 20-35 on DataAnnotation when slots open.
  • Coding - about 30-50 on Outlier, around 40 on DataAnnotation for code eval. Heads up: advertised rates can melt if time caps are tight or work gets rejected. Effective hourly is what matters.

The catch Work is inconsistent. Qual tests can be unpaid and picky. Region lock is real. Some safety tasks are rough. And algorithmic management can nuke your access without a clean appeal. This is a side stream, not rent money.

How to start without wasting weekends

  • Pick 3 and rotate: Outlier, DataAnnotation, plus one of MTurk or UHRS or Toloka.
  • Pass quals fast: build a simple rubric for consistent ratings and stick to it.
  • Specialize: math proofs, code review, chemistry, finance. Niches get the better queues.

Bottom line for 2026 AI tutor for models is a legit micro-profession. If you can write clearly, reason under time pressure, or review code, you can turn spare hours into 20-60 per hour when the queues are hot. Keep your expectations cold and your spreadsheets open. 📈

Which platform has treated you best lately - and what was your real effective hourly?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment