All AI training jobs

Mercor · Finance & specialist

Cybersecurity Labeling Expert - review AI outputs in your specialty

Listed on Mercor as “Cybersecurity Labeling Expert

$100-$150/hrRemoteContractPaid in USD

What this actually is

You bring your specialist expertise to AI evaluation. The shape of the work varies but the pattern is the same: review outputs, rate quality, write prompts, flag errors. The platform title (Cybersecurity Labeling Expert) reflects the rate band and the expertise required, not the day-to-day work.

Can you do this on your visa?

F-2 / F-4 / F-5 / F-6: open. E-1 to E-7: needs concurrent-employment permit. D-2 / D-4 students: S-3 permit, 20 hr/week cap. D-10 / D-8: case by case.

Korean tax on USD income

First 5 years in Korea: foreign-source income only taxed if remitted into Korea. After year 5: worldwide income. Full tax guide.

Original posting from Mercor

About the role

Cyberattacks cause billions in damages annually - ransomware cripples hospitals, data exfiltration exposes millions. As a Cybersecurity Labeling Expert, you'll be on the front lines of AI safety: reviewing real-world conversations flagged as potentially malicious and determining whether they represent genuine threats. Your judgments directly train the systems that keep AI out of the hands of bad actors.

What you'll do

Analyze flagged AI conversations - ranging from plain text to code-heavy exchanges - and apply your security expertise to assess intent and harm across four domains:

  • Scaled data exfiltration
  • Ransomware
  • Worms / self-replicating code
  • Local & remote exploits

Some conversations may involve POC exploit development; your expertise will determine what crosses the line.

Why it matters

The difference between a security researcher and a threat actor often comes down to context, specificity, and intent - exactly what automated systems struggle to detect. Your ground-truth labels directly improve the classifiers that decide what AI will and won't help with.

What we're looking for

  • Hands-on offensive security background: red team, malware analysis, pen testing, or exploit research
  • Ability to read between the lines - distinguishing legitimate security work from genuine attack intent
  • Comfort interpreting code-heavy conversations
  • Tier 2-3 experience: Masters / early-career through Senior / Principal

You're a strong fit if you've done red team consulting, threat intelligence analysis, vulnerability research, or AI safety labeling where nuanced judgment under ambiguity is routine.

Logistics

  • ~2-3 week engagement, remote
  • Minimum commitment: at least 20 hours per week - this is a firm floor, not a target. Candidates who cannot commit to 20+ hours per week will not be considered.
  • Requires access to a secure review interface and ability to handle PII
  • Compensation: $100-$150/hr depending on experience tier

Quoted from Mercor’s public listing on 2026-06-02. We don’t edit platform copy; honest framing is in the title and the “what this actually is” block above.