All AI training jobs

Mercor · STEM & research

Research Physics Expert - review AI outputs in your specialty

Listed on Mercor as “Research Physics Expert

$80-$140/hrRemoteContractPaid in USD

What this actually is

You design problems that stump current AI models, evaluate AI reasoning against the correct answer, write rubrics, and provide expert feedback. Often the highest-paid category because the expertise pool is small. The platform title (Research Physics Expert) reflects the rate band and the expertise required, not the day-to-day work.

Can you do this on your visa?

F-2 / F-4 / F-5 / F-6: open. E-1 to E-7: needs concurrent-employment permit. D-2 / D-4 students: S-3 permit, 20 hr/week cap. D-10 / D-8: case by case.

Korean tax on USD income

First 5 years in Korea: foreign-source income only taxed if remitted into Korea. After year 5: worldwide income. Full tax guide.

Original posting from Mercor

Role Overview

We are seeking expert physics researchers to author and verify golden reference solutions for the CritPt benchmark (arXiv:2509.26574v3) - a frontier research-level physics benchmark. Participants will solve CritPt research-level problems end-to-end, audit solutions from other experts, or adjudicate between parallel solution attempts, producing 100%-human-verified reference data used to evaluate large language models on frontier physics reasoning.

Physics Subdomains Covered

High Energy Physics & Mathematical Physics, Biophysics & Statistical Physics, Condensed Matter & AMO, Gravitation / Cosmology / Astrophysics, Quantum Information, Optical Properties of Materials, Magnetic Materials, Measurements in QM.

Key Responsibilities

  • Solve research-level physics challenges end-to-end with verifiable derivations, code, and peer-reviewed references
  • Decompose challenges into standalone checkpoint sub-problems that require genuine physical reasoning
  • Author Python answer templates with auto-grading functions for symbolic or numerical answers
  • Audit submitted solutions for correctness, scope, and method soundness; deliver actionable feedback across iterations
  • Adjudicate between parallel solver attempts and decide which solution becomes the golden reference
  • Document chain-of-thought reasoning, error tolerances, equivalent symbolic forms, and verification test cases

Ideal Qualifications

  • Solver: PhD or postdoc in the relevant subfield (senior PhD student minimum)
  • Auditor: Postdoc or junior professor in the relevant subfield (PhD minimum)
  • Adjudicator: Full professor or industry research PI in the relevant subfield (senior postdoc or junior professor minimum)
  • Hands-on familiarity with at least two canonical methods of the target subfield, demonstrable through publications (broader coverage strongly preferred)
  • 3-5 representative publications (arXiv ID or DOI), ideally within the last ~5 years and in the target subfield
  • Working proficiency with LaTeX, Python, Jupyter, and SymPy
  • Strong written English (B2/C1/C2 minimum; native or near-native preferred)

More About the Opportunity

  • Expected commitment: ~10 hours/week, sustained across an 8-10 week window per task pool
  • Pay range: $80-$140 per hour, based on role and demonstrated expertise
  • Asynchronous work

Quoted from Mercor’s public listing on 2026-06-02. We don’t edit platform copy; honest framing is in the title and the “what this actually is” block above.