Mercor · STEM & research
Research Physics Expert - review AI outputs in your specialty
Listed on Mercor as “Research Physics Expert”
What this actually is
You design problems that stump current AI models, evaluate AI reasoning against the correct answer, write rubrics, and provide expert feedback. Often the highest-paid category because the expertise pool is small. The platform title (Research Physics Expert) reflects the rate band and the expertise required, not the day-to-day work.
Can you do this on your visa?
F-2 / F-4 / F-5 / F-6: open. E-1 to E-7: needs concurrent-employment permit. D-2 / D-4 students: S-3 permit, 20 hr/week cap. D-10 / D-8: case by case.
Korean tax on USD income
First 5 years in Korea: foreign-source income only taxed if remitted into Korea. After year 5: worldwide income. Full tax guide.
Original posting from Mercor
Role Overview
We are seeking expert physics researchers to author and verify golden reference solutions for the CritPt benchmark (arXiv:2509.26574v3) - a frontier research-level physics benchmark. Participants will solve CritPt research-level problems end-to-end, audit solutions from other experts, or adjudicate between parallel solution attempts, producing 100%-human-verified reference data used to evaluate large language models on frontier physics reasoning.
Physics Subdomains Covered
High Energy Physics & Mathematical Physics, Biophysics & Statistical Physics, Condensed Matter & AMO, Gravitation / Cosmology / Astrophysics, Quantum Information, Optical Properties of Materials, Magnetic Materials, Measurements in QM.
Key Responsibilities
- Solve research-level physics challenges end-to-end with verifiable derivations, code, and peer-reviewed references
- Decompose challenges into standalone checkpoint sub-problems that require genuine physical reasoning
- Author Python answer templates with auto-grading functions for symbolic or numerical answers
- Audit submitted solutions for correctness, scope, and method soundness; deliver actionable feedback across iterations
- Adjudicate between parallel solver attempts and decide which solution becomes the golden reference
- Document chain-of-thought reasoning, error tolerances, equivalent symbolic forms, and verification test cases
Ideal Qualifications
- Solver: PhD or postdoc in the relevant subfield (senior PhD student minimum)
- Auditor: Postdoc or junior professor in the relevant subfield (PhD minimum)
- Adjudicator: Full professor or industry research PI in the relevant subfield (senior postdoc or junior professor minimum)
- Hands-on familiarity with at least two canonical methods of the target subfield, demonstrable through publications (broader coverage strongly preferred)
- 3-5 representative publications (arXiv ID or DOI), ideally within the last ~5 years and in the target subfield
- Working proficiency with LaTeX, Python, Jupyter, and SymPy
- Strong written English (B2/C1/C2 minimum; native or near-native preferred)
More About the Opportunity
- Expected commitment: ~10 hours/week, sustained across an 8-10 week window per task pool
- Pay range: $80-$140 per hour, based on role and demonstrated expertise
- Asynchronous work
Quoted from Mercor’s public listing on 2026-06-02. We don’t edit platform copy; honest framing is in the title and the “what this actually is” block above.
More on this platform
About Mercor
AI-interview-based talent network. One application, voice interview with their AI, then matched to projects across coding, research, and specialist work. Pay scales with track and seniority.
Mercor review: AI-interview talent network
4.1/5 on Glassdoor, fastest-growing platform in the category (+509% YoY). What the AI video interview actually asks, real pay across coding/research/medical/legal/finance tracks ($25-$200/hr), and the project-availability problem.
See all AI training jobs
Browse by category and compare across all eight platforms we cover.