Physics AI Research Project

New

Skills

Benchmarking phd Remote Collaboration

Mercor is recruiting Physics PhD holders to participate in a short-term, high-impact pilot with a leading frontier AI laboratory. This remote, asynchronous opportunity focuses on advancing AI reasoning and problem-solving capabilities within advanced physics domains. Collaborate with top researchers, contribute to cutting-edge benchmarks, and help shape the future of scientific AI.

Key Responsibilities
  • Review and evaluate physics reasoning and proofs for AI systems.
  • Author, refine, and validate technical content in advanced physics.
  • Design and assess research-style questions and benchmarking tasks.
  • Provide structured feedback to improve AI scientific reasoning.
  • Collaborate closely with AI researchers in a select cohort.
Required Skills & Qualifications
  • PhD in Physics from a top 20 institution (required).
  • Recent or current academic/research experience strongly preferred.
  • Expertise in advanced physics domains and problem-solving.
  • Ability to articulate complex reasoning formally and precisely.
  • Experience designing or evaluating scientific benchmarks/questions.
  • Strong written and verbal communication skills.
  • Comfort working remotely and asynchronously.
  • Availability for flexible engagement (5-40 hrs/week).
  • Eligible to work as an independent contractor.
  • Commitment to high-quality, impactful contributions.

Job Type: Remote

Salary: Not Disclosed

Experience: Entry

Duration: 12 Months

Share this job:

Similar Jobs

Management Consulting AI Project

New

Benchmark and improve AI model capabilities

Design consulting-style prompts and evaluations

Ai Benchmarking Management Consulting Online Research

Energy Efficiency Account Manager

Posted 146 days ago

Promote energy efficiency concepts and services to customers

Identify cost-effective investments in energy efficiency

Benchmarking

Vulnerability Research Engineer

Posted 177 days ago

Improve security detection capabilities in GitLab

Enhance vulnerability research and analysis

Benchmarking Devops Engineer Product Development
overtime