Outlier AI Logo

Outlier AI

Humanity's Finest - Referrals

Job Posted 19 Days Ago Posted 19 Days Ago
Be an Early Applicant
Remote
8 Locations
Expert/Leader
Remote
8 Locations
Expert/Leader
Join a team of top experts to craft challenging problems for AI models. Contribute to datasets and cite co-authorship in research papers.
The summary above was generated by AI

Note: At the moment, this opportunity is not available to applicants from California and New York.

We'll cut to the chase: we're looking for the world's best experts to take on the world's smartest models.

In Humanity’s Last Exam, we introduced the most challenging reasoning benchmark for frontier AI models. So far, the highest-performing system—OpenAI’s Deep Research—has achieved only 26% accuracy. We are collaborating with leading AI labs to identify the most effective data to improve AI reasoning capabilities in expert domains, and we aim to publish a new paper presenting our findings.

To do this, we are assembling a team of elite individuals who are the utmost experts in their respective fields. Our shared goal is to create PhD+ level problems that current state-of-the-art LLMs cannot correctly solve. This team will work collaboratively to produce datasets that will be available to our partner research groups, the world’s most advanced AI laboratories.

We're looking for unicorns to help us write some of the hardest problems intelligence has ever seen — do you think you can do this?

Why Join?

  • Have the opportunity to co-author a research paper analyzing how effectively this data enhances model reasoning.
  • Get access to an exclusive community of world-class researchers in a variety of domains; Each task submission will be open to peer review.
  • Receive up to $540 for each contributed problem and solution pair

What’s Next?

  • Still unsure? Watch this quick video of what comes after you hit "Apply Now" below
  • If you're up for the task, apply below!

PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with the Outlier Privacy Policy and our internal policies and programs designed to protect personal data.

This is a 1099 contract opportunity on the Outlier.ai platform. Because this is a freelance opportunity, we do not offer internships, sponsorship, or employment. You must be authorized to work in your country of residence. If you are an international student, you may be able to sign up for Outlier if you are on a visa. You should contact your tax and/or immigration advisor with specific questions regarding your circumstances.

Top Skills

Ai Research
Data Analysis

Similar Jobs

6 Hours Ago
Remote
Bengaluru, Karnataka, IND
Junior
Junior
Cloud • Information Technology • Productivity • Security • Software • App development • Automation
The Security GRC Analyst at Atlassian will implement and manage security risk and governance processes, collaborating with various teams and enhancing security operations through automation and technical guidance.
Top Skills: AutomationCybersecurityGoJqlPythonRisk ManagementSQL
Yesterday
Remote
India
Senior level
Senior level
Cloud • Information Technology • Productivity • Security • Software • App development • Automation
The Senior Machine Learning Systems Engineer will lead infrastructure for AI & ML tools, tackling complex challenges, mentoring junior members, and collaborating across teams.
Top Skills: Java,Kotlin,Aws,Sagemaker,S3,Cloud Formation
Yesterday
Remote
Hybrid
India
Mid level
Mid level
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
The Team Leader in Technology Services oversees testing execution, enhances UAT processes, conducts post-production testing, and manages defect resolution while collaborating with the Product Owner.
Top Skills: AzureExcelMs PowerpointMs VisioMs Word

What you need to know about the Toronto Tech Scene

Although home to some of the biggest names in tech, including Google, Microsoft and Amazon, Toronto has established itself as one of the largest startup ecosystems in the world. And with over 2,000 startups — more than 30 percent of the country's total startups — Toronto continues to attract new businesses. Be it helping entrepreneurs manage their finances, simplifying business operations by automating payroll or assisting pharmaceutical companies in launching new drugs, the city's tech scene is just getting started.
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account