Humanity's Last Exam - Search News

OpenAI’s deep research can complete 26% of Humanity’s Last Exam—a benchmark for the frontier of human knowledge

Artificial intelligence may be more than a quarter of the way to surpassing the boundaries of human knowledge. OpenAI’s new autonomous agent, deep research, has stormed past competing models and set a ...

AOL

Humanity’s last exam, the test that modern AI still struggles to pass

Artificial intelligence systems now breeze through many academic tests that once challenged both machines and people. That success created an unexpected problem. The benchmarks used to measure AI ...

RealClearScience

AI Is Failing 'Humanity's Last Exam'

How do you translate ancient Palmyrene script from a Roman tombstone? How many paired tendons are supported by a specific sesamoid bone in a hummingbird? Can you identify closed syllables in Biblical ...

Psychology Today

'Humanity's Last Exam' Exposes AI's Strengths and Weaknesses

Artificial intelligence (AI) is outpacing traditional benchmarks according to a new peer-reviewed study published in Nature. To effectively measure AI, a global consortium of domain experts from 50 ...

Hosted on MSN

AI is just one year away from beating 'humanity's last exam' - a bank of 2,500 expert questions

AI will be ready to score full marks on one of the world's most challenging knowledge tests branded Humanity's Last Exam (HLE) in a matter of months, developers claim. HLE was set up by tech bosses to ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results