Test Math Problems - Search News

AI scores a ‘C–’ on its hardest math test yet

The second batch of “First Proof” problems is meant to evaluate AI’s usefulness for research-level math. The best model got ...

Nature

Humans outperform AI at this highly rigorous mathematics test

A new benchmark pitting AI against previously unseen maths problems shows systems still fall short of top human expertise.

Four models compared: Humans remain ahead in math tests

An independent test of four artificial intelligence models verified 10 previously unpublished mathematical problems: the ETH Zurich model solved six problems, while other publicly available systems ...

Chronicle

A Grueling Math Test So Hard, Almost No One Gets a Perfect Score

Every year, thousands of college students from across the U.S. and Canada give up a full Saturday before finals begin to take a notoriously difficult, 6-hour math test — and not for a grade, but for ...

17don MSN

A famous math problem stumped humans for 80 years. AI just cracked it.

The math world is losing its mind over the new solution to an Erdős problem. This is what AI found, how we missed it—and why it matters.

Ars Technica

New secret math benchmark stumps AI models and PhDs alike

On Friday, research organization Epoch AI released FrontierMath, a new mathematics benchmark that has been turning heads in the AI world because it contains hundreds of expert-level problems that ...

MSN on MSN

Simple-looking math test question leaves people baffled - can you solve it in 30 seconds?

Get out your timer and number two pencil to see if your arithmetic skills from grade school are still intact.

The Conversation

Girls and boys solve math problems differently – with similar short‑term results but different long‑term outcomes

Among high school students and adults, girls and women are much more likely to use traditional, step-by-step algorithms to solve basic math problems – such as lining up numbers to add, starting with ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results