Mathematical Reasoning

Nvidia's Nemotron-Cascade 2 wins math and coding gold medals with 3B active parameters — and its post-training recipe is now open-source

Nvidia's Nemotron-Cascade 2 is a 30B MoE model that activates only 3B parameters at inference time, yet achieved gold ...

National Academies of Sciences%2c Engineering%2c and Medicine

AI to Assist Mathematical Reasoning: A Workshop

The National Academies of Sciences, Engineering, and Medicine are private, nonprofit institutions that provide expert advice on some of the most pressing challenges facing the nation and world. Our ...

Decrypt

Forget AGI—Top AI Models Still Struggle With Math

New benchmark study results show leading AI models, including ChatGPT, Claude, and Gemini, still lag humans in visual math ...

EdSource

A greater role in math education for parents: mathematical reasoning at home

March 12, 2026 - A settlement was recently reached in a class-action case, requiring LAUSD to offer intensive "high-dose" tutoring to 100,000 students for three years. While policymakers, researchers ...

AI solves 20-year math challenge that researcher thought machines could not crack

A Polish mathematician spent two decades crafting a problem meant to test the limits of artificial intelligence. A new AI ...

EurekAlert!

MathEval: a comprehensive benchmark for evaluating large language models on mathematical reasoning capabilities

This study introduces MathEval, a comprehensive benchmarking framework designed to systematically evaluate the mathematical reasoning capabilities of large language models (LLMs). Addressing key ...

Futurism

Top “Reasoning” AI Models Can be Brought to Their Knees With an Extremely Simple Trick

A team of Apple researchers has found that advanced AI models’ alleged ability to “reason” isn’t all it’s cracked up to be. But marketing aside, there’s no agreed-upon industrywide definition for what ...

National Academies of Sciences%2c Engineering%2c and Medicine

AI to Assist Mathematical Reasoning: A Workshop

A National Academies of Sciences, Engineering, and Medicine-appointed ad hoc committee will plan and organize a workshop that will bring together academic, industry, and government stakeholders to ...

National Academies of Sciences%2c Engineering%2c and Medicine

AI to Assist Mathematical Reasoning: A Workshop

Any project, supported or not by a committee, that has not deposited records to the Records Office. A National Academies of Sciences, Engineering, and Medicine-appointed ad hoc committee will plan and ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results