Logarithmic Models in Math

News

Researchers find that large language models struggle with math

Mathematics is the foundation of countless sciences, allowing us to model things like planetary orbits, atomic motion, signal frequencies, protein folding, and more. Moreover, it’s a valuable ...

VentureBeat4mon

Microsoft’s new rStar-Math technique upgrades small models to outperform OpenAI’s o1-preview at math problems

Learn More Microsoft is doubling down on the potential of small language models (SLMs) with the unveiling of rStar-Math, a new reasoning technique that can be applied to small models to boost ...

Business Insider4mon

This DeepSeek demo shows how good the Chinese AI model is at math and reasoning

The models use inference-time compute, breaking queries into smaller, manageable tasks. DeepSeek's DeepThink mode shows its thought process, solving challenging math problems. Chinese AI lab ...

Ars Technica1y

Telling AI model to “take a deep breath” causes math scores to soar in study

Google DeepMind researchers recently developed a technique to improve math ability in AI language models like ChatGPT by using other AI models to improve prompting—the written instructions that ...

The Washington Post2y

Program so accelerated that 8th-graders take AP Calculus. Will that work?

What happens when two bright and conscientious parents, without planning to do so, create one of the most ambitious math acceleration programs in the country? Jason and Sandy Roberts started the ...

Ars Technica6mon

New secret math benchmark stumps AI models and PhDs alike

The benchmark tests AI language models (such as GPT-4o, which powers ChatGPT) against original mathematics problems that typically require hours or days for specialist mathematicians to complete.

Hosted on MSN1mon

Microsoft's Phi-4-Reasoning Models Bring AI Math and Logic Skills to Smaller Devices

Microsoft has introduced a new set of small language models called Phi-4-reasoning, Phi-4-reasoning-plus, and Phi-4-mini-reasoning, which are described as "marking a new era for efficient AI.

TechCrunch7mon

Researchers question AI’s ‘reasoning’ ability as models stumble on math problems with trivial changes

[W]e investigate the fragility of mathematical reasoning in these models and demonstrate that their performance significantly deteriorates as the number of clauses in a question increases.

scmp.com1mon

DeepSeek quietly updates open-source model that handles maths proofs

DeepSeek’s Prover series consists of domain-specific models designed to solve math-related problems. The company has yet to provide any details about the model on its Hugging Face page.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results