Examples of Deductive Reasoning Math

OpenAI Just Solved 5 of 10 ‘Impossible’ Math Problems

OpenAI’s unreleased model solved five of 10 unpublished research-level math problems and proposed a breakthrough physics formula, signaling a new era for AI in science.

Communications of the ACM

Formal Reasoning Meets LLMs: Toward AI for Mathematics and Verification

A marriage of formal methods and LLMs seeks to harness the strengths of both.

EurekAlert!

MathEval: a comprehensive benchmark for evaluating large language models on mathematical reasoning capabilities

This study introduces MathEval, a comprehensive benchmarking framework designed to systematically evaluate the mathematical reasoning capabilities of large language models (LLMs). Addressing key ...

SiliconANGLE

Harmonic AI raises $120M at $1.45B valuation to advance mathematical reasoning

Artificial intelligence for formal mathematical reasoning startup Harmonic AI Inc. announced today that it has raised $120 million in new funding on a $1.45 billion valuation. The funding is intended ...

Business Wire

Harmonic Builds Momentum Towards Mathematical Superintelligence with $120 Million Series C

Ribbit Capital Leads Round at $1.45B Valuation of Math-Based AI Venture; Emerson Collective Joins Existing Backers Including Sequoia & Kleiner Perkins PALO ALTO, Calif.--(BUSINESS WIRE)--Harmonic, the ...

Dark Reading

'K2 Think' AI Model Jailbroken Mere Hours After Release

Hardly two days since its public release, a researcher has publicized how to jailbreak a major new artificial intelligence (AI) reasoning model called "K2 Think." K2 Think was released to the public ...

Morningstar

Baidu Unveils Reasoning Model ERNIE X1.1 with Upgrades in Key Capabilities

ERNIE X1.1 shows major advancements in factuality, instruction following, and agentic capabilities; it surpasses DeepSeek R1-0528 in overall performance while performing on par with top-tier models ...

Scientific American

Can Writing Math Proofs Teach AI to Reason Like Humans?

A few months before the 2025 International Mathematical Olympiad (IMO) in July, a three-person team at OpenAI made a long bet that they could use the competition’s brutally tough problems to train an ...

Futurism

Detailed Logs Show ChatGPT Leading a Vulnerable Man Directly Into Severe Delusions

As the AI spending bubble swells, so too are the numbers of people being drawn into delusional spirals by overly-confident chatbots. Joining their ranks is Allan Brooks, a father and business owner ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results