Logical Thinking Performance Task

New Gemini 3.1 Pro is Google's most advanced reasoning model

Google has announced the release of Gemini 3.1 Pro, a tool designed specifically for tackling complex problems where simple ...

19m

Google Launches Gemini 3.1 Pro, Says It Delivers 'Smarter Reasoning' for Complex Tasks

Google has launched Gemini 3.1 Pro, its latest AI model designed for complex reasoning tasks. With a benchmark score of 77.1% on ARC-AGI-2, Gemini 3.1 Pro doubles the reasoning performance of Gemini 3 ...

Interesting Engineering on MSN

New Gemini 3.1 Pro crushes previous benchmarks, outperforms GPT 5.2 reasoning

Google has rolled out Gemini 3.1 Pro, the latest update to its flagship AI ...

OfficeChai

Google Gemini 3.1 Pro Doubles Performance Over Gemini 3 Pro On ARC-AGI 2, Tops Benchmark

ARC-AGI 2 had been created when ARC-AGI 1 seemed all but saturated, but it appears that ARC-AGI 2 won’t remain unsolved for ...

21h

High Token Usage in Claude Sonnet 4.6 Limits Value for Long Reasoning Tasks

Sonnet 4.6 adds adaptive thinking and browser task gains with 4x higher token use than Sonnet 4.5, budget planning changes by task type.

13d

Anthropic's Claude Opus 4.6 AI model improves reasoning, task handling

Anthropic has launched Claude Opus 4.6, bringing improvements to long-context reasoning, accuracy in coding, and extended ...

Hosted on MSN

Scientists just developed a new AI modeled on the human brain — it's outperforming LLMs like ChatGPT at reasoning tasks

Scientists have developed a new type of artificial intelligence (AI) model that can reason differently from most large language models (LLMs) like ChatGPT, resulting in much better performance in key ...

Geeky Gadgets

Deepseek-r1 vs OpenAI-o1 – AI Reasoning Performance Comparison

Deepseek, a Chinese company, has introduced its Deepseek R1 model, attracting attention for its potential to rival OpenAI’s latest offerings. Reportedly outperforming OpenAI’s o1 Preview in benchmarks ...

Medical Xpress

How the brain deploys different reasoning strategies to tackle challenging mental tasks

The human brain is very good at solving complicated problems. One reason for that is that humans can break problems apart into manageable subtasks that are easy to solve one at a time. This allows us ...

Nvidia’s new technique cuts LLM reasoning costs by 8x without losing accuracy

Nvidia researchers developed dynamic memory sparsification (DMS), a technique that compresses the KV cache in large language models by up to 8x while maintaining reasoning accuracy — and it can be ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results