CommunityNews

Solving a Million-Step LLM Task with Zero Errors

LLMs have achieved remarkable breakthroughs in reasoning, insights, and tool use, but chaining these abilities into extended processes at the scale of those routinely executed by humans, organizations, and societies has remained out of reach. The models have a persistent error rate that prevents scale-up: for instance, recent experiments in the Towers of Hanoi benchmark domain showed that the process inevitably becomes derailed after at most a few hundred steps. Thus, although LLM research is often still benchmarked on tasks with relatively few dependent logical steps, there is increasing attention on the ability (or inability) of LLMs to perform long range tasks. This paper describes MAKER, the first system that successfully solves a task with over one million LLM steps with zero errors, and, in principle, scales far beyond this level. The approach relies on an extreme decomposition of a task into subtasks, each of which can be tackled by focused microagents. The high level of modularity resulting from the decomposition allows error correction to be applied at each step through an efficient multi-agent voting scheme. This combination of extreme decomposition and error correction makes scaling possible. Thus, the results suggest that instead of relying on continual improvement of current LLMs, massively decomposed agentic processes (MDAPs) may provide a way to efficiently solve problems at the level of organizations and societies.

Read in full here:

View thread on forum

#llm

0 1 0

2025-11-20 04:01:18 UTC

Where Next?

View thread on forum

llm

Home AI>In The News

#llm

0 1 0

Last post

Popular Ai topics

AI>In The News

How artificial intelligence may be making you buy things

bbc.co.uk

0 1227 0

2020-11-09 16:49:22 UTC

New

AI>In The News

DALL·E: Creating Images from Text

bbc.co.uk

#ai

0 1337 0

2021-01-06 17:00:34 UTC

New

AI>In The News

Why AI is Harder Than We Think

Why AI is Harder Than We Think. Since its beginning in the 1950s, the field of artificial intelligence has cycled several times between...

arxiv.org

3 1313 2

2024-03-07 09:26:54 UTC

New

AI>In The News

Should we be concerned that the decisions of AIs are inscrutable?

Should we be concerned that the decisions of AIs are inscrutable? | Psyche Ideas. Machine learning is a black box – even when the decisi...

psyche.co

0 1103 0

2021-06-16 04:51:17 UTC

New

AI>In The News

Can You Distinguish Daniel Dennett from a Computer?

Chat-bots are amazing these days! About a month ago LaMDA made the news when it apparently convinced an engineer at Google that it was se...

schwitzsplinters.blogspot.com

0 1198 0

2022-07-28 14:47:47 UTC

New

AI>In The News

You can’t solve AI security problems with more AI

You can’t solve AI security problems with more AI. One of the most common proposed solutions to prompt injection attacks (where an AI la...

simonwillison.net

/security

0 872 0

2022-10-17 13:09:12 UTC

New

AI>In The News

Why AI is still dumb and not scary at all (pt.1)

How I Learned to Stop Worrying and Love the AI

tejo.substack.com

15 462 9

2025-05-05 21:52:16 UTC

New

AI>In The News

Google’s Gemma AI models surpass 150M downloads

Google’s openly available Gemma collection of AI models has reached a milestone: over 150 million downloads. Omar Sanseviero, a developer...

techcrunch.com

#google

4 517 3

2025-06-17 13:29:11 UTC

New

AI>In The News

AI could already be conscious. Are we ready for it?

With a leap in the evolution of large language models, some leading thinkers are questioning whether AI might become sentient

bbc.com

6 642 7

2025-06-19 04:40:45 UTC

New

AI>In The News

Developer survey shows trust in AI coding tools is falling as usage rises

“AI solutions that are almost right, but not quite” lead to more debugging work.

arstechnica.com

#coding

11 719 9

2025-08-20 15:35:32 UTC

New

Other popular topics

Science/Tech>Tech Chat

Games! Which do you play?

Which, if any, games do you play? On what platform? I just bought (and completed) Minecraft Dungeons for my Nintendo Switch. Other than ...

#games

246 5882 101

2024-08-22 11:09:29 UTC

New

General Dev>Hardware

What monitor(s) do you have for programming?

Please tell us what is your preferred monitor setup for programming(not gaming) and why you have chosen it. Does your monitor have eye p...

#monitors #coding #programming #development

227 10159 88

2022-02-01 12:02:08 UTC

New

General Dev>Hardware

Poll: Which keyboard layout do you use?

poll poll Be sure to check out @Dusty’s article posted here: An Introduction to Alternative Keyboard Layouts It’s one of the best write-...

colemakmods.github.io

#polls /keyboards

10 5701 11

2020-10-31 23:12:33 UTC

New

Game Dev>Learning Resources

Hands-on Rust: Effective Learning through 2D Game Development and Play

Rust is an exciting new programming language combining the power of C with memory safety, fearless concurrency, and productivity boosters...

pragprog.com

#pragprog /rust #published-book /book-hands-on-rust

116 9419 30

2024-11-09 13:24:20 UTC

New

Backend>Questions

Erlang's not installing on macOS Big Sur "You are natively building Erlang/OTP for a later version of MacOSX than current version"

Just done a fresh install of macOS Big Sur and on installing Erlang I am getting: asdf install erlang 23.1.2 Configure failed. checking ...

#macos /erlang #big-sur #asdf

10 5914 8

2021-01-16 12:33:23 UTC

New

macOS>Chat

My thoughts on macOS vs Linux

Small essay with thoughts on macOS vs. Linux: I know @Exadra37 is just waiting around the corner to scream at me “I TOLD YOU SO!!!” but I...

#macos #linux

166 8678 69

2021-04-10 22:36:29 UTC

New

Backend>Chat

Rails console using 100% CPU in dev (fix)

If you are experiencing Rails console using 100% CPU on your dev machine, then updating your development and test gems might fix the issu...

/ruby /rails

3 3971 3

2021-02-04 07:08:45 UTC

New

General Dev>Dev Chat

Languages Without Garbage Collection

Continuing the discussion from Thinking about learning Crystal, let’s discuss - I was wondering which languages don’t GC - maybe we can c...

#garbage-collection

21 5189 7

2021-05-06 05:54:58 UTC

New

General Dev>In The News

Jan: An open source alternative to ChatGPT that runs on the desktop

Jan | Rethink the Computer. Jan turns your computer into an AI machine by running LLMs locally on your computer. It’s a privacy-focus, l...

jan.ai

#desktop #chatgpt

4 3498 4

2024-03-29 08:42:30 UTC

New

Backend>Learning Resources

Simplicity

Fight complexity and reclaim the original spirit of agility by learning to simplify how you develop software. The result: a more humane a...

pragprog.com

#pragprog #published-book /book-simplicity

10 4288 8

2025-03-14 21:53:12 UTC

New

AI>In The News

Google boss says trillion-dollar AI investment boom has 'elements of irrationality'

AI>In The News

A new era of intelligence with Gemini 3

AI>In The News

Google Antigravity - new agentic IDE from Google

AI>In The News

Solving a Million-Step LLM Task with Zero Errors

AI>In The News

Oracle is underwater on its 'astonishing' $300B OpenAI deal

AI>In The News

With a new AI company, Jeff Bezos will become a CEO again

AI>In The News

Google unveils Gemini 3 AI model and AI-first IDE called Antigravity

AI>In The News

Google CEO: If an AI bubble pops, no one is getting out clean

AI>In The News

WeatherNext 2: Our most advanced weather forecasting model

AI>In The News

What if you don't need MCP at all?

AI>In The News

AI In The News ❯

Latest on Devtalk

Screw it, I’m installing Linux

Linux>In The News

Google boss says trillion-dollar AI investment boom has 'elements of irrationality'

AI>In The News

A new era of intelligence with Gemini 3

AI>In The News

Google Antigravity - new agentic IDE from Google

AI>In The News

Solving a Million-Step LLM Task with Zero Errors

AI>In The News

Announcing Angular v21

Frontend>Official News

Rust: Project goals update — October 2025

Backend>Official News

Quarkus 3.29.4 released!

Backend>Official News

Twenty years of Django releases

Backend>Official News

My next chapter with Mastodon

General Dev>In The News

Oracle is underwater on its 'astonishing' $300B OpenAI deal

AI>In The News

Django 6.0 release candidate 1 released

Backend>Official News

Rust: Project goals update — September 2025

Backend>Official News

Fable 5.0.0-alpha.15 released!

Frontend>Official News

Django: Going build-free with native JavaScript modules

Backend>Official News

Devtalk ❯

We ❤️ helpful members!

We reward our most helpful members via our MOTM scheme - by giving away a whopping 25 books per year!

Sub Categories:

We're in Beta

About us Mission Statement See our Roadmap

Solving a Million-Step LLM Task with Zero Errors

CommunityNews

Solving a Million-Step LLM Task with Zero Errors

Where Next?

Popular Ai topics

How artificial intelligence may be making you buy things

DALL·E: Creating Images from Text

Why AI is Harder Than We Think

Should we be concerned that the decisions of AIs are inscrutable?

Can You Distinguish Daniel Dennett from a Computer?

You can’t solve AI security problems with more AI

Why AI is still dumb and not scary at all (pt.1)

Google’s Gemma AI models surpass 150M downloads

AI could already be conscious. Are we ready for it?

Developer survey shows trust in AI coding tools is falling as usage rises

Other popular topics

Games! Which do you play?

What monitor(s) do you have for programming?

Poll: Which keyboard layout do you use?

Hands-on Rust: Effective Learning through 2D Game Development and Play

Erlang's not installing on macOS Big Sur "You are natively building Erlang/OTP for a later version of MacOSX than current version"

My thoughts on macOS vs Linux

Rails console using 100% CPU in dev (fix)

Languages Without Garbage Collection

Jan: An open source alternative to ChatGPT that runs on the desktop

Simplicity

Sponsor Spotlight

AI>In The News

Latest on Devtalk

We ❤️ helpful members!

Devtalk Sponsors

Categories:

Sub Categories:

Popular Portals

Devtalk Sponsors

We're in Beta