xiji2646-netizen

Anthropic's agents now review their own past sessions and self-improve. Thoughts?

Anthropic shipped something called Dreaming for Managed Agents this week. It’s a scheduled background process that runs between sessions — the agent reviews its own past conversation transcripts, extracts patterns, and writes learnings into memory. No human in the loop unless you want one.

The framing that stuck with me: individual sessions are blind to cross-session patterns. A support agent won’t notice it made the same classification error 12 times this month. Dreaming is designed to surface exactly that kind of signal.

It ships alongside Outcomes (automated output grading against developer-defined rubrics) and multi-agent orchestration (coordinator + up to 20 parallel subagents, now in public beta). The three are meant to work as a loop: orchestration decomposes work, Outcomes grades it, Dreaming remembers the failures.

Still in research preview, not GA.

A few things I’m genuinely uncertain about:

The “automatic” mode lets the agent write directly to its own memory without approval. That’s a meaningful amount of autonomy over its own behavior. How do you audit what it’s actually learning? If it develops a subtly wrong heuristic over three months of self-reinforcement, how do you catch that before it’s deeply embedded?

Also curious about the human-review mode in practice — if you’re approving every proposed memory update, does that scale? Or does it become a bottleneck that defeats the purpose?

For those building on Managed Agents or similar systems: are you thinking about self-improvement loops as a feature you want, or a risk you’d rather control tightly? And does the “agent with three months of experience vs. freshly deployed agent” framing change how you think about agent versioning and rollbacks?

View thread on forum

#anthropic

0 1 0

2026-05-07 21:55:50 UTC

Where Next?

View thread on forum

anthropic

Home AI>Chat

#anthropic

0 1 0

Last post

Popular Ai topics

AI>Chat

How are you using AI in your professional and personal life?

How are you using AI in my life? How the day to day life is changed around you? professional and in personal life? I it use for autocom...

#ai

12 844 7

2025-07-10 08:42:52 UTC

New

AI>Chat

Tucker Carlson confronts Sam Altman about 'murder' of OpenAI engineer

Tucker: You’ve had complaints from one programmer who said you steal people’s stuff without paying them and he winded up being murdered.

#video #openai #suchir-balaji #sam-altman #tucker-carlson

4 1079 4

2026-02-26 21:14:15 UTC

New

AI>Chat

Anyone else hit breaking changes migrating from Opus 4.6 to 4.7?

Just went through the Anthropic migration guide for Opus 4.7 and there are more gotchas than the announcement implied. Curious if others ...

#blog-post #opus

0 0 0

2026-04-18 15:17:30 UTC

New

AI>Chat

DeepSeek V4 dropped today — $0.28/M output on 1M context, running on Huawei Ascend. Are you routing workloads to it?

DeepSeek just released V4 and the pricing is hard to ignore. V4-Flash: $0.28/M output tokens. V4-Pro: $2.19/M. Both with 1M token contex...

#ai /deepseek #llms

0 50 1

2026-04-25 00:18:04 UTC

New

AI>Chat

How are you handling the Claude Opus 4.7 migration?

Anthropic shipped Opus 4.7 last week and the agentic coding improvements look real. But the breaking changes are giving me pause. Specif...

#claude

0 1 0

2026-04-30 15:01:59 UTC

New

AI>Chat

Anyone else hitting Claude Code rate limits way too fast?

Been using Claude Code on Max for a few weeks and kept running into rate limits by early afternoon. Same tasks as colleagues who weren’t ...

#claude

0 0 0

2026-05-06 15:28:40 UTC

New

AI>Chat

Claude Code, Markdown, and the Case for HTML Artifacts

Claude Code, Markdown, and the Case for HTML Artifacts I do not think Markdown is going away. It is still the right format for README f...

#claude

0 0 0

2026-05-09 15:24:57 UTC

New

AI>Chat

Anyone else running multiple coding agents right now?

Cursor cloud agent development This month’s updates: Codex got real Windows sandboxing (May 13) ...

#claude

0 0 0

2026-05-14 16:48:14 UTC

New

AI>Chat

Anyone using Codex hooks in production?

Codex mobile in the ChatGPT app https://techcrunch.com/wp-content/uploads/2026/05/App-view.png?resize=1200,675) Codex shipped a batch o...

#chatgpt #codex

0 0 0

2026-05-15 16:41:49 UTC

New

AI>Chat

Gemini 3.5 Flash launched today - quick breakdown for anyone running agent workloads

Google shipped 3.5 Flash at I/O 2026. The “budget” Flash model now beats 3.1 Pro on coding and tool-calling benchmarks. Key numbers (fro...

#gemini

0 0 0

2026-05-20 17:01:45 UTC

New

Other popular topics

General Dev>Dev Chat

What dev-related stuff have you been up to?

Reading something? Working on something? Planning something? Changing jobs even!? If you’re up for sharing, please let us know what you’...

#community

1063 23050 405

2026-05-25 12:34:11 UTC

New

General Dev>Learning Resources

A Common-Sense Guide to Data Structures and Algorithms, Second Edition

Algorithms and data structures are much more than abstract concepts. Mastering them enables you to write code that runs faster and more e...

pragprog.com

#pragprog /python /ruby #published-book /book-a-common-sense-guide-to-data-structures-and-algorithms-second-edition #math #algorithms /js

19 6022 5

2020-08-14 00:58:37 UTC

New

Science/Tech>Tech Chat

What are you watching?

Or looking forward to? :nerd_face:

#community

503 14512 277

2026-05-11 08:52:14 UTC

New

General Dev>Dev Chat

Which vertical monitor do you use?

I’m thinking of buying a monitor that I can rotate to use as a vertical monitor? Also, I want to know if someone is using it for program...

#monitors #programming

51 4892 20

2023-06-28 07:23:42 UTC

New

General Dev>Hardware

BIIP MT3 Extended 2048 Custom Keycap Set (Drop)

This looks like a stunning keycap set :orange_heart: A LEGENDARY KEYBOARD LIVES ON When you bought an Apple Macintosh computer in the e...

/keyboards #apple #keycaps #mechanical-keyboards

14 6713 7

2020-12-12 19:58:26 UTC

New

Frontend>Learning Resources

Modern CSS with Tailwind

Tailwind CSS is an exciting new CSS framework that allows you to design your site by composing simple utility classes to create complex e...

pragprog.com

#pragprog /tailwind #published-book /book-modern-css-with-tailwind

12 5813 4

2021-05-13 14:50:23 UTC

New

General Dev>In The News

Jan: An open source alternative to ChatGPT that runs on the desktop

Jan | Rethink the Computer. Jan turns your computer into an AI machine by running LLMs locally on your computer. It’s a privacy-focus, l...

jan.ai

#desktop #chatgpt

4 5652 4

2024-03-29 08:42:30 UTC

New

Backend>Learning Resources

Ash Framework

Explore the power of Ash Framework by modeling and building the domain for a real-world web application. Rebecca Le @sevenseacat and ...

pragprog.com

#pragprog /elixir #published-book /ash /book-ash-framework

15 7555 9

2025-02-06 12:19:21 UTC

New

Backend>Learning Resources

Simplicity

Fight complexity and reclaim the original spirit of agility by learning to simplify how you develop software. The result: a more humane a...

pragprog.com

#pragprog #published-book /book-simplicity

10 6553 8

2025-03-14 21:53:12 UTC

New

Game Dev>In The News

Grand Theft Auto: Vice City | DOS games in browser

Open-source implementation of the classic GTA engine now running directly in your browser. Experience the reVC technology demo on DOS.Zon...

dos.zone

#games #browser

0 173 0

2025-12-20 02:36:57 UTC

New

AI>Chat

Gemini 3.5 Flash launched today - quick breakdown for anyone running agent workloads

AI>Chat

Anyone using Codex hooks in production?

AI>Chat

Anyone else running multiple coding agents right now?

AI>Chat