xiji2646-netizen

Anthropic's agents now review their own past sessions and self-improve. Thoughts?

Anthropic shipped something called Dreaming for Managed Agents this week. It’s a scheduled background process that runs between sessions — the agent reviews its own past conversation transcripts, extracts patterns, and writes learnings into memory. No human in the loop unless you want one.

The framing that stuck with me: individual sessions are blind to cross-session patterns. A support agent won’t notice it made the same classification error 12 times this month. Dreaming is designed to surface exactly that kind of signal.

It ships alongside Outcomes (automated output grading against developer-defined rubrics) and multi-agent orchestration (coordinator + up to 20 parallel subagents, now in public beta). The three are meant to work as a loop: orchestration decomposes work, Outcomes grades it, Dreaming remembers the failures.

Still in research preview, not GA.

A few things I’m genuinely uncertain about:

The “automatic” mode lets the agent write directly to its own memory without approval. That’s a meaningful amount of autonomy over its own behavior. How do you audit what it’s actually learning? If it develops a subtly wrong heuristic over three months of self-reinforcement, how do you catch that before it’s deeply embedded?

Also curious about the human-review mode in practice — if you’re approving every proposed memory update, does that scale? Or does it become a bottleneck that defeats the purpose?

For those building on Managed Agents or similar systems: are you thinking about self-improvement loops as a feature you want, or a risk you’d rather control tightly? And does the “agent with three months of experience vs. freshly deployed agent” framing change how you think about agent versioning and rollbacks?

View thread on forum

#anthropic

0 1 0

2026-05-07 21:55:50 UTC

Where Next?

View thread on forum

anthropic

Home AI>Chat

#anthropic

0 1 0

Last post

Popular Ai topics

AI>Chat

DeepSeek - the free, open source “ChatGPT killer”

Loads of news stories about DeepSeek here in the last few days, no surprise as it’s been making headlines across the world! Currently a h...

#ai #chatgpt #openai #llm /deepseek

3 440 6

2025-02-03 22:26:29 UTC

New

AI>Chat

Post your DeepSeek results

Curious what kind of results others are getting, I think actually prefer the 7B model to the 32B model, not only is it faster but the qua...

/deepseek

15 4275 15

2025-03-06 23:29:12 UTC

New

AI>Chat

General opinion on Google Gemini for code?

General thoughts on google gemini ? IMHO , when compared chatgpt and claude sonnnet its pretty shit, and its feels broken,

#ai

9 759 6

2025-06-17 04:11:13 UTC

New

AI>Chat

Need help with chatbot development (can pay $300 - $500)

Hello I hope you’re doing well. I’m looking to develop a custom chatbot and would love to collaborate with you on this project. The chat...

/python

0 1 0

2026-02-04 11:52:53 UTC

New

AI>Chat

Claude Code's entire source just leaked (512K lines) - anyone else digging through it?

Woke up to this today: Claude Code’s complete source code exposed via npm source map. Not a snippet. All 512,000 lines. 1,900 TypeScript ...

#claude

6 8359 5

2026-05-25 18:22:56 UTC

New

AI>Chat

Anyone else hit breaking changes migrating from Opus 4.6 to 4.7?

Just went through the Anthropic migration guide for Opus 4.7 and there are more gotchas than the announcement implied. Curious if others ...

#blog-post #opus

0 0 0

2026-04-18 15:17:30 UTC

New

AI>Chat

DeepSeek V4 is live in preview — should your team switch?

DeepSeek officially launched deepseek-v4-flash and deepseek-v4-pro in preview on April 24, 2026. The legacy routes (deepseek-chat, deepse...

/deepseek

0 0 0

2026-04-25 20:33:47 UTC

New

AI>Chat

Has anyone tried the Karpathy CLAUDE.md rules? (97.8k stars)

There’s a GitHub repo at forrestchang/andrej-karpathy-skills that’s sitting at 97.8k stars. It’s a single CLAUDE.md file with four behavi...

#claude

0 37 1

2026-04-30 01:16:04 UTC

New

AI>Chat

Anyone tried mattpocock/skills for Claude Code? Here is what I found after a week

Been using the skills repo (77K stars, #1 on GitHub Trending recently) with Claude Code. Sharing what worked and what did not. What work...

#claude

0 0 0

2026-05-13 16:00:07 UTC

New

AI>Chat

Gemini 3.5 Flash launched today - quick breakdown for anyone running agent workloads

Google shipped 3.5 Flash at I/O 2026. The “budget” Flash model now beats 3.1 Pro on coding and tool-calling benchmarks. Key numbers (fro...

#gemini

0 0 0

2026-05-20 17:01:45 UTC

New

Other popular topics

General Dev>Hardware

Poll: Which keyboard layout do you use?

poll poll Be sure to check out @Dusty’s article posted here: An Introduction to Alternative Keyboard Layouts It’s one of the best write-...

colemakmods.github.io

#polls /keyboards

10 6048 11

2020-10-31 23:12:33 UTC

New

Android>Learning Resources

Kotlin and Android Development featuring Jetpack: Build Better, Safer Android Apps

Start building native Android apps the modern way in Kotlin with Jetpack's expansive set of tools, libraries, and best practices. Learn h...

pragprog.com

#pragprog #android #game-dev /kotlin #published-book /book-kotlin-and-android-development-featuring-jetpack

7 5084 1

2020-11-03 20:38:30 UTC

New

Backend>Questions

Erlang's not installing on macOS Big Sur "You are natively building Erlang/OTP for a later version of MacOSX than current version"

Just done a fresh install of macOS Big Sur and on installing Erlang I am getting: asdf install erlang 23.1.2 Configure failed. checking ...

#macos /erlang #big-sur #asdf

10 6212 8

2021-01-16 12:33:23 UTC

New

General Dev>Dev Chat

The V Programming Language

The V Programming Language Simple language for building maintainable programs V is already mentioned couple of times in the forum, but I...

#programminguages /v

21 13874 7

2021-04-12 15:13:42 UTC

New

Community>In The Spotlight

Spotlight: Jamis Buck (Author) Interview and AMA!

Author Spotlight Jamis Buck @jamis This month, we have the pleasure of spotlighting author Jamis Buck, who has written Mazes for Prog...

#author-spotlight /ruby /book-the-ray-tracer-challenge /book-mazes-for-programmers

21 6352 9

2022-09-28 18:21:15 UTC

New

Backend>Questions

Psql: error: connection to server on socket "/tmp/.s.PGSQL.5432" failed: No such file or directory

If you’re getting errors like this: psql: error: connection to server on socket “/tmp/.s.PGSQL.5432” failed: No such file or directory ...

#macos /rails /postgresql

1 5553 1

2024-10-17 02:03:48 UTC

New

AI>Chat

How to: Run DeepSeek on Mac, Windows, and Linux!

This is a very quick guide, you just need to: Download LM Studio: https://lmstudio.ai/ Click on search Type DeepSeek, then select the o...

#macs /deepseek #guides #lm-studio

14 9328 10

2025-06-19 15:11:16 UTC

New

AI>Chat

Post your DeepSeek results

Curious what kind of results others are getting, I think actually prefer the 7B model to the 32B model, not only is it faster but the qua...

/deepseek

15 4275 15

2025-03-06 23:29:12 UTC

New

AI>Chat

Claude Code's entire source just leaked (512K lines) - anyone else digging through it?

Woke up to this today: Claude Code’s complete source code exposed via npm source map. Not a snippet. All 512,000 lines. 1,900 TypeScript ...

#claude

6 8359 5

2026-05-25 18:22:56 UTC

New

macOS>In The News

Millions of iCloud users could claim share of £3bn after Apple case given UK green light

Apple rejected the suggestion its practices are anti-competitive, saying many customers rely on third-party alternatives.

bbc.com

#apple #green

0 1 0

2026-06-23 16:35:04 UTC

New

AI>Chat

Has anyone seen a model's cost swing 60x on the same task?

AI>Chat

Gemini 3.5 Flash launched today - quick breakdown for anyone running agent workloads

AI>Chat

Anyone using Codex hooks in production?

AI>Chat

Anyone else running multiple coding agents right now?

AI>Chat