CommunityNews

LLMs can't self-correct in reasoning tasks, DeepMind study finds

LLMs can’t self-correct in reasoning tasks, DeepMind study finds - TechTalks.
A study by Google’s DeepMind and the University of Illinois at Urbana-Champaign has found that self-correction in large language models (LLMs) isn’t universally effective.

Read in full here:

This thread was posted by one of our members via one of our news source trackers.

View thread on forum

#deepmind

0 555 0

2023-10-09 23:42:52 UTC

Where Next?

View thread on forum

deepmind

Home General Dev>In The News

#deepmind

0 555 0

Last post

Popular General Dev topics

General Dev>In The News

Top 5 programming languages for web developers to learn

The following languages will help current and new web developers navigate the programming landscape to code web-based services and apps t...

techrepublic.com

#programming #languages #web

59 2557 24

2022-01-10 05:38:51 UTC

New

General Dev>In The News

Launching Fig

:tada: Launching Fig I am excited to announce that, as of today, Fig is generally available to the public for download. With our public ...

fig.io

29 1894 17

2021-11-22 15:41:56 UTC

New

General Dev>In The News

22 years of Emacs

How a piece of advice became a lifestyle TABLE OF CONTENTS WHERE TO BEGIN… FIRST CONTACT PICKING EMACS FOR LIFE CHEATING ON EMACS SERE...

arjenwiersma.nl

/emacs

0 1470 0

2022-03-14 15:21:49 UTC

New

General Dev>In The News

Why Python keeps growing, explained

Why Python keeps growing, explained | The GitHub Blog. A deep dive into why more people are using Python than ever, its key use cases, a...

github.blog

/python

9 1349 9

2023-08-19 11:34:00 UTC

New

General Dev>In The News

50 Shades of Go

50 Shades of Go: Traps, Gotchas, and Common Mistakes for New Golang Devs. Go is a simple and fun language, but, like any other language,...

devs.cloudimmunity.com

/go

1 1379 1

2023-05-27 11:29:17 UTC

New

General Dev>In The News

Two US lawyers fined for submitting fake court citations from ChatGPT

Two US lawyers fined for submitting fake court citations from ChatGPT. Law firm also penalised after chatbot invented six legal cases th...

theguardian.com

#chatgpt

0 2231 3

2024-01-29 11:33:13 UTC

New

General Dev>In The News

X can’t stop spread of explicit, fake AI Taylor Swift images

Will Swifties’ war on AI fakes spark a deepfake porn reckoning?

arstechnica.com

/swift

0 8379 0

2024-01-26 05:47:12 UTC

New

General Dev>In The News

SLUM: The Shadow Library Uptime Monitor

SLUM: The Shadow Library Uptime Monitor. This dashboard tracks the availability of popular shadow libraries in real time from a US-based...

open-slum.org

#library #monitor

0 2891 0

2025-01-19 20:46:27 UTC

New

General Dev>In The News

DeepSeek-V3 Technical Report

We present DeepSeek-V3, a strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token. To...

arxiv.org

/deepseek

0 891 0

2025-03-27 14:46:32 UTC

New

General Dev>In The News

Everything I know about good API design

Most of what modern software engineers do involves APIs: public interfaces for communicating with a program, like this one from Twilio. I...

seangoedecke.com

#api #design

1 1534 1

2025-08-25 10:21:40 UTC

New

Other popular topics

Backend>Learning Resources

Programming Machine Learning

Machine learning can be intimidating, with its reliance on math and algorithms that most programmers don't encounter in their regular wor...

pragprog.com

#pragprog #ai /python #published-book /book-programming-machine-learning #math #algorithms

6 5350 3

2023-10-03 15:08:13 UTC

New

General Dev>Learning Resources

Forge Your Future with Open Source

Free and open source software is the default choice for the technologies that run our world, and it’s built and maintained by people like...

pragprog.com

#pragprog #published-book /book-forge-your-future-with-open-source

3 5654 0

2020-04-21 18:37:36 UTC

New

General Dev>Hardware

Custom keyboard keycaps

There’s a whole world of custom keycaps out there that I didn’t know existed! Check out all of our Keycaps threads here: https://forum....

#hardware /keyboards #keycaps #mechanical-keyboards

15 11086 19

2023-07-27 16:30:57 UTC

New

General Dev>Hardware

Keyboard thock (sound)

I’ve been hearing quite a lot of comments relating to the sound of a keyboard, with one of the most desirable of these called ‘thock’, he...

/keyboards #mechanical-keyboards

14 11197 8

2020-11-11 11:59:23 UTC

New

General Dev>Code Editors

Onivim 2 Code Editor

Thanks to @foxtrottwist’s and @Tomas’s posts in this thread: Poll: Which code editor do you use? I bought Onivim! :nerd_face: https://on...

#code-editors /onivim /revery

88 6144 32

2023-05-15 07:32:26 UTC

New

General Dev>Dev Chat

Languages Without Garbage Collection

Continuing the discussion from Thinking about learning Crystal, let’s discuss - I was wondering which languages don’t GC - maybe we can c...

#garbage-collection

21 5575 7

2021-05-06 05:54:58 UTC

New

Backend>Chat

Using Regular Expressions in Erlang

Intensively researching Erlang books and additional resources on it, I have found that the topic of using Regular Expressions is either c...

/erlang #regular-expressions

91 5662 43

2021-09-06 19:12:48 UTC

New

Backend>Learning Resources

Programming WebRTC

Use WebRTC to build web applications that stream media and data in real time directly from one user to another, all in the browser. ...

pragprog.com

#pragprog #published-book /js #webrtc /book-programming-webrtc

27 6969 6

2021-11-20 19:03:04 UTC

New

Community>In The Spotlight

Spotlight: Jamis Buck (Author) Interview and AMA!

Author Spotlight Jamis Buck @jamis This month, we have the pleasure of spotlighting author Jamis Buck, who has written Mazes for Prog...

#author-spotlight /ruby /book-the-ray-tracer-challenge /book-mazes-for-programmers

21 6352 9

2022-09-28 18:21:15 UTC

New

Community>In The Spotlight

Spotlight: Peter Ullrich (Author) Interview and AMA!

Author Spotlight: Peter Ullrich @PJUllrich Data is at the core of every business, but it is useless if nobody can access and analyze ...

/elixir /phoenix /book-building-table-views-with-phoenix-liveview

72 4765 21

2023-10-17 17:07:59 UTC

New

General Dev>In The News

To update blobs or not to update blobs

General Dev>In The News

No Bookmarks

General Dev>In The News

Please do not use auto-scrolling content on the web and in applications

General Dev>In The News

Xous 0.10.0 - Introducing Baochip-1x Support

General Dev>In The News

Analytic Fog Rendering With Volumetric Primitives

General Dev>In The News

How I Dropped Our Production Database and Now Pay 10% More for AWS

General Dev>In The News

Google's new command-line tool can plug OpenClaw into your Workspace data

General Dev>In The News

World-first gigabit-per-second laser link between aircraft and geostationary satellite

General Dev>In The News

Converting dash cam videos into Panoramax images

General Dev>In The News

Google Safe Browsing missed 84% of confirmed phishing sites

General Dev>In The News

General Dev In The News ❯

Latest on Devtalk

Shipping grayscale photos at small scale

Embedded>Blogs/Talks

Notes on Writing Wasm

Frontend>In The News

I ported Linux to the PS5 and turned it into a Steam Machine

Game Dev>In The News

To update blobs or not to update blobs

General Dev>In The News

Your LLM Doesn't Write Correct Code. It Writes Plausible Code

AI>In The News

Unsloth Dynamic 2.0 GGUFs

AI>In The News

OpenAI – How to delete your account

AI>In The News

Don't trust AI agents

AI>In The News

No Bookmarks

General Dev>In The News

What AI coding costs you

AI>In The News

Please do not use auto-scrolling content on the web and in applications

General Dev>In The News

Emulator error 139 with latest android studio version

Android>Questions

Xous 0.10.0 - Introducing Baochip-1x Support

General Dev>In The News

Analytic Fog Rendering With Volumetric Primitives

General Dev>In The News

How I Dropped Our Production Database and Now Pay 10% More for AWS

General Dev>In The News

Devtalk ❯

We ❤️ helpful members!

We reward our most helpful members via our MOTM scheme - by giving away a whopping 25 books per year!

Sub Categories:

We're in Beta

About us Mission Statement See our Roadmap

LLMs can't self-correct in reasoning tasks, DeepMind study finds

CommunityNews

LLMs can't self-correct in reasoning tasks, DeepMind study finds

Where Next?

Popular General Dev topics

Top 5 programming languages for web developers to learn

Launching Fig

22 years of Emacs

Why Python keeps growing, explained

50 Shades of Go

Two US lawyers fined for submitting fake court citations from ChatGPT

X can’t stop spread of explicit, fake AI Taylor Swift images

SLUM: The Shadow Library Uptime Monitor

DeepSeek-V3 Technical Report

Everything I know about good API design

Other popular topics

Programming Machine Learning

Forge Your Future with Open Source

Custom keyboard keycaps

Keyboard thock (sound)

Onivim 2 Code Editor

Languages Without Garbage Collection

Using Regular Expressions in Erlang

Programming WebRTC

Spotlight: Jamis Buck (Author) Interview and AMA!

Spotlight: Peter Ullrich (Author) Interview and AMA!

Sponsor Spotlight

General Dev>In The News

Latest on Devtalk

We ❤️ helpful members!

Devtalk Sponsors

Categories:

Sub Categories:

Popular Portals

Devtalk Sponsors

We're in Beta