CommunityNews

Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models

Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models.
The capabilities and limitations of Large Language Models have been sketched out in great detail in recent years, providing an intriguing yet conflicting picture. On the one hand, LLMs demonstrate a general ability to solve problems. On the other hand, they show surprising reasoning gaps when compared to humans, casting doubt on the robustness of their generalisation strategies. The sheer volume of data used in the design of LLMs has precluded us from applying the method traditionally used to measure generalisation: train-test set separation. To overcome this, we study what kind of generalisation strategies LLMs employ when performing reasoning tasks by investigating the pretraining data they rely on. For two models of different sizes (7B and 35B) and 2.5B of their pretraining tokens, we identify what documents influence the model outputs for three simple mathematical reasoning tasks and contrast this to the data that are influential for answering factual questions. We find that, while the models rely on mostly distinct sets of data for each factual question, a document often has a similar influence across different reasoning questions within the same task, indicating the presence of procedural knowledge. We further find that the answers to factual questions often show up in the most influential data. However, for reasoning questions the answers usually do not show up as highly influential, nor do the answers to the intermediate reasoning steps. When we characterise the top ranked documents for the reasoning questions qualitatively, we confirm that the influential documents often contain procedural knowledge, like demonstrating how to obtain a solution using formulae or code. Our findings indicate that the approach to reasoning the models use is unlike retrieval, and more like a generalisable strategy that synthesises procedural knowledge from documents doing a similar form of reasoning.

Read in full here:

This thread was posted by one of our members via one of our news source trackers.

View thread on forum

#knowledge

0 51 0

2024-12-02 19:16:23 UTC

Where Next?

View thread on forum

knowledge

Home General Dev>In The News

#knowledge

0 51 0

Last post

Popular General Dev topics

General Dev>In The News

Steve Wozniak sues YouTube over Bitcoin scam

Apple co-founder Steve Wozniak is suing YouTube for allegedly allowing scammers to use images and videos of him to defraud people. The s...

bbc.co.uk

#google #youtube #bitcoin

13 1123 8

2020-08-03 17:46:18 UTC

New

General Dev>In The News

I am lonely will anyone speak to me

en.wikipedia.org

/diversity #mental-health

0 1210 1

2020-12-26 08:45:20 UTC

New

General Dev>In The News

Neovim nightly, v0.5.0 and v0.4.4 released!

Neovim nightly, v0.5.0 and v0.4.4 has been released. Link: Release Nvim development (prerelease) build · neovim/neovim · GitHub Link:...

github.com

#official-news /neovim

0 1185 0

2021-07-11 23:08:05 UTC

New

General Dev>In The News

It's not what programming languages do, it's what they shepherd you to

It’s not what programming languages do, it’s what they shepherd you to. How many of you have listened, read or taken part in a discussio...

nibblestew.blogspot.com

#programming #languages

50 1714 19

2022-05-10 15:41:49 UTC

New

General Dev>In The News

There’s No Such Thing as Clean Code

Everyone seems to be striving for ‘clean’ code at the moment. You can’t read a blog post without the author telling you how clean their a...

steveonstuff.com

#code

31 1262 9

2022-03-28 00:29:57 UTC

New

General Dev>In The News

Developing Godot Projects with Neovim

Developing Godot Projects with Neovim. When I started using Godot Engine, what surprised me the most is the built-in Language Server Pro...

devpoga.org

/neovim

0 1598 0

2022-07-27 13:30:06 UTC

New

General Dev>In The News

Why study functional programming?

…or, “why make programming even harder?” Learning functional programming is an opportunity to discover a new way to represent programs, t...

acm.wustl.edu

#programming

4 843 1

2022-08-07 00:35:55 UTC

New

General Dev>In The News

Why Python keeps growing, explained

Why Python keeps growing, explained | The GitHub Blog. A deep dive into why more people are using Python than ever, its key use cases, a...

github.blog

/python

9 974 9

2023-08-19 11:34:00 UTC

New

General Dev>In The News

Fintech engineering mistakes

9 fintech engineering mistakes. Read this list unless you want to build a money dissappearing system

startupwin.kelsus.com

0 1282 0

2023-06-28 15:09:41 UTC

New

General Dev>In The News

Should managers still code?

Ah, the eternal question, straight from the mailbag.

theengineeringmanager.substack.com

#code

0 181 0

2025-03-13 01:41:39 UTC

New

Other popular topics

Science/Tech>Tech Chat

Games! Which do you play?

Which, if any, games do you play? On what platform? I just bought (and completed) Minecraft Dungeons for my Nintendo Switch. Other than ...

#games

245 5335 101

2024-08-22 11:09:29 UTC

New

General Dev>Hardware

What monitor(s) do you have for programming?

Please tell us what is your preferred monitor setup for programming(not gaming) and why you have chosen it. Does your monitor have eye p...

#monitors #coding #programming #development

227 8684 88

2022-02-01 12:02:08 UTC

New

Game Dev>Learning Resources

Apple Game Frameworks and Technologies

Design and develop sophisticated 2D games that are as much fun to make as they are to play. From particle effects and pathfinding to soci...

pragprog.com

#pragprog #ios #game-dev #macos /swift #published-book #apple /book-apple-game-frameworks-and-technologies

30 3963 10

2021-04-22 16:51:02 UTC

New

General Dev>Code Editors

SpaceVim vs SpaceMacs

SpaceVim seems to be gaining in features and popularity and I just wondered how it compares with SpaceMacs in 2020 - anyone have any thou...

/vim #spacevim #spacemacs /emacs #code-editors

30 3579 14

2020-08-27 17:53:29 UTC

New

General Dev>Dev Chat

Which language or framework do you want to learn next?

Curious to know which languages and frameworks you’re all thinking about learning next :upside_down_face: Perhaps if there’s enough peop...

#community #learning

243 5922 95

2025-06-05 19:34:43 UTC

New

General Dev>Hardware

Seen any cool new keyboards?

We have a thread about the keyboards we have, but what about nice keyboards we come across that we want? If you have seen any that look n...

/keyboards #mechanical-keyboards

49 5284 39

2025-05-10 22:54:44 UTC

New

macOS>Chat

My thoughts on macOS vs Linux

Small essay with thoughts on macOS vs. Linux: I know @Exadra37 is just waiting around the corner to scream at me “I TOLD YOU SO!!!” but I...

#macos #linux

166 7775 69

2021-04-10 22:36:29 UTC

New

General Dev>Dev Chat

Languages Without Garbage Collection

Continuing the discussion from Thinking about learning Crystal, let’s discuss - I was wondering which languages don’t GC - maybe we can c...

#garbage-collection

21 4800 7

2021-05-06 05:54:58 UTC

New

Data Science

Can AI/ML predict a lottery win?

Biggest jackpot ever apparently! :upside_down_face: I don’t (usually) gamble/play the lottery, but working on a program to predict the...

#ai #machine-learning

19 3178 10

2021-10-18 19:01:41 UTC

New

AI>Chat

How to: Run DeepSeek on Mac, Windows, and Linux!

This is a very quick guide, you just need to: Download LM Studio: https://lmstudio.ai/ Click on search Type DeepSeek, then select the o...

#macs /deepseek #guides #lm-studio

14 5193 10

2025-06-19 15:11:16 UTC

New

General Dev>In The News

Writing memory efficient C structs

General Dev>In The News

Opsqueue: lightweight batch processing queue for heavy loads

General Dev>In The News

The Secret Stanford Program No One's Heard About

General Dev>In The News

The UK is slogging through an online age-gate apocalypse

General Dev>In The News

Weather Model based on ADS-B

General Dev>In The News

Software Development at 800 Words Per Minute | Dickson Tan's blog

General Dev>In The News

How I hacked my washing machine - Nex's Blog

General Dev>In The News

Keyboard Patents

General Dev>In The News

VPNs top download charts as age verification law kicks in

General Dev>In The News

Protest footage blocked as online safety act comes into force

General Dev>In The News

General Dev In The News ❯

Latest on Devtalk

New Debian Developers and Maintainers (May and June 2025)

Linux>Official News

A Python dict that can report which keys you did not use - Peterbe.com

Backend>In The News

The HTML Hobbyist

Frontend>In The News

Writing memory efficient C structs

General Dev>In The News

Opsqueue: lightweight batch processing queue for heavy loads

General Dev>In The News

500 virtual Linux devices on ARM64 (a Nerves story)

Backend>Blogs/Talks

React Native v0.81.0-rc.3 released!

Hybrid>Official News

Quarkus 3.25.0 released!

Backend>Official News

Apple releases iOS 18.6, macOS 15.6, and other updates as current gen winds down

iOS>In The News

Linux 6.16 brings faster file systems, improved confidential memory support, and more Rust support

Linux>In The News

The Secret Stanford Program No One's Heard About

General Dev>In The News

CentOS Board Meeting Recap, July 2025

Linux>Official News

Ash v3.5.33 released!

Backend>Official News

WebSharper 9.1.5.591 released!

Backend>Official News

The UK is slogging through an online age-gate apocalypse

General Dev>In The News

Devtalk ❯

We ❤️ helpful members!

We reward our most helpful members via our MOTM scheme - by giving away a whopping 25 books per year!

Sub Categories:

We're in Beta

About us Mission Statement See our Roadmap

Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models

CommunityNews

Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models

Where Next?

Popular General Dev topics

Steve Wozniak sues YouTube over Bitcoin scam

I am lonely will anyone speak to me

Neovim nightly, v0.5.0 and v0.4.4 released!

It's not what programming languages do, it's what they shepherd you to

There’s No Such Thing as Clean Code

Developing Godot Projects with Neovim

Why study functional programming?

Why Python keeps growing, explained

Fintech engineering mistakes

Should managers still code?

Other popular topics

Games! Which do you play?

What monitor(s) do you have for programming?

Apple Game Frameworks and Technologies

SpaceVim vs SpaceMacs

Which language or framework do you want to learn next?

Seen any cool new keyboards?

My thoughts on macOS vs Linux

Languages Without Garbage Collection

Can AI/ML predict a lottery win?

How to: Run DeepSeek on Mac, Windows, and Linux!

Sponsor Spotlight

General Dev>In The News

Latest on Devtalk

We ❤️ helpful members!

Devtalk Sponsors

Categories:

Sub Categories:

Popular Portals

Devtalk Sponsors

We're in Beta