CommunityNews

Defeating Nondeterminism in LLM Inference

Reproducibility is a bedrock of scientific progress. However, it’s remarkably difficult to get reproducible results out of large language models.
For example, you might observe that asking ChatGPT the same question multiple times provides different results. This by itself is not surprising, since getting a result from a language model involves “sampling”, a process that converts the language model’s output into a probability distribution and probabilistically selects a token.
What might be more surprising is that even when we adjust the temperature down to 0This means that the LLM always chooses the highest probability token, which is called greedy sampling. (thus making the sampling theoretically deterministic), LLM APIs are still not deterministic in practice (see past discussions here, here, or here). Even when running inference on your own hardware with an OSS inference library like vLLM or SGLang, sampling still isn’t deterministic (see here or here).

Read in full here:

View thread on forum

#llm

0 1270 0

2025-09-11 00:21:17 UTC

Where Next?

View thread on forum

llm

Home AI>In The News

#llm

0 1270 0

Last post

Popular Ai topics

AI>In The News

AI Is Discovering Patterns in Pure Mathematics That Have Never Been Seen Before

AI Is Discovering Patterns in Pure Mathematics That Have Never Been Seen Before. We can add suggesting and proving mathematical theorems...

sciencealert.com

#mathematics

0 1139 0

2021-12-11 23:07:15 UTC

New

AI>In The News

Actors launch campaign against AI 'show stealers'

Equity, the performing arts workers union, says actors need protection from computer-generated substitutes.

bbc.co.uk

6 773 2

2022-04-22 16:38:10 UTC

New

AI>In The News

Artificial Intelligence and Machine Learning– Explained

Steve Blank Artificial Intelligence and Machine Learning– Explained. Artificial Intelligence is a once-in-a lifetime commercial and defe...

steveblank.com

#basics

0 1375 0

2022-05-20 02:17:36 UTC

New

AI>In The News

DeepMind breaks 50-year math record using AI; new record falls a week later

AlphaTensor discovers better algorithms for matrix math, inspiring another improvement from afar.

arstechnica.com

#math #deepmind

0 1154 0

2022-10-14 13:13:23 UTC

New

AI>In The News

OpenAI debuts DALL-E API so devs can integrate its AI artwork into their apps

OpenAI offers integrated AI image generation on a demand—for 2 cents an image.

arstechnica.com

#apps #api #artwork

0 907 0

2022-11-04 00:29:13 UTC

New

AI>In The News

Nvidia and Microsoft team up to build massive AI cloud computer

AI supercomputer will use “tens of thousands” of Nvidia A100 and H100 GPUs.

arstechnica.com

#microsoft #nvidia

1 986 1

2022-11-19 23:00:18 UTC

New

AI>In The News

Mind-reading AI recreates what you're looking at with amazing accuracy

Giving AI systems the ability to focus on particular brain regions can make them much better at reconstructing images of what a monkey is...

newscientist.com

2 687 1

2025-05-18 17:45:18 UTC

New

AI>In The News

Adobe to automatically move subscribers to pricier, AI-focused tier in June

Monthly fees for multi-app subscribers to rise by up to 16.7 percent.

arstechnica.com

#adobe

6 893 5

2025-06-19 04:32:01 UTC

New

AI>In The News

Read That F*cking Code!

Stop vibe-coding blindly! Why reading AI-generated code is crucial in 2025. Avoid security flaws, architectural decay, and knowledge loss...

etsd.tech

#code

3 607 3

2025-08-12 20:59:43 UTC

New

AI>In The News

Vibe coding has turned senior devs into ‘AI babysitters,’ but they say it’s worth it

TechCrunch spoke to experienced coders about their time using AI-generated code about what they see as the future of vibe coding.

techcrunch.com

#coding #techcrunch

2 744 2

2025-09-22 11:45:51 UTC

New

Other popular topics

Backend>Chat

What is the reason behind Rust’s web framework, Rocket, not performing as well as expected in the Techempower benchmarks?

I know that these benchmarks might not be the exact picture of real-world scenario, but still I expect a Rust web framework performing a ...

#web-frameworks /rust

36 7463 11

2020-06-21 10:50:02 UTC

New

Game Dev>Learning Resources

Apple Game Frameworks and Technologies

Design and develop sophisticated 2D games that are as much fun to make as they are to play. From particle effects and pathfinding to soci...

pragprog.com

#pragprog #ios #game-dev #macos /swift #published-book #apple /book-apple-game-frameworks-and-technologies

30 7995 10

2021-04-22 16:51:02 UTC

New

General Dev>Code Editors

Poll: Which code editor do you use?

You might be thinking we should just ask who’s not using VSCode :joy: however there are some new additions in the space that might give V...

#community #polls /vim /emacs #code-editors /vscode #notepad /sublime-text #atom /textmate #codespaces #brackets /onivim #geany

121 5796 61

2025-09-05 00:52:19 UTC

New

General Dev>Code Editors

Dendron: a personal knowledge management tool on top of VSCode

/vscode #visual-studio-code

30 7372 9

2021-05-05 12:15:29 UTC

New

Backend>Chat

How to install Ruby 3 with ASDF

In case anyone else is wondering why Ruby 3 doesn’t show when you do asdf list-all ruby :man_facepalming: do this first: asdf plugin-upd...

/ruby #asdf

11 5961 4

2021-02-02 08:02:13 UTC

New

General Dev>Dev Chat

Roc Language - a new purely functional programming language built for speed and ergonomics

Hi folks, I don’t know if I saw this here but, here’s a new programming language, called Roc Reminds me a bit of Elm and thus Haskell. ...

#programminguages #functional-programming

49 5164 14

2021-11-10 20:03:09 UTC

New

Game Dev>Questions

Can I use Java to program a game for Nintendo switch?

I am trying to crate a game for the Nintendo switch, I wanted to use Java as I am comfortable with that programming language. Can you use...

/java #nintendo

8 4771 3

2023-09-15 11:15:04 UTC

New

AI>In The News

How to fix the eyes in AI-generated images

aidemos.info

0 4508 0

2022-09-10 13:54:33 UTC

New

General Dev>In The News

Jan: An open source alternative to ChatGPT that runs on the desktop

Jan | Rethink the Computer. Jan turns your computer into an AI machine by running LLMs locally on your computer. It’s a privacy-focus, l...

jan.ai

#desktop #chatgpt

4 5652 4

2024-03-29 08:42:30 UTC

New

General Dev>Reviews

Keyboard Review: UHK60V2 vs Defy vs Voyager vs Glove80 vs Svalboard

Ok, well here are some thoughts and opinions on some of the ergonomic keyboards I have, I guess like mini review of each that I use enoug...

/keyboards #uhk60v2 #defy #voyager #glove80 #svalboard

5 5681 7

2025-04-21 21:44:45 UTC

New

AI>In The News

Snowflake Cortex AI Escapes Sandbox and Executes Malware

AI>In The News

Toward automated verification of unreviewed AI-generated code

AI>In The News

Introducing Unsloth Studio

AI>In The News

Introducing GPT-5.4 mini and nano

AI>In The News

Claude tips for 3D work

AI>In The News

NVIDIA Launches Vera CPU, Purpose-Built for Agentic AI

AI>In The News

Amazon Tightens Code Guardrails After Outages Rock Retail Business

AI>In The News

Grammarly pulls AI author-impersonation tool after backlash

AI>In The News

Anthropic’s Claude AI can respond with charts, diagrams, and other visuals now

AI>In The News

An AI Agent Published a Hit Piece on Me – The Operator Came Forward

AI>In The News

AI In The News ❯

Latest on Devtalk

Nova v0.13.15 released!

Backend>Official News

Snowflake Cortex AI Escapes Sandbox and Executes Malware

AI>In The News

I haven't used a mouse for 14 years, and how to enable three fingers drag on macOS

macOS>In The News

FBI is buying location data to track US citizens, director confirms

General Dev>In The News

Remove Your Ring Camera With a Claw Hammer

General Dev>In The News

A Comprehensive Look at GPT-5.4 Mini and Nano: OpenAI’s ‘Small’ Models with ‘Big’ Ambitions

AI>Blogs/Talks

Nova v0.13.14 and v0.13.13 released!

Backend>Official News

React Native v0.85.0-rc.5 released!

Hybrid>Official News

Nova v0.13.12 and v0.13.11 released!

Backend>Official News

Quarkus 3.32.4 released!

Backend>Official News

Laravel v13.1.0 released!

Backend>Official News

Toward automated verification of unreviewed AI-generated code

AI>In The News

Introducing Unsloth Studio

AI>In The News

Introducing GPT-5.4 mini and nano

AI>In The News

Kagi Small Web

General Dev>In The News

Devtalk ❯

We ❤️ helpful members!

We reward our most helpful members via our MOTM scheme - by giving away a whopping 25 books per year!

Sub Categories:

We're in Beta

About us Mission Statement See our Roadmap

Defeating Nondeterminism in LLM Inference

CommunityNews

Defeating Nondeterminism in LLM Inference

Where Next?

Popular Ai topics

AI Is Discovering Patterns in Pure Mathematics That Have Never Been Seen Before

Actors launch campaign against AI 'show stealers'

Artificial Intelligence and Machine Learning– Explained

DeepMind breaks 50-year math record using AI; new record falls a week later

OpenAI debuts DALL-E API so devs can integrate its AI artwork into their apps

Nvidia and Microsoft team up to build massive AI cloud computer

Mind-reading AI recreates what you're looking at with amazing accuracy

Adobe to automatically move subscribers to pricier, AI-focused tier in June

Read That F*cking Code!

Vibe coding has turned senior devs into ‘AI babysitters,’ but they say it’s worth it

Other popular topics

What is the reason behind Rust’s web framework, Rocket, not performing as well as expected in the Techempower benchmarks?

Apple Game Frameworks and Technologies

Poll: Which code editor do you use?

Dendron: a personal knowledge management tool on top of VSCode

How to install Ruby 3 with ASDF

Roc Language - a new purely functional programming language built for speed and ergonomics

Can I use Java to program a game for Nintendo switch?

How to fix the eyes in AI-generated images

Jan: An open source alternative to ChatGPT that runs on the desktop

Keyboard Review: UHK60V2 vs Defy vs Voyager vs Glove80 vs Svalboard

Sponsor Spotlight

AI>In The News

Latest on Devtalk

We ❤️ helpful members!

Devtalk Sponsors

Categories:

Sub Categories:

Popular Portals

Devtalk Sponsors

We're in Beta