CommunityNews

LLM inference speed of light

LLM inference speed of light.
In the process of working on calm, a minimal from-scratch fast CUDA implementation of transformer-based language model inference, a critical consideration was establishing the speed of light for the inference process, and measuring the progress relative to that speed of light. In this post we’ll cover this theoretical limit and its implications.

Read in full here:

This thread was posted by one of our members via one of our news source trackers.

View thread on forum

#llm

0 362 0

2024-03-17 16:17:38 UTC

Where Next?

View thread on forum

llm

Home General Dev>In The News

#llm

0 362 0

Last post

Popular General Dev topics

General Dev>In The News

F# Is The Best Programming Language Today

F# Is The Best Coding Language Today. If you want to personally pick up a programming language in order to become a better coder in what...

danielbmarkham.com

#f-sharp #programming

41 1882 14

2022-02-27 23:57:32 UTC

New

General Dev>In The News

Doom-emacs: An Emacs framework

GitHub - hlissner/doom-emacs: An Emacs framework for the stubborn martian hacker. An Emacs framework for the stubborn martian hacker - G...

github.com

/emacs #doom-emacs

55 3346 16

2022-08-11 18:02:08 UTC

New

General Dev>In The News

LiveKit – open-source, high performance WebRTC infrastructure

GitHub - livekit/livekit: Scalable, high-performance WebRTC SFU. SDKs in JavaScript, React, React Native, Flutter, Swift, Kotlin, Unity/C...

github.com

#infrastructure #performance #webrtc

1 1773 1

2022-12-02 07:18:47 UTC

New

General Dev>In The News

LG 28-inch 16:18 DualUp Monitor

LG 28-inch 16:18 DualUp Monitor with Ergo Stand and USB Type-C™ (28MQ780-B) | LG USA. Shop LG 28MQ780-B on the official LG.com website ...

lg.com

12 2239 12

2022-09-01 19:28:37 UTC

New

General Dev>In The News

Tim Cook to take 50% pay hit after shareholder feedback

Apple’s Tim Cook to take 50% pay hit after shareholder feedback. ‘Target compensation’ for CEO down from $99.4m in 2022 to an expected $...

theguardian.com

#feedback

0 1856 0

2023-01-13 17:12:56 UTC

New

General Dev>In The News

Whatever happened to Elm, anyway?

Whatever happened to Elm, anyway?. I see this question pop up quite frequently in lots of different arenas - folks are curious as to wha...

derw.substack.com

/elm

17 1275 12

2025-04-21 03:57:49 UTC

New

General Dev>In The News

ONNX Runtime merges WebGPU backend

[js/web] WebGPU backend via JSEP by fs-eire · Pull Request #14579 · microsoft/onnxruntime. Description This change introduced the follo...

github.com

#backend

0 1372 0

2023-04-25 15:04:03 UTC

New

General Dev>In The News

Testing Intel’s Arc A770 GPU for Deep Learning

Christian Mills - Testing Intel’s Arc A770 GPU for Deep Learning Pt. 2. This post covers my experience training image classification mod...

christianjmills.com

#testing #learning #intel

0 1940 0

2023-08-09 15:00:13 UTC

New

General Dev>In The News

Ladybird: Truly independent web browser

Truly independent web browser. Contribute to LadybirdBrowser/ladybird development by creating an account on GitHub.

github.com

#browser #web #github

4 988 3

2025-03-10 13:45:11 UTC

New

General Dev>In The News

Everything Is Chrome

The power is in Google’s hands.

vale.rocks

#chrome

2 701 1

2025-03-11 21:52:03 UTC

New

Other popular topics

Backend>Learning Resources

Web Development with Clojure, Third Edition

Stop developing web apps with yesterday’s tools. Today, developers are increasingly adopting Clojure as a web-development platform. See f...

pragprog.com

#pragprog #web-development /clojure #published-book /book-web-development-with-clojure-third-edition

5 4584 1

2022-01-06 05:27:09 UTC

New

General Dev>Dev Chat

Standing Desks

No chair. I have a standing desk. This post was split into a dedicated thread from our thread about chairs :slight_smile:

#workspace #opinions

177 9886 77

2022-09-27 18:40:05 UTC

New

General Dev>Hardware

Planck vs Preonic vs Subatomic (Keyboards)

I ended up cancelling my Moonlander order as I think it’s just going to be a bit too bulky for me. I think the Planck and the Preonic (o...

/keyboards #mechanical-keyboards #ortholinear #planck #preonic

105 17596 47

2021-05-28 21:32:35 UTC

New

Frontend>Learning Resources

Modern CSS with Tailwind

Tailwind CSS is an exciting new CSS framework that allows you to design your site by composing simple utility classes to create complex e...

pragprog.com

#pragprog /tailwind #published-book /book-modern-css-with-tailwind

12 5813 4

2021-05-13 14:50:23 UTC

New

General Dev>Dev Chat

Warp—The blazingly fast, Rust-based terminal

A few weeks ago I started using Warp a terminal written in rust. Though in it’s current state of development there are a few caveats (tab...

/rust #terminal

52 6785 22

2025-02-26 17:47:24 UTC

New

Game Dev>Questions

Can I use Java to program a game for Nintendo switch?

I am trying to crate a game for the Nintendo switch, I wanted to use Java as I am comfortable with that programming language. Can you use...

/java #nintendo

8 4771 3

2023-09-15 11:15:04 UTC

New

Backend>Learning Resources

Engineering Elixir Applications

Develop, deploy, and debug BEAM applications using BEAMOps: a new paradigm that focuses on scalability, fault tolerance, and owning each ...

pragprog.com

#pragprog /elixir #published-book /book-engineering-elixir-applications

40 7136 21

2024-11-08 15:13:02 UTC

New

AI>In The News

DeepSeek (671B) running on a cluster of 8 Mac Mini Pros with 64GB RAM each

This is cool! DEEPSEEK-V3 ON M4 MAC: BLAZING FAST INFERENCE ON APPLE SILICON We just witnessed something incredible: the largest open-s...

#ai #macs /deepseek

0 6695 1

2025-01-29 18:43:37 UTC

New

Backend>Official News

Node.js v22.14.0 released!

Node.js v22.14.0 has been released. Link: Release 2025-02-11, Version 22.14.0 'Jod' (LTS), @aduh95 · nodejs/node · GitHub

github.com

/nodejs #official-news

0 4251 0

2025-02-11 15:30:14 UTC

New

Backend>Learning Resources

Advanced Functional Programming with Elixir

Use advanced functional programming principles, practical Domain-Driven Design techniques, and production-ready Elixir code to build scal...

joekoski.com

#pragprog /elixir #published-book #functional-programming /book-advanced-functional-programming-with-elixir

43 4989 22

2025-10-06 09:04:44 UTC

New

General Dev>In The News

Please do not use auto-scrolling content on the web and in applications

General Dev>In The News

Xous 0.10.0 - Introducing Baochip-1x Support

General Dev>In The News

Analytic Fog Rendering With Volumetric Primitives

General Dev>In The News

How I Dropped Our Production Database and Now Pay 10% More for AWS

General Dev>In The News

Google's new command-line tool can plug OpenClaw into your Workspace data

General Dev>In The News

World-first gigabit-per-second laser link between aircraft and geostationary satellite

General Dev>In The News

Converting dash cam videos into Panoramax images

General Dev>In The News

Google Safe Browsing missed 84% of confirmed phishing sites

General Dev>In The News

Good software knows when to stop

General Dev>In The News

A GitHub Issue Title Compromised 4,000 Developer Machines

General Dev>In The News

General Dev In The News ❯

Latest on Devtalk

What AI coding costs you

AI>In The News

Please do not use auto-scrolling content on the web and in applications

General Dev>In The News

Emulator error 139 with latest android studio version

Android>Questions

Xous 0.10.0 - Introducing Baochip-1x Support

General Dev>In The News

Analytic Fog Rendering With Volumetric Primitives

General Dev>In The News

How I Dropped Our Production Database and Now Pay 10% More for AWS

General Dev>In The News

Feds take notice of iOS vulnerabilities exploited under mysterious circumstances

iOS>In The News

Google's new command-line tool can plug OpenClaw into your Workspace data

General Dev>In The News

React Native v0.83.4 released!

Hybrid>Official News

Symfony v8.0.7, v7.4.7 and v6.4.35 released!

Backend>Official News

TypeScript v6.0-rc released!

Frontend>Official News

World-first gigabit-per-second laser link between aircraft and geostationary satellite

General Dev>In The News

Converting dash cam videos into Panoramax images

General Dev>In The News

Comparing Python packages for A/B test analysis: tea-tasting, Pingouin, statsmodels, and SciPy

Backend>In The News

Google Safe Browsing missed 84% of confirmed phishing sites

General Dev>In The News

Devtalk ❯

We ❤️ helpful members!

We reward our most helpful members via our MOTM scheme - by giving away a whopping 25 books per year!

Sub Categories:

We're in Beta

About us Mission Statement See our Roadmap

LLM inference speed of light

CommunityNews

LLM inference speed of light

Where Next?

Popular General Dev topics

F# Is The Best Programming Language Today

Doom-emacs: An Emacs framework

LiveKit – open-source, high performance WebRTC infrastructure

LG 28-inch 16:18 DualUp Monitor

Tim Cook to take 50% pay hit after shareholder feedback

Whatever happened to Elm, anyway?

ONNX Runtime merges WebGPU backend

Testing Intel’s Arc A770 GPU for Deep Learning

Ladybird: Truly independent web browser

Everything Is Chrome

Other popular topics

Web Development with Clojure, Third Edition

Standing Desks

Planck vs Preonic vs Subatomic (Keyboards)

Modern CSS with Tailwind

Warp—The blazingly fast, Rust-based terminal

Can I use Java to program a game for Nintendo switch?

Engineering Elixir Applications

DeepSeek (671B) running on a cluster of 8 Mac Mini Pros with 64GB RAM each

Node.js v22.14.0 released!

Advanced Functional Programming with Elixir

Sponsor Spotlight

General Dev>In The News

Latest on Devtalk

We ❤️ helpful members!

Devtalk Sponsors

Categories:

Sub Categories:

Popular Portals

Devtalk Sponsors

We're in Beta