CommunityNews

Alibaba Cloud says it cut Nvidia AI GPU use by 82% with new pooling system— up to 9x increase in output lets 213 GPUs perform like 1,192

A paper presented at SOSP 2025 details how token-level scheduling helped one GPU serve multiple LLMs, reducing demand from 1,192 to 213 H20s.

Read in full here:

2 comments

#nvidia #cloud #gpu

2 563 2

2025-10-25 12:41:32 UTC

Most Liked

jmagnani

Probably they want to start to be less dependent on NVidia GPUs.

Post #2

jkdiaz

Some companies in China is probably already building their own GPU that can rival NVidia’s.

Post #3

Where Next?

View thread on forum

nvidia

cloud

gpu

Home AI>In The News

#nvidia #cloud #gpu

2 563 2

Last post

Popular Ai topics

AI>In The News

Nvidia Announces A100 80GB GPU for AI

NVIDIA Doubles Down: Announces A100 80GB GPU, Supercharging World’s Most Powerful GPU for AI Supercomputing. SC20—NVIDIA today unveiled ...

nvidianews.nvidia.com

#nvidia

0 1351 1

2020-11-19 00:28:58 UTC

New

AI>In The News

Should we be concerned that the decisions of AIs are inscrutable?

Should we be concerned that the decisions of AIs are inscrutable? | Psyche Ideas. Machine learning is a black box – even when the decisi...

psyche.co

0 1253 0

2021-06-16 04:51:17 UTC

New

AI>In The News

DeepMind’s New AI with a Memory Outperforms Algorithms 25 Times Its Size

DeepMind’s New AI With a Memory Outperforms Algorithms 25 Times Its Size. DeepMind’s model, with just 7 billion parameters, outperformed...

singularityhub.com

#algorithms #deepmind

5 1229 1

2021-12-27 15:25:21 UTC

New

AI>In The News

Why cows may be hiding something but AI can spot it

bbc.co.uk

#spot

0 1004 0

2022-02-01 15:09:12 UTC

New

AI>In The News

Building games and apps entirely through natural language using OpenAI’s code-davinci model

Building games and apps entirely through natural language using OpenAI’s code-davinci model. TL;DR: OpenAI has a new code generating mod...

andrewmayneblog.wordpress.com

#apps #games #code

0 1117 0

2022-03-19 02:14:24 UTC

New

AI>In The News

DeepMind AI learns simple physics like a baby

DeepMind AI learns simple physics like a baby. Neural network could be a step towards programs for studying how human infants learn.

nature.com

#deepmind

0 964 0

2022-07-11 23:16:33 UTC

New

AI>In The News

How to fix the eyes in AI-generated images

aidemos.info

0 4508 0

2022-09-10 13:54:33 UTC

New

AI>In The News

You can’t solve AI security problems with more AI

You can’t solve AI security problems with more AI. One of the most common proposed solutions to prompt injection attacks (where an AI la...

simonwillison.net

/security

0 1129 0

2022-10-17 13:09:12 UTC

New

AI>In The News

AI and the Future of Pixel Art

AI and the Future of Pixel Art. Creative industries are undergoing a 0 to 1 moment. If you didn’t know, now you do. The impact that AI w...

pixelparmesan.com

#art

0 884 0

2022-11-11 14:34:40 UTC

New

AI>In The News

Vibe Coding is not an excuse for low-quality work

A field guide to responsible AI-assisted development

addyo.substack.com

#coding

6 787 4

2025-05-15 12:06:14 UTC

New

Other popular topics

Backend>Learning Resources

Distributed Services with Go

Take your Go skills to the next level by learning how to design, develop, and deploy a distributed service. Start from the bare essential...

pragprog.com

#pragprog /go #published-book /book-distributed-services-with-go

1 4310 0

2020-04-14 19:05:22 UTC

New

General Dev>Learning Resources

The Pragmatic Programmer, 20th Anniversary Edition

Andy and Dave wrote this influential, classic book to help their clients create better software and rediscover the joy of coding. Almost ...

pragprog.com

#pragprog #published-book /book-the-pragmatic-programmer-20th-anniversary-edition

4 4782 0

2020-04-18 18:22:46 UTC

New

General Dev>Hardware

Poll: Which keyboard layout do you use?

poll poll Be sure to check out @Dusty’s article posted here: An Introduction to Alternative Keyboard Layouts It’s one of the best write-...

colemakmods.github.io

#polls /keyboards

10 6048 11

2020-10-31 23:12:33 UTC

New

General Dev>Hardware

Keyboard thock (sound)

I’ve been hearing quite a lot of comments relating to the sound of a keyboard, with one of the most desirable of these called ‘thock’, he...

/keyboards #mechanical-keyboards

14 11197 8

2020-11-11 11:59:23 UTC

New

General Dev>Code Editors

Dendron: a personal knowledge management tool on top of VSCode

/vscode #visual-studio-code

30 8077 9

2021-05-05 12:15:29 UTC

New

General Dev>Dev Chat

How fast do you type? Check your WPM here!

Do the test and post your score :nerd_face: :keyboard: If possible, please add info such as the keyboard you’re using, the layout (Qw...

typing-speed-test.aoeu.eu

/keyboards

82 7682 31

2021-07-10 05:52:20 UTC

New

General Dev>Dev Chat

PragProg’s Medium Posts

Hello everyone! This thread is to tell you about what authors from The Pragmatic Bookshelf are writing on Medium.

#pragprog #blog-post

1147 29994 760

2025-07-10 13:36:16 UTC

New

Backend>Learning Resources

Effective Haskell

Build efficient applications that exploit the unique benefits of a pure functional language, learning from an engineer who uses Haskell t...

pragprog.com

#pragprog /haskell #published-book /book-effective-haskell

15 10218 1

2022-02-16 10:09:51 UTC

New

Community>In The Spotlight

Spotlight: Jamis Buck (Author) Interview and AMA!

Author Spotlight Jamis Buck @jamis This month, we have the pleasure of spotlighting author Jamis Buck, who has written Mazes for Prog...

#author-spotlight /ruby /book-the-ray-tracer-challenge /book-mazes-for-programmers

21 6352 9

2022-09-28 18:21:15 UTC

New

General Dev>Learning Resources

A Common-Sense Guide to Data Structures and Algorithms in Python, Volume 1

Big O Notation can make your code faster by orders of magnitude. Get the hands-on info you need to master data structures and algorithms ...

pragprog.com

#pragprog /python #published-book /book-a-common-sense-guide-to-data-structures-and-algorithms-in-python-volume-1

24 5988 11

2024-01-29 15:52:29 UTC

New

AI>In The News

Zig Creator Calls Spade a Spade, Anthropic Blows Smoke

AI>In The News

Old and new apps, via modern coding agents

AI>In The News

AI 2040 and the Cult of Intelligence

AI>In The News

Are Scientists Sacrificing Originality for Speed With the Use of AI?

AI>In The News

AI Can't Recreate Thrust (But It Can Help You Understand It)

AI>In The News

Don't Go Quietly Into the AI Night

AI>In The News

How Version Control Will Evolve for the Agent Boom

AI>In The News

A new way to reflect on how you use Claude

AI>In The News

I Think I Have LLM Burnout

AI>In The News

What's really slowing down the AI buildout

AI>In The News

AI In The News ❯

Latest on Devtalk

Dell sued by Finnish company over $70m price increase for data centre servers

General Dev>In The News

Salience-Driven Development

General Dev>In The News

The death of open channels

General Dev>In The News

Zig Creator Calls Spade a Spade, Anthropic Blows Smoke

AI>In The News

Are you telling me a readonly property is wrecking my performance?

General Dev>In The News

Old and new apps, via modern coding agents

AI>In The News

V 0.5.2 released!

Backend>Official News

AI 2040 and the Cult of Intelligence

AI>In The News

Are Scientists Sacrificing Originality for Speed With the Use of AI?

AI>In The News

AI Can't Recreate Thrust (But It Can Help You Understand It)

AI>In The News

Don't Go Quietly Into the AI Night

AI>In The News

Fable 5.8.0 released!

Frontend>Official News

Networking and the Internet, from First Principles · Faza

General Dev>In The News

Google Search lets creators know more about their reach

General Dev>In The News

Please don't discontinue Gemini 2.5 Flash

General Dev>In The News

Devtalk ❯

We ❤️ helpful members!

We reward our most helpful members via our MOTM scheme - by giving away a whopping 25 books per year!

Sub Categories:

We're in Beta

About us Mission Statement See our Roadmap

Alibaba Cloud says it cut Nvidia AI GPU use by 82% with new pooling system— up to 9x increase in output lets 213 GPUs perform like 1,192

CommunityNews

Alibaba Cloud says it cut Nvidia AI GPU use by 82% with new pooling system— up to 9x increase in output lets 213 GPUs perform like 1,192

Most Liked

jmagnani

jkdiaz

Where Next?

Popular Ai topics

Nvidia Announces A100 80GB GPU for AI

Should we be concerned that the decisions of AIs are inscrutable?

DeepMind’s New AI with a Memory Outperforms Algorithms 25 Times Its Size

Why cows may be hiding something but AI can spot it

Building games and apps entirely through natural language using OpenAI’s code-davinci model

DeepMind AI learns simple physics like a baby

How to fix the eyes in AI-generated images

You can’t solve AI security problems with more AI

AI and the Future of Pixel Art

Vibe Coding is not an excuse for low-quality work

Other popular topics

Distributed Services with Go

The Pragmatic Programmer, 20th Anniversary Edition

Poll: Which keyboard layout do you use?

Keyboard thock (sound)

Dendron: a personal knowledge management tool on top of VSCode

How fast do you type? Check your WPM here!

PragProg’s Medium Posts

Effective Haskell

Spotlight: Jamis Buck (Author) Interview and AMA!

A Common-Sense Guide to Data Structures and Algorithms in Python, Volume 1

Sponsor Spotlight

AI>In The News

Latest on Devtalk

We ❤️ helpful members!

Devtalk Sponsors

Categories:

Sub Categories:

Popular Portals

Devtalk Sponsors

We're in Beta