CommunityNews

Diffusion Training from Scratch on a Micro-Budget

Stretching Each Dollar: Diffusion Training from Scratch on a Micro-Budget.
As scaling laws in generative AI push performance, they also simultaneously concentrate the development of these models among actors with large computational resources. With a focus on text-to-image (T2I) generative models, we aim to address this bottleneck by demonstrating very low-cost training of large-scale T2I diffusion transformer models. As the computational cost of transformers increases with the number of patches in each image, we propose to randomly mask up to 75% of the image patches during training. We propose a deferred masking strategy that preprocesses all patches using a patch-mixer before masking, thus significantly reducing the performance degradation with masking, making it superior to model downscaling in reducing computational cost. We also incorporate the latest improvements in transformer architecture, such as the use of mixture-of-experts layers, to improve performance and further identify the critical benefit of using synthetic images in micro-budget training. Finally, using only 37M publicly available real and synthetic images, we train a 1.16 billion parameter sparse transformer with only $1,890 economical cost and achieve a 12.7 FID in zero-shot generation on the COCO dataset. Notably, our model achieves competitive FID and high-quality generations while incurring 118$\times$ lower cost than stable diffusion models and 14$\times$ lower cost than the current state-of-the-art approach that costs $28,400. We aim to release our end-to-end training pipeline to further democratize the training of large-scale diffusion models on micro-budgets.

Read in full here:

This thread was posted by one of our members via one of our news source trackers.

View thread on forum

#training

0 330 0

2024-07-30 16:05:23 UTC

Where Next?

View thread on forum

training

Home General Dev>In The News

#training

0 330 0

Last post

Popular General Dev topics

General Dev>In The News

Russia wants to ban the use of secure protocols such as TLS 1.3, DoH, DoT, ESNI

Quite scary if you ask me. And it seems China is already blocking TLS 1.3 traffic with their Great Firewall. On the other hand it’s a co...

#internet #encryption #censorship

1 971 1

2020-09-23 19:12:33 UTC

New

General Dev>In The News

Fuzix: A Unix-ish operating system for small machines by Alan Cox

FUZIX FUZIX is a fusion of various elements from the assorted UZI forks and branches beaten together into some kind of semi-coherent pla...

fuzix.org

#unix

0 2180 0

2021-01-04 22:15:21 UTC

New

General Dev>In The News

Launching Fig

:tada: Launching Fig I am excited to announce that, as of today, Fig is generally available to the public for download. With our public ...

fig.io

29 1894 17

2021-11-22 15:41:56 UTC

New

General Dev>In The News

8 Reasons to ditch Chrome and use Firefox

8 reasons to ditch Chrome and switch to Firefox. Chrome may dominate, but Firefox is a known name among browsers for a reason. Whether y...

pcworld.com

#chrome #firefox

73 1874 41

2022-07-14 16:50:27 UTC

New

General Dev>In The News

Why study functional programming?

…or, “why make programming even harder?” Learning functional programming is an opportunity to discover a new way to represent programs, t...

acm.wustl.edu

#programming

4 1171 1

2022-08-07 00:35:55 UTC

New

General Dev>In The News

Building a Slack alternative with Rust/Tauri

Building a Slack/Discord alternative with Tauri/Rust linen <span class="hashtag-icon-placeholder"></span>blog. Introduction My name is K...

linen.dev

/rust #slack #tauri

1 1425 1

2023-06-21 13:49:08 UTC

New

General Dev>In The News

Deepseek - starting this week we'll open-source 5 repos

We’re a tiny team @deepseek-ai pushing our limits in AGI exploration. Starting this week , Feb 24, 2025 we’ll open-source 5 repos – one ...

github.com

#github /deepseek

1 1227 0

2025-02-28 16:43:30 UTC

New

General Dev>In The News

Knowing CSS is mastery to Front end Development

There are countless articles why developers should not focus on Frameworks too much and instead learn to understand the underlying langua...

helloanselm.com

#css #development

2 745 1

2025-03-10 14:21:35 UTC

New

General Dev>In The News

The A.I. Monarchy

About accelerationism, NRx, and the intersection of technology, religion, and philosophy: an analysis of the essential ideas in the new A...

substack.com

2 871 1

2025-03-11 21:27:01 UTC

New

General Dev>In The News

Helium Browser

The web browser made for people, with love. Best privacy by default, unbiased ad-blocking, no bloat and no noise. Fully open source.

helium.computer

#browser

4 935 3

2025-10-09 23:27:52 UTC

New

Other popular topics

General Dev>Learning Resources

Seven More Languages in Seven Weeks

Learn from the award-winning programming series that inspired the Elixir language, and go on a step-by-step journey through the most impo...

pragprog.com

#pragprog /elixir /julia /lua #published-book #factor /elm #minikanren /idris /book-seven-more-languages-in-seven-weeks

4 5862 0

2020-04-29 21:59:54 UTC

New

Backend>Learning Resources

Seven Languages in Seven Weeks

Ruby, Io, Prolog, Scala, Erlang, Clojure, Haskell. With Seven Languages in Seven Weeks, by Bruce A. Tate, you’ll go beyond the syntax—and...

pragprog.com

#pragprog /clojure /erlang /haskell /prolog /ruby /scala #published-book /book-seven-languages-in-seven-weeks

5 5730 1

2022-01-20 13:48:55 UTC

New

Linux>Questions

AMD or Intel for Programming with Linux as the OS?

I am thinking in building or buy a desktop computer for programing, both professionally and on my free time, and my choice of OS is Linux...

#mobile #android #web-development #linux #desktop-computer #mobile-development

36 6006 10

2020-07-12 20:50:05 UTC

New

General Dev>Code Editors

Dendron: a personal knowledge management tool on top of VSCode

/vscode #visual-studio-code

30 8077 9

2021-05-05 12:15:29 UTC

New

Backend>Learning Resources

Python Testing with pytest, Second Edition

Create efficient, elegant software tests in pytest, Python's most powerful testing framework. Brian Okken @brianokken Edited by Kat...

pragprog.com

#pragprog /python #published-book /book-python-testing-with-pytest-second-edition

16 7461 4

2021-06-25 16:57:39 UTC

New

Science/Tech>Health & Diet

David Sinclair's new Lifespan podcast

We’ve talked about his book briefly here but it is quickly becoming obsolete - so he’s decided to create a series of 7 podcasts, the firs...

#health #podcasts #bio-hackers #david-sinclair

87 6790 49

2022-04-12 16:27:36 UTC

New

Game Dev>Questions

I want to learn how make a game, but where should I start?

I’m able to do the “artistic” part of game-development; character designing/modeling, music, environment modeling, etc. However, I don’t...

#game-dev

15 4965 9

2025-10-18 13:12:58 UTC

New

Backend>Questions

Psql: error: connection to server on socket "/tmp/.s.PGSQL.5432" failed: No such file or directory

If you’re getting errors like this: psql: error: connection to server on socket “/tmp/.s.PGSQL.5432” failed: No such file or directory ...

#macos /rails /postgresql

1 5553 1

2024-10-17 02:03:48 UTC

New

Backend>Learning Resources

Ash Framework

Explore the power of Ash Framework by modeling and building the domain for a real-world web application. Rebecca Le @sevenseacat and ...

pragprog.com

#pragprog /elixir #published-book /ash /book-ash-framework

15 7555 9

2025-02-06 12:19:21 UTC

New

AI>Chat

Claude Code's entire source just leaked (512K lines) - anyone else digging through it?

Woke up to this today: Claude Code’s complete source code exposed via npm source map. Not a snippet. All 512,000 lines. 1,900 TypeScript ...

#claude

6 8359 5

2026-05-25 18:22:56 UTC

New

General Dev>In The News

Open Hardware and Free Software: Teufel Mynd, a case study - FSFE

General Dev>In The News

The Age of Technology Companies

General Dev>In The News

Authorize, don’t authenticate

General Dev>In The News

Software for One

General Dev>In The News

I ♥ RSS – Andrew Shell's Weblog

General Dev>In The News

The Silicon Valley Founder Meat Grinder

General Dev>In The News

A Surveillance Treaty in Disguise: The Trouble With Canada's Quiet Decision to Sign the UN Cybercrime Convention - Michael Geist

General Dev>In The News

Project Cost Estimator — Know What Your Website Should Cost (2026)

General Dev>In The News

Oooo.audio - Looping plugin and standalone app for evolving tape-style textures

General Dev>In The News

eBay pays $46M to journalists it targeted in bizarre harassment campaign

General Dev>In The News

General Dev In The News ❯

Latest on Devtalk

Open Hardware and Free Software: Teufel Mynd, a case study - FSFE

General Dev>In The News

The Age of Technology Companies

General Dev>In The News

Authorize, don’t authenticate

General Dev>In The News

Software for One

General Dev>In The News

I ♥ RSS – Andrew Shell's Weblog

General Dev>In The News

The Silicon Valley Founder Meat Grinder

General Dev>In The News

LLMs Can Infer Political Alignment from Online Conversations

AI>In The News

A Surveillance Treaty in Disguise: The Trouble With Canada's Quiet Decision to Sign the UN Cybercrime Convention - Michael Geist

General Dev>In The News

Preact 10.29.8 released!

Frontend>Official News

New Free-to-play game: Ro - Group theory puzzle game (like Rubik's Cube)

Game Dev>Chat

Amber v2.0.0-beta.2 and v2.0.0-beta.1 released!

Backend>Official News

'First tremors' of AI earthquake showing in digital revenue hit

AI>In The News

Project Cost Estimator — Know What Your Website Should Cost (2026)

General Dev>In The News

Oooo.audio - Looping plugin and standalone app for evolving tape-style textures

General Dev>In The News

AI for Smarties (Smarties)

AI>Learning Resources

Devtalk ❯

We ❤️ helpful members!

We reward our most helpful members via our MOTM scheme - by giving away a whopping 25 books per year!

Sub Categories:

We're in Beta

About us Mission Statement See our Roadmap

Diffusion Training from Scratch on a Micro-Budget

CommunityNews

Diffusion Training from Scratch on a Micro-Budget

Where Next?

Popular General Dev topics

Russia wants to ban the use of secure protocols such as TLS 1.3, DoH, DoT, ESNI

Fuzix: A Unix-ish operating system for small machines by Alan Cox

Launching Fig

8 Reasons to ditch Chrome and use Firefox

Why study functional programming?

Building a Slack alternative with Rust/Tauri

Deepseek - starting this week we'll open-source 5 repos

Knowing CSS is mastery to Front end Development

The A.I. Monarchy

Helium Browser

Other popular topics

Seven More Languages in Seven Weeks

Seven Languages in Seven Weeks

AMD or Intel for Programming with Linux as the OS?

Dendron: a personal knowledge management tool on top of VSCode

Python Testing with pytest, Second Edition

David Sinclair's new Lifespan podcast

I want to learn how make a game, but where should I start?

Psql: error: connection to server on socket "/tmp/.s.PGSQL.5432" failed: No such file or directory

Ash Framework

Claude Code's entire source just leaked (512K lines) - anyone else digging through it?

Sponsor Spotlight

General Dev>In The News

Latest on Devtalk

We ❤️ helpful members!

Devtalk Sponsors

Categories:

Sub Categories:

Popular Portals

Devtalk Sponsors

We're in Beta