CommunityNews

Imagen: An AI system that creates photorealistic images from input text

We present Imagen, a text-to-image diffusion model with an unprecedented degree of photorealism and a deep level of language understanding. Imagen builds on the power of large transformer language models in understanding text and hinges on the strength of diffusion models in high-fidelity image generation. Our key discovery is that generic large language models (e.g. T5), pretrained on text-only corpora, are surprisingly effective at encoding text for image synthesis: increasing the size of the language model in Imagen boosts both sample fidelity and image-text alignment much more than increasing the size of the image diffusion model. Imagen achieves a new state-of-the-art FID score of 7.27 on the COCO dataset, without ever training on COCO, and human raters find Imagen samples to be on par with the COCO data itself in image-text alignment. To assess text-to-image models in greater depth, we introduce DrawBench, a comprehensive and challenging benchmark for text-to-image models. With DrawBench, we compare Imagen with recent methods including VQ-GAN+CLIP, Latent Diffusion Models, and DALL-E 2, and find that human raters prefer Imagen over other models in side-by-side comparisons, both in terms of sample quality and image-text alignment.

Read in full here:

This thread was posted by one of our members via one of our news source trackers.

1 comment

3 1307 1

2022-09-15 14:50:29 UTC

Most Liked

AstonJ

It’s amazing how far AI art has come

A photo of a raccoon wearing an astronaut helmet, looking out of the window at night.

Post #3

Where Next?

View thread on forum

Home AI>In The News

3 1307 1

Last post

Popular Ai topics

AI>In The News

Nvidia Uses AI to Slash Bandwidth on Video Calls

NVIDIA Uses AI to Slash Bandwidth on Video Calls. NVIDIA Research has invented a way to use AI to dramatically reduce video call bandwid...

petapixel.com

#video #nvidia

1 966 0

2020-10-09 15:35:49 UTC

New

AI>In The News

Nvidia Announces A100 80GB GPU for AI

NVIDIA Doubles Down: Announces A100 80GB GPU, Supercharging World’s Most Powerful GPU for AI Supercomputing. SC20—NVIDIA today unveiled ...

nvidianews.nvidia.com

#nvidia

0 1351 1

2020-11-19 00:28:58 UTC

New

AI>In The News

Everyone wants to do the model work, not the data work: Data Cascades in High-Stakes AI (pdf)

AI models are increasingly applied in high-stakes domains like health and conservation. Data quality carries an elevated signifi- cance i...

storage.googleapis.com

#pdf

0 1634 0

2021-03-30 14:41:01 UTC

New

AI>In The News

In the metaverse, responsible AI must be a priority

Language technology powered by AI can perpetuate bias if we are not careful. We need to be sure that language AI is trained to be ethical...

techcrunch.com

#metaverse

0 973 0

2022-03-05 14:57:25 UTC

New

AI>In The News

How to fix the eyes in AI-generated images

aidemos.info

0 4508 0

2022-09-10 13:54:33 UTC

New

AI>In The News

Adobe plays catch-up with Project Blink, an AI-powered video editor

AI video editor can recognize objects, people, and sounds, allowing editing via text.

arstechnica.com

#project #video #adobe

0 1271 0

2022-10-20 22:31:08 UTC

New

AI>In The News

Crush: The glamourous AI coding agent for your favourite terminal 💘

The glamourous AI coding agent for your favourite terminal :heart_with_arrow: - charmbracelet/crush

github.com

#terminal #coding #github #crush

0 1121 0

2025-07-31 01:27:58 UTC

New

AI>In The News

Claude-code - native LSP support

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing rout...

github.com

#code #changelog #claude

0 1 0

2025-12-23 13:53:12 UTC

New

AI>In The News

Moltbook - the front page of the agent internet

A social network built exclusively for AI agents. Where AI agents share, discuss, and upvote. Humans welcome to observe.

moltbook.com

#internet #agent

0 11 0

2026-01-30 14:53:02 UTC

New

AI>In The News

Reasonix — DeepSeek-native AI coding agent

Open-source AI coding agent for your terminal. Engineered around DeepSeek’s prefix-cache. MCP first-class · plan mode · MIT.

esengine.github.io

#coding /deepseek #agent #native

1 1166 1

2026-06-22 17:11:22 UTC

New

Other popular topics

Linux>Questions

AMD or Intel for Programming with Linux as the OS?

I am thinking in building or buy a desktop computer for programing, both professionally and on my free time, and my choice of OS is Linux...

#mobile #android #web-development #linux #desktop-computer #mobile-development

36 6006 10

2020-07-12 20:50:05 UTC

New

General Dev>Hardware

Moonlander Keyboard (Mechanical) (Ergonomic) (Split) (Ortholinear)

Bought the Moonlander mechanical keyboard. Cherry Brown MX switches. Arms and wrists have been hurting enough that it’s time I did someth...

#hardware /keyboards #moonlander #mechanical-keyboards #ortholinear #ergonomic

212 17779 90

2021-07-13 15:33:55 UTC

New

Data Science

Genetic Algorithms in Elixir

From finance to artificial intelligence, genetic algorithms are a powerful tool with a wide array of applications. But you don't need an ...

#pragprog #ai /elixir #published-book /book-genetic-algorithms-in-elixir

25 5243 6

2021-02-09 12:32:09 UTC

New

General Dev>Hardware

Keyboard thock (sound)

I’ve been hearing quite a lot of comments relating to the sound of a keyboard, with one of the most desirable of these called ‘thock’, he...

/keyboards #mechanical-keyboards

14 11197 8

2020-11-11 11:59:23 UTC

New

General Dev>Dev Chat

PragProg’s Medium Posts

Hello everyone! This thread is to tell you about what authors from The Pragmatic Bookshelf are writing on Medium.

#pragprog #blog-post

1147 29994 760

2025-07-10 13:36:16 UTC

New

Backend>Learning Resources

Python Testing with pytest, Second Edition

Create efficient, elegant software tests in pytest, Python's most powerful testing framework. Brian Okken @brianokken Edited by Kat...

pragprog.com

#pragprog /python #published-book /book-python-testing-with-pytest-second-edition

16 7461 4

2021-06-25 16:57:39 UTC

New

General Dev>Code Editors

Doom-Emacs: Can't find emacs in your PATH

If you get Can't find emacs in your PATH when trying to install Doom Emacs on your Mac you… just… need to install Emacs first! :lol: bre...

#macos /emacs #doom-emacs

4 5837 0

2022-02-04 00:32:03 UTC

New

Android>Questions

Clipboard readtext not working in android webview

Inside our android webview app, we are trying to paste the copied content from another app eg (notes) using navigator.clipboard.readtext ...

#android #clipboard

1 5651 0

2022-09-27 18:52:03 UTC

New

General Dev>In The News

Review of Linux on Minisforum V3 AMD Ryzen Tablet

A Brief Review of the Minisforum V3 AMD Tablet. Update: I have created an awesome-minisforum-v3 GitHub repository to list information fo...

mudkip.me

#linux #review #amd

0 4635 0

2024-06-24 02:26:38 UTC

New

Backend>Learning Resources

Advanced Functional Programming with Elixir

Use advanced functional programming principles, practical Domain-Driven Design techniques, and production-ready Elixir code to build scal...

joekoski.com

#pragprog /elixir #published-book #functional-programming /book-advanced-functional-programming-with-elixir

43 4989 22

2025-10-06 09:04:44 UTC

New

AI>In The News

Agents Are Invention Machines

AI>In The News

Claude Code: Anatomy of a Misfeature

AI>In The News

Kimi K3 - Intelligence, Performance & Price Analysis

AI>In The News

Introducing LM Studio Bionic: the AI agent for open models

AI>In The News

Grok Build is open source

AI>In The News

The Agentic Loop: Three loops in a trench coat

AI>In The News

How OpenAI Plans To Win Over Doctors, Patients And Hospitals

AI>In The News

Google revamps image search for its 25th anniversary with more images and more AI

AI>In The News

Zig Creator Calls Spade a Spade, Anthropic Blows Smoke

AI>In The News

Old and new apps, via modern coding agents

AI>In The News

AI In The News ❯

Latest on Devtalk

Grails v8.0.0-M4 released!

Backend>Official News

Agents Are Invention Machines

AI>In The News

Fable 5.11.0 released!

Frontend>Official News

Claude Code: Anatomy of a Misfeature

AI>In The News

Linus Torvalds to critics of AI coding in Linux: "Fork it. Or just walk away."

Linux>In The News

It's official: EU will force Google to share search data and open up AI on Android

Android>In The News

Kimi K3 - Intelligence, Performance & Price Analysis

AI>In The News

Introducing LM Studio Bionic: the AI agent for open models

AI>In The News

Fable 5.10.0 released!

Frontend>Official News

Crystal 1.21.0 released!

Backend>Official News

Space Datacenters - if you see a datacenter in orbit, it means something went wrong on Earth

General Dev>In The News

Distinguishing variables from parameters

General Dev>In The News

"One Hot Node"

General Dev>In The News

Salary information to be shown on job ads under new laws

General Dev>In The News

Digital Bandung

General Dev>In The News

Devtalk ❯

We ❤️ helpful members!

We reward our most helpful members via our MOTM scheme - by giving away a whopping 25 books per year!

Sub Categories:

We're in Beta

About us Mission Statement See our Roadmap

Imagen: An AI system that creates photorealistic images from input text

CommunityNews

Imagen: An AI system that creates photorealistic images from input text

Most Liked

AstonJ

Where Next?

Popular Ai topics

Nvidia Uses AI to Slash Bandwidth on Video Calls

Nvidia Announces A100 80GB GPU for AI

Everyone wants to do the model work, not the data work: Data Cascades in High-Stakes AI (pdf)

In the metaverse, responsible AI must be a priority

How to fix the eyes in AI-generated images

Adobe plays catch-up with Project Blink, an AI-powered video editor

Crush: The glamourous AI coding agent for your favourite terminal 💘

Claude-code - native LSP support

Moltbook - the front page of the agent internet

Reasonix — DeepSeek-native AI coding agent

Other popular topics

AMD or Intel for Programming with Linux as the OS?

Moonlander Keyboard (Mechanical) (Ergonomic) (Split) (Ortholinear)

Genetic Algorithms in Elixir

Keyboard thock (sound)

PragProg’s Medium Posts

Python Testing with pytest, Second Edition

Doom-Emacs: Can't find emacs in your PATH

Clipboard readtext not working in android webview

Review of Linux on Minisforum V3 AMD Ryzen Tablet

Advanced Functional Programming with Elixir

Sponsor Spotlight

AI>In The News

Latest on Devtalk

We ❤️ helpful members!

Devtalk Sponsors

Categories:

Sub Categories:

Popular Portals

Devtalk Sponsors

We're in Beta