CommunityNews

DeepSeek-v3.2: Pushing the frontier of open large language models

We introduce DeepSeek-V3.2, a model that harmonizes high computational efficiency with superior reasoning and agent performance. The key technical breakthroughs of DeepSeek-V3.2 are as follows: (1) DeepSeek Sparse Attention (DSA): We introduce DSA, an efficient attention mechanism that substantially reduces computational complexity while preserving model performance in long-context scenarios. (2) Scalable Reinforcement Learning Framework: By implementing a robust reinforcement learning protocol and scaling post-training compute, DeepSeek-V3.2 performs comparably to GPT-5. Notably, our high-compute variant, DeepSeek-V3.2-Speciale, surpasses GPT-5 and exhibits reasoning proficiency on par with Gemini-3.0-Pro, achieving gold-medal performance in both the 2025 International Mathematical Olympiad (IMO) and the International Olympiad in Informatics (IOI). (3) Large-Scale Agentic Task Synthesis Pipeline: To integrate reasoning into tool-use scenarios, we developed a novel synthesis pipeline that systematically generates training data at scale. This methodology facilitates scalable agentic post-training, yielding substantial improvements in generalization and instruction-following robustness within complex, interactive environments.

Read in full here:

https://huggingface.co/deepseek-ai/DeepSeek-V3.2/resolve/main/assets/paper.pdf

View thread on forum

#pdf /deepseek

0 95 0

2025-12-02 17:55:35 UTC

Where Next?

View thread on forum

deepseek

pdf

Home AI>In The News

#pdf /deepseek

0 95 0

Last post

Popular Ai topics

AI>In The News

AI Can Generate Convincing Text–and Anyone Can Use It

SOME OF THE most dazzling recent advances in artificial intelligence have come thanks to resources only available at big tech companies, ...

wired.com

0 1576 0

2021-04-01 00:20:44 UTC

New

AI>In The News

Hyundai announces $400M AI, robotics institute powered by Boston Dynamics

When Hyundai acquired Boston Dynamics at the end of 2020, there were plenty of open questions. Chief among them was why we should assume ...

techcrunch.com

#robotics

0 929 0

2022-08-15 13:27:08 UTC

New

AI>In The News

OpenAI debuts DALL-E API so devs can integrate its AI artwork into their apps

OpenAI offers integrated AI image generation on a demand—for 2 cents an image.

arstechnica.com

#apps #api #artwork

0 907 0

2022-11-04 00:29:13 UTC

New

AI>In The News

OpenJourney: Midjourney, but Open Source

OpenJourney is a Text-to-Image AI model which has the goal of bringing an open source equivalent to Midjourney to the people. It is curre...

open-journey.github.io

0 2151 0

2023-01-26 03:25:56 UTC

New

AI>In The News

Open source devs are fighting AI crawlers with cleverness and vengeance

AI web crawling bots are the cockroaches of the internet, many developers believe. FOSS devs are fighting back in ingenuous, humorous wa...

techcrunch.com

11 695 7

2025-05-22 20:00:57 UTC

New

AI>In The News

Cursor 1.0 - The AI Code Editor

Cursor 1.0 brings BugBot for code review, a first look at memories, one-click MCP setup, Jupyter support and general availability of Back...

cursor.com

#code #changelog #cursor

0 1040 0

2025-06-05 04:21:46 UTC

New

AI>In The News

Switching to Claude Code + VSCode inside Docker

Why I decided to ditch Cursor and switch to running Claude Code in an isolated environment + diy guide!

timsh.org

#docker #code /vscode #claude

0 849 2

2026-04-21 12:51:23 UTC

New

AI>In The News

Local LLM for Coding with Ollama on macOS

With all the AI buzz around coding assistants, and being a bit concerned about being dependent on third-party cloud providers here, I dec...

#macos

2 621 1

2025-08-09 05:07:32 UTC

New

AI>In The News

These psychological tricks can get LLMs to respond to “forbidden” prompts

Study shows how patterns in LLM training data can lead to “parahuman” responses.

arstechnica.com

0 782 0

2025-09-04 01:54:14 UTC

New

AI>In The News

Moltbook - the front page of the agent internet

A social network built exclusively for AI agents. Where AI agents share, discuss, and upvote. Humans welcome to observe.

moltbook.com

#internet #agent

0 11 0

2026-01-30 14:53:02 UTC

New

Other popular topics

General Dev>Dev Chat

HELLO WORLD (Introductions thread!)

Hello Devtalk World! Please let us know a little about who you are and where you’re from :nerd_face:

#community

483 7991 118

2026-07-26 18:49:51 UTC

New

Backend>Chat

What is the reason behind Rust’s web framework, Rocket, not performing as well as expected in the Techempower benchmarks?

I know that these benchmarks might not be the exact picture of real-world scenario, but still I expect a Rust web framework performing a ...

#web-frameworks /rust

36 7463 11

2020-06-21 10:50:02 UTC

New

General Dev>Hardware

What monitor(s) do you have for programming?

Please tell us what is your preferred monitor setup for programming(not gaming) and why you have chosen it. Does your monitor have eye p...

#monitors #coding #programming #development

227 11362 88

2022-02-01 12:02:08 UTC

New

General Dev>Hardware

Poll: Which keyboard layout do you use?

poll poll Be sure to check out @Dusty’s article posted here: An Introduction to Alternative Keyboard Layouts It’s one of the best write-...

colemakmods.github.io

#polls /keyboards

10 6048 11

2020-10-31 23:12:33 UTC

New

General Dev>Code Editors

Doom-Emacs: Can't find emacs in your PATH

If you get Can't find emacs in your PATH when trying to install Doom Emacs on your Mac you… just… need to install Emacs first! :lol: bre...

#macos /emacs #doom-emacs

4 5837 0

2022-02-04 00:32:03 UTC

New

General Dev>In The News

Zig now has built-in HTTP server and client in std

zig/http.zig at 7cf2cbb33ef34c1d211135f56d30fe23b6cacd42 · ziglang/zig. General-purpose programming language and toolchain for maintaini...

github.com

/zig #http

0 5624 0

2023-05-19 00:35:41 UTC

New

Backend>Learning Resources

Machine Learning in Elixir

Leverage Elixir and the Nx ecosystem to build intelligent applications that solve real-world problems in computer vision, natural languag...

pragprog.com

#pragprog /elixir #published-book #machine-learning #nx /book-machine-learning-in-elixir

18 4615 7

2024-11-08 22:13:04 UTC

New

Backend>Questions

Psql: error: connection to server on socket "/tmp/.s.PGSQL.5432" failed: No such file or directory

If you’re getting errors like this: psql: error: connection to server on socket “/tmp/.s.PGSQL.5432” failed: No such file or directory ...

#macos /rails /postgresql

1 5553 1

2024-10-17 02:03:48 UTC

New

Game Dev>In The News

Grand Theft Auto: Vice City | DOS games in browser

Open-source implementation of the classic GTA engine now running directly in your browser. Experience the reVC technology demo on DOS.Zon...

dos.zone

#games #browser

0 173 0

2025-12-20 02:36:57 UTC

New

AI>Chat

Claude Code's entire source just leaked (512K lines) - anyone else digging through it?

Woke up to this today: Claude Code’s complete source code exposed via npm source map. Not a snippet. All 512,000 lines. 1,900 TypeScript ...

#claude

6 8359 5

2026-05-25 18:22:56 UTC

New

Latest in DeepSeek

Reasonix — DeepSeek-native AI coding agent

AI>In The News

DeepSeek V4—almost on the frontier, a fraction of the price

AI>In The News

DeepSeek V4 Preview Release

AI>In The News

DeepSeek V4 is live in preview — should your team switch?

AI>Chat

DeepSeek v4

AI>In The News

DeepSeek V4 dropped today — $0.28/M output on 1M context, running on Huawei Ascend. Are you routing workloads to it?

AI>Chat

My mom and Dr. DeepSeek

AI>In The News

vLLM Large Scale Serving: DeepSeek @ 2.2k tok/s/H200 with Wide-EP

AI>In The News

China’s DeepSeek Uses Banned Nvidia Chips for AI Model, Report Says

AI>In The News

DeepSeek-v3.2: Pushing the frontier of open large language models

AI>In The News

DeepSeek Portal ❯

AI>In The News

NVIDIA Reportedly Increased GDDR6 And GDDR7 Kit Prices For Its RTX GPUs

AI>In The News

CXL Memory Explained: Can Servers Finally Share RAM?

AI>In The News

LLMs Can Infer Political Alignment from Online Conversations

AI>In The News

'First tremors' of AI earthquake showing in digital revenue hit

AI>In The News

Commodification of Intelligence: Good, Bad, and Ugly Circular AI Deals

AI>In The News

‘Vibe coding’ is fun and easy, but there’s a major catch

AI>In The News

Mapping CVEs to MITRE ATT&CK Techniques: A Curated Gold-Set Classifier and the Limits of LLM-Assisted Label Expansion

AI>In The News

Toolcraft - Starter kit for AI design apps

AI>In The News

I sent Claude Opus 5 "-" and it wrote me 5k tokens about a cartographer - Austin's Nerdy Things

AI>In The News

The Half We Don't Measure

AI>In The News

AI In The News ❯

Latest on Devtalk

NVIDIA Reportedly Increased GDDR6 And GDDR7 Kit Prices For Its RTX GPUs

AI>In The News

CXL Memory Explained: Can Servers Finally Share RAM?

AI>In The News

Open Hardware and Free Software: Teufel Mynd, a case study - FSFE

General Dev>In The News

The Age of Technology Companies

General Dev>In The News

Authorize, don’t authenticate

General Dev>In The News

Software for One

General Dev>In The News

I ♥ RSS – Andrew Shell's Weblog

General Dev>In The News

The Silicon Valley Founder Meat Grinder

General Dev>In The News

LLMs Can Infer Political Alignment from Online Conversations

AI>In The News

A Surveillance Treaty in Disguise: The Trouble With Canada's Quiet Decision to Sign the UN Cybercrime Convention - Michael Geist

General Dev>In The News

Preact 10.29.8 released!

Frontend>Official News

New Free-to-play game: Ro - Group theory puzzle game (like Rubik's Cube)

Game Dev>Chat

Amber v2.0.0-beta.2 and v2.0.0-beta.1 released!

Backend>Official News

'First tremors' of AI earthquake showing in digital revenue hit

AI>In The News

Project Cost Estimator — Know What Your Website Should Cost (2026)

General Dev>In The News

Devtalk ❯

We ❤️ helpful members!

We reward our most helpful members via our MOTM scheme - by giving away a whopping 25 books per year!

Sub Categories:

We're in Beta

About us Mission Statement See our Roadmap

DeepSeek-v3.2: Pushing the frontier of open large language models

CommunityNews

DeepSeek-v3.2: Pushing the frontier of open large language models

Where Next?

Popular Ai topics

AI Can Generate Convincing Text–and Anyone Can Use It

Hyundai announces $400M AI, robotics institute powered by Boston Dynamics

OpenAI debuts DALL-E API so devs can integrate its AI artwork into their apps

OpenJourney: Midjourney, but Open Source

Open source devs are fighting AI crawlers with cleverness and vengeance

Cursor 1.0 - The AI Code Editor

Switching to Claude Code + VSCode inside Docker

Local LLM for Coding with Ollama on macOS

These psychological tricks can get LLMs to respond to “forbidden” prompts

Moltbook - the front page of the agent internet

Other popular topics

HELLO WORLD (Introductions thread!)

What is the reason behind Rust’s web framework, Rocket, not performing as well as expected in the Techempower benchmarks?

What monitor(s) do you have for programming?

Poll: Which keyboard layout do you use?

Doom-Emacs: Can't find emacs in your PATH

Zig now has built-in HTTP server and client in std

Machine Learning in Elixir

Psql: error: connection to server on socket "/tmp/.s.PGSQL.5432" failed: No such file or directory

Grand Theft Auto: Vice City | DOS games in browser

Claude Code's entire source just leaked (512K lines) - anyone else digging through it?

Sponsor Spotlight

Latest in DeepSeek

AI>In The News

Latest on Devtalk

We ❤️ helpful members!

Devtalk Sponsors

Categories:

Sub Categories:

Popular Portals

Devtalk Sponsors

We're in Beta