CommunityNews

Deepseek - starting this week we'll open-source 5 repos

We’re a tiny team @deepseek-ai pushing our limits in AGI exploration.

Starting this week , Feb 24, 2025 we’ll open-source 5 repos – one daily drop – not because we’ve made grand claims, but simply as developers sharing our small-but-sincere progress with full transparency.

These are humble building blocks of our online service: documented, deployed and battle-tested in production. No vaporware, just sincere code that moved our tiny yet ambitious dream forward.

Why? Because every line shared becomes collective momentum that accelerates the journey. Daily unlocks begin soon. No ivory towers - just pure garage-energy and community-driven innovation

Stay tuned – let’s geek out in the open together.

Hello, DeepSeek Open Infra!

202502 Open-Source Week

We’re a tiny team @deepseek-ai pushing our limits in AGI exploration.

Starting this week , Feb 24, 2025 we’ll open-source 5 repos – one daily drop – not because we’ve made grand claims,
but simply as developers sharing our small-but-sincere progress with full transparency.

These are humble building blocks of our online service: documented, deployed and battle-tested in production.
No vaporware, just sincere code that moved our tiny yet ambitious dream forward.

Why? Because every line shared becomes collective momentum that accelerates the journey.
Daily unlocks begin soon. No ivory towers - just pure garage-energy and community-driven innovation

Stay tuned – let’s geek out in the open together.

Day 1 - FlashMLA

Efficient MLA Decoding Kernel for Hopper GPUs
Optimized for variable-length sequences, battle-tested in production

FlashMLA GitHub Repo
BF16 support
Paged KV cache (block size 64)
Performance: 3000 GB/s memory-bound | BF16 580 TFLOPS compute-bound on H800

Day 2 - DeepEP

Excited to introduce DeepEP - the first open-source EP communication library for MoE model training and inference.

DeepEP GitHub Repo
Efficient and optimized all-to-all communication
Both intranode and internode support with NVLink and RDMA
High-throughput kernels for training and inference prefilling
Low-latency kernels for inference decoding
Native FP8 dispatch support
Flexible GPU resource control for computation-communication overlapping

Day 3 - DeepGEMM

Introducing DeepGEMM - an FP8 GEMM library that supports both dense and MoE GEMMs, powering V3/R1 training and inference.

DeepGEMM GitHub Repo
Up to 1350+ FP8 TFLOPS on Hopper GPUs
No heavy dependency, as clean as a tutorial
Fully Just-In-Time compiled
Core logic at ~300 lines - yet outperforms expert-tuned kernels across most matrix sizes
Supports dense layout and two MoE layouts

Day 4 - Optimized Parallelism Strategies

DualPipe - a bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.
GitHub Repo

EPLB - an expert-parallel load balancer for V3/R1.
GitHub Repo

Analyze computation-communication overlap in V3/R1.
GitHub Repo

Day 5 - 3FS, Thruster for All DeepSeek Data Access

Fire-Flyer File System (3FS) - a parallel file system that utilizes the full bandwidth of modern SSDs and RDMA networks.

6.6 TiB/s aggregate read throughput in a 180-node cluster
3.66 TiB/min throughput on GraySort benchmark in a 25-node cluster
40+ GiB/s peak throughput per client node for KVCache lookup
Disaggregated architecture with strong consistency semantics
Training data preprocessing, dataset loading, checkpoint saving/reloading, embedding vector search & KVCache lookups for inference in V3/R1

3FS → GitHub - deepseek-ai/3FS: A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
Smallpond - data processing framework on 3FS → GitHub - deepseek-ai/smallpond: A lightweight data processing framework built on DuckDB and 3FS.

2024 AI Infrastructure Paper (SC24)

Fire-Flyer AI-HPC: A Cost-Effective Software-Hardware Co-Design for Deep Learning

Paper Link
Arxiv Paper Link

Read in full here:

This thread was posted by one of our members via one of our news source trackers.

View thread on forum

#github /deepseek

1 1228 0

2025-02-28 16:43:30 UTC

Where Next?

View thread on forum

deepseek

github

Home General Dev>In The News

#github /deepseek

1 1228 0

Last post

Popular General Dev topics

General Dev>In The News

The faster you unlearn OOP, the better for you and your software

Maybe it’s just my experience, but Object-Oriented Programming seems like a default, most common paradigm of software engineering. The on...

dpc.pw

#oop

36 2275 15

2021-06-21 01:31:51 UTC

New

General Dev>In The News

SPWN – A programming language that compiles to Geometry Dash levels

SPWN is a programming language that compiles to Geometry Dash levels. What that means is that you can create levels by using not only the...

github.com

#programming

0 2217 0

2021-08-31 16:10:33 UTC

New

General Dev>In The News

It's not what programming languages do, it's what they shepherd you to

It’s not what programming languages do, it’s what they shepherd you to. How many of you have listened, read or taken part in a discussio...

nibblestew.blogspot.com

#programming #languages

50 2055 19

2022-05-10 15:41:49 UTC

New

General Dev>In The News

Kyria Build, Part 1: A wireless ergonomic keyboard

It has some interesting features: It’s entirely wireless (the left half speaks Bluetooth to the right half, and the right half speaks B...

ianthehenry.com

#ergonomic #keyboard

0 1719 0

2022-01-21 02:06:58 UTC

New

General Dev>In The News

There’s No Such Thing as Clean Code

Everyone seems to be striving for ‘clean’ code at the moment. You can’t read a blog post without the author telling you how clean their a...

steveonstuff.com

#code

31 1701 9

2022-03-28 00:29:57 UTC

New

General Dev>In The News

LG 28-inch 16:18 DualUp Monitor

LG 28-inch 16:18 DualUp Monitor with Ergo Stand and USB Type-C™ (28MQ780-B) | LG USA. Shop LG 28MQ780-B on the official LG.com website ...

lg.com

12 2239 12

2022-09-01 19:28:37 UTC

New

General Dev>In The News

I made a home security system, powered by a Raspberry Pi 3

Raspberry Pi security alarm — the basics. In November last year — I started building a DIY security alarm system, using a Raspberry Pi a...

blog.cavelab.dev

/security

0 2261 0

2023-01-01 15:50:18 UTC

New

General Dev>In The News

To avoid being replaced by LLMs, do what they can't

To avoid being replaced by LLMs, do what they can’t. What LLM’s can’t do yet

seangoedecke.com

18 1109 7

2025-04-14 19:56:29 UTC

New

General Dev>In The News

Ladybird: Truly independent web browser

Truly independent web browser. Contribute to LadybirdBrowser/ladybird development by creating an account on GitHub.

github.com

#browser #web #github

4 988 3

2025-03-10 13:45:11 UTC

New

General Dev>In The News

Phlex for Rails Emails: Action Mailer without ERB

Rendering Action Mailer emails with Phlex components and layouts: Clean, Composable, and Completely Ruby - Blog post by Camillo Visini

camillovisini.com

/rails #emails

0 805 0

2025-03-11 18:50:49 UTC

New

Other popular topics

General Dev>Dev Chat

HELLO WORLD (Introductions thread!)

Hello Devtalk World! Please let us know a little about who you are and where you’re from :nerd_face:

#community

481 6762 116

2025-11-06 03:57:03 UTC

New

Backend>Learning Resources

Web Development with Clojure, Third Edition

Stop developing web apps with yesterday’s tools. Today, developers are increasingly adopting Clojure as a web-development platform. See f...

pragprog.com

#pragprog #web-development /clojure #published-book /book-web-development-with-clojure-third-edition

5 4584 1

2022-01-06 05:27:09 UTC

New

Backend>Learning Resources

Programming Machine Learning

Machine learning can be intimidating, with its reliance on math and algorithms that most programmers don't encounter in their regular wor...

pragprog.com

#pragprog #ai /python #published-book /book-programming-machine-learning #math #algorithms

6 5350 3

2023-10-03 15:08:13 UTC

New

Linux>Questions

AMD or Intel for Programming with Linux as the OS?

I am thinking in building or buy a desktop computer for programing, both professionally and on my free time, and my choice of OS is Linux...

#mobile #android #web-development #linux #desktop-computer #mobile-development

36 6006 10

2020-07-12 20:50:05 UTC

New

General Dev>Hardware

Custom keyboard keycaps

There’s a whole world of custom keycaps out there that I didn’t know existed! Check out all of our Keycaps threads here: https://forum....

#hardware /keyboards #keycaps #mechanical-keyboards

15 11086 19

2023-07-27 16:30:57 UTC

New

Backend>Learning Resources

Programming Phoenix LiveView

Build highly interactive applications without ever leaving Elixir, the way the experts do. Let LiveView take care of performance, scalabi...

pragprog.com

#pragprog /elixir /phoenix #published-book /book-programming-phoenix-liveview

79 10732 24

2026-02-13 05:37:02 UTC

New

Backend>Learning Resources

Python Testing with pytest, Second Edition

Create efficient, elegant software tests in pytest, Python's most powerful testing framework. Brian Okken @brianokken Edited by Kat...

pragprog.com

#pragprog /python #published-book /book-python-testing-with-pytest-second-edition

16 7461 4

2021-06-25 16:57:39 UTC

New

Backend>Learning Resources

Programming WebRTC

Use WebRTC to build web applications that stream media and data in real time directly from one user to another, all in the browser. ...

pragprog.com

#pragprog #published-book /js #webrtc /book-programming-webrtc

27 6969 6

2021-11-20 19:03:04 UTC

New

Community>In The Spotlight

Spotlight: Rebecca Skinner (Author) Interview and AMA!

Author Spotlight Rebecca Skinner @RebeccaSkinner Welcome to our latest author spotlight, where we sit down with Rebecca Skinner, auth...

#author-spotlight /haskell /book-effective-haskell

106 11719 28

2022-11-16 10:29:37 UTC

New

Windows>Chat

Taskbar Overflow Menu (NOT System Tray Overflow)

There appears to have been an update that has changed the terminology for what has previously been known as the Taskbar Overflow - this h...

#taskbar-overflow-win-11

3 3715 2

2023-02-13 08:43:55 UTC

New

Latest in DeepSeek

My mom and Dr. DeepSeek

AI>In The News

vLLM Large Scale Serving: DeepSeek @ 2.2k tok/s/H200 with Wide-EP

AI>In The News

China’s DeepSeek Uses Banned Nvidia Chips for AI Model, Report Says

AI>In The News

DeepSeek-v3.2: Pushing the frontier of open large language models

AI>In The News

Build a DeepSeek Model (From Scratch) (Manning)

AI>Learning Resources

The Demonization of DeepSeek - How NIST Turned Open Science into a Security Scare

AI>In The News

LLM Leaderboard - Comparison of over 100 AI models from OpenAI, Google, DeepSeek & others | Artificial Analysis

AI>In The News

Kodee’s Kotlin Roundup: New Releases, OpenAI vs. DeepSeek, and Compose Hot Reload

Backend>Official News

DeepSeek-V3 Technical Report

General Dev>In The News

How to run Ollama deepseek-coder:6.7b-instruct-q4_K_M in Docker for CrewAI Agents?

Backend>Questions

DeepSeek Portal ❯

General Dev>In The News

Enable CORS for Your Blog

General Dev>In The News

MicroGPT explained interactively

General Dev>In The News

WebMCP is available for early preview | Blog | Chrome for Developers

General Dev>In The News

Computer-generated dream world

General Dev>In The News

An interactive intro to Elliptic Curve Cryptography (ECC)

General Dev>In The News

Sub-second volumetric 3D printing by synthesis of holographic light fields - Nature

General Dev>In The News

Ghostty Docs

General Dev>In The News

The whole thing was a scam

General Dev>In The News

MinIO Is Dead, Long Live MinIO

General Dev>In The News

Worldwide Smartphone Market to Decline 13% in 2026, Marking the Largest Drop Ever Due to the Memory Shortage Crisis, according to IDC

General Dev>In The News

General Dev In The News ❯

Latest on Devtalk

FrankenSQLite — The Monster Database Engine for Rust

Backend>In The News

Enable CORS for Your Blog

General Dev>In The News

MicroGPT explained interactively

General Dev>In The News

WebMCP is available for early preview | Blog | Chrome for Developers

General Dev>In The News

Computer-generated dream world

General Dev>In The News

An interactive intro to Elliptic Curve Cryptography (ECC)

General Dev>In The News

Sub-second volumetric 3D printing by synthesis of holographic light fields - Nature

General Dev>In The News

Once More, With Feeling - field15

Community>General Chat

Ghostty Docs

General Dev>In The News

AI is Making Junior Devs Useless

AI>In The News

Ape coding

AI>In The News

I made a game called Bread (for a game jam)

Game Dev>Chat

Ash v3.19.0 released!

Backend>Official News

From Noise to Image

AI>In The News

747s and Coding Agents

AI>In The News

Devtalk ❯

We ❤️ helpful members!

We reward our most helpful members via our MOTM scheme - by giving away a whopping 25 books per year!

Sub Categories:

We're in Beta

About us Mission Statement See our Roadmap

Deepseek - starting this week we'll open-source 5 repos

CommunityNews

Deepseek - starting this week we'll open-source 5 repos

Hello, DeepSeek Open Infra!

202502 Open-Source Week

Day 1 - FlashMLA

Day 2 - DeepEP

Day 3 - DeepGEMM

Day 4 - Optimized Parallelism Strategies

Day 5 - 3FS, Thruster for All DeepSeek Data Access

2024 AI Infrastructure Paper (SC24)

Fire-Flyer AI-HPC: A Cost-Effective Software-Hardware Co-Design for Deep Learning

Where Next?

Popular General Dev topics

The faster you unlearn OOP, the better for you and your software

SPWN – A programming language that compiles to Geometry Dash levels

It's not what programming languages do, it's what they shepherd you to

Kyria Build, Part 1: A wireless ergonomic keyboard

There’s No Such Thing as Clean Code

LG 28-inch 16:18 DualUp Monitor

I made a home security system, powered by a Raspberry Pi 3

To avoid being replaced by LLMs, do what they can't

Ladybird: Truly independent web browser

Phlex for Rails Emails: Action Mailer without ERB

Other popular topics

HELLO WORLD (Introductions thread!)

Web Development with Clojure, Third Edition

Programming Machine Learning

AMD or Intel for Programming with Linux as the OS?

Custom keyboard keycaps

Programming Phoenix LiveView

Python Testing with pytest, Second Edition

Programming WebRTC

Spotlight: Rebecca Skinner (Author) Interview and AMA!

Taskbar Overflow Menu (NOT System Tray Overflow)

Sponsor Spotlight

Latest in DeepSeek

General Dev>In The News

Latest on Devtalk

We ❤️ helpful members!

Devtalk Sponsors

Categories:

Sub Categories:

Popular Portals

Devtalk Sponsors

We're in Beta