ManningBooks

ManningBooks

Devtalk Sponsor

Build a DeepSeek Model (From Scratch) (Manning)

In Build a DeepSeek Model (From Scratch) you’ll build your own DeepSeek clone from the ground up. First, you’ll quickly review LLM fundamentals, with an eye to where DeepSeek’s innovations address the common problems and limitations of standard models. Then, you’ll learn everything you need to create your own DeepSeek-inspired model, including the innovations that put DeepSeek on the map: Multihead Latent Attention (MLA), Multi-Token Prediction (MTP), Mixture of Experts (MoE), model distillation, and reasoning.

Raj Abhijit Dandekar, Rajat Dandekar, Sreedath Panat, Naman Dwivedi

Build a DeepSeek Model (From Scratch) is a hands-on guide to creating your own DeepSeek-style model step by step. You’ll start with a base LLM, then implement reasoning, retrieval, and optimization components to reproduce DeepSeek’s key architectural ideas. It’s an accessible deep dive into how multi-stage inference, reasoning traces, and reinforcement loops come together to create models capable of logical, multi-step problem solving.

In this book, you’ll learn how to:

  • Implement a DeepSeek-style reasoning framework from the ground up

  • Integrate retrieval augmentation, reasoning tokens, and self-reflection

  • Apply reinforcement learning for reasoning improvement

  • Measure reasoning performance across math, logic, and code tasks

  • Understand how reasoning models differ from traditional LLMs

Whether you’re a machine learning engineer, researcher, or developer curious about reasoning-centric AI, Build a DeepSeek Model (From Scratch) offers a transparent, implementation-first look into one of the most exciting frontiers in modern AI.


Don’t forget you can get 45% off with your Devtalk discount! Just use the coupon code “devtalk.com” at checkout :+1:

Most Liked

iPaul

iPaul

I own the Build an LLM from scratch, this books seems like a good follow up on that.

ManningBooks

ManningBooks

Devtalk Sponsor

Exactly! Even the authors agree with this.

alvinkatojr

alvinkatojr

I own the same book and I too I’m looking forward to this book :slight_smile:

Where Next?

Popular Ai topics Top

ManningBooks
In Build a Reasoning Model (From Scratch), acclaimed ML research engineer Sebastian Raschka takes you inside the black box of reasoning-e...
New
ManningBooks
Grokking AI Algorithms, Second Edition introduces the most important AI algorithms using relatable illustrations, interesting examples, a...
New
ManningBooks
The bestselling book on Python deep learning, now covering generative AI, Keras 3, PyTorch, and JAX! François Chollet and Matthew ...
New
pragdave
Build robust LLM-powered apps, chatbots, and agents while mastering AI engineering principles that will help you outlast the tools and th...
New
ManningBooks
In Build a DeepSeek Model (From Scratch) you’ll build your own DeepSeek clone from the ground up. First, you’ll quickly review LLM fundam...
New
ManningBooks
AI Governance: Secure, privacy-preserving, ethical systems presents a structured playbook for safely harnessing the potential of Generati...
New
ManningBooks
Hugging Face in Action reveals how to get the absolute best out of everything Hugging Face, from accessing state-of-the-art models to bui...
New
ManningBooks
Build AI-Enhanced Web Apps guides you through AI development using only JavaScript and other common web dev skills–no Python or Machine L...
New
New
ManningBooks
AI applications need much more than a connection to a model. To work well in the real world, they need memory, access to company knowledg...
New

Other popular topics Top

New
AstonJ
We have a thread about the keyboards we have, but what about nice keyboards we come across that we want? If you have seen any that look n...
New
New
Exadra37
Oh just spent so much time on this to discover now that RancherOS is in end of life but Rancher is refusing to mark the Github repo as su...
New
PragmaticBookshelf
Create efficient, elegant software tests in pytest, Python's most powerful testing framework. Brian Okken @brianokken Edited by Kat...
New
PragmaticBookshelf
Use WebRTC to build web applications that stream media and data in real time directly from one user to another, all in the browser. ...
New
New
New
New
sir.laksmana_wenk
I’m able to do the “artistic” part of game-development; character designing/modeling, music, environment modeling, etc. However, I don’t...
New