ManningBooks

ManningBooks

Devtalk Sponsor

Rearchitecting LLMs (Manning)

Rearchitecting LLMs: Structural techniques for efficient models turns research from the latest AI papers into production-ready practices for domain-specific model optimization. As you work through this practical book, you’ll perform hands-on surgery on popular open-source models like Llama-3, Gemma, and Qwen to create cost-effective local Small Language Models (SLMs).

Pere Marta

The premise is simple: most general-purpose LLMs weren’t built for your domain, constraints, or budget. Instead of treating models as black boxes, this book walks through how to open them up and reshape them. Not at the prompt level, but structurally.

Pere goes deep into hands-on work with open-source models like Llama-3, Gemma, and Qwen, showing how to:

  • Remove parts of a model that don’t pull their weight

  • Use pruning and distillation in ways that actually survive contact with production

  • Combine behavioral analysis with architectural changes, instead of guessing

  • Build smaller, local SLMs that make sense for specific tasks

  • Apply “fair pruning” to reduce bias at the neuron level (this part surprised me)

This is very much a keyboard-on-desk book. You’re not just reading about recent research papers—you’re translating that research into workflows you can run, test, and reason about. If you’ve ever wondered why a model is slow, expensive, or oddly confident about the wrong things, this book tries to answer that by showing you where to cut and where not to.


Don’t forget you can get 45% off with your Devtalk discount! Just use the coupon code “devtalk.com” at checkout :+1:

Where Next?

Popular Ai topics Top

ManningBooks
Before deploying an AI model into production, you need to know more than just its accuracy. Will it be fast enough for your users? Will i...
New
New
ManningBooks
Grokking AI Algorithms, Second Edition introduces the most important AI algorithms using relatable illustrations, interesting examples, a...
New
ManningBooks
The bestselling book on Python deep learning, now covering generative AI, Keras 3, PyTorch, and JAX! François Chollet and Matthew ...
New
ManningBooks
Retrieval Augmented Generation, The Seminal Papers explores 12 foundational research papers that explain why RAG works, how it’s built, a...
New
ManningBooks
AI applications need much more than a connection to a model. To work well in the real world, they need memory, access to company knowledg...
New
ManningBooks
AI is changing how offensive security workflows are designed, executed, and analyzed. AI Agents for Offensive Security: AI-powered attack...
New
ManningBooks
Today’s AI models demand a lot of memory, compute, and server horsepower–which quickly translates into cost. Quantization and Fast Infere...
New
ManningBooks
AI tools like ChatGPT, Claude Code, and OpenClaw produce impressive results that can be shockingly human-like. But are they really thinki...
New
ManningBooks
Building LLM Applications with DSPy introduces DSPy best practices you can adopt to create reliable, production-ready systems through pro...
New

Other popular topics Top

PragmaticBookshelf
Stop developing web apps with yesterday’s tools. Today, developers are increasingly adopting Clojure as a web-development platform. See f...
New
siddhant3030
I’m thinking of buying a monitor that I can rotate to use as a vertical monitor? Also, I want to know if someone is using it for program...
New
AstonJ
There’s a whole world of custom keycaps out there that I didn’t know existed! Check out all of our Keycaps threads here: https://forum....
New
Margaret
Hello everyone! This thread is to tell you about what authors from The Pragmatic Bookshelf are writing on Medium.
1147 29994 760
New
PragmaticBookshelf
Build efficient applications that exploit the unique benefits of a pure functional language, learning from an engineer who uses Haskell t...
New
First poster: bot
zig/http.zig at 7cf2cbb33ef34c1d211135f56d30fe23b6cacd42 · ziglang/zig. General-purpose programming language and toolchain for maintaini...
New
New
PragmaticBookshelf
Get the comprehensive, insider information you need for Rails 8 with the new edition of this award-winning classic. Sam Ruby @rubys ...
New
NewsBot
Node.js v22.14.0 has been released. Link: Release 2025-02-11, Version 22.14.0 'Jod' (LTS), @aduh95 · nodejs/node · GitHub
New
PragmaticBookshelf
A concise guide to MySQL 9 database administration, covering fundamental concepts, techniques, and best practices. Neil Smyth MySQL...
New