ManningBooks

ManningBooks

Devtalk Sponsor

Rearchitecting LLMs (Manning)

Rearchitecting LLMs: Structural techniques for efficient models turns research from the latest AI papers into production-ready practices for domain-specific model optimization. As you work through this practical book, you’ll perform hands-on surgery on popular open-source models like Llama-3, Gemma, and Qwen to create cost-effective local Small Language Models (SLMs).

Pere Marta

The premise is simple: most general-purpose LLMs weren’t built for your domain, constraints, or budget. Instead of treating models as black boxes, this book walks through how to open them up and reshape them. Not at the prompt level, but structurally.

Pere goes deep into hands-on work with open-source models like Llama-3, Gemma, and Qwen, showing how to:

  • Remove parts of a model that don’t pull their weight

  • Use pruning and distillation in ways that actually survive contact with production

  • Combine behavioral analysis with architectural changes, instead of guessing

  • Build smaller, local SLMs that make sense for specific tasks

  • Apply “fair pruning” to reduce bias at the neuron level (this part surprised me)

This is very much a keyboard-on-desk book. You’re not just reading about recent research papers—you’re translating that research into workflows you can run, test, and reason about. If you’ve ever wondered why a model is slow, expensive, or oddly confident about the wrong things, this book tries to answer that by showing you where to cut and where not to.


Don’t forget you can get 45% off with your Devtalk discount! Just use the coupon code “devtalk.com” at checkout :+1:

Where Next?

Popular Ai topics Top

ManningBooks
Before deploying an AI model into production, you need to know more than just its accuracy. Will it be fast enough for your users? Will i...
New
ManningBooks
Build an AI Agent (From Scratch) is a step-by-step guide to creating a working AI agent, starting with the bare essentials and growing yo...
New
ManningBooks
Erlang and OTP in Action teaches you the concepts of concurrent programming and the use of Erlang’s message-passing model. It walks you t...
New
ManningBooks
AI agent technology is changing fast! This totally revised Second Edition of AI Agents in Action by Micheal Lanham guides you through the...
New
ManningBooks
Introduction to Generative AI, Second Edition, guides you from your first eye-opening interaction with tools like ChatGPT to how AI tools...
New
ManningBooks
Retrieval Augmented Generation, The Seminal Papers explores 12 foundational research papers that explain why RAG works, how it’s built, a...
New
ManningBooks
AI applications need much more than a connection to a model. To work well in the real world, they need memory, access to company knowledg...
New
ManningBooks
Today’s AI models demand a lot of memory, compute, and server horsepower–which quickly translates into cost. Quantization and Fast Infere...
New
ManningBooks
In Designing AI Agents, you’ll learn how to establish agent architectures that manage costs and take governance seriously from day one. T...
New
ManningBooks
Build Applications with Local AI Models on a Mac shows you exactly how to build and run a ChatGPT-style assistant entirely on your own Ma...
New

Other popular topics Top

PragmaticBookshelf
Free and open source software is the default choice for the technologies that run our world, and it’s built and maintained by people like...
New
PragmaticBookshelf
From finance to artificial intelligence, genetic algorithms are a powerful tool with a wide array of applications. But you don't need an ...
New
AstonJ
I ended up cancelling my Moonlander order as I think it’s just going to be a bit too bulky for me. I think the Planck and the Preonic (o...
New
PragmaticBookshelf
Create efficient, elegant software tests in pytest, Python's most powerful testing framework. Brian Okken @brianokken Edited by Kat...
New
foxtrottwist
A few weeks ago I started using Warp a terminal written in rust. Though in it’s current state of development there are a few caveats (tab...
New
AstonJ
If you get Can't find emacs in your PATH when trying to install Doom Emacs on your Mac you… just… need to install Emacs first! :lol: bre...
New
AstonJ
Was just curious to see if any were around, found this one: I got 51/100: Not sure if it was meant to buy I am sure at times the b...
New
PragmaticBookshelf
Author Spotlight: VM Brasseur @vmbrasseur We have a treat for you today! We turn the spotlight onto Open Source as we sit down with V...
New
New
AstonJ
If you’re getting errors like this: psql: error: connection to server on socket “/tmp/.s.PGSQL.5432” failed: No such file or directory ...
New