ManningBooks

ManningBooks

Devtalk Sponsor

AI Model Evaluation (Manning)

Before deploying an AI model into production, you need to know more than just its accuracy. Will it be fast enough for your users? Will it scale under real-world traffic? Can you trust its decisions in critical scenarios? AI Model Evaluation (Manning Publications) gives you the practical tools and strategies to answer these questions—and more—so you can ship AI systems that actually work in the real world.

Leemay Nassery

Before deploying an AI model into production, you need to know more than just its accuracy. Will it be fast enough for your users? Will it scale under real-world traffic? Can you trust its decisions in critical scenarios? AI Model Evaluation (Manning Publications) gives you the practical tools and strategies to answer these questions—and more—so you can ship AI systems that actually work in the real world.

What you’ll learn in AI Model Evaluation:

  • Build diagnostic offline evaluations to uncover hidden model behaviors
  • Use shadow traffic to simulate production conditions safely
  • Design A/B tests to measure real business and product impact
  • Spot nuanced failures with human-in-the-loop feedback
  • Scale evaluations with LLMs as automated judges

Author Leemay Nassery (Spotify, Comcast, Dropbox, Etsy) shares real-world insights on what it really takes to prepare models for production. You’ll go beyond standard accuracy metrics to evaluate latency, user experience, and long-term impact on product goals.

Inside the book:
Each chapter explores a different evaluation method, from offline testing and A/B experiments to shadow deployments and qualitative analysis. Hands-on examples, including a movie recommendation engine, make it easy to apply these techniques to your own AI projects.


Don’t forget you can get 45% off with your Devtalk discount! Just use the coupon code “devtalk.com” at checkout :+1:

First Post!

peterchancc

peterchancc

We started exploring AI apps with LLMs, so this book should be a good reference for evaluating the open-source LLMs that we plan to use.

Where Next?

Popular Ai topics Top

ManningBooks
Before deploying an AI model into production, you need to know more than just its accuracy. Will it be fast enough for your users? Will i...
New

Other popular topics Top

Devtalk
Hello Devtalk World! Please let us know a little about who you are and where you’re from :nerd_face:
New
AstonJ
A thread that every forum needs! Simply post a link to a track on YouTube (or SoundCloud or Vimeo amongst others!) on a separate line an...
New
New
AstonJ
Curious to know which languages and frameworks you’re all thinking about learning next :upside_down_face: Perhaps if there’s enough peop...
New
Margaret
Hello content creators! Happy new year. What tech topics do you think will be the focus of 2021? My vote for one topic is ethics in tech...
New
Exadra37
I am asking for any distro that only has the bare-bones to be able to get a shell in the server and then just install the packages as we ...
New
AstonJ
Seems like a lot of people caught it - just wondered whether any of you did? As far as I know I didn’t, but it wouldn’t surprise me if I...
New
New
New
Fl4m3Ph03n1x
Background Lately I am in a quest to find a good quality TTS ai generation tool to run locally in order to create audio for some videos I...
New