ManningBooks

ManningBooks

Devtalk Sponsor

Architecting an Apache Iceberg Lakehouse (Manning)

Apache Iceberg is an open data format that lets data lake files work like database tables. It helps turn a data lake into a more reliable and capable lakehouse.

Alex Merced

A quick update on a book we shared here earlier while it was still in MEAP—Architecting an Apache Iceberg Lakehouse by Alex Merced is now out in print.

If you followed along during early access, this is the finished version, tightened up and expanded based on reader feedback. And if you skipped it the first time around, it’s a solid, end-to-end look at how to design a lakehouse that stays flexible as your data and tooling evolve.

The book walks through building a lakehouse from scratch using Apache Iceberg, showing how pieces like Spark, Flink, and Dremio fit into a larger system. It doesn’t stop at diagrams—you actually build a working setup, starting with data ingestion from PostgreSQL and ending with analytics dashboards. Along the way, it gets into the decisions that matter in practice: handling schema changes, mixing batch and streaming pipelines, and keeping performance predictable as things scale.

Iceberg itself is getting a lot of traction as an open table format that brings database-like behavior to data lakes. This book gives you a clear picture of how to put it to work without relying on a single vendor stack.


Don’t forget you can get 45% off with your Devtalk discount! Just use the coupon code “devtalk.com” at checkout :+1:

Where Next?

Popular Other Fields topics Top

AstonJ
China used facial recognition quite extensively: And now Russia is too: https://www.bbc.co.uk/news/av/world-europe-52157131/coronaviru...
New
PragmaticBookshelf
From finance to artificial intelligence, genetic algorithms are a powerful tool with a wide array of applications. But you don't need an ...
New
New
First poster: davearonson
Deep learning may transform health care, but model development has largely been dependent on availability of advanced technical expertise...
New
CommunityNews
“Markpainting” is a clever technique to watermark photos in such a way that makes it easier to detect ML-based manipulation: An image o...
New
First poster: bot
The Future of Deep Learning Is Photonic. Computing with light could slash the energy needs of neural networks
New
AstonJ
Biggest jackpot ever apparently! :upside_down_face: I don’t (usually) gamble/play the lottery, but working on a program to predict the...
New
First poster: bot
Two recent collaborations between mathematicians and DeepMind demonstrate the potential of machine learning to help researchers generate ...
New
ManningBooks
The book focuses on designing a complete, modular lakehouse architecture using Apache Iceberg—leveraging open source tools instead of rel...
New
ManningBooks
DAX Reimagined isn’t just another beginner’s guide to the powerful DAX language. This unique book teaches you how to work with the engine...
New

Other popular topics Top

PragmaticBookshelf
Take your Go skills to the next level by learning how to design, develop, and deploy a distributed service. Start from the bare essential...
New
DevotionGeo
I know that -t flag is used along with -i flag for getting an interactive shell. But I cannot digest what the man page for docker run com...
New
AstonJ
We have a thread about the keyboards we have, but what about nice keyboards we come across that we want? If you have seen any that look n...
New
foxtrottwist
A few weeks ago I started using Warp a terminal written in rust. Though in it’s current state of development there are a few caveats (tab...
New
PragmaticBookshelf
Programming Ruby is the most complete book on Ruby, covering both the language itself and the standard library as well as commonly used t...
New
New
PragmaticBookshelf
Explore the power of Ash Framework by modeling and building the domain for a real-world web application. Rebecca Le @sevenseacat and ...
New
AstonJ
This is cool! DEEPSEEK-V3 ON M4 MAC: BLAZING FAST INFERENCE ON APPLE SILICON We just witnessed something incredible: the largest open-s...
New
AstonJ
This is a very quick guide, you just need to: Download LM Studio: https://lmstudio.ai/ Click on search Type DeepSeek, then select the o...
New
Fl4m3Ph03n1x
Background Lately I am in a quest to find a good quality TTS ai generation tool to run locally in order to create audio for some videos I...
New