ManningBooks

ManningBooks

Devtalk Sponsor

Architecting an Apache Iceberg Lakehouse (Manning)

Apache Iceberg is an open data format that lets data lake files work like database tables. It helps turn a data lake into a more reliable and capable lakehouse.

Alex Merced

A quick update on a book we shared here earlier while it was still in MEAP—Architecting an Apache Iceberg Lakehouse by Alex Merced is now out in print.

If you followed along during early access, this is the finished version, tightened up and expanded based on reader feedback. And if you skipped it the first time around, it’s a solid, end-to-end look at how to design a lakehouse that stays flexible as your data and tooling evolve.

The book walks through building a lakehouse from scratch using Apache Iceberg, showing how pieces like Spark, Flink, and Dremio fit into a larger system. It doesn’t stop at diagrams—you actually build a working setup, starting with data ingestion from PostgreSQL and ending with analytics dashboards. Along the way, it gets into the decisions that matter in practice: handling schema changes, mixing batch and streaming pipelines, and keeping performance predictable as things scale.

Iceberg itself is getting a lot of traction as an open table format that brings database-like behavior to data lakes. This book gives you a clear picture of how to put it to work without relying on a single vendor stack.


Don’t forget you can get 45% off with your Devtalk discount! Just use the coupon code “devtalk.com” at checkout :+1:

Where Next?

Popular Other Fields topics Top

AstonJ
China used facial recognition quite extensively: And now Russia is too: https://www.bbc.co.uk/news/av/world-europe-52157131/coronaviru...
New
PragmaticBookshelf
Self-driving cars, natural language recognition, and online recommendation engines are all possible thanks to Machine Learning. Discover...
New
AstonJ
Probably only a delay of the inevitable - with facial-recognition systems already part of daily lives in countries such as China, how lon...
New
PragmaticBookshelf
From finance to artificial intelligence, genetic algorithms are a powerful tool with a wide array of applications. But you don't need an ...
New
First poster: bot
One of the first things to note about this book is that it is not an academic textbook. The authors of this book are not academics but a ...
New
First poster: bot
We introduce the problem of perpetual view generation —long-range generation of novel views corresponding to an arbitrarily long camera t...
New
First poster: bot
Summary In this project, we Added an OpenGL backend for MXNet/TVM - a general-purpose tensor computation framework, so that it automat...
New
First poster: bot
What is Logica? Logica is an open source declarative logic programming language for data manipulation. Logica is a successor to Yedalog, ...
New
CommunityNews
“Markpainting” is a clever technique to watermark photos in such a way that makes it easier to detect ML-based manipulation: An image o...
New
ManningBooks
Timeless Algorithms: The Seminal Papers explains both the how and the why of the most important data science algorithms. Along with the t...
New

Other popular topics Top

Devtalk
Reading something? Working on something? Planning something? Changing jobs even!? If you’re up for sharing, please let us know what you’...
1063 23050 405
New
PragmaticBookshelf
Write Elixir tests that you can be proud of. Dive into Elixir’s test philosophy and gain mastery over the terminology and concepts that u...
New
DevotionGeo
I know that -t flag is used along with -i flag for getting an interactive shell. But I cannot digest what the man page for docker run com...
New
AstonJ
This looks like a stunning keycap set :orange_heart: A LEGENDARY KEYBOARD LIVES ON When you bought an Apple Macintosh computer in the e...
New
AstonJ
Was just curious to see if any were around, found this one: I got 51/100: Not sure if it was meant to buy I am sure at times the b...
New
husaindevelop
Inside our android webview app, we are trying to paste the copied content from another app eg (notes) using navigator.clipboard.readtext ...
New
First poster: bot
zig/http.zig at 7cf2cbb33ef34c1d211135f56d30fe23b6cacd42 · ziglang/zig. General-purpose programming language and toolchain for maintaini...
New
New
First poster: AstonJ
Jan | Rethink the Computer. Jan turns your computer into an AI machine by running LLMs locally on your computer. It’s a privacy-focus, l...
New
mindriot
Ok, well here are some thoughts and opinions on some of the ergonomic keyboards I have, I guess like mini review of each that I use enoug...
New