ManningBooks

ManningBooks

Devtalk Sponsor

Architecting an Apache Iceberg Lakehouse (Manning)

Apache Iceberg is an open data format that lets data lake files work like database tables. It helps turn a data lake into a more reliable and capable lakehouse.

Alex Merced

A quick update on a book we shared here earlier while it was still in MEAP—Architecting an Apache Iceberg Lakehouse by Alex Merced is now out in print.

If you followed along during early access, this is the finished version, tightened up and expanded based on reader feedback. And if you skipped it the first time around, it’s a solid, end-to-end look at how to design a lakehouse that stays flexible as your data and tooling evolve.

The book walks through building a lakehouse from scratch using Apache Iceberg, showing how pieces like Spark, Flink, and Dremio fit into a larger system. It doesn’t stop at diagrams—you actually build a working setup, starting with data ingestion from PostgreSQL and ending with analytics dashboards. Along the way, it gets into the decisions that matter in practice: handling schema changes, mixing batch and streaming pipelines, and keeping performance predictable as things scale.

Iceberg itself is getting a lot of traction as an open table format that brings database-like behavior to data lakes. This book gives you a clear picture of how to put it to work without relying on a single vendor stack.


Don’t forget you can get 45% off with your Devtalk discount! Just use the coupon code “devtalk.com” at checkout :+1:

Where Next?

Popular Other Fields topics Top

New
CommunityNews
Data. It’s everywhere and we’re only getting more of it. For the last 5-10 years, data science has attracted newcomers near and far tryin...
New
AstonJ
Just listening now… details revealed in Thinking Elixir’s podcast: José Valim visits and finally publicly reveals what Project Nx is! H...
New
First poster: bot
Summary In this project, we Added an OpenGL backend for MXNet/TVM - a general-purpose tensor computation framework, so that it automat...
New
CommunityNews
“Markpainting” is a clever technique to watermark photos in such a way that makes it easier to detect ML-based manipulation: An image o...
New
First poster: bot
The Future of Deep Learning Is Photonic. Computing with light could slash the energy needs of neural networks
New
First poster: bot
Intro I finally escaped from (grad) school in 2019, spent two months interning as an assistant trader at FTX, and have since spent the la...
New
AstonJ
Biggest jackpot ever apparently! :upside_down_face: I don’t (usually) gamble/play the lottery, but working on a program to predict the...
New
First poster: bot
Two recent collaborations between mathematicians and DeepMind demonstrate the potential of machine learning to help researchers generate ...
New
ManningBooks
Kafka for Architects teaches you how to incorporate Kafka into enterprise applications. This book stays above the code-level details, foc...
New

Other popular topics Top

PragmaticBookshelf
Machine learning can be intimidating, with its reliance on math and algorithms that most programmers don't encounter in their regular wor...
New
PragmaticBookshelf
Write Elixir tests that you can be proud of. Dive into Elixir’s test philosophy and gain mastery over the terminology and concepts that u...
New
PragmaticBookshelf
Design and develop sophisticated 2D games that are as much fun to make as they are to play. From particle effects and pathfinding to soci...
New
AstonJ
Continuing the discussion from Thinking about learning Crystal, let’s discuss - I was wondering which languages don’t GC - maybe we can c...
New
rustkas
Intensively researching Erlang books and additional resources on it, I have found that the topic of using Regular Expressions is either c...
New
New
PragmaticBookshelf
Leverage Elixir and the Nx ecosystem to build intelligent applications that solve real-world problems in computer vision, natural languag...
New
First poster: AstonJ
Jan | Rethink the Computer. Jan turns your computer into an AI machine by running LLMs locally on your computer. It’s a privacy-focus, l...
New
PragmaticBookshelf
Fight complexity and reclaim the original spirit of agility by learning to simplify how you develop software. The result: a more humane a...
New
PragmaticBookshelf
A concise guide to MySQL 9 database administration, covering fundamental concepts, techniques, and best practices. Neil Smyth MySQL...
New