ManningBooks

ManningBooks

Devtalk Sponsor

Architecting an Apache Iceberg Data Lakehouse (Manning)

The book focuses on designing a complete, modular lakehouse architecture using Apache Iceberg—leveraging open source tools instead of relying on closed, vendor-specific platforms.

Highlights:

  • End-to-end guidance on building an Iceberg-based lakehouse from storage to BI
  • Integrates tools like Spark, Flink, Dremio, and Polaris
  • Covers ingestion pipelines (batch & streaming), schema evolution, governance, and security
  • Hands-on examples using PostgreSQL, Apache Superset, and more
  • Focus on real-world tradeoffs and implementation decisions at scale

The “lakehouse” data architecture is a powerful way to combine the flexibility of data lakes with the management features of data warehouses. The open source Apache Iceberg framework delivers the scalability, reliability, and performance you want from a lakehouse without the expense and vendor lock-in of platforms like Snowflake, BigQuery, and Redshift.

In Architecting an Apache Iceberg Data Lakehouse , data guru Alex Merced shows you:

  • How to create a modular, scalable Iceberg lakehouse architecture
  • Where Spark, Flink, Dremio, Polaris fit into your design
  • Reliable batch and streaming ingestion pipelines
  • Strategies for governance, security, and performance at scale

Apache Iceberg is an open source table format perfect for massive analytic datasets. Iceberg enables ACID transactions, schema evolution, and high-performance queries on data lakes using multiple compute engines like Spark, Trino, Flink, Presto, and Hive. An Iceberg data lakehouse enables fast, reliable analytics at scale while retaining the observability you need for compliance audits, governance, and provable data security.


If you’re exploring Iceberg as an alternative to platforms like Snowflake or BigQuery—or already using it and want to deepen your understanding—this could be a useful resource. The Early Access format also means readers can give feedback as the book evolves.

Full details: Architecting an Apache Iceberg Lakehouse - Alex Merced

Don’t forget you can get 45% off with your Devtalk discount! Just use the coupon code “devtalk.com" at checkout :+1:

Where Next?

Popular Other Fields topics Top

PragmaticBookshelf
Self-driving cars, natural language recognition, and online recommendation engines are all possible thanks to Machine Learning. Discover...
New
AstonJ
Probably only a delay of the inevitable - with facial-recognition systems already part of daily lives in countries such as China, how lon...
New
First poster: bot
What is Logica? Logica is an open source declarative logic programming language for data manipulation. Logica is a successor to Yedalog, ...
New
First poster: davearonson
Deep learning may transform health care, but model development has largely been dependent on availability of advanced technical expertise...
New
First poster: bot
The Modern Mathematics of Deep Learning. We describe the new field of mathematical analysis of deep learning. This field emerged around...
New
First poster: bot
Intro I finally escaped from (grad) school in 2019, spent two months interning as an assistant trader at FTX, and have since spent the la...
New
AstonJ
Biggest jackpot ever apparently! :upside_down_face: I don’t (usually) gamble/play the lottery, but working on a program to predict the...
New
ManningBooks
With Grokking Statistics, you’ll build a strong foundation in statistical analysis by working through engaging mini projects that put eac...
New
ManningBooks
DAX Reimagined isn’t just another beginner’s guide to the powerful DAX language. This unique book teaches you how to work with the engine...
New
New

Other popular topics Top

Devtalk
Reading something? Working on something? Planning something? Changing jobs even!? If you’re up for sharing, please let us know what you’...
1063 23050 405
New
PragmaticBookshelf
Brace yourself for a fun challenge: build a photorealistic 3D renderer from scratch! In just a couple of weeks, build a ray tracer that r...
New
PragmaticBookshelf
Free and open source software is the default choice for the technologies that run our world, and it’s built and maintained by people like...
New
PragmaticBookshelf
Design and develop sophisticated 2D games that are as much fun to make as they are to play. From particle effects and pathfinding to soci...
New
DevotionGeo
I know that -t flag is used along with -i flag for getting an interactive shell. But I cannot digest what the man page for docker run com...
New
DevotionGeo
The V Programming Language Simple language for building maintainable programs V is already mentioned couple of times in the forum, but I...
New
PragmaticBookshelf
Author Spotlight Mike Riley @mriley This month, we turn the spotlight on Mike Riley, author of Portable Python Projects. Mike’s book ...
New
DevotionGeo
I have always used antique keyboards like Cherry MX 1800 or Cherry MX 8100 and almost always have modified the switches in some way, like...
New
PragmaticBookshelf
Develop, deploy, and debug BEAM applications using BEAMOps: a new paradigm that focuses on scalability, fault tolerance, and owning each ...
New
CommunityNews
Open-source implementation of the classic GTA engine now running directly in your browser. Experience the reVC technology demo on DOS.Zon...
New