ManningBooks

ManningBooks

Devtalk Sponsor

Architecting an Apache Iceberg Data Lakehouse (Manning)

The book focuses on designing a complete, modular lakehouse architecture using Apache Iceberg—leveraging open source tools instead of relying on closed, vendor-specific platforms.

Highlights:

  • End-to-end guidance on building an Iceberg-based lakehouse from storage to BI
  • Integrates tools like Spark, Flink, Dremio, and Polaris
  • Covers ingestion pipelines (batch & streaming), schema evolution, governance, and security
  • Hands-on examples using PostgreSQL, Apache Superset, and more
  • Focus on real-world tradeoffs and implementation decisions at scale

The “lakehouse” data architecture is a powerful way to combine the flexibility of data lakes with the management features of data warehouses. The open source Apache Iceberg framework delivers the scalability, reliability, and performance you want from a lakehouse without the expense and vendor lock-in of platforms like Snowflake, BigQuery, and Redshift.

In Architecting an Apache Iceberg Data Lakehouse , data guru Alex Merced shows you:

  • How to create a modular, scalable Iceberg lakehouse architecture
  • Where Spark, Flink, Dremio, Polaris fit into your design
  • Reliable batch and streaming ingestion pipelines
  • Strategies for governance, security, and performance at scale

Apache Iceberg is an open source table format perfect for massive analytic datasets. Iceberg enables ACID transactions, schema evolution, and high-performance queries on data lakes using multiple compute engines like Spark, Trino, Flink, Presto, and Hive. An Iceberg data lakehouse enables fast, reliable analytics at scale while retaining the observability you need for compliance audits, governance, and provable data security.


If you’re exploring Iceberg as an alternative to platforms like Snowflake or BigQuery—or already using it and want to deepen your understanding—this could be a useful resource. The Early Access format also means readers can give feedback as the book evolves.

Full details: Architecting an Apache Iceberg Lakehouse - Alex Merced

Don’t forget you can get 45% off with your Devtalk discount! Just use the coupon code “devtalk.com" at checkout :+1:

Where Next?

Popular Other Fields topics Top

AstonJ
Is this the future of photography - or spying even :joy: Creating realistic 3D photo-models from a series of photographs - in this case ...
New
First poster: bot
As a student, when I was starting to seriously consider Data Science (DS) as a career option, the first thing that came to mind was where...
New
First poster: bot
Adversarial.io is an easy-to-use webapp for altering image material, in order to make it machine-unreadable. It works best with 299 x 29...
New
First poster: bot
Summary In this project, we Added an OpenGL backend for MXNet/TVM - a general-purpose tensor computation framework, so that it automat...
New
First poster: bot
What is Logica? Logica is an open source declarative logic programming language for data manipulation. Logica is a successor to Yedalog, ...
New
First poster: davearonson
Deep learning may transform health care, but model development has largely been dependent on availability of advanced technical expertise...
New
First poster: bot
The Modern Mathematics of Deep Learning. We describe the new field of mathematical analysis of deep learning. This field emerged around...
New
First poster: bot
The Future of Deep Learning Is Photonic. Computing with light could slash the energy needs of neural networks
New
ManningBooks
With Grokking Statistics, you’ll build a strong foundation in statistical analysis by working through engaging mini projects that put eac...
New
ManningBooks
Kafka for Architects teaches you how to incorporate Kafka into enterprise applications. This book stays above the code-level details, foc...
New

Other popular topics Top

PragmaticBookshelf
Take your Go skills to the next level by learning how to design, develop, and deploy a distributed service. Start from the bare essential...
New
PragmaticBookshelf
Free and open source software is the default choice for the technologies that run our world, and it’s built and maintained by people like...
New
AstonJ
I ended up cancelling my Moonlander order as I think it’s just going to be a bit too bulky for me. I think the Planck and the Preonic (o...
New
AstonJ
This looks like a stunning keycap set :orange_heart: A LEGENDARY KEYBOARD LIVES ON When you bought an Apple Macintosh computer in the e...
New
New
New
PragmaticBookshelf
Build modern server-driven web applications using htmx. Whatever programming language you use, you’ll write less (and cleaner) code. ...
New
sir.laksmana_wenk
I’m able to do the “artistic” part of game-development; character designing/modeling, music, environment modeling, etc. However, I don’t...
New
PragmaticBookshelf
Get the comprehensive, insider information you need for Rails 8 with the new edition of this award-winning classic. Sam Ruby @rubys ...
New
PragmaticBookshelf
Use advanced functional programming principles, practical Domain-Driven Design techniques, and production-ready Elixir code to build scal...
New