ManningBooks

ManningBooks

Devtalk Sponsor

Architecting an Apache Iceberg Data Lakehouse (Manning)

The book focuses on designing a complete, modular lakehouse architecture using Apache Iceberg—leveraging open source tools instead of relying on closed, vendor-specific platforms.

Highlights:

  • End-to-end guidance on building an Iceberg-based lakehouse from storage to BI
  • Integrates tools like Spark, Flink, Dremio, and Polaris
  • Covers ingestion pipelines (batch & streaming), schema evolution, governance, and security
  • Hands-on examples using PostgreSQL, Apache Superset, and more
  • Focus on real-world tradeoffs and implementation decisions at scale

The “lakehouse” data architecture is a powerful way to combine the flexibility of data lakes with the management features of data warehouses. The open source Apache Iceberg framework delivers the scalability, reliability, and performance you want from a lakehouse without the expense and vendor lock-in of platforms like Snowflake, BigQuery, and Redshift.

In Architecting an Apache Iceberg Data Lakehouse , data guru Alex Merced shows you:

  • How to create a modular, scalable Iceberg lakehouse architecture
  • Where Spark, Flink, Dremio, Polaris fit into your design
  • Reliable batch and streaming ingestion pipelines
  • Strategies for governance, security, and performance at scale

Apache Iceberg is an open source table format perfect for massive analytic datasets. Iceberg enables ACID transactions, schema evolution, and high-performance queries on data lakes using multiple compute engines like Spark, Trino, Flink, Presto, and Hive. An Iceberg data lakehouse enables fast, reliable analytics at scale while retaining the observability you need for compliance audits, governance, and provable data security.


If you’re exploring Iceberg as an alternative to platforms like Snowflake or BigQuery—or already using it and want to deepen your understanding—this could be a useful resource. The Early Access format also means readers can give feedback as the book evolves.

Full details: Architecting an Apache Iceberg Lakehouse - Alex Merced

Don’t forget you can get 45% off with your Devtalk discount! Just use the coupon code “devtalk.com" at checkout :+1:

Where Next?

Popular Other Fields topics Top

AstonJ
Probably only a delay of the inevitable - with facial-recognition systems already part of daily lives in countries such as China, how lon...
New
AstonJ
Is this the future of photography - or spying even :joy: Creating realistic 3D photo-models from a series of photographs - in this case ...
New
CommunityNews
Data. It’s everywhere and we’re only getting more of it. For the last 5-10 years, data science has attracted newcomers near and far tryin...
New
First poster: davearonson
Deep learning may transform health care, but model development has largely been dependent on availability of advanced technical expertise...
New
First poster: bot
The Future of Deep Learning Is Photonic. Computing with light could slash the energy needs of neural networks
New
AstonJ
Biggest jackpot ever apparently! :upside_down_face: I don’t (usually) gamble/play the lottery, but working on a program to predict the...
New
First poster: bot
Two recent collaborations between mathematicians and DeepMind demonstrate the potential of machine learning to help researchers generate ...
New
ManningBooks
The book focuses on designing a complete, modular lakehouse architecture using Apache Iceberg—leveraging open source tools instead of rel...
New
ManningBooks
DAX Reimagined isn’t just another beginner’s guide to the powerful DAX language. This unique book teaches you how to work with the engine...
New
ManningBooks
Timeless Algorithms: The Seminal Papers explains both the how and the why of the most important data science algorithms. Along with the t...
New

Other popular topics Top

AstonJ
Just done a fresh install of macOS Big Sur and on installing Erlang I am getting: asdf install erlang 23.1.2 Configure failed. checking ...
New
mafinar
This is going to be a long an frequently posted thread. While talking to a friend of mine who has taken data structure and algorithm cou...
New
PragmaticBookshelf
Author Spotlight Mike Riley @mriley This month, we turn the spotlight on Mike Riley, author of Portable Python Projects. Mike’s book ...
New
New
hilfordjames
There appears to have been an update that has changed the terminology for what has previously been known as the Taskbar Overflow - this h...
New
New
New
PragmaticBookshelf
Build modern server-driven web applications using htmx. Whatever programming language you use, you’ll write less (and cleaner) code. ...
New
sir.laksmana_wenk
I’m able to do the “artistic” part of game-development; character designing/modeling, music, environment modeling, etc. However, I don’t...
New
Fl4m3Ph03n1x
Background Lately I am in a quest to find a good quality TTS ai generation tool to run locally in order to create audio for some videos I...
New