ManningBooks

ManningBooks

Devtalk Sponsor

Architecting an Apache Iceberg Data Lakehouse (Manning)

The book focuses on designing a complete, modular lakehouse architecture using Apache Iceberg—leveraging open source tools instead of relying on closed, vendor-specific platforms.

Highlights:

  • End-to-end guidance on building an Iceberg-based lakehouse from storage to BI
  • Integrates tools like Spark, Flink, Dremio, and Polaris
  • Covers ingestion pipelines (batch & streaming), schema evolution, governance, and security
  • Hands-on examples using PostgreSQL, Apache Superset, and more
  • Focus on real-world tradeoffs and implementation decisions at scale

The “lakehouse” data architecture is a powerful way to combine the flexibility of data lakes with the management features of data warehouses. The open source Apache Iceberg framework delivers the scalability, reliability, and performance you want from a lakehouse without the expense and vendor lock-in of platforms like Snowflake, BigQuery, and Redshift.

In Architecting an Apache Iceberg Data Lakehouse , data guru Alex Merced shows you:

  • How to create a modular, scalable Iceberg lakehouse architecture
  • Where Spark, Flink, Dremio, Polaris fit into your design
  • Reliable batch and streaming ingestion pipelines
  • Strategies for governance, security, and performance at scale

Apache Iceberg is an open source table format perfect for massive analytic datasets. Iceberg enables ACID transactions, schema evolution, and high-performance queries on data lakes using multiple compute engines like Spark, Trino, Flink, Presto, and Hive. An Iceberg data lakehouse enables fast, reliable analytics at scale while retaining the observability you need for compliance audits, governance, and provable data security.


If you’re exploring Iceberg as an alternative to platforms like Snowflake or BigQuery—or already using it and want to deepen your understanding—this could be a useful resource. The Early Access format also means readers can give feedback as the book evolves.

Full details: Architecting an Apache Iceberg Lakehouse - Alex Merced

Don’t forget you can get 45% off with your Devtalk discount! Just use the coupon code “devtalk.com" at checkout :+1:

Where Next?

Popular Other Fields topics Top

AstonJ
China used facial recognition quite extensively: And now Russia is too: https://www.bbc.co.uk/news/av/world-europe-52157131/coronaviru...
New
PragmaticBookshelf
Self-driving cars, natural language recognition, and online recommendation engines are all possible thanks to Machine Learning. Discover...
New
AstonJ
Is this the future of photography - or spying even :joy: Creating realistic 3D photo-models from a series of photographs - in this case ...
New
New
CommunityNews
Data. It’s everywhere and we’re only getting more of it. For the last 5-10 years, data science has attracted newcomers near and far tryin...
New
AstonJ
Just listening now… details revealed in Thinking Elixir’s podcast: José Valim visits and finally publicly reveals what Project Nx is! H...
New
First poster: bot
We introduce the problem of perpetual view generation —long-range generation of novel views corresponding to an arbitrarily long camera t...
New
First poster: bot
Summary In this project, we Added an OpenGL backend for MXNet/TVM - a general-purpose tensor computation framework, so that it automat...
New
First poster: davearonson
Deep learning may transform health care, but model development has largely been dependent on availability of advanced technical expertise...
New
First poster: bot
Two recent collaborations between mathematicians and DeepMind demonstrate the potential of machine learning to help researchers generate ...
New

Other popular topics Top

ohm
Which, if any, games do you play? On what platform? I just bought (and completed) Minecraft Dungeons for my Nintendo Switch. Other than ...
New
AstonJ
SpaceVim seems to be gaining in features and popularity and I just wondered how it compares with SpaceMacs in 2020 - anyone have any thou...
New
Exadra37
I am asking for any distro that only has the bare-bones to be able to get a shell in the server and then just install the packages as we ...
New
PragmaticBookshelf
Use WebRTC to build web applications that stream media and data in real time directly from one user to another, all in the browser. ...
New
PragmaticBookshelf
Author Spotlight Jamis Buck @jamis This month, we have the pleasure of spotlighting author Jamis Buck, who has written Mazes for Prog...
New
PragmaticBookshelf
Author Spotlight Mike Riley @mriley This month, we turn the spotlight on Mike Riley, author of Portable Python Projects. Mike’s book ...
New
New
sir.laksmana_wenk
I’m able to do the “artistic” part of game-development; character designing/modeling, music, environment modeling, etc. However, I don’t...
New
PragmaticBookshelf
Explore the power of Ash Framework by modeling and building the domain for a real-world web application. Rebecca Le @sevenseacat and ...
New
AstonJ
Curious what kind of results others are getting, I think actually prefer the 7B model to the 32B model, not only is it faster but the qua...
New