Apache Hudi - The Definitive Guide, 9781098173838
Paperback
Build data lakehouses with Apache Hudi; faster insights, guaranteed transactions.

Apache Hudi - The Definitive Guide

Building Robust, Open, and High-Performing Data Lakehouses

$131.76

  • Paperback

    350 pages

  • Release Date

    7 November 2025

Check Delivery Options

Summary

Overcome challenges in building transactional guarantees on rapidly changing data by using Apache Hudi. With this practical guide, data engineers, data architects, and software architects will discover how to seamlessly build an interoperable lakehouse from disparate data sources and deliver faster insights using their query engine of choice.

Authors Shiyan Xu, Prashant Wason, Sudha Saktheeswaran, and Rebecca Bilbro provide practical examples and insights to help you unlock the full p…

Book Details

ISBN-13:9781098173838
ISBN-10:109817383X
Author:Shiyan Xu
Publisher:O'Reilly Media
Imprint:O'Reilly Media
Format:Paperback
Number of Pages:350
Release Date:7 November 2025
Weight:506g
Dimensions:232mm x 178mm
About The Author

Shiyan Xu

Shiyan Xu is a Founding Engineer at Onehouse and currently working as an Open Source Engineer. He has been an active contributor to Apache Hudi since 2019, and is serving as a PMC member of the project since 2021. Prior to joining Onehouse, Shiyan worked as a tech lead manager at Zendesk, leading the development of a large-scale data lake platform using Apache Hudi. He is passionate about open source development and engaging with community users.

Prashant Wason is a Staff Software Engineer at Uber Technologies and a PMC member of the Apache Hudi project. He has been an active contributor to the Hudi project since 2019 with features like Metadata Table and Record Index. Prashant has been working in the Storage and Data Infrastructure space for over 15 years.

Sudha Saktheeswaran is a Software Engineer at Onehouse and a PMC member of the Apache Hudi project. She comes with vast experience in real-time and distributed data systems through her work at Moveworks, Uber and Linkedin’s data infra teams. Sudha is also a key contributor to the early Presto integrations of Hudi. She is passionate about engaging with and driving the Hudi community.

Dr. Rebecca Bilbro is a data scientist, Python programmer, and author in Washington, DC. She specializes in data visualization for machine learning, from feature analysis to model selection and hyperparameter tuning. Rebecca is an active contributor to the open source community and has conducted research on natural language processing, semantic network extraction, entity resolution, and high dimensional information visualization. She earned her doctorate from the University of Illinois, Urbana-Champaign, where her research centered on communication and visualization practices in engineering. Rebecca is co-founder and CTO of Rotational Labs.

Returns

This item is eligible for free returns within 30 days of delivery. See our returns policy for further details.