7 Best Practices for Implementing Apache Iceberg
ฝัง
- เผยแพร่เมื่อ 20 ต.ค. 2024
- The Iceberg table format brings data warehouse characteristics to cloud object storage - including consistent SQL behavior, hidden partitioning and schema evolution. However, as with any new technology, there are new techniques you’ll need to master in order to succeed.
In this webinar Dan Weeks, Tabular CTO and Apache Iceberg PMC member, will cover the most important practices you need to develop to ensure your Iceberg deployment exceeds your expectations for performance, cost, security and simple operation.
After a short explanation of how Apache Iceberg works, the problems it solves and the levers and controls it provides, Dan will cover best practices across several areas including:
Selecting a Catalog
Ingesting Data
Connecting Compute
Maintaining Tables
Optimizing Performance
Enforcing Security
Data Privacy and Compliance
awesome talk!
Thanks Dan! This is one of the best talks I have listened to on Iceberg implementation. Automated table maintenance is the real deal.
Super candid to call out the “undifferentiated work”
There should be a huge asterisk next to the aforementioned REST catalog. It's not free or open source. The only good production ready catalog out there is nessie. Which Daniel doesn't mention (I guess because dremio are tabular's competitors).