Metadata
- Author: Thomas Cardenas
- Full Title:: Scaling Ancestry.com: How to Optimize Updates for Iceberg Tables With 100 Billion Rows
- Category:: 🗞️Articles
- URL:: https://medium.com/ancestry-product-and-technology/scaling-ancestry-com-how-to-optimize-updates-for-iceberg-tables-with-100-billion-rows-860285922316
- Finished date:: 2023-03-05
Highlights
Every hour, there is an Apache Spark job that performs a merge function of the recently changed rows into the Iceberg table (View Highlight)
Being able to partition data in this way is one of the key reasons Iceberg was chosen for the table (View Highlight)
merge-on-read (View Highlight)