Dr. Mario's 2nd 🧠

❯

❯

❯

Scaling Ancestry.com How to Optimize Updates for Iceberg Tables with 100 Billion Rows

Scaling Ancestry.com How to Optimize Updates for Iceberg Tables with 100 Billion Rows

1 min read

rw-book-cover

Metadata

Author: Thomas Cardenas
Full Title:: Scaling Ancestry.com: How to Optimize Updates for Iceberg Tables With 100 Billion Rows
Category:: 🗞️Articles
URL:: https://medium.com/ancestry-product-and-technology/scaling-ancestry-com-how-to-optimize-updates-for-iceberg-tables-with-100-billion-rows-860285922316
Finished date:: 2023-03-05

Highlights

Every hour, there is an Apache Spark job that performs a merge function of the recently changed rows into the Iceberg table (View Highlight)

Being able to partition data in this way is one of the key reasons Iceberg was chosen for the table (View Highlight)

merge-on-read (View Highlight)

Webmentions

Loading webmentions...

Unable to load webmentions. Please try again later.

❤️ Likes

🔄 Reposts

💬 Replies

🔗 Mentions

No webmentions found for this post yet. Be the first to mention it!

Graph View

Metadata
Highlights

Created with Quartz v4.5.1 © 2025