Pages tagged: se

Iceberg, The Right Idea - The Wrong Spec - Part 2 of 2: The Spec

Reading Time: 21 min, Date: 7/10/2025

Let us finally look at what is so wrong with the Iceberg spec and why this simply isn't a serious attempt at solving the metadata problem of large Data Lakes. In the first part of this I took...

Iceberg, The Right Idea - The Wrong Spec - Part 1 of 2: History

Reading Time: 17 min, Date: 7/6/2025

Iceberg: The great unifying vision finally allowing us to escape the vendor lock-in of our database engines. One table and metadata format to find them ... And in the darkness bind I the...

Why are Databases so Hard to Make? - Digging up Graves

Reading Time: 9 min, Date: 10/4/2024

In my last post about high speed DML, I talked how it is possible to modify tables at the kind of speeds that a modern SSD can deliver. I sketched an outline of an algorithm that can easily us...

Why are Databases so Hard to Make? - High Speed DML

Reading Time: 14 min, Date: 9/24/2024

After a brief intermezzo about testing (read about my thoughts here: Testing is Hard and we often use the wrong Incentives) - it is time to continue our journey together to where we will A...

Row or Column based Storage?

Reading Time: 4 min, Date: 4/7/2023

These days, columnar storage formats are getting a lot more attention in relational databases. Parquet, with its superior compression, is quickly taking over from CSV formats. SAP a...

Tag: se