Hi, in case you did not find the answer yet. In my hamble opinion:
- choose Iceberg: If you have several computing/query engines other than Spark, like Presto, Flink. Iceberg has a great extraction and design for a engine-independent table format. But its learning cost is relative high
- choose Delta: If you only have Spark and would like to be deeply binded with Databricks
- choose Hudi: If you would like to use data lake out-of-the-box and it is quite easy to use.
- If your data is updated frequently, like streaming, check
https://paimon.apache.org/ if you would like to be deeply binded with Flink