Fixing this is a difficult and extremely deep engineering and computer science problem, which is why there has been so little progress. Most improvements have been driven by users, but they aren’t able to make the investments to fix the problems underneath their problem, they have other work to do. Any fundamental improvements would materially break existing workflows, so adoption would be slow at best.
The entire spatial domain is stuck in a deep local minima.
1. Geoparquet for metadata (iceberg catalog could be added later if scale is extremely large) 2. Cloud Optimized Geotiff (COG) for image data (which is what NASA and ESA have been pumping for last 5 years and still do) 3. and an efficient lightweight library that can quickly grab pieces of data from 100s or 1000s of raster files parallely.
I did not want to attempt creating another format, coz I felt that we as geocommunity are at a stage where such tools and files put together forms a pretty good first level foundation to build on. No need for heavy legacy tools, closer to regular data (with parquet) and closer to duckdb world.
I feel adoption may be good, if this is the thesis / guiding principles.
Baby steps surely, but good steps I believe, and this is all thanks to the open source community.
Thanks for the question! Happy to answer more.
In short, even though I believe STAC adoption is what we should aim for, in reality we usually end up building workarounds.
It was a totally valid question but one that's practically impossible to answer. As a result there's just so much variation between STACs and you never really know what you're going to get.
What do you think of attempts like Source Cooperative?
And yes, having a parquet will add overhead of needing some form of catalog. But I believe we are very close to having Iceberg with native geo types being that catalog. at the same time, it opens another can of worms (databricks and other catalogs etc).
silver lining is that parquet (geoparquet) makes geo data closer to regular data.
Not going to discourage what you are doing but just reading the blog, my immediate instinct is to not try to check out what you build.