On November 25, 2025, from 11 a.m. to 12 p.m., CASD is organizing a webinar on technologies dedicated to large-scale spatial data processing: we will send the registration link in our next newsletter.
Apache Sedona will be presented, which extends the capabilities of Apache Spark (as well as Apache Flink and Snowflake) to large-scale spatial data processing. It provides distributed spatial datasets and a spatial SQL query engine that efficiently loads, processes, and analyzes large-scale spatial data on one or more machines.
GeoParquet is an extension of Apache Parquet designed to store geospatial vector data (points, lines, polygons) in a compressed, metadata-rich, column-organized format. It is already supported by many tools (e.g., Apache Sedona, GeoPandas, QGIS, DuckDB, Kepler GL, etc.).