This article details BASF Digital Farming's architecture for a scalable geospatial data platform on AWS, leveraging Amazon EKS, S3, and RDS. It focuses on managing vast amounts of satellite imagery and other geospatial assets using the SpatioTemporal Asset Catalog (STAC) specification and the open-source eoAPI ecosystem. The solution emphasizes dynamic scaling, optimized storage, and efficient metadata management to process and deliver actionable insights to farmers globally.
Read original on AWS Architecture BlogBASF Digital Farming's xarvio FIELD MANAGER platform processes hundreds of millions of satellite images, generating billions of geospatial artifacts. Unlike traditional static data providers, the agricultural environment demands ingesting near-daily imagery from diverse sensors and providers. This necessitates a robust, cloud-based infrastructure capable of handling massive data velocity and volume, including advanced quality assurance and machine learning pipelines.
The core of the solution is the SpatioTemporal Asset Catalog (STAC) specification, an open standard for describing and cataloging raster and vector datasets. STAC standardizes metadata across diverse sources (satellite imagery, UAV data, prescription maps), simplifying search, filtering, and retrieval. The platform is built on the eoAPI ecosystem, an integrated suite of open-source tools for cloud-based geospatial data lifecycle management.
The solution is built on Amazon EKS for core computing, Amazon S3 for storage, and Amazon RDS with PostgreSQL for metadata management. It comprises four main layers: Core Services, Storage, Database, and Ingestion.
Core Services Layer
An EKS cluster hosts three key containerized services: stac-service (STAC API implementation), raster-service (TiTiler for COG rendering), and vector-service (TiPG for MVT serving). These are orchestrated for high availability and modularity, supporting CI/CD workflows. Kubernetes Event-Driven Autoscaling (KEDA) is used to dynamically scale pods based on metrics like STAC ingestion queue depth or visualization request load, ensuring responsive performance and cost efficiency.
Amazon API Gateway provides secure public access, acting as a unified entry point for browser-based and mobile clients, applying rate limiting, authorization, and routing policies.