Menu
Hacker News·July 1, 2026

LTAP: Disaggregating PostgreSQL for Scalable Transactional and Analytical Workloads

This article introduces the Lakebase architecture, which re-architects PostgreSQL by disaggregating its WAL and data files into independent, cloud-native services (SafeKeeper and PageServer). This stateless compute approach enhances scalability, durability, and availability. It then extends to LTAP (Lakehouse Transactional and Analytical Processing), unifying transactional and analytical workloads on a single, open columnar data copy, avoiding complex CDC pipelines and maintaining performance for both.

Read original on Hacker News

The Monolithic Database Challenge

Traditional databases like PostgreSQL store the Write-Ahead Log (WAL) and data files on a single machine's local disk. While efficient for write durability (sequential WAL appends) and read performance (direct data file access), this monolithic architecture introduces significant challenges. These include risks of data loss due to misconfiguration or node failure, high costs and complexity for scaling reads (requiring full physical clones), and contention between analytical and transactional workloads on shared hardware resources. All these issues stem from the tight coupling of compute and storage.

Lakebase Architecture: Disaggregating PostgreSQL

Lakebase addresses the monolithic database limitations by making PostgreSQL compute instances stateless. It achieves this by externalizing the WAL and data files into purpose-built, independently scalable cloud services, leveraging durable cloud object storage. This fundamental shift allows PostgreSQL compute to be started, stopped, and replicated freely without data ownership.

  • SafeKeeper (WAL Externalization): Instead of local disk flushes, commit durability is achieved by replicating WAL records across a quorum of SafeKeeper nodes using Paxos-based network replication. This eliminates data loss risks from local disk failures or misconfigured flushes.
  • PageServer (Data File Externalization): Data files are managed by PageServer, allowing read replicas to access data directly from externalized storage without needing a full physical clone. This enables instant provisioning, elastic scaling of read capacity, and simpler high availability.

LTAP: Unifying Transactional and Analytical Workloads

LTAP (Lakehouse Transactional and Analytical Processing) extends Lakebase by storing operational data once in open columnar formats (like Parquet on S3) that can be read by both PostgreSQL (via Lakebase) and Lakehouse engines. This allows analytics to run directly on the same fresh data that transactions just wrote, eliminating the need for Change Data Capture (CDC) pipelines, maintaining a second data copy, and preventing analytical queries from slowing down transactional workloads. Unlike traditional HTAP, which attempts to unify both workloads within a single engine, LTAP unifies at the storage layer while keeping specialized, optimized engines for each workload.

💡

Key System Design Takeaway

The core principle of Lakebase and LTAP is disaggregation of compute and storage. This pattern is crucial for achieving elastic scalability, enhanced durability, and improved cost efficiency in cloud-native database architectures. By separating concerns, each component can be optimized and scaled independently, providing greater flexibility than traditional monolithic designs.

Advantages of LTAP Architecture

  • Single Copy for OLTP & OLAP: Eliminates data duplication, reducing storage costs and simplifying data pipelines.
  • Real-time Analytics: Analytics run on the freshest transactional data without latency from CDC or ETL processes.
  • Workload Isolation: OLTP and OLAP workloads do not contend for the same compute resources, ensuring stable performance for transactions.
  • Open Data Formats: Data stored in open columnar formats like Parquet, promoting interoperability with various analytical tools and engines.
  • Simplified Operations: Streamlined HA, instant branching, and elastic scaling without manual cloning.
PostgreSQLDisaggregated DatabaseWALOLTPOLAPLakehouseS3Parquet

Comments

Loading comments...

Architecture Design

Design this yourself
Design a high-scale e-commerce platform's backend that leverages a disaggregated PostgreSQL architecture (Lakebase/LTAP) for its primary database. Focus on how the separation of compute and storage, externalized WAL (SafeKeeper), and externalized data files (PageServer) contribute to achieving elastic scalability for both transactional and analytical queries, ensuring data durability, instant read replica provisioning, and real-time analytics without impacting transactional performance. Consider how the system integrates with open columnar storage for efficient analytics.
Practice Interview
Focus: disaggregated PostgreSQL database architecture for transactional and analytical workloads (LTAP)
LTAP: Disaggregating PostgreSQL for Scalable Transactional and Analytical Workloads | SysDesAi