Netflix's GenPage redesigns homepage construction using a single generative transformer model, moving away from a traditional multi-stage recommender pipeline. This architectural shift enables end-to-end optimization for page-level user satisfaction, better scaling, and increased flexibility for new product experiences. The system processes user context and autoregressively generates the entire homepage layout as a tokenized sequence.
Read original on Netflix Tech BlogTraditionally, systems like Netflix's homepage recommendations rely on complex, multi-stage pipelines involving separate components for candidate generation and ranking at both row and entity levels. GenPage represents a significant architectural departure, adopting a single generative transformer model inspired by Large Language Models (LLMs). This model treats user history and request context as a prompt and autoregressively generates the entire homepage as a response, including rows, entities, and layout simultaneously. This contrasts with most generative recommenders that produce flat ranked lists, allowing for holistic page construction rather than isolated scoring.
System Design Insight
This case study demonstrates a powerful trend in system design: leveraging advanced AI models to simplify complex, multi-stage pipelines. By moving from a cascade of specialized components to a single, end-to-end generative model, Netflix aims to achieve greater coherence, reduce operational complexity, and unlock new optimization possibilities that were previously challenging due to distributed decision-making.