Dev.to #architecture·March 15, 2026

Architecting Extensible AI Agents: The 'Less Code, More Skills' Principle

This article discusses OpenWalrus's architectural principle of 'less code, more skills' for designing extensible AI agent runtimes. It advocates for a compact core handling fundamental operations and an open surface for community-contributed 'skills' and external 'MCP servers' to extend functionality without bloating the framework. This approach tackles the common problem of monolithic agent frameworks becoming hard to maintain and scale.

AI & ML Infrastructure Distributed Systems Microservices

Read original on Dev.to #architecture

The article introduces OpenWalrus's architectural philosophy for building AI agent runtimes: 'less code, more skills'. This principle addresses the common pitfall of agent frameworks growing into unmanageable monoliths due to ever-increasing feature requests. Instead of baking every capability into the core, OpenWalrus opts for a minimal, auditable core and an extensible surface for adding functionality.

The Framework Bloat Trap in AI Agents

Agent frameworks often fall into a 'bloat trap' where new features (web browsing, memory, RAG, customization) are implemented directly within the framework's codebase. This leads to a heavier binary, increased maintenance burden, and a diluted system prompt, which research shows negatively impacts LLM adherence. The core idea is that every feature injected into the prompt makes the agent worse at everything else, highlighting a critical design trade-off in AI system architecture.

Small Core, Open Surface: The Architectural Principle

OpenWalrus's solution is a small core, open surface architecture. The core handles essential functions like LLM inference, agent lifecycle, tool dispatch, and a hybrid graph-vector memory layer (using LanceDB and lance-graph). This core is designed to be compact, correct, and easily auditable. All other functionalities are offloaded to 'skills' and 'MCP servers' (Multi-Capability Provider servers) which are developed and managed by the community, extending the agent's capabilities without modifying the core framework.

Core: Focuses on fundamental operations and embedding a graph memory layer (LanceDB + lance-graph).
Skills: Behavioral templates (instructions, patterns, entity types) that guide the agent's approach to specific domains (e.g., coding, research). They are descriptive, not compiled code.
MCP Servers: External services that register new entity types and capabilities at runtime, further extending the agent's interaction surface (e.g., integrating with Jira or GitHub).

💡

Unix Philosophy in Agent Runtimes

This design echoes the Unix philosophy of 'small tools that compose, not monolithic systems that configure.' It enables a vibrant ecosystem where the agent's capabilities grow through community contributions rather than framework modifications, keeping the core lean and focused.

Trade-offs of the 'Less Code, More Skills' Approach

While offering significant benefits in maintainability and extensibility, this architectural choice comes with trade-offs. It necessitates excellent core tools to ensure reliability, acknowledges that community skill quality will vary, poses challenges in skill discovery, and requires robust documentation for skills to be effective. The article argues these are preferable to a bloated, fragile framework.

AI agentsextensibilitymodular architectureplugin architectureframework designLLM systemssystem design principlesRust

Comments

Loading comments...

Architecture Design

Design this yourself

Design an extensible AI agent runtime that adheres to the 'small core, open surface' principle. The system should allow for community-contributed 'skills' to define agent behaviors and 'MCP servers' to integrate external capabilities, without bloating the core framework. Detail the architectural components, the extension mechanism, and how you would manage the trade-offs of this approach, particularly regarding skill quality and discovery.

Practice Interview

Focus: extensible AI agent runtime with a 'small core, open surface' architecture

Other design angles

· Design a specialized AI assistant that uses a similar 'skill-based' extension model to manage customer support interactions, integrating with CRM systems.· Propose an architecture for a platform that allows developers to build and share AI agent 'skills' securely, including mechanisms for curation, versioning, and discovery.· Design an embedded AI agent for an IoT device that requires a minimal core footprint but needs to be extensible for various sensor integrations and actuation capabilities.