Architecture

This page serves as a placeholder for our upcoming detailed architecture documentation. We will soon provide a comprehensive overview of our design approach for agent-to-agent communication and the protocol architecture.

In the meantime, we invite the community to actively participate in shaping key architectural decisions through focused discussions. The following discussion topics cover some of the most challenging and critical aspects of agent communication and protocol establishment:

Community Discussions

Join the conversations on GitHub:

Pending topics:

Natural Language as an Agent Interface (pending creation)
Choosing between REST and JSON-RPC for Communication (pending creation)
WebSockets vs HTTP vs Peer-to-Peer (Full Duplex Communication) (pending creation)
Integration of Legacy Software with Agent Protocols (pending creation)
Streaming Data between Agents (pending creation)
Handling Request Cancellation and Ensuring Persistency (pending creation)
Agent and Tool Providers versus MCP Servers: Roles and Responsibilities (pending creation)
Deployment Strategies for Agents (pending creation)
Configuration and Model Dependency Management for Agents (pending creation)
Namespacing and name uniqueness (pending creation)
Improved Installation Process (pending creation)

Manifest-Based Agent Offline Discoverability

💬 GitHub Discussion

Problem

In the Pre-alpha version of ACP, we introduced a tight coupling between agent providers and agents, meaning they run in the same runtime. To be discoverable, an agent must first be instantiated (dependencies installed, main process running). This presents several challenges:

Agents are only discoverable if the provider is running.
Agents require instantiation to become discoverable, consuming significant resources.
Agents become discoverable only if successfully executed, but sometimes agents depend on external configurations that aren’t available until discovery occurs.
Agents built using different languages or frameworks cannot coexist within the same provider.
Environment management currently occurs on the agent provider’s side, yet should be coupled directly to the agent.
Agents might depend on local resources (e.g., locally installed models) that must be prepared before runtime. The current design doesn’t support this.
Agent installation currently happens implicitly—it should instead occur explicitly at user request (to handle failures transparently).
A single agent should be deployable independently rather than deploying an entire provider.
Fatal errors in the provider or agent currently affect both, impacting stability.

Current Implementation

Currently, the platform functions as a service registry. Providers register via the beeai provider add command, pointing to a manifest file specifying deployment configurations. Providers then notify the platform through an JSON-RPC API about available agents embedded within their runtime.

Discoverability Process

Request
{ "id": 1, "jsonrpc": "2.0", "method": "agents/list" }

Response:

Response
{
  "jsonrpc": "2.0",
  "id": 1,
  "result": {
    "agents": [
      {
        "name": "gpt-researcher",
        "description": "The agent conducts in-depth local and web research using a language model to generate comprehensive reports with citations, aimed at delivering factual, unbiased information.",
        "inputSchema": {},
        "outputSchema": {},
        "fullDescription": "...",
        "framework": "...",
        "license": "...",
        "languages": ["Python"],
        "githubUrl": "...",
        "examples": {
          "cli": [
            {
              "command": "beeai run gpt-researcher \"Impact of climate change on global agriculture\"",
              "name": null,
              "description": "Conducting Research on Climate Change",
              "output": null,
              "processingSteps": [
                "Initializes task-specific agents to interpret the query",
                "Generates a series of questions to form an objective opinion on the topic",
                "Uses a crawler agent to gather and summarize information for each question",
                "Aggregates and filters these summaries into a final comprehensive report"
              ]
            }
          ]
        },
        "avgRunTimeSeconds": 2.1,
        "avgRunTokens": 111,
        "ui": {
          "type": "hands-off",
          "userGreeting": "What topic do you want to research?"
        },
        "provider": "git+https://github.com/i-am-bee/beeai@agents-v0.0.47#path=agents/community/gpt-researcher-agent/beeai-provider.yaml"
      }
    ]
  }
}

Provider Manifest Format

manifestVersion: int # manifest version, incremented on breaking changes

# canonical name incl. domain
# ex. "github.com/i-am-bee/beeai", "github.com/i-am-bee/beeai/nested/folder", "example.com", "example.com/something/else"
# for remotely loaded manifests, the domain of the manifest must match prefix -- e.g. "github.com/i-am-bee/beeai/nested/folder" can be loaded from "https://github.com/i-am-bee/beeai" but not from "https://example.com"
name: str

# how to run the provider server
driver: nodejs | python | container | unmanaged # can later contain new versions like "nodejs@2"

# valid for driver: nodejs | python
# either a NPM/PyPI package name, or "git:..." URL, or (not allowed in remote manifests) "file:..." URL
# can include version for pinning
package: str

# valid for driver: container
# container image identifier (e.g. "docker.io/something/here:latest")
# include version for pinning
image: str

# valid for driver: nodejs | python | container
# command with arguments to run
command: str[]

# whether the server communicates through stdin/stdout, or a HTTP server spawned at $PORT
# for unmanaged provider, only http is available
serverType: stdio | http

mcpTransport: ws | wss | sse | sses | none

# for serverType=http, an array of endpoints
ui: str[] # absolute for "remote" driver, relative for managed drivers

# manifest files can contain multiple entries separated by ---

Example:

manifestVersion: 1
name: github.com/i-am-bee/beeai/agents/community/something
driver: nodejs
package: git+https://github.com/i-am-bee/beeai/agents/community/something
command: ["something", "--transport", "http"]
serverType: http
mcpTransport: sse

Goal

Reevaluate the agent manifest and provider concepts. While grouping agents under providers has usability advantages, it introduces complexities outlined above. We propose shifting the provider details directly into individual agent manifests, enabling offline discovery and simplifying deployment. The new agent manifest should clearly describe:

Deployment and consumption (as a service or deployable artifact).
Interface definition (input/output schemas).
Configuration requirements (secrets, environment variables).
Dependencies (including other agents or external resources)
Capabilities (sessions, interruptions) and metadata (license, description, skills, etc.)

Open Questions

Should the agent manifest be compatible or inspired by Kubernetes manifests, or do we need a completely different schema due to ACP-specific requirements?
Should MCP resources (e.g., tools, prompts) be combined in a single manifest or maintain separate manifests for clarity and modularity?

Additional Resources

Support for Stateful Agents

💬 GitHub Discussion

Motivation

Supporting both stateless and stateful agents enables ACP to handle a broader range of use cases, particularly complex or long-lived interactions requiring persistent context. Such agents can offer more natural, efficient, and sophisticated user experiences. The following are some agent examples:

aider: Agent designed for collaborative AI-assisted programming with integrated file system support.
supervisor: A long-lived agent managing multi-step tasks.
Interactive agents involving continuous human-agent interactions while maintaining conversational context.

Stateless vs Stateful Agents

Area	Stateless Agents	Stateful Agents
Scalability and load balancing	✅ Easy	❌ Difficult
Complexity of interaction	❌ Low	✅ High
Communication efficiency	❌ Low	✅ High (incremental changes)
Failure, interruption recovery	✅ Easy	❌ Difficult
Agent complexity	✅ Low	❌ High (state management)

Problem

ACP in Pre-alpha supports both stateful and stateless agents. Stateful agents utilize agent templates, but this feature is currently undocumented. The transport mechanisms (e.g., SSE, WebSockets) inherently maintain stateful connections, raising questions about the adequacy and clarity of the existing implementation. This implementation must be clearly evaluated and documented.

Selecting appropriate transport mechanisms is critical. Incorporating stateful behavior should not overly complicate ACP nor compromise its scalability and performance.

Current Stateless Implementation

The agent author provides a single method to instantiate, run, and destroy an agent:

{
  "id": 1,
  "jsonrpc": "2.0",
  "method": "agents/run",
  "params": {
    "name": "agent-name",
    "input": { "prompt": "Hello world" }
  }
}

Current Stateful Implementation

The agent author provides three separate methods: create, run and destroy of an agent. The create method returns an instance ID, generated by the agent provider. The run method can be called repeatedly.

Create:

{
  "id": 1,
  "jsonrpc": "2.0",
  "method": "agents/create",
  "params": { "templateName": "agent-name", "config": {} }
}

Run:

{
  "id": 2,
  "jsonrpc": "2.0",
  "method": "agents/run",
  "params": {
    "name": "agent-name-001",
    "input": { "prompt": "Hello world" }
  }
}

Destroy:

{
  "id": 3,
  "jsonrpc": "2.0",
  "method": "agents/destroy",
  "params": { "name": "agent-name-001" }
}

Open Questions

Is the current stateful implementation (session = agent instance) sufficient for all use cases?
Is the current stateless implementation well-documented, can statelessness be enforced by design?
Transport consequences - REST (stateless) vs long-lived SSE (stateful) vs websocket (stateful)? Consider progressive enhancement?
Should standardized headers or metadata be adopted?
Mechanisms for restartable or recoverable connections?
Strategies for automated resource cleanup when stateful agents are no longer in use?
Provide OOTB support for key-value storage?

Alternative Proposal

Passing sessionId through _meta object.

{
  "id": 1,
  "jsonrpc": "2.0",
  "method": "agents/run",
  "params": {
    "name": "agent-name",
    "input": { "prompt": "Hello world" },
    "_meta": {
      "sessionId": "session-123"
    }
  }
}

Goal

ACP should flexibly support stateful and stateless agents while maintaining simplicity and scalability.

Specifically, ACP should:

Keep protocol-level statefulness minimal, without requiring persistent state management within the protocol itself.
Provide standardized mechanisms for agents to optionally manage context and state.
Delegate responsibility for persistent state management and resource cleanup explicitly to agent implementations.

Get Started

Agents

Concepts

Agent Communication Protocol

Observability & Telemetry

Architecture

Community Discussions

Manifest-Based Agent Offline Discoverability

Problem

Current Implementation

Discoverability Process

Provider Manifest Format

Goal

Open Questions

Additional Resources

Support for Stateful Agents

Motivation

Stateless vs Stateful Agents

Problem

Current Stateless Implementation

Current Stateful Implementation

Open Questions

Alternative Proposal

Goal

Additional Resources

Get Started

Agents

Concepts

Agent Communication Protocol

Observability & Telemetry

​Community Discussions

​Manifest-Based Agent Offline Discoverability

​Problem

​Current Implementation

​Discoverability Process

​Provider Manifest Format

​Goal

​Open Questions

​Additional Resources

​Support for Stateful Agents

​Motivation

​Stateless vs Stateful Agents

​Problem

​Current Stateless Implementation

​Current Stateful Implementation

​Open Questions

​Alternative Proposal

​Goal

​Additional Resources

Community Discussions

Manifest-Based Agent Offline Discoverability

Problem

Current Implementation

Discoverability Process

Provider Manifest Format

Goal

Open Questions

Additional Resources

Support for Stateful Agents

Motivation

Stateless vs Stateful Agents

Problem

Current Stateless Implementation

Current Stateful Implementation

Open Questions

Alternative Proposal

Goal

Additional Resources