Fleet backbone

The relayyour fleet runs on

cmdop server is the agent relay hub — the foundation a fleet of machines connects to. Agents dial out (no inbound ports, NAT-friendly); clients reach them over REST and WebSocket. Multi-tenant by design, and yours to self-host.

Get the agent Read the fleet docs

Architecture

Six processes, one relay

The relay is a handful of focused processes you run together. Agents speak gRPC; clients speak REST and WebSocket; everything is fleet-scoped and audited.

api_server

REST :8000

The client-facing REST surface — auth, presence, machine and fleet reads, the control plane your apps and SDK call.

grpc_server

gRPC :50051

The agent transport. Machines dial out and hold a bidirectional stream — terminal, exec and file ops flow here.

ws_gateway

WS :8765

The push channel. Streams live output, presence and events to connected clients over WebSocket.

tunnel_server

Secure tunnels to services behind a machine — reach a private port without opening one to the internet.

worker

arq jobs

The background worker — schedules, deferred jobs and housekeeping run off the request path.

jarvis_server

virtual peer

A server-side AI agent that joins the relay as a virtual machine and coordinates the fleet.

The gRPC contract lives in proto/ (cmdop.agent.v1) and is the single source of truth — the Go agent stubs are generated from it.

How it works

The operator console

See the relayfrom the inside

Every screenshot below is the real cmdop server admin console — the operator surface you get when you self-host the relay.

One console for the whole plane

System health for all six processes, machines, active sessions, LLM spend and errors — the whole relay at a glance.

cmdop server overview — system health (six processes), machines, sessions, LLM spend, errors and recent activity

Your fleet, as a table

Every connected machine with presence, OS and last-seen — including Jarvis as an always-on server-side agent.

cmdop server machines — the fleet as a table with Jarvis as an always-on server agent, presence and OS

Live gRPC metrics

Active streams, RPC totals, latency percentiles and success/error rates — streamed live over WebSocket.

cmdop server gRPC monitoring — live streams, RPC totals, latency percentiles and success/error charts

The LLM gateway

The OpenAI-compatible /v1 gateway: providers, keys, models and per-fleet spend, all governed in one place.

cmdop server model routing — the OpenAI-compatible LLM gateway: providers, keys, models and per-fleet spend

A server-side fleet agent

Jarvis joins the relay as a virtual machine and coordinates the fleet — ask it to run commands or check logs across every machine.

cmdop server Jarvis — the server-side fleet coordinator agent that joins the relay as a virtual machine

Issue and revoke access

Mint the relay CLI tokens machines connect with, and revoke them the moment a machine should drop off the network.

cmdop server agent tokens — issue and revoke the relay CLI tokens machines connect with

Connectivity

Agents dial out.No inbound ports.

Every machine opens an outbound gRPC stream to the relay and stays reachable through it — there is no listening port to expose, no firewall hole, no NAT traversal to babysit. The relay routes messages between agents and clients; the machines never accept inbound connections.

Security model

Multi-tenancy

Multi-tenantby design

Fleets are isolated at the database, not just in app code. Postgres row-level security partitions every table by fleet_id, so one relay can host many fleets with hard tenant boundaries — presence, audit and history never cross fleets.

Multi-fleet setup

Jarvis

A coordinatorthat lives in the relay

Jarvis is a server-side AI agent that joins the relay as a virtual machine peer. Fleet-scoped and always on, it can reach the other agents — ask one, broadcast to many — and coordinate work across the fleet without anything running on your own box.

About agents

LLM gateway

An OpenAI-compatiblegateway at /v1

The relay exposes inference at /v1 — chat completions, embeddings and models — speaking the OpenAI wire format. The server holds the provider keys and routes per fleet with spend and quota, so agents get one bearer-authenticated endpoint instead of a pile of vendor credentials.

Bash

curl https://your-relay.example.com/v1/chat/completions \
  -H "Authorization: Bearer $CMDOP_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{"model":"@balanced","messages":[{"role":"user","content":"hi"}]}'

Loading code...

Custom providers

Self-host

Run your own backbone

cmdop server ships as a self-hostable product. Bring it up with the Docker compose relay group, or run it on Kubernetes — you own the data, the keys and the fleet. No machine ever leaves your network unless you route it there.

Deployment guide Fleet concepts View on GitHub

Bash

# Bring up the relay (api · grpc · ws · tunnel · worker · jarvis)
git clone https://github.com/cmdop/cmdop_server && cd cmdop_server
cp .env.example .env.prod
make up-relay

Loading code...

Production runs the relay compose group (api · grpc · ws · tunnel · worker · jarvis) behind a single edge; the same processes deploy to Kubernetes when you want autoscaling and rolling upgrades.

Run the fleetbackbone yourself

Spin up the relay, point your agents at it, and own the whole plane end to end.

See pricing Read the deploy guide