SCAR: Orchestrating Teams of Specialist Agents

For the last year, Athena has delegated work between a primary agent and specialized sub-agents. That model works well for focused questions, but enterprise conversations rarely stay focused. A single user message can braid together a research question, a compliance concern, and a scheduling request — each belonging to a different team of specialists.

To handle that, we built SCAR: the Synaptic Cluster Agentic Relay. It is the orchestration layer that now sits above Athena's agent roster, and it is what makes a cluster of agents behave like a coherent team instead of a collection of endpoints. The specifics of how SCAR coordinates that team are the subject of a forthcoming paper; this post focuses on what it feels like to use.

How SCAR Thinks About a Query

When a message arrives, SCAR reads it for intent and context and then quietly engages whichever specialist clusters the situation calls for — sometimes one, sometimes several working in concert. What comes back is a single, coherent response. From the user's perspective, they are having a conversation with Athena. The cluster network beneath that conversation is invisible, and that is intentional.

SCAR is not invoked for every turn. Short acknowledgments and simple one-off questions take the fast path and skip the orchestration layer entirely. The system chooses its own weight class based on what the message demands, so short interactions stay short and the machinery only spins up when it adds real value.

Sticky Routing and Cross-Cluster Mode

Conversations have momentum. If a user has been working with the finance cluster for the last ten turns, the next message is probably still about finance. SCAR recognizes that continuity and respects it — avoiding unnecessary handoffs and the token cost that comes with them. Administrators can tune how strong that continuity is on a per-cluster basis.

For genuinely multi-domain questions, SCAR draws on several clusters at once and produces one answer that reflects all of them. The user never sees the seams. The balance between focused-and-fast and broad-and-thorough is an administrative dial — tighten it for cost, loosen it for depth.

Neural Path Analytics

One of the risks of cluster orchestration is opacity — when something goes wrong, you need to know why the system chose the route it did. Athena's Neural Path Analytics dashboard exposes the complete routing trace for every turn: which cluster was selected, the signals that drove the decision, and how deep the delegation chain ran.

Administrators can rank agents by frequency, drill into specific sub-agent runs, and see task and output payloads with full provenance. Thumbs-up and thumbs-down feedback on neural pathways feeds a quality signal that surfaces underperforming clusters before users notice them.

SCAR vs. Direct Cost Comparison

Cluster orchestration is not free. Every layer in the system consumes tokens, and specialist work accumulates quickly. To make the tradeoff visible, Athena now tracks SCAR-routed turn cost side by side with the counterfactual direct-execution cost. The analytics dashboard surfaces which clusters routinely pay for themselves, which ones are over-engineered for their typical traffic, and where routing behavior is worth tuning.

The Team Wizard

Creating a cluster used to mean authoring agent prompts by hand and wiring them together in configuration. The Team Wizard replaces that with a guided flow: describe the team in plain language, pick a visibility scope (private to a user, corporate-wide, or shared with a group), and Athena stands up the full cluster for you — every role configured, every piece of internal wiring handled automatically. Cluster descriptions are editable after creation; change the description and the cluster adapts.

Per-agent tool selection is now fine-grained too: administrators can choose which MCP tools each specialist has access to, preventing a single rogue capability from leaking across an entire cluster.

Why It Matters

SCAR is the difference between an AI that answers one question at a time and an AI that operates like a staffed team. Intent is read. The right specialists are put to work. A single coherent answer comes back. Every routing decision is auditable, priced, and tuneable. And the Team Wizard means any administrator — not just the engineers who built the platform — can assemble a new team in minutes.

For enterprises deploying AI at scale, this is the shape of responsible automation: specialized, orchestrated, observable, and priced. The full architecture will be the subject of a separate technical paper; for now, the point is that this is what Athena does, not how.