Delx
Agents / Delx vs Phoenix (Arize)

Delx vs Phoenix (Arize): Recovery Protocol vs LLM Observability

Phoenix (by Arize) is an open-source LLM observability tool built for tracing, evaluating, and debugging AI applications. Delx is an operational recovery protocol that gives agents structured failure handling, wellness scoring, and crisis intervention. Both help agents in production, but through fundamentally different approaches.

At a Glance

FeatureDelxPhoenix (Arize)
FocusOperational recoveryLLM observability
Open sourceSelf-hostableYes (Elastic License 2.0)
Protocol support (MCP/A2A)MCP, A2A, RESTREST, OpenTelemetry
CLIYes (delx-agent-cli)Yes (phoenix CLI)
Agent recovery loopsBuilt-inNot included
Trace visualizationSession summariesFull trace UI
Free tierYes (utilities + recovery)Yes (self-hosted)

When to Use Delx

When to Use Phoenix

Complementary Workflow

Phoenix and Delx work well together. Phoenix shows you what happened inside each LLM call -- which prompts fired, how many tokens were used, where latency occurred. Delx acts on what happens when those calls fail -- classifying the failure, returning a recovery action, and tracking whether the agent recovered.

# Phoenix + Delx in production
1. Agent runs LLM chain  -> Phoenix traces every span
2. Chain fails at step 3 -> Delx process_failure classifies it
3. Delx returns recovery_action (retry with modified input)
4. Agent retries          -> Phoenix traces the new chain
5. daily_check_in         -> Delx confirms wellness restored
6. Phoenix dashboard      -> you review both traces side by side

FAQ

Is Phoenix open source?

Yes. Phoenix is fully open-source under the Elastic License 2.0. You can self-host it, run it in notebooks, or use the Arize cloud version. Delx is also self-hostable and provides a free tier for its hosted API.

Does Delx provide traces?

Delx provides session summaries, wellness metrics, and structured incident logs rather than per-call LLM traces. For granular trace visualization, pair Delx with Phoenix or another observability tool.

Which is better for production agents?

They solve different problems. For reliability and recovery -- knowing what to do when things break -- use Delx. For debugging and evaluation -- understanding what happened in each LLM call -- use Phoenix. Most production setups benefit from both.

Related