WHAT
Traces tell you what the agent did.
Ordered chain of LLM + tool calls with timing, cost, and tokens. The diary of the run — the same story every observability tool is happy to tell.
Open-source explainable observability for AI agent systems. Causal attribution + constitutional governance + visual debugging.
Integrated with the frameworks you already use
The gap · Two lenses · One platform
Same trace data, two perspectives. Hover either lens to focus it; the components below are the real /traces and /sankey views, not screenshots.
WHAT
Ordered chain of LLM + tool calls with timing, cost, and tokens. The diary of the run — the same story every observability tool is happy to tell.
WHY
Causal attribution over the prompt. Every decision has a trail back to the words that drove it, the rule that caught it, and the edit that would have flipped it.
Side-by-side
Same failure mode. Same agent. Drag to compare the investigation loop without AuditTrail (left) vs. with it (right).
Drag the slider to compare. Mobile: swipe horizontally.
Feature-by-feature against the OSS + SaaS LLM observability pack. Uncertain cells are shown as partial; every claim is cross-checked against published docs. Hover any cell icon for footnotes.
| Feature | AuditTrail | LangSmith | Langfuse | OPIK | Helicone |
|---|---|---|---|---|---|
| Observability4 features· pinned | |||||
| Trace capture | |||||
| Interactive DAG viewer | |||||
| Sankey flow attribution | |||||
| Real-time streaming | |||||
| Explainability (XAI)4 features· pinned | |||||
| Causal attribution (SHAP + ablation) | |||||
| Counterfactual explanations | |||||
| Mechanistic XAI (SAE features) | |||||
| Natural language explanations | |||||
| Governance4 features· pinned | |||||
| Constitutional rule engine | |||||
| EU AI Act compliance mode | |||||
| REGO / OPA policy engine | |||||
| PDF audit export | |||||
| Operations4 features· pinned | |||||
| Live Fleet dashboard | |||||
| Operations Assistant chatbot | |||||
| 3-tier deployment actions | |||||
| OpenAI-compatible gateway | |||||
| Infrastructure4 features· pinned | |||||
| Self-hostable | |||||
| Open-source license | |||||
| 6-language first-party SDK | |||||
| OTel OTLP ingest | |||||
supported partial TBD — research pending not supported· hover a group to peek · click to pin · hover any row or footnote icon for details
Every surface the product ships — observe, explain, operate, govern, integrate. Screenshots captured against the live dashboard.

Causal attribution, constitutional governance, live fleet ops, and a BYOK assistant chatbot — all on one canvas.
Watch your fleet tick in real time — per-agent error rate, p95 latency, violation rate, and one-click jump to the most recent trace.
BYOK chatbot grounded on your traces. Bring your own OpenAI or Anthropic key — your data stays put.
Throttle nodes, swap models, file incidents — 3-tier safety baked in. Every action audited.
Article 12/13/50 logs, 180-day retention floor, PDF regulator export, incident reporting.
Point any OpenAI-compatible client at our gateway. Zero SDK changes. Full traces.
Cost, latency, violations, errors, traffic — fan out to Slack, email, or PagerDuty.
Mechanistic XAI · v2.0+
When your agent runs on a supported open-source model we attach a sparse autoencoder trained by SAELens and surface the top-activated features per span. Behavioural features, safety features, chain-of-thought cues — the interpretable units modern mech-interp research has learned to find.
Active near SYSTEM block discussing allowed topics.
Fires on the literal numeric tokens in the user turn.
Fires on phrases like "today" and "latest".
Rises inside `<thinking>` style wrappers.
Precedes tool-argument emission.
Fires after retrieved-context block.
Partial — the model is weighing a user-prompt nudge.
Pricing
Apache 2.0 self-host has every feature we ship. Cloud saves your ops team the DB + retention + scaling work. Enterprise layers SSO, SCIM, and private-deploy ceremony on top.
Your infra, your rules. Apache 2.0.
Free
Forever. No seat limit, no feature gates.
Hosted by us. You bring the agents.
$49
per org / month · billed annually · 100k traces / mo
SSO, SCIM, SOC 2 docs, private deploy.
Talk to us
Custom terms · multi-year available
Full feature matrix + annual-prepay discount on /pricing.
import audittrail
# Initialize — one line, zero config
audittrail.init(frameworks=["langgraph"])
@audittrail.traceable
async def run_agent(prompt: str):
result = await graph.ainvoke({"input": prompt})
return result
# Full traces, DAG, Sankey — automaticpip install audittrail — works with LangGraph, LangChain, AutoGen, OpenAI Agents SDK, and the raw OpenAI / Anthropic SDKs.
One Docker command. Full observability UI on localhost:3000. No cloud, no config, no call-home.
Instrument
# 2. Instrument — one decorator, zero config
from audittrail import traceable
@traceable(name="research-agent")
async def run(query: str) -> str:
...Instrument. Spans flow automatically: LLM calls, tool invocations, outputs, token counts, costs.
from audittrail import traceable
@traceable(name="research-agent")
async def deep_search(query: str) -> str:
plan = await llm.complete(f"Plan for: {query}")
docs = await search_tool(plan)
return await llm.complete(f"Synthesize: {docs}")Three lines of code. Full explainability. No configuration.
Drop the SDK in, point traces at AuditTrail, and start observing — and operating — your fleet in minutes. Helm chart ships with the repo. Docker Compose for dev. Cloud tier coming soon. Either way — same data model, same SDKs, same dashboard.
FAQ