How It Works¶

workledger sits between raw trace inputs and higher-level reasoning about work.

flowchart LR
  A["JSON / JSONL / OTEL / OpenInference / CloudEvents / supported HF datasets"] --> B["Normalize to ObservationSpan"]
  B --> C["Store in local DuckDB project"]
  C --> D["Roll up to WorkUnit"]
  D --> E["Optional classify to ClassificationTrace"]
  E --> F["Reports / review queue / exports / cost comparison"]

The Actual Pipeline¶

wl init creates a local project directory, copies built-in policy packs, and exports the schema bundle.
wl ingest or wl ingest-hf normalizes input payloads into ObservationSpan.
Those spans are stored in a local DuckDB database.
wl rollup groups related spans into WorkUnit records using work_unit_key, common issue/task identifiers, or trace ID fallback.
wl classify optionally applies a YAML policy pack to each WorkUnit and writes ClassificationTrace plus underlying PolicyDecision records.
wl report, wl review-queue, wl export, wl compare-costs, and wl benchmark read from that local store.

What Rollup Actually Adds¶

Rollup is the step where the repository becomes more than a trace normalizer. A WorkUnit carries:

title, summary, and objective
rolled direct and allocated cost
evidence bundle and lineage refs
review and trust state
source span IDs and compression ratio

That is the main primitive in this codebase.

What Is Optional¶

Policy classification is optional. The Hugging Face demos do not run it automatically.
Comparative economics is optional. wl report only includes it when you pass --include-economics.
The FastAPI server is a wrapper around the same local pipeline, not a separate execution path.