workledger¶
workledger is a Python CLI and local pipeline for turning AI trace inputs into normalized observations, rolled units of work, and optional downstream policy/reporting outputs.
In plain English: it helps you turn messy agent traces into human-reviewable units of work so you can understand what happened, what it cost, and what may need review.
The codebase is centered on three objects:
ObservationSpan: normalized trace or span recordWorkUnit: rolled work item with evidence, lineage, cost, and review stateClassificationTrace: optional policy-backed interpretation of a work unit
This repository is primarily a local DuckDB-backed pipeline exposed through the wl CLI. It also ships a FastAPI server, built-in YAML policy packs, benchmark fixtures, schema artifacts, synthetic demos, and two Hugging Face dataset adapters.
Start with the network-free local demo:
# prerequisites: Python 3.11+ and uv (https://docs.astral.sh/uv/getting-started/installation/)
git clone https://github.com/couscous18/workledger.git
cd workledger
uv sync --all-extras
uv run wl demo coding --project-dir .workledger/coding --open-report
Optional public-trace demos:
uv run wl demo hf-gaia --project-dir .workledger/hf-gaia --open-report
uv run wl demo hf-smoltrace --project-dir .workledger/hf-smoltrace --open-report
The Hugging Face demos exercise ingest, rollup, and reporting. Policy classification remains explicit.
Start here: