Transcript Donation Lab

Build a sanitized, reviewable donation bundle without ever uploading raw logs. Everything runs in your browser until you click submit.

Note: To submit donations via one-click, you need a HuggingFace account with write access to the target dataset repository. You can also use the manual download option to submit via git. Anything merged into the dataset may be copied or mirrored. Removing it later may not fully retract it from downstream use.

Prefer command line? This tool is also available as a CLI (npm run cli) for scripting and a TUI (npm run tui) for interactive terminal use. See the README for details.

Saved Data 0 items

Your progress and preferences are saved locally in this browser. Nothing is sent to any server until you submit.

Why review your transcripts?

How this can help you

See which prompts led to better outcomes
Find where you or the agent went in circles
Understand how you actually use coding agents
Catch secrets or sensitive data you didn't notice

How this can help others

Real usage data helps train better models
Your tricky bugs help others avoid them
Contribute to open research on AI tools
Your data influences how agents evolve

Start small: We recommend uploading just 2-3 transcripts first. Check how they look on HuggingFace, make sure the redaction caught everything, and see if there's anything you didn't think of. Then come back for more.

Work in batches: Your review progress is saved in this browser's localStorage. Come back anytime to continue where you left off.

Progress is stored locally in your browser only. If you clear browser data or switch devices, you'll start fresh.

Step 0 — Export locally

Run the export script on your machine to create raw_export.zip. The export stays on your computer and is never uploaded until you review and sanitize it.

Download exporter

Python exporter Bash helper PowerShell

python export_transcripts.py --source claude

python export_transcripts.py --source codex

python export_transcripts.py --source opencode

What gets exported?

Claude Code

Session transcripts from ~/.claude/projects/
Excludes: settings, todos, commands, MCP config
How Claude Code stores sessions →

Codex CLI

History from ~/.codex/history.jsonl
Excludes: config.toml, auth.json (credentials), AGENTS.md
Codex CLI config docs →

OpenCode

Sessions from SQLite database in ~/.opencode/
Automatically converted to JSON format
OpenCode documentation →

New options: Use --dry-run to preview what will be exported, --verbose to see excluded files, or --show-rules to view filtering rules.

Step 1 — Import your exports

Upload one or more export files. Everything is processed locally in your browser.

Claude Code

From export_transcripts.py --source claude

Codex CLI

From export_transcripts.py --source codex

OpenCode

From export_transcripts.py --source opencode

Loaded sources

No files loaded yet.

View file tree

Awaiting import.

Understanding your data

How Claude Code stores data

~/.claude/ contains your session history:

history.jsonl — Index of all sessions (prompts, timestamps, project paths)
projects/ — Full conversation transcripts as JSONL files

Entry types in session files:

`user`	Your messages to Claude
`assistant`	Claude's responses (text, thinking, tool calls)
`system`	System prompts and context
`summary`	Compressed context from long conversations
`file-history-snapshot`	Snapshots of file state during edits

How Codex CLI stores data

~/.codex/ contains:

history.jsonl — Prompt history (session_id, timestamp, text)
sessions/YYYY/MM/DD/ — Full rollout files per session

Entry types in rollout files:

`session_meta`	Session metadata (cwd, git info, model)
`response_item`	Messages with role (user/assistant) and content

How OpenCode stores data

~/.opencode/ uses SQLite for persistence:

*.db / *.sqlite — SQLite database with sessions table
Sessions contain messages, metadata, and file changes

Our exporter converts SQLite to JSON for processing.

Step 2 — Sessions

Pick the sessions worth donating. Click any session to view its full content. We only score locally.

Tip: For your first donation, try selecting just 2-3 sessions. You can always come back for more after you've seen how they look on HuggingFace.

How scoring works

Sessions are scored locally to help identify potentially valuable transcripts for donation. Higher scores suggest more useful content.

Signal	Points	Why it matters
Keywords: `error`, `traceback`, `stack`	+1.5 each	Debugging conversations
Keywords: `diff`, `patch`, `git`, `commit`	+1.5 each	Code changes
Keywords: `test`, `pytest`, `npm`, `yarn`, `pip`	+1.5 each	Build/test tooling
Keywords: `tool call`, `function`, `stderr`, `stdout`	+1.5 each	Tool use patterns
Length: 400–8000 chars	+2	Substantive but focused
Length: >8000 chars	-1	May be too verbose
Length: <120 chars	-1	Too short to be useful

Scores are heuristic only. Review content before donating — a low-scored session may still be valuable, and high-scored ones may contain sensitive data.

Step 3 — Redact & Filter

Click through each session to review how it will look after redaction. Adjust settings on the left — changes apply instantly.

No sessions selected. Go back to Step 2 and select sessions to review.

Step 4 — Confirm & Build

Summary

Sessions to donate: 0

Total redactions: 0

Fields included: 0

I have the rights to share this content and it does not violate any confidentiality obligations.

I have reviewed the sanitized output and confirm it is safe to share.

Step 5 — Submit

Contributor Info

This info is included in your donation bundle and will be publicly associated with your contribution.

Username Required. Sign in with HuggingFace below to auto-fill.

License How others may use your donated transcripts.

AI Training Preference Whether AI models may train on your data.

Sign in & Submit

Opens Hugging Face login in a new window

Target dataset repository

Or download for manual upload

Only needed if you want to keep a local copy or prefer to upload via git/web UI.

Bundle not built yet. Go to Step 4 to build your bundle first.

Manual upload instructions

Option 1: Git workflow

Option 2: HuggingFace Web UI

Transcript Donation Lab

Why review your transcripts?

How this can help you

How this can help others

Step 0 — Export locally

Download exporter

What gets exported?

Step 1 — Import your exports

Loaded sources

Understanding your data

Step 2 — Sessions

Step 3 — Redact & Filter

Session Navigation

Redaction Rules

Field Selection

Redaction Stats

Annotate Session

Step 4 — Confirm & Build

Summary

Step 5 — Submit

Contributor Info

Sign in & Submit

Redaction Playground

Why review your transcripts?

How this can help you

How this can help others

Step 0 — Export locally

Download exporter

What gets exported?

Step 1 — Import your exports

Loaded sources

Understanding your data

Step 2 — Sessions

Step 3 — Redact & Filter

Session Navigation

Redaction Rules

Field Selection

Redaction Stats

Annotate Session

Step 4 — Confirm & Build

Summary

Step 5 — Submit

Contributor Info

Sign in & Submit

Redaction Playground

Field Dictionary