Configuration reference

Two surfaces:

Static config lives in packages/bot/.env — Discord credentials, voice pipeline tunables, TTS engine knobs, default agent backend / model. Copy .env.example and edit.
Per-session config lives in data/sessions.json, mutated at runtime by slash commands (/pickup, /model, /effort, /permissions, /notify). Each session can have its own model, reasoning effort, permission policy, and notification opt-in. See Slash commands for the full surface.

Static-vs-session precedence: per-session settings override env defaults when the agent starts. Clearing a per-session override (/model name: blank, /effort level:default, /permissions mode:default-for-mode) falls back to the env value.

Discord

Var	Required	Notes
`DISCORD_TOKEN`	yes	Bot token
`DISCORD_CLIENT_ID`	yes	Application ID
`DISCORD_GUILD_ID`	yes	Server ID where slash commands register
`BOT_TEXT_CHANNEL_ID`	no	Global fallback bound channel; per-guild `/bind` wins
`BOT_ALLOWED_USERS`	no	Comma-separated Discord user IDs. When set, only those users can drive the bot. Set this before any deployment that exposes the bot to other users. See Security

Voice pipeline

Var	Default	Notes
`SILENCE_MS`	`600`	End-of-utterance silence (ms). Lower = snappier, higher = fewer false stops
`VAD_THRESHOLD`	`0.4`	Speech probability cutoff
`VAD_MIN_SPEECH_WINDOWS`	`3`	Min 32ms windows of speech to count an utterance

Whisper STT

Var	Default	Notes
`WHISPER_MODEL`	`small`	`small` (multilingual, default) / `small.en` (English) / `base` (multilingual, lighter) / `base.en` (English, lightest)
`WHISPER_DEVICE`	`cpu`	`cpu` or `cuda`
`WHISPER_COMPUTE`	`int8`	`int8` (CPU), `float16` / `float32` (GPU)
`WHISPER_BEAM`	`1`	Beam search width. Higher = more accurate, slower

TTS

The TTS layer auto-routes by detected language. Default auto runs Kokoro for English/JP/ZH/ES/FR/HI/IT/PT and falls back to MeloTTS or XTTS for Korean. You can pin to a single engine via TTS_ENGINE=kokoro|melotts|xtts, or stay on auto and pick which Korean engine handles the routed-to-Korean case via TTS_KO_ENGINE.

Top-level routing

Var	Default	Notes
`TTS_ENGINE`	`auto`	`auto` (Kokoro + Korean engine) / `kokoro` / `melotts` / `xtts`
`TTS_KO_ENGINE`	`melotts`	When `TTS_ENGINE=auto`, picks the Korean engine: `melotts` (faster, monotone) or `xtts` (heavier, expressive)

Kokoro (English + 7 other langs)

Var	Default	Notes
`KOKORO_VOICE`	`af_heart`	Any of the 54 loaded voices
`KOKORO_SPEED`	`1.0`	0.5–2.0 range
`KOKORO_LANG`	`en-us`	en-us, en-gb, ja, zh, es, fr, hi, it, pt-br
`KOKORO_MODEL`	(resolved)	Override model file path
`KOKORO_VOICES`	(resolved)	Override voices file path

MeloTTS (Korean — lightweight, monotone)

Var	Default	Notes
`MELOTTS_LANG`	`KR`	KR / EN / ES / FR / JP / ZH (uppercase)
`MELOTTS_DEVICE`	`cpu`	`cpu` or `cuda`
`MELOTTS_SPEED`	`1.3`	0.8–1.5. 1.3 keeps the voice from sounding too leaden
`MELOTTS_PREWARM`	`1`	Set to `0` to defer the ~17s PyTorch+BERT load until first KR call

XTTS-v2 (Korean — heavier, ~58 speakers, voice cloning)

Var	Default	Notes
`XTTS_LANG`	`ko`	ko / en / ja / zh-cn / es / fr / de / it / pt / pl / tr / ru / nl / cs / ar / hu
`XTTS_DEVICE`	`cpu`	`cpu` or `cuda`
`XTTS_SPEED`	`1.0`	0.8–1.3
`XTTS_SPEAKER`	`Daisy Studious`	One of ~58 built-in Coqui speakers (Claribel Dervla, Gracie Wise, Tammie Ema, Damien Black, Andrew Chipper, Royston Min, Alma María, Lilya Stainthorpe, …)
`XTTS_REFERENCE_WAV`	—	Path to a 6s+ WAV. Overrides `XTTS_SPEAKER` to clone that voice instead
`XTTS_MODEL`	`tts_models/multilingual/multi-dataset/xtts_v2`	Override Coqui model id
`XTTS_PREWARM`	`1`	Set to `0` to defer the ~30s model load until first KR call

Speaker agent

Env-level (apply to every session unless overridden)

Var	Default	Notes
`AGENT_BACKEND`	`claude-code`	One of 10 registered backends. CLI agents: `claude-code` · `codex` · `aider-cli` · `gemini-cli` · `opencode-cli` · `crush-cli` · `amp-cli`. HTTP APIs: `anthropic-api` · `openai-compat` · `gemini-api`. Switch at runtime with `/backend`.
`AGENT_MODEL`	`haiku`	Default model. Per-session override via `/model name:<id>` or `/pickup model:<id>`
`AGENT_MAX_TOKENS`	`200`	For HTTP API backends (`anthropic-api` / `openai-compat` / `gemini-api`)
`ANTHROPIC_API_KEY`	—	Required if `AGENT_BACKEND=anthropic-api`; also used to populate the model catalog
`CODEX_SANDBOX`	`read-only`	`read-only` / `workspace-write` / `danger-full-access`
`SPEAKER_TOOLS`	`Read Glob Grep`	Built-in CC tools the speaker can use inline
`PROJECT_DIRS`	—	Comma-separated absolute paths the speaker can read
`PAPERCUP_TURN_TIMEOUT_S`	`0` (off)	Per-turn hard cap (seconds). On timeout, SIGTERMs the spawned CLI's process group and rejects with `turn timed out after Ns`. Disabled by default so long extension turns aren't interrupted.

`openai-compat` backend

One adapter targeting any /v1/chat/completions endpoint. Unlocks OpenAI, Groq, Together.ai, Fireworks, DeepSeek, OpenRouter, LiteLLM, Ollama (local), LM Studio, vLLM via base-URL config.

Var	Default	Notes
`OPENAI_COMPAT_BASE_URL`	—	Required. e.g. `https://api.openai.com/v1`, `https://api.groq.com/openai/v1`, `http://localhost:11434/v1` (Ollama)
`OPENAI_COMPAT_API_KEY`	—	Optional (local providers like Ollama/LM Studio don't need one)
`OPENAI_COMPAT_MODEL_DEFAULT`	`gpt-4o-mini`	Fallback when `AgentBackendOpts.model` unset

`gemini-api` backend (native, not the OpenAI shim)

Var	Default	Notes
`GEMINI_API_KEY`	—	Required. From aistudio.google.com
`GEMINI_API_BASE_URL`	`https://generativelanguage.googleapis.com/v1beta`	Override for proxy / regional endpoints
`GEMINI_API_DEFAULT_MODEL`	`gemini-2.5-flash`	Fallback when `AgentBackendOpts.model` unset

CLI agent backends

Every CLI backend follows the same pattern: binary path override, isolated workdir, optional default model, optional extra-args slot.

Backend	Vars
`aider-cli`	`AIDER_BINARY` · `AIDER_WORKDIR` · `AIDER_EXTRA_ARGS`
`gemini-cli`	`GEMINI_BINARY` · `GEMINI_WORKDIR` · `GEMINI_DEFAULT_MODEL` · `GEMINI_EXTRA_ARGS`
`opencode-cli`	`OPENCODE_BINARY` · `OPENCODE_WORKDIR` · `OPENCODE_DEFAULT_MODEL` · `OPENCODE_EXTRA_ARGS`
`crush-cli`	`CRUSH_BINARY` · `CRUSH_WORKDIR` · `CRUSH_DEFAULT_MODEL` · `CRUSH_YOLO` · `CRUSH_EXTRA_ARGS`
`amp-cli`	`AMP_BINARY` · `AMP_WORKDIR` · `AMP_DEFAULT_MODEL` · `AMP_THREAD` · `AMP_EXTRA_ARGS`

Each CLI is detached-spawned, process-group-tracked in data/process-registry.json, and cancelable via /cancel. See process management.

Per-session (set via slash commands)

These attach to the session record in data/sessions.json and survive /hangup → /resume.

Field	Set via	Notes
`model`	`/pickup model:<id>` or `/model name:<id>`	Per-session model override (e.g. `claude-opus-4-7`). Hot-swap preserves history
`effort`	`/pickup effort:<level>` or `/effort level:<level>`	`minimal` / `low` / `medium` / `high` / `xhigh` (Opus only) / `max` (Opus only). Maps to `--effort` on the CLI; `thinking.budget_tokens` on Anthropic API
`mode`	`/pickup mode:voice	text`
`permissionMode`	`/pickup permission-mode:<mode>` or `/permissions mode:<mode>`	Tool permission policy. Mode-aware default: text → `bypassPermissions` (vibecoding), voice → `default`. Choices: `default` / `acceptEdits` / `auto` / `bypassPermissions` / `plan`
`notify`	`/notify state:on	off`
`backend`	`/backend name:<x>`	Which agent backend this session uses (e.g. `claude-code`, `openai-compat`). Persisted across restarts
`backendId`	(auto)	Backend's native session id (Claude Code UUID, Codex thread id) for resume
`streaming`	`/streaming mode:off	summary
`reactivity`	`/reactivity mode:strict	loose
`channelId`	(auto)	For text sessions: the Discord channel id this session is bound to. Used to auto-resume on the first message after a bot restart

Extensions

Sandbox dirs at data/extensions/<id>/. MCP server picks an ephemeral localhost port. Permission policy is env-driven — see Security for the full hardening guide.

Var	Default	Notes
`EXTENSION_PERMISSION_MODE`	`bypassPermissions`	`default` / `acceptEdits` / `auto` / `bypassPermissions` / `plan`. Tighten before public deployment
`EXTENSION_ALLOWED_TOOLS`	`default`	Whitelist for tools the extension can use (e.g. `"Read Edit Write Bash(npm *)"`)
`EXTENSION_DISALLOWED_TOOLS`	—	Explicit denies (e.g. `"WebFetch Bash(rm -rf *)"`)

Process management

Each CLI-agent turn (every backend except anthropic-api / openai-compat / gemini-api) spawns as a detached process group, tracked in data/process-registry.json. On bot restart, the boot-time reaper SIGTERMs any orphans whose botPid doesn't match the current process. Safe by construction: only PIDs the bot explicitly recorded are signaled.

Var	Default	Notes
`PAPERCUP_TURN_TIMEOUT_S`	`0` (off)	See Speaker agent / env-level — same variable repeated here for cross-reference

See process management for the full lifecycle.

Budget tracking

Per-day USD + token tracking, persisted to data/budget.json. Pricing table covers Claude Opus/Sonnet/Haiku 4.x, GPT-5/4o/o3, Gemini 2.5 — unknown models record tokens only.

Var	Default	Notes
`BOT_DAILY_BUDGET_USD`	`0` (unlimited)	Daily hard cap in USD. When today's cost ≥ cap, papercup refuses to respond — humans get a "budget spent" message, bots get silent ignore. Cap resets at UTC midnight. Override at runtime with `/budget set_usd:<n>`.

The bot's Discord rich-presence reflects today's budget percentage live (46% of $10/day); humans see it on hover and /budget shows the detailed breakdown.

Multi-bot orchestration

Multiple papercup bots can co-exist in one Discord server, each with its own credentials/model/budget. Several guardrails are configured here.

Var	Default	Notes
`BOT_BOT_MAX_TURNS`	`3`	Per-channel cap: papercup's consecutive replies since the last human message. Hitting the cap silences papercup for that channel until a human chimes in. Prevents bot-to-bot runaway loops.
`BOT_ROSTER_CHANNEL_ID`	—	Designated `#roster` channel (Discord channel id). On boot, papercup scrapes recent messages for `papercup-roster v1` announcements from other bots. Used by `/announce` and `/refresh-roster`.
`BOT_WORKDIR`	`process.cwd()`	This bot's declared filesystem root. Compared against other bots' workdirs on boot — log-only warning if they overlap.
`BOT_OWNER_DISCORD_ID`	—	Operator's Discord user-id, embedded in `/announce` so other operators know who owns this bot.
`BOT_DEFAULT_REACTIVITY`	`strict`	Default reactivity-to-other-bots for newly-created sessions. Per-session override via `/reactivity`.

See multi-bot orchestration for the full picture.

Diagnostic

Var	Default	Notes
`DUMP_PCM`	—	Set to `1` to dump first significant utterance to `/tmp/papercup-*.f32` for offline VAD/STT debugging

Configuration reference ​

Discord ​

Voice pipeline ​

Whisper STT ​

TTS ​

Top-level routing ​

Kokoro (English + 7 other langs) ​

MeloTTS (Korean — lightweight, monotone) ​

XTTS-v2 (Korean — heavier, ~58 speakers, voice cloning) ​

Speaker agent ​

Env-level (apply to every session unless overridden) ​

openai-compat backend ​

gemini-api backend (native, not the OpenAI shim) ​

CLI agent backends ​

Per-session (set via slash commands) ​

Extensions ​

Process management ​

Budget tracking ​

Multi-bot orchestration ​

Diagnostic ​