bot-bottle

Author	SHA1	Message	Date
didericis	933d8cf6c3	feat(dashboard): route stop output into right tmux pane test / unit (pull_request) Successful in 18s Details test / integration (pull_request) Successful in 1m7s Details PRD 0021 follow-up. Mirrors the bringup-into-right-pane fix on the explicit-stop path: when `\$TMUX` is set, the stop flow respawns the right pane with `tail -F state/<slug>/teardown.log` (via `_ensure_right_pane` — reuses the existing right pane if it's the agent's claude session) and redirects fd 2 to that log for the duration of `capture_session_state` + `cm.__exit__`. compose-down + network-remove messages stream into the right pane. After `settle_state` removes the state dir, the tail keeps its buffered output visible (tail -F handles file removal gracefully); the next attach respawns the pane with claude. Falls back to the existing curses-endwin path on tmux failure, or when the dashboard isn't in tmux at all.	2026-05-26 15:08:49 -04:00
didericis	e90d7dba76	fix(dashboard): repaint stdscr immediately after modal closes test / unit (pull_request) Successful in 18s Details test / integration (pull_request) Successful in 1m7s Details After the operator pressed `y` on the preflight modal (or picked an agent in the picker), the modal's curses sub-window stayed on screen until the dashboard's main loop ticked again — which during a 5-10s launch made it look like the confirmation never registered. Add `_erase_modal` (touchwin + refresh on stdscr) and call it at every exit from `_preflight_modal` and `_picker_modal`. The pre-modal frame buffered on stdscr immediately overwrites the sub-window's area; the launch proceeds with a clean dashboard underneath.	2026-05-26 15:01:56 -04:00
didericis	0936c40428	fix(dashboard): reuse existing right pane on new-agent start test / unit (pull_request) Successful in 17s Details test / integration (pull_request) Successful in 1m13s Details PRD 0021 follow-up. The new-agent flow was calling a dedicated `_tmux_split_pane_tail` that ALWAYS created a new pane — so every `n` start spawned a fresh right pane next to any existing one, accumulating panes instead of reusing them. Replace with a generic `_ensure_right_pane(tmux_state, argv)` that respawns the dashboard's tracked right pane if one is alive, splits a new one only when none is tracked or the tracked pane was closed. Both the new-agent tail-during- bringup path AND the existing claude-attach path now route through this helper. Net effect: starting a second agent reuses the same right pane — bringup tail replaces the prior claude session, then claude (for the new agent) replaces the tail. Closing the right pane manually via `C-b x` still triggers a fresh split on the next attach.	2026-05-26 14:50:56 -04:00
didericis	83ec9669c9	feat(dashboard): route launch output into right tmux pane test / unit (pull_request) Successful in 17s Details test / integration (pull_request) Successful in 1m8s Details PRD 0021 follow-up. When starting a new agent via `n` while in tmux, the dashboard now: 1. Pre-creates the right pane with `tail -F state/<slug>/bringup.log`. 2. Redirects fd 2 (stderr) to that log file via dup2 — affects both Python `info()` calls AND subprocess inheritors' stderr (docker compose up, network creates, provision). 3. Runs `backend.launch().__enter__()` with the redirect in place; everything streams into the right pane via tail. 4. Restores stderr. 5. Respawns the right pane (tail → claude session). Net effect: dashboard pane stays uncluttered during bringup, and the operator watches the compose-up + provision output in the same pane that's about to hold the claude session — no visual handoff between "starting" and "started." Curses never needs to come down on the tmux path (the pane is already created in the dashboard's neighbor pane, and stderr is redirected away from the terminal entirely). If `_tmux_split_pane_tail` fails (tmux missing, server died), falls through to the existing curses-endwin handoff so the operator still gets a session.	2026-05-26 14:41:53 -04:00
didericis	2ba84c5ba0	feat(dashboard): stop hook clears tmux state + right-pane row marker test / unit (pull_request) Successful in 18s Details test / integration (pull_request) Successful in 1m6s Details PRD 0021 chunk 4 (final). Two adjustments to close the split-pane loop: 1. `_stop_bottle_flow` clears `tmux_state['slug']` when the stopped bottle was the right-pane occupant. The pane itself stays in place (claude exits with "container not found"); the operator presses Enter on a different agent to repurpose it via respawn-pane. 2. `_render` accepts `right_pane_slug` and marks the matching agents-pane row with a `*` prefix + A_BOLD (when it's not also the focused row — focused selection still wins for visibility). Gives the operator a clear visual link between which agent the dashboard says is "active right now" and which one is visible to their right. Wired through `_main_loop`: passes `tmux_state` to `_stop_bottle_flow` on `x`, and `tmux_state.get('slug')` to `_render` on every tick. 479 unit tests pass (1 new for the tmux_state-preservation on non-owned stop). PRD 0021 implementation complete pending merge.	2026-05-26 14:29:59 -04:00
didericis	4991d5b3ee	feat(dashboard): new-agent flow spawns into right tmux pane PRD 0021 chunk 3. The `n` flow (PRD 0020 chunk 2) now routes the first claude session of a freshly-started bottle into the right tmux pane when `\$TMUX` is set — same `_attach_in_tmux` state machine the Enter re-attach uses, just with `resume=False` so claude starts fresh. Outside tmux the existing foreground handoff is unchanged. The compose-up phase (`backend.launch.__enter__`) still drops curses for its stderr output; we restore curses BEFORE spawning into the right pane so the dashboard re-renders alongside the new claude session instead of waiting for attach to return.	2026-05-26 14:27:37 -04:00
didericis	9944878277	feat(dashboard): tmux split-pane helpers + Enter dispatch PRD 0021 chunk 2. New tmux integration: when `\$TMUX` is set and the operator presses Enter on a focused agent row, the dashboard spawns / respawns the right pane with that bottle's claude session instead of taking over the terminal via curses.endwin. Mechanics: - `_in_tmux()` — true when `\$TMUX` is set. - `_tmux_split_pane_create` — first attach: `tmux split-window -h -P -F '#{pane_id}'` opens a right pane and prints its id for tracking. - `_tmux_respawn_pane` — subsequent attaches: `tmux respawn-pane -k -t <id>` swaps the content without re-splitting. - `_tmux_pane_exists` — `tmux list-panes` check before respawn so a manually-closed pane gracefully falls back to a fresh split. - `_attach_in_tmux` — owns the create-or-respawn state machine, mutates `tmux_state` ({pane_id, slug}) so the main loop tracks the right-pane occupant. - `_attach_via_handoff` — the previous curses-endwin path, extracted as the fallback when tmux is missing or fails. - `_attach_to_bottle` dispatches: in tmux + state available → `_attach_in_tmux`; otherwise → handoff. Main loop gets `tmux_state: dict = {"pane_id": None, "slug": None}`. Chunks 3 + 4 wire it through the new-agent flow and the stop hook. `FileNotFoundError`-safe `subprocess.run` calls around every tmux invocation — a missing tmux binary cleanly falls back to the handoff for that keypress. 478 unit tests pass (10 new for the pure argv builders + `_claude_runtime_args`).	2026-05-26 14:26:40 -04:00
didericis	2303cbc0be	refactor(bottle): extract claude_docker_argv from exec_claude test / unit (pull_request) Successful in 18s Details test / integration (pull_request) Successful in 1m10s Details PRD 0021 chunk 1. The tmux split-pane helpers (chunk 2+) need the same docker-exec argv that `exec_claude` builds — including the `--append-system-prompt-file <path>` flag the bottle's provisioner copies into place. Extract the argv construction into a pure `claude_docker_argv(argv, *, tty)` method so both foreground (`subprocess.run`) and tmux paths (`tmux respawn-pane …`) build from the same source. `exec_claude` becomes a one-liner that runs subprocess.run on the argv. No behavior change; 472 unit tests pass (7 new for the pure builder).	2026-05-26 14:21:04 -04:00
didericis	e5316be454	docs(prd-0021): rewrite as standalone — no references to closed PR #48 test / unit (pull_request) Successful in 18s Details test / integration (pull_request) Successful in 1m6s Details PR #48 closed; treat the implementation as starting from main, where no tmux integration exists yet. The PRD now describes the full design (including the `_in_tmux` detection + helper scaffolding) as fresh work. Sized into 4 chunks: `claude_docker_argv` refactor → tmux helpers + pane state + `_attach_to_bottle` dispatch → new-agent flow → stop + indicator. Same design as before — opt-in by `\$TMUX`, split-window-then- respawn, falls back to handoff on tmux failure or missing binary. No external references to PR #48.	2026-05-26 14:18:24 -04:00
didericis	8b8d668602	docs(prd-0021): dashboard as left tmux pane, selected agent as right pane test / unit (pull_request) Successful in 18s Details test / integration (pull_request) Successful in 1m8s Details Draft a PRD that tightens PR #48's tmux integration from "one new window per attach" to "one persistent right pane that the dashboard's selection drives." Inside tmux (`\$TMUX` set): dashboard in the left pane; pressing Enter or `n` spawns claude in the right pane via `tmux split-window` on first attach, then `tmux respawn-pane` on subsequent attaches so the operator-focused agent is always the visible one. Outside tmux: falls back to today's handoff. Opt-in by environment; no flag. Sized into 4 chunks (pane state + create → respawn → stop integration → supersede PR #48's new-window). Seven open questions called out, the biggest being whether the dashboard should auto-exec into a fresh tmux session when launched outside one (v1 says no — operators start tmux themselves).	2026-05-26 14:14:02 -04:00
didericis	c8c72debff	Merge pull request 'feat(attach): --continue on re-attach + keep bottles on dashboard quit' (#47 ) from reattach-resume-flag into main test / unit (push) Successful in 17s Details test / integration (push) Successful in 1m8s Details	2026-05-26 14:04:32 -04:00
didericis	ae6d11f09d	fix(dashboard): use os._exit on quit so bottles survive the dashboard test / unit (pull_request) Successful in 18s Details test / integration (pull_request) Successful in 1m9s Details The `bottles` dict held `@contextmanager`-wrapped launch contexts. On normal Python interpreter shutdown those context managers' generators got GC'd, which raised GeneratorExit at the yield point and ran the `finally` block — invoking each bottle's teardown and tearing down the compose project. Net effect: `q` WAS implicitly stopping every dashboard-launched bottle even though the keypress handler just `return`'d. `os._exit(0)` skips all Python-level cleanup (GC, atexit, etc.), so the docker compose projects survive the dashboard exit untouched. Curses gets explicit `endwin()` first because the brutal exit skips curses.wrapper's normal terminal restoration. Matches PRD 0020's resolved-question answer (`q` does NOT tear down bottles; teardown is always explicit via `x` or `./cli.py cleanup`).	2026-05-26 13:48:16 -04:00
didericis	14d5c78370	fix(attach): use --continue (no picker) instead of --resume `--resume` alone surfaces claude's session picker even when only one session exists. `--continue` jumps to the most recent session non-interactively, which is the actual behavior the dashboard's Enter re-attach wants for typical bottle-with-one-session cases.	2026-05-26 13:48:16 -04:00
didericis	832e92c7a6	feat(attach): pass --resume on dashboard re-attach Re-entering a running bottle from the dashboard (Enter on the agents pane) now invokes claude with `--resume` so the session picks up the prior conversation history rather than starting a fresh transcript. The first-attach paths (`./cli.py start` and the dashboard's new-agent `n` flow) leave it off — the transcript doesn't exist yet there. `attach_claude` gains a `resume: bool = False` kwarg; `_attach_to_bottle` in the dashboard passes `True`.	2026-05-26 13:48:16 -04:00
didericis	3d179f18fc	Merge pull request 'feat(dashboard): `x` stops a dashboard-owned bottle' (#46 ) from chunk-4-explicit-stop into main test / unit (push) Successful in 19s Details test / integration (push) Successful in 1m8s Details	2026-05-26 13:48:03 -04:00
didericis	3ed3745982	feat(dashboard): `x` stops a dashboard-owned bottle (PRD 0020 chunk 4) test / unit (pull_request) Successful in 17s Details test / integration (pull_request) Successful in 1m7s Details Final PRD 0020 chunk. `x` on a focused agents-pane row tears down the selected bottle if the dashboard owns it (started via the chunk-2 `n` flow): pops `(cm, bottle, identity)` from the main loop's bottles map, snapshots the transcript best-effort, calls `cm.__exit__(None, None, None)` to drive the existing compose-down + network-remove sequence, then `settle_state` to honor any pre-existing preserve marker. On a non-owned slug (discovered via `list_active_slugs` but not in the dashboard's bottles dict — i.e., previous-dashboard or external `./cli.py start` bottle), `x` is a no-op with a status hint pointing at `./cli.py cleanup`. Matches the PRD's cross-dashboard re-attach model: the dashboard can re-attach either kind, but can only tear down its own. The PRD's chunk 5 ("quit-cleanup") is satisfied by the existing no-op behavior of `q` — per the user's resolved-question answer, quit leaves bottles running unchanged. No code change needed for that. Footer surfaces `[x] stop`. 465 unit tests pass (1 new for the non-owned no-op path; the owned path is integration territory because it drives a real compose-down).	2026-05-26 03:46:57 -04:00
didericis	fc8be2e418	Merge pull request 'feat(dashboard): Enter on agents pane re-attaches to bottle' (#45 ) from chunk-3-reattach into main test / unit (push) Successful in 17s Details test / integration (push) Successful in 1m8s Details	2026-05-26 03:40:42 -04:00
didericis	572306ddb6	feat(dashboard): Enter on agents pane re-attaches to bottle test / unit (pull_request) Successful in 18s Details test / integration (pull_request) Successful in 1m11s Details PRD 0020 chunk 3. Enter on a focused agents-pane row drops to a claude session inside the selected bottle. Works for both dashboard-owned bottles (looks up the stored Bottle handle in the main loop's `bottles` dict) and externally-discovered ones (synthesizes a DockerBottle from the slug → `claude-bottle-<slug>` container name). For the synthesized path, the `--append-system-prompt-file` target resolves via metadata.json + the manifest's agent prompt if both can be read; otherwise the re-attach runs without the flag (claude defaults to no system prompt, the bottle's other state is untouched). Shares the curses.endwin → attach → refresh handoff with the chunk-2 new-agent flow via a new `_attach_to_bottle` helper. Footer reshuffled to advertise `[Enter] view/attach`. 464 unit tests pass (3 new for `_bottle_for_slug`).	2026-05-26 03:39:58 -04:00
didericis	5f2b40e679	Merge pull request 'docs(prd-0020): start + attach to agents from the dashboard' (#44 ) from dashboard-start-attach-agents into main test / unit (push) Successful in 17s Details test / integration (push) Successful in 1m8s Details	2026-05-26 03:27:01 -04:00
didericis	309ffaa4ab	feat(dashboard): agent picker modal + new-agent (`n`) flow test / unit (pull_request) Successful in 18s Details test / integration (pull_request) Successful in 1m7s Details PRD 0020 chunk 2. Pressing `n` opens a modal that lists every agent from the manifest with `(N running)` suffixes for ones that already have bottles up. Type to filter (substring, case-insensitive); j/k or arrows to navigate; Enter to confirm; Esc clears the filter on first press, exits the picker on the second. On confirmation, the dashboard runs: - `prepare_with_preflight` from chunk 1 with curses-modal render + prompt callables (the preflight modal centers the plan summary + captures [y/N]). - `backend.launch(plan).__enter__()` — enters but doesn't bind the context to a `with`. The (cm, bottle, identity) tuple lands in the main loop's `bottles` dict keyed by slug. - `curses.endwin()` → `attach_claude(bottle)` → `stdscr.refresh()` handoff. The agent's claude session takes over the terminal; on exit the dashboard re-renders with the bottle now visible in the agents pane. Crucially the context manager is held alive in `bottles` — never `__exit__`'d at quit. Chunk 4 will wire `x` to that exit; for now bottles started from the dashboard stay running until explicit cleanup. Matches the PRD's "q does not tear down" decision. Footer surfaces `[n] new agent`. 461 unit tests pass (8 new for `_filter_agents` and `_running_counts`).	2026-05-26 03:22:44 -04:00
didericis	a56be6beb5	refactor(start): extract prepare_with_preflight + attach_claude test / unit (pull_request) Successful in 18s Details test / integration (pull_request) Successful in 1m7s Details PRD 0020 chunk 1. `cli/start.py`'s `_launch_bottle` did three things in one function: prepare + preflight, attach claude, and settle state on teardown. Split them so the dashboard (PRD 0020 chunk 2+) can reuse the prepare + attach pieces piecewise without going through the CLI's one-shot orchestrator: - `prepare_with_preflight(spec, , stage_dir, render_preflight, prompt_yes, dry_run)` — injects render + prompt callables so the CLI binds them to stderr/stdin while the dashboard binds them to a curses modal. Returns `(plan, identity)`; identity is set after `backend.prepare` returns so callers can reap the prepare-time state dir on abort via `settle_state` in their finally — preserving today's preflight-N cleanup. - `attach_claude(bottle, , remote_control)` — runs claude inside the bottle and returns its exit code. The dashboard calls this from inside a `curses.endwin` → … → `stdscr.refresh()` handoff. - `capture_session_state` / `settle_state` lose their leading underscore; the dashboard will call them on session-end + explicit-stop respectively. `_launch_bottle` becomes a thin orchestrator over those helpers. No behavior change; all 453 unit tests pass and `./cli.py start implementer --dry-run` produces identical preflight output.	2026-05-26 03:12:29 -04:00
didericis	26322bdfd5	docs(prd-0020): record answers to open questions, switch to no-teardown-on-quit	2026-05-26 03:10:26 -04:00
didericis	ec20293c0a	docs(prd-0020): start + attach to agents from the dashboard test / unit (pull_request) Successful in 18s Details test / integration (pull_request) Successful in 1m7s Details Draft a PRD that turns the dashboard into the operator's single surface — collapses today's two-terminal workflow (one for `./cli.py start`, one for `./cli.py dashboard`) into a single dashboard invocation that can spin up new agents, re-attach to ones it already spun up, and explicitly stop them. Picks the "handoff" mechanism from `docs/research/claude-code- pane-in-dashboard.md` (curses.endwin → docker exec -it claude → stdscr.refresh) and crucially decouples the bottle's lifetime from any single claude session: exit claude → back to dashboard with the bottle still running; quit dashboard → tear down every bottle the dashboard owns. Sized into 5 chunks (refactor → picker + new-agent → re-attach → explicit stop → quit-cleanup). Seven open questions called out, the biggest being modal-vs-drop-and-resume for the preflight Y/N inside curses.	2026-05-26 02:59:42 -04:00
didericis	8cd867f3d2	docs(research): claude-code pane in the dashboard test / integration (pull_request) Successful in 1m8s Details test / unit (pull_request) Successful in 17s Details test / unit (push) Successful in 17s Details test / integration (push) Successful in 1m2s Details Survey the three realistic ways to surface a claude-code session inside the dashboard TUI: 1. Handoff — drop curses, foreground claude, restore on exit (the existing `e`/`p` pattern, extended). Minimal code, side-by-time rather than side-by-side. 2. Embedded emulator — own a PTY, parse claude-code's ANSI stream via `pyte`, paint it into a curses pane. Real "pane in the dashboard" but a six-week build with one new dep and several integration trap-doors (alt-screen, resize, input routing, multi-PTY state). 3. External multiplexer — delegate pane creation to tmux / iTerm / wezterm when detected. Tiny code, but splits the operator's mental model and gives up layout control. Recommendation: ship Option 1 first; defer Option 2 to "only if Option 1 is observably insufficient"; treat Option 3 as a niche augmentation for power users. Calls out four followups worth verifying before committing (PTY behavior at small sizes, attach-to-existing-exec, SIGWINCH handling, `-it` vs `-i` for the embedded path).	2026-05-26 02:51:08 -04:00
didericis	942d3a387a	Merge pull request 'refactor(egress): write routes.yaml as actual YAML, not JSON-in-yml' (#42 ) from egress-routes-yaml into main test / unit (push) Successful in 18s Details test / integration (push) Successful in 1m6s Details	2026-05-26 02:38:44 -04:00
didericis	3c2585cb98	fix(apply): write routes/pipelock yaml in place, not via rename test / unit (pull_request) Successful in 17s Details test / integration (pull_request) Successful in 1m6s Details PRD 0018 chunk 3's atomicity fix used write-temp-then-rename to update bind-mounted config files. POSIX rename atomically swaps the inode at the host path — but Docker single-file bind mounts on Linux pin the source inode at mount time, so post-rename the container's mount points at the now-orphaned old inode and never sees the new content. The egress sidecar's SIGHUP-driven reload re-reads the same stale file → "egress route updates aren't updatable via the supervisor anymore". Switch egress_apply + pipelock_apply to write in place (same inode, truncated + rewritten). Lose file-level POSIX atomicity, but: - egress: SIGHUP fires only AFTER the write returns; the addon's `load_routes` raises `ValueError` on a partial read and keeps the previous in-memory routes, so the in-process race window (already narrow) is non-disruptive. - pipelock: applies via `docker restart` rather than SIGHUP; restart serializes after the host write completes, so the container reads the fully-written file on next boot. macOS Docker Desktop's file-sharing layer (virtiofs / osxfs) silently re-resolves the path on rename, which is why this bug didn't surface in dev tests on macOS. Linux native Docker is the strict reading; the fix works on both.	2026-05-26 02:31:46 -04:00
didericis	c9825cf701	refactor(egress): write routes.yaml as actual YAML, not JSON-in-yml test / unit (pull_request) Successful in 18s Details test / integration (pull_request) Successful in 1m7s Details `egress_render_routes` now emits hand-rolled YAML in the same style as `pipelock_render_yaml`. The egress addon parses it via `yaml_subset.parse_yaml_subset` — the same parser the manifest loader + pipelock_apply use. Why bother: routes.yaml is bind-mounted into the egress sidecar AND surfaced to operators through `routes edit` (PRD 0019). JSON- in-yml renders ugly in $EDITOR and signals "this is data" rather than "this is config you can read at a glance". Real YAML reads cleanly. Mechanics: - `yaml_subset.py` drops its `claude_bottle.log` dependency. Errors now raise `YamlSubsetError` (a `ValueError`); the manifest loader + pipelock_apply catch it at the boundary and forward to `die` / `PipelockApplyError` so callers see the same behavior they did before. - `Dockerfile.egress` adds one COPY line for `yaml_subset.py` so it sits flat in `/app/` next to the addon. The addon uses an absolute-import-with-fallback shim so the same file works inside the container AND from the host's unit tests. - `egress_apply._merge_single_route` round-trips current routes.yaml through `parse_yaml_subset` + a new `_render_routes_payload` helper instead of `json.loads` + `json.dumps`. End-to-end: rebuilt the egress image, ran `./cli.py start` to a full bring-up, confirmed the addon's boot log shows `egress: loaded 9 route(s)` — i.e., the YAML parses inside the container. 453 unit + 3 integration tests pass.	2026-05-26 02:17:42 -04:00
didericis	11d5bf1489	Merge pull request 'feat(dashboard): agent-scoped e/p, drop discover-and-prompt path' (#41 ) from chunk-4-agent-scoped-edits into main test / unit (push) Successful in 18s Details test / integration (push) Successful in 1m7s Details	2026-05-26 01:52:40 -04:00
didericis	7b29c81f27	feat(dashboard): agent-scoped e/p, drop discover-and-prompt path test / unit (pull_request) Successful in 17s Details test / integration (pull_request) Successful in 1m6s Details PRD 0019 chunk 4 (final). The `e` (routes edit) and `p` (pipelock edit) keys now require an agent selection in the agents pane. Pressing them with the proposals pane focused, with no active agents, or with an out-of-range selection is a no-op with a status hint ("no agent selected; Tab into the agents pane first"). The discover-and-prompt scaffolding inside `_operator_edit_routes_flow` / `_operator_edit_allowlist_flow` / `_operator_edit_flow` is gone. The flows now take an `ActiveAgent` + required-service name; they refuse with a clear message when the bottle lacks the requested sidecar (e.g., `routes edit` against a bottle with no `bottle.egress.routes` declared). The `discover_egress_slugs` + `discover_pipelock_slugs` + `_discover_active_with_service` helpers come out — they had no remaining callers. Footer now reads `[e/p] edit selected agent`.	2026-05-26 01:50:28 -04:00
didericis	39e69f0bda	Merge pull request 'feat(dashboard): Tab toggle + per-pane selection state' (#40 ) from chunk-3-pane-selection into main test / unit (push) Successful in 17s Details test / integration (push) Successful in 1m5s Details	2026-05-26 01:44:24 -04:00
didericis	0abffc4d90	feat(dashboard): Tab toggle + per-pane selection state test / unit (pull_request) Successful in 17s Details test / integration (pull_request) Successful in 1m4s Details PRD 0019 chunk 3. The TUI now has two focusable panes — proposals and agents — and `Tab` toggles which one the `j/k`/arrow keys move through. Each pane keeps its own selection index. Switching panes doesn't lose the position in the other; the cursor (`>` + reverse-video row) appears only in the focused pane. The label line on each pane shows "(focused)" when active. Footer reshuffled: `[Tab] switch pane [j/k] move [Enter] view [a/m/r] proposal [e/p] edit [q] quit`. When the agents pane is focused and there's no status message to display, the idle status line surfaces the currently-selected agent (or "[no active agents]" / "[no agent selected]" fallbacks) so the operator knows what an agent-scoped edit verb will target after chunk 4 wires them up. Proposal action keys (a/m/r/Enter) are gated on the proposals pane being focused — pressing them with the agents pane focused is a no-op. e/p still use the global discover-and-prompt flow for one more chunk; chunk 4 swaps them to read the agents-pane selection.	2026-05-26 01:37:23 -04:00
didericis	897172fcc2	Merge pull request 'feat(dashboard): render active agents pane below proposals' (#39 ) from chunk-2-render-agents-pane into main test / unit (push) Successful in 18s Details test / integration (push) Successful in 1m3s Details	2026-05-26 01:34:28 -04:00
didericis	cfd8f269ba	feat(dashboard): render active agents pane below proposals test / unit (pull_request) Successful in 17s Details test / integration (pull_request) Successful in 1m4s Details PRD 0019 chunk 2. The TUI's main render now draws two panes: proposals on top (existing), active agents on the bottom (new). Header counts both totals. The agents pane refreshes on the same 1s tick — agents starting/stopping reflect without operator action. Each agent row shows slug, agent name, started-time (HH:MM:SS of the metadata.json timestamp), and the bracketed list of sidecars currently up. The `agent` service is filtered out of the displayed list — it's always present so it'd be noise; the sidecars are the differentiator. A bottle whose only running service is `agent` (sidecars still warming up) renders as `(starting)`. No selection model yet — that's chunk 3. The cursor stays in the proposals pane; `j/k`/arrow nav and the proposal action keys are unchanged.	2026-05-26 01:23:59 -04:00
didericis	8636982e80	Merge pull request 'docs(prd-0019): active agents in dashboard + agent-scoped edit verbs' (#38 ) from dashboard-active-agents into main test / unit (push) Successful in 17s Details test / integration (push) Successful in 1m8s Details	2026-05-26 01:14:15 -04:00
didericis	6e4a9f606f	feat(dashboard): discover_active_agents helper + ActiveAgent dataclass test / unit (pull_request) Successful in 18s Details test / integration (pull_request) Successful in 1m6s Details PRD 0019 chunk 1. New `discover_active_agents()` in dashboard.py returns one `ActiveAgent(slug, agent_name, started_at, services)` per currently-running compose project: - Slugs come from `list_active_slugs()` (chunk-5 shared helper). - The service set per project comes from ONE label-filtered `docker ps` call (PRD open question #1: avoids N per-bottle `compose ps` invocations on each 1s refresh tick). - agent_name + started_at come from each bottle's metadata.json; "?" / "" fallbacks when the file is missing so the row renders rather than vanishes. Not wired into the TUI yet — chunk 2 renders the agents pane. The parser (`_parse_services_by_project`) is split out as a pure function so the conditional-input shape can be unit-tested without docker.	2026-05-26 01:11:54 -04:00
didericis	9c9c32a941	docs(prd-0019): drop e/p fallback — selection-only, no-op otherwise test / unit (pull_request) Successful in 18s Details test / integration (pull_request) Successful in 1m6s Details When no agent is selected, `e` / `p` do nothing (status line shows "no agent selected") rather than falling back to today's global discover-and-prompt. The discover-and-prompt scaffolding in `_operator_edit_routes_flow` / `_operator_edit_allowlist_flow` comes out entirely — selection in the agents pane is now the only way to scope an edit. Old open-question #4 (single-bottle shortcut behavior in proposals-pane mode) is moot and removed.	2026-05-26 01:03:23 -04:00
didericis	9539982d3f	docs(prd-0019): active agents in dashboard + agent-scoped edit verbs test / unit (pull_request) Successful in 17s Details test / integration (pull_request) Successful in 1m3s Details Draft a PRD that adds an "active agents" pane to the dashboard TUI (below the existing proposals pane) and reshapes the operator `routes edit` (e) / `pipelock edit` (p) verbs to be agent-scoped when the cursor is in the agents pane — no more global discover + disambiguation prompt on every press. Tab toggles which pane nav keys move through. Sized into 4 chunks (discovery helper → render pane → selection state → agent-scoped verbs). Six open questions called out, the biggest being whether per-bottle `compose ps` on every 1s tick scales for hosts with many bottles (answer leans toward one label-filtered `docker ps`).	2026-05-26 00:58:34 -04:00
didericis	6babfcc656	Merge pull request 'refactor(dashboard): discover via docker compose ls' (#37 ) from chunk-5-dashboard into main test / unit (push) Successful in 18s Details test / integration (push) Successful in 1m5s Details	2026-05-26 00:24:43 -04:00
didericis	1fa3745832	refactor(dashboard): discover via docker compose ls test / unit (pull_request) Successful in 17s Details test / integration (pull_request) Successful in 1m8s Details PRD 0018 chunk 5. The dashboard's operator-edit verbs (`routes edit`, `pipelock edit`) enumerated running sidecars via `docker ps --filter name=...` prefix scans. Switch to `docker compose ls`-based discovery so the dashboard, cleanup CLI, and launch step all agree on what's running. Mechanics: - `claude_bottle/backend/docker/compose.py` grows three shared helpers: `list_compose_projects` (the JSON parse moved out of cleanup), `slug_from_compose_project` (inverse of `compose_project_name`), and `list_active_slugs` (sugar over the first two for the common "what's running?" question). - cleanup.py drops its private `_list_compose_projects` + `_PROJECT_PREFIX` in favor of the shared ones; `list_active` simplifies (one compose-ls call, not two). - dashboard.py's `_discover_sidecar_slugs` becomes `_discover_active_with_service`: cross-references the active slug list with a label-filtered `docker ps` so only bottles whose given service container is actually up surface in the edit menu. Bottles without an egress sidecar (no bottle.egress.routes) no longer appear for `routes edit`. 3 new unit tests cover the slug ↔ compose-project naming contract; manual probe with a fake compose project confirms both `discover_egress_slugs` and `discover_pipelock_slugs` return the expected slug.	2026-05-26 00:14:16 -04:00
didericis	0ae544d2a6	Merge pull request 'refactor(cleanup): compose-ls driven + drop pipelock CIDR allowlist' (#36 ) from chunk-4-cleanup-cli into main test / unit (push) Successful in 19s Details test / integration (push) Successful in 1m3s Details	2026-05-26 00:04:29 -04:00
didericis	aee249f119	refactor(cleanup): compose-ls driven, plus orphan state-dir reaping test / unit (pull_request) Successful in 17s Details test / integration (pull_request) Successful in 1m9s Details PRD 0018 chunk 4. `claude-bottle cleanup` now derives its work from `docker compose ls --all --format json`, filtered to projects whose name starts with `claude-bottle-`. Per project: one `compose down --volumes` removes the containers + the compose-managed networks atomically. The plan also enumerates three fallback buckets: - Stray containers — `claude-bottle-` containers with no `com.docker.compose.project` label (left over from pre-compose code paths). Cleared via `docker rm -f`. - Stray networks — `claude-bottle-` networks with no compose project label. Cleared via `docker network rm`. - Orphan state dirs — per-bottle `~/.claude-bottle/state/<id>/` dirs with no live project AND no `.preserve` marker. The `.preserve` marker (capability-block or auto-preserve-on-crash) explicitly opts-out of reaping; manual `rm -rf` is the only path for preserved state. cli/cleanup.py collapses to a single y/N prompt — backend.prepare_cleanup returns everything in one plan, backend.cleanup processes everything, no more double-prompt for state. The CLI-side state-dir enumeration + `_state_summary` flags from PR #25 are gone; the backend's orphan-detection rules subsume them.	2026-05-25 23:48:02 -04:00
didericis	f1c5816d1f	refactor(compose): drop pre-create networks + pipelock CIDR allowlist PRD 0018 chunk 4 spike: empirically verified that pipelock's SSRF guard checks proxied-request destinations (e.g. api.anthropic.com → public IP) and not source IPs of incoming connections. The bottle's own internal CIDR was being added to ssrf.ip_allowlist defensively, but that defense isn't load-bearing — direct pipelock probe (`curl --proxy http://pipelock https://api.anthropic.com/`) returns 404 from upstream rather than blocking on SSRF. So: - Networks become compose-managed (`internal: true` on the internal network; the egress one is a normal user-defined bridge). Compose creates + removes them via up/down. - launch.py drops the `docker network create` + `network_inspect_cidr` + pipelock yaml re-render dance. - The pre-create/external scaffolding from chunk 3 goes with it. End-to-end `./cli.py start` still works; cleanup leaves no orphans. If real-world use surfaces an SSRF block we hadn't predicted, the allowlist can come back via subnet-pinning rather than pre-create.	2026-05-25 23:48:02 -04:00
didericis	6927a7ba4b	Merge pull request 'feat(launch): switch start to docker compose project per bottle' (#35 ) from chunk-3-compose-lifecycle into main test / unit (push) Successful in 18s Details test / integration (push) Successful in 1m10s Details	2026-05-25 23:47:47 -04:00
didericis	cefdc8c6e9	feat(launch): switch start to docker compose project per bottle test / unit (pull_request) Successful in 18s Details test / integration (pull_request) Successful in 1m5s Details PRD 0018 chunk 3. Each instance is now one `docker compose` project: - launch.py renders the compose spec via chunk-1's bottle_plan_to_compose, writes it to state/<slug>/docker-compose.yml, `docker compose up -d`s, and (on teardown) dumps `docker compose logs --no-color --timestamps` to state/<slug>/compose.log before `docker compose down`. - Networks are pre-created (`docker network create --internal` + user-defined bridge) so pipelock yaml can know the internal CIDR before compose-up. Compose references them with `external: true`; the launch step's ExitStack still owns network removal. - Agent still runs `sleep infinity`; claude reaches it via `docker exec -it` exactly like before (per the PRD's resolved TTY question). - metadata.json grows a `compose_project` field so dashboard / cleanup tooling can derive compose invocations without re-deriving the slug. Security follow-ups from chunk-2 review: (b) CA private keys: pipelock + egress ca-key.pem land at 0o600 explicitly. The mitmproxy cert+key concat stays 0o644 because the egress container's uid-1000 user reads it through the bind mount; parent dir at 0o700 still restricts host-side reach. (c) Apply atomicity: egress_apply + pipelock_apply switch from `docker cp` to host-side write-temp-then-rename on the bind-mount source. POSIX rename is atomic on the same filesystem, so a sidecar SIGHUP racing the apply can't see a half-written routes.yaml / pipelock.yaml. Per-sidecar Docker{Sidecar}.start/stop methods stay in place — the integration test suite drives them directly to validate each image in isolation, which is still useful. launch.py no longer calls them; a follow-up chunk can prune if the integration tests move to the compose lifecycle. git-gate entrypoint's chmod 600 on the keyfile + known_hosts now tolerates EROFS (`\|\| true`) — the host SSH key is already 0600 (SSH refuses to load otherwise), so the inside-container chmod was already a no-op in the docker-cp path and now just needs to not error on the read-only bind mount. 422 unit tests pass; supervise integration test passes; end-to-end `./cli.py start implementer` brings up the project, attaches, captures full merged logs on teardown, and reaps all containers + networks.	2026-05-25 23:16:40 -04:00
didericis	b9f6889d09	Merge pull request 'refactor(state): write prepare-time scratch files under state/<slug>/' (#34 ) from chunk-2-state-bind-mount into main test / unit (push) Successful in 17s Details test / integration (push) Successful in 1m5s Details	2026-05-25 23:01:19 -04:00
didericis	cd82a48399	refactor(state): write prepare-time scratch files under state/<slug>/ test / unit (pull_request) Successful in 17s Details test / integration (pull_request) Successful in 1m5s Details PRD 0018 chunk 2. Each sidecar's prepare-time output (pipelock yaml + CAs, egress routes.yaml + CAs, git-gate entrypoint + hooks, supervise current-config, agent env + prompt) now lands in ~/.claude-bottle/state/<slug>/<service>/ instead of an ephemeral mktemp dir. The state subdirs become the stable bind-mount sources that chunk 3's docker compose project will reference. The SDK launch path is unchanged — `docker cp` still copies from the plan-held paths into containers, just from new locations. start.py's session-end cleanup is now in `finally`, which also reaps state dirs left behind by dry-run / preflight-N / prepare-exception paths (previously only the post-launch path settled state).	2026-05-25 22:53:47 -04:00
didericis	c8c302e50e	Merge pull request 'docs(prd-0018): one compose project per bottle instance' (#33 ) from compose-per-instance into main test / unit (push) Successful in 18s Details test / integration (push) Successful in 1m3s Details	2026-05-25 22:42:35 -04:00
didericis	3386cabe62	docs(prd-0018): resolve TTY open question — keep exec -it test / unit (pull_request) Successful in 18s Details test / integration (pull_request) Successful in 1m3s Details	2026-05-25 22:34:26 -04:00
didericis	4760a09263	feat(compose): pure renderer for bottle plan -> compose dict test / unit (pull_request) Successful in 17s Details test / integration (pull_request) Successful in 1m5s Details PRD 0018 chunk 1. New module `claude_bottle/backend/docker/compose.py` exposing `bottle_plan_to_compose(plan) -> dict` — a pure function that translates a fully-resolved DockerBottlePlan into a Compose v2 spec. Not wired in yet. Tests cover the conditional-service matrix (git on/off × egress on/off × supervise on/off) plus per-service shape (images vs builds, network aliases, bind mounts, env vars, depends_on).	2026-05-25 22:28:50 -04:00
didericis	3251ee1394	docs(prd-0018): one compose project per bottle instance test / unit (pull_request) Successful in 16s Details test / integration (pull_request) Successful in 1m3s Details Draft a PRD that replaces the chain of per-sidecar docker SDK calls in `claude-bottle start` with a single `docker compose` project per instance. Each `state/<slug>/` dir gets a self-describing set of artifacts: metadata.json, docker-compose.yml, compose.log, and the existing transcript/ + live-config/.	2026-05-25 22:15:32 -04:00

1 2 3 4 5 ...

352 Commits