bot-bottle

Author	SHA1	Message	Date
didericis-codex	a5d83bdcdc	fix(codex): trust launch home directory	2026-06-01 22:24:17 -04:00
didericis-codex	8e6583fcb7	fix(codex): trust bottle workspace on launch	2026-06-01 22:24:17 -04:00
didericis-codex	ac1aa197d4	fix(smolmachines): reset codex runtime db before auth check	2026-06-01 22:24:17 -04:00
didericis-codex	68e5097534	fix(codex): make host-credential bottles actually authenticate Debugging a live codex smolmachines bottle surfaced three independent failures past the sign-in screen; fix each so forward_host_credentials works end to end: - codex_auth: dummy access/id tokens now inherit the real host token's exp instead of now+1h. Codex (0.135) refreshes when its local token's JWT exp lapses; with a placeholder refresh_token that refresh fails and drops to the sign-in screen. Aligning exp tracks the real token's life. - prepare: set CODEX_CA_CERTIFICATE to the agent CA bundle for codex bottles. Codex is rustls and ignores the system store / NODE_EXTRA_CA_ CERTS; it reads CODEX_CA_CERTIFICATE (fallback SSL_CERT_FILE) for custom roots across HTTPS + wss, so it must be pointed at the egress MITM CA or injection can't work without tls_passthrough. - pipelock: auto tls_passthrough the Codex API hosts when forward_host_credentials is on. Egress injects the bearer before pipelock, whose header DLP then flags the JWT ("request header contains secret") and the retry storm trips its 429. passthrough host-gates the CONNECT but skips decrypt+rescan of egress-owned auth. The auto-added routes aren't in bottle.egress.routes, so the hosts are added explicitly. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 22:24:17 -04:00
didericis-codex	f8a4e6f40b	fix(codex): include account claims in dummy auth	2026-06-01 22:24:17 -04:00
didericis-codex	a6332b9535	fix(codex): provision dummy user auth state	2026-06-01 22:24:17 -04:00
didericis-codex	62dd7b2aa5	fix(codex): forward host credentials to api route	2026-06-01 22:24:17 -04:00
didericis-codex	711cb9c194	feat(codex): inject host credentials via egress	2026-06-01 22:24:17 -04:00
didericis-codex	6ea19a8d53	fix(git-gate): use smart http for smolmachines pushes test / unit (pull_request) Successful in 40s Details test / integration (pull_request) Successful in 54s Details test / unit (push) Successful in 37s Details test / integration (push) Successful in 44s Details	2026-05-29 23:21:50 -04:00
didericis-codex	630e65e9a4	test(egress): cover blocked git push fail-fast test / unit (pull_request) Successful in 28s Details test / integration (pull_request) Successful in 42s Details	2026-05-29 22:13:17 -04:00
didericis-codex	7bffaa791c	fix(git-gate): shorten daemon client timeout test / unit (pull_request) Successful in 30s Details test / integration (pull_request) Successful in 42s Details	2026-05-29 22:02:17 -04:00
didericis-codex	de2267d1b4	fix(git-gate): bound daemon client sessions test / unit (pull_request) Successful in 34s Details test / integration (pull_request) Successful in 44s Details	2026-05-29 21:57:31 -04:00
didericis-codex	cea832b21d	fix(codex): stop injecting api key placeholder test / unit (pull_request) Successful in 27s Details test / integration (pull_request) Successful in 41s Details	2026-05-29 02:39:37 -04:00
didericis-claude	6c673bece6	fix(git-gate): scope new-branch scan to incoming commits test / unit (pull_request) Successful in 28s Details test / integration (pull_request) Successful in 40s Details A new ref made the pre-receive hook scan the full ancestry (`log_opts="$new"`), so historical test-fixture findings rejected every new-branch push (#106). Scope it to `$new --not --all` — only commits new to the gate, which (since the bare repo is populated solely by upstream mirror-fetch and gitleaks-gated pushes) loses no coverage on what a push actually brings to the upstream. Also add BatchMode=yes + ConnectTimeout=10 to both the forward and access-hook ssh so an unreachable upstream fails fast instead of hanging. Refs #106 Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-05-29 01:59:20 -04:00
didericis-claude	847baa84be	refactor(manifest): raise ManifestError instead of die() test / unit (pull_request) Successful in 36s Details test / integration (pull_request) Successful in 59s Details Review feedback on #102: a manifest that can't be read should raise an exception, not call die() (a SystemExit). That SystemExit was the whole reason the dashboard had to special-case Die. manifest.py now raises ManifestError (a plain Exception) for every validation failure. The CLI dispatcher catches it and prints+exits 1 (same UX as before); the dashboard catches it with a normal `except ManifestError` and degrades to a status-line warning. Manifest tests assert on ManifestError + its message. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-05-29 00:15:15 -04:00
didericis-claude	99ec267c74	fix(dashboard): surface launch/crash failures (#100 ) test / unit (pull_request) Successful in 29s Details test / integration (pull_request) Successful in 43s Details The dashboard runs under curses.wrapper and cmd_dashboard only caught KeyboardInterrupt, so failures vanished: - die() prints to stderr, but under curses that lands on the alternate screen and is wiped on exit, so config errors gave no reason. - Die is a SystemExit, so the new-agent flow's `except Exception` never caught config errors; they crashed the TUI. - the startup manifest probe was unguarded. Now: Die carries its message (+ log.error()); cmd_dashboard re-surfaces a Die's reason once the terminal is restored and writes any other crash's traceback to ~/.bot-bottle/logs/dashboard-crash.log; the startup probe and the new-agent flow degrade a bad config to a status-line warning instead of crashing. Closes #100 Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-05-28 23:49:21 -04:00
didericis	0708e99e4e	feat(manifest): lift git.user to the agent layer Agents may declare git.user (name/email); it overlays the referenced bottle's git.user per-field at Manifest.bottle_for (agent wins on non-empty), mirroring the extends: merge. git.remotes is rejected on agents — it carries credentials and host trust and stays bottle-only. The overlay lives at bottle_for, the single chokepoint both backends use, so the docker/smolmachines git provisioners are unchanged. Adds Manifest.git_identity_summary with per-field (agent)/(bottle) provenance, printed in both preflights and `info`. Refs #94 Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-05-28 21:10:47 -04:00
didericis-codex	c854db87c6	fix(git): mount git-gate known hosts test / unit (push) Successful in 36s Details test / integration (push) Successful in 57s Details test / unit (pull_request) Successful in 32s Details test / integration (pull_request) Successful in 59s Details	2026-05-28 19:59:37 -04:00
didericis-codex	f86349ca92	fix(git): rewrite logical ip upstream aliases test / unit (pull_request) Successful in 27s Details test / integration (pull_request) Successful in 41s Details	2026-05-28 19:50:50 -04:00
didericis-codex	1f0434bffc	fix(manifest): allow ip git upstreams test / unit (pull_request) Successful in 27s Details test / integration (pull_request) Successful in 42s Details	2026-05-28 19:44:51 -04:00
didericis-codex	fed006441d	fix(pipelock): allow route ssrf ip policy test / unit (pull_request) Successful in 28s Details test / integration (pull_request) Successful in 44s Details	2026-05-28 19:32:31 -04:00
didericis-codex	bcadc07d09	feat(pipelock): allow route tls passthrough policy test / unit (pull_request) Successful in 37s Details test / integration (pull_request) Successful in 58s Details	2026-05-28 19:19:40 -04:00
didericis-codex	3299674c30	fix(pipelock): disable bip39 detector by default test / unit (pull_request) Successful in 35s Details test / integration (pull_request) Successful in 57s Details	2026-05-28 19:08:34 -04:00
didericis-codex	c31845a5b8	fix(egress): remove implicit provider routes test / unit (pull_request) Successful in 33s Details test / integration (pull_request) Successful in 58s Details	2026-05-28 19:04:49 -04:00
didericis-codex	9399626ba6	fix(agent): hide auth placeholder env in preflight test / unit (pull_request) Successful in 31s Details test / integration (pull_request) Successful in 55s Details	2026-05-28 19:00:39 -04:00
didericis-codex	43cd83d77b	fix(smolmachines): build sidecar image before launch test / unit (pull_request) Successful in 26s Details test / integration (pull_request) Successful in 39s Details	2026-05-28 18:49:28 -04:00
didericis-codex	c4449001d1	fix(dashboard): tolerate missing manifest test / unit (pull_request) Successful in 25s Details test / integration (pull_request) Successful in 44s Details	2026-05-28 18:44:42 -04:00
didericis-codex	7f3998e79e	fix(dashboard): quiet docker polling errors test / unit (pull_request) Successful in 29s Details test / integration (pull_request) Successful in 41s Details	2026-05-28 18:33:13 -04:00
didericis-codex	1cbedc91c0	refactor(agent): use agent-neutral runtime names Assisted-by: Codex	2026-05-28 17:59:24 -04:00
didericis-codex	c08b09dc9f	refactor!: rename project to bot-bottle Assisted-by: Codex	2026-05-28 17:56:14 -04:00
didericis-codex	500fd910c4	feat(agent): add provider templates test / unit (pull_request) Successful in 28s Details test / integration (pull_request) Successful in 40s Details Assisted-by: Codex	2026-05-28 02:18:53 -04:00
didericis-codex	59ee32cc8d	refactor(manifest): key git config by host test / unit (pull_request) Successful in 33s Details test / integration (pull_request) Successful in 42s Details	2026-05-28 00:49:34 -04:00
didericis-claude	a5c8b4e7b2	feat(manifest): bottle composition via \`extends:\` resolver (PRD 0025, #88 ) test / unit (pull_request) Successful in 27s Details test / integration (pull_request) Successful in 39s Details Add an optional `extends: <bottle-name>` field to bottle frontmatter. Two-pass load: 1. Collect raw frontmatter for every bottle file. 2. Recursively resolve each name into a merged Bottle via `_resolve_one_bottle` + `_merge_bottles`. Merge rules (per PRD 0025): - env: dict merge, child wins on key collision - git: full replace if child declares `git:` - git_user: per-field overlay (child's non-empty fields win) - egress: full replace if child declares `egress:` - supervise: full replace if child declares `supervise:` List-valued fields full-replace because partial merge is ambiguous (ordering matters, name collisions ambiguous); env is dict-merge because dict-keyed override is the natural shape. git_user overlays per-field so a parent can declare just the name and a child can add just the email. Cycles / self-extends / missing-parent / non-string `extends:` all die at parse with a pointer that includes the chain (cycles) or the available names (missing parent). Resolution is cached per-name so a diamond reference graph doesn't reparse the same parent N times. Both load paths threaded: - `_load_bottles_from_dir` (md files) — collect raws, then resolve. - `Manifest.from_json_obj` (JSON / test fixtures) — same. Tests (24, in `test_manifest_extends.py`): - Leaf without extends parses unchanged - Child inherits parent unchanged when child only declares `extends:` - env: disjoint union, collision (child wins), child-omits - git: replace, omit, explicit-empty-clears-parent - egress: same shape (replace, inherit) - git_user: parent-only, child-overrides-both, partial fields - 3-step chain (grandparent → parent → child) - Errors: missing parent, self-extends, 2-node cycle, 3-node cycle, non-string extends 685 unit tests pass. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-27 23:30:40 -04:00
didericis-claude	c9cdd41110	feat(smolmachines): apply git_user via git config --global on provision (issue #86 ) Mirror the docker backend's third provisioning subcase in `backend/smolmachines/provision/git.py`: _provision_git_user(plan, target) Runs `smolvm machine exec --name <M> -e HOME=/home/node -e USER=node -- runuser -u node -- git config --global user.<X> <value>` for each git_user field. No-op when `git_user.is_empty()`. `runuser -u node --` switches the UID without invoking a login shell (matching the existing `Bottle.exec_claude` pattern). HOME / USER are forced via `smolvm -e` because bare runuser inherits root's HOME=/root, which would put --global in /root/.gitconfig instead of /home/node/.gitconfig (where the existing `_provision_git_gate_config` writes). 4 unit tests in test_smolmachines_provision.TestProvisionGitUser: no-op, both-set (asserts runuser prefix + HOME/USER env), name-only, email-only. 661 unit tests pass. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-27 23:00:21 -04:00
didericis-claude	9e69aaa99a	feat(docker): apply git_user via git config --global on provision (issue #86 ) Add a third provisioning subcase to `backend/docker/provision/git.py`: _provision_git_user(plan, target) Runs `docker exec -u node <container> git config --global user.{name,email} <value>` for each field the bottle's `git_user` declares. No-op when `git_user.is_empty()`. `-u node` so `--global` lands in /home/node/.gitconfig (matching the existing `_provision_git_gate_config` write location, so agent-side `git` reads both configs from the same dotfile). Name and email apply independently — a bottle declaring only name runs just the user.name line, etc. 4 unit tests in `test_docker_provision_git_user.py`: no-op, both-set, name-only, email-only. 657 unit tests pass. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-27 22:58:37 -04:00
didericis-claude	689675160a	feat(manifest): add git_user bottle field (issue #86 ) Per-bottle `git config --global user.name` / `user.email` pair so the agent's commits inside the bottle land with a known identity rather than the agent image's default (no user, or whatever the image dropped in). Schema: git_user: name: "Eric Bauerfeld" email: "eric+claude@dideric.is" Either field can be set independently — name-only / email-only configs are valid and apply just the field that's set. An explicit `git_user:` block with both fields empty dies at parse time rather than silently no-op'ing; an omitted block is the no-op path (default GitUser is empty, provisioner skips). Parse-time validation: - Unknown sub-keys die (e.g., typo of `username`). - Non-string name/email dies. - Both-empty dies (half-finished edit hint). 11 unit tests in `test_manifest_git_user.py`; 653 unit tests pass. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-27 22:56:37 -04:00
didericis-claude	574551e2eb	fix(sidecar_init): strip EGRESS_TOKEN_* from non-egress daemons' env (issue #84 ) test / unit (pull_request) Successful in 27s Details test / integration (pull_request) Successful in 41s Details test / unit (push) Successful in 26s Details test / integration (push) Successful in 41s Details Pipelock was 403-blocking legitimate egress cred-injected traffic with 'blocked: request header contains secret'. The chain is `agent → egress → pipelock → internet`: egress injects `Authorization: Bearer <token>` for routes with an `auth_scheme`, then forwards upstream to pipelock. Pipelock has `scan_env: true` + `scan_headers: true` + `header_mode: all`, and the bundle supervisor spawned every daemon (egress, pipelock, git-gate, supervise) inheriting the bundle container's full env — including the `EGRESS_TOKEN_<n>` slots set via `docker run -e`. So pipelock had the token value egress injected sitting in its own env, matched it in the request headers, and blocked. The agent itself runs in a different machine and never sees `EGRESS_TOKEN_`, so stripping these from non-egress daemons' env loses no DLP coverage — pipelock can't catch the exfil of a value the agent doesn't have in the first place. New helper `_env_for_daemon(name, base_env)` returns the unchanged base for `egress` and a copy with `EGRESS_TOKEN_` filtered for everyone else. `_spawn` now passes the scoped env to `subprocess.Popen`. Prefix-based filter (not exact-match) so future egress-only env slots don't have to update this code. Tests: - `TestEnvForDaemon`: egress gets full env, pipelock / git-gate / supervise lose `EGRESS_TOKEN_0` + `EGRESS_TOKEN_1` but keep `PATH`, `EGRESS_UPSTREAM_PROXY`, `SUPERVISE_PORT`. - Independent-dict invariant locked so callers can't accidentally mutate the supervisor's env. 642 unit tests pass. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-27 21:14:48 -04:00
didericis-claude	aa5aa1f031	fix(smolmachines): defer pty_resize startup sync to dodge libkrun's bringup race test / unit (pull_request) Successful in 26s Details test / integration (pull_request) Successful in 41s Details test / unit (push) Successful in 27s Details test / integration (push) Successful in 45s Details The `b9853ae` stdin=DEVNULL fix wasn't sufficient. End-to-end testing against a live VM in tmux revealed a second crash path: libkrun spits "load \`config.json\`: parse error: trailing garbage { \"ociVersion\": \"1.0.2\", ... }" and the main exec dies (rc=1 or SIGKILL/rc=137, depending on race scheduling). Root cause: each `smolvm machine exec` writes a per-invocation OCI config.json to the same smolvm state dir during its bringup. The wrapper's startup sync() fires within 1ms of Popen-ing the main exec — both invocations write config.json concurrently, libkrun loads one mid-write, and gets garbage. Trivial inner commands (`sh -c "echo hi"`) finished before the overlap mattered, masking the race in earlier tests. claude's slower startup hits the race every time, and only inside tmux because the outside-tmux foreground-handoff path takes a different bringup sequence that happens to dodge the window. Fix: schedule the initial sync on a 2-second `threading.Timer` instead of calling it synchronously. By 2s the main exec is past its bringup window, so the side-channel's config.json write doesn't collide. Daemon thread so the timer doesn't block exit when the child finishes quickly. Trade-off: the in-VM PTY uses smolvm's default size for the first ~2s, then snaps to the host pane size when the timer fires. Verified end-to-end against a live VM in tmux: claude renders at the default size during bringup, then redraws at full pane width once the deferred sync lands. Operator-driven resizes (SIGWINCH) still bridge in real time via the already-installed signal handler. Also drop the diagnostic log added in `9c83ea6` — we have the fix. Regression test: `TestStartupSyncDeferred.test_main_schedules_timer_does_not_ call_sync_synchronously` mocks Popen + Timer + _push_size and asserts `main()` schedules the timer with the documented delay constant and never invokes _push_size synchronously. Catches a "let's just inline the sync() call" regression immediately. 638 unit tests pass. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-27 20:55:00 -04:00
didericis-claude	b9853ae0c7	fix(smolmachines): give pty_resize side-channel DEVNULL stdin so it survives under tmux test / unit (pull_request) Successful in 26s Details test / integration (pull_request) Successful in 40s Details Inside tmux the dashboard's smolmachines launch crashed within ~100ms of the wrapper Popen-ing the main smolvm exec child — sometimes with rc=137 (SIGKILL), sometimes with smolvm spitting a runc-style "load `config.json`: cannot parse the data: parse error: trailing garbage" and exiting 1. The same wrapper ran fine outside tmux. Diagnostic logs showed the SIGKILL landed ~100ms after the wrapper kicked off its initial `sync()` (which fires the side-channel smolvm exec). Root cause: the side-channel `subprocess.run([smolvm, machine, exec, --, sh, -c, ...])` did not specify `stdin=`, so it inherited the wrapper's stdin — the tmux pane PTY. The main smolvm child (the agent session) also had that PTY as stdin. Two concurrent smolvm processes sharing the PTY's foreground-process-group / input plumbing caused smolvm to abort one of them. iTerm's PTY plumbing apparently tolerated this; tmux's didn't. Fix is one line in `_push_size`: `stdin=subprocess.DEVNULL`. The side-channel never needs stdin — it runs a fire-and-forget `stty` and exits. Verified end-to-end: pre-fix the wrapper crashed under `tmux respawn-pane` against a live VM; post-fix the same invocation completes cleanly. Also drop the diagnostic log added in `37bd11b` — we have the fix. Regression test: `test_side_channel_uses_devnull_stdin` locks the `stdin=DEVNULL` invariant so a future "let's simplify the subprocess.run kwargs" refactor surfaces this immediately. 637 unit tests pass. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-27 20:43:59 -04:00
didericis-claude	794e8666e1	fix(smolmachines): invoke pty_resize by absolute path, not python -m test / unit (pull_request) Successful in 25s Details test / integration (pull_request) Successful in 40s Details The dashboard's launch path crashed inside tmux but worked outside it. Root cause: `python -m claude_bottle.backend.smolmachines.pty_resize` needs the `claude_bottle` package on `sys.path`, which by default comes from cwd. The outside-tmux path is `subprocess.run(...)` — inherits the dashboard process's cwd (the repo root, where `claude_bottle/` lives), so the import resolves. The inside-tmux path is `tmux split-window / respawn-pane <argv>`, and tmux opens the new pane with the pane's OWN cwd, not the cwd of the process invoking split-window. If the operator started their tmux pane anywhere outside the repo (typical: `$HOME`), the wrapper hit `ModuleNotFoundError: No module named 'claude_bottle'` and tmux closed the pane immediately. Sidestep the cwd dependence by invoking the wrapper as `python <absolute-path-to-pty_resize.py>` instead of `python -m <dotted-path>`. The wrapper has no `claude_bottle.*` imports — it's stdlib-only — so it runs as a standalone script anywhere on the filesystem. The absolute path comes from `pty_resize.__file__` at module-load time. Tests: - `test_pty_resize_wrapper_prefix`: updated to assert the absolute-script-path shape rather than the `-m <dotted>` shape. - `test_no_wrapper_when_tty_false`: the substring check now uses `any("pty_resize" in a for a in argv)` instead of string-joining (so the absolute path's "pty_resize.py" filename match still catches a regression). 636 unit tests pass. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-27 20:26:42 -04:00
didericis-claude	3fb305f654	fix(smolmachines): bridge host SIGWINCH into the VM PTY (issue #82 ) test / unit (pull_request) Successful in 28s Details test / integration (pull_request) Successful in 41s Details `smolvm 0.8.0 machine exec -t` allocates an in-VM PTY but never forwards the host terminal's window size — the PTY starts at `0 0` and host resizes (tmux pane resize, terminal window resize) go unnoticed, so the claude TUI inside a smolmachines bottle renders for whatever tiny box it last saw and ignores operator resizes. `docker exec -it` propagates window-size changes automatically; smolvm doesn't. Workaround: a small Python wrapper (`backend/smolmachines/pty_resize.py`) that interposes between the operator's terminal and `smolvm machine exec`. It spawns smolvm as a child, traps host SIGWINCH, and on every resize (plus once at startup) runs a side-channel `smolvm machine exec --name <M> -- sh -c 'for f in /dev/pts/; do stty -F $f cols X rows Y; done'`. The kernel delivers SIGWINCH to the in-VM foreground process group when the slave PTY's size changes, so claude picks up the new dimensions without extra signalling. `SmolmachinesBottle.claude_argv` prepends `[sys.executable, -m, claude_bottle.backend.smolmachines. pty_resize, <machine>, --, ...]` to the existing smolvm argv in TTY mode. Non-TTY mode (provisioning shell-outs) skips the wrapper — no PTY to resize. The wrapper survives the dashboard's `_build_resume_argv_with_fallback` shell-wrap because the split-at-`claude` token still finds the right position — the wrapper's prefix wraps the entire smolvm-exec framing. Tests: - `test_smolmachines_pty_resize.py` (new): argv parsing, the side-channel command shape (cols/rows / for-loop over /dev/pts/), and `_read_winsize`'s fallback across stdin/stdout/stderr including the smolvm-allocated-PTY- reports-`0 0` ironic case. - `test_smolmachines_bottle.py`: updated TTY-mode assertions to unwrap the pty_resize prefix; added `TestClaudeArgvNoTTY` to lock the non-TTY skip. 636 unit tests pass. Removable when smolvm grows native SIGWINCH forwarding. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-27 20:15:11 -04:00
didericis-claude	a3a9ec065e	feat(cleanup): walk every backend, reap smolmachines orphans too test / unit (pull_request) Successful in 26s Details test / integration (pull_request) Successful in 43s Details test / unit (push) Successful in 29s Details test / integration (push) Successful in 38s Details `./cli.py cleanup` previously called only the env-var-selected backend's `prepare_cleanup` / `cleanup` — so a leftover smolvm machine + bundle container + bundle network from a crashed smolmachines bottle would survive a default `docker`-mode cleanup indefinitely. Smolmachines now has a real `cleanup` module (alongside `enumerate.py` from issue #77) that walks: - smolvm machines named `claude-bottle-` (via `smolvm machine ls --json`) - bundle containers `claude-bottle-sidecars-` - bundle networks `claude-bottle-bundle-*` Cleanup runs stop+delete on the machines, force-rm on the containers, network rm on the networks. Each step is best-effort so a failed rm doesn't block the rest. `cli.py cleanup` walks every backend in `known_backend_names()` and runs each backend's `cleanup` after a single y/N prompt that shows a combined plan. State dirs (`~/.claude-bottle/state/<slug>/`) are shared layout with the docker backend, which still owns the orphan-state-dir bucket. It now consults `enumerate_active_bottles()` for the cross-backend live identity set so a running smolmachines bottle's state dir isn't reaped during a cleanup. Tests: smolmachines cleanup (prepare + cleanup ordering + failure handling); cross-backend orphan protection on the docker state-dir check; CLI cmd_cleanup walks both backends, short- circuits on all-empty, aborts on N. 617 unit tests pass. End-to-end verified on this host: $ smolvm machine ls --json \| jq '.[].name' "claude-bottle-researcher-m3hxd" $ ./cli.py cleanup --- smolmachines backend --- smolvm machine: claude-bottle-researcher-m3hxd remove all of the above? [y/N] Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-27 19:56:41 -04:00
didericis-claude	3103266053	fix(dashboard): hoist claude_argv to Bottle ABC so smolmachines pane attach works test / unit (pull_request) Successful in 27s Details test / integration (pull_request) Successful in 42s Details test / unit (push) Successful in 26s Details test / integration (push) Successful in 45s Details Launching a smolmachines agent from the dashboard inside tmux crashed with AttributeError: 'SmolmachinesBottle' object has no attribute 'claude_docker_argv' because the tmux pane-respawn path called `bottle.claude_docker_argv(...)` directly — a method that only existed on DockerBottle. The foreground-handoff path (curses endwin → subprocess.run → restore) doesn't hit it; it goes through `bottle.exec_claude` which is on the ABC. - Move the argv builder onto the `Bottle` ABC as `claude_argv(argv, *, tty=True) -> list[str]`. Both backends implement it; both `exec_claude` impls collapse to `subprocess.run(self.claude_argv(argv, tty=tty), check=False)`. - DockerBottle: rename `claude_docker_argv` → `claude_argv`, body unchanged. - SmolmachinesBottle: extract the argv-building from `exec_claude` into `claude_argv`; the new method returns the full `smolvm machine exec --name … -- runuser -u node -- claude …` argv. The `runuser` switch lives on the exec-framing prefix so the dashboard's `_build_resume_argv_with_fallback` split-at-"claude" trick keeps the UID switch when wrapping the claude tail in `sh -c "… --continue \|\| …"`. - Dashboard: drop the docker-specific wording — local + helper arg names `docker_argv` → `claude_argv`; docstrings on `_build_resume_argv_with_fallback`, `_build_split_pane_argv`, `_build_respawn_pane_argv` now say "backend-exec argv". The shell-fallback wrap is unchanged; the existing logic works for smolmachines because `claude` is still the marker token. Tests: - `tests/unit/test_smolmachines_bottle.py` (new): locks down the smolmachines argv shape — prompt-file flag injection, guest-env `-e K=V` forwarding, TTY toggle, runuser-precedes- claude invariant. - `test_docker_bottle.py`: TestClaudeDockerArgv → TestClaudeArgv; method renames follow. - `test_dashboard_active_agents.py`: docstring follow. 615 unit tests pass. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-27 19:52:02 -04:00
didericis-claude	5e0130b56f	fix(smolmachines): build agent image in launch, not prepare test / unit (push) Successful in 26s Details test / integration (push) Successful in 43s Details When starting a smolmachines agent from the dashboard the docker-build output rendered on top of the curses preflight modal — the build was kicked off before the operator had confirmed launch. The docker backend's `prepare` is pure resolution (no docker calls); smolmachines was inconsistent because `prepare` called `_ensure_smolmachine` which ran `docker build` → `docker save` → `crane push` → `smolvm pack create`, several seconds of stderr noise rendered before the y/N prompt. Move the pipeline: - `_ensure_smolmachine` (+ `_SMOLMACHINE_CACHE_DIR` + `_REPO_DIR` + the local-registry / smolvm imports) moves from `backend/smolmachines/prepare.py` to `backend/smolmachines/launch.py`. Called right before `_smolvm.machine_create` so the resulting `.smolmachine` sidecar path lands as a local in `launch`, not on the plan. - `SmolmachinesBottlePlan.agent_from_path: Path` becomes `agent_image_ref: str`. `prepare` stashes only the docker tag (`$CLAUDE_BOTTLE_IMAGE` \|\| `claude-bottle:latest`); `launch` resolves it into the artifact at bringup. This puts smolmachines on the same prepare-vs-launch boundary the docker backend uses: the preflight summary in the dashboard prints, the operator confirms, then `launch` runs — and its stderr is routed via `_route_op_to_right_pane` (in tmux) or via `curses.endwin` (foreground handoff) so the build output lands cleanly. Tests: - `tests/unit/test_smolmachines_prepare_image.py` → `tests/unit/test_smolmachines_launch_image.py`, updated to import `_ensure_smolmachine` from `launch` rather than `prepare`. - `test_smolmachines_provision.py`: plan fixture switches `agent_from_path` → `agent_image_ref`. 593 unit tests pass. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-27 19:44:53 -04:00
didericis-claude	5d740a6948	style(backend): drop stale "moved/removed" pointer comments test / unit (pull_request) Successful in 25s Details test / integration (pull_request) Successful in 41s Details test / unit (push) Successful in 27s Details test / integration (push) Successful in 41s Details PR #78 review comments 580, 582, 584. Each was a comment describing what the previous refactor removed or relocated — information that's in git history, not load-bearing for a reader of the file as-is. - claude_bottle/backend/docker/cleanup.py: drop trailing "enumerate_active moved to enumerate.py" note. - tests/unit/test_dashboard_active_agents.py: drop module docstring paragraph about which tests moved where. - tests/unit/test_docker_enumerate_active.py: drop "noop-when-docker-missing lives at the cross-backend gate now" trailing comment. 607 tests still pass. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-27 19:16:28 -04:00
didericis-claude	3b418580a9	refactor(backend): has_backend() helper + docker/enumerate split + ActiveAgent rename test / unit (pull_request) Successful in 28s Details test / integration (pull_request) Successful in 42s Details Addresses PR #78 review feedback: - New `has_backend(name)` on the backend package + abstract `BottleBackend.is_available()` on each concrete subclass. Replaces inline `shutil.which("docker") is None` checks in docker/cleanup.py:178 and smolmachines/enumerate.py:73. Docker → `shutil.which("docker") is not None`; smolmachines → `smolvm.is_available()`. Cross-backend `enumerate_active_ agents()` skips backends whose `is_available()` is False so a docker-only host doesn't fail when iterating past smolmachines (and vice versa). - Move docker `enumerate_active` + parser helpers out of cleanup.py into a new `backend/docker/enumerate.py`, mirroring the smolmachines/enumerate.py layout. cleanup.py is now purely about prepare_cleanup / cleanup; the active-listing concern owns its own file. - Drop the `ActiveAgent = ActiveBottle` alias in dashboard.py. The canonical name is `ActiveAgent` (the thing running inside a bottle is always called "agent" in this codebase; the bottle is the container). Renamed `enumerate_active_bottles` → `enumerate_active_agents` to match. Tests: - `test_backend_selection.TestEnumerateActiveAgents .test_skips_unavailable_backends` locks down the `is_available()`-gated iteration. - New `TestHasBackend` covers `has_backend("docker")` consulting the backend's `is_available`, and unknown-name → False. - Existing tests follow the rename; the docker-availability- side-effect test in `test_docker_enumerate_active` moves up to the cross-backend layer (where the gate lives now). 607 unit tests pass. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-27 19:03:16 -04:00
didericis-claude	adff1263d8	feat(cli): cross-backend list active + --backend flag + dashboard picker (issue #77 ) test / unit (pull_request) Successful in 29s Details test / integration (pull_request) Successful in 41s Details CLI and dashboard now share one cross-backend abstraction for listing + launching bottles, so adding a backend (docker / smolmachines) lights up in both places without separate wiring. Backend abstraction: - New `ActiveBottle` dataclass (`backend_name`, `slug`, `agent_name`, `started_at`, `services`) replaces the docker-specific `ActiveAgent`. Same field surface for the existing dashboard consumers; `ActiveAgent` becomes a typed alias for source-compat. - New `BottleBackend.enumerate_active() -> Sequence[ActiveBottle]` replaces the old `list_active() -> None` (which printed and returned nothing). Docker implements it via the existing compose query; smolmachines implements it via `smolvm machine ls --json` cross-referenced with each bundle container's `CLAUDE_BOTTLE_SIDECAR_DAEMONS` env (`backend/smolmachines/ enumerate.py`). - New `enumerate_active_bottles()` and `known_backend_names()` module-level helpers fold every backend into one call. - `get_bottle_backend(name=None)` takes an optional explicit name (precedence: arg > $CLAUDE_BOTTLE_BACKEND > "docker"). CLI: - `./cli.py list active` enumerates every backend, prints tab-separated `<backend>\t<slug>\t<agent>\t<services>`. The smolmachines bottle the user was looking for now shows up. - `./cli.py start` grows `--backend=<docker\|smolmachines>` (choices pulled live from `known_backend_names()`). Threaded through `prepare_with_preflight(backend_name=...)` so the resume path picks up the flag too. Dashboard: - Active agents pane lists both backends (the row formatter now prefixes `[docker]` / `[smolmachines]`). - New-agent flow inserts a backend picker modal between agent pick and preflight (`_backend_picker_modal`). Short-circuits when only one backend is registered. - `discover_active_agents()` collapses to `enumerate_active_bottles()`; `_parse_services_by_project` and `_query_services_by_project` move to `backend/docker/cleanup.py` where the docker enumerator owns them. Tests: parser + enumerate-active tests relocated to `test_docker_enumerate_active.py`. New `test_backend_selection.py` covers `get_bottle_backend`, `known_backend_names`, `enumerate_active_bottles`. New `test_cli_start_backend_flag.py` covers `--backend`'s argparse shape + the explicit-over-env precedence. 605 unit tests pass. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-27 18:27:12 -04:00
didericis-claude	7eda2a66ec	feat(smolmachines): patch smolvm state DB to actually enforce per-bottle allowlist test / unit (pull_request) Successful in 26s Details test / integration (pull_request) Successful in 44s Details Earlier commit framed this PR as "infrastructure landed, TSI enforcement blocked on upstream smolvm 0.8.0." Found a clean workaround that lets us enforce now. Smolvm persists each machine's config (including `allowed_cidrs`) as a JSON BLOB in `~/Library/Application Support/smolvm/server/smolvm.db`, `vms.data`. `machine create --allow-cidr X/32` silently writes `allowed_cidrs: null` to that row when combined with `--from`, but smolvm reads the row at `machine start` — so patching the row between create and start sets the allowlist for real. New `loopback_alias.force_allowlist(machine_name, cidrs)` opens the SQLite DB, JSON-decodes the row, sets `allowed_cidrs`, and writes back as BLOB (Text type silently corrupts smolvm's later reads). launch.py calls it immediately after `machine_create` and before `machine_start`. Verified end-to-end on macOS / Docker Desktop: VM allowlist after start: ["127.0.0.16/32"] VM → 127.0.0.1:3000 → BLOCKED (Permission denied) VM → 8.8.8.8:53 → BLOCKED (Permission denied) VM → 127.0.0.16:<bundle> → CONNECTED The DB-patch hack is correct only because smolvm reads `allowed_cidrs` from the row at start time (not derived in- process). When upstream honors `--allow-cidr` with `--from`, the call becomes redundant — drop the call and the workaround is gone. Tests: 4 new for `force_allowlist` (BLOB round-trip; Linux no-op; missing DB; missing row). Total 593 unit tests pass. README + PRD updated to reflect the fix landed (no longer "infrastructure pending upstream"). gitea#75 can close. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-27 16:55:03 -04:00
didericis-claude	2edc1abb9a	feat(smolmachines): per-bottle loopback alias scopes TSI to single /32 test / unit (pull_request) Successful in 27s Details test / integration (pull_request) Successful in 41s Details PR #74's Docker-Desktop fix routed the agent through `127.0.0.1:<random>` loopback forwards, but TSI filters by IP only — so the allowlist `127.0.0.1/32` let the agent VM reach any host service on macOS loopback (postgres, dev servers, other bottles' published ports, mDNSResponder, ...). Real downgrade vs the docker backend's `--internal` network. Resolution: per-bottle loopback alias. - New `loopback_alias` module manages a pool of `127.0.0.16` .. `127.0.0.31` on `lo0`. macOS only routes `127.0.0.1` by default; the extras need `sudo ifconfig lo0 alias`. `ensure_pool()` lazily adds the missing entries via one sudo prompt on first launch per reboot — aliases persist on `lo0` until reboot, so subsequent launches skip the prompt entirely. - `allocate(slug)` picks the lowest-numbered unused alias by inspecting running bundle containers' port-binding HostIps. No on-disk reservation — docker is the source of truth. - Bundle bringup binds published ports to the allocated alias (`docker run -p <alias>::<port>`) instead of `127.0.0.1`. - TSI allowlist becomes the alias's /32 — narrows reachability to this bottle's bundle only. - Linux native daemons share the host's network namespace; `127.0.0.0/8` works without aliases, so the module no-ops on non-Darwin and returns `127.0.0.1` from `allocate`. Tracking issue closed: gitea/issues/75. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-27 16:23:17 -04:00
didericis-claude	d7cef27584	feat(smolmachines): PRD 0022 sandbox-escape suite green under smolmachines (PRD 0023 chunk 5) test / unit (pull_request) Successful in 26s Details test / integration (pull_request) Successful in 43s Details Final PRD 0023 chunk. The PRD 0022 attack suite was already backend-agnostic — it goes through get_bottle_backend(), so the right dispatch happens based on CLAUDE_BOTTLE_BACKEND. Two cleanups to make it actually run cleanly under CLAUDE_BOTTLE_BACKEND=smolmachines: - setUpClass raises unittest.SkipTest with a useful message when CLAUDE_BOTTLE_BACKEND=smolmachines but smolvm isn't on PATH, or when the host isn't macOS (libkrun + TSI single-IP allowlist is macOS-only in v1). Without this, the test would die deep inside backend.prepare's smolmachines_preflight rather than skipping. - test_5_readme_push_blocked switches from a hardcoded `git://git-gate/...` remote URL (only resolvable on docker via the bundle's short alias) to the bottle's declared upstream URL (`ssh://git@unreachable.invalid:22/throwaway.git`). The agent's ~/.gitconfig insteadOf rewrite — set up by provision_git on both backends — transparently redirects to the gate, so the same test exercises docker's `git://git-gate/...` and smolmachines's `git://<bundle_ip>:9418/...` URLs without branching on backend. README gets a "Backend selection" subsection under Quickstart documenting CLAUDE_BOTTLE_BACKEND, the macOS-only v1 scope for smolmachines, and the `curl -sSL .../install.sh \| sh` install prerequisite — per PRD 0023's acceptance criteria. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-27 16:12:10 -04:00

1 2 3 4 5

224 Commits