bot-bottle

Author	SHA1	Message	Date
didericis	ef5d2f9a4d	feat(state): preserve on crash + always snapshot transcript test / unit (pull_request) Successful in 17s Details test / integration (pull_request) Successful in 1m31s Details Extends the preserve-on-capability-block design to also preserve state on agent crash, and snapshots the transcript on every teardown so any resume (crash or capability-block) gets a warm claude session — not a cold start. - capability_apply: rename _snapshot_transcript → snapshot_transcript (public; reused below). No behavior change in the capability path. - cli/start.py: capture bottle.exec_claude's exit code; while the container is still alive (inside the launch context): * always snapshot_transcript(identity) * if exit_code != 0, mark_preserved(identity) Then the existing _settle_state runs after teardown. Now the preservation matrix is: exit 0 (clean) → snapshot + cleanup state exit ≠0 (crash, Ctrl-C) → snapshot + preserve + show resume hint capability-block → (already snapshotted/preserved by apply before teardown; this path is a no-op because the container is already gone by the time exec_claude returns) snapshot_transcript is best-effort — capability-block's earlier snapshot is not clobbered when the container is already torn down, and a missing /home/node/.claude is a warn + skip. Tested behavior: clean exit doesn't preserve, non-zero exit (including SIGINT/130 and SIGKILL/137) preserves; empty identity no-ops both helpers. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-25 07:05:23 -04:00
didericis	9dbd20398e	feat(state): clean up per-bottle state on session end (except capability-block) test / unit (pull_request) Successful in 19s Details test / integration (pull_request) Successful in 1m35s Details Previously every bottle launch left ~/.claude-bottle/state/<identity>/ behind forever — metadata.json on every run, plus per-bottle Dockerfile + transcript snapshot on capability-block rebuilds. The metadata accumulated debris across launches; the only state worth keeping was the capability-block rebuild bundle. Make cleanup the default; preserve only on capability-block. - bottle_state.py: .preserve marker helpers (mark_preserved, is_preserved, clear_preserve_marker, preserve_marker_path) + cleanup_state(identity) that rm -rf's the per-bottle dir. - capability_apply.apply_capability_change writes mark_preserved before teardown so cli.py's session-end cleanup keeps the dir. - prepare.py clears any leftover marker at launch (start or resume), so a marker from a prior capability-block doesn't keep state alive past a subsequent normal session-end. - cli/start.py runs the cleanup decision AFTER the launch context closes: if is_preserved → print resume hint; else cleanup_state. The resume hint moves out of the launch with-block (was previously printed unconditionally — would have misled the operator about whether state was actually kept). Future-proof: cli.py never persists state speculatively. If the agent wants to be resumable, it has to go through capability-block. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-25 06:51:13 -04:00
didericis	4032e04a9c	feat(bottle): random-suffix identity + cli.py resume <identity> test / unit (pull_request) Successful in 18s Details test / integration (pull_request) Successful in 1m30s Details Replaces the cwd-hash identity with a random 5-char base36 suffix per launch, so two simultaneous `start <agent>` invocations against the same cwd no longer collide on container names. Each launch is its own bottle. State carries metadata: every prepare step writes ~/.claude-bottle/state/<identity>/metadata.json with the (agent_name, cwd, copy_cwd, started_at) the bottle was launched with. The new `cli.py resume <identity>` reads this metadata and re-launches a bottle pinned to the same identity — picking up the per-bottle Dockerfile (from a prior capability-block apply) and the transcript snapshot under the same state dir. - bottle_state.py: bottle_identity(agent_name) drops the cwd param and gains a random suffix; BottleMetadata dataclass + read/write/metadata_path helpers. - BottleSpec gains an optional identity field — resume sets it to pin the identity; start leaves it empty so prepare mints fresh. - prepare.py: writes metadata at launch time; uses spec.identity if provided (resume) else bottle_identity(agent_name) (fresh start). - start.py: extracted _launch_bottle from cmd_start so resume can share the launch core; prints `./cli.py resume <identity>` hint at session end. - cli/resume.py (new): reads metadata, reconstructs BottleSpec with the recorded identity + cwd, delegates to _launch_bottle. Errors clearly when no state exists for the given identity. - cli/__init__.py: registers `resume` in COMMANDS + usage. - dashboard.py: capability-block approval status line now appends the `resume <identity>` hint so the operator can copy-paste the rebuild command without leaving the TUI. Closes the rebuild loop in PRD 0016: agent calls capability-block → operator approves → bottle torn down with state preserved → status line shows resume command → operator runs it → replacement bottle boots with the new Dockerfile and prior transcript. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-25 06:09:45 -04:00
didericis	32b62cbacc	feat(cred_proxy)!: cred-proxy is the only Anthropic auth path test / unit (pull_request) Successful in 13s Details test / integration (pull_request) Successful in 23s Details Removes the legacy `CLAUDE_BOTTLE_OAUTH_TOKEN` -> `CLAUDE_CODE_OAUTH_TOKEN` forward in prepare.py. Bottles that need claude-code to authenticate must declare a cred_proxy route with role: "anthropic-base-url" — there is no fallback that hands the token to the agent directly. Drops the now-dead BottleSpec.forward_oauth_token field, the CLI setter that read CLAUDE_BOTTLE_OAUTH_TOKEN from the host env at prepare time, and the forward_oauth_token=False arg in the six pipelock integration tests. PRD 0010 and README updated; the dev ~/claude-bottle.json gains an anthropic-base-url route so the implementer/researcher agents keep working. BREAKING: bottles previously relying on the implicit OAuth forward will now produce an agent environ without any Anthropic credential. Verified with --dry-run: a bottle with no anthropic-base-url route yields env_names: [] (no token at all); a bottle that declares the route yields ANTHROPIC_BASE_URL plus a non-secret placeholder for CLAUDE_CODE_OAUTH_TOKEN.	2026-05-24 12:56:09 -04:00
didericis	beb0c9d58f	feat(cli): add --format=json to start --dry-run for machine-readable plan BottlePlan gains a to_dict method (abstract on the base, implemented on DockerBottlePlan) returning a JSON-serializable view of the resolved plan. `cli.py start --dry-run --format=json` prints it to stdout and exits zero. --format=json without --dry-run is rejected — emitting JSON during a real launch would race the y/N prompt. The dry-run integration test now parses the JSON and asserts on structured fields (agent, bottle, runtime, hosts sorted+deduped, etc.) instead of regex-matching the human-readable preflight stdout. That kills the magic-"8 hosts allowed" coupling — adding a new baked default doesn't break the test. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-11 16:23:24 -04:00
didericis	70a22fa210	refactor: rename platform abstraction to backend test / run tests/run_tests.py (pull_request) Successful in 21s Details Across the package: - claude_bottle/platform/ -> claude_bottle/backend/ - platform/docker/platform.py -> backend/docker/backend.py - class BottlePlatform -> BottleBackend - class DockerBottlePlatform -> DockerBottleBackend - get_bottle_platform() -> get_bottle_backend() - env var CLAUDE_BOTTLE_PLATFORM -> CLAUDE_BOTTLE_BACKEND - dict _PLATFORMS -> _BACKENDS "Backend" is shorter and more established as the term for a pluggable strategy-pattern implementation. "Platform" was vague (could mean OS, hardware, cloud) and mildly redundant — Docker is itself a platform. The previous PRD section claiming "the Backend protocol was rejected" referred to a low-level run/exec/cp/network_connect protocol; the name was never the reason. The PRD is updated to describe that rejected design by shape rather than by name. The bottle/agent concepts and the manifest schema are unchanged.	2026-05-10 23:59:38 -04:00
didericis	1d2c18eaae	refactor(platform): rename claude_bottle/bottles -> claude_bottle/platform test / run tests/run_tests.py (pull_request) Successful in 13s Details 'bottles' was the package name when it held a single Bottle Protocol; since we added BottlePlatform / BottlePlan / BottleCleanupPlan and made it the home of platform dispatch, 'platform' describes the package better. The 'bottle' concept (and the manifest field) stays. CLI imports update from ..bottles to ..platform; internal relative imports inside the package survive the rename unchanged. Git detected all 7 file renames.	2026-05-10 23:37:28 -04:00
didericis	2827d9b899	refactor(bottles): introduce BottlePlan base + move print onto plan test / run tests/run_tests.py (pull_request) Successful in 19s Details - Add BottlePlan (frozen dataclass + ABC) with spec, stage_dir, and an abstract `print(*, remote_control)` method. - DockerBottlePlan now inherits from BottlePlan; spec/stage_dir come from the base, Docker-specific fields stay on the subclass. - Move BottleSpec from bottles/docker.py to bottles/__init__.py so the cross-platform types live together. docker.py pulls them via `from . import ...`. - Move show_plan from cli/start.py to `DockerBottlePlan.print`. Caller becomes `plan.print(remote_control=...)`. The CLI no longer reads any Docker-specific fields. - BottlePlatform.prepare is now typed `Callable[..., BottlePlan]`. cmd_start drops ~46 more lines.	2026-05-10 22:49:57 -04:00
didericis	236c4fa50c	refactor(bottles): rename DockerBottleSpec to BottleSpec test / run tests/run_tests.py (pull_request) Successful in 13s Details The spec is intent-only and platform-agnostic — only the plan carries Docker-specific fields. Drop the 'Docker' prefix and re-export from claude_bottle.bottles so callers see it as cross-platform.	2026-05-10 22:40:19 -04:00
didericis	4f16b3a9e1	refactor(bottles): split factory into prepare + launch phases test / run tests/run_tests.py (pull_request) Successful in 15s Details The Docker factory had absorbed live container ops but left the host-side prep (image-name resolution, container-name collision retry, pipelock yaml generation, env_resolve writes, host validation) in cli/start.py. That kept ~half the Docker-specific logic outside the abstraction. Split the factory into two phases: prepare_docker_bottle(spec, stage_dir=...) -> DockerBottlePlan Resolves names, validates skills/SSH, writes scratch files. No Docker resources created yet. launch_docker_bottle(plan) -> ContextManager[Bottle] Builds image, creates networks, boots pipelock, runs the agent container, provisions files. Teardown on exit. DockerBottleSpec shrinks to intent-only inputs (manifest, agent name, --cwd flag, user_cwd, forward_oauth_token). The CLI no longer references docker_mod, pipelock, skills, ssh, or env_resolve. get_bottle_factory becomes get_bottle_platform returning a BottlePlatform with .prepare and .launch — one selectable thing per platform. The Bottle handle now remembers the in-container prompt path and adds --append-system-prompt-file to claude's argv when present, so the CLI no longer needs to know the path. cmd_start: ~148 lines down from 229. Tests pass; dry-run output byte-identical.	2026-05-10 22:36:26 -04:00
didericis	a284d85296	refactor(start): show_plan now takes DockerBottleSpec test / run tests/run_tests.py (pull_request) Successful in 15s Details	2026-05-10 22:23:40 -04:00
didericis	7500ba230c	refactor(start): extract show_plan from cmd_start test / run tests/run_tests.py (pull_request) Successful in 15s Details	2026-05-10 22:20:33 -04:00
didericis	d75cc9325f	feat(bottles): implement bottle factory abstraction per PRD 0003 test / run tests/run_tests.py (pull_request) Successful in 16s Details Introduce claude_bottle/bottles/ with a Bottle Protocol and a get_bottle_factory() that dispatches on CLAUDE_BOTTLE_PLATFORM (default "docker"). Move every Docker-specific subprocess.run call from cli/start.py, plus the orchestration of build, networks, the pipelock sidecar, container launch, and per-container provisioning (prompt, skills, ssh, .git), into create_docker_bottle. Drop bottles[].runtime from the manifest schema. Auto-detect whether gVisor is registered with the daemon and pass --runtime=runsc when it is; the preflight shows the resolved runtime so the choice is visible. Manifests still carrying 'runtime' get a clear error pointing at the auto-detect behavior, rather than silent ignore. Out of scope: cli/cleanup.py and cli/list.py still call docker directly. They enumerate active bottles across the host, which is a separate concern from "create a bottle" and is left for a follow-up that introduces a list_active/cleanup primitive on the factory.	2026-05-10 22:15:05 -04:00
didericis	1f36d53f7b	refactor(manifest): convert TypedDict to frozen dataclasses test / run tests/run_tests.py (pull_request) Successful in 14s Details Replace the TypedDict + 14 manifest_* free functions with frozen dataclasses (SshEntry, BottleEgress, Bottle, Agent, Manifest) carrying their own validators and constructors. Call sites import Manifest and chain attribute access; the manifest_* helpers and manifest_validate are gone. Behavior changes worth flagging: - Agent.bottle is now required (was optional with a "(none)" fallback). Manifest.from_json_obj dies if any agent lacks a 'bottle' field or references an undefined bottle, where previously start.py raised the error lazily for the specific agent being launched. - ssh.py now takes SshEntry instances; Host/IdentityFile shape checks moved upstream into Manifest construction, leaving only the IdentityFile filesystem-existence check in ssh_validate_entries. - pipelock_bottle_allowlist's per-element string check is dropped — the Manifest validator enforces it at load. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-10 21:20:15 -04:00
didericis	e3f5a5907a	feat(bottle): opt-in gVisor runtime per bottle test / run tests/run_tests.py (push) Successful in 19s Details Bottles can now set "runtime": "runsc" to launch the agent container under gVisor instead of runc, adding a userspace syscall barrier between the agent and the host kernel. Default is runc (Docker default). Pipelock stays on the default runtime per the research doc's minimum-diff prescription. The launcher verifies runsc is registered with the daemon before launch, surfaces the runtime in the preflight plan, and dies with an install pointer (and a macOS-not-supported note) when runsc is requested but unavailable. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-10 00:48:11 -04:00
didericis	f817847dff	refactor(cli): split claude_bottle/cli.py into a package test / run tests/run_tests.py (push) Successful in 20s Details One file per subcommand under claude_bottle/cli/, with shared constants and the tty helper in _common.py and dispatch in __init__.py. The public import (from claude_bottle.cli import main) is unchanged, so the root cli.py entrypoint and the test suite see no surface change. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-10 00:15:16 -04:00

16 Commits