bot-bottle

Author	SHA1	Message	Date
didericis	f943e14891	refactor(pipelock): take stage_dir, derive yaml_path internally test / unit (pull_request) Successful in 11s Details test / integration (pull_request) Failing after 12s Details PipelockProxy.prepare now accepts (bottle, slug, stage_dir) and derives the yaml_path itself, so callers don't need to know the filename. DockerBottleBackend.prepare_proxy becomes a one-line wrapper whose only caller already has bottle and slug in scope, so it's inlined and deleted.	2026-05-11 16:50:22 -04:00
didericis	479adc625a	test(pipelock): collapse over-decomposed allowlist helper tests test / unit (pull_request) Successful in 11s Details test / integration (pull_request) Successful in 21s Details The four lower-level helpers (pipelock_bottle_allowlist, pipelock_bottle_ssh_hostnames, pipelock_bottle_ssh_ip_cidrs, pipelock_bottle_ssh_trusted_domains) are one-line filters; testing each in isolation duplicates coverage that pipelock_effective_allowlist already provides end-to-end. The /32 CIDR suffix is the only behavior beyond filtering, so it keeps a tiny dedicated test. Drops the misplaced test_rejects_non_string_entry — that's manifest validation, not allowlist resolution. Belongs in a manifest-validation test file (which doesn't exist yet); leaving for a separate PR rather than adding a one-branch sample here. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-11 16:36:04 -04:00
didericis	757e76add7	test(cli): tighten and relocate --format=json validation test Move the --format=json-requires-dry-run check out of the integration suite (it doesn't need Docker — argparse fails before any backend runs) and tighten the assertion: previously asserted only that exit code was nonzero, so any unrelated breakage (manifest resolution failure, bad agent name, etc.) silently passed. Now asserts stderr contains the actual flag-conflict message. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-11 16:35:55 -04:00
didericis	8f5e07af7f	test(pipelock): drive sidecar smoke through production prepare/start test / unit (pull_request) Successful in 14s Details test / integration (pull_request) Successful in 23s Details The old smoke test hand-rolled the docker create/cp/start sequence in parallel with what DockerPipelockProxy.start already does, so any divergence in production code wouldn't trip it. Rewritten to call .prepare and .start directly and probe /health from a sibling curl container on the same internal network — same access topology the agent container uses in production. In-network probing means the test no longer depends on a published port, so it can run under act_runner (where host-loopback port publishing isn't reachable from the job container). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-11 16:23:43 -04:00
didericis	beb0c9d58f	feat(cli): add --format=json to start --dry-run for machine-readable plan BottlePlan gains a to_dict method (abstract on the base, implemented on DockerBottlePlan) returning a JSON-serializable view of the resolved plan. `cli.py start --dry-run --format=json` prints it to stdout and exits zero. --format=json without --dry-run is rejected — emitting JSON during a real launch would race the y/N prompt. The dry-run integration test now parses the JSON and asserts on structured fields (agent, bottle, runtime, hosts sorted+deduped, etc.) instead of regex-matching the human-readable preflight stdout. That kills the magic-"8 hosts allowed" coupling — adding a new baked default doesn't break the test. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-11 16:23:24 -04:00
didericis	30b4f12288	refactor(pipelock): expose structured config; assert on dict in tests Split pipelock config building from YAML rendering: pipelock_build_config returns a dict, pipelock_render_yaml serializes it, and _build_pipelock_yaml chains the two onto disk. Unchanged behavior — pipelock loads the same YAML. The yaml test now asserts on the structured config dict, which is robust to cosmetic YAML changes (key order, quoting). The two checks that only make sense on the rendered output — file mode 0600 and no-secret-leakage — stay against the on-disk content. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-11 16:23:12 -04:00
didericis	4462863d56	test: reorganize suite into unit/integration/canaries directories Replace the hand-maintained INTEGRATION_NAMES classifier (and the bespoke run_tests.py around it) with a directory-driven split: tests/unit/ unit tests, always run tests/integration/ Docker-dependent, skip cleanly without Docker tests/canaries/ upstream-regression checks, opt-in via CLAUDE_BOTTLE_RUN_CANARIES=1 The pinned-pipelock-image check moves to the canary suite — it tests upstream packaging, not our code, so it shouldn't gate every dev push. A scheduled canaries.yml workflow runs it weekly. The manifest-runtime tests collapse the four assertRaises cases for distinct 'runtime' values into one subTest loop and drop the error-message-wording assertions; the contract is "any value is rejected", not "the error literally contains 'auto-detect'". Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-11 16:23:02 -04:00
didericis	1269edf311	refactor(pipelock): PipelockProxy.prepare takes a Bottle, not (manifest, name) test / run tests/run_tests.py (pull_request) Successful in 14s Details Matches the allowlist-resolution helpers' shape: the caller resolves the bottle once and passes it in. Signature drops from (manifest, bottle_name, slug, yaml_path) to (bottle, slug, yaml_path). DockerBottleBackend.prepare_proxy uses manifest.bottle_for(agent_name) to get the bottle directly. Tests pass fixture.bottles[name]. prepare's docstring also explains what `slug` is: the lowercased, hyphen-normalized agent identifier used as the suffix in every per-agent resource name (agent container, pipelock container, the internal/egress networks). It's stored on the plan so start can derive the sidecar's container name. Top-level pipelock.py drops the Manifest import — no longer used.	2026-05-11 14:05:48 -04:00
didericis	1b3254bf37	refactor(pipelock): move PIPELOCK_IMAGE and PIPELOCK_PORT to docker/pipelock.py test / run tests/run_tests.py (pull_request) Successful in 14s Details Both constants were already only used by Docker-specific code (the sidecar boot, the proxy_url/host_port naming helpers, the image contract test). Move them next to DockerPipelockProxy. Top-level pipelock.py drops the 'os' import along with the constants; the two test files that pulled PIPELOCK_IMAGE retarget at the new location.	2026-05-11 13:59:43 -04:00
didericis	b49281800a	refactor(pipelock): move Docker-specific naming helpers to docker/pipelock.py test / run tests/run_tests.py (pull_request) Successful in 16s Details The three slug-based naming helpers were nominally on pipelock.py but each assumed a Docker container topology (the container name is 'claude-bottle-pipelock-<slug>', the proxy URL uses that container name). Move them next to DockerPipelockProxy: pipelock_container_name -> claude-bottle-pipelock-<slug> pipelock_proxy_url -> http://<container>:<port> pipelock_proxy_host_port -> <container>:<port> backend.py imports them directly from .pipelock; the orphan-cleanup test imports container_name from the same place.	2026-05-11 13:57:18 -04:00
didericis	edd8b444a6	refactor(pipelock): split sidecar lifecycle into DockerPipelockProxy test / run tests/run_tests.py (pull_request) Successful in 18s Details PipelockProxy becomes an ABC with the platform-agnostic prepare/_build_pipelock_yaml as concrete methods and start/stop as abstract. Docker-specific sidecar lifecycle moves to a new sibling file: claude_bottle/backend/docker/pipelock.py DockerPipelockProxy(PipelockProxy) — implements start (docker create/cp/network connect/start) and stop (docker inspect/rm -f). DockerBottleBackend._proxy is now a DockerPipelockProxy instance. Tests that previously instantiated PipelockProxy() directly switch to DockerPipelockProxy() (the base is no longer constructable).	2026-05-11 13:53:45 -04:00
didericis	25e67137f2	refactor(pipelock): allowlist-resolution helpers take a Bottle, not (manifest, name) test / run tests/run_tests.py (pull_request) Successful in 17s Details Every function in the 'Allowlist resolution' section was doing `manifest.bottles[bottle_name].X` as its first move. Push the lookup to the caller and have each helper take a resolved Bottle: pipelock_bottle_allowlist pipelock_bottle_ssh_hostnames pipelock_bottle_ssh_trusted_domains pipelock_bottle_ssh_ip_cidrs pipelock_effective_allowlist pipelock_allowlist_summary PipelockProxy._build_pipelock_yaml resolves bottle once at the top and passes it through; DockerBottleBackend.prepare already had the bottle in scope and now uses it directly. Tests pass the resolved bottle from each fixture.	2026-05-11 13:44:58 -04:00
didericis	c62b3204a8	refactor(util): move is_ipv4_literal out of pipelock.py into util.py test / run tests/run_tests.py (pull_request) Successful in 25s Details The classifier is a pure dotted-quad regex check — nothing pipelock-specific about it. Pipelock now imports it from util. test_pipelock_classify.py retargets at the new location. Two manifest-accessor functions in pipelock.py (pipelock_bottle_allowlist, pipelock_bottle_ssh_hostnames) look generic but are 1-line wrappers used only internally; they stay for now.	2026-05-11 13:37:31 -04:00
didericis	ff962d2893	refactor(pipelock): start/stop become methods on PipelockProxy test / run tests/run_tests.py (pull_request) Successful in 31s Details ProxyPlan -> PipelockProxyPlan, with two additional fields populated in launch: internal_network, egress_network (default ""). prepare fills yaml_path + slug; launch uses dataclasses.replace to populate the networks before calling start. pipelock_start -> PipelockProxy.start(plan). Reads yaml_path, slug, internal_network, egress_network off the plan. Returns the resolved container name. pipelock_stop -> PipelockProxy.stop(proxy_target). Takes the resolved container name directly (the value that start returned); no longer needs to know about slugs or naming conventions. Backend launch passes the running container name (state["pipelock"]) to stop. Test for stop's idempotency uses pipelock_container_name to construct the proxy_target.	2026-05-11 10:57:07 -04:00
didericis	30ead9102a	refactor(pipelock): introduce PipelockProxy class housing the yaml body test / run tests/run_tests.py (pull_request) Successful in 14s Details The YAML generation now lives on PipelockProxy.prepare(manifest, bottle_name, yaml_path) in claude_bottle/pipelock.py. The class is the natural home for any future proxy-level state. DockerBottleBackend keeps an instance as a class attribute (_proxy = PipelockProxy()) and its prepare_proxy becomes a thin delegation. A future backend that wants a different egress proxy (or none) plugs in its own strategy. Tests retarget at the new home — PipelockProxy.prepare gets the content-shape assertions; the sidecar smoke test uses the class directly too. Same coverage.	2026-05-11 01:18:53 -04:00
didericis	f344c8cd9d	test(pipelock): cut low-value tests (naming + entrypoint/cmd inspection) test / run tests/run_tests.py (pull_request) Successful in 14s Details Drops 6 tests with no real coverage loss: - tests/test_pipelock_naming.py — 4 tests asserting that f-string format helpers return their f-string. Shape locks, not behavior gates. - tests/test_pipelock_image.py:test_entrypoint_contains_pipelock and :test_cmd_contains_run — Docker image metadata inspection. The remaining test_binary_runs already covers 'does the pinned image actually work,' which is the only scenario these were really guarding against. 31 tests -> 25.	2026-05-11 01:11:59 -04:00
didericis	11f17d7927	refactor(docker): inline pipelock_write_yaml body into prepare_proxy test / run tests/run_tests.py (pull_request) Successful in 16s Details The yaml generation logic moves wholesale onto DockerBottleBackend where it's used. pipelock_write_yaml is deleted; pipelock.py keeps the allowlist resolution helpers (still called by prepare_proxy and by pipelock_allowlist_summary). The pipelock_start error message that referenced "pipelock_write_yaml must run first" now says "backend.prepare_proxy must run first." tests/test_pipelock_yaml.py rewritten to drive DockerBottleBackend(). prepare_proxy(spec, yaml_path); test_pipelock_sidecar_smoke.py call site updated similarly. Same coverage at the new location.	2026-05-11 01:04:47 -04:00
didericis	70a22fa210	refactor: rename platform abstraction to backend test / run tests/run_tests.py (pull_request) Successful in 21s Details Across the package: - claude_bottle/platform/ -> claude_bottle/backend/ - platform/docker/platform.py -> backend/docker/backend.py - class BottlePlatform -> BottleBackend - class DockerBottlePlatform -> DockerBottleBackend - get_bottle_platform() -> get_bottle_backend() - env var CLAUDE_BOTTLE_PLATFORM -> CLAUDE_BOTTLE_BACKEND - dict _PLATFORMS -> _BACKENDS "Backend" is shorter and more established as the term for a pluggable strategy-pattern implementation. "Platform" was vague (could mean OS, hardware, cloud) and mildly redundant — Docker is itself a platform. The previous PRD section claiming "the Backend protocol was rejected" referred to a low-level run/exec/cp/network_connect protocol; the name was never the reason. The PRD is updated to describe that rejected design by shape rather than by name. The bottle/agent concepts and the manifest schema are unchanged.	2026-05-10 23:59:38 -04:00
didericis	c79966731c	refactor(docker): move network.py into platform/docker/ test / run tests/run_tests.py (pull_request) Successful in 14s Details The Docker bridge / internal network primitives are Docker-specific; they belong inside the Docker platform package alongside util.py and the rest. Same logic the earlier top-level docker.py move followed. Imports: - platform.py: `from ... import network as network_mod` -> `from . import network as network_mod` - network.py: `from .log import ...` -> `from ...log import ...` - tests/test_orphan_cleanup.py: from claude_bottle.network -> from claude_bottle.platform.docker.network	2026-05-10 23:40:58 -04:00
didericis	d75cc9325f	feat(bottles): implement bottle factory abstraction per PRD 0003 test / run tests/run_tests.py (pull_request) Successful in 16s Details Introduce claude_bottle/bottles/ with a Bottle Protocol and a get_bottle_factory() that dispatches on CLAUDE_BOTTLE_PLATFORM (default "docker"). Move every Docker-specific subprocess.run call from cli/start.py, plus the orchestration of build, networks, the pipelock sidecar, container launch, and per-container provisioning (prompt, skills, ssh, .git), into create_docker_bottle. Drop bottles[].runtime from the manifest schema. Auto-detect whether gVisor is registered with the daemon and pass --runtime=runsc when it is; the preflight shows the resolved runtime so the choice is visible. Manifests still carrying 'runtime' get a clear error pointing at the auto-detect behavior, rather than silent ignore. Out of scope: cli/cleanup.py and cli/list.py still call docker directly. They enumerate active bottles across the host, which is a separate concern from "create a bottle" and is left for a follow-up that introduces a list_active/cleanup primitive on the factory.	2026-05-10 22:15:05 -04:00
didericis	1f36d53f7b	refactor(manifest): convert TypedDict to frozen dataclasses test / run tests/run_tests.py (pull_request) Successful in 14s Details Replace the TypedDict + 14 manifest_* free functions with frozen dataclasses (SshEntry, BottleEgress, Bottle, Agent, Manifest) carrying their own validators and constructors. Call sites import Manifest and chain attribute access; the manifest_* helpers and manifest_validate are gone. Behavior changes worth flagging: - Agent.bottle is now required (was optional with a "(none)" fallback). Manifest.from_json_obj dies if any agent lacks a 'bottle' field or references an undefined bottle, where previously start.py raised the error lazily for the specific agent being launched. - ssh.py now takes SshEntry instances; Host/IdentityFile shape checks moved upstream into Manifest construction, leaving only the IdentityFile filesystem-existence check in ssh_validate_entries. - pipelock_bottle_allowlist's per-element string check is dropped — the Manifest validator enforces it at load. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-10 21:20:15 -04:00
didericis	36cb0c53bf	refactor(manifest): add TypedDict schema and eager validation test / run tests/run_tests.py (pull_request) Successful in 20s Details Move schema checks out of per-access getters into a single manifest_validate pass invoked by manifest_resolve. Getters can now assume bottles/agents are well-typed dicts and every agent has a defined bottle, so the .get(...) or {} chains collapse. Behavior change: a bad runtime / shape error anywhere in the manifest now fails at load instead of on the N-th read. Intermediate step toward replacing TypedDict with a dataclass. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-10 21:08:54 -04:00
didericis	e3f5a5907a	feat(bottle): opt-in gVisor runtime per bottle test / run tests/run_tests.py (push) Successful in 19s Details Bottles can now set "runtime": "runsc" to launch the agent container under gVisor instead of runc, adding a userspace syscall barrier between the agent and the host kernel. Default is runc (Docker default). Pipelock stays on the default runtime per the research doc's minimum-diff prescription. The launcher verifies runsc is registered with the daemon before launch, surfaces the runtime in the preflight plan, and dies with an install pointer (and a macOS-not-supported note) when runsc is requested but unavailable. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-10 00:48:11 -04:00
didericis	4694db1201	PRD 0002: Test pipeline on Gitea Actions (#3 ) test / run tests/run_tests.py (push) Successful in 20s Details	2026-05-09 02:48:03 -04:00
didericis	399ed93dc8	refactor: convert project from bash to Python Replaces cli.sh + lib/.sh with a claude_bottle/ Python package and a cli.py entry point. No external dependencies — uses only Python's stdlib (json, subprocess, getpass, tempfile, argparse, re, etc.). - claude_bottle/{log,docker,manifest,env_resolve,network,pipelock, skills,ssh,cli}.py mirror the previous lib/.sh modules. - Tests converted to unittest under tests/test_*.py with a stdlib runner at tests/run_tests.py (unit \| integration \| path). - .githooks/commit-msg ported to Python; same Conventional Commits rules. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-08 15:26:58 +00:00
didericis	ba7616a4ae	PRD 0001: Per-agent egress proxy via pipelock (#1 )	2026-05-08 01:56:43 -04:00

26 Commits