bot-bottle

Author	SHA1	Message	Date
didericis-claude	3d557beeee	refactor(backend): move per-provider provisioning onto AgentProvider BottleBackend.provision now resolves the provider plugin from the plan and dispatches prompt / skills / declarative-apply / supervise-mcp through it. The four hooks the docker + smolmachines backends used to override (provision_skills, provision_prompt, provision_provider_auth, provision_supervise) are gone — the duplicated 50-line implementations under backend/{docker,smolmachines}/provision/{skills,prompt, provider_auth,supervise}.py are deleted. Each backend gains a small supervise_mcp_url(plan) override so the provider plugin can run `claude mcp add` / `codex mcp add` against the right URL: docker returns http://{SUPERVISE_HOSTNAME}:{SUPERVISE_PORT}/ on the compose network alias; smolmachines returns plan.agent_supervise_url which launch.py already pins to a host-loopback port. Removes tests/unit/test_provision_supervise.py — the URL it asserted on now lives on the backend, with no equivalent standalone surface to test against (it's covered by the broader plan / launch integration tests).	2026-06-03 21:38:13 -04:00
didericis-claude	0efc07ba67	refactor(backend): pass Bottle to provisioners instead of target string test / unit (pull_request) Successful in 50s Details test / integration (pull_request) Successful in 59s Details test / unit (push) Successful in 43s Details test / integration (push) Successful in 1m3s Details Closes #178. The backend provision functions now receive a Bottle handle with exec / cp_in methods instead of a raw target string. Provisioner modules use bottle.exec and bottle.cp_in in place of inlined subprocess.run(["docker", "exec"/"cp", ...]) and direct _smolvm.machine_cp / machine_exec calls. This decouples the provisioners from backend-specific runtime primitives so future refactors (e.g. the supervise rework) can swap the bottle's exec implementation without touching every provisioner. Each launch.py constructs the Bottle handle before calling provision so it can be passed in; provision_prompt's return value is wired back onto the bottle's prompt path attribute after the fact.	2026-06-03 20:47:37 +00:00
didericis-codex	d3bc463295	fix(cli): remove supervise queue highlight test / unit (pull_request) Successful in 43s Details test / integration (pull_request) Successful in 52s Details	2026-06-03 17:31:19 +00:00
didericis-codex	63a7e63ce9	test(cli): clean up supervise test naming test / unit (pull_request) Successful in 40s Details test / integration (pull_request) Successful in 48s Details	2026-06-03 17:26:15 +00:00
didericis-codex	41570e04c0	test(cli): update supervise triage coverage test / unit (pull_request) Successful in 40s Details test / integration (pull_request) Successful in 51s Details	2026-06-03 17:25:09 +00:00
didericis-claude	0b5d59cf9e	feat(prd-0048): implement SSH deploy-key provisioning with contrib/gitea - manifest_git.py: add ProvisionedKeyConfig dataclass; extend GitEntry with ProvisionedKey field (optional); make IdentityFile default to "" so provisioned_key entries can be constructed without a static path; add _parse_provisioned_key_config; update from_repos_entry to accept provisioned_key as an alternative to identity (mutually exclusive, parser rejects both-or-neither) - deploy_key_provisioner.py (new): DeployKeyProvisioner ABC with create() and delete() abstract methods; get_provisioner() factory with lazy contrib import for gitea - contrib/gitea/deploy_key_provisioner.py (new): GiteaDeployKeyProvisioner generating ed25519 keypairs via ssh-keygen and managing them through the Gitea deploy-key API (POST/DELETE); 404 on delete is success; all other errors raise RuntimeError - git_gate.py: add _provision_dynamic_key() called in GitGate.prepare() for entries with ProvisionedKey — generates key, writes private key and key ID files to stage_dir, patches GitGateUpstream.identity_file; add revoke_git_gate_provisioned_keys() for teardown — raises on failure - docker/launch.py: call revoke_git_gate_provisioned_keys() in teardown() after stack.close() so revocation runs after containers stop and failures propagate (not suppressed) - smolmachines/launch.py: extract _teardown_smolmachines() helper that catches stack.close() errors (warn + re-raise) then calls revocation; same fatal-on-failure contract as docker backend - test_manifest_git.py: 9 new cases for provisioned_key parsing - test_deploy_key_provisioner.py (new): factory smoke tests - test_contrib_gitea_deploy_key.py (new): create/delete/error/split tests Closes #169	2026-06-03 11:58:36 -04:00
didericis-claude	f0ca4e3527	refactor: extract dashboard state/model layer into dashboard_model.py test / unit (pull_request) Successful in 41s Details test / integration (pull_request) Successful in 47s Details test / unit (push) Successful in 35s Details test / integration (push) Successful in 44s Details Splits the 2103-line dashboard.py into two modules. Pure data structures (QueuedProposal), discovery helpers (discover_pending, discover_active_agents), derived-value helpers (_is_recent, _approval_status, _format_agent_row, _detail_lines, etc.), and argv-builder helpers (_build_split_pane_argv, _build_respawn_pane_argv, _build_resume_argv_with_fallback, _agent_runtime_args) all move to dashboard_model.py. The curses TUI, $EDITOR integration, tmux subprocess flows, and action handlers (approve, reject, operator_edit_routes, operator_edit_allowlist) remain in dashboard.py, which re-imports everything from dashboard_model so existing callers and tests are unaffected. Adds tests/unit/test_dashboard_model.py covering _approval_status, _proposed_payload_label, and _suffix_for_tool — three helpers that had no prior coverage. All 894 unit tests pass. Closes #158	2026-06-03 15:52:27 +00:00
didericis-claude	ca6d257f30	test(git-gate): add shell-escaping regression tests (issue #159 ) test / unit (pull_request) Successful in 36s Details test / integration (pull_request) Successful in 44s Details test / unit (push) Successful in 35s Details test / integration (push) Successful in 42s Details Cover all six pathological character classes (single-quote, double-quote, space, semicolon, newline, backtick) in both upstream URL and name positions. Each case validates rendered output via `sh -n` and asserts the original value is preserved verbatim after shlex.quote encoding. Also add `sh -n` smoke tests for the static pre-receive and access-hook scripts.	2026-06-03 14:51:23 +00:00
didericis-claude	cc0c952d0b	fix(security): harden git_gate.py shell rendering with shlex.quote and name validation test / unit (pull_request) Successful in 35s Details test / integration (pull_request) Successful in 44s Details test / unit (push) Successful in 32s Details test / integration (push) Successful in 41s Details Use shlex.quote() on name and upstream_url in git_gate_render_entrypoint() so special characters (single quotes, spaces, semicolons) cannot break or inject into the generated sh script. Add _GIT_NAME_RE validation in GitEntry.from_repos_entry() to restrict repo names to [A-Za-z0-9._-]+, making the manifest the first line of defence and shlex.quote() the belt-and-suspenders backstop. Closes #155	2026-06-03 04:40:21 +00:00
didericis-claude	9282bceaf8	fix: emit WARNING when Docker teardown ExitStack raises (issue #156 ) test / unit (pull_request) Successful in 37s Details test / integration (pull_request) Successful in 40s Details test / unit (push) Successful in 32s Details test / integration (push) Successful in 43s Details Replace the bare `except BaseException: pass` in the `teardown` closure with a `warn()` call that includes the container name and operation type ("compose-down"), so cleanup failures are visible in the log rather than silently discarded. Non-blocking: the exception is consumed and teardown continues, preserving the original error-propagation contract. Add test_docker_launch_teardown.py to lock the new behaviour: it injects a RuntimeError via a mocked `compose_down` callback and asserts the WARNING message contains the container name and operation label.	2026-06-03 04:13:53 +00:00
didericis-claude	4cf2cfc55d	test: update test suite for git-gate manifest redesign (PRD 0047) - fixtures.py: fixture_with_git_dict uses git-gate.repos + url/identity/host_key - test_manifest_git: rewrite to use git-gate.repos; replace duplicate-name test (names = dict keys, always unique) with two-repos-different-hosts test - test_manifest_git_user: _manifest → git-gate.user; update error message assertions - test_manifest_agent_git_user: git → git-gate throughout; repos rejection test - test_manifest_extends: git.remotes/git.user → git-gate.repos/git-gate.user - test_provision_git: IP test updated — no host alias, single insteadOf - test_compose: git.remotes → git-gate.repos + new field names - test_docker_provision_git_user: git.user → git-gate.user - test_git_gate: inline manifest dict updated to git-gate.repos - test_smolmachines_provision: git_json → git_gate_json; remove _remote_host	2026-06-02 23:59:34 -04:00
didericis	f427d35e72	fix(git-http): log access-hook denial detail to stdout test / unit (pull_request) Successful in 33s Details test / integration (pull_request) Successful in 39s Details test / unit (push) Successful in 43s Details test / integration (push) Successful in 59s Details Previously when the access-hook returned non-zero, git-http would pipe the hook's stderr into the 403 body sent back to the agent's git client but never log it locally, so docker logs just showed `"GET ... 403 -"` with no explanation. Operators had to shell into the sidecar and re-run the hook by hand to find out why a clone was being refused (e.g. upstream SSH unreachable, missing credentials). Route the hook's stderr/stdout through the existing log_message channel before sending the 403, one log line per output line so the default request-log format stays readable. When the hook exits non-zero with no output, log the exit code so the line is still informative. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-02 23:29:39 -04:00
didericis-codex	941f316462	feat(git-gate): remove git remote host override plumbing	2026-06-02 18:17:24 +00:00
didericis	3885e2f5ad	fix(workspace): include hidden cwd files in docker layer test / unit (pull_request) Successful in 32s Details test / integration (pull_request) Successful in 42s Details test / unit (push) Successful in 33s Details test / integration (push) Successful in 40s Details	2026-06-02 13:12:40 -04:00
didericis-codex	d5fcbe53ef	feat(workspace): port cwd across backends	2026-06-02 17:01:19 +00:00
didericis-codex	6150497b47	feat(workspace): trust resolved project path	2026-06-02 16:57:52 +00:00
didericis-codex	5308d53288	feat(workspace): add shared workspace plan	2026-06-02 16:56:57 +00:00
didericis	44273be9eb	fix(dashboard): stop agents in dashboard from moving during selection test / unit (push) Successful in 38s Details test / integration (push) Successful in 58s Details	2026-06-02 12:47:00 -04:00
didericis	0432a5d3ff	fix(codex): keep dummy auth refresh timestamp valid test / unit (push) Successful in 49s Details test / integration (push) Successful in 1m0s Details	2026-06-02 12:40:14 -04:00
didericis-claude	a0762ac2d3	test: add cross-backend print parity tests (PRD 0044) Shared fixtures build DockerBottlePlan and SmolmachinesBottlePlan from identical git_gate_plan and egress_plan inputs and assert that both backends render the same git gate lines (name → host:port) and egress lines (host [auth:scheme] when authenticated, host alone otherwise).	2026-06-02 12:12:08 -04:00
didericis-claude	5a2011c48f	fix: close child stdout pipes on restart and loop convergence (PRD 0043) Closes #140. In restart_daemon, the old process's stdout pipe was never explicitly closed after p.wait() returned, leaking the fd until the supervisor object was GC'd. Similarly, when the watch loop converged (all children dead), no pipe was closed. Both paths now call p.stdout.close() immediately after the process is confirmed exited. Tests enforce this with warnings.simplefilter("error", ResourceWarning) in TestSupervisor.setUp.	2026-06-02 11:48:24 -04:00
didericis-claude	cceb300d58	test: add cross-backend parity tests (PRD 0042) Closes #139. Adds tests/unit/test_backend_parity.py which verifies that DockerBottle and SmolmachinesBottle expose identical observable contracts for agent_argv shape, env injection, exec user-switching, ExecResult fields, and close() idempotency. All assertions use mock subprocess layers — no live Docker daemon or VM required.	2026-06-02 11:30:54 -04:00
didericis-claude	96b0c3f1fa	fix(git-http): validate Content-Length and cap body size (PRD 0041) Before this change, int() on a non-numeric Content-Length raised an unhandled ValueError, crashing the request handler. There was also no upper bound on how much memory a POST body could consume. After this change: - Non-numeric or missing Content-Length returns HTTP 400. - Negative Content-Length returns HTTP 400. - Bodies declared larger than 1 MiB (_MAX_BODY_BYTES) return HTTP 413, matching the cap already in supervise_server.py. Closes #138	2026-06-02 11:23:19 -04:00
didericis-claude	a3d9ac9605	feat: persist backend in BottleMetadata; use it in resume and dashboard reattach (PRD 0040) BottleMetadata gains a backend field (default ""). Docker prepare writes "docker"; smolmachines prepare writes "smolmachines". read_metadata deserialises it with "" as the backward-compatible default. resume now passes metadata.backend to _launch_bottle so a preserved smolmachines bottle is resumed on the right backend without requiring BOT_BOTTLE_BACKEND to be set manually. _bottle_for_slug now reads metadata.backend and constructs a SmolmachinesBottle for smolmachines slugs instead of always defaulting to DockerBottle. No-metadata slugs still fall back to Docker. Closes #137	2026-06-02 11:16:17 -04:00
didericis-claude	e5b5dd16f1	feat(dashboard): guard capability-block approval for smolmachines bottles (PRD 0039) apply_capability_change is Docker-only teardown/apply code. Before this change it was called regardless of backend, so approving a capability-block proposal from a smolmachines agent would run Docker commands against a slug that has no Docker container. After this change approve() reads the bottle's metadata: if compose_project is empty (the smolmachines indicator) it raises CapabilityApplyError with a clear operator message before any teardown runs. Docker bottles (non-empty compose_project) and unknown bottles (no metadata) fall through to the existing Docker path unchanged. Closes #136	2026-06-02 11:15:27 -04:00
didericis-claude	8830306101	feat(smolmachines): resolve manifest env through resolve_env() (PRD 0038) Before this change smolmachines prepare.py spliced bottle.env directly into guest_env, so ?prompt and ${HOST_VAR} entries reached the VM as raw sentinels rather than being prompted or interpolated. After this change prepare.py calls resolve_env(), matching the Docker backend's contract. Forwarded (secret/interpolated) values still flow through smolvm -e K=V argv — the known exposure gap documented in PRD 0038's open question. Closes #135	2026-06-02 14:38:36 +00:00
didericis-codex	6e954da9b7	fix(pipelock): validate yaml render config	2026-06-02 08:15:20 +00:00
didericis-codex	0a8bba58c7	fix(codex): harden auth redaction	2026-06-02 08:10:34 +00:00
didericis-codex	82ce5d3034	fix(supervise): bound response waits	2026-06-02 08:06:45 +00:00
didericis-codex	31708abfad	fix(sidecar): queue restart signals	2026-06-02 07:52:19 +00:00
didericis-codex	8f28bd81a7	refactor(manifest): split schema boundaries	2026-06-02 07:32:06 +00:00
didericis-claude	a81f0ffa49	fix(smolmachines): raise SmolvmError instead of die() on wait_exec_ready timeout test / unit (pull_request) Successful in 39s Details test / integration (pull_request) Successful in 58s Details test / unit (push) Successful in 38s Details test / integration (push) Successful in 55s Details die() raises Die(SystemExit), which implies a process exit. A timeout in wait_exec_ready is a bringup failure — raising SmolvmError lets the caller decide whether it's fatal, consistent with how machine_start failures propagate.	2026-06-02 06:29:05 +00:00
didericis-claude	0d922371b0	refactor(smolmachines): decompose launch(), add wait_exec_ready, file-lock allocate() (PRD 0032) Decompose the 207-line launch() into six named helpers: _allocate_resources, _mint_certs, _start_bundle, _discover_urls, _launch_vm, _init_vm. Each has explicit inputs/outputs and is independently testable. Replace time.sleep(1.5) with smolvm.wait_exec_ready(), which polls `machine exec true` with exponential backoff. Exits as soon as the exec channel is ready; dies loudly with a timeout message instead of silently leaving the VM in an unknown state. File-lock loopback_alias.allocate() with fcntl.flock(LOCK_EX) so concurrent bottle launches can't race on docker state and claim the same alias.	2026-06-02 06:23:39 +00:00
didericis-claude	10d0872043	refactor(egress): provisioned-wins merge + _route_to_yaml_fields (PRD 0031) Replace _merge_provider_route's five-case nested conditional with a flat provisioned-wins merge: provider routes claim their hosts outright, manifest routes for unclaimed hosts append unchanged. Token slot assignment moves to a single _assign_token_slots pass over the merged list. Add _route_to_yaml_fields as the single authoritative EgressRoute→YAML mapping, eliminating the risk of EgressRoute and egress_addon_core.Route silently drifting apart when new fields are added. egress_manifest_routes is now a pure lifter with no slot assignment. _merge_provider_route and _find_or_alloc_token_env are removed. Tests updated: conflict-die case removed, upgrade-bare replaced with provider-wins semantics, slot-assignment tests moved to TestSlotAssignment.	2026-06-02 05:45:20 +00:00
didericis-claude	0e29bcc829	refactor(egress): use provisioned_env instead of sentinel for Codex token (PRD 0030) test / unit (pull_request) Successful in 39s Details test / integration (pull_request) Successful in 45s Details Add `provisioned_env: dict[str, str]` to `AgentProvisionPlan`. When `forward_host_credentials=True`, `agent_provision_plan` reads the host Codex access token at prepare time and stores it under `CODEX_HOST_CREDENTIAL_TOKEN_REF`. Both backends merge `provisioned_env` over `os.environ` before calling `egress_resolve_token_values`, so the token slot resolves like any other manifest-declared token ref. Removes `egress_resolve_token_values_with_provider` and the sentinel `continue` skip from `egress_resolve_token_values`. The function is now fully generic — it neither knows nor cares about provider identity.	2026-06-02 04:53:23 +00:00
didericis-claude	75f0f9d907	refactor(egress): deduplicate token resolution across backends (PRD 0030) Extract egress_resolve_token_values_with_provider into bot_bottle/egress.py. Both docker and smolmachines launch paths now call the shared function instead of duplicating the forward_host_credentials / CODEX_HOST_CREDENTIAL_TOKEN_REF resolution block. Also fixes the host_env: object annotation on smolmachines._resolve_token_env to the correct dict[str, str]. Closes #118.	2026-06-02 04:22:43 +00:00
didericis	2dd8113f7c	fix(smolmachines): retry CA install after exec SIGKILL test / unit (push) Successful in 38s Details test / integration (push) Successful in 54s Details	2026-06-01 23:28:33 -04:00
didericis-codex	36e3443d2e	fix(codex): defer workspace trust handling test / unit (pull_request) Successful in 29s Details test / integration (pull_request) Successful in 42s Details test / unit (push) Successful in 30s Details test / integration (push) Successful in 44s Details	2026-06-02 03:11:51 +00:00
didericis-codex	d6ebd0d2eb	fix(egress): skip token slots for unauth provider routes test / unit (pull_request) Successful in 30s Details test / integration (pull_request) Successful in 43s Details	2026-06-02 03:06:10 +00:00
didericis-claude	938a0e05d6	refactor(manifest): remove codex_auth egress role Both provider-owned roles are now gone. Provider auth routes are provisioner-owned (claude: auth_token, codex: forward_host_credentials); the role field and validation plumbing stay for future use but EGRESS_ROLES is empty. Any manifest declaring a role now fails at parse time. Assisted-by: Claude Code	2026-06-01 22:24:17 -04:00
didericis-claude	f32b7eb299	fix(agent): always emit passthrough egress route for api.anthropic.com Mirrors the Codex pattern: Claude always gets a tls_passthrough route for api.anthropic.com so user-set tokens aren't stripped by pipelock, whether or not auth_token is declared. Auth injection (scheme + token_ref) and the placeholder env only apply when auth_token is set. Assisted-by: Claude Code	2026-06-01 22:24:17 -04:00
didericis-claude	de9bd7eb83	feat(manifest): add agent_provider.auth_token for Claude OAuth via egress Operators can now declare: agent_provider: template: claude auth_token: BOT_BOTTLE_CLAUDE_OAUTH_TOKEN and the provisioner injects a provider-owned api.anthropic.com egress route (Bearer, tls_passthrough) rather than requiring a manually declared route with the former claude_code_oauth role. Changes: - Add auth_token field to AgentProvider; validate claude-only. - Remove claude_code_oauth from EGRESS_ROLES / PROVIDER_EGRESS_ROLES. Manifests that declare the role now fail at parse time with "unknown role" — the provisioner owns the route. - agent_provision_plan: replace manifest_egress_routes/has_provider_auth with auth_token; Claude branch injects the api.anthropic.com route, placeholder env, and nonessential-traffic flags when auth_token is set. - Add hidden_env_names: frozenset[str] to AgentProvisionPlan; Claude branch populates it with CLAUDE_CODE_OAUTH_TOKEN. - Remove auth_role from AgentProviderRuntime and placeholder_env_for(). - print_util.visible_agent_env_names: accept hidden_env_names from the plan instead of dispatching on agent_provider_template. - Both backends: drop manifest_egress_routes call, pass auth_token. - PRD 0029 rescoped to cover both Codex and Claude provider auth. Assisted-by: Claude Code	2026-06-01 22:24:17 -04:00
didericis-claude	952dcd7eec	refactor(agent): move placeholder env injection into agent_provision_plan The has_provider_auth check and egress-placeholder injection were duplicated in both backends. Move them into agent_provision_plan so the provisioner owns that decision entirely: - Replace has_provider_auth: bool param with manifest_egress_routes, compute has_provider_auth internally from the route roles. - Inject CLAUDE_CODE_OAUTH_TOKEN=egress-placeholder inside the plan when has_provider_auth, alongside the existing nonessential-traffic vars. Backends no longer touch the placeholder env. - Remove placeholder_env from AgentProviderRuntime; expose placeholder_env_for() for print_util's hide-from-summary logic. Assisted-by: Claude Code	2026-06-01 22:24:17 -04:00
didericis-claude	59df0b0f0f	fix(codex): emit passthrough egress routes when not forwarding host credentials When forward_host_credentials is false, Codex bottles should still get tls_passthrough routes for the OpenAI/ChatGPT hosts so that tokens a user sets via `codex login` after launch aren't stripped by pipelock's header DLP. Previously no routes were emitted, which would have blocked those requests entirely once pipelock enforcement tightens. Rename the test to reflect the new expected behavior. Assisted-by: Claude Code	2026-06-01 22:24:17 -04:00
didericis-claude	884cedc160	refactor: provision egress routes via AgentProvisionPlan Remove provider-specific branching from egress.py and pipelock.py. Previously, `egress_routes_for_bottle` and `pipelock_effective_tls_passthrough` both contained `template == "codex"` checks — the same pattern the rest of the PR moved out of the backends. Root cause: `EgressRoute` had no `tls_passthrough` field, so pipelock couldn't learn from the synthesised Codex routes that they needed passthrough. Fix: - Add `EgressRoute.tls_passthrough: bool`. `egress_manifest_routes` lifts the existing `pipelock.tls_passthrough` manifest flag here; provider routes set it directly. - Add `AgentProvisionPlan.egress_routes`. `agent_provision_plan` populates it for Codex + `forward_host_credentials`, including `tls_passthrough=True`. - Replace Codex-specific `egress_routes_for_bottle` logic with a generic `_merge_provider_route` helper. Backends call `egress_routes_for_bottle(bottle, plan.egress_routes)`; no provider type checks inside egress or pipelock. - Rewrite `pipelock_effective_tls_passthrough` to read `route.tls_passthrough` from the merged route set instead of re-implementing the provider check. - Both backends now call `agent_provision_plan` before `Egress.prepare` and `PipelockProxy.prepare`, threading `plan.egress_routes` to both. `has_provider_auth` is derived from `egress_manifest_routes` (manifest routes only — provider routes carry no auth roles, so the result is identical). Assisted-by: Claude Code	2026-06-01 22:24:17 -04:00
didericis-codex	76a7921ae6	refactor(agent): move claude env defaults into plan	2026-06-01 22:24:17 -04:00
didericis-codex	c8ab0c67a8	refactor(agent): surface provider env defaults	2026-06-01 22:24:17 -04:00
didericis-codex	e808e81b87	refactor(agent): group provider provisioning into plan	2026-06-01 22:24:17 -04:00
didericis-codex	36ce7aed4f	refactor(codex): derive trusted paths from guest home	2026-06-01 22:24:17 -04:00
didericis-codex	a5d83bdcdc	fix(codex): trust launch home directory	2026-06-01 22:24:17 -04:00

1 2 3 4 5 ...

273 Commits