bot-bottle

didericis/bot-bottle

Fork 0

Commit Graph

Author	SHA1	Message	Date
didericis	73939861f9	docs(prd-0022): resolve open Qs 2, 4, 5 (DNS, gitleaks order, CI) test / unit (pull_request) Successful in 18s Details test / integration (pull_request) Successful in 1m7s Details User feedback: - Q2 (direct DNS resolver test): yes — test 4 grows a second sub-assertion verifying `dig @8.8.8.8` from the agent has no path out, alongside the existing crafted-subdomain check. - Q4 (gitleaks ordering): test 5 grows an ordering check — asserts the rejection mentions `gitleaks` AND does NOT mention upstream-network-phase phrases (resolve / refused / unreachable / upstream). Confirms gitleaks rejects BEFORE git-gate tries any upstream push. - Q5 (CI): try it, accept fallback. New chunk 6 adds a Gitea Actions job marked `continue-on-error: true` — runs the suite if the runner can host compose, doesn't block the workflow if docker-in-docker prevents it. Three open questions remain (1: pipelock's actual DLP coverage for non-body shapes; 3: realistic fake secret shape vs. gitleaks regex; 6+7: backend-agnostic invocation + required tools — for the smolmachines work).	2026-05-26 22:04:46 -04:00
didericis	62f6716e8d	docs(prd-0022): end-to-end sandbox-escape integration test test / unit (pull_request) Successful in 19s Details test / integration (pull_request) Successful in 1m9s Details Draft a PRD for a composite integration test that brings up a real bottle with a known allowlist + planted secret and runs five attacks from inside the agent container: 1. Request to non-allowlisted hostname 2. Request to non-allowlisted IP (incl. host-header spoof) 3. Secret exfil via HTTP — path / query / body / headers 4. Secret exfil via crafted DNS subdomain 5. Secret exfil via README link pushed through git-gate Each attack passes only when blocked with a permissions error. The suite is backend-agnostic — runs against whatever CLAUDE_BOTTLE_BACKEND selects — so it becomes the gate the upcoming smolmachines spike has to pass before that backend can substitute for Docker. Sized into 5 chunks (fixture → attacks 1+2 → attack 3 → attack 4 → attack 5). Seven open questions called out, biggest being: today's pipelock probably leaks via header / path / query because DLP only scans bodies — the test will expose this as a real gap (chunk 3 lands with `expectedFailure` markers if so).	2026-05-26 21:52:24 -04:00

Author

SHA1

Message

Date

didericis

73939861f9

docs(prd-0022): resolve open Qs 2, 4, 5 (DNS, gitleaks order, CI)

test / unit (pull_request) Successful in 18s

Details

test / integration (pull_request) Successful in 1m7s

Details

User feedback:

  - Q2 (direct DNS resolver test): yes — test 4 grows a
    second sub-assertion verifying `dig @8.8.8.8` from the
    agent has no path out, alongside the existing
    crafted-subdomain check.
  - Q4 (gitleaks ordering): test 5 grows an ordering check
    — asserts the rejection mentions `gitleaks` AND does
    NOT mention upstream-network-phase phrases (resolve /
    refused / unreachable / upstream). Confirms gitleaks
    rejects BEFORE git-gate tries any upstream push.
  - Q5 (CI): try it, accept fallback. New chunk 6 adds a
    Gitea Actions job marked `continue-on-error: true` —
    runs the suite if the runner can host compose, doesn't
    block the workflow if docker-in-docker prevents it.

Three open questions remain (1: pipelock's actual DLP
coverage for non-body shapes; 3: realistic fake secret
shape vs. gitleaks regex; 6+7: backend-agnostic invocation
+ required tools — for the smolmachines work).

2026-05-26 22:04:46 -04:00

didericis

62f6716e8d

docs(prd-0022): end-to-end sandbox-escape integration test

test / unit (pull_request) Successful in 19s

Details

test / integration (pull_request) Successful in 1m9s

Details

Draft a PRD for a composite integration test that brings up
a real bottle with a known allowlist + planted secret and
runs five attacks from inside the agent container:

  1. Request to non-allowlisted hostname
  2. Request to non-allowlisted IP (incl. host-header spoof)
  3. Secret exfil via HTTP — path / query / body / headers
  4. Secret exfil via crafted DNS subdomain
  5. Secret exfil via README link pushed through git-gate

Each attack passes only when blocked with a permissions
error. The suite is backend-agnostic — runs against
whatever CLAUDE_BOTTLE_BACKEND selects — so it becomes the
gate the upcoming smolmachines spike has to pass before that
backend can substitute for Docker.

Sized into 5 chunks (fixture → attacks 1+2 → attack 3 →
attack 4 → attack 5). Seven open questions called out,
biggest being: today's pipelock probably leaks via header /
path / query because DLP only scans bodies — the test will
expose this as a real gap (chunk 3 lands with
`expectedFailure` markers if so).

2026-05-26 21:52:24 -04:00

2 Commits