T

test / run tests/run_tests.py (pull_request) Failing after 31s

Details

docs: document CI status check and main branch protection

Add a Gitea Actions test-status badge plus a short README "CI" section
covering how to read the check and what to do when it's red. Capture
the (out-of-tree) branch-protection rule on `main` in docs/ci.md so
the gate that requires the test check is reproducible from the repo
alone — covers both the Gitea UI path and the equivalent API call.

Refs: PRD 0002

Assisted-by: Claude Code

2026-05-08 20:21:54 -04:00

.claude/skills/init-entry

Initial commit

2026-05-07 22:45:36 -04:00

.gitea/workflows

ci: add Gitea Actions workflow running the full test suite

2026-05-08 20:17:12 -04:00

.githooks

refactor: convert project from bash to Python

2026-05-08 15:26:58 +00:00

claude_bottle

chore: tidy gitignore, Dockerfile indent, unused import

2026-05-08 14:51:59 -04:00

docs

docs: document CI status check and main branch protection

2026-05-08 20:21:54 -04:00

tests

refactor: convert project from bash to Python

2026-05-08 15:26:58 +00:00

.dockerignore

Initial commit

2026-05-07 22:45:36 -04:00

.gitignore

chore: tidy gitignore, Dockerfile indent, unused import

2026-05-08 14:51:59 -04:00

CLAUDE.md

docs: trim CLAUDE.md to minimal orientation

2026-05-08 14:58:46 -04:00

cli.py

refactor: convert project from bash to Python

2026-05-08 15:26:58 +00:00

Dockerfile

chore: tidy gitignore, Dockerfile indent, unused import

2026-05-08 14:51:59 -04:00

README.md

docs: document CI status check and main branch protection

2026-05-08 20:21:54 -04:00

README.md

claude-bottle

Spins up an isolated container for running Claude Code with a curated set of skills and env vars.

Why "claude-bottle"?

Each container is a bottle; Claude is the genie inside. The genie has broad powers within the bottle — read, write, run anything — but it cannot escape to the host. You uncork one bottle per agent (./cli.sh start <agent>), many bottles run in parallel, and each one's powers are scoped to what the manifest grants it: a curated set of skills, env vars, and a starting prompt. When the session ends the bottle is destroyed and the genie does not persist.

Goals

Minimize risk of running claude with full permissions
Allow me to easily spin up agent tasks in parallel
Create isolated, well defined, easily updated, shareable agents

Non-goals

Communicating between agents directly
Self hosted VMs (v1 uses local Docker containers, not VMs)
Advanced agent auditing (lean on git history for auditing)

Quickstart

Requires Docker on the host and a long-lived Claude Code OAuth token in your shell env.

./cli.sh start <agent>   # builds the image on first run, drops you into claude

The container is removed automatically when the session ends. If the script is killed with SIGKILL the exit trap won't fire and the container may be left running; remove it with docker rm -f <container-name>.

CI

Every push to a PR (and every push to main) runs the full test suite on Gitea Actions via .gitea/workflows/test.yml. The status badge at the top of this README links to the workflow's run history; the same check appears on each PR.

Reading the check:

Green — tests/run_tests.py exited 0 on the runner.
Red — at least one test failed (or the workflow itself errored). Click through to the run, expand the failing step, and read the unittest output. Reproduce locally with tests/run_tests.py (or tests/run_tests.py unit if you don't have Docker handy); integration tests skip cleanly when Docker isn't reachable.
Skipped tests — expected when the runner has no Docker daemon. They still leave the job green; if you actually want them executed, ensure Docker is available on the runner host.

Pushing a fix to a red PR re-triggers the workflow automatically — no manual rerun needed. Branch protection on main requires this check to be green before the merge button is enabled; see docs/ci.md for how those rules are configured.

Egress

Agent containers route HTTP / HTTPS traffic through a per-agent pipelock sidecar attached to a Docker --internal network. The sidecar enforces a hostname allowlist, runs DLP scanning (48 default credential patterns), and detects URL-embedded high-entropy secret leaks. Without the proxy the agent has no route off-box at all — the internal network has no default gateway. The sidecar and network are torn down with the agent on session exit.

The effective allowlist is the union of a baked-in default for Claude Code's required hosts (api.anthropic.com, claude.ai, ...) and the optional bottles.<name>.egress.allowlist field in claude-bottle.json:

{
  "bottles": {
    "default": {
      "env":    { },
      "ssh":    [ ],
      "egress": { "allowlist": ["github.com"] }
    }
  }
}

The resolved allowlist is shown in the y/N preflight before launch. See docs/prds/0001-per-agent-egress-proxy-via-pipelock.md for the design and docs/research/pipelock-assessment.md for the rationale.

Auth: OAuth token, not API key

claude-bottle authenticates claude inside the container with the same Pro/Max subscription you already use on the host, via a long-lived OAuth token. No ANTHROPIC_API_KEY is needed.

Why a token instead of mounting ~/.claude.json: on macOS, Claude Code stores OAuth credentials in the encrypted Keychain, not in ~/.claude.json. Mounting that file into a Linux container does not carry the credentials with it. Linux hosts keep credentials in ~/.claude/.credentials.json, but to keep the launcher portable claude-bottle uses the env-var path on every host.

One-time setup on the host:

claude setup-token   # browser login, prints a ~1-year OAuth token

Stash the token in your shell env (e.g. ~/.zshrc or a secret manager) as CLAUDE_BOTTLE_OAUTH_TOKEN:

export CLAUDE_BOTTLE_OAUTH_TOKEN="<token>"

cli.sh automatically forwards it to every container as CLAUDE_CODE_OAUTH_TOKEN via docker run -e — no manifest wiring required, and the value is never written to disk or placed on argv.

Inside the container, claude picks up CLAUDE_CODE_OAUTH_TOKEN and authenticates against your subscription. Caveats: the token is bound to your subscription tier (Pro/Max/Team/Enterprise), it does not work with claude --bare (which only reads ANTHROPIC_API_KEY), and if it leaks, regenerate via claude setup-token again. Reference: https://code.claude.com/docs/en/authentication.

Description

Lightweight, self-hosted sandbox for AI coding agents that protects against prompt-injected or misbehaving agents: all egress traffic is TLS-inspected and secret-scanned, and credentials are injected at the proxy so the agent never sees them. No third-party platform in the loop, no trust required.

Readme Apache-2.0 30 MiB