Community-contributed kits for Docker Sandboxes.
Each top-level directory is a kit — a declarative artifact containing a spec.yaml and optional files/ directory that extends sandbox agents with additional capabilities.
- Kits overview — what kits are and how to use them
- Kit examples — reference examples for common kit patterns
- Build your own agent kit — step-by-step tutorial using the
ampkit in this repo
Contributing a kit or a fix? Read CONTRIBUTING.md first — this repo enforces verified commit signatures, so you'll need GPG or SSH signing set up before your PR can be merged.
Note
Kits are experimental. The kit file format, CLI commands, and experience for creating, loading, and managing kits are subject to change as the feature evolves. Bugs and feature requests for the kits in this repo belong in its issue tracker; general feedback on the kit feature itself goes to docker/sbx-releases.
Kits are passed to sbx run (or sbx create) via --kit. The flag accepts a local path, an OCI registry reference, a ZIP archive, or a git+... URL.
The most common form is a git URL targeting this repo:
$ sbx run --kit "git+https://github.com/docker/sbx-kits-contrib.git#dir=code-server" claudeThe fragment after # accepts two parameters, both optional:
| Parameter | Purpose | Example |
|---|---|---|
dir |
Subdirectory inside the repo containing the kit | #dir=code-server |
ref |
Git ref to check out — branch, tag, or commit SHA | #ref=v1.0.0 |
Combine them with &:
# Pin to a tag — the recommended form for production use
$ sbx run --kit "git+https://github.com/docker/sbx-kits-contrib.git#ref=v0.2.0&dir=code-server" claude
# Track a branch (less stable; the kit may change under you)
$ sbx run --kit "git+https://github.com/docker/sbx-kits-contrib.git#ref=main&dir=code-server" claude
# Pin to an exact commit SHA — fully reproducible
$ sbx run --kit "git+https://github.com/docker/sbx-kits-contrib.git#ref=abc1234&dir=code-server" claudeWithout ref, sbx clones the default branch shallowly. With a branch or tag, sbx clones at that ref shallowly. With a commit SHA, sbx clones fully and checks out the commit.
You can also use SSH instead of HTTPS for private repos:
$ sbx run --kit "git+ssh://git@github.com/docker/sbx-kits-contrib.git#dir=code-server" claudeFor local development, point --kit at a directory:
$ sbx run --kit ./code-server/ claudesbx-kits-contrib/
├── spec/ # Kit artifact types, loading, and validation (importable library)
├── tck/ # Technology Compatibility Kit — test suite using testcontainers-go
├── pi/ # Pi coding agent kit
├── nanobot/ # Nanobot assistant kit
├── openclaw/ # OpenClaw assistant kit
├── nanoclaw/ # NanoClaw WhatsApp bridge kit
└── .github/ # CI workflows
- Create a directory at the repo root with your kit name (lowercase, alphanumeric + hyphens):
my-kit/
├── spec.yaml
└── files/
└── home/ # Files copied to /home/agent/ in the container
└── config.json
There's no per-kit test file to write — the shared TestKitTCK in tck/kit_test.go reads a KIT env var pointing at the kit directory and runs the full TCK suite against it.
- Write your
spec.yaml:
schemaVersion: "1"
kind: mixin
name: my-kit
displayName: My Kit
description: "Short description of what this kit does"
network:
allowedDomains:
- example.com
deniedDomains:
- tracker.example.com
environment:
variables:
MY_CONFIG: "/home/agent/config.json"
commands:
install:
- command: "pip install my-tool"
user: "1000"
description: Install my-tool
startup:
- command: ["my-tool", "serve"]
user: "1000"
background: true
description: Start my-tool- Run the TCK locally — from inside the kit's directory:
cd my-kit
../scripts/test-kit.shOr from the repo root, naming the kit:
./scripts/test-kit.sh my-kitExtra flags are forwarded to go test, so ../scripts/test-kit.sh -v -run …
works as expected. If you'd rather invoke go test directly, the equivalent is:
KIT="$PWD/my-kit" go test -v -count=1 -timeout 10m -run TestKitTCK ./tck/...KIT must be an absolute path because go test runs the binary with its
working directory set to the package directory (./tck/).
Windows users: the wrapper is a bash script — run it from Git Bash (ships with Git for Windows) or WSL, not from cmd.exe or PowerShell. If you'd rather skip the wrapper, the direct go test invocation above works in PowerShell too — just substitute $env:KIT = "$PWD\my-kit" for the env-var syntax.
A kit's network.allowedDomains is its complete outbound network contract. The CI e2e job runs with a deny-all default policy, so anything not in your allowedDomains is blocked at request time — and any failed request inside an install hook surfaces as sbx create failing.
The non-obvious trap is package managers refreshing every configured source, not just the one you added:
apt-get updatere-fetches metadata for every file in/etc/apt/sources.list[.d/]— including sources the base template added. If any of those returns non-2xx,apt-getexits non-zero even if the package you want is in a different source. For kits built onshell-docker/*-dockertemplates that meansdownload.docker.com(Docker's apt repo, pre-added by the template) needs to be in yourallowedDomainseven if you're only installing something from Ubuntu's main archive.- Ubuntu hosts amd64 packages on
archive.ubuntu.com+security.ubuntu.comand arm64 packages onports.ubuntu.com. List all three for cross-arch coverage; CI is amd64, your Mac is likely arm64. npm install,pip install,cargo,go get, etc. each have their own registry/mirror hosts — declare them too.
If you're not sure what your install hooks reach, probe locally under deny-all and read sbx policy log to see exactly what the proxy blocked. The recipe is cross-platform (no daemon-log greping, no OS-specific paths):
# 1. Switch to the strict baseline. `sbx policy reset` drops any local
# rules you've added — if you have customisations, list them first
# with `sbx policy ls` so you can restore them later.
sbx policy reset -f
sbx policy set-default deny-all
# 2. Run your kit. The install hooks fire during `sbx create`.
sbx create --name probe-my-kit --kit "$PWD/my-kit" <agent> /tmp/sbx-kit-debug || true
# 3. See what the proxy blocked (and what got through). Filter by the
# sandbox name so you only see this kit's requests.
sbx policy log probe-my-kit
# 4. Clean up the probe sandbox and restore your previous default policy.
sbx rm -f probe-my-kit
sbx policy reset -f
sbx policy set-default balanced # or whichever preset you were onEvery Blocked requests row is a domain your install or startup hook reached for under deny-all. Add the host (column HOST, e.g. download.docker.com:443) to allowedDomains and re-probe until the block list is empty.
The TCK validates your kit automatically:
- Validation —
spec.yamlparses correctly with required fields - Network policy — allowed domains and service auth are well-formed
- Credential policy — credential sources are properly defined
- Commands — install/startup commands are well-formed
- Environment variables — declared env vars are set in the container
- Container files — files from
files/are injected at the correct paths - Security — tmpfs mounts (e.g.,
/run/secrets) are present
The default TCK runs every kit assertion against a fabricated testcontainers-go container — fast, deterministic, no sbx needed. The optional e2e layer goes further: it boots a real sbx sandbox from the kit, then verifies the kit's content actually landed inside the running container. It catches things the default TCK can't — install commands that fail under the non-root agent user, ${WORKDIR} placeholders that resolve differently than expected, agent-kit name mismatches, or memory blocks the engine never writes out.
tck/e2e_test.go (build-tag e2e, function TestE2ECreateSandbox) drives one kit per run:
- Loads the kit at
$KIT_UNDER_TESTand picks the agent argument — kit name forkind: agent,claudeforkind: mixin. - Runs
sbx create --kit <kit> --name <unique> <agent> <tmpdir>against a temporary workspace. - Verifies, via
sbx exec, that the running sandbox contains:- every
environment.variablesentry, - every file under
files/homeand everycommands.initFiles(with${WORKDIR}resolved to/home/agent/workspace, the real sandbox workdir), - every declared
tmpfsmount (plus the implicit/run/secrets), - the rendered memory file —
Manifest.AIFilenamefor agent kits (inlined memory) orkits-memory/<kit-name>.mdfor mixin kits.
- every
- Cleans up with
sbx rm -f <name>.
sbxonPATH. Install the latest release fromdocker/sbx-releases.- An authenticated
sbxsession against Docker Hub. The non-interactive form:printf '%s' "$DOCKERHUB_TOKEN" | sbx login --username "$DOCKERHUB_USERNAME" --password-stdin
- Linux with
/dev/kvmaccessible (for the sailor microVM). On Linux runners and most workstations this is already the case; in CI the workflow doessudo chmod 666 /dev/kvmto relax permissions.
The test is hidden behind the e2e build tag so kit authors running go test ./... see no behavior change. Opt in via the wrapper:
# From inside the kit's directory:
cd my-kit
../scripts/test-kit-e2e.sh
# Or from the repo root, naming the kit:
./scripts/test-kit-e2e.sh my-kitExtra flags are forwarded to go test. The wrapper checks sbx is on PATH and validates the kit directory has a spec.yaml/spec.yml before invoking. If you'd rather drop to go test directly:
KIT_UNDER_TEST="$PWD/my-kit" \
go test -tags=e2e -v -timeout 25m -count=1 -run TestE2ECreateSandbox ./tck/...KIT_UNDER_TEST must be an absolute path: go test runs each binary with its working directory set to the package directory (./tck/), so a relative path resolves against ./tck/, not the repo root.
To run every kit locally:
for spec in $(find "$PWD" -mindepth 2 -maxdepth 2 \( -name spec.yaml -o -name spec.yml \)); do
./scripts/test-kit-e2e.sh "$(dirname "$spec")"
doneEach subtest (env, files/<path>, tmpfs/<path>, memory) reports independently, so a failure pinpoints which piece of kit content didn't make it into the container.
The test-kit-e2e job in .github/workflows/tck.yml runs alongside the default test-kit job. It downloads the latest sbx release, signs in to Docker Hub using DOCKERHUB_USERNAME / DOCKERHUB_TOKEN repo secrets, then runs the e2e test once per detected kit. The job is skipped on fork PRs because the secrets aren't exposed there.
By default, mixins use the shell template image. To extend a specific agent (e.g., Claude, Gemini), add the extends field:
schemaVersion: "1"
kind: mixin
name: my-claude-extension
extends: claude
# ...The TCK resolves the parent's template image automatically for well-known agents (shell, claude, codex, copilot, cursor, docker-agent, droid, gemini, kiro, opencode). For other parents, use WithImage:
suite, err := tck.NewSuiteFromDir(".", tck.WithImage("my-custom/template:latest"))Importable library for parsing, validating, and working with kit artifacts:
import "github.com/docker/sbx-kits-contrib/spec"
artifact, err := spec.LoadFromDirectory("./my-kit")Test framework that validates kit artifacts against real containers:
import "github.com/docker/sbx-kits-contrib/tck"
suite, err := tck.NewSuiteFromDir(".")
suite.RunAll(t)Pull requests trigger TCK tests automatically:
- Kit changes: only the modified kit is tested
- TCK/spec changes: all kits are tested
- Each kit runs in a separate CI runner on Linux
- The optional
test-kit-e2ejob exercises every detected kit against a realsbxCLI — see End-to-end (e2e) Tests. Skipped on fork PRs (no Docker Hub secrets).
- Go 1.23+
- Docker (for container-based TCK tests)