Files
openclaw/docs/reference/test.md
T

91 lines
3.9 KiB
Markdown
Raw Normal View History

2025-12-13 13:25:49 +00:00
---
summary: "How to run tests locally (vitest) and when to use force/coverage modes"
read_when:
- Running or fixing tests
2026-01-31 16:04:03 -05:00
title: "Tests"
2025-12-13 13:25:49 +00:00
---
2026-01-31 21:13:13 +09:00
2025-12-13 13:25:49 +00:00
# Tests
2025-12-10 01:00:29 +00:00
- Full testing kit (suites, live, Docker): [Testing](/help/testing)
2026-01-10 01:15:42 +00:00
2025-12-11 15:17:40 +00:00
- `pnpm test:force`: Kills any lingering gateway process holding the default control port, then runs the full Vitest suite with an isolated gateway port so server tests dont collide with a running instance. Use this when a prior gateway run left port 18789 occupied.
2026-02-15 04:20:08 +00:00
- `pnpm test:coverage`: Runs the unit suite with V8 coverage (via `vitest.unit.config.ts`). Global thresholds are 70% lines/branches/functions/statements. Coverage excludes integration-heavy entrypoints (CLI wiring, gateway/telegram bridges, webchat static server) to keep the target focused on unit-testable logic.
2026-03-18 16:57:27 +00:00
- `pnpm test`: runs the full wrapper. It keeps only a small behavioral override manifest in git, then uses a checked-in timing snapshot to peel the heaviest measured unit files into dedicated lanes.
2026-03-22 16:22:04 -07:00
- Unit files default to `threads` in the wrapper; keep fork-only exceptions documented in `test/fixtures/test-parallel.behavior.json`.
2026-03-22 12:36:14 -07:00
- `pnpm test:extensions` now defaults to `threads` via `vitest.extensions.config.ts`; the March 22, 2026 direct full-suite control run passed clean without extension-specific fork exceptions.
- `pnpm test:channels`: runs channel-heavy suites.
- `pnpm test:extensions`: runs extension/plugin suites.
2026-03-18 16:57:27 +00:00
- `pnpm test:perf:update-timings`: refreshes the checked-in slow-file timing snapshot used by `scripts/test-parallel.mjs`.
- Gateway integration: opt-in via `OPENCLAW_TEST_INCLUDE_GATEWAY=1 pnpm test` or `pnpm test:gateway`.
2026-03-22 16:22:04 -07:00
- `pnpm test:e2e`: Runs gateway end-to-end smoke tests (multi-instance WS/HTTP/node pairing). Defaults to `forks` + adaptive workers in `vitest.e2e.config.ts`; tune with `OPENCLAW_E2E_WORKERS=<n>` and set `OPENCLAW_E2E_VERBOSE=1` for verbose logs.
2026-01-08 02:00:11 +01:00
- `pnpm test:live`: Runs provider live tests (minimax/zai). Requires API keys and `LIVE=1` (or provider-specific `*_LIVE_TEST=1`) to unskip.
2025-12-31 22:39:42 +01:00
## Local PR gate
For local PR land/gate checks, run:
- `pnpm check`
- `pnpm build`
- `pnpm test`
- `pnpm check:docs`
If `pnpm test` flakes on a loaded host, rerun once before treating it as a regression, then isolate with `pnpm vitest run <path/to/test>`. For memory-constrained hosts, use:
- `OPENCLAW_TEST_PROFILE=low OPENCLAW_TEST_SERIAL_GATEWAY=1 pnpm test`
2025-12-31 22:39:42 +01:00
## Model latency bench (local keys)
2026-01-30 03:15:10 +01:00
Script: [`scripts/bench-model.ts`](https://github.com/openclaw/openclaw/blob/main/scripts/bench-model.ts)
2025-12-31 22:39:42 +01:00
Usage:
2026-01-31 21:13:13 +09:00
2026-01-06 23:48:22 +00:00
- `source ~/.profile && pnpm tsx scripts/bench-model.ts --runs 10`
2025-12-31 22:39:42 +01:00
- Optional env: `MINIMAX_API_KEY`, `MINIMAX_BASE_URL`, `MINIMAX_MODEL`, `ANTHROPIC_API_KEY`
- Default prompt: “Reply with a single word: ok. No punctuation or extra text.”
Last run (2025-12-31, 20 runs):
2026-01-31 21:13:13 +09:00
2025-12-31 22:39:42 +01:00
- minimax median 1279ms (min 1114, max 2431)
- opus median 2454ms (min 1224, max 3170)
2026-01-01 18:01:42 +01:00
2026-03-01 12:37:23 -08:00
## CLI startup bench
Script: [`scripts/bench-cli-startup.ts`](https://github.com/openclaw/openclaw/blob/main/scripts/bench-cli-startup.ts)
Usage:
- `pnpm tsx scripts/bench-cli-startup.ts`
- `pnpm tsx scripts/bench-cli-startup.ts --runs 12`
- `pnpm tsx scripts/bench-cli-startup.ts --entry dist/entry.js --timeout-ms 45000`
This benchmarks these commands:
- `--version`
- `--help`
- `health --json`
- `status --json`
- `status`
Output includes avg, p50, p95, min/max, and exit-code/signal distribution for each command.
2026-01-01 18:01:42 +01:00
## Onboarding E2E (Docker)
2026-01-02 20:58:50 +01:00
Docker is optional; this is only needed for containerized onboarding smoke tests.
2026-01-01 18:01:42 +01:00
Full cold-start flow in a clean Linux container:
```bash
scripts/e2e/onboard-docker.sh
```
2026-01-01 19:14:14 +01:00
2026-01-30 03:15:10 +01:00
This script drives the interactive wizard via a pseudo-tty, verifies config/workspace/session files, then starts the gateway and runs `openclaw health`.
## QR import smoke (Docker)
Ensures `qrcode-terminal` loads under the supported Docker Node runtimes (Node 24 default, Node 22 compatible):
```bash
pnpm test:docker:qr
```