Commit Graph

396 Commits

Author SHA1 Message Date
Josh Avant 3d07dfbb65 feat(qa-lab): add Convex credential broker and admin CLI (#65596)
* QA Lab: add Convex credential source for Telegram lane

* QA Lab: scaffold Convex credential broker

* QA Lab: add Convex credential admin CLI

* QA Lab: harden Convex credential security paths

* QA Broker: validate Telegram payloads on admin add

* fix: note QA Convex credential broker in changelog (#65596) (thanks @joshavant)
2026-04-12 22:03:42 -05:00
Tak Hoffman c37e49f275 Add /trace toggle and fix Active Memory diagnostics 2026-04-12 13:20:22 -05:00
Marcus Castro 000fc7f233 refactor(qa): add shared QA channel contract and harden worker startup (#64562)
* refactor(qa): add shared transport contract and suite migration

* refactor(qa): harden worker gateway startup

* fix(qa): scope waits and sanitize shutdown artifacts

* fix(qa): confine artifacts and redact preserved logs

* fix(qa): block symlink escapes in artifact paths

* fix(gateway): clear shutdown race timers

* fix(qa): harden shutdown cleanup paths

* fix(qa): sanitize gateway logs in thrown errors

* fix(qa): harden suite startup and artifact paths

* fix(qa): stage bundled plugins from mutated config

* fix(qa): broaden gateway log bearer redaction

* fix(qa-channel): restore runtime export

* fix(qa): stop failed gateway startups as a process tree

* fix(qa-channel): load runtime hook from api surface
2026-04-12 15:02:57 -03:00
Eva fd45ea2bf1 test(qa): add compaction retry parity scenario 2026-04-11 14:22:48 +01:00
Eva 55df6f11a4 fix: harden parity gate review findings 2026-04-11 14:22:48 +01:00
Eva c73d005c7a docs: clarify parity verdict interpretation 2026-04-11 14:22:48 +01:00
Eva db09edacfc qa-lab: gate parity on shared scenario coverage 2026-04-11 14:22:48 +01:00
Eva 67fdd3b4df benchmarks: add agentic parity report gate 2026-04-11 14:22:48 +01:00
Eva 79f539d9ce docs: clarify GPT-5.4 parity harness and review flow 2026-04-11 14:22:48 +01:00
Gustavo Madeira Santana 25445a9f2e qa-lab: add Matrix live transport QA lane (#64489)
Merged via squash.

Prepared head SHA: ae9bb37751156432d0ee3ae2d976520ec79a717a
Co-authored-by: gumadeiras <5599352+gumadeiras@users.noreply.github.com>
Co-authored-by: gumadeiras <5599352+gumadeiras@users.noreply.github.com>
Reviewed-by: @gumadeiras
2026-04-10 19:35:08 -04:00
Peter Steinberger 46a6746bca docs: clarify codex harness validation 2026-04-11 00:13:08 +01:00
Peter Steinberger 6281dd7379 perf: reduce test import overhead 2026-04-10 23:09:37 +01:00
Peter Steinberger c59fc764db docs(codex): document harness command smoke 2026-04-10 23:07:25 +01:00
Peter Steinberger 972ed139a7 fix: make docs anchor audit use Mintlify CLI 2026-04-10 21:39:52 +01:00
Peter Steinberger c16b1b7433 docs: document harness fallback policy 2026-04-10 21:27:26 +01:00
Peter Steinberger bfc0889776 docs: document Codex harness plugin workflow 2026-04-10 21:22:16 +01:00
Peter Steinberger 3027efaf21 test: raise QA suite default concurrency 2026-04-10 13:45:57 +01:00
Peter Steinberger 4c14f55c62 test: parallelize QA suite scenarios 2026-04-10 13:45:57 +01:00
Peter Steinberger 0b0c062e97 fix: avoid Claude CLI subscription prompt classifier 2026-04-10 10:52:35 +01:00
Shakker 655cfb477a docs: clarify multipass live auth support 2026-04-09 23:53:13 +01:00
Shakker a04c331cc1 docs: document qa multipass runner 2026-04-09 23:53:13 +01:00
Mason 164287f056 docs-i18n: avoid ambiguous body-only wrapper unwrap (#63808)
* docs-i18n: avoid ambiguous body-only wrapper unwrap

* docs: clarify targeted testing tip

* changelog: include docs-i18n follow-up thanks
2026-04-10 00:01:17 +08:00
Peter Steinberger cca9e5b914 test: cap broad live model sweeps 2026-04-09 01:37:55 +01:00
Peter Steinberger d51f527cca feat: add gh-read GitHub app helper 2026-04-08 00:09:07 +01:00
Bruce MacDonald 86f35a9bc0 chore(ollama): update suggested onboarding models (#62626)
Merged via squash.

Prepared head SHA: 48c083b88ae26d76ad9a6086efe43629ec124fb2
Co-authored-by: BruceMacD <5853428+BruceMacD@users.noreply.github.com>
Co-authored-by: BruceMacD <5853428+BruceMacD@users.noreply.github.com>
Reviewed-by: @BruceMacD
2026-04-07 11:42:29 -07:00
Peter Steinberger e973275fd0 fix: harden claude-cli live switch smoke 2026-04-07 16:05:54 +01:00
Peter Steinberger 9d4b0d551d fix: support inferrs string-only completions 2026-04-07 15:55:20 +01:00
Peter Steinberger 1cec37184c fix: harden qa memory dreaming sweep 2026-04-07 12:57:33 +01:00
Peter Steinberger 4d4dbe8e15 test: share live probes with acp bind 2026-04-07 10:35:24 +01:00
Peter Steinberger c2f9de3935 feat: unify live cli backend probes 2026-04-07 10:35:24 +01:00
Peter Steinberger 65f9fc397e perf(test): split support boundary shard 2026-04-07 09:12:26 +01:00
Peter Steinberger 0af808b457 test: add cli backend live matrix metadata 2026-04-07 09:06:09 +01:00
Peter Steinberger 37dccb52ed test: add gemini acp bind docker coverage 2026-04-07 07:59:45 +01:00
Peter Steinberger ce1d2c1004 test: cover claude and codex acp bind docker smoke 2026-04-07 06:06:29 +01:00
Peter Steinberger 191f867ef6 perf(test): split ui and bundled full-suite shards 2026-04-07 00:39:05 +01:00
Peter Steinberger f9c721d5bf fix: add vydra kling live lane 2026-04-06 19:47:43 +01:00
Peter Steinberger 7a736bff90 perf(test): split reply queue seams and unit shards 2026-04-06 19:31:20 +01:00
Peter Steinberger ec55902989 perf(test): tighten reply fast paths and split unit shards 2026-04-06 19:23:17 +01:00
Peter Steinberger 41ea5316aa test: add shared media live harness 2026-04-06 19:15:31 +01:00
Peter Steinberger b4e1747391 feat: add one-command qa lab docker launcher 2026-04-06 17:47:17 +01:00
Peter Steinberger 0335a8783c perf(test): shard full vitest runs 2026-04-06 17:34:11 +01:00
Peter Steinberger 8a33a8d607 perf(test): trim runtime lookups and add changed bench 2026-04-06 16:49:28 +01:00
Peter Steinberger 0c5e6037b0 fix(openai): clarify auth routes in picker and docs 2026-04-06 16:14:51 +01:00
Peter Steinberger d7e3df5eaa perf(test): expand light lane routing 2026-04-06 16:13:21 +01:00
Peter Steinberger a010ce462f perf(test): split light vitest lanes and restore hooks 2026-04-06 15:51:00 +01:00
Peter Steinberger ac38f332c5 fix(anthropic): prefer claude cli over setup-token 2026-04-06 15:31:07 +01:00
Peter Steinberger cd5b1653f6 feat: declare explicit media provider capabilities 2026-04-06 15:24:38 +01:00
Peter Steinberger 878c208844 perf(test): restore scoped vitest routing 2026-04-06 15:16:17 +01:00
Peter Steinberger 9afcbbec5e refactor(auth): extract persisted auth store helpers 2026-04-06 14:25:06 +01:00
Peter Steinberger d378a504ac fix: restore claude cli guidance and doctor behavior 2026-04-06 14:21:11 +01:00