Skip to content

E2E status report — 2026-06-22 (gh-aw matrix) #3369

Description

@dsyme

Run: https://github.com/githubnext/gh-aw-test/actions/runs/27969882871  ·  Trigger: workflow_dispatch  ·  Generated: 2026-06-22T20:53:59Z  ·  Outcome: failure

❌ Test errors — samples mode

test v0.79.8 v0.80.9 main
test-copilot-assign-to-agent
test-copilot-call-workflow
test-copilot-mcp-code-quality
test-copilot-reply-to-pull-request-review-comment
test-copilot-set-issue-field
test-copilot-siderepo-create-pull-request
test-copilot-siderepo-push-to-pull-request-branch-using-dispatch
test-copilot-siderepo-sparse-create-pull-request
test-copilot-siderepo-sparse-push-to-pull-request-branch-using-dispatch
test-copilot-update-release

✅ Test successes — samples mode

test v0.79.8 v0.80.9 main
test-claude-create-issue
test-codex-create-issue
test-copilot-add-comment
test-copilot-add-discussion-comment
test-copilot-add-labels
test-copilot-add-reviewer
test-copilot-assign-milestone
test-copilot-assign-to-user
test-copilot-close-discussion
test-copilot-close-issue
test-copilot-close-pull-request
test-copilot-command
test-copilot-create-check-run
test-copilot-create-discussion
test-copilot-create-issue
test-copilot-create-pull-request
test-copilot-create-pull-request-review-comment
test-copilot-create-repository-code-scanning-alert
test-copilot-create-two-pull-requests
test-copilot-custom-safe-outputs
test-copilot-dispatch-workflow
test-copilot-gh-steps
test-copilot-hide-comment
test-copilot-link-sub-issue
test-copilot-mark-pull-request-as-ready-for-review
test-copilot-mcp
test-copilot-noop
test-copilot-nosandbox-add-comment
test-copilot-nosandbox-add-discussion-comment
test-copilot-nosandbox-add-labels
test-copilot-nosandbox-command
test-copilot-nosandbox-create-discussion
test-copilot-nosandbox-create-issue
test-copilot-nosandbox-create-pull-request
test-copilot-nosandbox-create-pull-request-review-comment
test-copilot-nosandbox-create-repository-code-scanning-alert
test-copilot-nosandbox-create-two-pull-requests
test-copilot-nosandbox-custom-safe-outputs
test-copilot-nosandbox-mcp
test-copilot-nosandbox-push-to-pull-request-branch
test-copilot-nosandbox-update-issue
test-copilot-push-to-pull-request-branch-using-dispatch
test-copilot-push-to-pull-request-branch-using-slash-command
test-copilot-remove-labels
test-copilot-report-incomplete
test-copilot-resolve-pull-request-review-thread
test-copilot-set-issue-type
test-copilot-siderepo-add-comment
test-copilot-siderepo-add-discussion-comment
test-copilot-siderepo-add-labels
test-copilot-siderepo-create-discussion
test-copilot-siderepo-create-issue
test-copilot-siderepo-create-pull-request-review-comment
test-copilot-siderepo-create-two-pull-requests
test-copilot-siderepo-mcp
test-copilot-siderepo-subdir-create-pull-request
test-copilot-siderepo-update-issue
test-copilot-submit-pull-request-review
test-copilot-unassign-from-user
test-copilot-update-discussion
test-copilot-update-issue
test-copilot-update-pull-request
test-copilot-upload-asset

🤖 Agent triage prompt

Copy the block below into a coding agent (e.g. Copilot) to investigate the failures:

You are triaging failures from the gh-aw-test E2E suite.
Run: https://github.com/githubnext/gh-aw-test/actions/runs/27969882871
Repository under test: github/gh-aw (the gh-aw CLI/compiler).
Test harness repository: githubnext/gh-aw-test (this repo; runner is e2e.sh).

Goal: for EACH failed test listed in this status report, access the GitHub
Actions logs for the run above (and the per-entry artifacts e2e-<label>-samples-<bool>,
which contain e2e-test-*.log, e2e-output.log and fails.txt), determine the root
cause, and categorize the failure as exactly one of:

  1. TRANSIENT — flaky/infra/network/rate-limit/timing; not a real defect.
     Action: note it and recommend a re-run (./e2e.sh rerun).
  2. TEST-FRAMEWORK BUG — a defect in this repo's harness (e2e.sh), a workflow
     source file (.github/workflows/test-*.md), a sample, or CI config.
     Action: propose a concrete fix (file + change) in githubnext/gh-aw-test.
  3. GH-AW BUG — a defect in github/gh-aw itself (compiler output, runtime
     engine behaviour, safe-output handling, etc.).
     Action: open an issue in github/gh-aw with a minimal repro, the failing
     test name, the gh-aw ref/mode/samples combination, and links to the
     relevant log lines. Check for an existing open issue first and link it
     instead of filing a duplicate.

Steps:
  - Use 'gh run view <run-id> --log' and 'gh run download <run-id>' to fetch logs/artifacts.
  - Read AGENTS.md in githubnext/gh-aw-test for harness conventions before proposing fixes.
  - Group failures by suspected root cause; the same gh-aw bug may explain several.
  - Produce a table: test | category | root cause | recommended action | issue/PR link.
  - Only open github/gh-aw issues for category 3, and only after confirming no duplicate exists.

Previous status reports (closed by this run)

Metadata

Metadata

Assignees

No one assigned

    Labels

    e2e-status-reportAutomated cross-ref E2E status report

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions