Files

GitHub Actions 641588367b chore(diagnostics): Add comprehensive diagnostic tools for E2E testing

- Create phase1_diagnostics.md to document findings from test interruptions
- Introduce phase1_validation_checklist.md for pre-deployment validation
- Implement diagnostic-helpers.ts for enhanced logging and state capture
- Enable browser console logging, error tracking, and dialog lifecycle monitoring
- Establish performance monitoring for test execution times
- Document actionable recommendations for Phase 2 remediation

2026-02-03 00:02:45 +00:00

3.7 KiB

Raw Blame History

Phase 1.1: Test Execution Order Analysis

Date: February 2, 2026 Phase: Analyze Test Execution Order Duration: 30 minutes

Current Configuration Analysis

Project Dependency Chain (playwright.config.js:195-223)

setup (auth)
   ↓
security-tests (sequential, 1 worker, headless chromium)
   ↓
security-teardown (cleanup)
   ↓
┌──────────┬──────────┬──────────┐
│ chromium │ firefox  │ webkit   │  ← Parallel execution (no inter-dependencies)
└──────────┴──────────┴──────────┘

Configuration Details:

Workers (CI): workers: 1 (Line 116) - Forces sequential execution
Retries (CI): retries: 2 (Line 114) - Tests retry twice on failure
Timeout: 90s per test (Line 108)
Dependencies: Browser projects depend on setup and security-tests, NOT on each other

Why Sequential Execution Amplifies Failure

The Problem:

With workers: 1 in CI, Playwright runs ALL projects sequentially in a single worker:

Worker 1: [setup] → [security-tests] → [security-teardown] → [chromium] → [firefox] → [webkit]

When Chromium encounters an interruption (not a normal failure):

Error: Target page, context or browser has been closed at test #263
This is an INTERRUPTION, not a normal test failure
The worker encounters an unrecoverable error (browser context closed unexpectedly)
Playwright terminates the worker to prevent cascading failures
Since there's only 1 worker, the entire test run terminates
Firefox and WebKit never start - marked as "did not run"

Root Cause: The interruption is treated as a fatal worker error, not a test failure.

Interruption vs Failure

Type	Behavior	Impact
Normal Failure	Test fails assertion, runner continues	Next test runs
Interruption	Browser/context closed unexpectedly	Worker terminates
Timeout	Test exceeds 90s, marked as timeout	Next test runs
Error	Uncaught exception, test marked as error	Next test runs

Interruptions are non-recoverable - they indicate the test environment is in an inconsistent state.

Current GitHub Actions Architecture

Current workflow uses matrix sharding:

strategy:
  matrix:
    shard: [1, 2, 3, 4]
    browser: [chromium, firefox, webkit]

This creates 12 jobs:

chromium-shard-1, chromium-shard-2, chromium-shard-3, chromium-shard-4
firefox-shard-1, firefox-shard-2, firefox-shard-3, firefox-shard-4
webkit-shard-1, webkit-shard-2, webkit-shard-3, webkit-shard-4

BUT: All jobs run in the same e2e-tests job definition. If one browser has issues, it affects that browser's shards only.

The issue: The sharding is already browser-isolated at the GitHub Actions level. The problem is likely in local testing or in how the interruption is being reported.

Analysis Conclusion

Finding: The GitHub Actions workflow is ALREADY browser-isolated via matrix strategy. Each browser runs in separate jobs.

The Real Problem:

The diagnostic report shows Chromium interrupted at test #263
Firefox and WebKit show "did not run" (0 tests executed)
This suggests the issue is in the Playwright CLI command or local testing, NOT GitHub Actions

Next Steps:

Verify if the issue is in local testing vs CI
Check if there's a project dependency issue in playwright.config.js
Implement Phase 1.2 hotfix to ensure complete browser isolation
Add diagnostic logging to capture the actual interruption error

Recommendation: Proceed with Phase 1.2 to add explicit browser job separation and enhanced logging.

3.7 KiB Raw Blame History