Files

GitHub Actions 04a31b374c fix(e2e): enhance toast feedback handling and improve test stability

- Updated toast locator strategies to prioritize role="status" for success/info toasts and role="alert" for error toasts across various test files.
- Increased timeouts and added retry logic in tests to improve reliability under load, particularly for settings and user management tests.
- Refactored emergency server health checks to use Playwright's request context for better isolation and error handling.
- Simplified rate limit and WAF enforcement tests by documenting expected behaviors and removing redundant checks.
- Improved user management tests by temporarily disabling checks for user status badges until UI updates are made.

2026-01-29 20:32:38 +00:00

8.3 KiB

Raw Blame History

applyTo, description

applyTo	description
**	Strict protocols for test execution, debugging, and coverage validation.

Testing Protocols

0. E2E Verification First (Playwright)

MANDATORY: Before running unit tests, verify the application UI/UX functions correctly end-to-end.

Testing Scope Clarification

Playwright E2E Tests (UI/UX):

Test user interactions with the React frontend
Verify UI state changes when settings are toggled
Ensure forms submit correctly
Check navigation and page rendering
Port: 8080 (Charon Management Interface)

Integration Tests (Middleware Enforcement):

Test Cerberus security module enforcement
Verify ACL, WAF, Rate Limiting, CrowdSec actually block/allow requests
Test requests routing through Caddy proxy with full middleware
Port: 80 (User Traffic via Caddy)
Location: backend/integration/ with //go:build integration tag
CI: Runs in separate workflows (cerberus-integration.yml, waf-integration.yml, etc.)

Two Modes: Docker vs Vite

Playwright E2E tests can run in two modes with different capabilities:

Mode	Base URL	Coverage Support	When to Use
Docker	`http://localhost:8080`	❌ No (0% reported)	Integration testing, CI validation
Vite Dev	`http://localhost:5173`	✅ Yes (real coverage)	Local development, coverage collection

Why? The @bgotink/playwright-coverage library uses V8 coverage which requires access to source files. Only the Vite dev server exposes source maps and raw source files needed for coverage instrumentation.

Running E2E Tests (Integration Mode)

For general integration testing without coverage:

# Against Docker container (default)
npx playwright test --project=chromium

# With explicit base URL
PLAYWRIGHT_BASE_URL=http://localhost:8080 npx playwright test --project=chromium

Running E2E Tests with Coverage

IMPORTANT: Use the dedicated skill for coverage collection:

# Recommended: Uses skill that starts Vite and runs against localhost:5173
.github/skills/scripts/skill-runner.sh test-e2e-playwright-coverage

The coverage skill:

Starts Vite dev server on port 5173
Sets PLAYWRIGHT_BASE_URL=http://localhost:5173
Runs tests with V8 coverage collection
Generates reports in coverage/e2e/ (LCOV, HTML, JSON)

DO NOT expect coverage when running against Docker:

# ❌ WRONG: Coverage will show "Unknown% (0/0)"
PLAYWRIGHT_BASE_URL=http://localhost:8080 npx playwright test --coverage

# ✅ CORRECT: Use the coverage skill
.github/skills/scripts/skill-runner.sh test-e2e-playwright-coverage

Verifying Coverage Locally Before CI

Before pushing code, verify E2E coverage:

Run the coverage skill:

.github/skills/scripts/skill-runner.sh test-e2e-playwright-coverage

Check coverage output:

# View HTML report
open coverage/e2e/index.html

# Check LCOV file exists for Codecov
ls -la coverage/e2e/lcov.info

Verify non-zero coverage:

# Should show real percentages, not "0%"
head -20 coverage/e2e/lcov.info

General Guidelines

No Truncation: Never pipe Playwright test output through head, tail, or other truncating commands. Playwright runs interactively and requires user input to quit when piped, causing the command to hang indefinitely.
Why First: If the application is broken at the E2E level, unit tests may need updates. Playwright catches integration issues early.
On Failure: Analyze failures, trace root cause through frontend → backend flow, then fix before proceeding to unit tests.
Scope: Run relevant test files for the feature being modified (e.g., tests/manual-dns-provider.spec.ts).

1. Execution Environment

No Truncation: Never use pipe commands (e.g., head, tail) or flags that limit stdout/stderr. If a test hangs, it likely requires an interactive input or is caught in a loop; analyze the full output to identify the block.
Task-Based Execution: Do not manually construct test strings. Use existing project tasks (e.g., npm test, go test ./...). If a specific sub-module requires frequent testing, generate a new task definition in the project's configuration file (e.g., .vscode/tasks.json) before proceeding.

2. Failure Analysis & Logic Integrity

Evidence-Based Debugging: When a test fails, you must quote the specific error message or stack trace before suggesting a fix.
Bug vs. Test Flaw: Treat the test as the "Source of Truth." If a test fails, assume the code is broken until proven otherwise. Research the original requirement or PR description to verify if the test logic itself is outdated before modifying it.
Zero-Hallucination Policy: Only use file paths and identifiers discovered via the ls or search tools. Never guess a path based on naming conventions.

3. Coverage & Completion

Coverage Gate: A task is not "Complete" until a coverage report is generated.
Threshold Compliance: You must compare the final coverage percentage against the project's threshold (Default: 85% unless specified otherwise). If coverage drops, you must identify the "uncovered lines" and add targeted tests.
Patch Coverage Gate (Codecov): If production code is modified, Codecov patch coverage must be 100% for the modified lines. Do not relax thresholds; add targeted tests.
Patch Triage Requirement: Plans must include the exact missing/partial patch line ranges copied from Codecov’s Patch view.

4. GORM Security Validation (Manual Stage)

Requirement: All backend changes involving GORM models or database interactions must pass the GORM Security Scanner.

When to Run

Before Committing: When modifying GORM models (files in backend/internal/models/)
Before Opening PR: Verify no security issues introduced
After Code Review: If model-related changes were requested
Definition of Done: Scanner must pass with zero CRITICAL/HIGH issues

Running the Scanner

Via VS Code (Recommended for Development):

Open Command Palette (Cmd/Ctrl+Shift+P)
Select "Tasks: Run Task"
Choose "Lint: GORM Security Scan"

Via Pre-commit (Manual Stage):

# Run on all Go files
pre-commit run --hook-stage manual gorm-security-scan --all-files

# Run on staged files only
pre-commit run --hook-stage manual gorm-security-scan

Direct Execution:

# Report mode - Show all issues, exit 0 (always)
./scripts/scan-gorm-security.sh --report

# Check mode - Exit 1 if issues found (use in CI)
./scripts/scan-gorm-security.sh --check

Expected Behavior

Pass (Exit Code 0):

No security issues detected
Proceed with commit/PR

Fail (Exit Code 1):

Issues detected (ID leaks, exposed secrets, DTO embedding, etc.)
Review scanner output for file:line references
Fix issues before committing
See GORM Security Scanner Documentation

Common Issues Detected

🔴 CRITICAL: ID Leak — Numeric ID with json:"id" tag
- Fix: Change to json:"-", use UUID for external reference
🔴 CRITICAL: Exposed Secret — APIKey/Token/Password with JSON tag
- Fix: Change to json:"-" to hide sensitive field
🟡 HIGH: DTO Embedding — Response struct embeds model with exposed ID
- Fix: Use explicit field definitions instead of embedding

Integration Status

Current Stage: Manual (soft launch)

Scanner available for manual invocation
Does not block commits automatically
Developers should run proactively

Future Stage: Blocking (after remediation)

Scanner will block commits with CRITICAL/HIGH issues
CI integration will enforce on all PRs
See GORM Scanner Roadmap

Performance

Execution Time: ~2 seconds per full scan
Fast enough for pre-commit use
No impact on commit workflow when passing

Documentation

Implementation Details: docs/implementation/gorm_security_scanner_complete.md
Specification: docs/plans/gorm_security_scanner_spec.md
QA Report: docs/reports/gorm_scanner_qa_report.md

8.3 KiB Raw Blame History Unescape Escape