Commit Graph

654 Commits

Author SHA1 Message Date
GitHub Actions
707c34b4d6 fix(ci): improve Playwright installation steps by removing redundant system dependency installs and enhancing exit code handling 2026-02-04 17:43:49 +00:00
GitHub Actions
1b66257868 fix(ci): enhance Playwright installation steps with system dependencies and cache checks 2026-02-04 17:27:35 +00:00
GitHub Actions
6e3fcf7824 fix: simplify Playwright browser installation steps
Remove overly complex verification logic that was causing all browser
jobs to fail. Browser installation should fail fast and clearly if
there are issues.

Changes:
- Remove multi-line verification scripts from all 3 browser install steps
- Simplify to single command: npx playwright install --with-deps {browser}
- Let install step show actual errors if it fails
- Let test execution show "browser not found" errors if install incomplete

Rationale:
- Previous complex verification (using grep/find) was the failure point
- Simpler approach provides clearer error messages for debugging
- Tests themselves will fail clearly if browsers aren't available

Expected outcome:
- Install steps show actual error messages if they fail
- If install succeeds, tests execute normally
- If install "succeeds" but browser is missing, test step shows clear error

Timeout remains at 45 minutes (accommodates 10-15 min install + execution)
2026-02-04 17:08:30 +00:00
GitHub Actions
3c0b9fa2b1 fix: resolve Playwright browser executable not found errors in CI
Root causes:
1. Browser cache was restoring corrupted/stale binaries from previous runs
2. 30-minute timeout insufficient for fresh Playwright installation (10-15 min)
   plus Docker/health checks and test execution

Changes:
- Remove browser caching from all 3 browser jobs (chromium, firefox, webkit)
- Increase timeout from 30 → 45 minutes for all jobs
- Add diagnostic logging to browser install steps:
  * Install start/completion timestamps
  * Exit code verification
  * Cache directory inspection on failure
  * Browser executable verification using 'npx playwright test --list'

Benefits:
- Fresh browser installations guaranteed (no cache pollution)
- 15-minute buffer prevents premature timeouts
- Detailed diagnostics to catch future installation issues early
- Consistent behavior across all browsers

Technical notes:
- Browser install with --with-deps takes 10-15 minutes per browser
- GitHub Actions cache was causing more harm than benefit (stale binaries)
- Sequential execution (1 shard per browser) combined with fresh installs
  ensures stable, reproducible CI behavior

Expected outcome:
- Firefox/WebKit failures from missing browser executables → resolved
- Chrome timeout at 30 minutes → resolved with 45 minute buffer
- Future installation issues → caught immediately via diagnostics

Refs: #hofix/ci
QA: YAML syntax validated, pre-commit hooks passed (12/12)
2026-02-04 16:44:47 +00:00
Jeremy
40a37f76ac Merge branch 'main' into hotfix/ci 2026-02-04 11:09:04 -05:00
GitHub Actions
e6c2f46475 fix(e2e): update E2E tests workflow to sequential execution and fix race conditions
- Changed workflow name to reflect sequential execution for stability.
- Reduced test sharding from 4 to 1 per browser, resulting in 3 total jobs.
- Updated job summaries and documentation to clarify execution model.
- Added new documentation file for E2E CI failure diagnosis.
- Adjusted job summary tables to reflect changes in shard counts and execution type.
2026-02-04 16:08:11 +00:00
Jeremy
a845b83ef7 fix: Merge branch 'development' 2026-02-04 16:01:22 +00:00
GitHub Actions
7bb88204d2 fix(ci): remove redundant Playwright browser cache cleanup from workflows 2026-02-04 14:42:17 +00:00
GitHub Actions
73f6d3d691 fix(ci): remove redundant image tag determination logic from multiple workflows 2026-02-04 14:24:11 +00:00
GitHub Actions
80c033b812 fix(ci): standardize image tag step ID across integration workflows 2026-02-04 14:16:02 +00:00
Jeremy
0519b4baed Merge branch 'main' into hotfix/ci 2026-02-04 09:10:32 -05:00
GitHub Actions
8edde88f95 fix(ci): add image_tag input for manual triggers in integration workflows 2026-02-04 14:08:36 +00:00
GitHub Actions
e1c7ed3a13 fix(ci): add manual trigger inputs for Cerberus integration workflow 2026-02-04 13:53:01 +00:00
Jeremy
83a695fbdc Merge branch 'feature/beta-release' into development 2026-02-04 05:26:47 -05:00
GitHub Actions
6938d4634c fix(ci): update workflows to support manual triggers and conditional execution based on Docker build success 2026-02-04 10:07:50 +00:00
GitHub Actions
1267b74ace fix(ci): add pull_request triggers to test workflows for PR coverage
workflow_run triggers only fire for push events, not pull_request events,
causing PRs to skip integration and E2E tests entirely. Add dual triggers
to all test workflows so they run for both push (via workflow_run) and
pull_request events, while maintaining single-build architecture.

All workflows still pull pre-built images from docker-build.yml - no
redundant builds introduced. This fixes PR test coverage while preserving
the "Build Once, Test Many" optimization for push events.

Fixes: Build Once architecture (commit 928033ec)
2026-02-04 05:51:58 +00:00
GitHub Actions
721b533e15 fix(docker-build): enhance feature branch tag generation with improved sanitization 2026-02-04 05:17:19 +00:00
GitHub Actions
1a8df0c732 refactor(docker-build): simplify feature branch tag generation in workflow 2026-02-04 05:00:46 +00:00
GitHub Actions
4a2c3b4631 refactor(docker-build): improve Docker build command handling with array arguments for tags and labels 2026-02-04 04:55:58 +00:00
GitHub Actions
ac39eb6866 refactor(docker-build): optimize Docker build command handling and improve readability 2026-02-04 04:50:48 +00:00
GitHub Actions
6b15aaad08 fix(workflow): enhance Docker build process for PRs and feature branches 2026-02-04 04:46:41 +00:00
GitHub Actions
928033ec37 chore(ci): implement "build once, test many" architecture
Restructures CI/CD pipeline to eliminate redundant Docker image builds
across parallel test workflows. Previously, every PR triggered 5 separate
builds of identical images, consuming compute resources unnecessarily and
contributing to registry storage bloat.

Registry storage was growing at 20GB/week due to unmanaged transient tags
from multiple parallel builds. While automated cleanup exists, preventing
the creation of redundant images is more efficient than cleaning them up.

Changes CI/CD orchestration so docker-build.yml is the single source of
truth for all Docker images. Integration tests (CrowdSec, Cerberus, WAF,
Rate Limiting) and E2E tests now wait for the build to complete via
workflow_run triggers, then pull the pre-built image from GHCR.

PR and feature branch images receive immutable tags that include commit
SHA (pr-123-abc1234, feature-dns-provider-def5678) to prevent race
conditions when branches are updated during test execution. Tag
sanitization handles special characters, slashes, and name length limits
to ensure Docker compatibility.

Adds retry logic for registry operations to handle transient GHCR
failures, with dual-source fallback to artifact downloads when registry
pulls fail. Preserves all existing functionality and backward
compatibility while reducing parallel build count from 5× to 1×.

Security scanning now covers all PR images (previously skipped),
blocking merges on CRITICAL/HIGH vulnerabilities. Concurrency groups
prevent stale test runs from consuming resources when PRs are updated
mid-execution.

Expected impact: 80% reduction in compute resources, 4× faster
total CI time (120min → 30min), prevention of uncontrolled registry
storage growth, and 100% consistency guarantee (all tests validate
the exact same image that would be deployed).

Closes #[issue-number-if-exists]
2026-02-04 04:42:42 +00:00
GitHub Actions
daef23118a test(crowdsec): add LAPI connectivity tests and enhance integration test reporting 2026-02-04 01:56:56 +00:00
Jeremy
da6682000e Merge branch 'feature/beta-release' into renovate/feature/beta-release-weekly-non-major-updates 2026-02-03 14:55:10 -05:00
GitHub Actions
b6a189c927 fix(security): add CrowdSec diagnostics script and E2E tests for console enrollment and diagnostics
- Implemented `diagnose-crowdsec.sh` script for checking CrowdSec connectivity and configuration.
- Added E2E tests for CrowdSec console enrollment, including API checks for enrollment status, diagnostics connectivity, and configuration validation.
- Created E2E tests for CrowdSec diagnostics, covering configuration file validation, connectivity checks, and configuration export.
2026-02-03 18:26:32 +00:00
renovate[bot]
6d746385c3 chore(deps): update actions/checkout digest to de0fac2 2026-02-03 17:20:33 +00:00
renovate[bot]
df80c49070 chore(deps): update github/codeql-action digest to 6bc82e0 2026-02-03 07:15:37 +00:00
GitHub Actions
4178910eac refactor: streamline supply chain workflows by removing Syft and Grype installations and utilizing official Anchore actions for SBOM generation and vulnerability scanning 2026-02-03 07:09:54 +00:00
GitHub Actions
de66689b79 fix: update SYFT and GRYPE versions to include SHA256 digests for improved security 2026-02-03 06:40:50 +00:00
GitHub Actions
07e8261ecb chore(e2e): update concurrency settings to prevent cancellation of in-progress E2E tests 2026-02-03 04:18:37 +00:00
GitHub Actions
3ecc4015a6 refactor(workflows): simplify E2E Tests workflow name by removing 'Split Browsers' suffix 2026-02-03 00:56:00 +00:00
GitHub Actions
19e74f2122 refactor(workflows): standardize workflow names by removing 'Tests' suffix 2026-02-03 00:51:06 +00:00
GitHub Actions
d6cbc407fd fix(e2e): update Docker build-push-action version in E2E tests workflow 2026-02-03 00:06:01 +00:00
GitHub Actions
641588367b chore(diagnostics): Add comprehensive diagnostic tools for E2E testing
- Create phase1_diagnostics.md to document findings from test interruptions
- Introduce phase1_validation_checklist.md for pre-deployment validation
- Implement diagnostic-helpers.ts for enhanced logging and state capture
- Enable browser console logging, error tracking, and dialog lifecycle monitoring
- Establish performance monitoring for test execution times
- Document actionable recommendations for Phase 2 remediation
2026-02-03 00:02:45 +00:00
renovate[bot]
a8b24eb8f9 chore(deps): update weekly-non-major-updates 2026-02-02 21:50:07 +00:00
Jeremy
5951a16984 Merge branch 'development' into renovate/development-actions-github-script-8.x 2026-02-02 16:47:26 -05:00
Jeremy
eb66cda0f4 Merge branch 'development' into renovate/development-weekly-non-major-updates 2026-02-02 16:46:46 -05:00
Jeremy
2d31c86d91 Merge branch 'development' into renovate/development-pin-dependencies 2026-02-02 16:46:22 -05:00
Jeremy
9c41c1f331 Merge branch 'development' into renovate/development-peter-evans-create-pull-request-8.x 2026-02-02 16:45:12 -05:00
Jeremy
3bb7098220 Merge branch 'feature/beta-release' into renovate/feature/beta-release-weekly-non-major-updates 2026-02-02 16:44:12 -05:00
GitHub Actions
3414576f60 fix(e2e): implement performance tracking for shard execution and API call metrics 2026-02-02 21:32:27 +00:00
renovate[bot]
dd28a0d819 chore(deps): update actions/github-script action to v8 2026-02-02 21:25:41 +00:00
renovate[bot]
ffcfb40919 chore(deps): update weekly-non-major-updates 2026-02-02 21:25:36 +00:00
renovate[bot]
e2562d27df chore(deps): pin dependencies 2026-02-02 21:25:31 +00:00
renovate[bot]
8908a37dbf chore(deps): update peter-evans/create-pull-request action to v8 2026-02-02 21:23:55 +00:00
renovate[bot]
38453169c5 chore(deps): update actions/checkout action to v6 2026-02-02 21:23:51 +00:00
renovate[bot]
22c2e10f64 chore(deps): update weekly-non-major-updates 2026-02-02 21:23:46 +00:00
Jeremy
44d425d51d Merge branch 'feature/beta-release' into renovate/feature/beta-release-peter-evans-create-pull-request-8.x 2026-02-02 09:55:06 -05:00
renovate[bot]
280e7b9c19 chore(deps): pin peter-evans/create-pull-request action to c5a7806 2026-02-02 14:53:28 +00:00
Jeremy
a92e49604f Merge branch 'feature/beta-release' into renovate/feature/beta-release-peter-evans-create-pull-request-8.x 2026-02-02 09:48:59 -05:00