akanealw/Charon

Fork 0

Files

T

GitHub Actions c43976f84a fix: add LAPI availability check for console enrollment and update UI warnings

2025-12-15 07:30:35 +00:00

17 KiB

Raw Blame History

Cerberus Technical Documentation

This document is for developers and advanced users who want to understand how Cerberus works under the hood.

Looking for the user guide? See Security Features instead.

What Is Cerberus?

Cerberus is the optional security suite built into Charon. It includes:

WAF (Web Application Firewall) — Inspects requests for malicious payloads
CrowdSec — Blocks IPs based on behavior and reputation
Access Lists — Static allow/deny rules (IP, CIDR, geo)
Rate Limiting — Volume-based abuse prevention (placeholder)

All components are disabled by default and can be enabled independently.

Architecture

Request Flow

When a request hits Charon:

Check if Cerberus is enabled (global setting + dynamic database flag)
WAF evaluation (if waf_mode != disabled)
- Increment charon_waf_requests_total metric
- Check payload against loaded rulesets
- If suspicious:
  - block mode: Return 403 + increment charon_waf_blocked_total
  - monitor mode: Log + increment charon_waf_monitored_total
ACL evaluation (if enabled)
- Test client IP against active access lists
- First denial = 403 response
CrowdSec check (placeholder for future)
Rate limit check (placeholder for future)
Pass to downstream handler (if not blocked)

Middleware Integration

Cerberus runs as Gin middleware on all /api/v1 routes:

r.Use(cerberusMiddleware.RequestLogger())

This means it protects the management API but does not directly inspect traffic to proxied websites (that happens in Caddy).

Threat Model & Protection Coverage

What Cerberus Protects

Threat Category	CrowdSec	ACL	WAF	Rate Limit
Known attackers (IP reputation)	✅	❌	❌	❌
Geo-based attacks	❌	✅	❌	❌
SQL Injection (SQLi)	❌	❌	✅	❌
Cross-Site Scripting (XSS)	❌	❌	✅	❌
Remote Code Execution (RCE)	❌	❌	✅	❌
Zero-Day Web Exploits	⚠️	❌	✅	❌
DDoS / Volume attacks	❌	❌	❌	✅
Brute-force login attempts	✅	❌	❌	✅
Credential stuffing	✅	❌	❌	✅

Legend:

✅ Full protection
⚠️ Partial protection (time-delayed)
❌ Not designed for this threat

Zero-Day Exploit Protection (WAF)

The WAF provides pattern-based detection for zero-day exploits:

How It Works:

Attacker discovers new vulnerability (e.g., SQLi in your login form)
Attacker crafts exploit: ' OR 1=1--
WAF inspects request → matches SQL injection pattern → BLOCKED
Your application never sees the malicious input

Limitations:

Only protects HTTP/HTTPS traffic
Cannot detect completely novel attack patterns (rare)
Does not protect against logic bugs in application code

Effectiveness:

~90% of zero-day web exploits use known patterns (SQLi, XSS, RCE)
~10% are truly novel and may bypass WAF until rules are updated

Request Processing Pipeline

1. [CrowdSec]      Check IP reputation → Block if known attacker
2. [ACL]           Check IP/Geo rules → Block if not allowed
3. [WAF]           Inspect request payload → Block if malicious pattern
4. [Rate Limit]    Count requests → Block if too many
5. [Proxy]         Forward to upstream service

Configuration Model

Database Schema

SecurityConfig table:

type SecurityConfig struct {
    ID                   uint   `gorm:"primaryKey"`
    Name                 string `json:"name"`
    Enabled              bool   `json:"enabled"`
    AdminWhitelist       string `json:"admin_whitelist"`        // CSV of IPs/CIDRs
    CrowdsecMode         string `json:"crowdsec_mode"`          // disabled, local, external
    CrowdsecAPIURL       string `json:"crowdsec_api_url"`
    CrowdsecAPIKey       string `json:"crowdsec_api_key"`
    WafMode              string `json:"waf_mode"`               // disabled, monitor, block
    WafRulesSource       string `json:"waf_rules_source"`       // Ruleset identifier
    WafLearning          bool   `json:"waf_learning"`
    RateLimitEnable      bool   `json:"rate_limit_enable"`
    RateLimitBurst       int    `json:"rate_limit_burst"`
    RateLimitRequests    int    `json:"rate_limit_requests"`
    RateLimitWindowSec   int    `json:"rate_limit_window_sec"`
}

Environment Variables (Fallbacks)

If no database config exists, Charon reads from environment:

CERBERUS_SECURITY_WAF_MODE — disabled | monitor | block
🚨 DEPRECATED: CERBERUS_SECURITY_CROWDSEC_MODE — Use GUI toggle instead (see below)
🚨 DEPRECATED: CERBERUS_SECURITY_CROWDSEC_API_URL — External mode is no longer supported
🚨 DEPRECATED: CERBERUS_SECURITY_CROWDSEC_API_KEY — External mode is no longer supported
CERBERUS_SECURITY_ACL_ENABLED — true | false
CERBERUS_SECURITY_RATELIMIT_ENABLED — true | false

⚠️ IMPORTANT: The CHARON_SECURITY_CROWDSEC_MODE (and legacy CERBERUS_SECURITY_CROWDSEC_MODE, CPM_SECURITY_CROWDSEC_MODE) environment variables are DEPRECATED as of version 2.0. CrowdSec is now GUI-controlled through the Security dashboard, just like WAF, ACL, and Rate Limiting.

Why the change?

CrowdSec now works like all other security features (GUI-based)
No need to restart containers to enable/disable CrowdSec
Better integration with Charon's security orchestration
The import config feature replaced the need for external mode

Migration: If you have CHARON_SECURITY_CROWDSEC_MODE=local in your docker-compose.yml, remove it and use the GUI toggle instead. See Migration Guide for step-by-step instructions.

WAF (Web Application Firewall)

Current Implementation

Status: Prototype with placeholder detection

The current WAF checks for <script> tags as a proof-of-concept. Full OWASP CRS integration is planned.

func (w *WAF) EvaluateRequest(r *http.Request) (Decision, error) {
    if strings.Contains(r.URL.Query().Get("q"), "<script>") {
        return Decision{Action: "block", Reason: "XSS detected"}, nil
    }
    return Decision{Action: "allow"}, nil
}

Future: Coraza Integration

Planned integration with Coraza WAF and OWASP Core Rule Set:

waf, err := coraza.NewWAF(coraza.NewWAFConfig().
    WithDirectives(loadedRuleContent))

This will provide production-grade detection of:

SQL injection
Cross-site scripting (XSS)
Remote code execution
File inclusion attacks
And more

Rulesets

SecurityRuleSet table stores rule definitions:

type SecurityRuleSet struct {
    ID         uint   `gorm:"primaryKey"`
    Name       string `json:"name"`
    SourceURL  string `json:"source_url"`  // Optional URL for rule updates
    Mode       string `json:"mode"`        // owasp, custom
    Content    string `json:"content"`     // Raw rule text
}

Manage via /api/v1/security/rulesets.

Prometheus Metrics

charon_waf_requests_total{mode="block|monitor"} — Total requests evaluated
charon_waf_blocked_total{mode="block"} — Requests blocked
charon_waf_monitored_total{mode="monitor"} — Requests logged but not blocked

Scrape from /metrics endpoint (no auth required).

Structured Logging

WAF decisions emit JSON-like structured logs:

{
  "source": "waf",
  "decision": "block",
  "mode": "block",
  "path": "/api/v1/proxy-hosts",
  "query": "name=<script>alert(1)</script>",
  "ip": "203.0.113.50"
}

Use these for dashboard creation and alerting.

Access Control Lists (ACLs)

How They Work

Each AccessList defines:

Type: whitelist | blacklist | geo_whitelist | geo_blacklist | local_only
IPs: Comma-separated IPs or CIDR blocks
Countries: Comma-separated ISO country codes (US, GB, FR, etc.)

Evaluation logic:

Whitelist: If IP matches list → allow; else → deny
Blacklist: If IP matches list → deny; else → allow
Geo Whitelist: If country matches → allow; else → deny
Geo Blacklist: If country matches → deny; else → allow
Local Only: If RFC1918 private IP → allow; else → deny

Multiple ACLs can be assigned to a proxy host. The first denial wins.

GeoIP Database

Uses MaxMind GeoLite2-Country database:

Path configured via CHARON_GEOIP_DB_PATH
Default: /app/data/GeoLite2-Country.mmdb (Docker)
Update monthly from MaxMind for accuracy

CrowdSec Integration

GUI-Based Control (Current Architecture)

CrowdSec is now GUI-controlled, matching the pattern used by WAF, ACL, and Rate Limiting. The environment variable control (CHARON_SECURITY_CROWDSEC_MODE) is deprecated and will be removed in a future version.

How to Enable CrowdSec

Step 1: Access Security Dashboard

Navigate to Security in the sidebar
Find the CrowdSec card
Toggle the switch to ON
Wait 10-15 seconds for LAPI to start
Verify status shows "Active" with a running PID

Step 2: Verify LAPI is Running

docker exec charon cscli lapi status

Expected output:

✓ You can successfully interact with Local API (LAPI)

Step 3: (Optional) Enroll in CrowdSec Console

Once LAPI is running, you can enroll your instance:

Go to Cerberus → CrowdSec
Enable the Console enrollment feature flag (if not already enabled)
Click Enroll with CrowdSec Console
Paste your enrollment token from crowdsec.net
Submit

Prerequisites for Console Enrollment:

✅ CrowdSec must be enabled via GUI toggle
✅ LAPI must be running (verify with cscli lapi status)
✅ Feature flag feature.crowdsec.console_enrollment must be enabled
✅ Valid enrollment token from crowdsec.net

⚠️ Important: Console enrollment requires an active LAPI connection. If LAPI is not running, the enrollment will appear successful locally but won't register on crowdsec.net.

How CrowdSec Works in Charon

Startup Flow:

Container starts → CrowdSec config initialized (but agent NOT started)
User toggles CrowdSec switch in GUI → Frontend calls /api/v1/admin/crowdsec/start
Backend handler starts LAPI process → PID tracked in backend
User can verify status in Security dashboard
User toggles OFF → Backend calls /api/v1/admin/crowdsec/stop

This matches the pattern used by other security features:

Feature	Control Method	Status Endpoint	Lifecycle Handler
Cerberus	GUI Toggle	`/security/status`	N/A (master switch)
WAF	GUI Toggle	`/security/status`	Config regeneration
ACL	GUI Toggle	`/security/status`	Config regeneration
Rate Limit	GUI Toggle	`/security/status`	Config regeneration
CrowdSec	✅ GUI Toggle	`/security/status`	Start/Stop handlers

Import Config Feature

The import config feature (importCrowdsecConfig) allows you to:

Upload a complete CrowdSec configuration (tar.gz)
Import pre-configured settings, collections, and bouncers
Manage CrowdSec entirely through Charon's GUI

This replaced the need for "external" mode:

Old way (deprecated): Set CROWDSEC_MODE=external and point to external LAPI
New way: Import your existing config and let Charon manage it internally

Troubleshooting

Problem: Console enrollment shows "enrolled" locally but doesn't appear on crowdsec.net

Solution: LAPI must be running before enrollment. Check with:

docker exec charon cscli lapi status

If LAPI is not running:

Go to Security dashboard
Toggle CrowdSec OFF, then ON again
Wait 15 seconds
Verify LAPI is running
Re-submit enrollment token

Problem: CrowdSec won't start after toggling

Solution: Check logs:

docker logs charon

Common issues:

Config directory missing (should auto-create)
Permissions issues (should be handled by entrypoint)
Port 8085 already in use (check for conflicting services)

Security Decisions

The SecurityDecision table logs all security actions:

type SecurityDecision struct {
    ID        uint      `gorm:"primaryKey"`
    Source    string    `json:"source"`    // waf, crowdsec, acl, ratelimit, manual
    IPAddress string    `json:"ip_address"`
    Action    string    `json:"action"`    // allow, block, challenge
    Reason    string    `json:"reason"`
    Timestamp time.Time `json:"timestamp"`
}

Use cases:

Audit trail for compliance
UI visibility into recent blocks
Manual override tracking

Self-Lockout Prevention

Admin Whitelist

Purpose: Prevent admins from blocking themselves

Implementation:

Stored in SecurityConfig.admin_whitelist as CSV
Checked before applying any block decision
If requesting IP matches whitelist → always allow

Recommendation: Add your VPN IP, Tailscale IP, or home network before enabling Cerberus.

Break-Glass Token

Purpose: Emergency disable when locked out

How it works:

Generate via POST /api/v1/security/breakglass/generate
Returns one-time token (plaintext, never stored hashed)
Token can be used in POST /api/v1/security/disable to turn off Cerberus
Token expires after first use

Storage: Tokens are hashed in database using bcrypt.

Localhost Bypass

Requests from 127.0.0.1 or ::1 may bypass security checks (configurable). Allows local management access even when locked out.

API Reference

Status

GET /api/v1/security/status

Returns:

{
  "enabled": true,
  "waf_mode": "monitor",
  "crowdsec_mode": "local",
  "acl_enabled": true,
  "ratelimit_enabled": false
}

Enable Cerberus

POST /api/v1/security/enable
Content-Type: application/json

{
  "admin_whitelist": "198.51.100.10,203.0.113.0/24"
}

Requires either:

admin_whitelist with at least one IP/CIDR
OR valid break-glass token in header

Disable Cerberus

POST /api/v1/security/disable

Requires either:

Request from localhost
OR valid break-glass token in header

Get/Update Config

GET /api/v1/security/config
POST /api/v1/security/config

See SecurityConfig schema above.

Rulesets

GET /api/v1/security/rulesets
POST /api/v1/security/rulesets
DELETE /api/v1/security/rulesets/:id

Decisions (Audit Log)

GET /api/v1/security/decisions?limit=50
POST /api/v1/security/decisions  # Manual override

Testing

Integration Test

Run the Coraza integration test:

bash scripts/coraza_integration.sh

Or via Go:

cd backend
go test -tags=integration ./integration -run TestCorazaIntegration -v

Manual Testing

Enable WAF in monitor mode
Send request with <script> in query string
Check /api/v1/security/decisions for logged attempt
Switch to block mode
Repeat — should receive 403

Observability

Recommended Dashboards

Block Rate:

rate(charon_waf_blocked_total[5m]) / rate(charon_waf_requests_total[5m])

Monitor vs Block Comparison:

rate(charon_waf_monitored_total[5m])
rate(charon_waf_blocked_total[5m])

Alerting Rules

High block rate (potential attack):

alert: HighWAFBlockRate
expr: rate(charon_waf_blocked_total[5m]) > 0.3
for: 10m
annotations:
  summary: "WAF blocking >30% of requests"

No WAF evaluation (misconfiguration):

alert: WAFNotEvaluating
expr: rate(charon_waf_requests_total[10m]) == 0
for: 15m
annotations:
  summary: "WAF received zero requests, check middleware config"

Development Roadmap

Phase	Feature	Status
1	WAF placeholder + metrics	✅ Complete
2	ACL implementation	✅ Complete
3	Break-glass token	✅ Complete
4	Coraza CRS integration	📋 Planned
5	CrowdSec local agent	📋 Planned
6	Rate limiting enforcement	📋 Planned
7	Adaptive learning/tuning	🔮 Future

FAQ

Why is the WAF just a placeholder?

We wanted to ship the architecture and observability first. This lets you enable monitoring, see the metrics, and prepare dashboards before the full rule engine is integrated.

Can I use my own WAF rules?

Yes, via /api/v1/security/rulesets. Upload custom Coraza-compatible rules.

Does Cerberus protect Caddy's proxy traffic?

Not yet. Currently it only protects the management API (/api/v1). Future versions will integrate directly with Caddy's request pipeline to protect proxied traffic.

Why is monitor mode still blocking?

Known issue with the placeholder implementation. This will be fixed when Coraza integration is complete.

17 KiB Raw Blame History

Cerberus Technical Documentation

What Is Cerberus?

Architecture

Request Flow

Middleware Integration

Threat Model & Protection Coverage

What Cerberus Protects

Zero-Day Exploit Protection (WAF)

Request Processing Pipeline

Configuration Model

Database Schema

Environment Variables (Fallbacks)

WAF (Web Application Firewall)

Current Implementation

Future: Coraza Integration

Rulesets

Prometheus Metrics

Structured Logging

Access Control Lists (ACLs)

How They Work

GeoIP Database

CrowdSec Integration

GUI-Based Control (Current Architecture)

How to Enable CrowdSec

How CrowdSec Works in Charon

Import Config Feature

Troubleshooting

Security Decisions

Self-Lockout Prevention

Admin Whitelist

Break-Glass Token

Localhost Bypass

API Reference

Status

Enable Cerberus

Disable Cerberus

Get/Update Config

Rulesets

Decisions (Audit Log)

Testing

Integration Test

Manual Testing

Observability

Recommended Dashboards

Alerting Rules

Development Roadmap

FAQ

Why is the WAF just a placeholder?

Can I use my own WAF rules?

Does Cerberus protect Caddy's proxy traffic?

Why is monitor mode still blocking?

See Also

17 KiB

Raw Blame History