Security & Trust

Security

Autonomous action constraints, data model, secrets, and self-hosting threat model.

PublicSource-ownedMarkdown export

Self-Hosted Workflow BoundaryFounder workflows stay separated from operator controls and HELM-governed external actions.

Production-grade security configuration for Pilot.

Audience

Use this page if you are operating Pilot in production, reviewing autonomous-action constraints, hardening connectors, or checking the self-hosting threat model. It is for operators, platform engineers, and security reviewers.

Outcome

After this page you should be able to:

generate and rotate required secrets;
explain how sessions, API keys, connector tokens, and session snapshots are protected;
understand autonomous-action constraints and fail-closed behavior;
configure OAuth, webhooks, CORS, TLS, rate limits, backups, and audit logs;
distinguish Pilot application security from HELM governance responsibilities.

Security Boundary

Diagram1. Ingestion & Context Plane -> User or Telegram -> Workspace tenancy -> HELM governance -> Encrypted token/session storage -> 2. Evaluation & Policy Plane -> Gateway auth and rate limits -> Trust boundary policy

flowchart TD
    subgraph Ingestion["1. Ingestion & Context Plane"]
        User["User or Telegram"]
        Workspace["Workspace tenancy"]
        Helm["HELM governance"]
        Secrets["Encrypted token/session storage"]
    end

    subgraph Evaluation["2. Evaluation & Policy Plane"]
        Gateway["Gateway auth and rate limits"]
        Policy["Trust boundary policy"]
        Approval["Approval gates"]
        Audit["Audit events"]
    end

    subgraph Ledger["4. Tamper-Evident Ledger Plane"]
        Receipts["Receipts and evidence packs"]
    end

    %% Operational Flow Edges
    User --> Gateway
    Gateway --> Workspace
    Workspace --> Policy
    Policy --> Approval
    Policy --> Helm
    Gateway --> Secrets
    Gateway --> Audit
    Helm --> Receipts

    %% Premium Styling Rules
    style Gateway fill:#2d3748,stroke:#4a5568,stroke-width:2px,color:#fff
    style Policy fill:#2d3748,stroke:#4a5568,stroke-width:2px,color:#fff
    style Approval fill:#2d3748,stroke:#4a5568,stroke-width:2px,color:#fff
    style Audit fill:#2d3748,stroke:#4a5568,stroke-width:2px,color:#fff
    style Receipts fill:#2f855a,stroke:#276749,stroke-width:2px,color:#fff

Mermaid source

flowchart TD
    subgraph Ingestion["1. Ingestion & Context Plane"]
        User["User or Telegram"]
        Workspace["Workspace tenancy"]
        Helm["HELM governance"]
        Secrets["Encrypted token/session storage"]
    end

    subgraph Evaluation["2. Evaluation & Policy Plane"]
        Gateway["Gateway auth and rate limits"]
        Policy["Trust boundary policy"]
        Approval["Approval gates"]
        Audit["Audit events"]
    end

    subgraph Ledger["4. Tamper-Evident Ledger Plane"]
        Receipts["Receipts and evidence packs"]
    end

    %% Operational Flow Edges
    User --> Gateway
    Gateway --> Workspace
    Workspace --> Policy
    Policy --> Approval
    Policy --> Helm
    Gateway --> Secrets
    Gateway --> Audit
    Helm --> Receipts

    %% Premium Styling Rules
    style Gateway fill:#2d3748,stroke:#4a5568,stroke-width:2px,color:#fff
    style Policy fill:#2d3748,stroke:#4a5568,stroke-width:2px,color:#fff
    style Approval fill:#2d3748,stroke:#4a5568,stroke-width:2px,color:#fff
    style Audit fill:#2d3748,stroke:#4a5568,stroke-width:2px,color:#fff
    style Receipts fill:#2f855a,stroke:#276749,stroke-width:2px,color:#fff

Source Truth

Security guidance is backed by:

services/gateway/src/middleware/
services/gateway/src/routes/auth.ts
services/gateway/src/routes/connectors*
services/orchestrator/src/trust.ts
packages/db/src/schema/
scripts/rotate-encryption-key.ts
scripts/rotate-master-key.ts
docs/helm-integration.md
docs/degradation-matrix.md

If code and this guide disagree, update the guide before publishing.

Secrets Management

Required Secrets

Every production deployment must generate unique values for:

# Generate all secrets at once
export SESSION_SECRET=$(openssl rand -hex 32)
export ENCRYPTION_KEY=$(openssl rand -hex 32)
export TELEGRAM_WEBHOOK_SECRET=$(openssl rand -hex 32)

Secret	Purpose	Risk if compromised
`SESSION_SECRET`	HMAC signing for session tokens and OAuth state	Session forgery, CSRF bypass
`ENCRYPTION_KEY`	AES-256-GCM encryption of connector OAuth tokens	Token theft for GitHub/Gmail/Drive
`TELEGRAM_WEBHOOK_SECRET`	HMAC validation of incoming Telegram webhooks	Spoofed bot commands

⚠️ Never use the default dev values in production. The launch-gate.sh script will flag this.

Secret Rotation

Rotating secrets requires careful coordination:

SESSION_SECRET rotation: All active sessions become invalid. Users must re-authenticate.
ENCRYPTION_KEY rotation: Use the rotation tool (below) to re-encrypt tokens without losing them.
TELEGRAM_WEBHOOK_SECRET rotation: Update the webhook via Telegram API after changing.

ENCRYPTION_KEY Rotation Procedure

# 1. Generate the new key
NEW_KEY=$(openssl rand -hex 32)

# 2. Dry run to see how many rows will rotate
ENCRYPTION_KEY_OLD=$CURRENT_ENCRYPTION_KEY \
ENCRYPTION_KEY_NEW=$NEW_KEY \
DATABASE_URL=$PROD_DATABASE_URL \
  tsx scripts/rotate-encryption-key.ts --dry-run

# 3. Run the rotation (writes to DB)
ENCRYPTION_KEY_OLD=$CURRENT_ENCRYPTION_KEY \
ENCRYPTION_KEY_NEW=$NEW_KEY \
DATABASE_URL=$PROD_DATABASE_URL \
  tsx scripts/rotate-encryption-key.ts

# 4. Update .env.production.pilot, then redeploy through the DO script
DO_DROPLET_IP=<ip> bash infra/digitalocean/deploy.sh deploy

# 5. Verify — a subsequent agent run that uses a connector token should succeed

The rotation is idempotent per row; failed rows are logged and skipped so the rest continue.

Prompt Injection Defense

The agent loop treats all user-controlled and tool-output content as untrusted data, not instructions. Strategy:

Tagged context blocks. User input (task context, operator goal, role, tool outputs) is JSON-encoded and wrapped in <context tag="...">...</context> tags. The LLM sees explicit framing, not raw prose.
System-level instruction. The plan prompt begins with a SECURITY NOTICE that tells the model content inside <context> blocks is untrusted.
Tool allowlist. The tool registry presents only the tools available for the current mode; requests for any other tool are rejected by the trust boundary.
Trust boundary checks. Before executing any tool call, the TrustBoundary evaluates kill switches, blocklists, budget, connector scope, and approval requirements. Fail-closed.

Known gaps:

LLMs can still be convinced to misuse allowed tools in unexpected ways. Defense-in-depth: approval-gated sensitive tools (email send, financial actions, external posts).
The model may leak short strings from context into its reply. Do not place credentials, other users' data, or raw secrets into agent-visible context.

Testing: See services/orchestrator/src/__tests__/agent-loop.test.ts for injection-resistance assertions.

Authenticated users can delete their account via:

DELETE /api/users/me
Authorization: Bearer <session-token>

Behaviour:

The user row is deleted. FK cascades clean up sessions, api_keys, and workspace_members.
Founder profile rows are set_null'd (FK policy).
Any workspace where the user was the sole member is also deleted, cascading to its tasks, operators, audit_log, etc.
Workspaces with other members are left intact; the user is just unlinked.

Admins may execute the same deletion on behalf of a user via a direct DB query; follow the same sequence.

Authentication

Session Tokens

30-day expiry, stored in database
Transmitted via Authorization: Bearer <token> header
Session can be revoked via DELETE /api/auth/session

Email Magic Codes

15-minute expiry, stored as SESSION_SECRET-keyed HMAC digests rather than plaintext codes
Timing-safe comparison during verification
One-time use: successful verification deletes the pending code before creating the session
Failed verification increments the pending-code attempt counter and deletes the code after 5 attempts
Request and verification outcomes write best-effort audit entries

API Keys

365-day expiry, stored as SHA-256 hash (never plaintext)
Transmitted via X-API-Key: <key> header
One-time display on creation (hash is not reversible)

Rate Limiting

Built-in rate limiting by endpoint category:

Endpoint	Limit	Window
`/api/auth/*`	5 req	1 min
`/api/connectors/*/grant`	10 req	1 min
`/api/connectors/*/token`	10 req	1 min
`/api/tasks`	30 req	1 min
`/api/*` (general)	100 req	1 min

For production, consider adding an external rate limiter (Cloudflare, nginx) for DDoS protection.

OAuth Security

CSRF Protection

OAuth flows use HMAC-signed state parameters:

State = {connectorId}:{workspaceId}:{nonce}:{hmac}
HMAC computed with SESSION_SECRET
States expire after 10 minutes
Each state is single-use (deleted after callback)

Redirect URI Validation

Redirect URIs are registered per-provider at startup
Only exact-match redirect URIs are accepted
Production URIs must use HTTPS

Token Storage

OAuth access and refresh tokens are encrypted at rest using AES-256-GCM
Key derivation: scrypt(ENCRYPTION_KEY, 'pilot-salt', 32)
IV is randomly generated per encryption operation
Auth tag is stored alongside ciphertext for integrity verification

Network Security

HTTPS / TLS

Pilot does not terminate TLS directly. Use a reverse proxy:

Caddy (recommended — automatic HTTPS):

your-domain.com {
    reverse_proxy localhost:3100
}

nginx:

server {
    listen 443 ssl;
    server_name your-domain.com;
    ssl_certificate /path/to/cert.pem;
    ssl_certificate_key /path/to/key.pem;

    location / {
        proxy_pass http://127.0.0.1:3100;
        proxy_set_header Host $host;
        proxy_set_header X-Real-IP $remote_addr;
        proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;
        proxy_set_header X-Forwarded-Proto $scheme;
    }
}

Cloudflare Tunnel (zero-config):

cloudflared tunnel --url http://localhost:3100

CORS

In production, explicitly set ALLOWED_ORIGINS:

ALLOWED_ORIGINS=https://your-domain.com,https://app.your-domain.com

The wildcard (*) is only allowed in development mode.

Security Headers

Pilot applies these headers automatically via Hono's secureHeaders():

X-Content-Type-Options: nosniff
X-Frame-Options: SAMEORIGIN
X-XSS-Protection: 0 (modern CSP preferred)
Referrer-Policy: no-referrer
Strict-Transport-Security (when behind HTTPS proxy)

Trust Boundary

The orchestrator enforces a multi-layer trust boundary on every agent action:

Kill Switch → Policy Validation → Tool Blocklist → Budget → Connector Allowlist → Content Bans → Approval Gate

Configuration

Set via workspace settings (/api/workspace/settings):

{
  "policyConfig": {
    "killSwitch": false,
    "failClosed": true,
    "toolBlocklist": ["github_create_repo"],
    "contentBans": ["password", "secret"],
    "connectorAllowlist": ["github"],
    "requireApprovalFor": ["gmail_send", "github_create_repo"]
  },
  "budgetConfig": {
    "dailyTotalMax": 500,
    "perTaskMax": 100,
    "perOperatorMax": 200,
    "emergencyKill": 1000,
    "currency": "EUR"
  }
}

Fail-Closed Behavior

If failClosed is true (default), any policy validation failure blocks all actions. This means:

Missing or malformed budget config → all actions denied
Budget values out of range → all actions denied
Invalid perTaskMax > dailyTotalMax → all actions denied

Database Security

Connection Security

For production PostgreSQL:

DATABASE_URL=postgresql://helm:STRONG_PASSWORD@db-host:5432/pilot?sslmode=require

Backup Encryption

Production backup uploads are encrypted before leaving the Droplet. Set BACKUP_ENCRYPTION_PASSPHRASE and DO Spaces S3_* settings, then use:

# Create, GPG-encrypt, checksum, and upload
bash scripts/backup.sh create-and-upload

# Verify or restore an encrypted backup
bash scripts/backup.sh verify backups/pilot_YYYY...sql.gz.gpg
bash scripts/backup.sh restore backups/pilot_YYYY...sql.gz.gpg

Remote plaintext upload is blocked unless BACKUP_ALLOW_PLAINTEXT_UPLOAD=1 is explicitly set for a non-production drill.

Audit Trail

All mutating API requests are logged to the audit_events table:

User ID, workspace ID, action, resource, timestamp
Request body (sanitized — tokens/secrets redacted)
Response status code

Query audit logs via GET /api/audit?workspaceId=....

Checklist

Before going to production, verify:

SESSION_SECRET is a unique random value (not change-me-in-production)
ENCRYPTION_KEY is set (not using dev fallback)
TELEGRAM_WEBHOOK_SECRET is set for webhook mode
ALLOWED_ORIGINS is set to specific domains (not *)
NODE_ENV=production is set
PostgreSQL uses SSL (?sslmode=require)
HTTPS is terminated via reverse proxy
Database password is strong and unique
Backups are configured and tested
launch-gate.sh passes all checks
npm run production:readiness passes; DigitalOcean deploy blocks while any capability remains below production_ready

Troubleshooting

Symptom	Likely Cause	Fix
users cannot log in	email provider, session secret, or database is misconfigured	check auth routes, email provider, and session rows
connector token cannot decrypt	encryption key rotated without migration	use the rotation procedure and verify rows before deploy
webhook accepts unexpected requests	webhook secret is missing or mismatched	set provider webhook secret and verify request validation
autonomous action bypasses approval	trust boundary policy is too permissive	update workspace policy and require approval for sensitive tools
production keeps running without HELM	fail-closed config is missing	set `HELM_FAIL_CLOSED=1` and verify `/health` reports HELM state