Inter-Worker Governance
Meta-DCC: the same algorithm governing the fleet that governs the search.
14 workers running independently waste compute. On nu3496 (3496 cities), the best worker found 2.02% gap while the worst sat at 4.34% — a 2,212-unit spread representing pure waste. The fix: a Level 1 DCC that monitors all workers, detects fleet stagnation, and (in Phase 2) intervenes by injecting good tours into stuck workers.
Same algorithm at both levels. The Meta-DCC uses the identical LZ76 compression sensor, coupling parameter, and escalation ladder as the base-level worker DCC. The only difference: what the sensor points at. Base-level DCC monitors individual search improvement streams. Meta-DCC monitors fleet-wide state snapshots. Same code. Same math. Different semantics.
The Fleet Lifecycle on uy734 (81 Meta-DCC Steps)
v2.5 ran 14 workers on uy734 (734 cities, optimal 79114) with Meta-DCC monitoring every 30 seconds. The resulting 81-step log revealed a clear three-phase lifecycle:
Steps 1–20
Healthy Fleet
All 14 workers IMPROVING. Fleet best drops rapidly: 83188 → 79942. meta_u climbs 10 → 18 (trust the workers). Spread narrows 1860 → 1342.
Steps 21–55
Stagnation
DEAD workers appear (5–13 per snapshot). Improvements rare. meta_u holds at 18 then starts dropping. Escalation climbs to SHARE level.
Steps 56–81
Fleet Exhaustion
meta_u drops 18 → 3. Step 65: all 14 workers DEAD, zero IMPROVING. Escalation reaches RESTRUCTURE. The sensor says: intervene NOW.
The Self-Calibrating Bands Discovery (P17)
v2.4 hardcoded the LZ compression thresholds at [0.25, 0.65]. Real fleet data lived at [0.01, 0.08] — off by 10×. meta_u drifted to floor and stayed there: useless signal. This was the same mistake as the or-opt exclusion — overriding the system instead of letting it learn.
v2.5 fixed it: self-calibrating bands. The system tracks its own LZ ratio history and derives thresholds from the 25th/75th percentiles of what it has actually observed. After a 10-observation warmup, bands converged to [0.040, 0.061]. meta_u then used its full range [3, 18] with 22 meaningful direction changes. The system that was built to let data decide now lets data decide about its own parameters. MDL all the way down.
Worker Classification
Each 30-second snapshot classifies every worker:
- IMPROVING: Found improvement since last snapshot. Stuck counter resets to 0.
- GRINDING: No improvement but gap ≤ fleet median. Stuck counter decays by 1 (not accumulate). This prevents false escalation.
- STUCK: No improvement and gap > fleet median. Stuck counter increments.
- DEAD: At nuclear escalation level (4). Stuck counter increments.
Fleet snapshot encoding: 1 byte per worker: state(2 bits) + escalation_bucket(2 bits) + stuck_bucket(2 bits). Fed to LZ76 sensor.
Escalation triggers: persistently_stuck = count of workers with stuck_intervals ≥ meta_stuck_threshold (default 3). Escalation: 0 stuck=OBSERVE, 1=INFORM, 2+=SHARE, 3+=CROSS-POLLINATE, >50% of fleet=RESTRUCTURE.
Band Self-Calibration
Warmup: First 10 LZ observations collected, meta_u frozen at midpoint. No coupling adjustment during warmup.
Calibration: Rolling window of last 100 LZ ratios. band_low = 25th percentile. band_high = 75th percentile. Guard: band_high = max(band_high, band_low + 0.001).
Coupling rule (search polarity): lz_ratio < band_low → meta_u decreases (fleet stuck, explore). lz_ratio > band_high → meta_u increases (fleet diverse, trust). Same polarity as base-level search DCC. Opposite from trading DCC.
Verified result: Bands converged from [0.000, 1.000] to [0.040, 0.061] on uy734. meta_u range after calibration: [3, 18] with 22 direction changes (vs 0 useful changes in v2.4 with hardcoded bands).
Result: uy734 reached 0.46% gap (79478) with 14 workers in 40 minutes. Phase 1 is observer-only — workers are identical to v2.3. The value of v2.5 is diagnostic: for the first time, we can see fleet dynamics in real time and have a calibrated sensor for Phase 2 interventions.