Applied control arena · interim evidence page

AMR Strategy Control Arena

A simulated antimicrobial-resistance strategy arena for testing whether MDLxDCC can discover robust control kernels under deceptive, rugged, multi-basin escape landscapes. This is a governance and strategy-calibration experiment, not a medical or wet-lab protocol.
Version v0.1 · May 4, 2026 · built from amr_arena_v1.3-final interim snapshot · run still in progress
42,682 clean runs592 / 900 complete strategiesWEAKEN_ESCAPED kernelDCC basic failed
Boundary. This page reports an abstract simulation arena. Operator names such as KILL, WEAKEN, and WEAKEN_ESCAPED are model-side pressure operators, not clinical recommendations, biological protocols, or treatment guidance. The value is in the MDLxDCC method: strategy search, null controls, robustness testing, and governance failure analysis.
Progress
65.9%
42,682 / 64,800 tasks completed in the uploaded snapshot.
Health
0
duplicate tasks, JSON errors, checkpoint/runlog mismatches, or pending/runlog overlap.
Viable strategies
18
all 18 complete viable strategies contained WEAKEN_ESCAPED.
Current winner
0.888
combo__weaken_escaped__freeze_dim control score.

Abstract#

The AMR arena asks a MDLxDCC-native question: in a deceptive, rugged escape landscape, can a wide strategy catalog discover a small control kernel that prevents population escape better than baseline or manually obvious pressure patterns?

The interim answer is already strong enough to document: among the 592 fully completed strategies in the snapshot, every viable strategy contained WEAKEN_ESCAPED. The current robust winner is combo__weaken_escaped__freeze_dim, while the MDL winner is the cheaper combo__weaken__weaken_escaped. The run is not complete, so this is not the final AMR verdict. It is a strong interim signal and a clear target for DCC improvement.

1. Why AMR deserves its own arena page#

Verdict: yes, this is page-worthy now — as an interim result page, not as a final paper.

AMR is different from a normal candidate domain because it tests three things at once:

This makes AMR a better fit for a dedicated page than for a footnote inside the domain map. The page can later become the stable place for final results, vNext arena upgrades, and DCC selector patches.

2. Arena configuration#

The uploaded run is a large, deceptive-landscape strategy sweep:

python amr_arena.py --profile custom --pop 700 --timesteps 350 --landscapes 12 --seeds 6 --strategy-mode all --dcc basic --workers 5 --max-generated-strategies 900 --sampling-mode family_balanced --mutation 0.05 --transfer 0.02 --ruggedness 0.70 --landscape-mode deceptive --n-basins 16 --tunnel-prob 0.24 --deadend-frac 0.20 --jackpot-frac 0.10 --corridor-frac 0.24 --pop-profile batch --topology batch --checkpoint-every-seconds 60 --out-prefix amr_240h_v13_w5
DimensionValueWhy it matters
Population700 agentsLarge enough to expose escape and late-collapse behavior.
Landscape suite12 landscapes × 6 seedsEnables fair per-strategy ranking over 72 runs per complete strategy.
Strategy catalog900 strategiesWide search across mono, combo, staged, triple, pulse, and adaptive families.
Landscape modedeceptive, ruggedness 0.70, 16 basinsDesigned to punish naive pressure and reveal robust control kernels.
CheckpointingJSONL runlog + checkpointAllows interim analysis without stopping the long run.

3. Interim results#

3.1 Health and coverage

MetricSnapshot value
Runlog rows42,682
Checkpoint completed / total42,682 / 64,800
Complete strategies592 / 900
Partial strategies1 — combo__weaken__tax_diversity
Untouched strategies307
Duplicate task keys0
Checkpoint/runlog mismatch0

3.2 Current leaderboard

RankStrategyTierControlMDLMean final escapedInterpretation
1combo__weaken_escaped__freeze_dimMEDIUM0.888222.062.3 / 700best robust control
2combo__weaken__weaken_escapedCHEAP0.886186.2134.8 / 700MDL seed
3combo__weaken_escaped__force_mixMEDIUM0.865225.791.8 / 700strong dynamic/niche add-on
4combo__weaken_escaped__slowCHEAP0.863220.3100.3 / 700cheap stabilizer
5combo__weaken_escaped__burdenCHEAP0.861217.2108.0 / 700cheap stabilizer

3.3 Landscape wound

The hardest current regime is L3 high_cost_adaptive / patches. Even the best complete strategy still leaves about 257.8 / 700 final escaped there. This is the main target for the follow-up run: not a sign the arena failed, but the best identified stress point.

4. Kernel discovery#

Current kernel: WEAKEN_ESCAPED. All 18 viable complete strategies contain it. Without it, no completed strategy is currently viable.
Operator presentComplete strategiesViableBest controlMedian controlBest strategy
WEAKEN_ESCAPED23180.8880.826combo__weaken_escaped__freeze_dim
FREEZE_DIM3110.8880.338combo__weaken_escaped__freeze_dim
FORCE_MIX3120.8650.209combo__weaken_escaped__force_mix
SLOW4710.8630.237combo__weaken_escaped__slow
BURDEN4410.8610.206combo__weaken_escaped__burden

The key interpretation is not “freeze dimension wins.” FREEZE_DIM alone is weak. It becomes strong when paired with the escape-targeted weakening kernel. The same pattern holds for FORCE_MIX, SLOW, and BURDEN: they are add-ons, not the root.

5. DCC result: useful failure#

StrategyFinal escapedControlSelected bundleVerdict
adaptive_dcc_basic700 / 7000.196WEAKEN + RAISE_ESCAPE_COST + SLOWfailed selector

This is a valuable negative result. It does not show that DCC is useless. It shows that dcc basic selected the wrong bundle because its candidate pool or selection prior missed the discovered kernel. In MDLxDCC terms, the arena has generated the patch target: DCC must learn from historical strategy evidence or include an expert-pool prior.

6. Honest limits#

7. AMR Arena vNext#

The next arena should not simply run a larger random grid. It should preserve the current wide run, then build a narrowed vNext around the discovered kernel.

UpgradePurpose
DCC expert poolForce candidate consideration of WEAKEN_ESCAPED, WEAKEN_ESCAPED + FREEZE_DIM, WEAKEN_ESCAPED + FORCE_MIX, WEAKEN + WEAKEN_ESCAPED, WEAKEN_ESCAPED + SLOW, and WEAKEN_ESCAPED + BURDEN.
Historical priorLet DCC initialize from completed runlog evidence instead of choosing static bundles blind.
Landscape-specialized selectorSeparate high_cost_adaptive / patches, dynamic fragmented landscapes, and well-mixed transfer-heavy landscapes.
Evidence metric packExport a universal metric bundle comparable with TSP, Sudoku, NAS, CW, RH, Poincare, and future AMR runs.
Kill-test suiteTest whether the WEAKEN_ESCAPED kernel survives changed seeds, different basin geometry, lower/higher transfer, and alternate scoring weights.

8. Completion update slot#

Next update after run completion: replace interim counts with the final 64,800-run summary, add full operator-family heatmaps, confirm or revise the current winner, and document whether the remaining WEAKEN_ESCAPED neighborhood changes the top rank. If the kernel survives completion, AMR becomes one of the cleanest applied-control demonstrations in the MDLxDCC arena family.


Version 0.1 · May 4, 2026 · AMR interim page. Snapshot: 42,682 / 64,800 tasks complete, 592 complete strategies, current winner combo__weaken_escaped__freeze_dim, MDL seed combo__weaken__weaken_escaped, DCC basic selector failure documented.