MDLxDCC Arena — Riemann Hypothesis / Prime Gaps v0.12

Abstract#

This working paper reframes the previous speculative Riemann Hypothesis brainstorming note into a smaller, testable research program. The empirical question is: Do ordered prime gaps contain measurable compression structure beyond their marginal distribution?

Using the first 20,000 and 100,000 primes, we first compared LZ76 complexity of real prime-gap order against shuffled controls preserving the same multiset. At 100,000 primes, the raw gap encoding produced delta −167.96 and mean z-score −32.38 against shuffled controls.

The final E2/SEA batch now extends the test ladder to stronger controls and larger ranges. It includes 15 runs, 60 summary rows, and 1,960 window rows, covering Markov-preserving, wheel-aware, block-shuffle, and shuffle controls across 100k, 500k, 1M, and 2M prime ranges.

Final SEA finding: the broad E1 shuffle signal is partly explained by local/arithmetic structure, but the raw gap signal survives wheel-aware and Markov-preserving controls from 100k through 2M primes. The 500k bridge is positive, the 1M shuffle baseline is now present, and the strongest line remains the 2M Markov-preserving raw gap test: delta −142.34, z −8.76, with 50/50 windows negative (100 trials at 2M; limited p-value resolution; see Section 10.5).

The v0.4 generator-floor run now tests the next question: can any compact generator beat the Markov2 baseline on holdout? The answer is empirically yes at 1M scale. The run completed 1,620 window tasks and 72 generator tasks. At the sensor/control layer, raw gap still fails against Markov2. At the generator layer, however, density_markov2 wins raw gap/gap_div2, and all 9/9 tested encodings have a non-Markov2 winner beating the Markov2 holdout floor.

The v0.7 carrier-stress ladder now tests the sharper placebo question opened by v0.6: is the raw-gap generator gain really prime-density information, or a more generic monotone scale/clock state? The answer is empirically sharper: at 1M, clock_loglog_markov2 wins raw gap in every split, with Δholdout vs Markov2 −8,784.21 on the main split. density_wrong_scale_markov2 and density-DCC variants remain strong, but shuffled/lagged/reversed density labels cluster closely enough that exact prime-coordinate density is not isolated as the unique cause.

This does not prove RH. It shows a robust, order-sensitive prime-gap compression signal, a repeated empirical breach of the Markov2 generator floor, and now a stronger negative result against pure-density interpretation. The next phase is a corrected law-sheet build plus 2M/5M replication of the top carrier laws.

1. Purpose#

The original brainstorming paper mixed three levels: empirical observations, reasoned interpretations, and metaphysical speculation. This version separates them. The empirical core is now simple: prime gaps are treated as a sequence, the real order is compared against null models, and MDL/LZ76 decides whether additional structure exists.

2. Background: Why Prime Gaps?#

Prime gaps \(g_n = p_{n+1} - p_n\) are not independent. Their distribution changes with scale, and primes obey arithmetic constraints. The sharper question is: After preserving basic statistics, does the ordered sequence still compress better than appropriate controls? That is exactly the kind of question the 8Z/MDL/DCC program is built to ask.

3. DCC / Edge-of-Chaos Motivation#

DCC balances between seizure (excessive order) and noise (excessive disorder). The original speculative analogy placed Re(s)=1/2 as the edge-of-chaos line. This remains a conceptual analogy, not a proof. The empirical test only asks whether prime gaps show measurable structure under compression tools.

4. Important Boundary: Not an RH Proof#

This paper makes no claim to prove the Riemann Hypothesis. It only asserts that ordered prime gaps are more compressible than shuffled controls preserving the same gap distribution. Any bridge to RH remains speculative future motivation.

5. E1 Experiment: Prime-Gap LZ76 Against Shuffled Controls#

We generated the first N primes, computed gaps, and applied several encodings (gap, delta, abs_delta, mod6, bucket_log2). Each real ordered window was compared against shuffled controls. The main statistic is \(\Delta_{LZ} = LZ76(real) - mean(LZ76(control))\); a negative value indicates the real sequence is more compressible.

6. E1 Results#

6.1 First run: 20,000 primes#

Configuration: 20,000 primes, 30 trials, window 5,000. Summary:

Encoding	Real LZ76	Shuffled LZ76	Delta	Z-score	Interpretation
gap	1486.75	1579.31	−92.56	−22.85	strong
gap_div2	1486.75	1579.14	−92.39	−23.86	strong
delta	1753.00	1820.93	−67.93	−16.72	strong, but needs special control
abs_delta	1520.00	1537.62	−17.62	−4.44	moderate/strong
mod6	714.75	882.10	−167.35	−88.43	sanity check; partly expected
bucket_log2	1045.25	1054.22	−8.97	−3.92	weak but present

6.2 Stronger run: 100,000 primes#

Configuration: 100,000 primes, 50 trials, window 10,000. Summary:

Encoding	Real LZ76	Shuffled LZ76	Delta	Z-score	Interpretation
gap	2898.80	3066.76	−167.96	−32.38	very strong
gap_div2	2898.80	3067.18	−168.38	−32.83	very strong
delta	3395.10	3532.89	−137.79	−23.90	very strong, needs control
abs_delta	2959.50	3002.05	−42.55	−8.25	strong
mod6	1281.40	1603.32	−321.92	−127.43	sanity check
bucket_log2	1974.80	1984.15	−9.35	−3.01	weak but present

Raw prime gaps are about 5.5% more compressible in real order than in shuffled order.

7. Interpretation of E1#

Supported: The order of prime gaps carries structure beyond marginal distribution. Not yet shown: that it survives Markov-preserving controls, that it connects to zeta zeros, or that AC/Zero Framework is correct. mod6 is a sanity check; delta needs caution due to lag-1 artifacts.

8. E2: Stronger Null Controls#

C0 Full Shuffle – already done. Distribution alone does not explain compressibility.
C1 Block Shuffle – preserve local order; test long-range structure.
C2 Wheel-Aware – preserve modular/residue constraints; test if signal is just residue arithmetic.
C3 Markov-Preserving – preserve first-order transition behavior; the key hard control.
C4 Cramér-Like – random-prime heuristic synthetic gaps.
C5 Depth/Scaling – early vs. late windows; see if signal grows.

9. E2 Acceptance Criteria#

Minimal success: raw gap outperforms full shuffle, block shuffle, and wheel-aware. Strong success: outperforms Markov surrogates. Very strong: survives all controls and shows scaling growth.

10. Final E2 / SEA Results: Stronger Controls and Scaling#

The E2/SEA batch tested whether the initial shuffle signal survives harder null models and larger prime ranges. SEA means Scaling Experiment Arena. The final uploaded batch is complete: 15 runs, 60 summary rows, and 1,960 window rows.

The final SEA claim is no longer merely “prime gaps beat shuffle.” It is narrower and stronger: raw prime-gap order contains compression-detectable sequential structure not fully explained by marginal gap distribution, mod-6 wheel structure, or first-order Markov transitions, with positive scaling evidence through 2M primes.

10.1 Completed SEA final runs#

The final SEA package includes all planned continuation runs:

100k: Markov t2000, wheel6 t2000, block50 t1000, block100 t1000
500k: Markov t700, wheel6 t700, block50 t700, block100 t700
1M: shuffle t300, wheel6 t300, Markov t300, block50 t300, block100 t300
2M: Markov t100, wheel6 t100

This completes the missing 100k wheel replication, the 500k bridge, and the 1M shuffle baseline that were absent from the partial version.

10.2 Final raw `gap` summary#

Run	Control	Delta	Z-score	Window result	Median p	Reading
100k Markov t2000	markov	−32.64	−3.69	10/10 negative	0.00175	survives first-order transitions
100k wheel6 t2000	wheel6	−19.83	−3.78	10/10 negative	0.00075	survives mod-6 wheel
100k block50 t1000	block50	−14.08	−2.58	10/10 negative	0.01499	survives local block shuffle
100k block100 t1000	block100	−8.70	−1.63	10/10 negative	0.09091	marginal / local-scale boundary
500k Markov t700	markov	−65.13	−5.41	25/25 negative	0.00143	survives first-order transitions
500k wheel6 t700	wheel6	−33.45	−4.75	25/25 negative	0.00143	survives mod-6 wheel
500k block50 t700	block50	−21.95	−3.04	25/25 negative	0.00428	survives local block shuffle
500k block100 t700	block100	−11.48	−1.64	25/25 negative	0.08845	marginal / local-scale boundary
1M shuffle t300	shuffle	−301.62	−41.62	50/50 negative	0.00332	E1-style baseline at 1M
1M wheel6 t300	wheel6	−31.02	−4.39	50/50 negative	0.00332	survives mod-6 wheel
1M Markov t300	markov	−58.26	−4.84	50/50 negative	0.00332	survives first-order transitions
1M block50 t300	block50	−21.10	−2.88	50/50 negative	0.00664	survives local block shuffle
1M block100 t300	block100	−11.13	−1.55	49/50 negative	0.09801	marginal / local-scale boundary
2M Markov t100	markov	−142.34	−8.76	50/50 negative	0.00990	survives first-order transitions
2M wheel6 t100	wheel6	−64.42	−7.01	50/50 negative	0.00990	survives mod-6 wheel

10.3 Interpretation of the final SEA result#

Markov survival is now the central result. Markov-preserving controls keep first-order gap transitions, yet raw gap remains more compressible at every tested depth: 100k, 500k, 1M, and 2M.

Depth	Delta	Z-score	Negative windows	Median p
100k Markov t2000	−32.64	−3.69	10/10	0.00175
500k Markov t700	−65.13	−5.41	25/25	0.00143
1M Markov t300	−58.26	−4.84	50/50	0.00332
2M Markov t100	−142.34	−8.76	50/50	0.00990

Wheel survival is also confirmed. The raw signal survives mod-6-aware controls at 100k, 500k, 1M, and 2M. This means the signal is not merely the trivial 6k ± 1 residue structure.

Depth	Delta	Z-score	Negative windows	Median p
100k wheel6 t2000	−19.83	−3.78	10/10	0.00075
500k wheel6 t700	−33.45	−4.75	25/25	0.00143
1M wheel6 t300	−31.02	−4.39	50/50	0.00332
2M wheel6 t100	−64.42	−7.01	50/50	0.00990

Block controls narrow the scale claim. Block50 remains stable from 100k through 1M. Block100 is consistently weaker and mostly marginal. This suggests that a meaningful part of the signal lives at local-to-medium sequential scales, while the Markov/wheel survival shows the signal is not exhausted by the simplest local arithmetic explanations.

Run	Delta	Z-score	Negative windows	Median p
100k block50 t1000	−14.08	−2.58	10/10	0.01499
100k block100 t1000	−8.70	−1.63	10/10	0.09091
500k block50 t700	−21.95	−3.04	25/25	0.00428
500k block100 t700	−11.48	−1.64	25/25	0.08845
1M block50 t300	−21.10	−2.88	50/50	0.00664
1M block100 t300	−11.13	−1.55	49/50	0.09801

Sensor ranking is now clearer. Raw gap is the primary sensor. gap_div2 is effectively the same signal. abs_delta and bucket_log2 are diagnostic only: they weaken, flip, or become near-zero under harder controls.

10.4 Final E2 / SEA thesis#

Prime-gap order contains a surviving raw-gap compression signal that is not fully explained by gap distribution, simple wheel residue structure, or first-order Markov transition statistics. The effect is smaller than the initial shuffle signal but more meaningful, and the final SEA batch confirms survival through 100k, 500k, 1M, and 2M prime ranges.

10.5 Caution#

The 2M runs currently use only 100 trials, so the permutation p-value resolution is limited. The z-scores and window stability are more informative at this stage. The final result is strong enough to justify the next phase, but not strong enough to claim an RH proof or a final mathematical invariant.

11. Relation to 8Z/DCC Founding Hypothesis#

The first E2/SEA results support number theory as a credible ninth-domain candidate for the 8Z/DCC approach. The strongest current claim is not that DCC proves RH, but that 8Z/DCC compression tools detect a nontrivial order-sensitive signal in prime gaps that survives first-pass wheel-aware and Markov-preserving controls up to 2M primes.

11.1 Two-track strategy#

This project should not choose prematurely between “prove RH” and “build an independent MDLxDCC theory of prime structure.” Both tracks are useful and should run in parallel:

Track A — RH-facing path: determine whether the discovered compression structure can be formalized into an invariant that implies RH, becomes equivalent to RH, or forbids off-critical-line zeros through a prime-counting or explicit-formula bridge.
Track B — MDLxDCC-native path: develop a new information-theoretic account of prime distribution based on compression, multi-scale structure, and DCC-style invariants, valuable even if it does not directly prove RH.

These are not competing goals. RH is the gold-standard benchmark; the MDLxDCC invariant is the deeper search object.

12. Revised Position on Original RH/AC Hypothesis#

The critical line as edge-of-chaos remains speculative motivation only. Verified: E1 signal. Reasoned: compression advantage suggests order-sensitive structure. Speculative: AC/Zero Framework explanations are not evidence.

13. Mathematics, Reality, and the Zero Framework#

This section records the philosophical motivation behind the Zero Framework while keeping it separate from the empirical E1/E2 result. The central issue is not whether mathematics is "wrong." Formal mathematics can define many internally consistent worlds. The stricter question is:

When a formal object is used to describe reality or nature, what exactly is the mapping between the symbol and the thing being described?

In pure mathematics, definitions are allowed to be abstract. A structure can be studied because it is consistent, elegant, or fruitful. In applied mathematics and mathematical physics, however, a formal object earns physical meaning only through a disciplined bridge: what it measures, what it predicts, what it preserves, and where the mapping stops.

This distinction matters for the present paper because RH lives inside pure mathematics, while the 8Z/DCC framing asks a more reality-facing question: whether prime-gap dynamics expose an information structure that can be measured, compressed, and related to deeper arithmetic regularities. The empirical compression tests do not decide ontology. They only ask whether a measurable signal exists.

13.1 The two zeros#

Standard mathematics treats zero as a single formal object: the additive identity, the result of \(x - x\), and the value attained when a function vanishes. The Zero Framework proposes that ordinary language often mixes two different meanings under the same word:

Numeric zero \(0\): a position or value inside a formal system. It exists within a structure of relations.
Ontological nothingness \(\emptyset\): absence of existence itself. Not a point, not a value, and not reached by ordinary arithmetic operations.

On this view, a function that equals zero at some point has not "become nothing." It has reached a structured value. A zeta zero is therefore better described as an exact cancellation or destructive-interference point, not an annihilation into non-being.

13.2 Analytic continuation as ontological smuggling#

Analytic continuation is an internally valid and powerful mathematical operation. The concern here is not its consistency. The concern is what happens when a result obtained by extension is presented under the same notation as the original object — without making the substitution explicit.

The Dirichlet series \(\zeta(s) = \sum_{n=1}^{\infty} n^{-s}\) converges for \(\text{Re}(s) > 1\). For \(s = 1\) the harmonic series diverges. For \(s = -1\) the sum \(1 + 2 + 3 + \cdots\) diverges to infinity. Yet analytic continuation assigns \(\zeta(-1) = -\frac{1}{12}\). This is a correct statement about the analytically continued function. But the informal phrase \(1 + 2 + 3 + \cdots = -\frac{1}{12}\) silently substitutes one object for another under the same notation. The two agree where the original sum converges. They are not the same thing where it does not.

The ontological import is this: when the output of an extended object is presented as if it were the output of the original process, the interpretation can carry a hidden metaphysical claim — that the "true value" of a divergent process exists and is reachable by algebraic extension — without stating that claim explicitly or subjecting it to scrutiny. Whether this affects internal consistency is no. Whether it affects what mathematics describes about reality is an open question. This is what we call ontological smuggling: not an error in the proof, but an unacknowledged step in the interpretation.

13.3 Axiom-level vs. interpretation-level critique#

There are two versions of the Zero Framework critique, and they should not be confused:

Interpretation-level: Current mathematics is largely correct, but some language misleads. A "zero of a function" should not be described as ontological nothingness. A divergent series assigned a finite value through continuation or regularization should be presented with that context intact. This is already partially implemented in Section 14.
Axiom-level: The identification of numeric zero with ontological nothingness is not just a language habit — it may be embedded in the foundations. A mathematics built on a clean distinction between \(0\) and \(\emptyset\) from the start might produce a different structure, one where certain currently valid moves require explicit ontological justification before proceeding. This remains a research program, not a theorem.

The historical analogy is Lobachevsky. For two millennia, Euclid's parallel postulate appeared self-evident — not a choice but a necessity. Replacing it with a different axiom produced a geometry that was equally consistent and turned out to be physically more accurate than the original. The Zero Framework at the axiom level is asking whether the identification \(0 \equiv \text{"nothing"}\) is a parallel postulate of arithmetic: a convenience that has been mistaken for a necessity. The interpretation-level critique is conservative and already useful. The axiom-level critique is speculative but not without historical precedent.

13.4 Implication for RH specifically#

The implication for RH is indirect. RH is a formal statement about the zeros of the analytically continued zeta function. The Zero Framework does not currently change that statement, prove it, or disprove it.

What it may do is sharpen the language around what a zero means. If a zeta zero is treated as a structured cancellation point rather than "nothingness," then the research question becomes cleaner:

What structure forces all nontrivial cancellation points of the zeta function onto the critical line?

That question is still mathematical. The 8Z/DCC contribution, if any, would be to search for measurable information-structure invariants in prime-gap dynamics or prime-counting error that might later connect to the zeta-zero structure.

Current status: interpretation-level critique useful; axiom-level critique unformalized; RH implication speculative. This remains L8 in the interpretation ladder unless a formal invariant is discovered.

14. Corrected Language About Zeta Zeros#

Zeta zeros are points of exact cancellation or destructive interference in the analytic structure of the zeta function, not annihilation into ontological nothingness. This correction is an interpretation-level clarification: a function attaining the value zero is not the same as a quantity ceasing to exist.

15. Current Working Thesis#

Ordered prime gaps contain measurable compression structure beyond their marginal distribution. After the final E2/SEA batch, the surviving raw-gap signal also exceeds wheel-aware and first-order Markov-preserving controls from 100k through 2M primes, while block tests show that much of the broader signal remains local-to-medium scale.

The strongest current thesis is narrower than the original speculation but stronger than the initial shuffle-only result. v0.6 showed that the Markov2 floor can be beaten on holdout and that the carrier is scale-conditioned transition structure rather than a clean one-term density explanation. v0.7 sharpens this further: a generic loglog position-clock conditioned Markov2 law beats density variants on raw gap across the tested split ladder. The longer-term target is a formal information-theoretic invariant of prime distribution, not merely another route to the existing RH statement.

Operationally, the project now runs on three tracks: Track A seeks an RH-facing formal bridge; Track B seeks a native MDLxDCC invariant describing prime-gap structure directly; Track C isolates the generator carrier that can beat Markov2 without leakage or split artifacts.

16. Next Experiments#

The v0.7 ladder is now complete through 1M. The next work should not simply widen the arena. The current bottleneck is formal clarity: distinguish generic scale-clock laws from true prime-density information, then ask whether the surviving law has a clean mathematical description.

v0.7.1 hygiene: fix the law-sheet split-name issue where rolling1/rolling2/rolling3 are all reported as missing_splits=rolling. The generator results are usable; the law-sheet failure label is over-penalized.
Top carrier replication: rerun only the top raw-gap carriers — clock_loglog_markov2, clock_log_markov2, density_wrong_scale_markov2, density_dcc_no_motif, density_markov2, density_shuffled_markov2, and clock_dcc_no_motif — at 2M or 5M.
Prime-specific density test: add stricter placebos: rank-preserving random monotone clocks, permuted density blocks, shifted-log clocks, and density residuals orthogonalized against loglog clock.
Invariant extraction: treat clock_loglog_markov2 as the first raw-gap law-sheet candidate and gap_mod6 variable_markov as the residue-memory candidate; attempt compact symbolic descriptions of both.
Formal bridge: only after carrier replication, connect the candidate residuals to prime-counting error, explicit-formula residuals, or a new RH-adjacent criterion.

17. Preliminary Conclusion#

E1 delivered a clear shuffle signal. E2 narrowed and strengthened the result. The final SEA batch now confirms that the broad shuffle advantage is partly local/arithmetic, but the raw gap signal survives wheel-aware and Markov-preserving controls at 100k, 500k, 1M, and 2M.

The current conclusion is therefore:

The prime-gap compression signal is real, smaller under harder controls, and still alive where it matters most: raw gap order survives beyond marginal distribution, simple wheel structure, and first-order transition statistics. The completed SEA run adds a 500k bridge, a 1M shuffle baseline, and stronger scale evidence through 2M primes.

The most stable current signal is not only the aggregate z-score, but the window-level consistency: Markov and wheel controls show negative raw-gap deltas in every tested window at 100k, 500k, 1M, and 2M. The only near-boundary result in the key raw-gap family is block100 at 1M, with 49/50 negative windows and a marginal z-score.

This is not RH evidence yet, but it is a legitimate 8Z/DCC number-theory signal candidate. The deeper goal is to discover a formal invariant that explains prime-distribution structure; RH is the benchmark, not the only possible prize.

18. Three-Track Roadmap: RH, MDLxDCC Invariant, Generator Arena#

The current compression experiments cannot prove the Riemann Hypothesis by themselves. They are empirical signal tests. A proof would require a formal mathematical invariant or theorem that applies to all relevant cases, not only to tested prime ranges.

18.1 Three live tracks#

The roadmap now has three live tracks:

Track	Name	Target	Status
A	RH-facing formal bridge	Use the empirical signal to search for a formal invariant connected to prime-counting error, zeta-zero structure, explicit formulas, or an RH-equivalent criterion.	Future formal work; not claimed by E1/E2/SEA/Arena.
B	MDLxDCC-native prime-structure invariant	Build a new information-theoretic description of prime-gap structure where RH is a benchmark, not the only definition of success.	Motivated by surviving compression signal.
C	Inverse generator arena	Search over compact generators, grammars, automata, dictionaries, and multi-scale programs that reproduce prime-gap structure better than null models.	v0.4 achieved the first 1M Markov2-floor breach on generator holdout; next target is scaling, ablation, and formalization.

This distinction matters. A successful MDLxDCC invariant could be valuable even before it proves RH, and a generator that survives holdout and scaling could become the first concrete object from which a stronger invariant is extracted.

The higher target is not merely “prove RH by compression.” The higher target is to discover a formal invariant or compact generator of prime-distribution structure. RH then becomes one benchmark such an invariant may imply, explain, or sit beside.

18.2 The required bridge#

A possible proof path would need the following form:

Use 8Z/DCC/MDL tools to discover a stable invariant in prime-gap dynamics, prime-counting error, or another RH-adjacent arithmetic trace.
Formalize that invariant as a precise mathematical statement.
Prove that the invariant holds for all relevant values, not merely for tested ranges.
Show that the invariant implies RH, is equivalent to RH, strengthens an RH-style distribution bound, or explains a structure that RH does not directly describe.

In short:

empirical signal → generator candidate → formal invariant → distribution theorem / RH implication / new criterion

18.3 Why not just prove RH?#

RH matters because it is one of the cleanest known formulations of prime-distribution regularity. It would strongly constrain the error term in the prime number theorem and would affect many related results in analytic number theory.

But a new 8Z/DCC invariant could be valuable even before it proves RH, and potentially more valuable if it explains more than RH alone. For example, it could:

give a new formal description of structure in prime gaps,
produce a bound on prime-counting error comparable to an RH-style bound,
explain why the distribution is stable rather than merely state that it is,
generalize to other L-functions, arithmetic traces, or structured sequences,
or discover a new RH-equivalent criterion from an information-theoretic direction.

Thus RH is not the only possible victory condition. It is the gold-standard benchmark. The deeper goal is a new invariant that explains prime-distribution structure in a way that can later be connected to RH or to a stronger distribution theorem.

18.4 Prime-counting error route#

One possible route is through prime-counting error. RH is deeply connected to how tightly prime-counting functions stay near their expected main terms. A compression-based approach would need to discover a bound or regularity in the prime-gap trace that implies an appropriate bound on prime-counting error.

A rough proof-seed form would be:

If every prime-gap window has bounded normalized MDL excess,
then prime-counting error remains within an RH-compatible bound.

This is not yet a theorem. It is a target shape for future formalization.

18.5 Explicit-formula route#

The most RH-adjacent route would connect compression structure in prime gaps to the oscillatory terms generated by zeta zeros in explicit formulas. In that direction, the desired contradiction would look like this:

Assume an off-critical-line zero exists.
Show that such a zero forces a detectable oscillatory or compressible signature in prime-counting error or prime-gap dynamics.
Show that a proven MDL/DCC invariant forbids that signature.
Conclude that no off-critical-line zero can exist.

Symbolically:

off-line zero → forbidden oscillation/compression signature → contradiction → RH

18.6 Equivalent-criterion route#

Another route is discovery of a new RH-equivalent condition. 8Z/DCC could search for a new boundedness, monotonicity, spectral, or compression condition that appears empirically stable. The mathematical task would then be:

prove the new condition is equivalent to RH or implies RH,
then prove the new condition directly.

This may be more realistic than directly proving statements about zeta zeros from compression data.

18.7 Proof-roadmap phases#

Phase	Goal	Status
1	Detect empirical signal in prime-gap order	completed first pass; E1 positive
2	Test signal against stronger null controls	final E2/SEA pass positive; deeper tests needed
3	Extract LZ76 dictionaries, motifs, and multi-scale MDL spectrum	started by RH Arena v0.1/v0.2b
4	Run inverse generator benchmark with holdout validation	started; Markov baseline currently strongest
5	Repair controls and scoring after 1M Arena diagnosis	v0.3 running
6	Relate surviving generator/invariant to prime-counting error or explicit-formula terms	planned
7	Extract candidate invariant	future
8	Prove invariant and distribution/RH implication where possible	future
9	Formal verification where possible	future

18.8 Current position#

The current strongest honest formulation is:

8Z/DCC may help discover a new RH-adjacent regularity by treating prime gaps as a multi-scale information trace and searching for stable compression or generator invariants that bound or constrain prime-distribution error. RH is the benchmark; the deeper prize is the invariant. The project should pursue all three tracks: RH-facing proof bridge, MDLxDCC-native structure invariant, and inverse generator arena.

This keeps the door open without pretending that E1, E2, SEA, or the current RH Arena runs can prove RH by themselves.

19. Inverse Generator Hypothesis#

RH gives a constraint on prime-distribution error. MDLxDCC asks the inverse question:

Can we discover a compact generator, grammar, automaton, dictionary, or multi-scale program that reproduces prime-gap structure better than shuffle, wheel, Markov, or Cramér-like null models?

This is not necessarily a search for a one-line formula for primes. The target is the shortest useful generator of prime-gap structure: an algorithmic or MDL formula that explains more of the real trace than competing null models.

If such a generator survives holdout and scaling, it becomes an invariant candidate. If the invariant later constrains prime-counting error, explicit-formula residuals, or zero-spacing structure, it may become an RH bridge — or a separate MDLxDCC-native path to prime-distribution theory.

19.1 Arena flow#

trace → encoding → sensors → controls → generators → MDL score → holdout validation → invariant candidate

The critical change is that controls are no longer only adversaries. They become baselines that generators must beat. A generator that cannot beat Markov, wheel, or Cramér baselines is not yet an invariant candidate, even if it produces visually plausible prime-gap structure.

19.2 Generator classes#

Class	Role	Interpretation
G0 shuffle	frequency baseline	Destroys order; useful but weak.
G1 wheel-only	arithmetic residue baseline	Captures simple modular constraints.
G2/G3 Markov	transition baseline	Tests whether signal is mostly local first/second-order dynamics.
G4 motif grammar	dictionary/motif baseline	Tests recurring gap-token phrases.
G5 LZ dictionary	compressive replay baseline	Tests whether dictionary fragments can generate realistic structure.
G6 local-density Cramér	density trend baseline	Tests whether changing prime density explains the trace.
G7 hybrid DCC	candidate engine	Combines local density, motif grammar, Markov fallback, and wheel/density fallback under MDL scoring.

19.3 Acceptance rule#

A generator becomes interesting only if it satisfies all of these:

trained only on early data, evaluated on later holdout data,
beats simpler controls under total_mdl_score, not merely raw accuracy,
matches LZ76, entropy, motif spectrum, and next-gap predictive cost within acceptable error,
survives scaling across 100k → 500k → 1M → 2M and beyond,
produces a compact description that can be inspected mathematically.

In this language, the Arena does not “prove RH.” It searches for compact generator/invariant candidates. Proof would be a later certificate.

20. RH Arena v0.3 Results#

The completed v0.3 Arena run is the first cleaned-up version after the v0.2b diagnosis. It separates gap_mod6/gap_mod30 from prime_residue6/prime_residue30, uses wheel-aware Cramér controls with warmup, excludes the Cramér global window-0 artifact from summary aggregation, and scores generators using predictive bits over the holdout length.

20.1 Run integrity#

Field	Value
Script	`8z-rh-arena-v0.3`
Prime range	1,000,000 primes; last prime 15,485,863; 999,999 gaps
Train / test	500,000 / 1,000,000 primes
Window / stride	50,000 / 50,000
Trials	400 per control task; p-value floor ≈ 0.00249
Workers	10 requested / 10 used
Completed work	2,200 window tasks; 88 generator tasks; 30,491 motif rows; 16,892 invariant-candidate rows
Elapsed	52,970 seconds, about 14.7 hours

The run completed cleanly with windows.done.json, generators.done.json, and run.done.json. The result is usable as a v0.3 baseline.

20.2 Core sensor readout#

The core raw-gap signal remains real under most controls, but Markov2 is still the hard boundary.

Trace	Control	Mean ΔLZ	Mean z	Window stability	Readout
`gap`	shuffle	−785.94	−77.49	20/20 negative	Strong order signal.
`gap`	block10	−241.47	−21.32	20/20 negative	Survives short-block preservation.
`gap`	markov1	−204.54	−11.35	20/20 negative	Survives first-order transition baseline.
`gap`	wheel6	−86.68	−8.66	20/20 negative	Survives simple wheel-aware control.
`gap`	local_cramer	−165.82	−7.69	20/20 negative	Survives wheel-aware local-density surrogate.
`gap`	cramer_global	−124.69	−5.80	19/19 negative	Survives after excluding window-0 artifact.
`gap`	block50	−57.68	−5.54	20/20 negative	Still stable.
`gap`	block100	−29.86	−2.92	20/20 negative	Weaker but present.
`gap`	wheel30	−8.86	−0.88	16/20 negative	Mostly absorbed by stronger wheel structure.
`gap`	markov2	+44.71	+2.59	1/20 negative	Signal killed/reversed; current boundary.

gap_div2 repeats the same picture almost exactly. This confirms that the core raw-gap signal is not a token-scaling artifact.

20.3 Residue and Cramér readout#

The v0.3 split clarified the old residue confusion. gap_mod6 and gap_mod30 contain strong, stable gap-residue motif structure. prime_residue6 and prime_residue30 are now explicit prime-residue traces and should be treated as wheel-state diagnostics, not as raw evidence for a new invariant.

Trace	Control	Mean z	Readout
`gap_mod6`	markov2	−29.53	Gap-residue structure survives second-order local baseline.
`gap_mod30`	markov2	−2.44	Weaker but still negative in 20/20 windows.
`prime_residue6`	markov2	−0.15	Mostly absorbed; useful as wheel-state diagnostic.
`prime_residue30`	markov2	+1.02	Mostly absorbed/reversed; not a primary invariant signal.
`gap_mod6`	local_cramer	+14.20	Cramér surrogate over-explains or mismatches this residue view; read cautiously.
`gap_mod30`	local_cramer	+14.93	Same caution.

The earlier huge residue/Cramér readout is no longer treated as a discovery by itself. It is now a diagnostic showing where surrogate design and residue-state modeling still need care.

20.4 Generator leaderboard#

The generator arena changed meaningfully after predictive-bits scoring. For raw gaps, markov2 now beats markov1, and hybrid_dcc moves to second place but still does not beat the Markov2 baseline.

Encoding	Winner	Second	Third	Readout
`gap`	markov2	hybrid_dcc	markov1	Markov2 is the current raw-gap baseline to beat.
`gap_div2`	markov2	hybrid_dcc	markov1	Same as raw gap.
`abs_delta`	markov2	hybrid_dcc	markov1	Second-order transition structure dominates.
`gap_mod6`	markov2	hybrid_dcc	markov1	Strong residue-state transition structure.
`gap_mod30`	markov2	hybrid_dcc	markov1	Same hierarchy at finer wheel scale.
`prime_residue6`	markov2	hybrid_dcc	markov1	Mostly wheel-state transition modeling.
`prime_residue30`	markov2	hybrid_dcc	markov1	Mostly wheel-state transition modeling.
`cumulative_error_walk`	markov1	motif_grammar	lz_dictionary	Still interesting, but not yet a clean invariant.
`normalized_gap`	local_cramer	shuffle	wheel_only	Density trend dominates this encoding.
`gap_minus_logp`	markov1	motif_grammar	lz_dictionary	Needs better density-corrected generator design.

Current generator verdict: the Arena has not yet found a DCC/motif generator that beats Markov2 on raw gaps. That is not a failure. It is the clean next target.

20.5 Invariant candidates#

The v0.3 run emitted invariant_candidates.csv. The strongest stable candidates are not yet raw-gap formulas. They are mostly gap-residue and coarse gap-class motif families.

Encoding	Candidate motif	Controls survived	Window support	Min z	Readout
`gap_mod6`	`2,0,4`	9	20	35.09	Very strong residue motif family.
`gap_mod6`	`4,0,2`	7	20	35.85	Companion residue motif.
`gap_mod6`	`2,4,2`	6	20	37.51	Stable alternating residue motif.
`gap_mod30`	`4,6,14`	6	14	9.67	Finer wheel-residue candidate.
`gap_mod30`	`10,2,10`	5	17	8.61	Stable finer gap-residue motif.
`bucket_log2`	`4,1,4`	7	20	5.03	Coarse size-class candidate.
`gap`	`2,12,10`	4	10	2.30	Weak raw-gap candidate; not invariant-level yet.

This is the right shape of result for an early inverse-generator arena: it does not magically produce an RH invariant, but it tells us where the stable motifs live. The strongest motifs are currently residue/gap-class structures. Raw integer gap motifs are weaker and need deeper filtering.

20.6 Conclusion and next step#

The v0.3 result sharpens the project:

Confirmed: raw gap/gap_div2 LZ structure survives shuffle, block, wheel6, Markov1, and wheel-aware Cramér controls at 1M scale.
Boundary: Markov2 absorbs/reverses the raw-gap LZ advantage.
Generator floor: Markov2 is now the baseline to beat for raw gaps.
DCC status: hybrid_dcc improved to second place for many encodings, but it still does not beat Markov2.
Candidate status: strongest motif families are gap-residue/coarse-class motifs, not a raw-gap invariant yet.

v0.3 does not close the RH problem. It gives a cleaner target: build a generator or invariant that beats Markov2 on holdout while preserving the stable motif and residue-family structure discovered by the Arena.

The next code step should be a focused v0.4 generator arena: Markov2+DCC hybrids, variable-order Markov, motif-conditioned Markov, density-feedback states, and per-encoding generator leaderboards. The next paper step is to update this page after v0.4 shows whether any compact generator can actually beat the Markov2 floor.

21. RH Arena v0.4 Results: Markov2 Floor Attack#

The v0.4 Arena run executes the next step proposed by v0.3: stop asking only whether raw gaps survive controls, and ask whether a compact generator can beat the markov2 holdout floor. This section should be read with a strict split:

Sensor/control layer: raw gap still hits the Markov2 boundary.
Generator/holdout layer: new density-conditioned and DCC-style generators now beat the Markov2 floor.

v0.4 does not remove the Markov2 boundary from the LZ control test. It crosses the Markov2 floor in the generator arena.

21.1 Run integrity#

Field	Value
Script	`8z-rh-arena-v0.4`
Schema	`rh_arena_schema_v0.4`
Prime range	1,000,000 primes; last prime 15,485,863; 999,999 gaps
Train / test	500,000 / 1,000,000 primes
Window / stride	50,000 / 50,000
Trials	400 per control task; p-value floor ≈ 0.00249
Workers	10 requested / 10 used
Encodings	`gap`, `gap_div2`, `abs_delta`, `bucket_log2`, `normalized_gap`, `gap_minus_logp`, `cumulative_error_walk`, `gap_mod6`, `gap_mod30`
Control modes	`shuffle`, `block50`, `block100`, `wheel6`, `wheel30`, `markov1`, `markov2`, `local_cramer`, `cramer_global`
Generator modes	`markov2`, `markov3`, `markov4`, `variable_markov`, `motif_markov2`, `density_markov2`, `markov2_dcc`, `density_dcc`
Completed work	1,620 window tasks; 72 generator tasks; 10,696 motif rows; 5,826 invariant-candidate rows
Elapsed	31,280.007 seconds, about 8 h 41 min 20 s

The run completed cleanly with windows.done.json, generators.done.json, and run.done.json. No completed tasks came from checkpoint; this was a fresh complete run.

21.2 Sensor readout: Markov2 still kills raw-gap LZ advantage#

The raw-gap sensor picture remains almost exactly the v0.3 lesson. gap survives many controls, including Markov1 and wheel-aware Cramér, but not Markov2.

Control	Mean ΔLZ	Mean z	Window stability	Median p	Readout
`shuffle`	-785.94	-77.49	20/20 negative	0.00249	Strong order signal; same broad baseline as before.
`markov1`	-204.54	-11.35	20/20 negative	0.00249	Survives first-order transition baseline.
`wheel6`	-86.68	-8.66	20/20 negative	0.00249	Survives simple mod-6 wheel-aware control.
`local_cramer`	-165.82	-7.69	20/20 negative	0.00249	Survives wheel-aware local-density surrogate.
`cramer_global`	-124.69	-5.80	19/19 negative	0.00249	Survives after excluding the known window-0 artifact.
`block50`	-57.68	-5.54	20/20 negative	0.00249	Stable local/medium-scale signal.
`block100`	-29.86	-2.92	20/20 negative	0.00249	Weaker but still negative in all windows.
`wheel30`	-8.86	-0.88	16/20 negative	0.15835	Mostly absorbed by stronger wheel structure.
`markov2`	44.71	+2.59	1/20 negative	0.99875	Signal killed/reversed; sensor-level boundary.

gap_div2 repeats the same pattern: Markov2 is again positive/reversed, while shuffle, Markov1, wheel6, block, and Cramér controls remain negative. The important interpretation is that the LZ sensor no longer says “raw gaps beat everything.” It says “raw gaps beat first-order/local/wheel/Cramér controls, but second-order transition memory is enough to absorb this sensor.”

21.3 Generator floor: Markov2 is beaten on holdout#

The generator result is the new signal. The v0.4 challengers were designed specifically to attack the Markov2 floor. They succeeded across all tested encodings.

Encoding	Winner	Runner-up	Δ holdout vs Markov2	Δ bits/token vs Markov2	Winner margin
`gap`	`density_markov2`	`density_dcc`	-7,953.28	-0.015946	4,297.46
`gap_div2`	`density_markov2`	`density_dcc`	-7,954.26	-0.015946	4,298.46
`abs_delta`	`density_markov2`	`density_dcc`	-7,262.93	-0.014544	5,486.79
`bucket_log2`	`density_markov2`	`markov2`	-1,329.38	-0.002755	1,329.38
`normalized_gap`	`markov2_dcc`	`density_dcc`	-1,568.44	-0.000723	43.88
`gap_minus_logp`	`density_dcc`	`markov2_dcc`	-372,137.19	-0.743387	2,470.89
`cumulative_error_walk`	`density_dcc`	`markov2_dcc`	-112,715.09	-0.225453	14,854.76
`gap_mod6`	`variable_markov`	`markov4`	-50,185.35	-0.100330	5,160.61
`gap_mod30`	`markov3`	`density_markov2`	-40,125.27	-0.080438	39,471.28

For raw gap, the first accepted breach is not a complicated motif grammar. The best generator is density_markov2: a Markov2 spine conditioned by local density state. The DCC hybrid is second, meaning DCC helps but is not yet the simplest winning explanation.

Rank	Generator	Holdout score	Δ holdout vs Markov2	Bits/token	Δ bits/token	Motif distance	LZ error
1	`density_markov2`	1,936,623.45	-7,953.28	3.872776	-0.015946	0.01028	0.02110
2	`density_dcc`	1,940,920.91	-3,655.82	3.881239	-0.007483	0.03494	0.03376
3	`markov2`	1,944,576.73	0.00	3.888722	0.000000	0.01262	0.02565
4	`markov2_dcc`	1,947,542.12	2,965.39	3.894477	0.005755	0.05036	0.04351
5	`motif_markov2`	1,965,111.90	20,535.17	3.929558	0.040836	0.07454	0.06210

v0.4 generator verdict: the Markov2 floor is no longer unbroken. The best current raw-gap explanation is Markov2 plus local density state, with DCC hybrids close behind but not yet dominant.

21.4 Invariant candidates: robust motifs still live mostly in residue space#

The invariant-candidate file contains 5,826 rows. Of these, 168 survive at least 6 controls, 39 survive at least 8 controls, and 22 survive all 9 controls. Among the all-control survivors, 21/22 are gap_mod6 motif families and one is bucket_log2.

Encoding	Motif	Length	Controls	Window support sum	Min z	Median z	Median lift	Total count	Score
`gap_mod6`	`2,0,0,4,2`	5	9	175	20.15	93.05	1.979	148,955	141.27
`gap_mod6`	`4,2,0,0,4`	5	9	171	19.43	89.34	1.956	143,686	136.21
`gap_mod6`	`2,4,2,0,4`	5	9	171	4.48	34.95	1.208	196,367	31.40
`bucket_log2`	`3,3,4,1`	4	9	91	3.94	5.53	1.186	13,199	27.60
`gap_mod6`	`4,2,0,4`	4	9	176	3.17	69.15	1.400	357,528	22.23

This confirms the v0.3 motif lesson at higher focus: the most stable candidate layer is still residue/gap-class structure, especially gap_mod6. Raw integer gap motifs exist, but they are weaker and do not yet look like the final invariant object.

21.5 Conclusion and next step#

Confirmed: v0.4 completed the targeted Markov2-floor attack cleanly.
Sensor boundary: raw gap/gap_div2 still lose or reverse against Markov2 in LZ-control testing.
Generator breakthrough: all 9 tested encodings now have a non-Markov2 generator winner beating the Markov2 holdout baseline.
Raw-gap winner: density_markov2, not full DCC, is the smallest current winner.
DCC status: density_dcc and markov2_dcc are strong, especially on density-adjusted/error-walk encodings, but they still need ablation and scaling.
Invariant status: stable motifs remain mostly gap-residue/coarse-class candidates, not a theorem-ready raw-gap formula.

v0.4 changes the project state from “Markov2 is the generator floor to beat” to “Markov2 has been beaten empirically at 1M by density-conditioned generators; now prove the gain survives scaling, ablation, and formalization.”

The v0.5 smoke now changes the next code step: scale the whole ablation ladder, not only density_markov2/density_dcc. At smoke scale, density_markov1 is the raw-gap winner, variable_markov is the gap_mod6 winner, and DCC/motif terms remain candidates rather than confirmed carriers. The next paper step is to compress the stable gap_mod6 motif families and the raw-gap density-Markov gain into one smaller invariant candidate.

22. RH Arena v0.5 Smoke Results: Density Ablation and Split Stability#

The first v0.5 run is a smoke-scale ablation, not a replacement for the v0.4 1M result. Its value is that it cleanly separates the raw Markov2 boundary from the generator layer and begins isolating which ingredient carries the generator gain.

v0.5 smoke verdict: the raw-gap LZ sensor boundary is still Markov2, but the generator floor is beaten again. At 120k scale the raw-gap winner is density_markov1, not a motif/DCC hybrid. This points to density-conditioned transition structure as the first ingredient to scale-test.

22.1 Run integrity#

Field	Value
Script	`8z-rh-arena-v0.5`
Schema	`rh_arena_schema_v0.5`
Run size	120,000 primes; last prime 1,583,539; 119,999 gaps
Train / test	60,000 / 120,000 primes
Windows	20,000-token window; 20,000 stride; 6 windows per encoding/control
Trials	50 sampled controls; p floor 0.019608
Workers	1 requested / 1 used
Completed work	120 window tasks; 272 generator tasks; 544 generator replicates
Generator splits	`main`, `early`, `mid`, `late`
Density guard	`safe` mode; safe density generators do not use actual holdout prime coordinates; oracle diagnostic intentionally flagged

22.2 Sensor readout: Markov2 still absorbs raw-gap LZ advantage#

The sensor layer remains consistent with the earlier v0.3/v0.4 story. Raw gap and gap_div2 are strongly more compressible than shuffle, wheel6, Markov1, and local Cramér controls, but not under Markov2. The gap_mod6 residue trace remains strongly structured even under Markov2.

Encoding	Control	Mean ΔLZ	Mean z	Negative windows	Readout
`gap`	shuffle	−343.81	−54.76	6/6	strong order signal
`gap`	local_cramer	−151.40	−9.81	6/6	survives wheel-aware local density control
`gap`	markov1	−83.46	−6.58	6/6	survives first-order transition control
`gap`	markov2	+25.79	+2.19	1/6	raw-gap LZ advantage absorbed/reversed
`gap_div2`	markov2	+26.32	+2.20	1/6	same boundary as raw gap
`gap_mod6`	markov2	−73.37	−14.58	6/6	residue-state structure survives Markov2
`normalized_gap`	markov2	+152.83	+7.77	0/6	density-adjusted trace is not a raw sensor win here

22.3 Generator ablation: density_markov1 wins raw gap at smoke scale#

The generator layer is where v0.5 changes the working hypothesis. On raw gap/gap_div2, the best safe non-Markov2 generator is density_markov1 across all four tested splits. That is a useful simplification: at this scale, adding Markov order 2, motif injection, or full DCC does not improve the raw-gap winner.

Split	Encoding	Winner	Δholdout vs Markov2	Runner-up	Readout
main	`gap`	`density_markov1`	−2,423.76	`density_wrong_scale_markov2`	density + first-order transition wins raw gap
early	`gap`	`density_markov1`	−2,343.44	`density_wrong_scale_markov2`	same winner in early split
mid	`gap`	`density_markov1`	−3,146.49	`density_only`	density itself becomes strong
late	`gap`	`density_markov1`	−4,150.22	`density_only`	density signal strengthens later
main	`gap_div2`	`density_markov1`	−2,424.96	`density_wrong_scale_markov2`	same as raw gap
main	`gap_mod6`	`variable_markov`	−5,853.89	`markov3`	residue trace prefers variable-order memory
main	`normalized_gap`	`dcc_only`	−39,430.74	`density_dcc_full`	read as density-adjusted diagnostic, not raw-gap invariant

The leakage guard is also behaving as intended: all safe density generators are marked non-leaking, while density_oracle_markov2 is deliberately flagged as using actual test-prime coordinates and should not be used as a public winner.

22.4 Conclusion and next step#

Confirmed by smoke: v0.5 runs cleanly, resumes/writes all expected reports, and the leakage guard works.
Sensor layer: raw-gap Markov2 boundary remains; gap_mod6 still carries strong residue-state structure.
Generator layer: Markov2 holdout floor is beaten again, but the raw-gap winner is simpler than expected: density_markov1.
Ablation readout: density clock + low-order transition is the first ingredient to scale-test; motif/DCC interactions remain candidates, not winners, on raw gap at this smoke scale.

v0.5 changed the next test: do not only scale density_markov2. Scale the whole ablation ladder — density_only, density_markov0, density_markov1, density_markov2, density-DCC variants, and variable Markov — and ask which ingredient survives at 1M, 2M, and beyond.

v0.6 update: that scaling test has now been run through 1M. The smoke winner density_markov1 did not remain the raw-gap winner; the scaled story is density/scale vs generic clock-DCC.

23. RH Arena v0.6 Carrier Isolation: Density vs Clock vs Residual#

The v0.6 run answers the question opened by v0.5: was the raw-gap generator gain really a density signal, or mostly a generic position/scale clock? The 4-worker sequence ran three stages — 150k smoke, 500k bridge, and 1M carrier isolation — with the optional 2M raw-carrier stage left off.

v0.6 verdict: the Markov2 generator floor is still beaten at 1M, but the carrier is not clean pure density. The full raw-gap split is won by density_wrong_scale_markov2; the mid and late raw-gap splits are won by clock_dcc_no_motif. The best current description is scale-conditioned transition structure, with density and clock-DCC variants both active.

23.1 Run integrity and scale sequence#

Stage	Primes	Train	Trials	Workers	Elapsed	Window tasks	Generator replicates
150k smoke	150,000	75,000	50	4/4	1.144 h	120	800
500k bridge	500,000	250,000	75	4/4	5.957 h	360	1,800
1M carrier	1,000,000	500,000	100	4/4	18.199 h	840	3,000

The 1M carrier run used 1,000,000 primes, 500,000 train primes, 50,000-token windows, 100 sampled controls, and 4/4 workers. It completed 840 window tasks and 3,000 generator replicates. The leakage guard is clean: 0 failed guard replicates and no oracle-density diagnostics in the default v0.6 run.

23.2 Sensor readout: raw-gap Markov2 boundary remains#

The sensor layer remains consistent with v0.3–v0.5: raw gap is strongly more compact than shuffle, wheel6, Markov1, and Cramér-like controls, but Markov2 still absorbs or reverses the raw-gap LZ advantage.

1M raw `gap` control	Mean ΔLZ	Mean z	Negative fraction	Median p	Reading
`shuffle`	-785.97	-76.55	100.0%	0.00990	huge order signal
`wheel6`	-86.50	-8.70	100.0%	0.00990	survives mod-6 wheel
`wheel30`	-8.61	-0.84	75.0%	0.19802	weak / near local wheel boundary
`markov1`	-204.73	-11.49	100.0%	0.00990	survives first-order transition
`markov2`	44.97	2.60	5.0%	1.00000	absorbed/reversed boundary
`local_cramer`	-165.53	-7.62	100.0%	0.00990	survives local Cramér
`cramer_global`	-124.34	-5.81	100.0%	0.00990	survives global Cramér

The stronger residue traces remain visible: gap_mod6 vs wheel30 has ΔLZ −1,476.84 and z −362.69; gap_mod6 vs Markov2 still has ΔLZ −198.95 and z −29.99. This keeps the residue-state track separate from the raw-gap generator track.

23.3 Generator carrier: density wins main, clock-DCC wins mid/late#

At 1M, non-Markov2 generators still beat the Markov2 holdout floor. The key result is the split diagnosis: density wins the full/main raw-gap split, while clock-DCC wins the mid and late raw-gap splits.

Split	Winner	Δ vs Markov2	Best clock	Best density	Best residual	Diagnosis
early	`density_wrong_scale_markov2`	-4,057.07	`clock_dcc_no_motif`	`density_wrong_scale_markov2`	`density_residual_dcc_no_motif`	density_beats_generic_clock
main	`density_wrong_scale_markov2`	-8,641.49	`clock_dcc_no_motif`	`density_wrong_scale_markov2`	`density_residual_dcc_no_motif`	density_beats_generic_clock
mid	`clock_dcc_no_motif`	-1,402.59	`clock_dcc_no_motif`	`density_dcc_no_motif`	`density_residual_dcc_no_motif`	generic_clock_beats_density
late	`clock_dcc_no_motif`	-1,414.21	`clock_dcc_no_motif`	`density_dcc_no_motif`	`density_residual_dcc_no_motif`	generic_clock_beats_density

For the 1M main raw-gap split, the top generator leaderboard is:

Rank	Generator	Holdout score	Δ holdout vs Markov2	Δ predictive bits	Δ bits/token	Guard
1	`density_wrong_scale_markov2`	1,935,933.72	-8,641.49	-8,642.41	-0.017285	pass
2	`density_dcc_no_motif`	1,935,952.28	-8,622.92	-8,667.85	-0.017336	pass
3	`density_lagged_markov2`	1,936,595.22	-7,979.99	-7,980.31	-0.015961	pass
4	`density_markov2`	1,936,595.39	-7,979.81	-7,980.31	-0.015961	pass
5	`density_shuffled_markov2`	1,936,595.42	-7,979.79	-7,980.31	-0.015961	pass
6	`clock_dcc_no_motif`	1,942,231.68	-2,343.53	-2,277.21	-0.004554	pass
7	`density_residual_dcc_no_motif`	1,942,242.75	-2,332.45	-2,272.05	-0.004544	pass
8	`clock_sqrt_markov2`	1,942,629.68	-1,945.52	-1,905.45	-0.003811	pass
9	`clock_wrong_scale_markov2`	1,943,066.91	-1,508.30	-1,468.36	-0.002937	pass
10	`clock_square_markov2`	1,943,262.57	-1,312.63	-1,275.22	-0.002550	pass
11	`clock_markov2`	1,943,369.42	-1,205.78	-1,166.67	-0.002333	pass
12	`clock_lagged_markov2`	1,943,369.59	-1,205.61	-1,166.04	-0.002332	pass

The close cluster of density_markov2, density_lagged_markov2, and density_shuffled_markov2 remains important: it says scale state matters, but exact prime-coordinate density is not yet isolated as the sole cause. The new seed is clock_dcc_no_motif, because it wins mid/late raw-gap splits and is the best clock-family challenger.

23.4 Encoding separation: raw scale carrier vs residue memory#

Encoding	Winner	Δ vs Markov2	Carrier diagnosis	Reading
`gap`	`density_wrong_scale_markov2`	-8,641.49	density_beats_generic_clock	raw carrier: density-scale main split
`gap_div2`	`density_wrong_scale_markov2`	-8,641.19	density_beats_generic_clock	same as raw gap
`gap_minus_logp`	`clock_markov1`	-1,051,637.89	generic_clock_beats_density	generic clock dominates density-adjusted residual scale
`normalized_gap`	`density_dcc_full`	-3,360.80	density_beats_generic_clock	DCC-density wins normalized scale
`gap_mod6`	`variable_markov`	-50,173.70	generic_clock_beats_density	higher / variable-order memory
`gap_mod30`	`markov3`	-40,124.87	density_beats_generic_clock	higher-order Markov memory

This separation is useful. Raw gap/gap_div2 are now a scale-carrier problem. gap_mod6 and gap_mod30 are not primarily density-clock stories; they prefer higher-order or variable-order memory.

23.5 Invariant candidates#

The 1M v0.6 run produced 2,050 invariant candidates. High-control survivors remain concentrated in gap_mod6, with one sparse gap_div2 survivor.

Threshold	Candidate count
≥ 3 controls	271
≥ 4 controls	97
≥ 5 controls	34
≥ 6 controls	7
≥ 7 controls	4

All-control candidates in this run:

Encoding	Motif	Length	Controls	Support sum	Min z	Median z
`gap_mod6`	`4,2,0,4`	4	7	100	3.13	137.14
`gap_mod6`	`4,2,0,4,2`	5	7	93	2.83	72.05
`gap_mod6`	`2,0,4,2`	4	7	88	2.43	139.43
`gap_div2`	`4,3,2,7,21`	5	7	7	7.25	19.23

The highest-support all-control motifs again live mostly in the residue alphabet. They are strong empirical candidates, not theorem-ready invariants.

23.6 Conclusion and next step#

Confirmed: v0.6 completed the density-vs-clock-vs-residual carrier isolation run cleanly.
Sensor boundary: raw gap still hits Markov2 as the hard LZ-control boundary.
Generator floor: Markov2 is beaten again at 1M on holdout.
Carrier: not pure density; best wording is scale-conditioned transition structure.
DCC status: clock_dcc_no_motif is now important because it wins mid/late raw-gap splits.
Residue status: gap_mod6/gap_mod30 remain a higher-order memory track.

Do not claim that density explains the prime-gap signal. The current result says that density, generic clock, and clock-DCC are now separated enough to design the next sharper arena.

The next code step should be a smaller v0.7 carrier-stress arena: hold density_wrong_scale_markov2, density_markov2, density_lagged_markov2, density_shuffled_markov2, clock_dcc_no_motif, and density_residual_dcc_no_motif in direct paired competition, with stricter split-consistency penalties and an optional focused 2M raw-gap carrier run.

24. RH Arena v0.7 Carrier Stress: Density Placebo vs Generic Clock#

The v0.7 run is the surgical test proposed after v0.6. It asks whether the raw-gap generator gain comes from true train-extrapolated prime-density information, a generic monotone position/scale clock, wrong-scale regularization, or DCC state-conditioning. The ladder ran from 20k through 1M with 4 workers.

v0.7 verdict: the Markov2 generator floor is still beaten, but the raw-gap carrier is not isolated prime density. At 1M, clock_loglog_markov2 wins raw gap in all seven tested splits. Density variants remain strong, especially in the main split, but the density-placebo cluster shows that generic scale-clock state is the stronger current explanation.

24.1 Run integrity and ladder#

Stage	Primes	Train	Trials	Workers	Elapsed	Window tasks	Generator tasks	Replicates
20k smoke	20,000	10,000	10	4/4	0.22s	80	216	216
150k validation	150,000	75,000	40	4/4	2.210 h	525	945	2835
500k bridge	500,000	250,000	60	4/4	13.494 h	980	1323	5292
1M focused	1,000,000	500,000	80	4/4	41.416 h	980	1323	6615

The 1M focused stage used 1,000,000 primes, 500,000 train primes, 50,000-token windows, 80 sampled controls, and 4/4 workers. It completed 980 window tasks, 1,323 generator score tasks, and 6,615 generator replicates. The leakage guard is clean: 0 failed guard replicates and no oracle-density diagnostics.

24.2 Sensor readout: raw-gap Markov2 boundary still holds#

The v0.7 sensor layer preserves the v0.3–v0.6 boundary. Raw gap is strongly more compact than shuffle, Markov1, wheel6, and Cramér-like controls, but Markov2 absorbs/reverses the LZ advantage.

1M raw `gap` control	Mean ΔLZ	Mean z	Negative fraction	Median p	Dict overlap
`shuffle`	-786.00	-76.91	100.0%	0.01235	0.165
`markov1`	-204.84	-11.55	100.0%	0.01235	0.230
`wheel6`	-86.65	-8.77	100.0%	0.01235	0.250
`local_cramer`	-164.90	-7.64	100.0%	0.01235	0.230
`cramer_global`	-124.31	-5.86	100.0%	0.01235	0.231
`wheel30`	-8.51	-0.84	75.0%	0.19753	0.294
`markov2`	45.13	2.65	5.0%	1.00000	0.271

The residue track is still separate: gap_mod6 vs Markov2 has ΔLZ −199.06 and z −30.59; gap_mod30 vs Markov2 has ΔLZ −31.17 and z −2.50.

24.3 Generator carrier: loglog clock wins raw gap across splits#

Split	Winner	Δ vs Markov2	Best density	Density − clock Δ	Diagnosis
early	`clock_loglog_markov2`	-4378.92	`density_wrong_scale_markov2`	321.61	generic_clock_beats_density
main	`clock_loglog_markov2`	-8784.21	`density_wrong_scale_markov2`	142.46	density_clock_tie
mid	`clock_loglog_markov2`	-5187.52	`density_dcc_no_motif`	4552.44	generic_clock_beats_density
late	`clock_loglog_markov2`	-5245.96	`density_dcc_no_motif`	4579.67	generic_clock_beats_density
rolling1	`clock_loglog_markov2`	-6009.94	`density_wrong_scale_markov2`	348.21	generic_clock_beats_density
rolling2	`clock_loglog_markov2`	-6248.92	`density_dcc_no_motif`	5526.25	generic_clock_beats_density
rolling3	`clock_loglog_markov2`	-6293.48	`density_dcc_no_motif`	5484.38	generic_clock_beats_density

For the 1M main raw-gap split, the top generator leaderboard is:

Rank	Generator	Family	Holdout score	Δ holdout vs Markov2	Δ bits/token	Guard
1	`clock_loglog_markov2`	clock_placebo	1,935,792.89	-8,784.21	-0.017473	pass
2	`density_wrong_scale_markov2`	density_ablation	1,935,935.36	-8,641.74	-0.017285	pass
3	`density_dcc_no_motif`	dcc_ablation	1,935,953.87	-8,623.23	-0.017336	pass
4	`density_reversed_markov2`	density_placebo	1,936,557.75	-8,019.35	-0.015961	pass
5	`density_lagged_markov2`	density_ablation	1,936,596.81	-7,980.29	-0.015961	pass
6	`density_markov2`	density_ablation	1,936,597.07	-7,980.03	-0.015961	pass
7	`density_shuffled_markov2`	density_ablation	1,936,597.07	-7,980.03	-0.015961	pass
8	`clock_log_markov2`	clock_placebo	1,937,320.22	-7,256.88	-0.014410	pass
9	`clock_dcc_no_motif`	clock_dcc_ablation	1,942,238.61	-2,338.49	-0.004554	pass
10	`density_residual_dcc_no_motif`	density_residual_ablation	1,942,249.72	-2,327.38	-0.004544	pass
11	`clock_sqrt_markov2`	clock_ablation	1,942,631.62	-1,945.48	-0.003811	pass
12	`clock_wrong_scale_markov2`	clock_ablation	1,943,068.84	-1,508.26	-0.002937	pass

The main split is close: clock_loglog_markov2 beats density_wrong_scale_markov2 by only 142.46 holdout units, so it is a density/clock tie under the configured carrier margin. But the split ladder is not close: early, mid, late, and rolling splits consistently prefer generic log/loglog clock states.

24.4 Density/clock placebo matrix#

The density placebo matrix is the main new information. On the 1M main raw-gap split, density_markov2, density_lagged_markov2, and density_shuffled_markov2 all sit at about Δ −7,980 vs Markov2; density_reversed_markov2 is also strong at Δ −8,019. This cluster means exact prime-coordinate density is not isolated as the unique cause. The signal behaves more like a monotone scale-state transition law.

By encoding, the 1M main split reads:

Encoding	Winner	Δ vs Markov2	Carrier diagnosis
`cumulative_error_walk`	`clock_loglog_markov2`	-163,772.17	generic_clock_beats_density
`gap`	`clock_loglog_markov2`	-8,784.21	density_clock_tie
`gap_minus_logp`	`clock_loglog_markov2`	-456,581.80	generic_clock_beats_density
`gap_mod30`	`markov3`	-40,124.81	density_beats_generic_clock
`gap_mod6`	`variable_markov`	-50,186.51	generic_clock_beats_density
`normalized_gap`	`density_dcc_full`	-3,200.57	density_beats_generic_clock
`prime_residue30`	`density_markov2`	-529.95	density_clock_tie

24.5 Candidate law sheets#

The new candidate_law_sheets.csv/json output works as a formalization-facing summary. For raw gap, the top law is clock_loglog_markov2: split count 7, split-win count 7, mean Δ vs Markov2 -6,021.28, best Δ -8,784.21, and candidate law “position-clock conditioned transition law.”

For the residue track, the top law remains gap_mod6 variable_markov: split count 7, split-win count 7, mean Δ vs Markov2 -25,018.00, best Δ -50,186.51, and candidate law “symbolic Markov transition law.”

Implementation note: the v0.7 law-sheet failure-mode column over-penalizes all candidates with missing_splits=rolling. The generator output contains rolling1, rolling2, and rolling3; the law-sheet checker expects literal rolling. This is a reporting/hygiene bug, not a generator-result failure. It should be fixed in v0.7.1 before public release.

24.6 Conclusion and next step#

Confirmed: v0.7 completed the carrier-stress ladder through 1M with clean leakage guard.
Sensor boundary: raw gap still hits Markov2 as the hard LZ-control boundary.
Generator floor: Markov2 is beaten again on holdout.
Raw-gap carrier: clock_loglog_markov2 is now the best raw-gap law candidate across splits; pure density is not isolated.
Residue carrier: gap_mod6 remains a variable-order Markov memory problem; gap_mod30 remains a higher-order Markov problem.
DCC status: DCC variants are still useful challengers, but they are not the raw-gap winner in v0.7.

v0.7 moves the project from “density/clock/DCC are all alive” to a sharper statement: the raw-gap holdout gain is best explained by generic log/loglog scale-clock conditioned transition structure, while residue alphabets carry separate higher-order symbolic memory.

The next code step should be v0.7.1 hygiene plus a narrow 2M/5M replication of the top law candidates. The next formal step is to write the clock_loglog_markov2 transition law and the gap_mod6 variable_markov residue law as compact mathematical objects, then test whether either connects to prime-counting error or an RH-adjacent criterion.

Appendix A – E1 Result Tables#

Tables are shown in Section 6 above.

Appendix B – Interpretation Ladder#

Level	Claim	Status
L1	Prime-gap order compresses better than shuffled gaps	✅ supported by E1
L2	Signal is not only marginal distribution	✅ supported by E1
L3	Signal survives wheel-aware controls	✅ supported by final E2/SEA batch
L4	Signal survives Markov-preserving controls	✅ supported by final E2/SEA batch
L5	Signal scales with prime depth	✅ supported through 100k → 500k → 1M → 2M for raw gap under Markov/wheel controls
L6	Signal relates to zeta-zero statistics	🔮 speculative
L7	DCC edge-of-chaos explains critical line	🔮 speculative
L8	AC/Zero Framework explains RH truth/proof difficulty	🔮 highly speculative
L9	Zero Framework at axiom level: \(0 \ne \emptyset\) as a possible foundational distinction	🔮 philosophical hypothesis, pre-formal
L10	Compact generator beats the Markov2 holdout floor	✅ empirically achieved in v0.4 across 9/9 tested encodings; still not a formal invariant or proof
L11	Generator gain survives scaling, ablation, and independent seeds	✅ partially supported by v0.6 150k→500k→1M sequence; carrier not yet uniquely identified
L12	Carrier is isolated as pure density, generic clock, residual, or DCC	🧪 v0.6 narrows it to scale-conditioned transition structure; density and clock-DCC both active
L13	Generator/invariant connects to prime-counting error, explicit formula residuals, or RH-equivalent criterion	🔮 future formal bridge
L14	Carrier stress separates prime-specific density from generic clock/scale state	🧪 v0.7: raw-gap winner is `clock_loglog_markov2`; exact density is not isolated

Appendix C – Working Language Rules#

Use: “compression-detectable structure”, “order-sensitive signal”, “generator candidate”, “invariant candidate”, “holdout validation”, “preliminary empirical result”, “null controls”, “not an RH proof”. Avoid: “RH confirmed”, “proof”, “annihilation of zeta zeros”, “AC requires RH”, “generator proves RH”.

Appendix D – Methodology & Reproducibility#

D1. E1 Shuffle Protocol#

For each analysis window independently, the exact multiset of gap tokens inside that window is shuffled. This preserves the local gap distribution of that window and destroys only the ordering inside the window.

This matters because the E1 claim is not that prime gaps have a special distribution. The claim is narrower: the real order of those same gaps is more compressible than shuffled order.

D2. Windowing#

E1 20k run: window = 5,000, stride = 5,000.
E1/E2 100k runs: window = 10,000, stride = 10,000.
E2/SEA 500k and 1M runs: window = 20,000, stride = 20,000.
E2/SEA 2M runs: window = 40,000, stride = 40,000.
Mean LZ76: averaged across windows.

The final SEA batch contains non-overlapping main windows: 10 windows at 100k, 25 windows at 500k, 50 windows at 1M, and 50 windows at 2M.

D3. Encodings#

Each encoding maps the prime-gap sequence into a token sequence before LZ76 is computed:

gap: raw integer prime-gap values.
gap_div2: normalized even gap values after the initial special case.
delta: first difference, \(g_{n+1} - g_n\).
abs_delta: absolute first difference, \(|g_{n+1} - g_n|\).
gap_mod6 / gap_mod30: gap values modulo wheel bases.
prime_residue6 / prime_residue30: prime residue-state traces, kept separate from gap-residue traces.
bucket_log2: logarithmic bucket encoding of gap size.
normalized_gap: gap divided by a local logarithmic scale.
gap_minus_logp: gap minus a local expected logarithmic scale.
cumulative_error_walk: cumulative walk of gap excess against local expectation.

D3b. LZ76 Implementation Note#

LZ76 is computed as a phrase-count complexity measure over token sequences, not as compressed byte size. Each encoding first maps the prime-gap sequence into symbolic integer tokens. The custom parser then scans the token stream left-to-right and counts the number of new phrases needed to parse it. Lower phrase count means the sequence is more compressible under this LZ76-style dictionary construction.

phrases = {}
i = 0
while i < len(tokens):
    extend the current phrase until it is new
    add phrase to dictionary
    phrase_count += 1
    i = next unread token

This is sufficient for the current signal tests, but future reproducibility releases should include the exact parser code, fixed test vectors, and a comparison with byte-level compressors.

D4. Main Statistic#

The primary statistic is:

Delta_LZ = LZ76(real) - mean(LZ76(control))

A negative value means the real ordered sequence is more compressible than the control sequences.

D5. Permutation p-value#

The permutation p-value is computed as:

p = (1 + count(control_LZ <= real_LZ)) / (trials + 1)

The minimum possible p-value depends on trial count: with 100 trials it is about 0.0099, with 300 trials about 0.00332, with 700 trials about 0.00143, with 1000 trials about 0.000999, and with 2000 trials about 0.0005. Low p-values here should still be read as sampled-control dominance, not as a final theorem-level significance claim.

D6. Known Likely Sources of Signal#

The E1 compression advantage may partly reflect known arithmetic constraints and scale effects, including:

changing average prime-gap size with scale, approximately related to \(\log n\),
residue-class restrictions such as \(6k \pm 1\),
local gap correlations,
twin-prime and prime-constellation effects,
sieve structure and wheel-like constraints.

This is why E2 adds block, wheel-aware, Markov-preserving, Cramér-like, and depth/scaling controls. The purpose is to separate obvious arithmetic structure from deeper sequential compression structure.

D7. Current Limitation#

The current result is a completed first-pass E2/SEA signal plus v0.3–v0.7 generator-carrier evidence, not a final theorem. Raw gap defeats shuffle, wheel6, Markov1, and Cramér-like controls but still hits Markov2 as the hard LZ-control boundary. The generator layer now beats Markov2 on holdout at 1M, and v0.7 shows that the raw-gap carrier is not clean pure density: generic loglog scale-clock conditioning currently wins the split ladder. The work still needs a v0.7.1 law-sheet hygiene patch, focused 2M/5M replication, independent code review, exact parser test vectors, and a formal bridge to prime-counting error or an RH-equivalent criterion.

D8. Arena Version Notes#

v0.1: first RH Arena: windows, controls, sensors, motif extraction, generators, holdout outputs.
v0.2b: resume/workers/progress hardening and 1M generator benchmark.
v0.3: gap-residue vs prime-residue split, wheel-aware Cramér, Cramér warmup, predictive-bits generator scoring, invariant_candidates.csv, and completed 1M analysis.
v0.4: focused Markov2-floor generator attack; non-Markov2 generators beat Markov2 across tested encodings at 1M.
v0.5: density/DCC ablation and split-validation smoke; identified density-conditioned transition structure as first carrier candidate.
v0.6: clock-vs-density-vs-residual carrier isolation; 150k→500k→1M sequence completed; scale-conditioned transition carrier found, not pure density.
v0.7: carrier-stress ladder with density/clock placebo matrix, DCC-no-motif decomposition, split-consistency law sheets; raw-gap carrier shifts to clock_loglog_markov2, while gap_mod6 remains variable_markov.

Appendix E – Next Update Slot#

Next: Patch v0.7.1 law-sheet split naming, then run a narrow 2M/5M focused replication for clock_loglog_markov2, clock_log_markov2, density_wrong_scale_markov2, density_dcc_no_motif, density_markov2, density_shuffled_markov2, clock_dcc_no_motif, variable_markov, and markov3. Do not widen until the log/loglog-clock law and the residue variable-Markov law are written as compact candidate invariants.

Version 0.12 · May 12, 2026 · v0.7 carrier-stress integrated · raw-gap Markov2 sensor boundary preserved · Markov2 holdout floor beaten again at 1M · carrier narrowed from density-scale to generic log/loglog scale-clock transition structure · no RH proof claim · next: v0.7.1 law-sheet patch and 2M/5M focused replication.

Abstract#

1. Purpose#

2. Background: Why Prime Gaps?#

3. DCC / Edge-of-Chaos Motivation#

4. Important Boundary: Not an RH Proof#

5. E1 Experiment: Prime-Gap LZ76 Against Shuffled Controls#

6. E1 Results#

6.1 First run: 20,000 primes#

6.2 Stronger run: 100,000 primes#

7. Interpretation of E1#

8. E2: Stronger Null Controls#

9. E2 Acceptance Criteria#

10. Final E2 / SEA Results: Stronger Controls and Scaling#

10.1 Completed SEA final runs#

10.2 Final raw gap summary#

10.3 Interpretation of the final SEA result#

10.4 Final E2 / SEA thesis#

10.5 Caution#

11. Relation to 8Z/DCC Founding Hypothesis#

11.1 Two-track strategy#

12. Revised Position on Original RH/AC Hypothesis#

13. Mathematics, Reality, and the Zero Framework#

13.1 The two zeros#

13.2 Analytic continuation as ontological smuggling#

13.3 Axiom-level vs. interpretation-level critique#

13.4 Implication for RH specifically#

14. Corrected Language About Zeta Zeros#

15. Current Working Thesis#

16. Next Experiments#

17. Preliminary Conclusion#

18. Three-Track Roadmap: RH, MDLxDCC Invariant, Generator Arena#

18.1 Three live tracks#

18.2 The required bridge#

18.3 Why not just prove RH?#

18.4 Prime-counting error route#

18.5 Explicit-formula route#

18.6 Equivalent-criterion route#

18.7 Proof-roadmap phases#

18.8 Current position#

19. Inverse Generator Hypothesis#

19.1 Arena flow#

19.2 Generator classes#

19.3 Acceptance rule#

20. RH Arena v0.3 Results#

20.1 Run integrity#

20.2 Core sensor readout#

20.3 Residue and Cramér readout#

20.4 Generator leaderboard#

20.5 Invariant candidates#

20.6 Conclusion and next step#

21. RH Arena v0.4 Results: Markov2 Floor Attack#

21.1 Run integrity#

21.2 Sensor readout: Markov2 still kills raw-gap LZ advantage#

21.3 Generator floor: Markov2 is beaten on holdout#

21.4 Invariant candidates: robust motifs still live mostly in residue space#

21.5 Conclusion and next step#

22. RH Arena v0.5 Smoke Results: Density Ablation and Split Stability#

22.1 Run integrity#

22.2 Sensor readout: Markov2 still absorbs raw-gap LZ advantage#

22.3 Generator ablation: density_markov1 wins raw gap at smoke scale#

22.4 Conclusion and next step#

23. RH Arena v0.6 Carrier Isolation: Density vs Clock vs Residual#

23.1 Run integrity and scale sequence#

23.2 Sensor readout: raw-gap Markov2 boundary remains#

23.3 Generator carrier: density wins main, clock-DCC wins mid/late#

23.4 Encoding separation: raw scale carrier vs residue memory#

23.5 Invariant candidates#

23.6 Conclusion and next step#

24. RH Arena v0.7 Carrier Stress: Density Placebo vs Generic Clock#

24.1 Run integrity and ladder#

24.2 Sensor readout: raw-gap Markov2 boundary still holds#

24.3 Generator carrier: loglog clock wins raw gap across splits#

24.4 Density/clock placebo matrix#

24.5 Candidate law sheets#

24.6 Conclusion and next step#

Appendix A – E1 Result Tables#

Appendix B – Interpretation Ladder#

Appendix C – Working Language Rules#

Appendix D – Methodology & Reproducibility#

D1. E1 Shuffle Protocol#

10.2 Final raw `gap` summary#