Abstract#
This working paper reframes the previous speculative Riemann Hypothesis brainstorming note into a smaller, testable research program. The empirical question is: Do ordered prime gaps contain measurable compression structure beyond their marginal distribution?
Using the first 20,000 and 100,000 primes, we first compared LZ76 complexity of real prime-gap order against shuffled controls preserving the same multiset. At 100,000 primes, the raw gap encoding produced delta −167.96 and mean z-score −32.38 against shuffled controls.
The final E2/SEA batch now extends the test ladder to stronger controls and larger ranges. It includes 15 runs, 60 summary rows, and 1,960 window rows, covering Markov-preserving, wheel-aware, block-shuffle, and shuffle controls across 100k, 500k, 1M, and 2M prime ranges.
Final SEA finding: the broad E1 shuffle signal is partly explained by local/arithmetic structure, but the raw gap signal survives wheel-aware and Markov-preserving controls from 100k through 2M primes. The 500k bridge is positive, the 1M shuffle baseline is now present, and the strongest line remains the 2M Markov-preserving raw gap test: delta −142.34, z −8.76, with 50/50 windows negative (100 trials at 2M; limited p-value resolution; see Section 10.5).
The v0.4 generator-floor run now tests the next question: can any compact generator beat the Markov2 baseline on holdout? The answer is empirically yes at 1M scale. The run completed 1,620 window tasks and 72 generator tasks. At the sensor/control layer, raw gap still fails against Markov2. At the generator layer, however, density_markov2 wins raw gap/gap_div2, and all 9/9 tested encodings have a non-Markov2 winner beating the Markov2 holdout floor.
The v0.7 carrier-stress ladder now tests the sharper placebo question opened by v0.6: is the raw-gap generator gain really prime-density information, or a more generic monotone scale/clock state? The answer is empirically sharper: at 1M, clock_loglog_markov2 wins raw gap in every split, with Δholdout vs Markov2 −8,784.21 on the main split. density_wrong_scale_markov2 and density-DCC variants remain strong, but shuffled/lagged/reversed density labels cluster closely enough that exact prime-coordinate density is not isolated as the unique cause.
This does not prove RH. It shows a robust, order-sensitive prime-gap compression signal, a repeated empirical breach of the Markov2 generator floor, and now a stronger negative result against pure-density interpretation. The next phase is a corrected law-sheet build plus 2M/5M replication of the top carrier laws.
1. Purpose#
The original brainstorming paper mixed three levels: empirical observations, reasoned interpretations, and metaphysical speculation. This version separates them. The empirical core is now simple: prime gaps are treated as a sequence, the real order is compared against null models, and MDL/LZ76 decides whether additional structure exists.
2. Background: Why Prime Gaps?#
Prime gaps \(g_n = p_{n+1} - p_n\) are not independent. Their distribution changes with scale, and primes obey arithmetic constraints. The sharper question is: After preserving basic statistics, does the ordered sequence still compress better than appropriate controls? That is exactly the kind of question the 8Z/MDL/DCC program is built to ask.
3. DCC / Edge-of-Chaos Motivation#
DCC balances between seizure (excessive order) and noise (excessive disorder). The original speculative analogy placed Re(s)=1/2 as the edge-of-chaos line. This remains a conceptual analogy, not a proof. The empirical test only asks whether prime gaps show measurable structure under compression tools.
4. Important Boundary: Not an RH Proof#
This paper makes no claim to prove the Riemann Hypothesis. It only asserts that ordered prime gaps are more compressible than shuffled controls preserving the same gap distribution. Any bridge to RH remains speculative future motivation.
5. E1 Experiment: Prime-Gap LZ76 Against Shuffled Controls#
We generated the first N primes, computed gaps, and applied several encodings (gap, delta, abs_delta, mod6, bucket_log2). Each real ordered window was compared against shuffled controls. The main statistic is \(\Delta_{LZ} = LZ76(real) - mean(LZ76(control))\); a negative value indicates the real sequence is more compressible.
6. E1 Results#
6.1 First run: 20,000 primes#
Configuration: 20,000 primes, 30 trials, window 5,000. Summary:
| Encoding | Real LZ76 | Shuffled LZ76 | Delta | Z-score | Interpretation |
|---|---|---|---|---|---|
| gap | 1486.75 | 1579.31 | −92.56 | −22.85 | strong |
| gap_div2 | 1486.75 | 1579.14 | −92.39 | −23.86 | strong |
| delta | 1753.00 | 1820.93 | −67.93 | −16.72 | strong, but needs special control |
| abs_delta | 1520.00 | 1537.62 | −17.62 | −4.44 | moderate/strong |
| mod6 | 714.75 | 882.10 | −167.35 | −88.43 | sanity check; partly expected |
| bucket_log2 | 1045.25 | 1054.22 | −8.97 | −3.92 | weak but present |
6.2 Stronger run: 100,000 primes#
Configuration: 100,000 primes, 50 trials, window 10,000. Summary:
| Encoding | Real LZ76 | Shuffled LZ76 | Delta | Z-score | Interpretation |
|---|---|---|---|---|---|
| gap | 2898.80 | 3066.76 | −167.96 | −32.38 | very strong |
| gap_div2 | 2898.80 | 3067.18 | −168.38 | −32.83 | very strong |
| delta | 3395.10 | 3532.89 | −137.79 | −23.90 | very strong, needs control |
| abs_delta | 2959.50 | 3002.05 | −42.55 | −8.25 | strong |
| mod6 | 1281.40 | 1603.32 | −321.92 | −127.43 | sanity check |
| bucket_log2 | 1974.80 | 1984.15 | −9.35 | −3.01 | weak but present |
Raw prime gaps are about 5.5% more compressible in real order than in shuffled order.
7. Interpretation of E1#
Supported: The order of prime gaps carries structure beyond marginal distribution. Not yet shown: that it survives Markov-preserving controls, that it connects to zeta zeros, or that AC/Zero Framework is correct. mod6 is a sanity check; delta needs caution due to lag-1 artifacts.
8. E2: Stronger Null Controls#
- C0 Full Shuffle – already done. Distribution alone does not explain compressibility.
- C1 Block Shuffle – preserve local order; test long-range structure.
- C2 Wheel-Aware – preserve modular/residue constraints; test if signal is just residue arithmetic.
- C3 Markov-Preserving – preserve first-order transition behavior; the key hard control.
- C4 Cramér-Like – random-prime heuristic synthetic gaps.
- C5 Depth/Scaling – early vs. late windows; see if signal grows.
9. E2 Acceptance Criteria#
Minimal success: raw gap outperforms full shuffle, block shuffle, and wheel-aware. Strong success: outperforms Markov surrogates. Very strong: survives all controls and shows scaling growth.
10. Final E2 / SEA Results: Stronger Controls and Scaling#
The E2/SEA batch tested whether the initial shuffle signal survives harder null models and larger prime ranges. SEA means Scaling Experiment Arena. The final uploaded batch is complete: 15 runs, 60 summary rows, and 1,960 window rows.
The final SEA claim is no longer merely “prime gaps beat shuffle.” It is narrower and stronger: raw prime-gap order contains compression-detectable sequential structure not fully explained by marginal gap distribution, mod-6 wheel structure, or first-order Markov transitions, with positive scaling evidence through 2M primes.
10.1 Completed SEA final runs#
The final SEA package includes all planned continuation runs:
- 100k: Markov t2000, wheel6 t2000, block50 t1000, block100 t1000
- 500k: Markov t700, wheel6 t700, block50 t700, block100 t700
- 1M: shuffle t300, wheel6 t300, Markov t300, block50 t300, block100 t300
- 2M: Markov t100, wheel6 t100
This completes the missing 100k wheel replication, the 500k bridge, and the 1M shuffle baseline that were absent from the partial version.
10.2 Final raw gap summary#
| Run | Control | Delta | Z-score | Window result | Median p | Reading |
|---|---|---|---|---|---|---|
| 100k Markov t2000 | markov | −32.64 | −3.69 | 10/10 negative | 0.00175 | survives first-order transitions |
| 100k wheel6 t2000 | wheel6 | −19.83 | −3.78 | 10/10 negative | 0.00075 | survives mod-6 wheel |
| 100k block50 t1000 | block50 | −14.08 | −2.58 | 10/10 negative | 0.01499 | survives local block shuffle |
| 100k block100 t1000 | block100 | −8.70 | −1.63 | 10/10 negative | 0.09091 | marginal / local-scale boundary |
| 500k Markov t700 | markov | −65.13 | −5.41 | 25/25 negative | 0.00143 | survives first-order transitions |
| 500k wheel6 t700 | wheel6 | −33.45 | −4.75 | 25/25 negative | 0.00143 | survives mod-6 wheel |
| 500k block50 t700 | block50 | −21.95 | −3.04 | 25/25 negative | 0.00428 | survives local block shuffle |
| 500k block100 t700 | block100 | −11.48 | −1.64 | 25/25 negative | 0.08845 | marginal / local-scale boundary |
| 1M shuffle t300 | shuffle | −301.62 | −41.62 | 50/50 negative | 0.00332 | E1-style baseline at 1M |
| 1M wheel6 t300 | wheel6 | −31.02 | −4.39 | 50/50 negative | 0.00332 | survives mod-6 wheel |
| 1M Markov t300 | markov | −58.26 | −4.84 | 50/50 negative | 0.00332 | survives first-order transitions |
| 1M block50 t300 | block50 | −21.10 | −2.88 | 50/50 negative | 0.00664 | survives local block shuffle |
| 1M block100 t300 | block100 | −11.13 | −1.55 | 49/50 negative | 0.09801 | marginal / local-scale boundary |
| 2M Markov t100 | markov | −142.34 | −8.76 | 50/50 negative | 0.00990 | survives first-order transitions |
| 2M wheel6 t100 | wheel6 | −64.42 | −7.01 | 50/50 negative | 0.00990 | survives mod-6 wheel |
10.3 Interpretation of the final SEA result#
Markov survival is now the central result. Markov-preserving controls keep first-order gap transitions, yet raw gap remains more compressible at every tested depth: 100k, 500k, 1M, and 2M.
| Depth | Delta | Z-score | Negative windows | Median p |
|---|---|---|---|---|
| 100k Markov t2000 | −32.64 | −3.69 | 10/10 | 0.00175 |
| 500k Markov t700 | −65.13 | −5.41 | 25/25 | 0.00143 |
| 1M Markov t300 | −58.26 | −4.84 | 50/50 | 0.00332 |
| 2M Markov t100 | −142.34 | −8.76 | 50/50 | 0.00990 |
Wheel survival is also confirmed. The raw signal survives mod-6-aware controls at 100k, 500k, 1M, and 2M. This means the signal is not merely the trivial 6k ± 1 residue structure.
| Depth | Delta | Z-score | Negative windows | Median p |
|---|---|---|---|---|
| 100k wheel6 t2000 | −19.83 | −3.78 | 10/10 | 0.00075 |
| 500k wheel6 t700 | −33.45 | −4.75 | 25/25 | 0.00143 |
| 1M wheel6 t300 | −31.02 | −4.39 | 50/50 | 0.00332 |
| 2M wheel6 t100 | −64.42 | −7.01 | 50/50 | 0.00990 |
Block controls narrow the scale claim. Block50 remains stable from 100k through 1M. Block100 is consistently weaker and mostly marginal. This suggests that a meaningful part of the signal lives at local-to-medium sequential scales, while the Markov/wheel survival shows the signal is not exhausted by the simplest local arithmetic explanations.
| Run | Delta | Z-score | Negative windows | Median p |
|---|---|---|---|---|
| 100k block50 t1000 | −14.08 | −2.58 | 10/10 | 0.01499 |
| 100k block100 t1000 | −8.70 | −1.63 | 10/10 | 0.09091 |
| 500k block50 t700 | −21.95 | −3.04 | 25/25 | 0.00428 |
| 500k block100 t700 | −11.48 | −1.64 | 25/25 | 0.08845 |
| 1M block50 t300 | −21.10 | −2.88 | 50/50 | 0.00664 |
| 1M block100 t300 | −11.13 | −1.55 | 49/50 | 0.09801 |
Sensor ranking is now clearer. Raw gap is the primary sensor. gap_div2 is effectively the same signal. abs_delta and bucket_log2 are diagnostic only: they weaken, flip, or become near-zero under harder controls.
10.4 Final E2 / SEA thesis#
Prime-gap order contains a surviving raw-gap compression signal that is not fully explained by gap distribution, simple wheel residue structure, or first-order Markov transition statistics. The effect is smaller than the initial shuffle signal but more meaningful, and the final SEA batch confirms survival through 100k, 500k, 1M, and 2M prime ranges.
10.5 Caution#
The 2M runs currently use only 100 trials, so the permutation p-value resolution is limited. The z-scores and window stability are more informative at this stage. The final result is strong enough to justify the next phase, but not strong enough to claim an RH proof or a final mathematical invariant.
11. Relation to 8Z/DCC Founding Hypothesis#
The first E2/SEA results support number theory as a credible ninth-domain candidate for the 8Z/DCC approach. The strongest current claim is not that DCC proves RH, but that 8Z/DCC compression tools detect a nontrivial order-sensitive signal in prime gaps that survives first-pass wheel-aware and Markov-preserving controls up to 2M primes.
11.1 Two-track strategy#
This project should not choose prematurely between “prove RH” and “build an independent MDLxDCC theory of prime structure.” Both tracks are useful and should run in parallel:
- Track A — RH-facing path: determine whether the discovered compression structure can be formalized into an invariant that implies RH, becomes equivalent to RH, or forbids off-critical-line zeros through a prime-counting or explicit-formula bridge.
- Track B — MDLxDCC-native path: develop a new information-theoretic account of prime distribution based on compression, multi-scale structure, and DCC-style invariants, valuable even if it does not directly prove RH.
These are not competing goals. RH is the gold-standard benchmark; the MDLxDCC invariant is the deeper search object.
12. Revised Position on Original RH/AC Hypothesis#
The critical line as edge-of-chaos remains speculative motivation only. Verified: E1 signal. Reasoned: compression advantage suggests order-sensitive structure. Speculative: AC/Zero Framework explanations are not evidence.
13. Mathematics, Reality, and the Zero Framework#
This section records the philosophical motivation behind the Zero Framework while keeping it separate from the empirical E1/E2 result. The central issue is not whether mathematics is "wrong." Formal mathematics can define many internally consistent worlds. The stricter question is:
When a formal object is used to describe reality or nature, what exactly is the mapping between the symbol and the thing being described?
In pure mathematics, definitions are allowed to be abstract. A structure can be studied because it is consistent, elegant, or fruitful. In applied mathematics and mathematical physics, however, a formal object earns physical meaning only through a disciplined bridge: what it measures, what it predicts, what it preserves, and where the mapping stops.
This distinction matters for the present paper because RH lives inside pure mathematics, while the 8Z/DCC framing asks a more reality-facing question: whether prime-gap dynamics expose an information structure that can be measured, compressed, and related to deeper arithmetic regularities. The empirical compression tests do not decide ontology. They only ask whether a measurable signal exists.
13.1 The two zeros#
Standard mathematics treats zero as a single formal object: the additive identity, the result of \(x - x\), and the value attained when a function vanishes. The Zero Framework proposes that ordinary language often mixes two different meanings under the same word:
- Numeric zero \(0\): a position or value inside a formal system. It exists within a structure of relations.
- Ontological nothingness \(\emptyset\): absence of existence itself. Not a point, not a value, and not reached by ordinary arithmetic operations.
On this view, a function that equals zero at some point has not "become nothing." It has reached a structured value. A zeta zero is therefore better described as an exact cancellation or destructive-interference point, not an annihilation into non-being.
13.2 Analytic continuation as ontological smuggling#
Analytic continuation is an internally valid and powerful mathematical operation. The concern here is not its consistency. The concern is what happens when a result obtained by extension is presented under the same notation as the original object — without making the substitution explicit.
The Dirichlet series \(\zeta(s) = \sum_{n=1}^{\infty} n^{-s}\) converges for \(\text{Re}(s) > 1\). For \(s = 1\) the harmonic series diverges. For \(s = -1\) the sum \(1 + 2 + 3 + \cdots\) diverges to infinity. Yet analytic continuation assigns \(\zeta(-1) = -\frac{1}{12}\). This is a correct statement about the analytically continued function. But the informal phrase \(1 + 2 + 3 + \cdots = -\frac{1}{12}\) silently substitutes one object for another under the same notation. The two agree where the original sum converges. They are not the same thing where it does not.
The ontological import is this: when the output of an extended object is presented as if it were the output of the original process, the interpretation can carry a hidden metaphysical claim — that the "true value" of a divergent process exists and is reachable by algebraic extension — without stating that claim explicitly or subjecting it to scrutiny. Whether this affects internal consistency is no. Whether it affects what mathematics describes about reality is an open question. This is what we call ontological smuggling: not an error in the proof, but an unacknowledged step in the interpretation.
13.3 Axiom-level vs. interpretation-level critique#
There are two versions of the Zero Framework critique, and they should not be confused:
- Interpretation-level: Current mathematics is largely correct, but some language misleads. A "zero of a function" should not be described as ontological nothingness. A divergent series assigned a finite value through continuation or regularization should be presented with that context intact. This is already partially implemented in Section 14.
- Axiom-level: The identification of numeric zero with ontological nothingness is not just a language habit — it may be embedded in the foundations. A mathematics built on a clean distinction between \(0\) and \(\emptyset\) from the start might produce a different structure, one where certain currently valid moves require explicit ontological justification before proceeding. This remains a research program, not a theorem.
The historical analogy is Lobachevsky. For two millennia, Euclid's parallel postulate appeared self-evident — not a choice but a necessity. Replacing it with a different axiom produced a geometry that was equally consistent and turned out to be physically more accurate than the original. The Zero Framework at the axiom level is asking whether the identification \(0 \equiv \text{"nothing"}\) is a parallel postulate of arithmetic: a convenience that has been mistaken for a necessity. The interpretation-level critique is conservative and already useful. The axiom-level critique is speculative but not without historical precedent.
13.4 Implication for RH specifically#
The implication for RH is indirect. RH is a formal statement about the zeros of the analytically continued zeta function. The Zero Framework does not currently change that statement, prove it, or disprove it.
What it may do is sharpen the language around what a zero means. If a zeta zero is treated as a structured cancellation point rather than "nothingness," then the research question becomes cleaner:
What structure forces all nontrivial cancellation points of the zeta function onto the critical line?
That question is still mathematical. The 8Z/DCC contribution, if any, would be to search for measurable information-structure invariants in prime-gap dynamics or prime-counting error that might later connect to the zeta-zero structure.
Current status: interpretation-level critique useful; axiom-level critique unformalized; RH implication speculative. This remains L8 in the interpretation ladder unless a formal invariant is discovered.
14. Corrected Language About Zeta Zeros#
Zeta zeros are points of exact cancellation or destructive interference in the analytic structure of the zeta function, not annihilation into ontological nothingness. This correction is an interpretation-level clarification: a function attaining the value zero is not the same as a quantity ceasing to exist.
15. Current Working Thesis#
Ordered prime gaps contain measurable compression structure beyond their marginal distribution. After the final E2/SEA batch, the surviving raw-gap signal also exceeds wheel-aware and first-order Markov-preserving controls from 100k through 2M primes, while block tests show that much of the broader signal remains local-to-medium scale.
The strongest current thesis is narrower than the original speculation but stronger than the initial shuffle-only result. v0.6 showed that the Markov2 floor can be beaten on holdout and that the carrier is scale-conditioned transition structure rather than a clean one-term density explanation. v0.7 sharpens this further: a generic loglog position-clock conditioned Markov2 law beats density variants on raw gap across the tested split ladder. The longer-term target is a formal information-theoretic invariant of prime distribution, not merely another route to the existing RH statement.
Operationally, the project now runs on three tracks: Track A seeks an RH-facing formal bridge; Track B seeks a native MDLxDCC invariant describing prime-gap structure directly; Track C isolates the generator carrier that can beat Markov2 without leakage or split artifacts.
16. Next Experiments#
The v0.7 ladder is now complete through 1M. The next work should not simply widen the arena. The current bottleneck is formal clarity: distinguish generic scale-clock laws from true prime-density information, then ask whether the surviving law has a clean mathematical description.
- v0.7.1 hygiene: fix the law-sheet split-name issue where
rolling1/rolling2/rolling3are all reported asmissing_splits=rolling. The generator results are usable; the law-sheet failure label is over-penalized. - Top carrier replication: rerun only the top raw-gap carriers —
clock_loglog_markov2,clock_log_markov2,density_wrong_scale_markov2,density_dcc_no_motif,density_markov2,density_shuffled_markov2, andclock_dcc_no_motif— at 2M or 5M. - Prime-specific density test: add stricter placebos: rank-preserving random monotone clocks, permuted density blocks, shifted-log clocks, and density residuals orthogonalized against
loglogclock. - Invariant extraction: treat
clock_loglog_markov2as the first raw-gap law-sheet candidate andgap_mod6 variable_markovas the residue-memory candidate; attempt compact symbolic descriptions of both. - Formal bridge: only after carrier replication, connect the candidate residuals to prime-counting error, explicit-formula residuals, or a new RH-adjacent criterion.
17. Preliminary Conclusion#
E1 delivered a clear shuffle signal. E2 narrowed and strengthened the result. The final SEA batch now confirms that the broad shuffle advantage is partly local/arithmetic, but the raw gap signal survives wheel-aware and Markov-preserving controls at 100k, 500k, 1M, and 2M.
The current conclusion is therefore:
The prime-gap compression signal is real, smaller under harder controls, and still alive where it matters most: raw gap order survives beyond marginal distribution, simple wheel structure, and first-order transition statistics. The completed SEA run adds a 500k bridge, a 1M shuffle baseline, and stronger scale evidence through 2M primes.
The most stable current signal is not only the aggregate z-score, but the window-level consistency: Markov and wheel controls show negative raw-gap deltas in every tested window at 100k, 500k, 1M, and 2M. The only near-boundary result in the key raw-gap family is block100 at 1M, with 49/50 negative windows and a marginal z-score.
This is not RH evidence yet, but it is a legitimate 8Z/DCC number-theory signal candidate. The deeper goal is to discover a formal invariant that explains prime-distribution structure; RH is the benchmark, not the only possible prize.
18. Three-Track Roadmap: RH, MDLxDCC Invariant, Generator Arena#
The current compression experiments cannot prove the Riemann Hypothesis by themselves. They are empirical signal tests. A proof would require a formal mathematical invariant or theorem that applies to all relevant cases, not only to tested prime ranges.
18.1 Three live tracks#
The roadmap now has three live tracks:
| Track | Name | Target | Status |
|---|---|---|---|
| A | RH-facing formal bridge | Use the empirical signal to search for a formal invariant connected to prime-counting error, zeta-zero structure, explicit formulas, or an RH-equivalent criterion. | Future formal work; not claimed by E1/E2/SEA/Arena. |
| B | MDLxDCC-native prime-structure invariant | Build a new information-theoretic description of prime-gap structure where RH is a benchmark, not the only definition of success. | Motivated by surviving compression signal. |
| C | Inverse generator arena | Search over compact generators, grammars, automata, dictionaries, and multi-scale programs that reproduce prime-gap structure better than null models. | v0.4 achieved the first 1M Markov2-floor breach on generator holdout; next target is scaling, ablation, and formalization. |
This distinction matters. A successful MDLxDCC invariant could be valuable even before it proves RH, and a generator that survives holdout and scaling could become the first concrete object from which a stronger invariant is extracted.
The higher target is not merely “prove RH by compression.” The higher target is to discover a formal invariant or compact generator of prime-distribution structure. RH then becomes one benchmark such an invariant may imply, explain, or sit beside.
18.2 The required bridge#
A possible proof path would need the following form:
- Use 8Z/DCC/MDL tools to discover a stable invariant in prime-gap dynamics, prime-counting error, or another RH-adjacent arithmetic trace.
- Formalize that invariant as a precise mathematical statement.
- Prove that the invariant holds for all relevant values, not merely for tested ranges.
- Show that the invariant implies RH, is equivalent to RH, strengthens an RH-style distribution bound, or explains a structure that RH does not directly describe.
In short:
empirical signal → generator candidate → formal invariant → distribution theorem / RH implication / new criterion
18.3 Why not just prove RH?#
RH matters because it is one of the cleanest known formulations of prime-distribution regularity. It would strongly constrain the error term in the prime number theorem and would affect many related results in analytic number theory.
But a new 8Z/DCC invariant could be valuable even before it proves RH, and potentially more valuable if it explains more than RH alone. For example, it could:
- give a new formal description of structure in prime gaps,
- produce a bound on prime-counting error comparable to an RH-style bound,
- explain why the distribution is stable rather than merely state that it is,
- generalize to other L-functions, arithmetic traces, or structured sequences,
- or discover a new RH-equivalent criterion from an information-theoretic direction.
Thus RH is not the only possible victory condition. It is the gold-standard benchmark. The deeper goal is a new invariant that explains prime-distribution structure in a way that can later be connected to RH or to a stronger distribution theorem.
18.4 Prime-counting error route#
One possible route is through prime-counting error. RH is deeply connected to how tightly prime-counting functions stay near their expected main terms. A compression-based approach would need to discover a bound or regularity in the prime-gap trace that implies an appropriate bound on prime-counting error.
A rough proof-seed form would be:
If every prime-gap window has bounded normalized MDL excess, then prime-counting error remains within an RH-compatible bound.
This is not yet a theorem. It is a target shape for future formalization.
18.5 Explicit-formula route#
The most RH-adjacent route would connect compression structure in prime gaps to the oscillatory terms generated by zeta zeros in explicit formulas. In that direction, the desired contradiction would look like this:
- Assume an off-critical-line zero exists.
- Show that such a zero forces a detectable oscillatory or compressible signature in prime-counting error or prime-gap dynamics.
- Show that a proven MDL/DCC invariant forbids that signature.
- Conclude that no off-critical-line zero can exist.
Symbolically:
off-line zero → forbidden oscillation/compression signature → contradiction → RH
18.6 Equivalent-criterion route#
Another route is discovery of a new RH-equivalent condition. 8Z/DCC could search for a new boundedness, monotonicity, spectral, or compression condition that appears empirically stable. The mathematical task would then be:
- prove the new condition is equivalent to RH or implies RH,
- then prove the new condition directly.
This may be more realistic than directly proving statements about zeta zeros from compression data.
18.7 Proof-roadmap phases#
| Phase | Goal | Status |
|---|---|---|
| 1 | Detect empirical signal in prime-gap order | completed first pass; E1 positive |
| 2 | Test signal against stronger null controls | final E2/SEA pass positive; deeper tests needed |
| 3 | Extract LZ76 dictionaries, motifs, and multi-scale MDL spectrum | started by RH Arena v0.1/v0.2b |
| 4 | Run inverse generator benchmark with holdout validation | started; Markov baseline currently strongest |
| 5 | Repair controls and scoring after 1M Arena diagnosis | v0.3 running |
| 6 | Relate surviving generator/invariant to prime-counting error or explicit-formula terms | planned |
| 7 | Extract candidate invariant | future |
| 8 | Prove invariant and distribution/RH implication where possible | future |
| 9 | Formal verification where possible | future |
18.8 Current position#
The current strongest honest formulation is:
8Z/DCC may help discover a new RH-adjacent regularity by treating prime gaps as a multi-scale information trace and searching for stable compression or generator invariants that bound or constrain prime-distribution error. RH is the benchmark; the deeper prize is the invariant. The project should pursue all three tracks: RH-facing proof bridge, MDLxDCC-native structure invariant, and inverse generator arena.
This keeps the door open without pretending that E1, E2, SEA, or the current RH Arena runs can prove RH by themselves.
19. Inverse Generator Hypothesis#
RH gives a constraint on prime-distribution error. MDLxDCC asks the inverse question:
Can we discover a compact generator, grammar, automaton, dictionary, or multi-scale program that reproduces prime-gap structure better than shuffle, wheel, Markov, or Cramér-like null models?
This is not necessarily a search for a one-line formula for primes. The target is the shortest useful generator of prime-gap structure: an algorithmic or MDL formula that explains more of the real trace than competing null models.
If such a generator survives holdout and scaling, it becomes an invariant candidate. If the invariant later constrains prime-counting error, explicit-formula residuals, or zero-spacing structure, it may become an RH bridge — or a separate MDLxDCC-native path to prime-distribution theory.
19.1 Arena flow#
trace → encoding → sensors → controls → generators → MDL score → holdout validation → invariant candidate
The critical change is that controls are no longer only adversaries. They become baselines that generators must beat. A generator that cannot beat Markov, wheel, or Cramér baselines is not yet an invariant candidate, even if it produces visually plausible prime-gap structure.
19.2 Generator classes#
| Class | Role | Interpretation |
|---|---|---|
| G0 shuffle | frequency baseline | Destroys order; useful but weak. |
| G1 wheel-only | arithmetic residue baseline | Captures simple modular constraints. |
| G2/G3 Markov | transition baseline | Tests whether signal is mostly local first/second-order dynamics. |
| G4 motif grammar | dictionary/motif baseline | Tests recurring gap-token phrases. |
| G5 LZ dictionary | compressive replay baseline | Tests whether dictionary fragments can generate realistic structure. |
| G6 local-density Cramér | density trend baseline | Tests whether changing prime density explains the trace. |
| G7 hybrid DCC | candidate engine | Combines local density, motif grammar, Markov fallback, and wheel/density fallback under MDL scoring. |
19.3 Acceptance rule#
A generator becomes interesting only if it satisfies all of these:
- trained only on early data, evaluated on later holdout data,
- beats simpler controls under
total_mdl_score, not merely raw accuracy, - matches LZ76, entropy, motif spectrum, and next-gap predictive cost within acceptable error,
- survives scaling across 100k → 500k → 1M → 2M and beyond,
- produces a compact description that can be inspected mathematically.
In this language, the Arena does not “prove RH.” It searches for compact generator/invariant candidates. Proof would be a later certificate.
20. RH Arena v0.3 Results#
The completed v0.3 Arena run is the first cleaned-up version after the v0.2b diagnosis. It separates gap_mod6/gap_mod30 from prime_residue6/prime_residue30, uses wheel-aware Cramér controls with warmup, excludes the Cramér global window-0 artifact from summary aggregation, and scores generators using predictive bits over the holdout length.
20.1 Run integrity#
| Field | Value |
|---|---|
| Script | 8z-rh-arena-v0.3 |
| Prime range | 1,000,000 primes; last prime 15,485,863; 999,999 gaps |
| Train / test | 500,000 / 1,000,000 primes |
| Window / stride | 50,000 / 50,000 |
| Trials | 400 per control task; p-value floor ≈ 0.00249 |
| Workers | 10 requested / 10 used |
| Completed work | 2,200 window tasks; 88 generator tasks; 30,491 motif rows; 16,892 invariant-candidate rows |
| Elapsed | 52,970 seconds, about 14.7 hours |
The run completed cleanly with windows.done.json, generators.done.json, and run.done.json. The result is usable as a v0.3 baseline.
20.2 Core sensor readout#
The core raw-gap signal remains real under most controls, but Markov2 is still the hard boundary.
| Trace | Control | Mean ΔLZ | Mean z | Window stability | Readout |
|---|---|---|---|---|---|
gap | shuffle | −785.94 | −77.49 | 20/20 negative | Strong order signal. |
gap | block10 | −241.47 | −21.32 | 20/20 negative | Survives short-block preservation. |
gap | markov1 | −204.54 | −11.35 | 20/20 negative | Survives first-order transition baseline. |
gap | wheel6 | −86.68 | −8.66 | 20/20 negative | Survives simple wheel-aware control. |
gap | local_cramer | −165.82 | −7.69 | 20/20 negative | Survives wheel-aware local-density surrogate. |
gap | cramer_global | −124.69 | −5.80 | 19/19 negative | Survives after excluding window-0 artifact. |
gap | block50 | −57.68 | −5.54 | 20/20 negative | Still stable. |
gap | block100 | −29.86 | −2.92 | 20/20 negative | Weaker but present. |
gap | wheel30 | −8.86 | −0.88 | 16/20 negative | Mostly absorbed by stronger wheel structure. |
gap | markov2 | +44.71 | +2.59 | 1/20 negative | Signal killed/reversed; current boundary. |
gap_div2 repeats the same picture almost exactly. This confirms that the core raw-gap signal is not a token-scaling artifact.
20.3 Residue and Cramér readout#
The v0.3 split clarified the old residue confusion. gap_mod6 and gap_mod30 contain strong, stable gap-residue motif structure. prime_residue6 and prime_residue30 are now explicit prime-residue traces and should be treated as wheel-state diagnostics, not as raw evidence for a new invariant.
| Trace | Control | Mean z | Readout |
|---|---|---|---|
gap_mod6 | markov2 | −29.53 | Gap-residue structure survives second-order local baseline. |
gap_mod30 | markov2 | −2.44 | Weaker but still negative in 20/20 windows. |
prime_residue6 | markov2 | −0.15 | Mostly absorbed; useful as wheel-state diagnostic. |
prime_residue30 | markov2 | +1.02 | Mostly absorbed/reversed; not a primary invariant signal. |
gap_mod6 | local_cramer | +14.20 | Cramér surrogate over-explains or mismatches this residue view; read cautiously. |
gap_mod30 | local_cramer | +14.93 | Same caution. |
The earlier huge residue/Cramér readout is no longer treated as a discovery by itself. It is now a diagnostic showing where surrogate design and residue-state modeling still need care.
20.4 Generator leaderboard#
The generator arena changed meaningfully after predictive-bits scoring. For raw gaps, markov2 now beats markov1, and hybrid_dcc moves to second place but still does not beat the Markov2 baseline.
| Encoding | Winner | Second | Third | Readout |
|---|---|---|---|---|
gap | markov2 | hybrid_dcc | markov1 | Markov2 is the current raw-gap baseline to beat. |
gap_div2 | markov2 | hybrid_dcc | markov1 | Same as raw gap. |
abs_delta | markov2 | hybrid_dcc | markov1 | Second-order transition structure dominates. |
gap_mod6 | markov2 | hybrid_dcc | markov1 | Strong residue-state transition structure. |
gap_mod30 | markov2 | hybrid_dcc | markov1 | Same hierarchy at finer wheel scale. |
prime_residue6 | markov2 | hybrid_dcc | markov1 | Mostly wheel-state transition modeling. |
prime_residue30 | markov2 | hybrid_dcc | markov1 | Mostly wheel-state transition modeling. |
cumulative_error_walk | markov1 | motif_grammar | lz_dictionary | Still interesting, but not yet a clean invariant. |
normalized_gap | local_cramer | shuffle | wheel_only | Density trend dominates this encoding. |
gap_minus_logp | markov1 | motif_grammar | lz_dictionary | Needs better density-corrected generator design. |
Current generator verdict: the Arena has not yet found a DCC/motif generator that beats Markov2 on raw gaps. That is not a failure. It is the clean next target.
20.5 Invariant candidates#
The v0.3 run emitted invariant_candidates.csv. The strongest stable candidates are not yet raw-gap formulas. They are mostly gap-residue and coarse gap-class motif families.
| Encoding | Candidate motif | Controls survived | Window support | Min z | Readout |
|---|---|---|---|---|---|
gap_mod6 | 2,0,4 | 9 | 20 | 35.09 | Very strong residue motif family. |
gap_mod6 | 4,0,2 | 7 | 20 | 35.85 | Companion residue motif. |
gap_mod6 | 2,4,2 | 6 | 20 | 37.51 | Stable alternating residue motif. |
gap_mod30 | 4,6,14 | 6 | 14 | 9.67 | Finer wheel-residue candidate. |
gap_mod30 | 10,2,10 | 5 | 17 | 8.61 | Stable finer gap-residue motif. |
bucket_log2 | 4,1,4 | 7 | 20 | 5.03 | Coarse size-class candidate. |
gap | 2,12,10 | 4 | 10 | 2.30 | Weak raw-gap candidate; not invariant-level yet. |
This is the right shape of result for an early inverse-generator arena: it does not magically produce an RH invariant, but it tells us where the stable motifs live. The strongest motifs are currently residue/gap-class structures. Raw integer gap motifs are weaker and need deeper filtering.
20.6 Conclusion and next step#
The v0.3 result sharpens the project:
- Confirmed: raw
gap/gap_div2LZ structure survives shuffle, block, wheel6, Markov1, and wheel-aware Cramér controls at 1M scale. - Boundary: Markov2 absorbs/reverses the raw-gap LZ advantage.
- Generator floor: Markov2 is now the baseline to beat for raw gaps.
- DCC status:
hybrid_dccimproved to second place for many encodings, but it still does not beat Markov2. - Candidate status: strongest motif families are gap-residue/coarse-class motifs, not a raw-gap invariant yet.
v0.3 does not close the RH problem. It gives a cleaner target: build a generator or invariant that beats Markov2 on holdout while preserving the stable motif and residue-family structure discovered by the Arena.
The next code step should be a focused v0.4 generator arena: Markov2+DCC hybrids, variable-order Markov, motif-conditioned Markov, density-feedback states, and per-encoding generator leaderboards. The next paper step is to update this page after v0.4 shows whether any compact generator can actually beat the Markov2 floor.
21. RH Arena v0.4 Results: Markov2 Floor Attack#
The v0.4 Arena run executes the next step proposed by v0.3: stop asking only whether raw gaps survive controls, and ask whether a compact generator can beat the markov2 holdout floor. This section should be read with a strict split:
- Sensor/control layer: raw
gapstill hits the Markov2 boundary. - Generator/holdout layer: new density-conditioned and DCC-style generators now beat the Markov2 floor.
v0.4 does not remove the Markov2 boundary from the LZ control test. It crosses the Markov2 floor in the generator arena.
21.1 Run integrity#
| Field | Value |
|---|---|
| Script | 8z-rh-arena-v0.4 |
| Schema | rh_arena_schema_v0.4 |
| Prime range | 1,000,000 primes; last prime 15,485,863; 999,999 gaps |
| Train / test | 500,000 / 1,000,000 primes |
| Window / stride | 50,000 / 50,000 |
| Trials | 400 per control task; p-value floor ≈ 0.00249 |
| Workers | 10 requested / 10 used |
| Encodings | gap, gap_div2, abs_delta, bucket_log2, normalized_gap, gap_minus_logp, cumulative_error_walk, gap_mod6, gap_mod30 |
| Control modes | shuffle, block50, block100, wheel6, wheel30, markov1, markov2, local_cramer, cramer_global |
| Generator modes | markov2, markov3, markov4, variable_markov, motif_markov2, density_markov2, markov2_dcc, density_dcc |
| Completed work | 1,620 window tasks; 72 generator tasks; 10,696 motif rows; 5,826 invariant-candidate rows |
| Elapsed | 31,280.007 seconds, about 8 h 41 min 20 s |
The run completed cleanly with windows.done.json, generators.done.json, and run.done.json. No completed tasks came from checkpoint; this was a fresh complete run.
21.2 Sensor readout: Markov2 still kills raw-gap LZ advantage#
The raw-gap sensor picture remains almost exactly the v0.3 lesson. gap survives many controls, including Markov1 and wheel-aware Cramér, but not Markov2.
| Control | Mean ΔLZ | Mean z | Window stability | Median p | Readout |
|---|---|---|---|---|---|
shuffle | -785.94 | -77.49 | 20/20 negative | 0.00249 | Strong order signal; same broad baseline as before. |
markov1 | -204.54 | -11.35 | 20/20 negative | 0.00249 | Survives first-order transition baseline. |
wheel6 | -86.68 | -8.66 | 20/20 negative | 0.00249 | Survives simple mod-6 wheel-aware control. |
local_cramer | -165.82 | -7.69 | 20/20 negative | 0.00249 | Survives wheel-aware local-density surrogate. |
cramer_global | -124.69 | -5.80 | 19/19 negative | 0.00249 | Survives after excluding the known window-0 artifact. |
block50 | -57.68 | -5.54 | 20/20 negative | 0.00249 | Stable local/medium-scale signal. |
block100 | -29.86 | -2.92 | 20/20 negative | 0.00249 | Weaker but still negative in all windows. |
wheel30 | -8.86 | -0.88 | 16/20 negative | 0.15835 | Mostly absorbed by stronger wheel structure. |
markov2 | 44.71 | +2.59 | 1/20 negative | 0.99875 | Signal killed/reversed; sensor-level boundary. |
gap_div2 repeats the same pattern: Markov2 is again positive/reversed, while shuffle, Markov1, wheel6, block, and Cramér controls remain negative. The important interpretation is that the LZ sensor no longer says “raw gaps beat everything.” It says “raw gaps beat first-order/local/wheel/Cramér controls, but second-order transition memory is enough to absorb this sensor.”
21.3 Generator floor: Markov2 is beaten on holdout#
The generator result is the new signal. The v0.4 challengers were designed specifically to attack the Markov2 floor. They succeeded across all tested encodings.
| Encoding | Winner | Runner-up | Δ holdout vs Markov2 | Δ bits/token vs Markov2 | Winner margin |
|---|---|---|---|---|---|
gap | density_markov2 | density_dcc | -7,953.28 | -0.015946 | 4,297.46 |
gap_div2 | density_markov2 | density_dcc | -7,954.26 | -0.015946 | 4,298.46 |
abs_delta | density_markov2 | density_dcc | -7,262.93 | -0.014544 | 5,486.79 |
bucket_log2 | density_markov2 | markov2 | -1,329.38 | -0.002755 | 1,329.38 |
normalized_gap | markov2_dcc | density_dcc | -1,568.44 | -0.000723 | 43.88 |
gap_minus_logp | density_dcc | markov2_dcc | -372,137.19 | -0.743387 | 2,470.89 |
cumulative_error_walk | density_dcc | markov2_dcc | -112,715.09 | -0.225453 | 14,854.76 |
gap_mod6 | variable_markov | markov4 | -50,185.35 | -0.100330 | 5,160.61 |
gap_mod30 | markov3 | density_markov2 | -40,125.27 | -0.080438 | 39,471.28 |
For raw gap, the first accepted breach is not a complicated motif grammar. The best generator is density_markov2: a Markov2 spine conditioned by local density state. The DCC hybrid is second, meaning DCC helps but is not yet the simplest winning explanation.
| Rank | Generator | Holdout score | Δ holdout vs Markov2 | Bits/token | Δ bits/token | Motif distance | LZ error |
|---|---|---|---|---|---|---|---|
| 1 | density_markov2 | 1,936,623.45 | -7,953.28 | 3.872776 | -0.015946 | 0.01028 | 0.02110 |
| 2 | density_dcc | 1,940,920.91 | -3,655.82 | 3.881239 | -0.007483 | 0.03494 | 0.03376 |
| 3 | markov2 | 1,944,576.73 | 0.00 | 3.888722 | 0.000000 | 0.01262 | 0.02565 |
| 4 | markov2_dcc | 1,947,542.12 | 2,965.39 | 3.894477 | 0.005755 | 0.05036 | 0.04351 |
| 5 | motif_markov2 | 1,965,111.90 | 20,535.17 | 3.929558 | 0.040836 | 0.07454 | 0.06210 |
v0.4 generator verdict: the Markov2 floor is no longer unbroken. The best current raw-gap explanation is Markov2 plus local density state, with DCC hybrids close behind but not yet dominant.
21.4 Invariant candidates: robust motifs still live mostly in residue space#
The invariant-candidate file contains 5,826 rows. Of these, 168 survive at least 6 controls, 39 survive at least 8 controls, and 22 survive all 9 controls. Among the all-control survivors, 21/22 are gap_mod6 motif families and one is bucket_log2.
| Encoding | Motif | Length | Controls | Window support sum | Min z | Median z | Median lift | Total count | Score |
|---|---|---|---|---|---|---|---|---|---|
gap_mod6 | 2,0,0,4,2 | 5 | 9 | 175 | 20.15 | 93.05 | 1.979 | 148,955 | 141.27 |
gap_mod6 | 4,2,0,0,4 | 5 | 9 | 171 | 19.43 | 89.34 | 1.956 | 143,686 | 136.21 |
gap_mod6 | 2,4,2,0,4 | 5 | 9 | 171 | 4.48 | 34.95 | 1.208 | 196,367 | 31.40 |
bucket_log2 | 3,3,4,1 | 4 | 9 | 91 | 3.94 | 5.53 | 1.186 | 13,199 | 27.60 |
gap_mod6 | 4,2,0,4 | 4 | 9 | 176 | 3.17 | 69.15 | 1.400 | 357,528 | 22.23 |
This confirms the v0.3 motif lesson at higher focus: the most stable candidate layer is still residue/gap-class structure, especially gap_mod6. Raw integer gap motifs exist, but they are weaker and do not yet look like the final invariant object.
21.5 Conclusion and next step#
- Confirmed: v0.4 completed the targeted Markov2-floor attack cleanly.
- Sensor boundary: raw
gap/gap_div2still lose or reverse against Markov2 in LZ-control testing. - Generator breakthrough: all 9 tested encodings now have a non-Markov2 generator winner beating the Markov2 holdout baseline.
- Raw-gap winner:
density_markov2, not full DCC, is the smallest current winner. - DCC status:
density_dccandmarkov2_dccare strong, especially on density-adjusted/error-walk encodings, but they still need ablation and scaling. - Invariant status: stable motifs remain mostly gap-residue/coarse-class candidates, not a theorem-ready raw-gap formula.
v0.4 changes the project state from “Markov2 is the generator floor to beat” to “Markov2 has been beaten empirically at 1M by density-conditioned generators; now prove the gain survives scaling, ablation, and formalization.”
The v0.5 smoke now changes the next code step: scale the whole ablation ladder, not only density_markov2/density_dcc. At smoke scale, density_markov1 is the raw-gap winner, variable_markov is the gap_mod6 winner, and DCC/motif terms remain candidates rather than confirmed carriers. The next paper step is to compress the stable gap_mod6 motif families and the raw-gap density-Markov gain into one smaller invariant candidate.
22. RH Arena v0.5 Smoke Results: Density Ablation and Split Stability#
The first v0.5 run is a smoke-scale ablation, not a replacement for the v0.4 1M result. Its value is that it cleanly separates the raw Markov2 boundary from the generator layer and begins isolating which ingredient carries the generator gain.
v0.5 smoke verdict: the raw-gap LZ sensor boundary is still Markov2, but the generator floor is beaten again. At 120k scale the raw-gap winner is
density_markov1, not a motif/DCC hybrid. This points to density-conditioned transition structure as the first ingredient to scale-test.
22.1 Run integrity#
| Field | Value |
|---|---|
| Script | 8z-rh-arena-v0.5 |
| Schema | rh_arena_schema_v0.5 |
| Run size | 120,000 primes; last prime 1,583,539; 119,999 gaps |
| Train / test | 60,000 / 120,000 primes |
| Windows | 20,000-token window; 20,000 stride; 6 windows per encoding/control |
| Trials | 50 sampled controls; p floor 0.019608 |
| Workers | 1 requested / 1 used |
| Completed work | 120 window tasks; 272 generator tasks; 544 generator replicates |
| Generator splits | main, early, mid, late |
| Density guard | safe mode; safe density generators do not use actual holdout prime coordinates; oracle diagnostic intentionally flagged |
22.2 Sensor readout: Markov2 still absorbs raw-gap LZ advantage#
The sensor layer remains consistent with the earlier v0.3/v0.4 story. Raw gap and gap_div2 are strongly more compressible than shuffle, wheel6, Markov1, and local Cramér controls, but not under Markov2. The gap_mod6 residue trace remains strongly structured even under Markov2.
| Encoding | Control | Mean ΔLZ | Mean z | Negative windows | Readout |
|---|---|---|---|---|---|
gap | shuffle | −343.81 | −54.76 | 6/6 | strong order signal |
gap | local_cramer | −151.40 | −9.81 | 6/6 | survives wheel-aware local density control |
gap | markov1 | −83.46 | −6.58 | 6/6 | survives first-order transition control |
gap | markov2 | +25.79 | +2.19 | 1/6 | raw-gap LZ advantage absorbed/reversed |
gap_div2 | markov2 | +26.32 | +2.20 | 1/6 | same boundary as raw gap |
gap_mod6 | markov2 | −73.37 | −14.58 | 6/6 | residue-state structure survives Markov2 |
normalized_gap | markov2 | +152.83 | +7.77 | 0/6 | density-adjusted trace is not a raw sensor win here |
22.3 Generator ablation: density_markov1 wins raw gap at smoke scale#
The generator layer is where v0.5 changes the working hypothesis. On raw gap/gap_div2, the best safe non-Markov2 generator is density_markov1 across all four tested splits. That is a useful simplification: at this scale, adding Markov order 2, motif injection, or full DCC does not improve the raw-gap winner.
| Split | Encoding | Winner | Δholdout vs Markov2 | Runner-up | Readout |
|---|---|---|---|---|---|
| main | gap | density_markov1 | −2,423.76 | density_wrong_scale_markov2 | density + first-order transition wins raw gap |
| early | gap | density_markov1 | −2,343.44 | density_wrong_scale_markov2 | same winner in early split |
| mid | gap | density_markov1 | −3,146.49 | density_only | density itself becomes strong |
| late | gap | density_markov1 | −4,150.22 | density_only | density signal strengthens later |
| main | gap_div2 | density_markov1 | −2,424.96 | density_wrong_scale_markov2 | same as raw gap |
| main | gap_mod6 | variable_markov | −5,853.89 | markov3 | residue trace prefers variable-order memory |
| main | normalized_gap | dcc_only | −39,430.74 | density_dcc_full | read as density-adjusted diagnostic, not raw-gap invariant |
The leakage guard is also behaving as intended: all safe density generators are marked non-leaking, while density_oracle_markov2 is deliberately flagged as using actual test-prime coordinates and should not be used as a public winner.
22.4 Conclusion and next step#
- Confirmed by smoke: v0.5 runs cleanly, resumes/writes all expected reports, and the leakage guard works.
- Sensor layer: raw-gap Markov2 boundary remains;
gap_mod6still carries strong residue-state structure. - Generator layer: Markov2 holdout floor is beaten again, but the raw-gap winner is simpler than expected:
density_markov1. - Ablation readout: density clock + low-order transition is the first ingredient to scale-test; motif/DCC interactions remain candidates, not winners, on raw gap at this smoke scale.
v0.5 changed the next test: do not only scale
density_markov2. Scale the whole ablation ladder —density_only,density_markov0,density_markov1,density_markov2, density-DCC variants, and variable Markov — and ask which ingredient survives at 1M, 2M, and beyond.
v0.6 update: that scaling test has now been run through 1M. The smoke winner density_markov1 did not remain the raw-gap winner; the scaled story is density/scale vs generic clock-DCC.
23. RH Arena v0.6 Carrier Isolation: Density vs Clock vs Residual#
The v0.6 run answers the question opened by v0.5: was the raw-gap generator gain really a density signal, or mostly a generic position/scale clock? The 4-worker sequence ran three stages — 150k smoke, 500k bridge, and 1M carrier isolation — with the optional 2M raw-carrier stage left off.
v0.6 verdict: the Markov2 generator floor is still beaten at 1M, but the carrier is not clean pure density. The full raw-gap split is won by
density_wrong_scale_markov2; the mid and late raw-gap splits are won byclock_dcc_no_motif. The best current description is scale-conditioned transition structure, with density and clock-DCC variants both active.
23.1 Run integrity and scale sequence#
| Stage | Primes | Train | Trials | Workers | Elapsed | Window tasks | Generator replicates | Guard failures |
|---|---|---|---|---|---|---|---|---|
| 150k smoke | 150,000 | 75,000 | 50 | 4/4 | 1.144 h | 120 | 800 | 0 |
| 500k bridge | 500,000 | 250,000 | 75 | 4/4 | 5.957 h | 360 | 1,800 | 0 |
| 1M carrier | 1,000,000 | 500,000 | 100 | 4/4 | 18.199 h | 840 | 3,000 | 0 |
The 1M carrier run used 1,000,000 primes, 500,000 train primes, 50,000-token windows, 100 sampled controls, and 4/4 workers. It completed 840 window tasks and 3,000 generator replicates. The leakage guard is clean: 0 failed guard replicates and no oracle-density diagnostics in the default v0.6 run.
23.2 Sensor readout: raw-gap Markov2 boundary remains#
The sensor layer remains consistent with v0.3–v0.5: raw gap is strongly more compact than shuffle, wheel6, Markov1, and Cramér-like controls, but Markov2 still absorbs or reverses the raw-gap LZ advantage.
1M raw gap control | Mean ΔLZ | Mean z | Negative fraction | Median p | Reading |
|---|---|---|---|---|---|
shuffle | -785.97 | -76.55 | 100.0% | 0.00990 | huge order signal |
wheel6 | -86.50 | -8.70 | 100.0% | 0.00990 | survives mod-6 wheel |
wheel30 | -8.61 | -0.84 | 75.0% | 0.19802 | weak / near local wheel boundary |
markov1 | -204.73 | -11.49 | 100.0% | 0.00990 | survives first-order transition |
markov2 | 44.97 | 2.60 | 5.0% | 1.00000 | absorbed/reversed boundary |
local_cramer | -165.53 | -7.62 | 100.0% | 0.00990 | survives local Cramér |
cramer_global | -124.34 | -5.81 | 100.0% | 0.00990 | survives global Cramér |
The stronger residue traces remain visible: gap_mod6 vs wheel30 has ΔLZ −1,476.84 and z −362.69; gap_mod6 vs Markov2 still has ΔLZ −198.95 and z −29.99. This keeps the residue-state track separate from the raw-gap generator track.
23.3 Generator carrier: density wins main, clock-DCC wins mid/late#
At 1M, non-Markov2 generators still beat the Markov2 holdout floor. The key result is the split diagnosis: density wins the full/main raw-gap split, while clock-DCC wins the mid and late raw-gap splits.
| Split | Winner | Δ vs Markov2 | Best clock | Best density | Best residual | Diagnosis |
|---|---|---|---|---|---|---|
| early | density_wrong_scale_markov2 | -4,057.07 | clock_dcc_no_motif | density_wrong_scale_markov2 | density_residual_dcc_no_motif | density_beats_generic_clock |
| main | density_wrong_scale_markov2 | -8,641.49 | clock_dcc_no_motif | density_wrong_scale_markov2 | density_residual_dcc_no_motif | density_beats_generic_clock |
| mid | clock_dcc_no_motif | -1,402.59 | clock_dcc_no_motif | density_dcc_no_motif | density_residual_dcc_no_motif | generic_clock_beats_density |
| late | clock_dcc_no_motif | -1,414.21 | clock_dcc_no_motif | density_dcc_no_motif | density_residual_dcc_no_motif | generic_clock_beats_density |
For the 1M main raw-gap split, the top generator leaderboard is:
| Rank | Generator | Holdout score | Δ holdout vs Markov2 | Δ predictive bits | Δ bits/token | Guard |
|---|---|---|---|---|---|---|
| 1 | density_wrong_scale_markov2 | 1,935,933.72 | -8,641.49 | -8,642.41 | -0.017285 | pass |
| 2 | density_dcc_no_motif | 1,935,952.28 | -8,622.92 | -8,667.85 | -0.017336 | pass |
| 3 | density_lagged_markov2 | 1,936,595.22 | -7,979.99 | -7,980.31 | -0.015961 | pass |
| 4 | density_markov2 | 1,936,595.39 | -7,979.81 | -7,980.31 | -0.015961 | pass |
| 5 | density_shuffled_markov2 | 1,936,595.42 | -7,979.79 | -7,980.31 | -0.015961 | pass |
| 6 | clock_dcc_no_motif | 1,942,231.68 | -2,343.53 | -2,277.21 | -0.004554 | pass |
| 7 | density_residual_dcc_no_motif | 1,942,242.75 | -2,332.45 | -2,272.05 | -0.004544 | pass |
| 8 | clock_sqrt_markov2 | 1,942,629.68 | -1,945.52 | -1,905.45 | -0.003811 | pass |
| 9 | clock_wrong_scale_markov2 | 1,943,066.91 | -1,508.30 | -1,468.36 | -0.002937 | pass |
| 10 | clock_square_markov2 | 1,943,262.57 | -1,312.63 | -1,275.22 | -0.002550 | pass |
| 11 | clock_markov2 | 1,943,369.42 | -1,205.78 | -1,166.67 | -0.002333 | pass |
| 12 | clock_lagged_markov2 | 1,943,369.59 | -1,205.61 | -1,166.04 | -0.002332 | pass |
The close cluster of density_markov2, density_lagged_markov2, and density_shuffled_markov2 remains important: it says scale state matters, but exact prime-coordinate density is not yet isolated as the sole cause. The new seed is clock_dcc_no_motif, because it wins mid/late raw-gap splits and is the best clock-family challenger.
23.4 Encoding separation: raw scale carrier vs residue memory#
| Encoding | Winner | Δ vs Markov2 | Carrier diagnosis | Reading |
|---|---|---|---|---|
gap | density_wrong_scale_markov2 | -8,641.49 | density_beats_generic_clock | raw carrier: density-scale main split |
gap_div2 | density_wrong_scale_markov2 | -8,641.19 | density_beats_generic_clock | same as raw gap |
gap_minus_logp | clock_markov1 | -1,051,637.89 | generic_clock_beats_density | generic clock dominates density-adjusted residual scale |
normalized_gap | density_dcc_full | -3,360.80 | density_beats_generic_clock | DCC-density wins normalized scale |
gap_mod6 | variable_markov | -50,173.70 | generic_clock_beats_density | higher / variable-order memory |
gap_mod30 | markov3 | -40,124.87 | density_beats_generic_clock | higher-order Markov memory |
This separation is useful. Raw gap/gap_div2 are now a scale-carrier problem. gap_mod6 and gap_mod30 are not primarily density-clock stories; they prefer higher-order or variable-order memory.
23.5 Invariant candidates#
The 1M v0.6 run produced 2,050 invariant candidates. High-control survivors remain concentrated in gap_mod6, with one sparse gap_div2 survivor.
| Threshold | Candidate count |
|---|---|
| ≥ 3 controls | 271 |
| ≥ 4 controls | 97 |
| ≥ 5 controls | 34 |
| ≥ 6 controls | 7 |
| ≥ 7 controls | 4 |
All-control candidates in this run:
| Encoding | Motif | Length | Controls | Support sum | Min z | Median z |
|---|---|---|---|---|---|---|
gap_mod6 | 4,2,0,4 | 4 | 7 | 100 | 3.13 | 137.14 |
gap_mod6 | 4,2,0,4,2 | 5 | 7 | 93 | 2.83 | 72.05 |
gap_mod6 | 2,0,4,2 | 4 | 7 | 88 | 2.43 | 139.43 |
gap_div2 | 4,3,2,7,21 | 5 | 7 | 7 | 7.25 | 19.23 |
The highest-support all-control motifs again live mostly in the residue alphabet. They are strong empirical candidates, not theorem-ready invariants.
23.6 Conclusion and next step#
- Confirmed: v0.6 completed the density-vs-clock-vs-residual carrier isolation run cleanly.
- Sensor boundary: raw
gapstill hits Markov2 as the hard LZ-control boundary. - Generator floor: Markov2 is beaten again at 1M on holdout.
- Carrier: not pure density; best wording is scale-conditioned transition structure.
- DCC status:
clock_dcc_no_motifis now important because it wins mid/late raw-gap splits. - Residue status:
gap_mod6/gap_mod30remain a higher-order memory track.
Do not claim that density explains the prime-gap signal. The current result says that density, generic clock, and clock-DCC are now separated enough to design the next sharper arena.
The next code step should be a smaller v0.7 carrier-stress arena: hold density_wrong_scale_markov2, density_markov2, density_lagged_markov2, density_shuffled_markov2, clock_dcc_no_motif, and density_residual_dcc_no_motif in direct paired competition, with stricter split-consistency penalties and an optional focused 2M raw-gap carrier run.
24. RH Arena v0.7 Carrier Stress: Density Placebo vs Generic Clock#
The v0.7 run is the surgical test proposed after v0.6. It asks whether the raw-gap generator gain comes from true train-extrapolated prime-density information, a generic monotone position/scale clock, wrong-scale regularization, or DCC state-conditioning. The ladder ran from 20k through 1M with 4 workers.
v0.7 verdict: the Markov2 generator floor is still beaten, but the raw-gap carrier is not isolated prime density. At 1M,
clock_loglog_markov2wins rawgapin all seven tested splits. Density variants remain strong, especially in the main split, but the density-placebo cluster shows that generic scale-clock state is the stronger current explanation.
24.1 Run integrity and ladder#
| Stage | Primes | Train | Trials | Workers | Elapsed | Window tasks | Generator tasks | Replicates | Guard failures |
|---|---|---|---|---|---|---|---|---|---|
| 20k smoke | 20,000 | 10,000 | 10 | 4/4 | 0.22s | 80 | 216 | 216 | 0 |
| 150k validation | 150,000 | 75,000 | 40 | 4/4 | 2.210 h | 525 | 945 | 2835 | 0 |
| 500k bridge | 500,000 | 250,000 | 60 | 4/4 | 13.494 h | 980 | 1323 | 5292 | 0 |
| 1M focused | 1,000,000 | 500,000 | 80 | 4/4 | 41.416 h | 980 | 1323 | 6615 | 0 |
The 1M focused stage used 1,000,000 primes, 500,000 train primes, 50,000-token windows, 80 sampled controls, and 4/4 workers. It completed 980 window tasks, 1,323 generator score tasks, and 6,615 generator replicates. The leakage guard is clean: 0 failed guard replicates and no oracle-density diagnostics.
24.2 Sensor readout: raw-gap Markov2 boundary still holds#
The v0.7 sensor layer preserves the v0.3–v0.6 boundary. Raw gap is strongly more compact than shuffle, Markov1, wheel6, and Cramér-like controls, but Markov2 absorbs/reverses the LZ advantage.
1M raw gap control | Mean ΔLZ | Mean z | Negative fraction | Median p | Dict overlap |
|---|---|---|---|---|---|
shuffle | -786.00 | -76.91 | 100.0% | 0.01235 | 0.165 |
markov1 | -204.84 | -11.55 | 100.0% | 0.01235 | 0.230 |
wheel6 | -86.65 | -8.77 | 100.0% | 0.01235 | 0.250 |
local_cramer | -164.90 | -7.64 | 100.0% | 0.01235 | 0.230 |
cramer_global | -124.31 | -5.86 | 100.0% | 0.01235 | 0.231 |
wheel30 | -8.51 | -0.84 | 75.0% | 0.19753 | 0.294 |
markov2 | 45.13 | 2.65 | 5.0% | 1.00000 | 0.271 |
The residue track is still separate: gap_mod6 vs Markov2 has ΔLZ −199.06 and z −30.59; gap_mod30 vs Markov2 has ΔLZ −31.17 and z −2.50.
24.3 Generator carrier: loglog clock wins raw gap across splits#
| Split | Winner | Δ vs Markov2 | Best density | Density − clock Δ | Diagnosis |
|---|---|---|---|---|---|
| early | clock_loglog_markov2 | -4378.92 | density_wrong_scale_markov2 | 321.61 | generic_clock_beats_density |
| main | clock_loglog_markov2 | -8784.21 | density_wrong_scale_markov2 | 142.46 | density_clock_tie |
| mid | clock_loglog_markov2 | -5187.52 | density_dcc_no_motif | 4552.44 | generic_clock_beats_density |
| late | clock_loglog_markov2 | -5245.96 | density_dcc_no_motif | 4579.67 | generic_clock_beats_density |
| rolling1 | clock_loglog_markov2 | -6009.94 | density_wrong_scale_markov2 | 348.21 | generic_clock_beats_density |
| rolling2 | clock_loglog_markov2 | -6248.92 | density_dcc_no_motif | 5526.25 | generic_clock_beats_density |
| rolling3 | clock_loglog_markov2 | -6293.48 | density_dcc_no_motif | 5484.38 | generic_clock_beats_density |
For the 1M main raw-gap split, the top generator leaderboard is:
| Rank | Generator | Family | Holdout score | Δ holdout vs Markov2 | Δ bits/token | Guard |
|---|---|---|---|---|---|---|
| 1 | clock_loglog_markov2 | clock_placebo | 1,935,792.89 | -8,784.21 | -0.017473 | pass |
| 2 | density_wrong_scale_markov2 | density_ablation | 1,935,935.36 | -8,641.74 | -0.017285 | pass |
| 3 | density_dcc_no_motif | dcc_ablation | 1,935,953.87 | -8,623.23 | -0.017336 | pass |
| 4 | density_reversed_markov2 | density_placebo | 1,936,557.75 | -8,019.35 | -0.015961 | pass |
| 5 | density_lagged_markov2 | density_ablation | 1,936,596.81 | -7,980.29 | -0.015961 | pass |
| 6 | density_markov2 | density_ablation | 1,936,597.07 | -7,980.03 | -0.015961 | pass |
| 7 | density_shuffled_markov2 | density_ablation | 1,936,597.07 | -7,980.03 | -0.015961 | pass |
| 8 | clock_log_markov2 | clock_placebo | 1,937,320.22 | -7,256.88 | -0.014410 | pass |
| 9 | clock_dcc_no_motif | clock_dcc_ablation | 1,942,238.61 | -2,338.49 | -0.004554 | pass |
| 10 | density_residual_dcc_no_motif | density_residual_ablation | 1,942,249.72 | -2,327.38 | -0.004544 | pass |
| 11 | clock_sqrt_markov2 | clock_ablation | 1,942,631.62 | -1,945.48 | -0.003811 | pass |
| 12 | clock_wrong_scale_markov2 | clock_ablation | 1,943,068.84 | -1,508.26 | -0.002937 | pass |
The main split is close: clock_loglog_markov2 beats density_wrong_scale_markov2 by only 142.46 holdout units, so it is a density/clock tie under the configured carrier margin. But the split ladder is not close: early, mid, late, and rolling splits consistently prefer generic log/loglog clock states.
24.4 Density/clock placebo matrix#
The density placebo matrix is the main new information. On the 1M main raw-gap split, density_markov2, density_lagged_markov2, and density_shuffled_markov2 all sit at about Δ −7,980 vs Markov2; density_reversed_markov2 is also strong at Δ −8,019. This cluster means exact prime-coordinate density is not isolated as the unique cause. The signal behaves more like a monotone scale-state transition law.
By encoding, the 1M main split reads:
| Encoding | Winner | Δ vs Markov2 | Carrier diagnosis |
|---|---|---|---|
cumulative_error_walk | clock_loglog_markov2 | -163,772.17 | generic_clock_beats_density |
gap | clock_loglog_markov2 | -8,784.21 | density_clock_tie |
gap_minus_logp | clock_loglog_markov2 | -456,581.80 | generic_clock_beats_density |
gap_mod30 | markov3 | -40,124.81 | density_beats_generic_clock |
gap_mod6 | variable_markov | -50,186.51 | generic_clock_beats_density |
normalized_gap | density_dcc_full | -3,200.57 | density_beats_generic_clock |
prime_residue30 | density_markov2 | -529.95 | density_clock_tie |
24.5 Candidate law sheets#
The new candidate_law_sheets.csv/json output works as a formalization-facing summary. For raw gap, the top law is clock_loglog_markov2: split count 7, split-win count 7, mean Δ vs Markov2 -6,021.28, best Δ -8,784.21, and candidate law “position-clock conditioned transition law.”
For the residue track, the top law remains gap_mod6 variable_markov: split count 7, split-win count 7, mean Δ vs Markov2 -25,018.00, best Δ -50,186.51, and candidate law “symbolic Markov transition law.”
missing_splits=rolling. The generator output contains rolling1, rolling2, and rolling3; the law-sheet checker expects literal rolling. This is a reporting/hygiene bug, not a generator-result failure. It should be fixed in v0.7.1 before public release.24.6 Conclusion and next step#
- Confirmed: v0.7 completed the carrier-stress ladder through 1M with clean leakage guard.
- Sensor boundary: raw
gapstill hits Markov2 as the hard LZ-control boundary. - Generator floor: Markov2 is beaten again on holdout.
- Raw-gap carrier:
clock_loglog_markov2is now the best raw-gap law candidate across splits; pure density is not isolated. - Residue carrier:
gap_mod6remains a variable-order Markov memory problem;gap_mod30remains a higher-order Markov problem. - DCC status: DCC variants are still useful challengers, but they are not the raw-gap winner in v0.7.
v0.7 moves the project from “density/clock/DCC are all alive” to a sharper statement: the raw-gap holdout gain is best explained by generic log/loglog scale-clock conditioned transition structure, while residue alphabets carry separate higher-order symbolic memory.
The next code step should be v0.7.1 hygiene plus a narrow 2M/5M replication of the top law candidates. The next formal step is to write the clock_loglog_markov2 transition law and the gap_mod6 variable_markov residue law as compact mathematical objects, then test whether either connects to prime-counting error or an RH-adjacent criterion.
Appendix A – E1 Result Tables#
Tables are shown in Section 6 above.
Appendix B – Interpretation Ladder#
| Level | Claim | Status |
|---|---|---|
| L1 | Prime-gap order compresses better than shuffled gaps | ✅ supported by E1 |
| L2 | Signal is not only marginal distribution | ✅ supported by E1 |
| L3 | Signal survives wheel-aware controls | ✅ supported by final E2/SEA batch |
| L4 | Signal survives Markov-preserving controls | ✅ supported by final E2/SEA batch |
| L5 | Signal scales with prime depth | ✅ supported through 100k → 500k → 1M → 2M for raw gap under Markov/wheel controls |
| L6 | Signal relates to zeta-zero statistics | 🔮 speculative |
| L7 | DCC edge-of-chaos explains critical line | 🔮 speculative |
| L8 | AC/Zero Framework explains RH truth/proof difficulty | 🔮 highly speculative |
| L9 | Zero Framework at axiom level: \(0 \ne \emptyset\) as a possible foundational distinction | 🔮 philosophical hypothesis, pre-formal |
| L10 | Compact generator beats the Markov2 holdout floor | ✅ empirically achieved in v0.4 across 9/9 tested encodings; still not a formal invariant or proof |
| L11 | Generator gain survives scaling, ablation, and independent seeds | ✅ partially supported by v0.6 150k→500k→1M sequence; carrier not yet uniquely identified |
| L12 | Carrier is isolated as pure density, generic clock, residual, or DCC | 🧪 v0.6 narrows it to scale-conditioned transition structure; density and clock-DCC both active |
| L13 | Generator/invariant connects to prime-counting error, explicit formula residuals, or RH-equivalent criterion | 🔮 future formal bridge |
| L14 | Carrier stress separates prime-specific density from generic clock/scale state | 🧪 v0.7: raw-gap winner is clock_loglog_markov2; exact density is not isolated |
Appendix C – Working Language Rules#
Use: “compression-detectable structure”, “order-sensitive signal”, “generator candidate”, “invariant candidate”, “holdout validation”, “preliminary empirical result”, “null controls”, “not an RH proof”. Avoid: “RH confirmed”, “proof”, “annihilation of zeta zeros”, “AC requires RH”, “generator proves RH”.
Appendix D – Methodology & Reproducibility#
D1. E1 Shuffle Protocol#
For each analysis window independently, the exact multiset of gap tokens inside that window is shuffled. This preserves the local gap distribution of that window and destroys only the ordering inside the window.
This matters because the E1 claim is not that prime gaps have a special distribution. The claim is narrower: the real order of those same gaps is more compressible than shuffled order.
D2. Windowing#
- E1 20k run: window = 5,000, stride = 5,000.
- E1/E2 100k runs: window = 10,000, stride = 10,000.
- E2/SEA 500k and 1M runs: window = 20,000, stride = 20,000.
- E2/SEA 2M runs: window = 40,000, stride = 40,000.
- Mean LZ76: averaged across windows.
The final SEA batch contains non-overlapping main windows: 10 windows at 100k, 25 windows at 500k, 50 windows at 1M, and 50 windows at 2M.
D3. Encodings#
Each encoding maps the prime-gap sequence into a token sequence before LZ76 is computed:
gap: raw integer prime-gap values.gap_div2: normalized even gap values after the initial special case.delta: first difference, \(g_{n+1} - g_n\).abs_delta: absolute first difference, \(|g_{n+1} - g_n|\).gap_mod6/gap_mod30: gap values modulo wheel bases.prime_residue6/prime_residue30: prime residue-state traces, kept separate from gap-residue traces.bucket_log2: logarithmic bucket encoding of gap size.normalized_gap: gap divided by a local logarithmic scale.gap_minus_logp: gap minus a local expected logarithmic scale.cumulative_error_walk: cumulative walk of gap excess against local expectation.
D3b. LZ76 Implementation Note#
LZ76 is computed as a phrase-count complexity measure over token sequences, not as compressed byte size. Each encoding first maps the prime-gap sequence into symbolic integer tokens. The custom parser then scans the token stream left-to-right and counts the number of new phrases needed to parse it. Lower phrase count means the sequence is more compressible under this LZ76-style dictionary construction.
phrases = {}
i = 0
while i < len(tokens):
extend the current phrase until it is new
add phrase to dictionary
phrase_count += 1
i = next unread token
This is sufficient for the current signal tests, but future reproducibility releases should include the exact parser code, fixed test vectors, and a comparison with byte-level compressors.
D4. Main Statistic#
The primary statistic is:
Delta_LZ = LZ76(real) - mean(LZ76(control))
A negative value means the real ordered sequence is more compressible than the control sequences.
D5. Permutation p-value#
The permutation p-value is computed as:
p = (1 + count(control_LZ <= real_LZ)) / (trials + 1)
The minimum possible p-value depends on trial count: with 100 trials it is about 0.0099, with 300 trials about 0.00332, with 700 trials about 0.00143, with 1000 trials about 0.000999, and with 2000 trials about 0.0005. Low p-values here should still be read as sampled-control dominance, not as a final theorem-level significance claim.
D6. Known Likely Sources of Signal#
The E1 compression advantage may partly reflect known arithmetic constraints and scale effects, including:
- changing average prime-gap size with scale, approximately related to \(\log n\),
- residue-class restrictions such as \(6k \pm 1\),
- local gap correlations,
- twin-prime and prime-constellation effects,
- sieve structure and wheel-like constraints.
This is why E2 adds block, wheel-aware, Markov-preserving, Cramér-like, and depth/scaling controls. The purpose is to separate obvious arithmetic structure from deeper sequential compression structure.
D7. Current Limitation#
The current result is a completed first-pass E2/SEA signal plus v0.3–v0.7 generator-carrier evidence, not a final theorem. Raw gap defeats shuffle, wheel6, Markov1, and Cramér-like controls but still hits Markov2 as the hard LZ-control boundary. The generator layer now beats Markov2 on holdout at 1M, and v0.7 shows that the raw-gap carrier is not clean pure density: generic loglog scale-clock conditioning currently wins the split ladder. The work still needs a v0.7.1 law-sheet hygiene patch, focused 2M/5M replication, independent code review, exact parser test vectors, and a formal bridge to prime-counting error or an RH-equivalent criterion.
D8. Arena Version Notes#
- v0.1: first RH Arena: windows, controls, sensors, motif extraction, generators, holdout outputs.
- v0.2b: resume/workers/progress hardening and 1M generator benchmark.
- v0.3: gap-residue vs prime-residue split, wheel-aware Cramér, Cramér warmup, predictive-bits generator scoring,
invariant_candidates.csv, and completed 1M analysis. - v0.4: focused Markov2-floor generator attack; non-Markov2 generators beat Markov2 across tested encodings at 1M.
- v0.5: density/DCC ablation and split-validation smoke; identified density-conditioned transition structure as first carrier candidate.
- v0.6: clock-vs-density-vs-residual carrier isolation; 150k→500k→1M sequence completed; scale-conditioned transition carrier found, not pure density.
- v0.7: carrier-stress ladder with density/clock placebo matrix, DCC-no-motif decomposition, split-consistency law sheets; raw-gap carrier shifts to
clock_loglog_markov2, whilegap_mod6remainsvariable_markov.
Appendix E – Next Update Slot#
Next: Patch v0.7.1 law-sheet split naming, then run a narrow 2M/5M focused replication for clock_loglog_markov2, clock_log_markov2, density_wrong_scale_markov2, density_dcc_no_motif, density_markov2, density_shuffled_markov2, clock_dcc_no_motif, variable_markov, and markov3. Do not widen until the log/loglog-clock law and the residue variable-Markov law are written as compact candidate invariants.
Version 0.12 · May 12, 2026 · v0.7 carrier-stress integrated · raw-gap Markov2 sensor boundary preserved · Markov2 holdout floor beaten again at 1M · carrier narrowed from density-scale to generic log/loglog scale-clock transition structure · no RH proof claim · next: v0.7.1 law-sheet patch and 2M/5M focused replication.