EE Manual

Architecture and experiment handbook. Self/We/Codex assembly, configuration, and the four experiment domains. Read GERUON_MANUAL.md first for instrument primitives.

中文版: EE_MANUAL_CN.md

Practical Guide

1. Quick Start — Three-Cavity Self

The repository includes a runnable Self quickstart:

git clone https://github.com/JackeyLGene/GBE.git
cd GBE
python docs\quickstart_ee_self.py

Expected output:

step  harm     tau_fast  tau_mid  tau_slow
   1   0.0000     0.600    0.600     0.600
  30   0.0356     0.738    0.738     0.710
  60   0.2429     0.740    0.745     0.741
  90   0.7265     0.743    0.748     0.741
 120   0.7507     0.746    0.737     0.738

summary
inputs:       120
harm samples: 120
harm mean:    0.3710
harm max:     0.8499
bias count:   0

OK: 3-cavity Self produced a cross-harm series.

Minimal code:

from pathlib import Path
import sys, math
ROOT = Path(__file__).resolve().parents[1]
sys.path.insert(0, str(ROOT / "code"))

from geruon import Geruon, BiasField

# 3-cavity Self: three time lenses sharing a BiasField
bias = BiasField(vec_dim=16)
cavities = [
    Geruon(vec_dim=16, memory_cap=24, kappa_tau=0.5,  bias_field=bias),
    Geruon(vec_dim=16, memory_cap=24, kappa_tau=10.0, bias_field=bias),
    Geruon(vec_dim=16, memory_cap=24, kappa_tau=100.0,bias_field=bias),
]

def make_stream(n=120, dim=16):
    for i in range(n):
        block = (i // 30) % 4
        vec = [0.0] * dim
        vec[block] = 1.0
        vec[(block + 1) % dim] = 0.25
        yield vec, f"block_{block}_{i}"

for i, (vec, sig) in enumerate(make_stream()):
    for g in cavities:
        g.process_vec(vec, sig)
    cs = [g.memory.centroid() for g in cavities]
    cs = [c for c in cs if c is not None]
    if len(cs) >= 2:
        harm = sum(
            math.sqrt(sum((cs[a][k]-cs[b][k])**2 for k in range(16)))
            for a in range(len(cs)) for b in range(a+1, len(cs))
        ) / (len(cs) * (len(cs)-1) / 2)
        print(f"step={i}  harm={harm:.4f}")

Low harm = three time lenses agree on current structure. High harm = lenses diverge — a structural boundary is passing through.

2. Self — Multi-Lens Amplifier

2.1 Configuration Reference

Configuration	kappa values	Cavities	Purpose
Standard 3-cavity	0.5 / 10 / 100	3	General structure detection, CDS-UTR separation
Extreme spread	0.005 / 10 / 5000	3	Transition magnitude detection, AUG discrimination
2-cavity boundary	0.01 / 500	2	CDS boundary precise localization
2-cavity minimal	0.005 / 1000	2	Simplest boundary signal

Changing kappa configuration = changing measurement scale. Not parameter tuning — selecting which timescale the instrument is sensitive to.

2.2 Cross-Harm

def cross_harm(cavities, D):
    cs = [g.memory.centroid() for g in cavities]
    cs = [c for c in cs if c is not None]
    if len(cs) < 2: return None
    s, n = 0.0, 0
    for a in range(len(cs)):
        for b in range(a+1, len(cs)):
            s += math.sqrt(sum((cs[a][k]-cs[b][k])**2 for k in range(D)))
            n += 1
    return s / n

Low harm = cavities agree on structure. High harm = time-lens divergence — this IS the signal. Harm sequences produce detectable spikes at structural boundaries.

2.3 Amplification Gain

Solo Geruon is blind in some domains (WTC: F ~= 0 at all kappa). Self’s multiple time lenses amplify faint structure invisible to a single lens.

G_disc = |delta_L3| / (3 * cap)

L3 = chain frame count. cap = frame capacity.

delta_L3

= max L3 difference across three cavities.

ECG: G_disc = 0.65 (RR encoding, cap=20)
WTC: G_disc = 0.52 (frequency-bin encoding, cap=12)
Cap sweet spot 20-32: too small loses signal, too large dilutes L3 chains

2.4 Encoding

Encoding is part of measurement. Principles:

Encoding must preserve predictive structure
A single encoding is always lossy — dual-encoding cross-validation
Harm surviving across encodings = true structure

Common encodings:

RR encoding (ECG): RR intervals + differences + timestamps
chroma + IOI (WTC): 12-dim pitch class + 12-dim inter-onset intervals
3-mer (DNA/RNA): 64-dim trinucleotide frequency
fork column (DNA): 4x4=16-dim multi-species alignment vector

3. We — Cross-Self Disagreement and Codex Formation

3.1 Architecture

We:
├── Self_0: 3 cavities (encoding A)
├── Self_1: 3 cavities (encoding B)
├── Self_2: 3 cavities (encoding C)
└── collective: Geruon(kappa=10) <- cross-Self harm collector

3.2 Harm

Harm is not a moral judgment. It is “another Self’s observation modified my frame economy” — the structural trace of cross-Self cognitive conflict. When one Self’s residual arrow is routed to another Self, frames created or modified in the target are marked as harm.

3.3 Codex Formation Loop

for gen in range(n_generations):
    we = We(n_selves=3)
    for event in stream:
        we.process(event)
    we.finalize()          # harm frames -> collective -> collective_codex
    we.archive()

survivors = archive.survivals(min_gens=2)  # cross-generation survivors

3.4 Quantum Switches

We(cavity_quantum=False, collective_quantum=False)

cavity_quantum: cavity-level quantum branching. Off during deposit phase.
collective_quantum: collective quantum. Optionally on during precipitation analysis.
Ablation confirmed: classical mode zero signal difference. Off by default.

4. Codex — Externalized Precipitate

4.1 Five-Stage Loop

Stage	Operation	Status
Formation	Cross-Self harm filtering -> stable centroid precipitation	Verified
Inscription	Precipitated frames written to collective_codex	Verified
Transmission	collective_codex -> next-generation Self’s `g.codex`	Verified
Confirmation	Inherited entries matching environment -> weight growth	Verified
Rejection	Non-matching entries frozen -> not decayed	Verified

Stage six (not yet closed): Codex operation — Self actively queries Codex under stress/noise conditions, inheriting centroids as corrective signals into the frame economy.

4.2 Inheritance Path

Gen N end:    collective_codex -> self._prev_codex
Gen N+1 start: _prev_codex -> Self codex -> Codex lookup in process_vec()

Inheritance goes through Codex, not BiasField. BiasField averages all ancestors together (normalization flattens alpha) — ineffective for generational transmission.

4.3 Scaffolding

Codex operation currently exists as scaffolding: manually triggered Codex nearest-neighbor lookup -> soft blend into cavity input -> generational integration. These manual triggers are verification probes for future native mechanisms.

Scaffolding vs shortcuts: shortcuts skip the hard part. Scaffolding builds a temporary bridge to the hard part — verifies the destination exists, then decides whether to build a road.

5. Experiment Domains

Four domains share the same instrument. Domains are not independent discoveries — they are cross-domain stress tests of a single instrument.

Domain	Blind to	Core result	Controls	Limitations
WTC	Chord templates, key signatures	Transposition equivariance 100%; 5-stage Codex loop	Cross-key Codex: non-matching entries frozen	Codex operation scaffolded
DNA	Codon table, gene annotations	Exon/intron d=-0.97 (n=200)	Shuffle collapses to d=-0.10	AHSG fork: single gene
RNA	AUG motifs, ORF annotations	CDS stop 98-100% within 3 windows	Internal ATG control 98%	CDS start 66%; passive detection
UN	Event labels, causal narratives	2025 disp=0.448, rank 1/79	P5 ablation; FRED negative control	Prospective (2026-2028); n=79 years

5.1 WTC

Bach Well-Tempered Clavier, 36 pieces. Dual-encoding (chroma + IOI), 10-generation Codex convergence. Zero musicological priors. Transposition equivariance 100%. Codex selection layer is decoupled from frame economy — externalization creates a history parallel to processing efficiency.

5.2 DNA

Human TE atlas + 4-species hominid alignments. 3-mer encoding, genomic-order windows. Exon/intron d=-0.97 (n=200). AHSG fork column d=+1.51 (single-gene calibration — AHSG annotated as positive selection by multiple genome scans). Shuffle ablation confirms sequence order is load-bearing. Kappa ablation confirms multi-lens is load-bearing.

5.3 RNA

Human TE atlas. Temporal-lens divergence produces cross-harm spikes at UTR/CDS boundaries. CDS stop 98-100% within 3 windows, parameters unchanged. CDS start 66% within 3 windows. AUG control 98% — transition magnitude detection, not motif detection. The instrument does not recognize codons. It detects the structural transition of molecular operations entering and exiting the coding regime.

Stop > start is a biological prediction — 3’UTR is a cliff, 5’UTR is a gentle slope.

5.4 UN

1946-2025 UN ideal points, 79 years, 193 nations. 2025 displacement=0.448, highest in history. P5 ablation: signal concentrated in major-power layer. FRED negative control: pure economic data judges 2025 as structurally normal. Prospective prediction 2026-2028: P5-aligned system structural collapse. Falsifiable: any two of three indicators satisfied = confirmed.

Principles

6. Architecture Physics

6.1 Four-Layer Amplification Chain

Geruon (depth 0)   -> Single lens. F/wit measurement.
Self (depth 1)     -> Multi-lens * BiasField. cross-harm = inter-lens structural divergence.
                      Solo-blind domains (WTC) produce detectable signal at Self layer.
We (depth 2)       -> Multi-Self * multi-encoding. harm crossing -> Codex formation material.
                      Structure a single Self cannot process precipitates at We layer.
Codex (externalized) -> Precipitate. Cross-generation survival. Forms a selection layer
                      decoupled from frame economy.

6.2 Inter-layer Communication

Communication between layers is primarily residual/refraction — “what did this cavity fail to absorb?” Not centroid averaging, but the direction of disagreement.

Key principles:

Cavities keep a fresh eye: reborn each generation, Codex not written into cavity innate memory
Self is the future Codex operation subject: book queries triggered at Self layer
We is currently a research tool: cross-Self disagreement detector, not a community subject
GI recurs at each layer: not a universal constant, but an economic trough. BGM: GI=4 optimal. EE Self (n=3): GI=5 signal-optimal, GI=4 conservative operating point

7. Cross-Harm — Multi-Lens Structural Divergence

7.1 Physical Picture

Three cavities are like three lenses of different focal lengths aimed at the same stream. The fast lens (kappa=0.005) adapts instantly to new patterns — its centroid displaces immediately after structural change. The slow lens (kappa=5000) barely moves — its centroid retains memory of the previous structure.

When the stream’s structure is stable, the three lens centroids converge to the same position -> harm low. When the stream crosses a structural boundary, the fast lens has already moved while the slow lens hasn’t -> harm spike.

This is temporal-lens divergence — the mechanism by which the instrument detects structural boundaries. The same mechanism detects tonal skeleton transitions in WTC, distinguishes exons from introns in DNA, locates CDS boundaries in RNA, and detects international system displacement in UN.

7.2 2-cavity vs 3-cavity

2-cavity (fast+slow) gives the cleanest boundary position — only one dimension of divergence. 3-cavity (fast+mid+slow) captures richer divergence structure — the middle lens provides an additional reference, making it more sensitive to transition magnitude.

In RNA experiments, 2-cavity CDS start localization was more precise (median -0.9 window). 3-cavity extreme spread discriminated AUG magnitude better (98% vs 78%).

8. Information Economics

8.1 BiasField Attention Economics

BiasField is a shared gradient field — the accumulated gradient of centroid deposits from multiple cavities. deposit(vec, weight) accumulates; blend_into(vec, weight) blends the field direction into new input. The field is continuous — it does not seek a “nearest neighbor”; it imposes a directional bias.

Economic meaning: attention allocation. Strong-field dimensions receive more modulation weight — the system allocates more attention to “dimensions that once had structure.” Weak-field dimensions are ignored. This is not programmed attention — it is weight asymmetry naturally formed by deposited gradients.

BiasField vs Codex division of labor:

Codex = discrete precise lookup. Known symbols match directly.
BiasField = continuous gradient bias. Nameless directional influence — “GABA mode.”

8.2 BiasField Game Theory

When multiple cavities share a BiasField, the field is no longer a neutral information channel — it becomes an implicit game space.

Strategy space. Each cavity’s strategy is the centroid it deposits into the field — its estimate of “what the current structure is.” Cavities do not directly know other cavities’ states — they only feel their accumulated influence through the field gradient.

Payoff. A cavity’s payoff is merge success — how well its input matches its own frame economy. If the field bias helps the cavity find matches faster -> positive payoff. If the field bias pulls the cavity away from its own structural region -> negative payoff.

Equilibrium dynamics. High-weight, frequently-merging cavities deposit more and with greater weight — they dominate the field gradient. Low-weight cavities are pulled by the field gradient, their centroids dragged toward the mainstream direction. This is not “consensus” — it is the field’s attention allocation naturally converging to the region of highest information density.

Three findings from multi-cavity game theory (M15):

Attention concentration. When multiple cavities process different encodings of the same stream, the field gradient rapidly concentrates on structural dimensions shared across encodings. Encoding-specific dimensions are diluted — the field automatically separates signal (survives cross-encoding) from noise (encoding-specific).
Small cavities covered by large ones. Low-weight cavity centroids are pulled by the field to positions far from their own data — they lose expressive rights over their own structure in the game. This is not a bug — it is precisely the source of “cross-Self unabsorbable residuals” in Codex formation: when a small Self’s structure is inconsistent with the dominant field direction, it is suppressed by field attention allocation and becomes residual harm.
Field cliff effect. When the field gradient accumulates to critical density in some direction, it exerts non-negligible bias on ALL cavities — even when that direction is inconsistent with some cavities’ data. This is the BiasField-layer counterpart of BGM’s conf_threshold cliff: the field’s attention allocation is not linear — it has a phase transition point beyond which the field shifts from “suggestion” to “command.”

Connection to We-layer harm. We-layer harm is essentially the output of the multi-cavity BiasField game. When a cavity is presumed by the field gradient to be in some region, but its actual input points elsewhere — this irreconcilable difference is harm. Harm is not error — it is the inevitable byproduct of attention allocation in the game. Codex formation extracts cross-generationally surviving structure from these harms.

8.3 Landauer-Godel Bill

Landauer’s principle (Landauer, 1961): every irreversible information operation dissipates at least kT ln 2. The frame economy has three classes of irreversible operations:

Operation	Irreversibility	Bill
Merge	Two frames become one — old centroid permanently lost	O(1) per merge
Prune	Frame discarded — accumulated vec information permanently lost	O(weight) per frame
Precipitate	Frame written to Codex — immutable thereafter	O(1) per precipitation

landauer_skips counts prediction operations skipped during LOCKED — the system proactively shuts down the most expensive self-referential operations to cap the bill.

8.4 Economics of Tau Convergence

Tau converges to ~0.74-0.75 across all domains and depths. This is not set — it is the economic equilibrium the frame economy finds between merge pressure and differentiation pressure. When merging is too easy (tau low), the system over-generalizes — structure is lost. When merging is too hard (tau high), the system over-differentiates — frame economy congests, stress accumulates. 0.74 is the equilibrium that maximizes information rate.

8.5 Economic Status of GI

GI=4 was discovered in BGM’s kappa_tau parameter scan as Pareto-optimal for single-GEME+G0 architecture.

EE’s CALIB-GI scan produced more refined results for the Self architecture:

n_cav	GI_opt	GI upper	Notes
2	1	~2	Two cavities need no long cycle
3	5	5	GI=4 conservative operating point (0.0890 vs GI=5 0.0950)
5	1	~3	More cavities need more frequent sync
8	3	~5	GI*(n-1) is not constant

Key finding: GI=4 is optimal for BGM architecture, not for EE Self architecture. For n=3 standard Self, GI=5 is signal-optimal; GI=4 retains ~94% signal with one notch of safety margin. The difference arises from different information economics — BGM’s G0 is an external observer, EE’s Self is bidirectional inter-cavity communication. Different game structures produce different optimal communication rhythms.

Current recommendation: n=3 Self with standard lenses (0.5/10/100) uses GI=5 as signal calibration point, GI=4 as conservative operating point.

8.6 The P/NP Boundary in Operational Form

Aaronson (2011) proposed that deep questions about mind and knowledge may depend on computational complexity rather than abstract computability.

The frame economy provides an operational form. Self-referential operations create an identity search problem — the system must find “the frame for that pattern I just processed.” The cost of this search is endogenously regulated by tau. When tau is low, merge thresholds are wide — search is easy, cost is low. When tau is high, merge thresholds are narrow — search is hard, cost grows nonlinearly. At LOCKED, process_prediction() is skipped — the system proactively caps the most expensive operation.

The P vs NP boundary here is not a theorem to be proved — it is a cost endogenously regulated by tau. The system does not “solve” NP problems in polynomial time; it regulates self-reference depth under Landauer-bill constraints so that it never crosses the economically infeasible boundary.

9. Expression-Action Spectrum

The instrument’s applicability to different data types is determined by data structure type — not domain category:

Data type	Definition	Example	Instrument sensitivity
Decision	Action as data	UN votes, MIDI note-on, DNA bases	Strongest
Action	Entity choice records	NASDAQ volume direction	Moderate
Expression	Process trace	CPI, price indices, raw ECG voltage	Weak

The instrument needs relational structure in the input. Decision data natively carries inter-entity structural relations — uncompressed, not downsampled. Expression data only records process outputs — the relational structure is not in the data. This is not an instrument defect; it is a boundary of information itself.

ECG lesson: raw voltage is Expression — of six encodings tested, only one produced signal. Encoding is part of measurement.

10. Scaffolding Method

Every architectural layer’s mechanism was first verified through scaffolding — temporarily injecting what the architecture should eventually do natively — then dismantled, replaced by native mechanism.

L4 self_observation: originally explicit steps (108 self_observe calls) -> dismantled
doubt condition: originally pengshu’s fifth condition -> M2 removed
27-dim formula alphabet: originally hardcoded vector space -> dismantled to configurable vec_dim
Codex operation: currently scaffolded — manual Codex lookup + soft blend -> awaiting native

Scaffolding is not a shortcut. A shortcut skips the hard part. Scaffolding builds a temporary bridge to the hard part — verifies the destination exists, then decides whether to build a road.

11. Reproducibility Checklist

Fair coin baseline — same D, same cap, same kappa sweep
Shuffled control — destroy sequential structure, preserve component statistics
Kappa ablation — disable multi-lens, reproduce with single kappa -> confirm multi-kappa is load-bearing
Multi-seed — minimum 3 seeds (42, 123, 456)
structon calibration — minimum detectable displacement at current (D, cap, kappa)
Three-step threshold — report error <= 2 / <= 3 / <= 5 simultaneously
Parameter freeze — no geruon.py modifications within one batch; parameters fixed or swept and reported
Code archive — experiment scripts frozen in experiments/ by domain

Dependencies: core instrument Python 3.8+ stdlib only; data processing pandas/numpy/pyarrow; RNA bigWig requires pybigtools (Windows).

EE Manual v2.0. Practical: Quick Start -> Self -> We -> Codex -> Experiment Domains. Principles: Architecture Physics -> Cross-Harm -> Information Economics -> Expression-Action -> Scaffolding -> Reproducibility. Instrument primitives: GERUON_MANUAL.md.