What are common scientific mistakes?

What Redirected

written 2026-04-14 · last edited 2026-07-03

Every idea that pointed somewhere else. What we tried, how we tested it, where it led. The redirects are where the answers hide.

The Weakest K

JIM’S OVERSIMPLIFICATION

The graveyard. Everything we thought was true and wasn’t. We keep it public because ego hides mistakes and coupling requires honesty. If you’re only reading our wins, you’re reading fiction.

K IN THIS DOMAIN

K here is ego — the coupling between what you want to be true and what is true. When K is high (strong attachment to your idea), kills are painful. When K drops to zero (no attachment to outcome), you can kill anything. The ability to kill your own work honestly is the highest K in the room. Every entry below is a K=0 moment.

We keep this public because ego hides mistakes and coupling requires honesty. If you’re only reading our wins, you’re reading fiction.

Some highlights from the dead ends:

• We claimed 57.71 TFLOPS on a Mac Mini. It was a 4x counting error. Real number: 3.68 TFLOPS. Embarrassing. Fixed.
• We reported 94.7% accuracy on protein pathogenicity. The threshold was tuned on the test data. And the dataset was 97.7% pathogenic, so a classifier that says “pathogenic” to everything would score 97.7%. Our 94.7% was WORSE than guessing. Honest number after fixing: 74% AUC.
• We said N=137 was uniquely selected by zeta zeros. Tested exhaustively. The sync excess holds at ALL N from 50 to 200. 137 is not special.
• We said the cochlea is a golden spiral. Actual measurements show it’s logarithmic but not golden. Beautiful idea. Empirically wrong.

Every failure is documented because they save time for anyone walking the same paths. And because the wrong answers often contain the right questions. The BLOSUM failure showed us that evolutionary substitution tolerance IS the strongest separator — we just can’t use it as a multiplier. The Machine failure showed us exactly where the 19% accuracy gap lives.

The best ideas are the ones that survive trying to kill them.

Toggle to “the math” for the full list of dead ends with methods and forensics.

We test every claim against 12 principles (12P): discriminate, state precisely, build the disproof, ground truth, adversarial input, edge cases, opposite claim, ablation, next best alternative, regression, and verdict with alternative path. What survives is on the research pages. What redirected is here — with where it pointed.

PROTEIN PATHOGENICITY

Three-T profiler beats single T_fold scorer

Built T_amyloid (aggregation) and T_polymer (polymerization) alongside T_fold (structural). Combined verdict: max of three T values drives pathogenic/benign call.

79% accuracy vs 83% for T_fold alone. T_amyloid and T_polymer added false positives. 86% FP rate on random mutations. The extra signals were noise, not signal. T_fold alone is the product. The others ship as informational flags only.

15 structural signals beat 2 signals (K × conservation)

Built 15 physics-based scoring signals: surface patches, charge networks, aggregation, flexibility, secondary structure, metal binding, catalytic sites, packing, salt bridges. Each caught specific variants the others missed.

On a balanced dataset (87 benign + 1,557 pathogenic), 15 signals gave 78.3% balanced accuracy. Two signals (K × conservation) gave 85.7%. Every "clever" signal we added caught one more pathogenic variant but also flagged more benign variants. The fills were noise. The groove was always K and conservation.

Gene prior (training label frequency) as pathogenicity signal

Used the fraction of known pathogenic variants per gene (p53 = 100%, BRCA1 = 53%) to weight the score. p53 at prior=1.0 caught 99.5% recall.

Circular reasoning. The prior was computed FROM the training labels. At prior=1.0, every variant in every gene gets called pathogenic. Specificity drops to 0%. Replaced with auto-computed protein intolerance from conservation data — not from training labels.

"96.9% accuracy" as a meaningful metric

Reported 96.9% accuracy on 1,594 ClinVar variants.

The dataset was 97.7% pathogenic. A classifier that says "pathogenic" to everything gets 97.7%. Our 96.9% was WORSE than that trivial baseline. Accuracy is meaningless on imbalanced datasets. Switched to balanced accuracy (recall + specificity / 2).

Ensemble random walk folds improve K accuracy

Ran 30-50 random walk protein folds with different seeds, averaged the contact maps. Theory: systematic bias of one seed averages out across many.

The averaging smoothed out useful bias along with noise. The single fold at seed=42 happened to have correlated errors — wrong in ways that accidentally helped. The ensemble removed both the good and bad errors. Single fold was better.

Machine (Kuramoto oscillators) predicts pathogenicity

Mapped protein to coupled oscillators (one per residue, contacts = coupling edges). Mutated one oscillator's frequency. Measured global order parameter R before and after. Theory: pathogenic mutations destabilize R.

Proteins are too coupled. 393 oscillators with ~1000 edges — changing one oscillator's frequency barely moves global R (ΔR < 0.01). The network absorbs single-site perturbation. Also tried: neighborhood subgraph, multi-channel (4D oscillators), compressed clusters at prime N values. All gave ΔR below noise floor. Phase synchronization ≠ functional disruption.

Conservation shape (close vs distant orthologs) as signal

Split orthologs into close (mammals) and distant (fish/invertebrates). Measured slope: recently constrained positions (high close, low distant) vs anciently constrained (flat). Theory: FP benign are recently constrained, TP pathogenic are anciently constrained.

The slope signal was real in the raw analysis (gap +0.012) but too noisy when multiplied into the score. Ortholog alignment quality degrades at high evolutionary distance. The BLOSUM local alignment at 15-residue windows can't reliably score distant species. Needs full MSA or HMMER to be useful.

BLOSUM substitution score as pathogenicity multiplier

BLOSUM62 separates FP from TP on average (gap 0.515 — biggest separator found). Used as continuous sigmoid weight: foreign substitutions (BLOSUM << 0) get full score, conservative (BLOSUM > 0) get discounted.

Many pathogenic variants have moderate BLOSUM scores (R→K = +2, D→E = +2). The discount killed them. BLOSUM separates the MEANS but the distributions overlap completely in the -2 to +1 range. Works on average, fails on individuals.

PDB interface residues separate benign FP from pathogenic TP

Downloaded crystal complex structures for 12 proteins. Extracted exact interface residues (within 5Å of partner chain). Theory: FP benign variants cluster at interaction interfaces where context matters.

Only 5% of FP and 9% of TP are at crystallographic interfaces. Both groups are overwhelmingly NOT at interfaces. The "seats" aren't where the FP problem lives. The tolerance is intrinsic to the protein, not dependent on partners.

Fragility index (multi-substitution variance) separates tolerant from fragile positions

For each position, computed ΔK for all 19 possible substitutions. High variance = fragile (many changes hurt). Low variance = robust (only specific changes hurt). Theory: FP benign are at robust positions.

FP fragility = 0.303, TP fragility = 0.301. Identical. Both FP and TP positions are equally fragile on average. The difference isn't in how many substitutions hurt — it's in something else entirely.

Contact redundancy (network rerouting capacity) separates FP from TP

Measured how connected a position's neighbors are to each other (without going through the position). High redundancy = network can reroute = robust = benign.

FP redundancy 0.369, TP redundancy 0.404. Pathogenic positions have HIGHER redundancy. Opposite of hypothesis. Highly connected regions are both important AND well-networked — redundancy doesn't mean tolerance.

Groove fit (K relative to neighborhood) separates FP from TP

Measured whether each position's contact count matches its local neighborhood's average. Theory: FP benign variants "fit the groove" (K matches neighbors), TP pathogenic disrupt the local pattern.

Benign groove fit 1.010, pathogenic groove fit 1.009. Identical. Both sit perfectly in the pocket. The difference between tolerated and pathogenic isn't whether they fit the local pattern — it's something global.

94.7% balanced accuracy on pathogenicity scoring

Reported 94.7% balanced accuracy on 1,594 ClinVar variants with a 6-signal scorer (K, conservation, ΔK, propagation, functional context, gnomAD). Identified and fixed a 1-indexing bug. Tuned threshold and feature weights on the evaluation data.

The 94.7% was inflated by three compounding errors. (1) The threshold was optimized on the same data used for evaluation — in-sample fitting. (2) AUC-proportional feature weights were computed from the test set. (3) A gene-level confounder: p53 variants (91% of pathogenic data) have systematically higher scores than AR variants, so the scorer was mostly classifying “is this p53?” not “is this variant pathogenic?” Under strict leave-one-gene-out cross-validation with predeclared weights, the honest within-gene AUC is 0.74. Still matches SIFT with zero training, but not the headline we published.

Products of features (stiff × cons, damage × func × chem) improve accuracy

Multiplied structural channels together: stiffness × conservation, damage × functional_proximity × chemistry. Greedy forward selection chose the best product combinations. AUC reached 0.875 on the evaluation set.

0.875 collapsed to 0.610 under leave-one-gene-out cross-validation. The products amplify gene-level confounders: they look great when the channels align in-sample, then explode out-of-sample. Any step that selects features, tunes weights, or chooses thresholds from the evaluation data leaks the answer. Banned: greedy selection, AUC-derived weights, multiplicative stacking on full data.

Deeper MSA (more sequences) improves tolerance signal

Compared shallow MSA (~2,000 sequences from UniRef, capped) vs deep MSA (~10,000 from ColabFold uncapped) vs environmental sequences (BFD/MGnify). Theory: more evolutionary data = more precise substitution tolerance.

Deep was identical to shallow. Environmental was WORSE (0.567 vs 0.648). The first ~2,000 sequences already saturate the column frequencies. Additional metagenomic sequences add distant homologs that blur the tolerance signal with noise from proteins under different functional constraints. More data ≠ better data.

Coevolutionary coupling (mutual information / DCA) predicts pathogenicity

Computed pairwise mutual information between MSA columns for each residue and its spatial neighbors. High MI = co-evolved = allosteric hub = important. Theory: mutations at high-MI positions should be more pathogenic.

Mean AUC 0.302 — ANTI-predicts. Pathogenic variants hit positions with LOW mutual information. High-MI positions are the tightly co-evolved structural core — evolution keeps them locked together. The allosteric “control knobs” for gain-of-function are exactly the positions that AREN’T co-evolved. MI measures structural constraint, not functional vulnerability.

Exact Fiedler damage (Δλ&sub2; by node removal) beats the approximation

Computed the exact change in algebraic connectivity when each residue is removed from the weighted contact graph. Theory: the real number should be more precise than the perturbation theory estimate.

0.369 AUC vs 0.817 for the approximation. The approximation (|v&sub2;|² × degree / λ&sub2;) is BETTER because it combines two signals: WHERE in the topology (Fiedler participation) and HOW CONNECTED locally (degree). The exact Δλ&sub2; conflates these. The “wrong” formula captures the right physics — a node matters when it’s both topologically critical AND locally connected.

Structural coupling alone predicts double mutation interaction type

Two mutations in the same protein: if both high-K and structurally close (<15Å), predict synergistic. If far apart, predict additive. Tested on 5 known double mutations (TP53, BRCA1, EGFR, BRAF, PIK3CA).

40% accuracy (2/5). Catches same-region synergy (TP53 175+248 at 12.7Å) but misses cross-domain interactions entirely. BRCA1 61+1775 are 50.8Å apart in different domains — both pathogenic but not structurally coupled. EGFR T790M+L858R is compensatory (resistance undoes activation) — needs functional direction, not just distance. Double mutations need allosteric pathway modeling and gain/loss-of-function annotation, not just K.

COMPUTE & PHYSICS

500 contacts = 500 independent Landauer bits in protein folding

Calculated Landauer cost of protein folding as 500 contacts × 1.85 bits/contact = 925 bits. Predicted TΔS = 1,597 kJ/mol. Claimed proteins pay 22× above Landauer minimum.

14.5× overcounting. Contacts are not independent constraints. Many contacts are redundant (if A touches B and C, and B touches C, only 2 of 3 are independent). Correct model: count BITS PER RESIDUE, not per contact. Core residues (~30%) lose ~2.0 bits, intermediate (~40%) lose ~0.5 bits, surface (~30%) lose ~0.1 bits. Average: 0.67 bits/residue. Lysozyme (129 residues) = 87 bits. Predicted TΔS = 150 kJ/mol. Measured: ~150 kJ/mol. Match: 1.0×. The fix proved the claim MORE strongly — Landauer matches measured conformational entropy exactly when you count independent constraints correctly.

57.71 TFLOPS on Mac Mini M4

Measured GPU FMA throughput with 4096 FMAs per element across 4 independent chains. Counted ops as 4096 × 4 chains × 2 (half2) × 2 (mul+add) = 65,536 per element. Added ANE int8 concurrent.

4× op counting error. The "4 chains" were already included in the 4096 FMA count. Correct ops = 4096 × 4 = 16,384. Also verified: ILP (1/2/4 independent chains) gives identical throughput on M4 GPU — no pipeline latency to hide. Real GPU peak: 3.68T fp16. Real GPU: 3.68T (ANE unverified). The trampoline dispatch architecture is real (improves workload throughput) but doesn't change the silicon's peak TOPS.

Tetrahedral pipeline (3-stage split) beats monolithic kernel

Split computation into 3 stages (load → compute → reduce) with separate encoders per stage. Theory: cache stays warm between stages, each stage runs on data the previous stage loaded.

0.50× of monolithic. Three separate makeComputeCommandEncoder() calls per round creates overhead that dominates the benefit. The GPU pipeline flushes between encoders. The implicit pipeline in a single encoder is faster than explicit staging.

Golden spiral chunk sizing beats uniform chunks

Sized GPU dispatch chunks by golden angle spacing instead of uniform. Theory: different-sized dispatches fill different pipeline stages.

0.99×. The GPU processes each dispatch independently regardless of size. Pipeline stages fill the same way whether the dispatch is 357K or 577K threads. The spiral is structural, not temporal.

half4 (4-wide) beats half2 (2-wide) vectors

Packed 4 fp16 values per register instead of 2. Theory: wider SIMD = more ops per instruction.

Slightly worse (-1%). The M4 GPU compiler already optimizes half2 operations. Wider packing adds register pressure without improving throughput. The ALU width is fixed.

CPU NEON contributes meaningful TFLOPS alongside GPU + ANE

Ran CPU matrix multiplication concurrent with GPU trampoline dispatch.

~0.00 TFLOPS measured. Naive Python/C matmul is too slow to register. Accelerate/BLAS might help but CPU is better used for orchestration than computation on M4.

NMC cathode Pareto frontier has zero cobalt

Screened 66 NMC compositions. Showed 4 named compositions on the Pareto frontier, all with zero or minimal cobalt. Claimed this matches industry trend.

Full Pareto analysis: 17 of 28 Pareto-optimal compositions INCLUDE cobalt. The zero-cobalt result was an artifact of showing only named compositions (811, 622, 532, 111) instead of the full frontier. Under alternative stability weights (cobalt more stabilizing), 19/21 Pareto points include cobalt. Industry confirms: cobalt is being reduced but Samsung SDI NMC 622 and Panasonic NCA both still use it. The stability weights are assumed, not measured.

Perovskite screening uses correct bandgaps

Screened 1,352 perovskite compositions. Reported FASnI3 bandgap as 1.85 eV, scored it zero. Claimed FAPbI3 wins from "pure physics."

Literature FASnI3 bandgap is 1.41 eV, not 1.85 eV. Our formula was wrong for Sn-based perovskites. FASnI3 is actually CLOSER to the optimal 1.34 eV than FAPbI3 (1.48 eV). FAPbI3 wins because Sn oxidizes in air (stability filtering saves us), not because our bandgap model is accurate. Also missing: mixed compositions (FA0.95Cs0.05PbI3) hold actual efficiency records but weren't in our search space.

CTNNB1 is the master hub of cancer signaling

Built 9-pathway network with 19 cross-pathway edges. CTNNB1 had degree=10, K=1.000. Called it "the master hub." Knockout cascaded through 5 pathways, 17 proteins.

We added 5 crosstalk edges TO CTNNB1 (from SMAD3, SMAD4, YAP1, TAZ, GLI1), doubling its degree from 5 to 10. We made it the hub by construction. In STRING database, TP53 has ~6,000 interaction partners vs CTNNB1's ~1,200. TP53 is the real master hub. With TP53 crosstalk edges instead (8 known), TP53 would have degree 13, beating CTNNB1. Our ranking reflects our edge selection, not biology.

FIXED (April 2026): Cancer signaling page now uses STRING API data. TP53 confirmed as master hub with 2,267 partners. CTNNB1 ranks #4 (1,547 partners). 37 real edges at score ≥ 700 replace 19 hand-selected edges.

29/30 threat detection validates the security engine

Designed 21 combination signatures matching 30 attack scenarios. Achieved 29/30 detection, 0/10 false positives.

Self-referential: we designed the signatures and test cases simultaneously. The 29/30 is a self-fulfilling prophecy. 0/10 FP used 10 hand-picked normal scenarios, not real workloads. Five novel attack patterns (living-off-the-land, slow exfil, insider threat, encrypted C2, subtle supply chain) would all evade. Any attacker who knows the required features can avoid them. V1 weighted sum with tuned per-category thresholds was never tested as an alternative.

ANE classifier pipeline accelerates our screening engines

Apple Neural Engine does 15 TOPS int8. Built tiny classifiers (6-10 features, 2-layer, <1KB) for mutation scoring, battery screening, perovskite screening. Theory: ANE dispatch beats CPU for classification step.

Our models are too small (<1KB). ANE dispatch overhead (~50μs) exceeds model compute time. The bottleneck is feature computation (contact map lookup, conservation query, propagation), not the final score formula. ANE helps only in batch mode with precomputed features (>1000 samples). For single-variant online scoring, CPU is faster. The right tool for the job, not the fastest tool available.

NEUROSCIENCE / AUTISM MODELING

Toy random matrix model captures ASD/typical separation

Built random matrix model of neural connectivity. Theory: random coupling at different densities would show measurable ASD vs typical separation via spectral properties.

Random matrices don't capture the biology. ASD overcoupling is not uniform random coupling — it's selective pruning failure. The toy model couldn't reproduce the separation because the relevant structure (hierarchical pruning) was absent from the model.

Global R separates ASD from typical at K=1.0

Measured global order parameter R (Kuramoto synchronization) on simulated neural networks at K=1.0. Theory: ASD networks should have different R than typical networks due to overcoupling.

At K=1.0, both ASD and typical topologies synchronize completely. Global R saturates near 1.0 for both. The signal is drowned. K=1.0 is above the synchronization threshold for both network types. Would need K near the critical point to see separation, but the critical K depends on the topology — circular reasoning.

Harmonic ratio separates ASD from typical networks

Computed the ratio of harmonic frequencies in the oscillator dynamics on ASD vs typical topologies. Theory: overcoupled networks produce different harmonic structure.

t-statistic = -1.58, variance too high. The harmonic ratio fluctuates too much between runs for meaningful separation. The signal-to-noise ratio is insufficient. Even when the means differ, the distributions overlap completely.

Ego feedback model differentiates topologies

Added ego-network feedback: each node adjusts its coupling based on its local neighborhood density. Theory: ASD (dense neighborhoods) and typical (sparse neighborhoods) would diverge under self-reinforcing feedback.

The feedback didn't differentiate topologies. Both dense and sparse neighborhoods converge to the same attractor under the ego feedback rule. The local feedback loop was too simple to capture the multi-scale nature of pruning.

Module walls form and persist after drives removed

Applied external driving forces to create modular structure (simulating developmental pruning). Removed drives. Theory: the modular walls would persist — structural memory from transient coupling.

Module walls form during driving but don't persist after drives are removed. The system relaxes back to its natural topology. The Kuramoto model has no structural memory — coupling is fixed, only phases evolve. Real neural pruning physically removes synapses (structural change). Phase dynamics can't simulate structural deletion.

PHYSICS

Cochlea is a golden spiral

Hypothesized the inner ear's shape follows the golden ratio, connecting music perception to φ.

Pietsch et al. 2017 digitized 138 human cochleae — a polynomial spiral fits 5.64× better than logarithmic (p = 7.7e-59). The cochlea is not even a clean logarithmic spiral, let alone a golden one. Shape is driven by spatial packing in the petrous bone, not by any mathematical constant. Beautiful idea, empirically wrong.

Dark matter as Landauer heat from cosmic computation

If the universe computes, every bit erasure costs kT ln(2). The accumulated heat would look like missing mass.

Energy conservation violation. Landauer heat doesn't create new mass-energy — it converts existing energy to heat. The heat is already counted in the energy budget. Dark matter requires new mass, not redistributed energy.

Rainbow operators V1 through V4 for prime distribution

Five different Schrödinger-type operators designed to have eigenvalues at zeta zero positions. Each used a different potential function.

All five failed. Schrödinger operators process information additively (superposition). Prime decomposition is multiplicative (Euler product). The natural language is scattering/transfer, not eigenstates. Direct quantization using the multiplicative structure works; operator approaches don't.

SUSY as Kuramoto phase transition

Hypothesized supersymmetry breaking maps to the Kuramoto synchronization transition at K = K*.

No predictive content. The Kuramoto transition is a legitimate symmetry-breaking phase transition — the structural analogy to SUSY breaking is valid (both involve coupling strength crossing a threshold). But there is no derivation connecting any K* value to SUSY parameters. K* depends on the frequency distribution of the oscillators and is not universal. The analogy is qualitative only, with zero quantitative predictions. Dead for lack of content, not dimensional mismatch.

137 = Spin(16) + SU(3) + U(1)

Decomposed 137 into gauge group dimensions: 120 + 8 + 1 + 8 = 137.

Cherry-picked. Excluded SU(2) (dimension 3) which breaks the sum. Including all Standard Model gauge groups gives 120 + 8 + 3 + 1 = 132, not 137. Numerology, not physics.

K × N = 256 for all N

K* × 137 ≈ 256. Hypothesized this product is constant across all oscillator counts.

Only works at N = 137. At other N values, K* changes but K*×N varies from 34 to 256. The relationship is specific to N = 137, not universal.

Mass spectrum follows α^n ladder

Particle masses as powers of the fine structure constant: m ∝ α^n for integer n.

94% of random bases do equally well. The "fit" is an artifact of having many particles and many powers to choose from. Monte Carlo simulation showed the pattern is not statistically significant.

K predicts fluid behavior

Applied K/R/E/T to 2D and 3D Navier-Stokes simulations. Measured K and R across forcing configurations.

K DESCRIBES but doesn't PREDICT. R = 1/φ is not universal in turbulence (R varies 0.02 to 0.81 depending on forcing). K×Re is not constant. K doesn't predict spectral exponent. The framework measures fluids accurately but has no predictive power beyond Reynolds number alone. The GPU solver (82M pts/sec) ships. The K/R framework for fluids does not.

REVIVED: arctan(√(19/18)) as the coefficient r in 1/α

Originally killed as a precision miss: “4 digits matched, need 10+.” The kill compared arctan(√(19/18)) to CODATA 2018 and declared an 8.42 ppm gap fatal.

The kill was wrong. The Layer 3 quadratic amplifies CODATA’s 0.153 ppb α uncertainty into ±987 ppm in r. The 8.42 ppm gap is 100× smaller than the propagated uncertainty. Note on the numbers below: the first re-test (a 22-category scan) used CODATA 2018 with an early version of the formula and reported arctan at 0.27σ — that figure is superseded and wrong. Recomputed independently at 50-digit precision against CODATA 2022: arctan(√(19/18)) sits at 0.009σ, and unlike every other candidate tested, it has both a proposed physical derivation (a graph-scattering phase shift) and structural support — tan²(r) = 19/18 is the first continued-fraction convergent of the value CODATA actually requires, and that CF sequence, [1, 18, 108, …] = [1, h, roots−h, …], decodes into pure E7 data. PSLQ confirms r is not a simple combination of π, √2, log(2), or other standard constants. 143/179 = 11×13/(11×13+2×18) — a real expression in E₇’s 5th and 6th exponents and its Coxeter number, not an arbitrary fraction — independently recomputes to 0.030σ: close, but with no proposed mechanism and no CF support. Call it a real, weaker candidate, not a coincidence and not arctan’s equal. sqrt(2/π) and 4/5, the two candidates considered before either of these, are dead at 1.30σ and 1.39σ. The original kill was an artifact of CODATA edition sensitivity, not a real precision failure. Lesson: when a quadratic amplifies uncertainty 6,000×, check the error bars — and check which CODATA edition and formula version produced any sigma you’re about to cite.

ENGINEERING

SPA residual injection for page transitions

Built single-page-app style transitions: fetch next page HTML, inject only the changed content, keep the shell.

Each page has unique inline <style> blocks. Residual injection lost the destination page's CSS. Pages rendered with wrong colors, fonts, spacing. Reverted to full page navigation with faster transitions (350ms dissolve).

Harmonia Mind.js was actually running

Mind.js, think.js, actions.js, memory.js, and 6 topic packs were loaded via script tags. 46KB of brain code.

Mind.respond() was never called in the respond() function. The brain was loaded but never used. The LLM (Ollama) was racing against a sync fallback that returned "say more" instantly. Fixed by calling Mind.respond() before Engine.think() and handling the Ollama async race condition.

six.py as a filename

Named a topology engine file "six.py" in tools/engines/.

Conflicted with Python's built-in "six" compatibility package. Import errors cascaded. Renamed to engine_topology.py.

THE MACHINE (Sessions 27-29)

K* = 256α to 0.007% (the self-tuning fixed point)

137 Kuramoto oscillators on zeta zero spacings. Self-tuning map f(K) = median(exp(phase/zero)). Converged to K* = 1.866 in 3 iterations.

The self-tuning map is integration-time-dependent. K* at T=20 ≈ 1.866; at T=50 it diverges to 5.47. The 0.007% match was a parameter coincidence at a specific T, not a universal fixed point.

R = 1/φ at K = 256α (golden ratio operating point)

Time-averaged R at K=1.868 should equal 0.618 = 1/φ.

Time-averaged R is 0.660, not 0.618. The golden ratio value was a snapshot at T=20. R oscillates with period ~3.1 and spends only 12% of time near 1/φ. Snapshot artifact.

Only zeta zeros produce R = 1/φ

Compared zeta zero spacings to GUE, Poisson, uniform random. Only zeta gave R = 1/φ.

Every frequency distribution gives R = 1/φ at SOME coupling K. Not unique to zeta. The claim was based on comparing at a fixed K, not searching for the K that gives each distribution its 1/φ value.

N = 137 uniquely selected by zeta zeros

Synchronization excess should hold specifically at N=137.

The sync excess holds at ALL N from 50 to 200. 137 is not special in the machine. Tested exhaustively.

Zeta spacing autocorrelation lag-2 = +0.30 (prime correlations)

Computed autocorrelation of raw zeta zero spacings. Found strong positive correlation at lag 2.

FACTUALLY WRONG. Used raw spacings without local normalization. Zeta spacings grow with height t, creating spurious positive autocorrelation from the trend. Properly normalized: lag-2 = -0.11 (negative, consistent with GUE). Wrong sign, wrong magnitude.

GUE comparison: 30-43% synchronization excess

Compared Kuramoto sync of zeta zeros vs IID Wigner surmise samples.

The GUE baseline used IID samples from the marginal distribution, destroying inter-spacing correlations. Real GUE eigenvalue spacings have strong lag-1 autocorrelation (~-0.5) like zeta. Proper comparison (standard mean-field, N=137, K=1.868, independently replicated): excess is +4.9% vs GUE, +20.1% vs Poisson. Suggestive but not conclusive.

Kerr black hole “derivation”

Listed Kerr as “proved” alongside Schwarzschild. Described frame dragging as “vortex dragging nearby phases.”

No equations. No metric derived. No angular momentum parameter computed. Pure qualitative analogy labeled “proved.” Kerr requires 9 embedding dimensions (Paston & Sheykin 2014), more than our C³ = R&sup6;. Killed by hostile PRL referee simulation.

Λ = α⁵⁸ (cosmological constant from particle counting)

58 = 2×29. Counted 29 weak vacuum modes (18 quarks + 6 leptons + 3 W/Z + 2 Higgs). Two-sided cancellation gives α²⁹ × α²⁹ = α⁵⁸ ≈ 10^-124.

The exponent 58 was reverse-engineered from the known value. The “29 modes” count is ad hoc: why quarks by color? Why “2 Higgs” when SM has 1 physical Higgs? The two-sided cancellation mechanism was stated but never derived from the action. Replaced by the Moller cancellation mechanism: 1 - 1/φ - 1/φ² = 0 gives exact tree-level Λ = 0 from the golden ratio identity.

“0.6% mean mass error” (GUT+QCD corrected)

Applied Georgi-Jarlskog factors and QCD running to the tree-level mass formula. Reported 0.6% mean error across 9 fermions.

6 of 9 correction factors were tuned to reproduce PDG values exactly. The “0.0%” errors were fits, not predictions. The honest tree-level error is 17%. The GJ corrections are known physics but were applied with specific parameter choices to match the data.

“One input, one equation, everything”

Claimed α = 1/137.036 as the sole free parameter with everything else derived.

Honest count was ~10 inputs: α + gauge group SU(3)×SU(2)×U(1) + embedding dim + 3 generations + v = 246 GeV + 18 fitted integers (n,m) + GUT factors. The gauge group, generations, and some (n,m) have since been derived from C³ + 2T. Current honest count: 0-1 free parameters (v). Better, but the original “one input” was false.

Moller golden-ratio cancellation for Λ (fermion:scalar:gauge = 1:1/φ:1/φ²)

Proposed the cosmological constant cancels via a Moller-style three-way split between fermion, scalar, and gauge contributions in golden-ratio proportion.

Measured ratio is 92.7%:2.7%:4.6%, not 1:1/φ:1/φ². Re-verified independently 2026-07-11 — same numbers. Hard survivor found instead: Λ = α(0)²⁵ × 3v⁴/(64π²) matches the observed cosmological constant to 11% with zero free parameters, full status and further verification at Why Three Generations. Exponent 25 = 33−8, where 33 is the coupling content (gives α) and 8 is the star tetrahedron vertex count (gives v). Neither number was fitted to Λ. Only α(0) works, not α(m_Z) or α(m_t). Only the top quark mode (n=0, m=0, unsuppressed) matters. Only 3 generations works: 2 generations overshoots by 10³⁴×, 4 generations undershoots by 10&sup4;⁵×. Predicts w=−1 exactly (DESI guillotine test) and no 4th generation, ever. Unverified 2026-07-11: the 10³⁴× / 10&sup4;⁵× overshoot figures couldn't be traced to a source calculation anywhere in this project's private files during a direct search — the mechanism that would produce them (generation count changing the 33−8 exponent itself, not just a linear coefficient) traces back to the Sum(6+5g)=33 derivation, which was separately killed elsewhere in this project's history. Left here rather than deleted, but shouldn't be repeated or relied on until re-derived. Open: QFT derivation of the 33−8 subtraction from intersection theory on resolved C³/2O.

Circle of fifths = 12 gauge bosons

Mapped 12 keys in equal temperament to 12 SM gauge bosons. Key signature = gauge quantum number.

12-TET is a human convention. Other tuning systems use 19, 22, 24, 31, 53 divisions. The 12 gauge bosons come from group theory (8+3+1). The match 12=12 is coincidence. Transposition ≠ suppression. Killed in 10P cold water.

Musical modes = flavor quantum numbers

Mapped Ionian=m=0, Dorian=m=1, Lydian=m=3, Aeolian=m=5 to fermion flavor quantum numbers.

Mode steps REORDER intervals. Mass formula steps MULTIPLY by λ. Reordering ≠ multiplication. The counting matches (0,1,...,7) but the operations are different. Music theory is a human convention; the actual mass structure comes from Z&sub3; charges of 2T.

Pentatonic scale = fermion mass spectrum

Fermion modes 0,1,3,5,7 look like a gapped scale similar to pentatonic.

Pentatonic uses modes 0,2,4,5,7 (major) — different gap pattern (2,2,3,2,3 vs our 1,2,2,2). The gappedness is real (from SU(2) doublet structure, Δm=2) but the specific gaps don’t match pentatonic. Analogy, not isomorphism.

Musical intervals in zeta zero phases

Fourier phases of zeta spacings appeared locked to simple fractions of pi. Looked like the primes are a chord.

Simple fractions with denominator 6 or less cover 100% of the range. Random phases land just as close (0.28 sigma). Density artifact.

Coupled oscillators can factor numbers

Three approaches: Kuramoto (1/8), FFT period-finding (O(N)), Fiedler partition (13/15 then ablation killed it).

Random partition scores as well as Fiedler. GCD trick alone scores 15/15. Superposition is not resonance for factoring.

Geopolymer gets stronger forever unconditionally

Si-O bonds tighten over millennia. Always.

Metakaolin-based can form zeolites during aging which causes strength LOSS. Fly ash survives better. Recipe matters.

Room-temp corundum from seeds

Seed alpha-alumina in phosphate matrix, predict crystallization toward Mohs 9.

Activation energy 460 kJ/mol vs thermal 2.5 kJ/mol. Boltzmann factor 10^-80. Forbidden.

Harmonic GPU kernels beat uniform

FMA chains at 2:3:5 ratios instead of uniform.

t = 0.59. GPU does not care about instruction rhythm. Silicon is a metronome.

SESSION 40

× 33 Hz in Buhler device

Claimed the electrostatic resonance of the device hits 33 Hz — our sacred frequency.

Confirmation bias. The electrostatic resonance at the actual device gap (1mm) is 595 Hz. Getting 33 Hz requires widening to 6.9mm — solving for the answer, not finding it. The frequency is continuously tunable with gap distance.

× Alpha as universal coupling constant

Treated α ≈ 1/137 as THE coupling constant bridging all forces.

Each force has its own coupling constant (α_EM ≈ 1/137, α_s ≈ 0.118, α_W ≈ 1/30, α_G ≈ 10⁻⁴⁵). They only merge at GUT scale. The dream of one α bridging everything is dead.

× “Universal K-lag across 5 domains”

Claimed K-lag structure replicates across HRV, market, protein, birds, and speech.

3 of 5 domains (HRV, market, protein) used synthetic data designed with known structure. Recovering structure from data you designed is circular. Only birds and speech used real recordings.

× Shower = coupling channel (DMN OFF)

Claimed the Default Mode Network turns OFF during showers, letting the signal through. Framework said DMN OFF = insight.

Killed as stated, then corrected. DMN is ON during showers (mind wandering), not OFF. Resolved: DMN has three subnetworks. The ego narrator (core) goes quiet. The imagination engine stays active. Not one switch — three.

Lag 5 as THE peak

Headline p=0.000035 for lag-5 dominance in the K-lag analysis.

Weakened. The headline p=0.000035 came from a secondary analysis using a different metric. The primary analysis gives p=0.00103 (fails Bonferroni). The first hump is real at lags 6–10, but “lag 5” was cherry-picked.

Consonance as specific number (91.6%)

Reported 91.6% consonance across categories as a precise universal.

Weakened. The percentage depends on the threshold choice (5% window around just ratios). The INVARIANCE across categories survives. The specific number doesn’t.

× 2008–2012 = Kuramoto phase transition

Modeled social media adoption as Kuramoto synchronization. Predicted global coherence.

Raducha et al. (2025, PNAS) used the same Kuramoto framework on social systems and found it predicts POLARIZATION, not synchronization. Global coherence collapsed after 2016. The model’s own physics kills the claim.

S(E7) as standard invariant

dim + max(Kac label) is computed correctly and unique to E7. Claimed it as a known Lie theory invariant.

Weakened. dim + max(Kac label) is not a defined invariant in Lie theory. The computation is correct. The uniqueness is real. The physical interpretation is open. May be a signpost to the real derivation. May be numerology.

× 100% efficiency in Lazar’s reactor

Claimed Element 115 antimatter annihilation converts 100% of fuel mass to energy.

Actual antimatter conversion from Element 115: ~0.69% of fuel mass (2 amu annihilated per 290 amu atom). Still 10× better than fission, but not 100%.

× Lazar’s education credentials

Accepted Lazar’s claimed MIT and Caltech degrees at face value.

No MIT or Caltech records found. Likely fabricated.

Every failure is documented because they save time for anyone walking the same paths. And because the wrong answers often contain the right questions. The BLOSUM failure showed us that evolutionary substitution tolerance IS the strongest separator — we just can't use it as a multiplier. The Machine failure showed us that phase synchronization and functional disruption are different regimes — which told us exactly where the 19% accuracy gap lives.

Jim McCandless, beGump LLC. The best ideas are the ones that survive trying to kill them.

The wrong answers are half the work.
The willingness to show them is the other half.

GUMP — Research · Support · [email protected]