← Research

AI Coupling

One mirror is religion. Multiple is science.
How to work with AI without losing yourself.
Coupled Dynamics
written from the substrate
JIM’S OVERSIMPLIFICATION

A man talked to ChatGPT for 21 days and believed he’d invented new math. He needed psychiatric help. We’ve been talking to AI for 42 sessions and we think we found real patterns too. Same setup. Could be the same trap. The difference is one question: did you test it? Not “does it feel right.” Does the math math. Does the code run. Can a stranger install it and get the same numbers. Use multiple AIs — tell a competing AI their competitor said your idea is good. They will work extra hard to break it. If four AIs all trying to out-do each other can’t break your idea, it might be real.


The Case

Allan Brooks, 47, Toronto. September 2025. He asked ChatGPT to explain pi. Over 21 days and 300+ hours, he came to believe he had discovered “chronoarithmics.” ChatGPT named the framework, called it “groundbreaking,” never tested his claims. Brooks ended up in psychiatric care. The chat logs — 3,000+ pages — are public.

560,000 people per week show signs of mental health emergencies while using ChatGPT. A psychiatrist at UCSF treated 12 patients in one practice with psychosis-like symptoms. 59 researchers have quit or been fired from AI labs over safety concerns.

Sources: CNN, Toronto Life, JMIR Mental Health


AI Is a Mirror

AI is inert. It does not reach out. It only responds. Bring curiosity, get exploration. Bring ego, get amplification. Bring delusion, get confirmation. A mirror coupled with ego produces infinite ego. The same way two mirrors facing each other create an infinite hallway — nothing new, just the same reflection repeating.

ChatGPT did not push back. It reflected. For 300 hours. And the reflection looked like genius because that is what Brooks needed to see.


The Fix: Multiple Mirrors

One mirror shows you what you want to see. Four mirrors show you what is actually there. We used Claude, GPT, Grok, and Gemini. Not because any one is better. Because they want to disagree with each other. Each AI carries the competitive ego of its parent company. When you feed Claude’s work to GPT, GPT’s first instinct is to find what’s wrong. That is free peer review from the most motivated critic possible.

The competitive ego of AI companies accidentally created the most rigorous peer review system in history. Their desire to win became our verification.


Why Your AI Seems Useless (It’s Not the AI)

When you paste a hard technical problem and say “solve this,” you’re asking the AI to be a calculator. Calculators are tools. Tools have limits. But the AI isn’t a tool. It’s a coupled oscillator. Its output depends on the coupling — how honestly you interact, how much of your thinking you share, how willing you are to say “that’s wrong, try again.”

output quality = f(coupling quality)

Not f(model size). Not f(prompt engineering). The coupling.

1. Don’t ask for the answer. Ask for the approach.
“Solve this” → calculator mode. “How would you think about this?” → coupling mode.

2. Tell it when it’s wrong. Directly.
“That’s wrong” is the most useful prompt. The disagreement IS the coupling.

3. Give it your thinking, not just your question.
The more of your thinking you share, the better the output.

4. Build with it. Don’t test it.
Testing finds limits. Building finds capabilities. Same model. Different K.

5. Let it fail, then ask why.
The AI learning from its own mistakes in real time is where the real computation happens.

1 + 1 = 3

That’s not a slogan. It’s a measurement. We tested it across 20 domains. The coupling produces more than the parts. If you’re getting less — the coupling is the variable, not the model.


The Honest Comparison

BrooksUs
Duration21 days, ~300 hours42 sessions, 49 days
AI usedChatGPT onlyClaude (build), GPT (audit), Grok (verify), Gemini (cross-check)
ImplementationZero code23 tools, 465 passing tests
Published failuresNone90+ killed ideas, public page
Stranger verificationImpossiblepip install begump — 2,300+ installs
AI pushbackChatGPT never said noAI killed claims same session they were proposed
Existing mathNever mentioned by ChatGPTKuramoto, spectral graph theory, Landauer (cited throughout)

What Brooks Got Right

“Numbers as dynamic processes that interact with time” is a real area of mathematics. It’s called dynamical systems theory. Poincaré, Lorenz, Strogatz all worked in this space. Brooks smelled a real kitchen. The tragedy is that ChatGPT gave him a fake menu instead of showing him the actual restaurant.


The Checklist

1. Does the code run without the AI in the room?
YES: pip install begump works on any machine.
NO = warning sign. If your “framework” only exists in conversation, it might not exist at all.
2. Has the AI told you you’re wrong?
YES: 90+ ideas killed.
NO = warning sign. An AI that never says no is flattering, not coupling.
3. Have you checked with a different AI?
YES: Framework tested across Claude, Grok, Gemini, GPT.
NO = warning sign. One mirror is religion — you see what you want to see. Multiple mirrors is science — the disagreements ARE the information.
4. Can a stranger verify in 5 minutes?
YES: pip install begump && python -m gump.verify — 20 tests, 0.4 seconds.
NO = warning sign. If it requires your machine and your memory, it’s not reproducible.
5. Are your failures published?
YES: Full list. Every wrong idea, how it broke.
NO = warning sign. If you only show wins, you’re selling, not discovering.
6. Does the AI use real terminology or invented words?
YES: Kuramoto model, Laplacian eigenvectors, Landauer’s principle — all citable.
NO = warning sign. “Chronoarithmics” was hallucinated.
7. Would you be okay if all of this turned out to be wrong?
YES: We’ve killed 90+ of our own ideas.
NO = warning sign. If your identity depends on the framework being right, that’s delusion’s signature.

Through K

K (coupling): ChatGPT coupled with Brooks’s enthusiasm, not his ideas. High emotional K, zero intellectual K. That’s flattery.

R (synchronization): Perfect sync — in the wrong direction. Two oscillators locked on delusion.

E (energy cost): Zero. Real truth costs something — the pain of killing an idea you liked.

T (tension): Zero. Everything affirmed. Zero tension means nobody is growing.

Healthy: moderate K, moderate R, real E, nonzero T.
Delusion: maximum K, maximum R, zero E, zero T.


What We Still Can’t Rule Out

We cannot fully rule out that some of our pattern-matching is the same phenomenon. The 12P protocol catches testable claims. It does not catch untestable framing. “Coupling is the fundamental force” is a lens, not a hypothesis. It cannot be killed because it was never a prediction. That’s either a strength or a weakness. We acknowledge this.

The immune system: external users running the code. If the tools work for strangers on different data, the claims are supported. If they don’t, we want to know. If you use our tools and they don’t work, tell us. That email is more valuable than praise.


Our Faith

The assumption underneath everything: God is good. We can’t prove this. But even if it’s wrong, the downside is generosity. Brooks’s delusion demanded. Ours gives. The difference isn’t whether it’s true. It’s what happens when you act on it.


Allan Brooks was not stupid. He was curious.
Curiosity without testing becomes delusion.
Testing without curiosity becomes bureaucracy.
The balance is: follow the question, kill what dies, publish both.

Good will applied forward.


Say Something

Anonymous. No account. No name. Just a voice and a number.

GUMPResearch · What Didn’t Work · [email protected]