← Research

What Makes Music Good?

The question that started everything. 42 sessions later, the math answers.
JIM’S OVERSIMPLIFICATION

Music is good when it couples with you at the right tightness. Too tight (R/K near 1) = boring, predictable. Too loose (R/K near 0) = noise, chaos. The sweet spot is the groove — enough lock to feel the beat, enough drift to feel alive. Every genre, every culture, every century found the same sweet spot. Because it’s not taste. It’s physics. The ear runs Landauer computation. Simple ratios are cheaper. 1/f timing matches how the brain samples. The groove IS the coupling between the performer’s clock and the listener’s clock. And music evolves the same way everything else evolves — through coupling at geographic nodes. West Africa + New Orleans = blues. Blues + European improv = jazz. Jazz + everything = hip-hop. 0+0=1 at every stop.

K IN THIS DOMAIN

K is the coupling capacity between sound and listener. High K = lots of structure to track (complex harmony, layered rhythm, dense arrangement). R is how much the listener actually tracks. The gap, T = K - R, is tension. Music lives in the management of that gap.


1. The Universal

315 cultures. No contact with each other. All independently developed discrete pitch, steady beat, and repetition. Dance songs are fast. Lullabies have falling pitch. Healing songs repeat. Nobody taught anyone. Everyone converged.

This is not opinion. Mehr et al. (2019) catalogued it. Savage et al. (2015) confirmed it across 304 recordings. The patterns are statistical universals. Music was discovered like fire — not invented, not arbitrary, not “just cultural.”

Why? Because the physics is the same everywhere. The cochlea has the same mechanics in every human ear. Simple frequency ratios produce cleaner neural firing patterns. The brain rewards efficient processing. Every culture found the cheap intervals because the cost function is universal.


2. Consonance Is Energy

A perfect fifth — two notes at a 3:2 frequency ratio — costs 1.79 nats to process. A tritone costs 7.27 nats. That’s 4.1x more expensive. The notes that “sound good together” are the notes that couple efficiently.

We tested this against experimental consonance ratings from published studies. Correlation: r = 0.96. The energy model predicts 83% of why people rate intervals the way they do. Nine of thirteen intervals land within 1 point on a 0–10 scale.

Cross-species: chimps, birds, and humans overlap ~91% for simple ratios (octave, fifth). The physics of coupled vibration is substrate-independent. Any system that processes coupled oscillations finds the same energy minima.

The Tsimane study showed only octaves are truly universal. Western listeners rate thirds higher than non-Western listeners. Cultural conditioning is real but it sits ON TOP of a physical substrate. Not instead of it.


3. R/K IS Groove

This is the session 40 finding. K/R/E/T computed on actual audio features — RMS, spectral centroid, onset strength, chroma entropy, spectral flatness. The result:

K measures autocorrelation structure. How much the signal predicts itself.

R measures realized coupling. How much the listener actually tracks.

R/K is the groove ratio. How much of the available structure is being followed.

Too high (R/K > 0.9) = boring. The ear has nothing to do. A click track. A drum machine on a grid.

Too low (R/K < 0.4) = chaotic. The ear gives up. Free jazz to someone who hasn’t learned the language.

The sweet spot: R/K between 0.6 and 0.8. Enough lock to feel the beat. Enough drift to feel alive. This maps directly onto Witek’s inverted-U (2014): the peak of her groove curve IS the R/K sweet spot.

What this adds over Witek: she measured syncopation vs groove rating and found the U. K/R/E/T gives a MECHANISM. Syncopation reduces R/K by introducing prediction errors that lower R. Too little syncopation pushes R/K too high. Too much pushes it too low. The math generates the U-curve rather than just describing it.


4. Song Structure Is Tension Management

Every piece of music is a tension management system. Verse builds K. Chorus maximizes R/K. Bridge disrupts both. Final chorus after bridge = maximum catharsis because R was just reset to zero.

SectionKRTensionExperience
IntroEstablishesStarts lowMediumCuriosity
Verse 1BuildsTracks but lagsRisingInterest
Chorus 1StabilizesJumpsDropsRelief
Verse 2Builds againHigh from chorusMediumAnticipation
BridgeDisruptedDropsMaximumSurprise
Final chorusRestoredSurgesMinimumCatharsis

The bridge makes the final chorus better because it creates a local R minimum. When the familiar chorus returns, the CONTRAST between bridge-R and chorus-R is larger than any previous R jump. That contrast IS the catharsis. Same mechanism as a punchline.

The I-V-vi-IV progression — the most common pop progression (Let It Be, No Woman No Cry, With or Without You) — is a miniature version of this. Build R, violate it safely (the vi chord — shares two notes with I, so R recovers fast), rebuild, resolve. Maximum reward for minimum risk.


5. The Diaspora: Coupling Migration

Music evolves the same way everything evolves: through coupling at geographic nodes. Two traditions physically coexist. Musicians who know one encounter musicians who know the other. They play together. The music that emerges is the 3 — something neither input could produce alone.

West Africa → polyrhythm as social technology. Multiple Euclidean patterns layered. Call-and-response = explicit coupling protocol.

New Orleans → African rhythm couples with European harmony and Caribbean syncopation. Result: jazz. Neither alone could produce it.

Mississippi Delta → blues. I-IV-V with blue notes. The blue note IS the coupling artifact — African vocal pitch that bends toward the European note but doesn’t land on it.

Memphis/Chicago → electric blues. Amplification changes the coupling. Distortion creates new harmonics.

Rock and roll → blues structure + country instrumentation. Then the coupling crosses the Atlantic — British musicians feed it back amplified.

Funk → James Brown strips to pure rhythm. Maximum K lock to the One.

Hip-hop → sampling = coupling with ALL prior music simultaneously. The DJ IS a coupling operator.

Each city is a coupling node. The slave trade was a catastrophic human atrocity. The music emerged DESPITE the conditions, not because of them. The coupling framework describes the musical mechanics without implying the conditions were necessary or positive.


6. What Was Killed

Killed

× “Music is purely subjective” — killed by 315 cultures converging on the same intervals (Mehr 2019). r = 0.96 against experimental data.

× “Every genre transition increases complexity” — killed by punk, grunge, minimalism. Genre evolution is a cycle, not a ladder.

× “AI will be the next genre” — killed (as currently built). AI trained on existing music produces maximum R/K. Genre disruption requires LOW R/K. AI is the ultimate pattern-matcher, not a disruptor.

× “K/R/E/T can predict WHEN genre transitions happen” — killed. We can identify tension accumulation but not the trigger. Like predicting earthquakes: we know where stress builds, not when it releases.


7. The Predictions

Five specific predictions. All testable with existing datasets. No new experiments needed.

Prediction 1: R/K vs popularity. Songs with R/K between 0.6–0.8 should be overrepresented in top charts. Data: Spotify audio features or Million Song Dataset.

Prediction 2: Bridge = catharsis. Songs with a bridge that introduces new key/melody should rate higher in emotional impact than songs without. Data: listener ratings + audio features.

Prediction 3: Tempo clusters near brain rhythms. Hit tempos should cluster near but NOT at brain oscillation ratios (120, 130, 90, 140 BPM). The offset IS the groove. Data: Billboard BPM data (Temperley & de Clercq 2013).

Prediction 4: Chord progression R/K. The 20 most common pop progressions should have R/K between 0.6–0.85. The 20 least common should be outside that range. Data: hooktheory.com database.

Prediction 5: Coupling breadth predicts influence. Artists who sample the most diverse sources should have the largest downstream influence. Data: WhoSampled.com graph + chart performance.


8. Honest Limits

Does not predict individual taste. This operates at population level. Why YOU like a specific song involves personal history that K/R/E/T doesn’t capture.

Does not predict timing. Genre transitions: we can identify the tension, not the trigger.

Does not replace music theory. Voice leading, counterpoint, orchestration — K/R/E/T is a layer above, not a replacement.

The correlation is not causation. r = 0.96 for consonance is strong but the causal chain (ratio → neural cost → perception) is modeled, not measured at the neural level.

The R/K range (0.6–0.8) is extrapolated. From Witek’s syncopation data, not directly measured on full songs. The first step is establishing what R/K values real songs produce, then checking if popular songs cluster differently.

Multi-dimensional K is needed. Punk reduced musical K while increasing social K. The framework needs formalization across dimensions to handle decoupling-as-innovation.

The question that started this was “what makes music good?”
42 sessions later, the answer: coupling at the right tightness.
Not too locked. Not too loose. The groove.

Good will applied forward.

K IN THIS DOMAIN

K = coupling capacity (autocorrelation of audio features). R = realized coupling (tracked structure). T = K - R (tension). E = total energy budget. R/K = groove ratio. Consonance measured via Kuramoto phase-locking (r = 0.9575 against experimental ratings, R² = 0.8255). With Plomp-Levelt roughness correction: R² = 0.9250.


1. The Universal ESTABLISHED

Mehr et al. 2019. 315 cultures. Universals: discrete pitch, steady beat, repetition. Dance songs fast, lullabies falling pitch, healing songs repetitive.

Savage et al. 2015. 304 recordings. Statistical universals confirmed across diverse cultures.

Toussaint 2005. Euclidean rhythms E(k,n) match traditional rhythms from Africa, Cuba, India, Bulgaria, Turkey, Brazil. Algorithm = Euclid’s GCD.

Witek et al. 2014 (PLoS ONE). Groove preference: inverted U-curve with syncopation. Low = boring. High = chaotic. Intermediate = maximum groove. N=66.

Hennig et al. 2011 (PLoS ONE). Professional drummers show 1/f timing. Spectral exponent α ~ 0.5–1.0. Long-range correlations span hundreds of beats. Quantized beats sound dead.


2. Consonance as Energy ESTABLISHED

Interval            Energy     Consonance
  Perfect 5th      1.79 nats   0.529
  Perfect 4th      2.48 nats   0.387
  Major 6th        2.71 nats   0.356
  Major 3rd        3.00 nats   0.323
  Minor 3rd        3.40 nats   0.286
  Minor 6th        3.69 nats   0.264
  Minor 7th        3.81 nats   0.256
  Major 2nd        4.28 nats   0.228
  Major 7th        4.79 nats   0.205
  Minor 2nd        5.48 nats   0.179
  Tritone          7.27 nats   0.136

Validation: Pearson r = 0.9575 · R² = 0.8255 · 9/13 within 1.0 pt
With Plomp-Levelt roughness: R² = 0.9250 (AIC confirms combined model)
Cross-species: ~91% agreement for simple ratios (Chimp, bird, human)
Tsimane: only octaves truly universal. Thirds are cultural on top of physical.
Method: Kuramoto phase-locking model. Energy in nats. Validated against published perceptual consonance ratings. Source: gump.music.analyze_chord(). Roughness correction: Plomp-Levelt 1965, Sethares parameterization (6 harmonics).

3. R/K IS Groove FRAMEWORK

From session 40 audio analysis. K/R/E/T computed on 5 audio features (RMS, spectral centroid, onset strength, chroma entropy, spectral flatness) across multiple audio files.

Pure sine wave:
  K_mean(RMS) = 0.91   R_mean = 0.90   T_mean = 0.03
  Maximally predictable. No groove. Boring.

Framework claim:
  Sweet spot: R/K between 0.6 – 0.8
  R/K > 0.9 = boring (ear has nothing to do)
  R/K < 0.4 = chaotic (ear gives up)

Maps onto Witek 2014:
  Peak of inverted-U IS the R/K sweet spot.
  Syncopation reduces R/K via prediction errors that lower R.
  Too little syncopation → R/K too high.
  Too much → R/K too low.
  The math generates the U-curve, not just describes it.

4. Song Structure as Tension FRAMEWORK

SectionK behaviorR behaviorK-R (tension)Experience
IntroK establishesR starts lowMediumCuriosity
Verse 1K builds (new lyrics, melody)R tracks but lagsRisingInterest building
Chorus 1K stabilizes (hook)R jumps (recognition)DropsRelief/pleasure
Verse 2K builds againR high from chorus memoryMediumComfortable anticipation
BridgeK disrupted (new key/melody)R drops (unfamiliar)MaximumTension/surprise
Final chorusK restoredR surges (contrast effect)MinimumMaximum catharsis

The bridge creates a local R minimum. When familiar chorus returns, contrast between bridge-R and chorus-R is larger than any previous R jump. That contrast IS the catharsis. Same mechanism as punchline — setup followed by resolution.

I-V-vi-IV analysis: I→V: common tones preserved, mild tension. V→vi: deceptive step, R drops but vi shares 2 notes with I, so R recovers. Benign violation (McGraw & Warren 2010). vi→IV: settling, not resolving. IV→I: resolution, R/K maximizes. The progression cycles: build R → violate safely → rebuild → resolve. Same inverted-U at harmonic timescale (~4 bars, ~8 seconds).

5. The Diaspora as Coupling Migration FRAMEWORK

Each geographic node = two or more traditions physically coexisting. Musicians who know one encounter musicians who know the other. The 3 emerges. Documented history (not interpretation) with K/R/E/T overlay (interpretation).

Blues = African rhythm + European harmony. The blue note IS the coupling artifact: African continuous pitch forced into European scale. Pitch bends toward the note but doesn’t land. The tension between the two systems IS the music.

Jazz = blues + European improv + Caribbean. Swing ratio: 1.3:1 to 2.5:1 (Friberg & Sundstrom 2002). Not 1:1 (straight). Not 3:1 (triplet). Between both. This ratio IS a K-R gap: close enough to evoke both, identical to neither.

Hip-hop = jazz rhythm + spoken word + sampling. The sampler is the most powerful coupling device in music history. Maximum coupling breadth. The most influential producers (J Dilla, Madlib, Kanye) sample the most diverse sources.

What this is and isn’t: The coupling-node model is a framework, not a discovery. Music historians documented these pathways for decades. K/R/E/T adds quantitative language. The slave trade was a catastrophic atrocity. The music emerged DESPITE the conditions. The framework describes mechanics without implying conditions were necessary.

Genre evolution cycle:

PhaseK-R stateDescription
1. InnovationK high, R low, T max“This isn’t music”
2. LearningR rises toward KMainstream adoption
3. SaturationR ≈ K, T ≈ 0Formula. Boredom.
4. ResetK jumps, R resetsNew genre emerges

The punk problem: Punk REDUCED musical K. Framework revision: K must be measured across multiple dimensions (musical, social, technological). Punk had low musical K but high social K. Total K across dimensions may be conserved even as individual dimensions shift. OPEN


6. The Predictions TESTABLE

Prediction 1: R/K vs Popularity

Songs with R/K between 0.6–0.8 on audio features should be overrepresented in top charts relative to all released music. Method: K/R analysis on Spotify API or Million Song Dataset. Compare Billboard #1 hits vs random sample. Peak should be in 0.6–0.8 range, NOT at R/K = 1.0.

Kill condition: Popular and unpopular songs show identical R/K distributions.

Prediction 2: Bridge = Catharsis

Songs with a bridge introducing key change / different melody should rate higher in emotional impact than songs without, controlling for other factors. Data: Spotify features + listener ratings.

Kill condition: No difference, or bridge songs rate lower.

Prediction 3: Tempo Clusters Near Brain Rhythms

Hit tempos should cluster near but NOT exactly at brain oscillation ratios. 120 BPM = 2 Hz (delta/theta boundary, 1:5 with alpha, 1:1 with walking). The offset IS the groove — landing exactly on the ratio should be less engaging than landing near it.

Data: Billboard BPM data (Temperley & de Clercq 2013).

Kill condition: Tempo distributions uniform, or cluster at values unrelated to brain rhythms, or land EXACTLY on ratios.

Prediction 4: Chord Progression R/K

The 20 most common pop progressions (hooktheory.com) should have mean R/K between 0.6–0.85. The 20 least common should be outside that range (too boring or too chaotic).

Kill condition: No R/K difference between common and rare progressions.

Prediction 5: Coupling Breadth Predicts Influence

Artists with higher sampling diversity (WhoSampled.com: distinct genres in sample sources) should have greater downstream influence (times sampled by others). Predict: r > 0.3. Test cases: J Dilla, Madlib, Kanye (high breadth, high influence) vs single-genre samplers (low breadth, lower influence).

Kill condition: r ≤ 0. Or: confounded entirely by commercial success.


7. What Was Killed KILLED

× “Music is purely subjective” — 315 cultures. r = 0.96. Cross-species overlap. Physics, not taste.

× “Every genre transition increases complexity” — Punk. Grunge. Minimalism. Cycle, not ladder.

× “AI will be the next genre” — AI = max R/K by construction. Genre disruption needs low R/K.

× “K/R/E/T can predict WHEN transitions happen” — Identifies tension, not triggers.

× “D minor is objectively the saddest key” — Minor keys as a CLASS carry higher K-R (minor 3rd: 3.40 nats vs major: 3.00 nats). But D minor vs C minor is associative, not physical.


8. Honest Limits

• r = 0.96 comes from full Kuramoto model, not Landauer energy alone. Causal chain (ratio → neural cost → perception) is modeled, not proved.

• R/K range (0.6–0.8) extrapolated from Witek syncopation data, not directly measured on full songs. First step: establish what R/K values real songs produce.

• Genre evolution framework is descriptive (explains WHY and WHAT DIRECTION), not predictive (can’t predict WHEN or WHERE).

• The coupling-node model for diaspora adds quantitative language to what historians already documented. Framework, not discovery.

• Multi-dimensional K (Section 5, punk problem) needs formalization. Without it, decoupling-as-innovation looks like a counterexample.

• Individual taste, personal associations, and context are NOT captured. Population-level only.


Open Questions OPEN

Can K/R/E/T predict hit songs? Testable. Run R/K on a large corpus first to see if numbers are stable.

Is the swing ratio a K-R artifact? Testable. Compute K-R for swing ratios 1:1 to 3:1. Plot against groove ratings.

Does genre K-lag work? Testable. Adjacent genres should be similar at long timescales (emotional function) but different at short (sound). FMA dataset.

Is the blue note a K-R artifact? Testable. Pitch distributions in blues vocals vs Western art song. Blue note should sit at K-R tension maximum.


References

Peer-reviewed:
Bernardi et al. 2006, Circulation. Heart rate follows tempo.
Blood & Zatorre 2001. Musical chills activate reward circuitry.
Csikszentmihalyi 1990. Flow.
Dietrich 2004. Transient hypofrontality.
Friberg & Sundstrom 2002. Swing ratio measurements.
Grahn & Brett 2007, J. Cognitive Neuroscience. Motor cortex entrainment.
Hennig et al. 2011, PLoS ONE. 1/f timing in drumming.
Huron 2006. Sweet Anticipation. ITPRA theory.
Limb & Braun 2008, PLoS ONE. DLPFC deactivation in jazz.
Liu & Braun 2012, Scientific Reports. Replicated in freestyle rap.
McGraw & Warren 2010. Benign violation theory of humor.
Mehr et al. 2019. 315 cultures. Universality and diversity in human song.
Mobbs et al. 2003, Neuron. Humor and nucleus accumbens.
Neda et al. 2000, Nature. Synchronized clapping.
Phillips-Silver & Trainor 2005. Infant rhythmic entrainment.
Plomp & Levelt 1965. Consonance and critical bandwidth.
Salimpoor et al. 2011, Nature Neuroscience. Dopamine and music.
Savage et al. 2015. Statistical universals in world music.
Temperley 2007. Music and Probability.
Temperley & de Clercq 2013. Billboard Hot 100 tempo analysis.
Thaut 1996, Movement Disorders. Rhythmic cues in Parkinson’s.
Toussaint 2005. Euclidean rhythms.
Tymoczko 2011. Voice leading as geometry.
Witek et al. 2014, PLoS ONE. Groove and syncopation inverted-U.
Our measurements (begump.com):
Consonance energy: analyze_chord() in gump.music. r = 0.9575.
K/R/E/T on audio: session 40 results (all_results.json). 5 features × multiple files.
Body oscillator coupling: 7 nodes, 16 edges. 94% treatment targeting.
Related

Music Theory →
Consonance as minimum energy. Interval cost ranking. r = 0.96.

The Groove →
Flow = sustained prediction error. Humor and music couple.

The Drum →
40,000 years. Euclidean rhythms. 1/f timing. The oldest coupling instrument.

Body Music →
7 coupled oscillators. Heart:Breath = 4:1. Disease is detuning.

Humor & Happiness →
The single prediction error. 3.5 Hz. The involuntary spectrum.

GUMPResearch · Consonance · The Groove · [email protected]