We thought we found signal. We added coupling features to simplified molecular descriptors and got +0.055 AUC. Then someone pointed out our baseline was weak — no Morgan fingerprints, no hyperparameter optimization, 30-minute vibe code. They were right. We re-ran with ECFP4 2048-bit Morgan fingerprints, 21 RDKit descriptors, grid-searched hyperparameters, and the full 41,120-compound dataset. Delta: −0.003 (p=0.50). Not significant. Morgan fingerprints already capture the topological information our coupling features were providing. This page is the correction.
K here is molecular coupling. How tightly bonded the atom graph is. High K = robust connectivity = harder to fragment = different binding behavior. The Fiedler value measures this directly: the algebraic connectivity of the molecular graph.
A valid criticism noted our original comparison used simplified descriptors without Morgan fingerprints or hyperparameter optimization. We re-ran with a proper cheminformatics pipeline: ECFP4 2048-bit Morgan fingerprints + 21 RDKit physicochemical descriptors (2,069 total features), grid search over 36 hyperparameter combinations (same search applied to all models), and the full 41,120-compound dataset with no subsampling. K/R/E/T adds −0.003 AUC-ROC (p=0.50) and −0.012 AUC-PR (p=0.14). Neither metric significant. The original +0.055 was real but only because the baseline was artificially weak. Someone did our MM12P for free. They were right on every point.
| Comparison | Standard | Combined | Delta | p-value |
|---|---|---|---|---|
| Original (weak baseline) | 0.753 (16 simplified, 4,943 cpds) | 0.808 | +0.055 | <0.001 |
| Proper (Morgan+RDKit) | 0.816 (2,069 features, 41,120 cpds) | 0.812 | −0.003 | 0.50 |
Given a molecule, predict whether it is active against HIV. Standard approach: compute molecular descriptors (weight, polarity, hydrogen bonds), feed them to a classifier. We asked: what happens if you also describe the molecule as a coupled system — measuring its connectivity structure, energy distribution, and internal tension?
MoleculeNet HIV: 41,127 compounds from real screening data. 1,443 active (3.5%). Original analysis subsampled to 4,943 molecules. Corrected analysis uses the full 41,120 valid compounds (7 unparseable SMILES excluded) with no subsampling. Class imbalance handled via scale_pos_weight in XGBoost.
ECFP4 2048-bit Morgan fingerprints + 21 RDKit physicochemical descriptors. Grid search: n_estimators {100, 300, 500} x max_depth {3, 5, 7, 9} x learning_rate {0.01, 0.05, 0.1} = 36 combinations, 3-fold CV. Same search for all models. Final evaluation: stratified 5-fold CV on full dataset.
| Model | Features | AUC-ROC (95% CI) | AUC-PR (95% CI) |
|---|---|---|---|
| A (Standard) | Morgan+RDKit (2,069) | 0.816 [0.795, 0.836] | 0.476 [0.432, 0.520] |
| B (K/R/E/T only) | K/R/E/T (26) | 0.751 [0.734, 0.768] | 0.234 [0.200, 0.267] |
| C (Combined) | Standard+K/R/E/T (2,095) | 0.812 [0.787, 0.838] | 0.464 [0.418, 0.510] |
| Fold | A (Standard) | B (K/R/E/T) | C (Combined) | Delta (C−A) |
|---|---|---|---|---|
| 1 | 0.841 | 0.754 | 0.838 | −0.003 |
| 2 | 0.811 | 0.744 | 0.793 | −0.018 |
| 3 | 0.810 | 0.771 | 0.808 | −0.002 |
| 4 | 0.796 | 0.732 | 0.794 | −0.003 |
| 5 | 0.820 | 0.753 | 0.830 | +0.010 |
| Mean | 0.816 | 0.751 | 0.812 | −0.003 |
Top 10 features from Model C trained on full dataset. K/R/E/T features account for 1.9% of total importance despite being 1.2% of feature count. Only K_wiener_norm appears in the top 20.
| # | Feature | Importance | Source |
|---|---|---|---|
| 1 | morgan_1648 | 0.0051 | Standard |
| 2 | morgan_1519 | 0.0051 | Standard |
| 3 | morgan_577 | 0.0048 | Standard |
| 4 | morgan_1254 | 0.0047 | Standard |
| 5 | K_wiener_norm | 0.0042 | K/R/E/T |
| 6 | morgan_1185 | 0.0041 | Standard |
| 7 | morgan_800 | 0.0040 | Standard |
| 8 | morgan_572 | 0.0040 | Standard |
| 9 | morgan_758 | 0.0039 | Standard |
| 10 | morgan_1991 | 0.0039 | Standard |
Compare to original: K_fiedler was #2. Against Morgan fingerprints, it does not even appear in the top 20. The topology it captured is already encoded in Morgan bits.
These results compared K/R/E/T against 16 simplified descriptors from a minimal SMILES parser — no Morgan fingerprints, no ECFP, no MACCS keys, no hyperparameter optimization, subsampled to 4,943 compounds. The delta was real but the baseline was not competitive.
| Model | Features | AUC (mean ± std) |
|---|---|---|
| A | Standard only (16 simplified) | 0.753 ± 0.008 |
| B | K/R/E/T only (29) | 0.796 ± 0.013 |
| C | Combined (45) | 0.808 ± 0.014 |
Five original claims. The proper baseline test killed the main ones. Updated verdicts:
Against 16 simplified descriptors, yes. Against 2,069 Morgan+RDKit features with optimized hyperparameters on the full dataset: −0.003 (p=0.50). The +0.055 was an artifact of comparing against a baseline that did not capture molecular topology. Morgan fingerprints already encode the substructure patterns K_fiedler was providing. The critic who flagged this was correct on every point.
K_fiedler looked important only because the simplified baseline missed topology. In the proper model, K/R/E/T features account for 1.9% of total importance. Standard features: 98.1%. K_fiedler does not appear in the top 20. K_wiener_norm (normalized mean path length) appears at #5, the only K/R/E/T feature in the top 20.
The permutation test confirmed that K/R/E/T features carry real signal vs. noise. That is still true — K/R/E/T alone gets 0.751 AUC, far above random. But the relevant question was never "do these features beat noise?" It was "do they beat Morgan fingerprints?" They do not.
K/R/E/T only: 0.751. Morgan+RDKit: 0.816. Delta: −0.065, t=−8.48, p=0.001. Not close. The original MM12P audit predicted this kill. We just had not run the test yet.
Still true — Fiedler works on any graph. But in this domain, the standard tools already capture what Fiedler provides. "Same math" does not mean "additional signal." The framework provides language, not magic.
We didn’t stop at the first kill. We chased every angle.
| # | Attempt | Delta | Status |
|---|---|---|---|
| 1 | Static K/R/E/T vs weak baseline | +0.055 | KILLED (weak baseline) |
| 2 | Static K/R/E/T vs Morgan (41K) | -0.003 | Dead |
| 3 | Dynamic K (Kuramoto, spectral, full 41K) | -0.002 | Dead |
| 4 | Cross-coupling (molecule × pocket, 2D) | -0.002 | Dead |
| 5 | 3D shape + USR reference similarity | -0.018 | Dead |
| 6 | Regression: lipophilicity (logD) | -0.030 R² | HURT (p=0.004) |
| 7 | Regression: solubility (ESOL) | +0.006 R² | p=0.064 (suggestive) |
Seven attempts. Six dead. One suggestive (ESOL, p=0.064 — five K/R/E/T features in the top 20 for solubility prediction, but doesn’t cross the significance threshold).
The boundary found: Morgan fingerprints encode molecular coupling more efficiently than K/R/E/T on graphs. Our graph features are a lower-resolution view of the same information Morgan captures in 1024 bits. Not a different view. A subset. Classification or regression — the wall holds.
The framework’s contribution to drug discovery: interpretability, not performance. K_fiedler tells you WHY a molecule might bind. Morgan tells you THAT it binds. For a medicinal chemist who needs mechanism, coupling features explain what the fingerprint can’t. For pure prediction, Morgan wins.
ESOL is the open thread: solubility IS a coupling measurement (molecule coupling with solvent). Electronegativity differences and bond energy have direct physical meaning there. Five K/R/E/T features ranked in the top 20. p=0.064. Not proven. Not dead. Open.
K/R/E/T does not improve Morgan’s predictions. But it does characterize WHERE Morgan fails. We ran error analysis across four MoleculeNet datasets (ESOL, FreeSolv, Lipophilicity, HIV) and found that Morgan’s worst predictions cluster in K-space. Three failure archetypes emerged.
| Archetype | K Signature | Why Morgan Fails |
|---|---|---|
| High-polarity | Elevated R_en_var, R_en_max_diff, en_autocorr | Internal electrical tension that substructure fingerprints do not encode |
| Large / sprawling | High Wiener index, high bond count, low density, low Fiedler value | Morgan’s radius-2 window loses spatial context across extended molecules |
| Ring-strained | High clustering coefficient, high ring strain energy | Morgan encodes ring topology but not the strain energy stored in it |
The practical application: A drug discovery team using Morgan + XGBoost gets predictions. For molecules matching these three archetypes — high-polarity, large/sprawling, or ring-strained — those predictions are less reliable. The team should flag them for additional analysis: docking, MD simulation, or experimental validation. Not every compound. Just the ones the coupling profile identifies as edge cases. This saves compute and improves outcomes. Not by replacing Morgan. By knowing where Morgan is blind.
Honest caveat: This is retrospective characterization, not prospective prediction. The clustering is visible after the fact — we identified failure archetypes by looking at where Morgan already failed. Turning this into a real-time reliability flag requires validation on external datasets not used in the analysis. The pattern is statistically significant. The utility is not yet proven in production.
Mutation Scanner — Fiedler damage on protein contact graphs (0.82 AUC single feature)
Financial Crime — Fiedler vector on transaction networks (5/5 fraud detected)
Framework — K/R/E/T definitions across 20 domains
K-Lag Spectrum — K as a function of timescale, not a single number
Jim McCandless, beGump LLC. All computation on Mac Mini M4, 16GB, 35W. No cloud. MoleculeNet HIV dataset (41,120 valid compounds). Original: XGBoost 300 trees, depth 6, lr 0.1, custom SMILES parser. Correction: XGBoost with grid-searched hyperparameters, RDKit Morgan fingerprints (ECFP4 2048-bit) + 21 RDKit descriptors. Page corrected May 10, 2026. Thank you to the anonymous critic who did our MM12P for free.