File size: 6,796 Bytes

f28a48c

---
license: mit
tags:
  - quantum-computing
  - molecular-simulation
  - shadow-spectroscopy
  - regression
  - pytorch
library_name: pytorch
---

# molecular-shadows-h4

**The H4 direct-observable regressor** for fermionic shadow spectroscopy on
linear $\mathrm{H_4}$ / STO-3G. Given an equal nearest-neighbor bond length $R$
and a propagation time $t$, it predicts the length-120 vector of time-evolved
Majorana expectation values

$$ D_\mu(R, t) = \langle \psi_0(R) \,|\, \Gamma_\mu(t) \,|\, \psi_0(R) \rangle,
\qquad \Gamma_\mu(t) = e^{iH(R)t}\,\Gamma_\mu\,e^{-iH(R)t}, $$

which feed the downstream matchgate-shadow spectroscopy post-processing
(FFT $\to$ peaks $\to$ energy gaps). This is the four-hydrogen model; a
single-reference $\mathrm{H_2}$ companion exists separately.

> **What's new vs the earlier H4 release.** This model uses an
> **explicit-amplitude composition** head and **adaptive Fourier bandwidth**,
> with **chemically-informed orbital-energy inputs**. Short-bond accuracy — the
> structural weak spot of the earlier release ($R < 1.0$ Å, $r \approx 0.40$) —
> is now strong ($r \approx 0.90$). Overall held-out Pearson $0.83 \to 0.93$.

## Architecture

A linear-in-time Fourier head with geometry-conditioned frequencies and an
explicit amplitude/phase factorization (no nonlinear trunk):

$$ \hat{y}_\mu(R, t) = \sum_{k=1}^{256} a_{k\mu}(R)\cos(\omega_k(R)\, t)
   + b_{k\mu}(R)\sin(\omega_k(R)\, t) + \mathrm{dc}_\mu(R), $$

with geometry-conditioned, bandwidth-limited frequencies

$$ \omega_k(R) = \omega_{\mathrm{op}}(R)\cdot
   \sigma\!\big(\mathrm{freq\_net}(\varepsilon(R))\big)_k, $$

where $\varepsilon(R)$ are the HF orbital energies and $\omega_{\mathrm{op}}(R)$
is the per-geometry operational frequency ceiling (soft-floored at 8.0). The
amplitude tensors $a_{k\mu}, b_{k\mu}$ are low-rank factorized (rank 16).

```
(R, t) + HF orbital energies ε(R)
      │
      ├── freq_net(ε(R)) ─ sigmoid ─ × ω_op(R)  → 256 frequencies ω_k(R)   (adaptive bandwidth)
      │
      ├── amp_net(ε(R)) → low-rank a_kμ(R), b_kμ(R), dc_μ(R)   (rank 16)
      │
      └── y_μ(R,t) = Σ_k a_kμ cos(ω_k t) + b_kμ sin(ω_k t) + dc_μ   (explicit-amplitude head)
```

| Hyperparameter | Value |
|---|---|
| n_observables | 120 (degree-1 Majorana monomials, 8 spin-orbitals) |
| n_fourier | 256 |
| explicit_amplitude / amp_rank | True / 16 |
| adaptive_bandwidth | True ($\omega_{\mathrm{op}}$ floor 8.0, soft) |
| conditioned_frequencies | True |
| trunk depth × width | 6 × 768 |
| freq_net depth × width | 3 × 128 |
| residual trunk | None |
| n_orb_features | 4 (HF spatial-orbital energies of H4/STO-3G) |
| Parameter count | ~12.3 M |
| Training | 150k steps, AdamW, cosine LR $10^{-3}\to10^{-7}$, grad-clip 1.0, seed 42 |

## Chemically-informed inputs

The model conditions on **HF orbital energies $\varepsilon(R)$** rather than the
bare scalar $R$. On a paired held-out comparison (identical geometries, same
architecture, same data), orbital-energy inputs beat geometry-only inputs
decisively where multi-reference character lives, and converge with it at
dissociation:

| $R$ bin (Å) | orbital-energy input | scalar-$R$ input |
|---|---|---|
| $<0.74$ | **0.89** | 0.57 |
| $[0.74, 1.0)$ | **0.98** | 0.71 |
| $[1.0, 1.5)$ | 0.96 | 0.88 |
| $[1.5, 2.0)$ | 0.94 | 0.89 |
| $\geq 2.0$ | 0.93 | 0.94 |

This repo ships the orbital-energy model.

## Held-out evaluation

50 held-out geometries on the dense $R \in [0.5, 3.0]$ Å grid (251 total),
per-observable temporal Pearson $r$:

| $R$ bin (Å) | pearson_mean |
|---|---|
| $<0.74$ | 0.90 |
| $[0.74, 1.0)$ | 0.97 |
| $[1.0, 1.5)$ | 0.96 |
| $[1.5, 2.0)$ | 0.93 |
| $\geq 2.0$ | 0.92 |

**Aggregate:** mean Pearson **0.928** across all 50 held-out geometries; top-1
spectral-peak match 44/50. The result is seed-robust — pooling a second seed
($n=100$) gives overall 0.94 with borderline-bin seed spread $|\Delta| < 0.02$.
See `eval_results.json` for per-$R$ numbers.

## Inputs / outputs

- **Input.** $(R, t)$ — equal nearest-neighbor bond length in Å (linear chain:
  H atoms at $0, R, 2R, 3R$) and propagation time in a.u.
- **Output.** Length-120 vector of expectation values $D_\mu(R,t)$ for degree-1
  Majorana observables on H4/STO-3G's 8 spin-orbital JW encoding.
- **Valid range.** $R \in [0.5, 3.0]$ Å, $t \in [0, 300]$ a.u. Accuracy is now
  reasonably uniform across the curve; the only mild residual deficit is in the
  long-$R$ time-domain fit, which the spectral post-processing absorbs.

## Quickstart

```python
from inference import MolecularShadowsRegressor
import numpy as np

m = MolecularShadowsRegressor.from_hub(
    "aniketdesh/molecular-shadows-h4",
    revision="v18-orb",    # immutable architecture pin
    token="hf_...",
)

t_grid = np.linspace(0, 300, 1500)
y = m.predict_trajectory(R=0.8, t_grid=t_grid)   # (1500, 120) — short bond now reliable
```

## Notes & limitations

Earlier H4 releases were structurally weak at short bond ($R < 1.0$ Å): linear
H4 has a multi-reference singlet manifold whose eigenvectors rotate
near-discontinuously through avoided crossings as the chain compresses, and the
old GELU trunk could not encode that rapid amplitude rotation. The
explicit-amplitude head (which factorizes the prediction into per-geometry
amplitudes and frequencies) plus orbital-energy conditioning resolve most of
this — short-bond Pearson is now $\approx 0.90$. The remaining limitation is a
small loss of time-domain fidelity at large $R$ (slowly-varying non-sinusoidal
structure the linear-in-time basis cannot represent); because the downstream
Chan-style spectral analysis reads off frequencies, not waveforms, this does
not degrade recovered energy gaps.

## Files in this repo

| File | Purpose |
|---|---|
| `regressor.pt` | torch payload (state_dict + config + R/t grids) |
| `observable_regressor.py` | architecture (single file) |
| `inference.py` | loader (`MolecularShadowsRegressor`) |
| `orbital_energies.npz` | R-grid + HF orbital-energy table (+ $\omega_{\mathrm{op}}$) |
| `eval_results.json` | per-R held-out metrics (50 geoms) |
| `eval_summary.json` | aggregate |
| `history.json` | training curves |
| `README.md` | this file |

## Versioning

- `v18-orb` (current): explicit-amplitude composition + adaptive bandwidth +
  orbital-energy conditioning, grad-clip 1.0. Mean Pearson 0.928, short-bond
  recovered. Pin via `revision="v18-orb"`.
- Future architectures push as new commits with new tags; existing pins keep
  loading the exact committed version.

## Citation

Method: matchgate-shadow spectroscopy following
[arXiv:2212.11036](https://arxiv.org/abs/2212.11036) and
[matchgate-shadow theory](https://link.springer.com/article/10.1007/s00220-023-04844-0).

## License

MIT.