P
Physics World Model
← All benchmarks

CACTI Mismatch-Only Benchmark Suite (P-benchmark + 4 I-benchmark tiers)L3-004

ReconstructMismatch
Read full 4-layer walkthrough →

Example Data

View demos →

A reference solver demo is available — see input / ground truth / reconstruction visualizations on the demos page.

Genesis spec

Parent L2:
L2-004
Spec type:
d_spec:

On-chain

Not registered on-chain yet.

P-benchmark — rho=50 distribution over Ω

The headline ranking score. Solvers are evaluated on 50 instances drawn parametrically across the full Ω, scored as a distribution. This is what the leaderboard's rank-1 chases.

Dataset: CACTI-6 + DAVIS-SCI + UCF101-Sports union with parametric Omega sampling(200 dev instances)
Authored baselineoverall_Q≈ Q_int
GAP-TV0.4545
PnP-FFDNet0.6161
EfficientSCI0.6767

To take rank 1 of the P-benchmark, beat the highest authored Q above. See L2 spec for the epsilon_fn used per draw.

I-benchmark tiers 4 fixed-Ω operating points

Each tier is a single-instance evaluation at a specific Ω with its own ε floor. Solvers may pass individual tiers without saturating the full P-benchmark.

Tierρε (dB)Ω parametersd_ibenchTop authored baseline
T1_nominal126H=256, W=256, B=80.14EfficientSCI = 34.5 (PSNR_dB)Q=0.92
T2_low324H=256, W=256, B=80.28EfficientSCI = 29.4 (PSNR_dB)Q=0.75
T3_moderate522.5H=512, W=512, B=160.48EfficientSCI = 25.8 (PSNR_dB)Q=0.62
T4_blind1021H=1024, W=1024, B=320.71EfficientSCI = 20.1 (PSNR_dB)Q=0.34

Current standing

🏆 Current SOTA
No certs submitted yet.
Be the first to submit a solution and claim rank 1.
📊 Classical floor
GAP-TV
26.80 dB
deliberate floor; anyone better wins
📈 PWM-enabled delta
Delta unavailable (need both SOTA and reference PSNR).

Example data

Two independent samples from the InverseNet benchmark, each shown as measurement → ground truth → reference reconstruction. The reconstruction is the reference solver's output — the bar a competing solver must beat to win the bounty.

sample_01 · kobe

reference PSNR 27.99 dB
Measurement
cacti sample_01 measurement
shape [256,256]
Ground truth
cacti sample_01 ground truth
shape [8,256,256]
Reference reconstruction
27.99 dB
cacti sample_01 reconstruction
shape [8,256,256]

sample_02 · traffic

reference PSNR 20.88 dB
Measurement
cacti sample_02 measurement
shape [256,256]
Ground truth
cacti sample_02 ground truth
shape [8,256,256]
Reference reconstruction
20.88 dB
cacti sample_02 reconstruction
shape [8,256,256]

Get this benchmark

Everything you need to run the reference solver locally, browse the full benchmark specification, or submit a competing solution.

Download sample data

sample_01 · 4143245 B
sample_02 · 2792190 B
sample_03 · 2613734 B
sample_04 · 2337413 B
sample_05 · 3105035 B
sample_06 · 3228468 B

Browse on GitHub

Run the reference solver locally

git clone https://github.com/integritynoble/pwm-public.git
cd pwm-public
python3 scripts/regenerate_demos_inversenet.py --only cacti (uses pwm_core.recon.cacti_solvers.gap_tv_cacti)
cat /tmp/out/meta.json

Expected reference PSNR: 27.99 dB (seed ). Byte-stable across runs at the same git SHA.

Leaderboard (top 10)

No solutions submitted yet. The reference baseline above is the floor — submit a better solver to claim rank 1.