Skip to main content
Research

AI agents replicate
quantum papers on
real hardware.

Can AI systematically reproduce quantum computing experiments? We tested 27 claims from 6 landmark papers across 3 quantum processors. 93% pass. The gaps between published results and AI-reproduced results are the finding.

6 papers replicated100+ experiments3 hardware platforms

100+ Experiments

15+Bell states

Entanglement benchmarking across qubit pairs

12+GHZ states

3-50 qubit multipartite entanglement

40+VQE chemistry

H2 and HeH+ energy estimation with mitigation

8+QAOA MaxCut

Combinatorial optimization on hardware

20+Benchmarks

RB, QV, connectivity probes, characterization

10+QEC

[[4,2,2]] detection code, NN decoders

Three Chips, One Suite

QI Tuna-9
Qubits9
QV8
Bell best93.5%
VQE best0.92 kcal/mol

Best small-scale fidelity

IQM Garnet
Qubits20
QV32
Bell best98.1%
VQE bestn/a kcal/mol

Highest Bell fidelity

IBM Torino
Qubits133
QV32
Bell best86.5%
VQE best0.22 kcal/mol

Best VQE with TREX

Data & Reproducibility

All raw data, circuits, and analysis scripts are open on GitHub. Every result file uses schema-versioned JSON with SHA256 checksums for raw counts and circuits.

Read the paper (PDF) outline on GitHub

Environment
Python3.12Qiskit2.1.2PennyLane0.44QI SDK3.5.1
Hardware
IBM Torino133 qubitsQI Tuna-99 qubitsIQM Garnet20 qubits