Skip to main content
Back to Home
MatterSpace Lattice · Research Results

Blind Rediscovery of
Single-Atom Alloy Catalysts

600 candidates. 23 dopant elements. Zero target knowledge. Both Re₁@Ni and Ir₁@Ni catalysts blindly rediscovered to sub-angstrom accuracy. Every candidate valid by construction.

0.408 Å

Full RMSD

Level C PASS

97.5–99%

Structural validity

By construction

~$15

Cloud cost

Single A100, 4.7 hrs

Vareon Research

Vareon Inc. · Vareon Limited · March 2026

The Problem: Generate and Filter Is Broken

Materials discovery today follows a propose-then-filter paradigm. Generate millions of candidate structures randomly or through exhaustive enumeration. Evaluate them with expensive density functional theory (DFT) or machine-learned interatomic potentials. Discard the vast majority because they are physically invalid or chemically unreasonable.

The waste is staggering. Most compute cycles are spent evaluating structures that should never have been proposed. GNoME identified 2.2 million stable crystals through brute-force screening. MatterGen generates crystals with diffusion but filters for validity afterward. Open Catalyst built massive datasets for screening. CDVAE, DiffCSP, FlowMM — all rely on post-hoc filtering for physical validity.

None of these approaches can guarantee that a generated structure satisfies physical, chemical, and geometric constraints during generation. None have demonstrated blind rediscovery — starting from zero knowledge and independently finding known materials to sub-angstrom accuracy.

The Solution: Constraints Baked into Generation

MatterSpace embeds physical, chemical, and geometric constraints directly into the generation process. Goals and constraints are not applied after generation — they are enforced at every single step, resulting in near-guaranteed validity every time.

Valid by Construction

Every structural constraint — minimum interatomic distances, coordination bounds, surface height limits — is enforced during generation, not after. The engine does not produce invalid candidates and filter them out. It produces valid candidates from the start.

Adaptive Exploration

The engine autonomously navigates complex energy landscapes, balancing exploration of new configurations with refinement of promising candidates. No manual scheduling or hand-tuning required.

Modular Refinement

Post-generation refinement with high-accuracy interatomic potentials improves structural precision without modifying the core generative architecture. The refinement calculator is modular and upgradeable.

Domain-Agnostic Core

The generation engine is universal. Only the domain pack changes — the constraints, objectives, and physics specific to each scientific field. One engine. Every domain.

Why this is different

Current generative AI suffers from the generate-and-filter approach: produce candidates blindly, then discard the invalid ones. MatterSpace eliminates this waste entirely. The engine generates with goals and constraints baked in, resulting in near-guaranteed validity every time. This is not a marginal improvement — it is a fundamentally different paradigm for generative AI.

Results: Three Levels of Validation, All Passed

MatterSpace was tested through a blind rediscovery experiment: starting from a palette of 23 dopant elements with zero target information, the engine had to independently generate candidates that match known Re₁@Ni and Ir₁@Ni single-atom alloy catalysts for methane cracking. A three-level post-hoc validation protocol measured performance, structural motif similarity, and exact geometric accuracy.

A

Performance Threshold

PASS

Primary Metric

581 / 600 candidates below -1.3 eV

Key Finding

Best adsorption energy: -34.73 eV

The vast majority of generated candidates exhibit strongly favorable surface binding, confirming that MatterSpace generates catalytically relevant materials — not random structures.

B

Motif / Site Fingerprint Match

PASS

Primary Metric

75 matches, best similarity 0.814

Key Finding

Both Re and Ir independently identified from 23 elements

Without any target information, the engine correctly identified both target dopant elements from a 23-element palette. The probability of randomly selecting both correct elements is 0.19%.

C

Exact Structural Accuracy

PASS

Primary Metric

0.408 Å full RMSD (metal-only: 0.363 Å)

Key Finding

Both targets independently below 0.5 Å threshold

Ir₁@Ni at 0.408 Å and Re₁@Ni at 0.466 Å — both independently rediscovered to sub-angstrom precision. This is the first demonstration of blind generative material rediscovery achieving all three validation levels for surface catalysts.

Constraint Enforcement: Near-Perfect Satisfaction

Across all generation steps, constraints were satisfied at near-perfect rates with negligible computational overhead. The constraint enforcement mechanism adds less than 4% to total generation time while guaranteeing structural validity at every step.

How It Compares

MatterSpace is the only system achieving all three validation levels. Existing generative models demonstrate Level A capability (favorable properties) but have not demonstrated Level B (motif matching) or Level C (sub-angstrom structural reproduction) — because they are not designed for blind rediscovery.

SystemABCConstraints
GNoME PASSPost-hoc
MatterGen PASSPost-hoc
Open Catalyst PASSPost-hoc
CDVAE PASSPost-hoc
DiffCSP PASSPost-hoc
FlowMM PASSPost-hoc
MatterSpace PASS PASS PASSBy construction

Why This Transfers to Every Domain

This result was produced by MatterSpace Lattice — the materials discovery engine. But the core architecture that made it possible is domain-agnostic. None of the underlying engine is specific to materials. Only the domain pack changes.

What stays the same across every domain

Constraints enforced during generation — valid by construction

Adaptive landscape navigation across complex energy surfaces

Diverse archives of Pareto-optimal candidates, not a single answer

Modular architecture — plug in any model as a refinement calculator

Multi-objective optimization toward user-defined goals

What changes: domain packs

Scientists direct their agents with what they are after. Their agents pick parameters from a parameter pool curated for each pack, set constraints and goals, and MatterSpace begins generating. The models are based on strong open-source foundations — and customers can bring their own models and data too.

But that is rarely the bottleneck. Most open-source models are already strong. What's lacking is strong engineering to steer generation toward the desired solution space with small compute budgets and achieve near 100% validity by construction.

LatticeReady

Batteries, catalysts, superconductors, magnets, photovoltaics, thermoelectrics, HEAs, electrolytes, coatings

PharmaEarly Testing

ADMET constraints, binding affinity objectives, molecular stability. The same constraint enforcement produces valid drug candidates.

AlgoEarly Testing

Complexity bounds, correctness constraints, optimality objectives. Valid-by-construction algorithms.

TesseraEarly Testing

Design rule constraints, power-performance-area objectives. Physical layout validity by construction.

Expected impact

The 97.5–99% structural validity demonstrated on materials is expected to bring a new paradigm to generative AI across drug discovery, longevity research, advanced materials, and algorithm design. MatterSpace eliminates the generate-and-filter waste that plagues current generative approaches. When constraints are baked into generation, nearly every GPU cycle produces a viable candidate.

Read the Full Paper

The complete manuscript with full results, validation protocol, and comparison tables.

Read the Paper