Skip to main content
Journal cover image

Data generation with optimal experimental design for operator learning

Publication ,  Journal Article
Xu, X; Guilleminot, J; Tarokh, V
Published in: Computer Methods in Applied Mechanics and Engineering
March 1, 2026

Partial differential equations (PDEs) are fundamental to modeling complex physical phenomena across scientific disciplines. While operator learning offers a promising alternative to conventional PDE solvers, it generally requires substantial high-fidelity training data, resulting in significant computational costs. Existing approaches typically sample PDE parameters uniformly or heuristically, which can be inefficient in computational resources and may lead to suboptimal neural operator performance. Leveraging functional encoding, we propose a systematic framework that adapts (finite-dimensional) optimal experimental design (OED) principles for generating informative training datasets while minimizing computational costs. The OED framework employs physics-infused informativeness metrics—solution variance, energy dissipation, and high-frequency spectral content—to guide strategic parameter sampling using Bayesian inference with adaptive acquisition strategies. The framework achieves substantial improvements in data efficiency through computationally lightweight OED operations that incur negligible overhead compared to expensive PDE simulations. We empirically demonstrate that strategic parameter sampling guided by informativeness metrics significantly outperforms uniform random sampling strategies across multiple PDE benchmarks, including Burgers, Darcy, and Navier-Stokes equations. Comprehensive evaluations using both Fourier Neural Operators (FNO) and Deep Operator Networks (DeepONet), two leading neural operator architectures, confirm the generality of the approach. Using equivalent computational budgets, the method achieves substantially lower validation errors by concentrating simulations on parameter regions that provide maximum learning value. We provide a principled yet practical approach to training data generation that reduces the computational barrier to deploying neural operators for complex parametric PDEs. We release an open-source implementation to make this data-efficient operator learning framework accessible.

Duke Scholars

Published In

Computer Methods in Applied Mechanics and Engineering

DOI

ISSN

0045-7825

Publication Date

March 1, 2026

Volume

450

Related Subject Headings

  • Applied Mathematics
  • 49 Mathematical sciences
  • 40 Engineering
  • 09 Engineering
  • 01 Mathematical Sciences
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Xu, X., Guilleminot, J., & Tarokh, V. (2026). Data generation with optimal experimental design for operator learning (Accepted). Computer Methods in Applied Mechanics and Engineering, 450. https://doi.org/10.1016/j.cma.2025.118675
Xu, X., J. Guilleminot, and V. Tarokh. “Data generation with optimal experimental design for operator learning (Accepted).” Computer Methods in Applied Mechanics and Engineering 450 (March 1, 2026). https://doi.org/10.1016/j.cma.2025.118675.
Xu X, Guilleminot J, Tarokh V. Data generation with optimal experimental design for operator learning (Accepted). Computer Methods in Applied Mechanics and Engineering. 2026 Mar 1;450.
Xu, X., et al. “Data generation with optimal experimental design for operator learning (Accepted).” Computer Methods in Applied Mechanics and Engineering, vol. 450, Mar. 2026. Scopus, doi:10.1016/j.cma.2025.118675.
Xu X, Guilleminot J, Tarokh V. Data generation with optimal experimental design for operator learning (Accepted). Computer Methods in Applied Mechanics and Engineering. 2026 Mar 1;450.
Journal cover image

Published In

Computer Methods in Applied Mechanics and Engineering

DOI

ISSN

0045-7825

Publication Date

March 1, 2026

Volume

450

Related Subject Headings

  • Applied Mathematics
  • 49 Mathematical sciences
  • 40 Engineering
  • 09 Engineering
  • 01 Mathematical Sciences