Goto

Collaborating Authors

 boundary condition


HyPINO: Multi-Physics Neural Operators via HyperPINNs and the Method of Manufactured Solutions

Neural Information Processing Systems

We present HyPINO, a multi-physics neural operator designed for zero-shot generalization across a broad class of PDEs without requiring task-specific fine-tuning. Our approach combines a Swin Transformer-based hypernetwork with mixed supervision: (i) labeled data from analytical solutions generated via the Method of Manufactured Solutions (MMS), and (ii) unlabeled samples optimized using physics-informed objectives. The model maps PDE parameterizations to target Physics-Informed Neural Networks (PINNs) and can handle linear elliptic, hyperbolic, and parabolic equations in two dimensions with varying source terms, geometries, and mixed Dirichlet/Neumann boundary conditions, including interior boundaries. HyPINO achieves strong zero-shot accuracy on seven benchmark problems from PINN literature, outperforming U-Nets, Poseidon, and Physics-Informed Neural Operators (PINO). Further, we introduce an iterative refinement procedure that treats the residual of the generated PINN as "delta PDE" and performs another forward pass to generate a corrective PINN. Summing their contributions and repeating this process forms an ensemble whose combined solution progressively reduces the error on six benchmarks and achieves a >100 lower L2 loss in the best case, while retaining forward-only inference. Additionally, we evaluate the fine-tuning behavior of PINNs initialized by HyPINO and show that they converge faster and to lower final error than both randomly initialized and Reptilemeta-learned PINNs on five benchmarks, performing on par on the remaining two. Our results highlight the potential of this scalable approach as a foundation for extending neural operators toward solving increasingly complex, nonlinear, and high-dimensional PDE problems. The code and model weights are publicly available at https://github.com/rbischof/hypino.


Axial Neural Networks for Dimension-Free Foundation Models

Neural Information Processing Systems

The advent of foundation models in AI has significantly advanced general-purpose learning, enabling remarkable capabilities in zero-shot inference and in-context learning. However, training such models on physics data, including solutions to partial differential equations (PDEs), poses a unique challenge due to varying dimensionalities across different systems. Traditional approaches either fix a maximum dimension or employ separate encoders for different dimensionalities, resulting in inefficiencies. To address this, we propose a dimension-agnostic neural network architecture, the Axial Neural Network (XNN), inspired by parametersharing structures such as Deep Sets and Graph Neural Networks.


HARDMath2: ABenchmark for Applied Mathematics Built by Students as Part of a Graduate Class

Neural Information Processing Systems

Large language models (LLMs) have shown remarkable progress in mathematical problem-solving, but evaluation has largely focused on problems that have exact analytical solutions or involve formal proofs, often overlooking approximationbased problems ubiquitous in applied science and engineering. To fill this gap, we build on prior work and present HARDMath2, a dataset of 211 original problems covering the core topics in an introductory graduate applied math class, including boundary-layer analysis, WKB methods, asymptotic solutions of nonlinear partial differential equations, and the asymptotics of oscillatory integrals. This dataset was designed and verified by the students and instructors of a core graduate applied mathematics course at Harvard. We built the dataset through a novel collaborative environment that challenges students to write and refine difficult problems consistent with the class syllabus, peer-validate solutions, test different models, and automatically check LLM-generated solutions against their own answers and numerical ground truths. Evaluation results show that leading frontier models still struggle with many of the problems in the dataset, highlighting a gap in the mathematical reasoning skills of current LLMs. Importantly, students identified strategies to create increasingly difficult problems by interacting with the models and exploiting common failure modes. This back-and-forth with the models not only resulted in a richer and more challenging benchmark but also led to qualitative improvements in the students' understanding of the course material, which is increasingly important as we enter an age where state-of-the-art language models can solve many challenging problems across a wide domain of fields.


Fast Reconstruction of Exact Maxwell Dynamics from Sparse Data

arXiv.org Machine Learning

We introduce FLASH-MAX, a shallow, exact-by-construction neural network architecture for predicting homogeneous electromagnetic fields from sparse pointwise observations. Each hidden neuron represents a separate exact solution to Maxwell's equations, so that the network satisfies the governing equations symbolically by construction and can be trained end-to-end from sparse data within seconds. We prove a universal approximation result showing that this exact model class remains universal on arbitrary domains. FLASH-MAX reaches sub-1% relative validation error from about 1K sparse pointwise observations in seconds, all while maintaining a zero PDE residual, and keeps single-digit errors even for only 100 observations sampled from 3D space. These results suggest that moving governing structure from the loss into the hypothesis class can dramatically improve the trade-off between precision and optimization speed in scientific machine learning.


f8e55d98b0c2569bd0aa25b076e6b3f8-Supplemental-Conference.pdf

Neural Information Processing Systems

Motion Compensation We compare our method to the traditional motion-compensated coding378 approach that forms the core of inter-picture coding in well established compression standards such379 as MPEG. Block matching is an essential component of these standards, allowing the compression of380 video content by up to three orders of magnitude with moderate loss of information. For each block381 in a frame, typical coders search for the most similar spatially displaced block in the previous frame382 (typically measured with MSE), and communicate the displacement coordinates to allow prediction383 of frame content by translating blocks of the (already transmitted) previous frame. We implemented384 a "diamond search" algorithm [29] operating on blocks of 8 8 pixels, with a maximal search385 distance of 8 pixels which balances accuracy of motion estimates and speed of estimation (the search386 step is computationally intensive). We use the estimated displacements to perform causal motion387 compensation (cMC), using displacement vectors estimated from the previous two observed frames388 (xt 1 and xt) to predict the next frame (xt+1) rather than the current one (as in MPEG).389


NTopo: Mesh-free Topology Optimization using Implicit Neural Representations

Neural Information Processing Systems

Recent advances in implicit neural representations show great promise when it comes to generating numerical solutions to partial differential equations. Compared to conventional alternatives, such representations employ parameterized neural networks to define, in a mesh-free manner, signals that are highly-detailed, continuous, and fully differentiable. In this work, we present a novel machine learning approach for topology optimization--an important class of inverse problems with high-dimensional parameter spaces and highly nonlinear objective landscapes. To effectively leverage neural representations in the context of mesh-free topology optimization, we use multilayer perceptrons to parameterize both density and displacement fields. Our experiments indicate that our method is highly competitive for minimizing structural compliance objectives, and it enables self-supervised learning of continuous solution spaces for topology optimization problems.




Material

Neural Information Processing Systems

A.1 Data Configuration The inputs to a hydraulic simulation include an elevation map, initial conditions, and the boundary conditions. For a given elevation map, there is an infinite possible combinations of initial and boundary conditions that could potentially realize in future events. It is an interesting question how to automatically configure the most relevant initial and boundary conditions to train on, to get a representation that will be useful in potential future real-world scenarios. We suggest a basic configuration that adequate for the purpose of this paper. These include the water height h Rm m at each pixel and a staggered grid flux q R2 (m 1) (m 1) in each direction x,y.