publications | Alexander Heinlein

Journal Articles

2026

CMAME
PACMANN: Point adaptive collocation method for artificial neural networks

Coen Visser, Alexander Heinlein, and Bianca Giovanardi

Computer Methods in Applied Mechanics and Engineering, 2026

Abs Bib doi URL Preprint

Physics-Informed Neural Networks (PINNs) have emerged as a tool for approximating the solution of Partial Differential Equations (PDEs) in both forward and inverse problems. PINNs minimize a loss function which includes the PDE residual determined for a set of collocation points. Previous work has shown that the number and distribution of these collocation points have a significant influence on the accuracy of the PINN solution. Therefore, the effective placement of these collocation points is an active area of research. Specifically, available adaptive collocation point sampling methods have been reported to scale poorly in terms of computational cost when applied to high-dimensional problems. In this work, we address this issue and present the Point Adaptive Collocation Method for Artificial Neural Networks (PACMANN). PACMANN incrementally moves collocation points toward regions of higher residuals using gradient-based optimization algorithms guided by the gradient of the PINN loss function, that is, the squared PDE residual. We apply PACMANN to several forward and inverse problems, including one with a low-regularity solution and 3D Navier Stokes, and demonstrate that this method matches the performance of state-of-the-art methods in terms of the accuracy/efficiency tradeoff for the low-dimensional problems, while outperforming available approaches for high-dimensional problems. Key features of the method include its low computational cost and simplicity of integration into existing physics-informed neural network pipelines. The code is available at https://github.com/CoenVisser/PACMANN.
@article{Visser:2026:PAC, author = {Visser, Coen and Heinlein, Alexander and Giovanardi, Bianca}, doi = {https://doi.org/10.1016/j.cma.2025.118723}, journal = {Computer Methods in Applied Mechanics and Engineering}, pages = {118723}, title = {PACMANN: Point adaptive collocation method for artificial neural networks}, url = {https://www.sciencedirect.com/science/article/pii/S0045782525009958}, volume = {452}, year = {2026} }
Sci Rep
Mechanistic-driven graph neural network surrogates for pandemic response

Agatha Schmidt, Henrik Zunker, Alexander Heinlein, and 1 more author

Scientific Reports, Feb 2026

Abs Bib doi URL Preprint

During the COVID-19 crisis, mechanistic models have been proven fundamental to guide evidence-based decision making. However, time-critical decisions in a dynamically changing environment restrict the time available for modelers to gather supporting evidence. As infectious disease dynamics are often heterogeneous on a spatial or demographic scale, models should be resolved accordingly. In addition, with a large number of potential interventions, all scenarios can barely be computed on time, even when using supercomputing facilities. We suggest to combine complex mechanistic models with data-driven surrogate models to allow for on-the-fly model adaptations by public health experts. We build upon a spatially and demographically resolved infectious disease model and train a graph neural network for data sets representing early phases of the pandemic. The resulting networks reached an execution time of less than a second, a significant speedup compared to the metapopulation approach. The suggested approach yields potential for on-the-fly execution and, thus, integration of disease dynamics models in low-barrier website applications. For the approach to be used with decision-making, datasets with larger variance will have to be considered.
@article{Schmidt:2026:TGNN, author = {Schmidt, Agatha and Zunker, Henrik and Heinlein, Alexander and K\"uhn, Martin J.}, doi = {10.1038/s41598-026-39431-5}, journal = {Scientific Reports}, month = feb, pages = {6361}, title = {Mechanistic-driven graph neural network surrogates for pandemic response}, url = {https://www.nature.com/articles/s41598-026-39431-5}, volume = {16}, year = {2026} }

2025

CAMWA
High-order discretized ACMS method for the simulation of finite-size two-dimensional photonic crystals

Elena Giammatteo, Alexander Heinlein, Philip L. Lederer, and 1 more author

Computers & Mathematics with Applications, Feb 2025

Abs Bib doi URL Preprint

The computational complexity and efficiency of the approximate mode component synthesis (ACMS) method is investigated for the two-dimensional heterogeneous Helmholtz equations, aiming at the simulation of large but finite-size photonic crystals. The ACMS method is a Galerkin method that relies on a non-overlapping domain decomposition and special basis functions defined based on the domain decomposition. While, in previous works, the ACMS method was realized using first-order finite elements, we use an underlying hp–finite element method. We study the accuracy of the ACMS method for different wavenumbers, domain decompositions, and discretization parameters. Moreover, the computational complexity of the method is investigated theoretically and compared with computing times for an implementation based on the open source software package NGSolve. The numerical results indicate that, for relevant wavenumber regimes, the size of the resulting linear systems for the ACMS method remains moderate, such that sparse direct solvers are a reasonable choice. Moreover, the ACMS method exhibits only a weak dependence on the selected domain decomposition, allowing for greater flexibility in its choice. Additionally, the numerical results show that the error of the ACMS method achieves the predicted convergence rate for increasing wavenumbers. Finally, to display the versatility of the implementation, the results of simulations of large but finite-size photonic crystals with defects are presented.
@article{Giammatteo:2025:HDA, author = {Giammatteo, Elena and Heinlein, Alexander and Lederer, Philip L. and Schlottbom, Matthias}, doi = {https://doi.org/10.1016/j.camwa.2025.08.013}, journal = {Computers \& Mathematics with Applications}, pages = {376-394}, title = {High-order discretized ACMS method for the simulation of finite-size two-dimensional photonic crystals}, url = {https://www.sciencedirect.com/science/article/pii/S0898122125003396}, volume = {196}, year = {2025} }
ACM TOMS
Coupling deal.II and FROSch: A Sustainable and Accessible (O)RAS Preconditioner

Alexander Heinlein, Sebastian Kinnewig, and Thomas Wick

ACM Trans. Math. Softw., Sep 2025

Abs Bib doi URL Preprint

In this work, restricted additive Schwarz (RAS) and optimized restricted additive Schwarz (ORAS) preconditioners from the Trilinos package FROSch (Fast and Robust Overlapping Schwarz) are employed to solve model problems implemented using deal.II (differential equations analysis library). Therefore, a Tpetra-based interface for coupling deal.II and FROSch is implemented. While RAS preconditioners have been available before, ORAS preconditioners have been newly added to FROSch. The deal.II–FROSch interface works for both Lagrange-based and Nédélec finite elements. Here, as model problems, nonstationary, nonlinear, variational-monolithic fluid-structure interaction and the indefinite time-harmonic Maxwell’s equations are considered. Several numerical experiments in two and three spatial dimensions confirm the performance of the preconditioners as well as the FROSch-deal.II interface. In conclusion, the overall software interface is straightforward and easy to use while giving satisfactory solver performances for challenging PDE systems.
@article{Heinlein:2025:CDF, author = {Heinlein, Alexander and Kinnewig, Sebastian and Wick, Thomas}, doi = {10.1145/3766906}, journal = {ACM Trans. Math. Softw.}, month = sep, title = {Coupling deal.{II} and {FROSch}: {A} {Sustainable} and {Accessible} ({O}){RAS} {Preconditioner}}, url = {https://dl.acm.org/doi/10.1145/3766906}, year = {2025} }
CAMWA
Improving Pseudo-Time Stepping Convergence for CFD Simulations With Neural Networks

Anouk Zandbergen, Tycho van Noorden, and Alexander Heinlein

Computers & Mathematics with Applications, Sep 2025

Abs Bib doi URL Preprint

Computational fluid dynamics (CFD) simulations of viscous fluids described by the stationary Navier–Stokes equations are considered. Depending on the Reynolds number of the flow, the Navier–Stokes equations may exhibit a highly nonlinear behavior. The system of nonlinear equations resulting from the discretization of the Navier–Stokes equations can be solved using nonlinear iteration methods, such as Newton’s method. However, fast quadratic convergence is typically only obtained in a local neighborhood of the solution, and for many configurations, the classical Newton iteration does not converge at all. In such cases, so-called globalization techniques may help to improve convergence. In this paper, pseudo-time stepping (also known as pseudo-transient continuation) is employed in order to improve nonlinear convergence. The classical algorithm is enhanced by a neural network model that is trained to predict a local pseudo-time step. Generalization of the novel approach is facilitated by predicting the local pseudo-time step separately on each element using only local information on a patch of adjacent elements as input. Numerical results for standard benchmark problems, including flow over a backward facing step geometry and Couette flow, show the performance of the machine learning-enhanced globalization approach; as the software for the simulations, the CFD Module of COMSOL Multiphysics® is employed.
@article{Zandbergen:2025:IPT, author = {Zandbergen, Anouk and {van Noorden}, Tycho and Heinlein, Alexander}, doi = {10.1016/j.camwa.2025.07.006}, journal = {Computers \& Mathematics with Applications}, pages = {64-83}, title = {Improving Pseudo-Time Stepping Convergence for CFD Simulations With Neural Networks}, url = {https://www.sciencedirect.com/science/article/pii/S0898122125002871}, volume = {196}, year = {2025} }
JCOMP
Learning the solution operator of two-dimensional incompressible Navier-Stokes equations using physics-aware convolutional neural networks

Viktor Grimm, Alexander Heinlein, and Axel Klawonn

Journal of Computational Physics, Sep 2025

Abs Bib doi URL Preprint

In recent years, the concept of introducing physics to machine learning has become widely popular. Most physics-inclusive ML-techniques however are still limited to a single geometry or a set of parametrizable geometries. Thus, there remains the need to train a new model for a new geometry, even if it is only slightly modified. With this work we introduce a technique with which it is possible to learn approximate solutions to the steady-state Navier–Stokes equations in varying geometries without the need of parametrization. This technique is based on a combination of a U-Net-like CNN and well established discretization methods from the field of the finite difference method. The results of our physics-aware CNN are compared to a state-of-the-art data-based approach. Additionally, it is also shown how our approach performs when combined with the data-based approach.
@article{Grimm:2025:LSO, author = {Grimm, Viktor and Heinlein, Alexander and Klawonn, Axel}, doi = {https://doi.org/10.1016/j.jcp.2025.114027}, issn = {0021-9991}, journal = {Journal of Computational Physics}, pages = {114027}, title = {Learning the solution operator of two-dimensional incompressible Navier-Stokes equations using physics-aware convolutional neural networks}, url = {https://www.sciencedirect.com/science/article/pii/S0021999125003109}, volume = {535}, year = {2025} }
JCLB
Deep operator network models for predicting post-burn contraction

Selma Husanović, Ginger Egberts, Alexander Heinlein, and 1 more author

Clinical Biomechanics, Sep 2025

Abs Bib doi URL Preprint

Background Burn injuries present a significant global health challenge. Among the most severe long-term consequences are contractures, which can lead to functional impairments and disfigurement. Understanding and predicting the evolution of post-burn wounds is essential for developing effective treatment strategies. Traditional mathematical models, while accurate, are often computationally expensive and time-consuming, limiting their practical application. Recent advancements in machine learning, particularly in deep learning, offer promising alternatives for accelerating these predictions. Methods This study explores the use of a deep operator network, a type of neural operator, as a surrogate model for finite element simulations aimed at predicting post-burn contraction across multiple wound shapes. A deep operator network was trained on three distinct initial wound shapes, with enhancements made to the architecture by incorporating initial wound shape information and applying sine augmentation to enforce boundary conditions. Findings The performance of the trained deep operator network was evaluated on a test set including finite element simulations based on convex combinations of the three basic wound shapes. The model achieved an R2 score of 0.99, indicating strong predictive accuracy and generalization. Moreover, the model provided reliable predictions over an extended period of up to one year, with speedups of up to 128-fold on the Central Processing Unit and 235-fold on the Graphical Processing Unit, compared to the numerical model. Interpretation These findings suggest that deep operator networks can effectively serve as a surrogate for traditional finite element methods in simulating post-burn wound evolution, with potential applications in medical treatment planning.
@article{Husanovic:2025:DON, author = {Husanović, Selma and Egberts, Ginger and Heinlein, Alexander and Vermolen, Fred}, doi = {10.1016/j.clinbiomech.2025.106558}, issn = {0268-0033}, journal = {Clinical Biomechanics}, pages = {106558}, title = {Deep operator network models for predicting post-burn contraction}, url = {https://www.sciencedirect.com/science/article/pii/S0268003325001317}, year = {2025} }
CMAME
Overlapping Schwarz preconditioners for randomized neural networks with domain decomposition

Yong Shang, Alexander Heinlein, Siddhartha Mishra, and 1 more author

Computer Methods in Applied Mechanics and Engineering, Sep 2025

Abs Bib doi URL Preprint

Randomized neural networks (RaNNs), characterized by fixed hidden layers after random initialization, offer a computationally efficient alternative to fully parameterized neural networks trained using stochastic gradient descent-type algorithms. In this paper, we integrate RaNNs with overlapping Schwarz domain decomposition in two primary ways: firstly, to formulate the least-squares problem with localized basis functions, and secondly, to construct effective overlapping Schwarz preconditioners for solving the resulting linear systems. Specifically, neural networks are randomly initialized in each subdomain following a uniform distribution, and these localized solutions are combined through a partition of unity, providing a global approximation to the solution of the partial differential equation. Boundary conditions are imposed via a constraining operator, eliminating the necessity for penalty methods. Furthermore, we apply principal component analysis (PCA) within each subdomain to reduce the number of basis functions, thereby significantly improving the conditioning of the resulting linear system. By constructing additive Schwarz (AS) and restricted AS preconditioners, we efficiently solve the least-squares problems using iterative solvers such as the Conjugate Gradient (CG) and generalized minimal residual methods. Numerical experiments clearly demonstrate that the proposed methodology substantially reduces computational time, particularly for multi-scale and time-dependent PDE problems. Additionally, we present a three-dimensional numerical example illustrating the superior efficiency of employing the CG method combined with an AS preconditioner over direct methods like QR decomposition for solving the associated least-squares system.
@article{Shang:2025:DDM, author = {Shang, Yong and Heinlein, Alexander and Mishra, Siddhartha and Wang, Fei}, doi = {10.1016/j.cma.2025.118011}, journal = {Computer Methods in Applied Mechanics and Engineering}, pages = {118011}, title = {Overlapping {Schwarz} preconditioners for randomized neural networks with domain decomposition}, url = {https://www.sciencedirect.com/science/article/pii/S004578252500283X}, volume = {442}, year = {2025} }
IEEE Access
DDU-Net: A Domain Decomposition-Based CNN for High-Resolution Image Segmentation on Multiple GPUs

Corné Verburg, Alexander Heinlein, and Eric C. Cyr

IEEE Access, Sep 2025

Abs Bib doi URL Preprint

The segmentation of ultra-high resolution images poses challenges such as loss of spatial information or computational inefficiency. In this work, a novel approach that combines encoder-decoder architectures with domain decomposition strategies to address these challenges is proposed. Specifically, a domain decomposition-based U-Net (DDU-Net) architecture is introduced, which partitions input images into non-overlapping patches that can be processed independently on separate devices. A communication network is added to facilitate inter-patch information exchange to enhance the understanding of spatial context. Experimental validation is performed on a synthetic dataset that is designed to measure the effectiveness of the communication network. Then, the performance is tested on the DeepGlobe land cover classification dataset as a real-world benchmark data set. The results demonstrate that the approach, which includes inter-patch communication for images divided into 16\times16 non-overlapping subimages, achieves a 2-3\,% higher intersection over union (IoU) score compared to the same network without inter-patch communication. The performance of the network which includes communication is equivalent to that of a baseline U-Net trained on the full image, showing that our model provides an effective solution for segmenting ultra-high-resolution images while preserving spatial context. The code is available at https://github.com/corne00/DDU-Net.
@article{Verburg:2025:DDU, author = {Verburg, Corné and Heinlein, Alexander and Cyr, Eric C.}, doi = {10.1109/ACCESS.2025.3561033}, journal = {IEEE Access}, pages = {66967-66983}, title = {{DDU}-{Net}: {A} {Domain} {Decomposition}-{Based} {CNN} for {High}-{Resolution} {Image} {Segmentation} on {Multiple} {GPUs}}, url = {https://ieeexplore.ieee.org/document/10965628}, volume = {13}, year = {2025} }
SISC
Algebraic Construction of Adaptive Coarse Spaces for Two-Level Schwarz Preconditioners

Alexander Heinlein, and Kathrin Smetana

SIAM Journal on Scientific Computing, Sep 2025

Abs Bib doi URL Preprint

Two-level domain decomposition preconditioners lead to fast convergence and scalability of iterative solvers. However, for highly heterogeneous problems with a rapidly varying coefficient function, the condition number of the preconditioned system generally depends on the contrast of the coefficient function. As a result, the convergence may deteriorate. Enhancing the coarse space by functions constructed from suitable local eigenvalue problems restores robust, contrast-independent convergence; these coarse spaces are often denoted as adaptive or spectral coarse spaces. However, these eigenvalue problems typically rely on nonalgebraic information such that the adaptive coarse spaces cannot be constructed from the fully assembled system matrix. In this paper, a novel algebraic adaptive coarse space which relies on the $a$-orthogonal decomposition of (local) finite element (FE) spaces into functions that solve the elliptic PDE with some trace and FE functions that are zero on the boundary is proposed. In particular, the basis is constructed from eigenmodes of two types of local eigenvalue problems associated with the edges of the domain decomposition. To approximate functions that solve the PDE locally, we employ a transfer eigenvalue problem which has originally been proposed for the construction of optimal local approximation spaces for multiscale methods. In addition, we make use of a Dirichlet eigenvalue problem that is a slight modification of the Neumann eigenvalue problem used in the adaptive generalized Dryja–Smith–Widlund (AGDSW) coarse space. Both eigenvalue problems rely solely on local Dirichlet matrices, which can be extracted from the fully assembled system matrix, allowing for an algebraic construction. By combining arguments from multiscale and domain decomposition methods, we derive a contrast-independent upper bound for the condition number. While we restrict ourselves here to a two-dimensional diffusion problem discretized by low-order FEs on regular meshes, the proposed framework is general, and we conjecture that the approach can be readily extended, for instance, to other elliptic problems, three dimensions, or, under mild assumptions, higher-order discretizations. The robustness of the method is confirmed numerically for a variety of heterogeneous coefficient distributions, including binary random distributions and a coefficient function constructed from the SPE10 benchmark. The results are comparable to those of the nonalgebraic AGDSW coarse space as well as for those cases where the convergence of the classical algebraic generalized Dryja–Smith–Widlund coarse space deteriorates. Moreover, the coarse space dimension is the same as or comparable to the AGDSW coarse space for all numerical experiments.
@article{Heinlein:2025:ACA, author = {Heinlein, Alexander and Smetana, Kathrin}, doi = {10.1137/23M1618193}, journal = {SIAM Journal on Scientific Computing}, number = {2}, pages = {A1170-A1197}, title = {Algebraic Construction of Adaptive Coarse Spaces for Two-Level Schwarz Preconditioners}, url = {https://doi.org/10.1137/23M1618193}, volume = {47}, year = {2025} }
MATHMOD2025
Towards Model Discovery Using Domain Decomposition and PINNs

Tirtho S. Saha, Alexander Heinlein, and Cordula Reisch

IFAC-PapersOnLine, Sep 2025

11th Vienna International Conference on Mathematical Modelling MATHMOD 2025

Abs Bib doi URL Preprint

We enhance machine learning algorithms for learning model parameters in complex systems represented by differential equations with domain decomposition methods. The study evaluates the performance of two approaches, namely (vanilla) Physics-Informed Neural Networks (PINNs) and Finite Basis Physics-Informed Neural Networks (FBPINNs), in learning the dynamics of test models with a quasi-stationary longtime behavior. We test the approaches for data sets in different dynamical regions and with varying noise level. As results, the FBPINN approach better captures the overall dynamical behavior compared to the vanilla PINN approach, even in cases with data only from a time domain with quasi-stationary dynamics.
@article{Saha:2025:TMD, author = {Saha, Tirtho S. and Heinlein, Alexander and Reisch, Cordula}, doi = {10.1016/j.ifacol.2025.03.008}, journal = {IFAC-PapersOnLine}, note = {11th Vienna International Conference on Mathematical Modelling MATHMOD 2025}, number = {1}, pages = {37-42}, title = {Towards Model Discovery Using Domain Decomposition and PINNs}, url = {https://www.sciencedirect.com/science/article/pii/S2405896325002253}, volume = {59}, year = {2025} }

2024

IMAJNA
An extension of the approximate component mode synthesis method to the heterogeneous Helmholtz equation

Elena Giammatteo, Alexander Heinlein, and Matthias Schlottbom

IMA Journal of Numerical Analysis, Sep 2024

Abs Bib doi URL Preprint

In this work, we propose and analyze an extension of the approximate component mode synthesis (ACMS) method to the two-dimensional heterogeneous Helmholtz equation. The ACMS method has originally been introduced by Hetmaniuk and Lehoucq as a multiscale method to solve elliptic partial differential equations. The ACMS method uses a domain decomposition to separate the numerical approximation by splitting the variational problem into two independent parts: local Helmholtz problems and a global interface problem. While the former are naturally local and decoupled such that they can be easily solved in parallel, the latter requires the construction of suitable local basis functions relying on local eigenmodes and suitable extensions. We carry out a full error analysis of this approach focusing on the case where the domain decomposition is kept fixed, but the number of eigenfunctions is increased. The theoretical results in this work are supported by numerical experiments verifying algebraic convergence for the method. In certain, practically relevant cases, even super-algebraic convergence for the local Helmholtz problems can be achieved without oversampling.
@article{Giammatteo:2024:EAC, author = {Giammatteo, Elena and Heinlein, Alexander and Schlottbom, Matthias}, doi = {10.1093/imanum/drae076}, journal = {IMA Journal of Numerical Analysis}, pages = {drae076}, title = {An extension of the approximate component mode synthesis method to the heterogeneous Helmholtz equation}, url = {https://doi.org/10.1093/imanum/drae076}, year = {2024} }
CMAME
Multilevel domain decomposition-based architectures for physics-informed neural networks

Victorita Dolean, Alexander Heinlein, Siddhartha Mishra, and 1 more author

Computer Methods in Applied Mechanics and Engineering, Sep 2024

Abs Bib doi URL Preprint

Physics-informed neural networks (PINNs) are a powerful approach for solving problems involving differential equations, yet they often struggle to solve problems with high frequency and/or multi-scale solutions. Finite basis physics-informed neural networks (FBPINNs) improve the performance of PINNs in this regime by combining them with an overlapping domain decomposition approach. In this work, FBPINNs are extended by adding multiple levels of domain decompositions to their solution ansatz, inspired by classical multilevel Schwarz domain decomposition methods (DDMs). Analogous to typical tests for classical DDMs, we assess how the accuracy of PINNs, FBPINNs and multilevel FBPINNs scale with respect to computational effort and solution complexity by carrying out strong and weak scaling tests. Our numerical results show that the proposed multilevel FBPINNs consistently and significantly outperform PINNs across a range of problems with high frequency and multi-scale solutions. Furthermore, as expected in classical DDMs, we show that multilevel FBPINNs improve the accuracy of FBPINNs when using large numbers of subdomains by aiding global communication between subdomains.
@article{Dolean:2024:MDD, author = {Dolean, Victorita and Heinlein, Alexander and Mishra, Siddhartha and Moseley, Ben}, doi = {10.1016/j.cma.2024.117116}, journal = {Computer Methods in Applied Mechanics and Engineering}, number = {1}, pages = {117116}, title = {Multilevel domain decomposition-based architectures for physics-informed neural networks}, url = {https://www.sciencedirect.com/science/article/pii/S0045782524003724?via%3Dihub}, volume = {429}, year = {2024} }
GAMM-Mitteilungen
A computational framework for pharmaco-mechanical interactions in arterial walls using parallel monolithic domain decomposition methods

Daniel Balzani, Alexander Heinlein, Axel Klawonn, and 5 more authors

GAMM-Mitteilungen, Sep 2024

Abs Bib doi URL Preprint

A computational framework is presented to numerically simulate the effects of antihypertensive drugs, in particular calcium channel blockers, on the mechanical response of arterial walls. A stretch-dependent smooth muscle model by Uhlmann and Balzani is modified to describe the interaction of pharmacological drugs and the inhibition of smooth muscle activation. The coupled deformation-diffusion problem is then solved using the finite element software FEDDLib and overlapping Schwarz preconditioners from the Trilinos package FROSch. These preconditioners include highly scalable parallel GDSW (generalized Dryja–Smith–Widlund) and RGDSW (reduced GDSW) preconditioners. Simulation results show the expected increase in the lumen diameter of an idealized artery due to the drug-induced reduction of smooth muscle contraction, as well as a decrease in the rate of arterial contraction in the presence of calcium channel blockers. Strong and weak parallel scalability of the resulting computational implementation are also analyzed.
@article{Balzani:2024:CFP, author = {Balzani, Daniel and Heinlein, Alexander and Klawonn, Axel and Knepper, Jascha and Ramesh, Sharan Nurani and Rheinbach, Oliver and Sa{\ss}mannshausen, Lea and Uhlmann, Klemens}, doi = {10.1002/gamm.202370002}, journal = {GAMM-Mitteilungen}, number = {1}, pages = {e202370002}, title = {A computational framework for pharmaco-mechanical interactions in arterial walls using parallel monolithic domain decomposition methods}, url = {https://onlinelibrary.wiley.com/doi/abs/10.1002/gamm.202370002}, volume = {47}, year = {2024} }

2023

Ocean Engineering
Machine learning for phase-resolved reconstruction of nonlinear ocean wave surface elevations from sparse remote sensing data

Svenja Ehlers, Marco Klein, Alexander Heinlein, and 4 more authors

Ocean Engineering, Sep 2023

Abs Bib doi URL Preprint

Accurate short-term prediction of phase-resolved water wave conditions is crucial for decision-making in ocean engineering. However, the initialization of remote-sensing-based wave prediction models first requires a reconstruction of wave surfaces from sparse measurements like radar. Existing reconstruction methods either rely on computationally intensive optimization procedures or simplistic modeling assumptions that compromise real-time capability or accuracy of the entire prediction process. We therefore address these issues by proposing a novel approach for phase-resolved wave surface reconstruction using neural networks based on the U-Net and Fourier neural operator (FNO) architectures. Our approach utilizes synthetic yet highly realistic training data on uniform one-dimensional grids, that is generated by the high-order spectral method for wave simulation and a geometric radar modeling approach. The investigation reveals that both models deliver accurate wave reconstruction results and show good generalization for different sea states when trained with spatio-temporal radar data containing multiple historic radar snapshots in each input. Notably, the FNO-based network performs better in handling the data structure imposed by wave physics due to its global approach to learn the mapping between input and desired output in Fourier space.
@article{Ehlers:2023:MLP, author = {Ehlers, Svenja and Klein, Marco and Heinlein, Alexander and Wedler, Mathies and Desmars, Nicolas and Hoffmann, Norbert and Stender, Merten}, doi = {10.1016/j.oceaneng.2023.116059}, issn = {0029-8018}, journal = {Ocean Engineering}, number = {}, pages = {116059}, title = {Machine learning for phase-resolved reconstruction of nonlinear ocean wave surface elevations from sparse remote sensing data}, url = {https://www.sciencedirect.com/science/article/pii/S0029801823024435}, volume = {288}, year = {2023} }
JCOMP
Towards parallel time-stepping for the numerical simulation of atherosclerotic plaque growth

Stefan Frei, and Alexander Heinlein

Journal of Computational Physics, Sep 2023

Abs Bib doi URL

The numerical simulation of atherosclerotic plaque growth is computationally prohibitive, since it involves a complex cardiovascular fluid-structure interaction (FSI) problem with a characteristic time scale of milliseconds to seconds, as well as a plaque growth process governed by reaction-diffusion equations, which takes place over several months. In this work we combine a temporal homogenization approach, which separates the problem in computationally expensive FSI problems on a micro scale and a reaction-diffusion problem on the macro scale, with parallel time-stepping algorithms. It has been found in the literature that parallel time-stepping algorithms do not perform well when applied directly to the FSI problem. To circumvent this problem, a parareal algorithm is applied on the macro-scale reaction-diffusion problem instead of the micro-scale FSI problem. We investigate modifications in the coarse propagator of the parareal algorithm, in order to further reduce the number of costly micro problems to be solved. The approaches are tested in detailed numerical investigations based on serial simulations.
@article{Frei:2023:TPT, author = {Frei, Stefan and Heinlein, Alexander}, doi = {10.1016/j.jcp.2023.112347}, issn = {0021-9991}, journal = {Journal of Computational Physics}, number = {}, pages = {112347}, title = {Towards parallel time-stepping for the numerical simulation of atherosclerotic plaque growth}, url = {https://www.sciencedirect.com/science/article/pii/S0021999123004424}, volume = {}, year = {2023} }
CMAM
A Multi-Level Extension of the GDSW Overlapping Schwarz Preconditioner in Two Dimensions

Alexander Heinlein, Oliver Rheinbach, and Friederike Röver

Computational Methods in Applied Mathematics, Sep 2023

Abs Bib doi URL

Multilevel extensions of overlapping Schwarz domain decomposition preconditioners of Generalized Dryja–Smith–Widlund (GDSW) type are considered in this paper. The original GDSW preconditioner is a two-level overlapping Schwarz domain decomposition preconditioner, which can be constructed algebraically from the fully assembled stiffness matrix. The FROSch software, which belongs to the ShyLU package of the Trilinos software library, provides parallel implementations of different variants of GDSW preconditioners. The coarse problem can limit the parallel scalability of two-level GDSW preconditioners. As a remedy, in the past, three-level GDSW approaches have been proposed, which can significantly extend the range of scalability. Here, a multilevel extension of the GDSW preconditioner is introduced and analyzed. Finally, parallel results for the implementation in FROSch for up to 40 000 cores of the SuperMUC-NG supercomputer at Leibniz Supercomputing Centre (LRZ) and to 48 000 cores of the JUWELS supercomputer at Jülich Supercomputing Centre (JSC) are presented.
@article{Heinlein:2023:MLE, author = {Heinlein, Alexander and Rheinbach, Oliver and R\"over, Friederike}, doi = {10.1515/cmam-2022-0168}, issn = {}, journal = {Computational Methods in Applied Mathematics}, number = {}, pages = {}, title = {A Multi-Level Extension of the {GDSW} Overlapping {Schwarz} Preconditioner in Two Dimensions}, url = {https://doi.org/10.1515/cmam-2022-0168}, volume = {}, year = {2023} }
CM
Comparison of Arterial Wall Models in Fluid-Structure Interaction Simulations

Daniel Balzani, Alexander Heinlein, Axel Klawonn, and 2 more authors

Computational Mechanics, Sep 2023

Abs Bib doi URL Preprint

Monolithic fluid-structure interaction (FSI) of blood ﬂow with arterial walls is considered, making use of sophisticated nonlinear wall models. These incorporate the eﬀects of almost incompressibility as well as of the anisotropy caused by embedded collagen ﬁbers. In the literature, relatively simple structural models such as Neo-Hooke are often considered for FSI with arterial walls. Such models lack, both, anisotropy and incompressibility. In this paper, numerical simulations of idealized heart beats in a curved benchmark geometry, using simple and sophisticated arterial wall models, are compared: we consider three diﬀerent almost incompressible, anisotropic arterial wall models as a reference and, for comparison, a simple, isotropic Neo-Hooke model using four different parameter sets. The simulations show significant quantitative and qualitative diﬀerences in the stresses and displacements as well as the lumen cross sections. For the Neo-Hooke models, a significantly larger amplitude in the in- and outﬂow areas during the heart beat is observed, presumably due to the lack of ﬁber stiﬀening. For completeness, we also consider a linear elastic wall using 16 different parameter sets. However, using our benchmark setup, we were not successful in achieving good agreement with our nonlinear reference calculation.
@article{Balzani:2023:CAW, author = {Balzani, Daniel and Heinlein, Alexander and Klawonn, Axel and Rheinbach, Oliver and Schr{\"o}der, J{\"o}rg}, doi = {10.1007/s00466-023-02321-y}, journal = {Computational Mechanics}, number = {5}, pages = {949-965}, title = {Comparison of Arterial Wall Models in Fluid-Structure Interaction Simulations}, url = {https://link.springer.com/article/10.1007/s00466-023-02321-y}, volume = {72}, year = {2023} }

2022

SISC
Parallel Scalability of Three-Level FROSch Preconditioners to 220 000 Cores using the Theta Supercomputer

Alexander Heinlein, Oliver Rheinbach, and Friederike Röver

SIAM Journal on Scientific Computing, Sep 2022

Abs Bib doi URL Preprint

The parallel performance of the three-level fast and robust overlapping Schwarz (FROSch) preconditioners is investigated for linear elasticity. The FROSch framework is part of the Trilinos software library and contains a parallel implementation of different preconditioners with energy minimizing coarse spaces of generalized Dryja–Smith–Widlund type. The three-level extension is constructed by a recursive application of the FROSch preconditioner to the coarse problem. In this paper, the additional steps in the implementation in order to apply the FROSch preconditioner recursively are described in detail. Furthermore, it is shown that no explicit geometric information is needed in the recursive application of the preconditioner. In particular, the rigid body modes, including the rotations, can be interpolated on the coarse level without additional geometric information. Parallel results for a three-dimensional linear elasticity problem obtained on the Theta supercomputer (Argonne Leadership Computing Facility, Argonne, IL) using up to 220 000 cores are discussed and compared to results obtained on the SuperMUC-NG supercomputer (Leibniz Supercomputing Centre, Garching, Germany). Notably, it can be observed that a hierarchical communication operation in FROSch related to the coarse operator starts to dominate the computing time on Theta, which has a dragonfly interconnect, for 100 000 message passing interface (MPI) ranks or more. The same operation, however, scales well and stays within the order of a second in all experiments performed on SuperMUC-NG, which uses a fat tree network. Using hybrid MPI/OpenMP parallelization, the onset of the MPI communication problem on Theta can be delayed. Further analysis of the performance of FROSch on large supercomputers with dragonfly interconnects will be necessary.
@article{Heinlein:2022:PST, author = {Heinlein, Alexander and Rheinbach, Oliver and R{\"o}ver, Friederike}, doi = {10.1137/21M1431205}, journal = {SIAM Journal on Scientific Computing}, number = {0}, pages = {S173-S198}, title = {Parallel Scalability of Three-Level FROSch Preconditioners to 220 000 Cores using the Theta Supercomputer}, volume = {0}, year = {2022} }
SISC
Adaptive nonlinear domain decomposition methods with an application to the p-Laplacian

Alexander Heinlein, Axel Klawonn, and Martin Lanser

SIAM Journal on Scientific Computing, Sep 2022

Abs Bib doi URL Preprint

In this article, different nonlinear domain decomposition methods are applied to nonlinear problems with highly-heterogeneous coeﬃcient functions with jumps. In order to obtain a robust solver with respect to nonlinear as well as linear convergence, adaptive coarse spaces are employed. First, as an example for a nonlinearly left-preconditioned domain decomposition method, the two-level restricted nonlinear Schwarz method H1-RASPEN (Hybrid Restricted Additive Schwarz Preconditioned Exact Newton) is combined with an adaptive generalized Dryja–Smith–Widlund (GDSW) coarse space. Second, as an example for a nonlinearly right-preconditioned domain decomposition method, a nonlinear FETI-DP (Finite Element Tearing and Interconnecting - Dual Primal) method is equipped with an edge-based adaptive coarse space. Both approaches are compared with the respective nonlinear domain decomposition methods with classical coarse spaces as well as with the respective Newton-Krylov methods with adaptive coarse spaces. For some two-dimensional pLaplace model problems with diﬀerent spatial coeﬃcient distributions, it can be observed that the best linear and nonlinear convergence can only be obtained when combining the nonlinear domain decomposition methods with adaptive coarse spaces.
@article{Heinlein:2022:AND, author = {Heinlein, Alexander and Klawonn, Axel and Lanser, Martin}, doi = {10.1137/21M1433605}, journal = {SIAM Journal on Scientific Computing}, number = {}, pages = {152-S172}, title = {Adaptive nonlinear domain decomposition methods with an application to the $p$-Laplacian}, volume = {}, year = {2022} }
MRS Advances
Irradiation-dependent Topology Optimization of Metallization Grid Patterns and Variation of Contact Layer Thickness used for Latitude-based Yield Gain of Thin-film Solar Modules

Mario Zinßer, Benedikt Braun, Tim Helder, and 6 more authors

MRS Advances, Sep 2022

Abs Bib doi URL

We show that the concept of topology optimization for metallization grid patterns of thin-film solar devices can be applied to monolithically integrated solar cells. Different irradiation intensities favour different topological grid designs as well as a different thickness of the transparent conductive oxide (TCO) layer. For standard laboratory efficiency determination, an irradiation power of 1000 W/m2 is generally applied. However, this power rarely occurs for real world solar modules operating at mid-latitude locations. Therefore, contact layer thicknesses but also lateral grid patterns should be optimized for lower irradiation intensities. This results in material production savings for the grid and TCO layer of up to 50% and simultaneously a significant gain in yield of over 1% for regions with a low annual mean irradiation.
@article{Zinsser:2022:IDT, author = {Zin{\ss}er, Mario and Braun, Benedikt and Helder, Tim and Friedlmeier, Theresa Magorian and Pieters, Bart and Heinlein, Alexander and Denk, Martin and G\"oddeke, Dominik and Powalla, Michael}, doi = {10.1557/s43580-022-00321-3}, journal = {MRS Advances}, number = {}, pages = {}, title = {Irradiation-dependent Topology Optimization of Metallization Grid Patterns and Variation of Contact Layer Thickness used for Latitude-based Yield Gain of Thin-film Solar Modules}, url = {https://doi.org/10.1557/s43580-022-00321-3}, volume = {}, year = {2022} }
SISC
Adaptive GDSW Coarse Spaces of Reduced Dimension for Overlapping Schwarz Methods

Alexander Heinlein, Axel Klawonn, Jascha Knepper, and 2 more authors

SIAM Journal on Scientific Computing, Sep 2022

Abs Bib doi URL PDF Preprint

A new reduced-dimension adaptive generalized Dryja–Smith–Widlund (GDSW) overlapping Schwarz method for linear second-order elliptic problems in three dimensions is introduced. It is robust with respect to large contrasts of the coefficients of the partial differential equations. The condition number bound of the new method is shown to be independent of the coefficient contrast and only dependent on a user-prescribed tolerance. The interface of the nonoverlapping domain decomposition is partitioned into nonoverlapping patches. The new coarse space is obtained by selecting a few eigenvectors of certain local eigenproblems which are defined on these patches. These eigenmodes are energy-minimally extended to the interior of the nonoverlapping subdomains and added to the coarse space. By using a new interface decomposition, the reduced-dimension adaptive GDSW overlapping Schwarz method usually has a smaller coarse space than existing GDSW and adaptive GDSW domain decomposition methods. A robust condition number estimate is proven for the new reduced-dimension adaptive GDSW method which is also valid for existing adaptive GDSW methods. Numerical results for the equations of isotropic linear elasticity in three dimensions confirming the theoretical findings are presented.
@article{Heinlein:2022:AGD, author = {Heinlein, Alexander and Klawonn, Axel and Knepper, Jascha and Rheinbach, Oliver and Widlund, Olof B.}, doi = {10.1137/20M1364540}, journal = {SIAM Journal on Scientific Computing}, number = {3}, pages = {A1176-A1204}, title = {Adaptive GDSW Coarse Spaces of Reduced Dimension for Overlapping Schwarz Methods}, url = {https://doi.org/10.1137/20M1364540}, volume = {44}, year = {2022} }
SISC
FROSch Preconditioners for Land Ice Simulations of Greenland and Antarctica

Alexander Heinlein, Mauro Perego, and Sivasankaran Rajamanickam

SIAM Journal on Scientific Computing, Sep 2022

Abs Bib doi URL Preprint

Numerical simulations of Greenland and Antarctic ice sheets involve the solution of large-scale highly nonlinear systems of equations on complex shallow geometries. This work is concerned with the construction of Schwarz preconditioners for the solution of the associated tangent problems, which are challenging for solvers mainly because of the strong anisotropy of the meshes and wildly changing boundary conditions that can lead to poorly constrained problems on large portions of the domain. Here, two-level generalized Dryja–Smith–Widlund (GDSW)–type Schwarz preconditioners are applied to different land ice problems, i.e., a velocity problem, a temperature problem, as well as the coupling of the former two problems. We employ the message passing interface (MPI)–parallel implementation of multilevel Schwarz preconditioners provided by the package FROSch (fast and robust Schwarz) from the Trilinos library. The strength of the proposed preconditioner is that it yields out-of-the-box scalable and robust preconditioners for the single physics problems. To the best of our knowledge, this is the first time two-level Schwarz preconditioners have been applied to the ice sheet problem and a scalable preconditioner has been used for the coupled problem. The preconditioner for the coupled problem differs from previous monolithic GDSW preconditioners in the sense that decoupled extension operators are used to compute the values in the interior of the subdomains. Several approaches for improving the performance, such as reuse strategies and shared memory OpenMP parallelization, are explored as well. In our numerical study we target both uniform meshes of varying resolution for the Antarctic ice sheet as well as nonuniform meshes for the Greenland ice sheet. We present several weak and strong scaling studies confirming the robustness of the approach and the parallel scalability of the FROSch implementation. Among the highlights of the numerical results are a weak scaling study for up to 32K processor cores (8K MPI ranks and 4 OpenMP threads) and 566M degrees of freedom for the velocity problem as well as a strong scaling study for up to 4K processor cores (and MPI ranks) and 68M degrees of freedom for the coupled problem.
@article{Heinlein:2022:FRO, author = {Heinlein, Alexander and Perego, Mauro and Rajamanickam, Sivasankaran}, doi = {10.1137/21M1395260}, journal = {SIAM Journal on Scientific Computing}, number = {2}, pages = {B339-B367}, title = {FROSch Preconditioners for Land Ice Simulations of Greenland and Antarctica}, url = {https://doi.org/10.1137/21M1395260}, volume = {44}, year = {2022} }
ETNA
Surrogate convolutional neural network models for steady computational fluid dynamics simulations

Matthias Eichinger, Alexander Heinlein, and Axel Klawonn

Electronic Transactions on Numerical Analysis (ETNA), Sep 2022

Abs Bib doi URL Preprint

A convolution neural network (CNN)-based approach for the construction of reduced order surrogate models for computational fluid dynamics (CFD) simulations is introduced; it is inspired by the approach of Guo, Li, and Iori [X. Guo, W. Li, and F. Iorio, Convolutional neural networks for steady flow approximation, in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD’16, New York, USA, 2016, ACM, pp. 481–490]. In particular, the neural networks are trained in order to predict images of the flow field in a channel with varying obstacle based on an image of the geometry of the channel. A classical CNN with bottleneck structure and a U-Net are compared while varying the input format, the number of decoder paths, as well as the loss function used to train the networks. This approach yields very low prediction errors, in particular, when using the U-Net architecture. Furthermore, the models are also able to generalize to unseen geometries of the same type. A transfer learning approach enables the model to be trained to a new type of geometries with very low training cost. Finally, based on this transfer learning approach, a sequential learning strategy is introduced, which significantly reduces the amount of necessary training data.
@article{Eichinger:2022:SCN, author = {Eichinger, Matthias and Heinlein, Alexander and Klawonn, Axel}, doi = {10.1553/etna_vol56s235}, journal = {Electronic Transactions on Numerical Analysis (ETNA)}, pages = {235--255}, title = {Surrogate convolutional neural network models for steady computational fluid dynamics simulations}, url = {https://epub.oeaw.ac.at/?arp=0x003d4c21}, volume = {56}, year = {2022} }
ETNA
Estimating the time-dependent contact rate of SIR and SEIR models in mathematical epidemiology using physics-informed neural networks

Viktor Grimm, Alexander Heinlein, Axel Klawonn, and 2 more authors

Electronic Transactions on Numerical Analysis (ETNA), Sep 2022

Abs Bib doi URL Preprint

The course of an epidemic can be often successfully described mathematically using compartment models. These models result in a system of ordinary diﬀerential equations. Two well-known examples are the SIR and the SEIR models. The transition rates between the diﬀerent compartments are deﬁned by certain parameters which are speciﬁc for the respective virus. Often, these parameters can be taken from the literature or can be determined from statistics. However, the contact rate or the related eﬀective reproduction number are in general not constant and thus cannot easily be determined. Here, a new machine learning approach based on physics-informed neural networks is presented that can learn the contact rate from given data for the dynamical systems given by the SIR and SEIR models. The new method generalizes an already known approach for the identiﬁcation of constant parameters to the variable or time-dependent case. After introducing the new method, it is tested for synthetic data generated by the numerical solution of SIR and SEIR models. Here, the case of exact and perturbed data is considered. In all cases, the contact rate can be learned very satisfactorily. Finally, the SEIR model in combination with physics-informed neural networks is used to learn the contact rate for COVID-19 data given by the course of the epidemic in Germany. The simulation of the number of infected individuals over the course of the epidemic, using the learned contact rate, is very promising.
@article{Grimm:2022:ETD, author = {Grimm, Viktor and Heinlein, Alexander and Klawonn, Axel and Lanser, Martin and Weber, Janine}, doi = {10.1553/etna_vol56s1}, journal = {Electronic Transactions on Numerical Analysis (ETNA)}, pages = {1–27}, title = {Estimating the time-dependent contact rate of SIR and SEIR models in mathematical epidemiology using physics-informed neural networks}, url = {https://epub.oeaw.ac.at/?arp=0x003cfd4a}, volume = {56}, year = {2022} }

2021

SISC
Combining Machine Learning and Adaptive Coarse Spaces - A Hybrid Approach for Robust FETI-DP Methods in Three Dimensions

Alexander Heinlein, Axel Klawonn, Martin Lanser, and 1 more author

SIAM Journal on Scientific Computing, Sep 2021

Abs Bib doi URL Preprint

The hybrid ML-FETI-DP algorithm combines the advantages of adaptive coarse spaces in domain decomposition methods and certain supervised machine learning techniques. Adaptive coarse spaces ensure robustness of highly scalable domain decomposition solvers, even for highly heterogeneous coefficient distributions with arbitrary coefficient jumps. However, their construction requires the setup and solution of local generalized eigenvalue problems, which is typically computationally expensive. The idea of ML-FETI-DP is to interpret the coefficient distribution as image data and predict whether an eigenvalue problem has to be solved or can be neglected while still maintaining robustness of the adaptive FETI-DP method. For this purpose, neural networks are used as image classifiers. In the present work, the ML-FETI-DP algorithm is extended to three dimensions, which requires both a complex data preprocessing procedure to construct consistent input data for the neural network as well as a representative training and validation data set to ensure generalization properties of the machine learning model. Numerical experiments for stationary diffusion and linear elasticity problems with realistic coefficient distributions show that a large number of eigenvalue problems can be saved; in the best case of the numerical results presented here, 97% of the eigenvalue problems can be avoided to be set up and solved.
@article{Heinlein:2021:CML1, author = {Heinlein, Alexander and Klawonn, Axel and Lanser, Martin and Weber, Janine}, doi = {10.1137/20M1344913}, journal = {SIAM Journal on Scientific Computing}, number = {5}, pages = {S816-S838}, title = {Combining Machine Learning and Adaptive Coarse Spaces - A Hybrid Approach for Robust FETI-DP Methods in Three Dimensions}, url = {https://doi.org/10.1137/20M1344913}, volume = {43}, year = {2021} }
GAMM-Mitteilungen
Combining Machine Learning and Domain Decomposition Methods for the Solution of Partial Diﬀerential Equations – A Review

Alexander Heinlein, Axel Klawonn, Martin Lanser, and 1 more author

GAMM-Mitteilungen, Sep 2021

Abs Bib doi URL Preprint

Scientiﬁc machine learning, an area of research where techniques from machine learning and scientiﬁc computing are combined, has become of increasing importance and receives growing attention. Here, our focus is on a very speciﬁc area within scientiﬁc machine learning given by the combination of domain decomposition methods with machine learning techniques. The aim of the present work is to make an attempt of providing a review of existing and also new approaches within this field as well as to present some known results in a uniﬁed framework; no claim of completeness is made. As a concrete example of machine learning enhanced domain decomposition methods, an approach is presented which uses neural networks to reduce the computational eﬀort in adaptive domain decomposition methods while retaining their robustness. More precisely, deep neural networks are used to predict the geometric location of constraints which are needed to deﬁne a robust coarse space. Additionally, two recently published deep domain decomposition approaches are presented in a uniﬁed framework. Both approaches use physics-constrained neural networks to replace the discretization and solution of the subdomain problems of a given decomposition of the computational domain. Finally, a brief overview is given of several further approaches which combine machine learning with ideas from domain decomposition methods to either increase the performance of already existing algorithms or to create completely new methods.
@article{Heinlein:2021:CML2, author = {Heinlein, Alexander and Klawonn, Axel and Lanser, Martin and Weber, Janine}, doi = {10.1002/gamm.202100001}, journal = {GAMM-Mitteilungen}, number = {1}, pages = {e202100001}, title = {Combining Machine Learning and Domain Decomposition Methods for the Solution of Partial Diﬀerential Equations – A Review}, url = {https://onlinelibrary.wiley.com/doi/10.1002/gamm.202100001}, volume = {44}, year = {2021} }

2020

ETNA
A Frugal FETI-DP and BDDC Coarse Space for Heterogeneous Problems

Alexander Heinlein, Axel Klawonn, Martin Lanser, and 1 more author

Electronic Transactions on Numerical Analysis (ETNA), Sep 2020

Abs Bib doi URL Preprint

The convergence rate of domain decomposition methods is generally determined by the eigenvalues of the preconditioned system. For second-order elliptic partial differential equations, coefficient discontinuities with a large contrast can lead to a deterioration of the convergence rate. Only by implementing an appropriate coarse space or second level, a robust domain decomposition method can be obtained. In this article, a new frugal coarse space for FETI-DP (Finite Element Tearing and Interconnecting - Dual Primal) and BDDC (Balancing Domain Decomposition by Constraints) methods is presented, which has a lower set-up cost than competing adaptive coarse spaces. In particular, in contrast to adaptive coarse spaces, it does not require the solution of any local generalized eigenvalue problems. The approach considered here aims at a low-dimensional approximation of the adaptive coarse space by using appropriate weighted averages and is robust for a broad range of coefficient distributions for diffusion and elasticity problems. In this article, the robustness is heuristically justified as well as numerically shown for several coefficient distributions. The new coarse space is compared to adaptive coarse spaces, and parallel scalability up to 262,144 parallel cores for a parallel BDDC implementation with the new coarse space is shown. The superiority of the new coarse space over classic coarse spaces with respect to parallel weak scalability and time to solution is confirmed by numerical experiments.
@article{Heinlein:2020:FFE, author = {Heinlein, Alexander and Klawonn, Axel and Lanser, Martin and Weber, Janine}, doi = {10.1553/etna_vol53s562}, journal = {Electronic Transactions on Numerical Analysis (ETNA)}, pages = {562-591}, title = {A Frugal {FETI-DP} and {BDDC} Coarse Space for Heterogeneous Problems}, url = {https://epub.oeaw.ac.at/?arp=0x003c1ad0}, volume = {53}, year = {2020} }
SISC
Additive and Hybrid Nonlinear Two-Level Schwarz Methods and Energy Minimizing Coarse Spaces for Unstructured Grids

Alexander Heinlein, and Martin Lanser

SIAM Journal on Scientific Computing, Sep 2020

Abs Bib doi URL Preprint

Nonlinear domain decomposition (DD) methods, such as ASPIN (additive Schwarz preconditioned inexact Newton), RASPEN (restricted additive Schwarz preconditioned inexact Newton), nonlinear FETI-DP (finite element tearing and interconnecting-dual primal), and nonlinear BDDC (balancing DD by constraints), can be reasonable alternatives to classical Newton–Krylov-DD methods for the solution of sparse nonlinear systems of equations, e.g., arising from a discretization of a nonlinear partial differential equation (PDE). These nonlinear DD approaches are often able to effectively tackle unevenly distributed nonlinearities and outperform Newton’s method with respect to convergence speed as well as global convergence behavior. Furthermore, they often improve parallel scalability due to a superior ratio of local to global work. Nonetheless, as for linear DD methods, it is often necessary to incorporate an appropriate coarse space in a second level to obtain numerical scalability for increasing numbers of subdomains. In addition, an appropriate coarse space can also improve the nonlinear convergence of nonlinear DD methods. In this paper, we introduce four variants for integrating coarse spaces in nonlinear Schwarz methods in an additive or multiplicative way using Galerkin projections. These new variants can be interpreted as natural nonlinear equivalents to well-known linear additive and hybrid two-level Schwarz preconditioners. Furthermore, they facilitate the use of various coarse spaces, e.g., coarse spaces based on energy-minimizing extensions, which can easily be used for irregular DDs, such as, e.g., those obtained by graph partitioners. In particular, multiscale finite element method (MsFEM)-type coarse spaces are considered, and it is shown that they outperform classical approaches for certain heterogeneous nonlinear problems. The new approaches are then compared with classical Newton–Krylov-DD and nonlinear one-level Schwarz approaches for different homogeneous and heterogeneous model problems based on the p-Laplace operator.
@article{Heinlein:2020:AHN, author = {Heinlein, Alexander and Lanser, Martin}, doi = {10.1137/19M1276972}, journal = {SIAM Journal on Scientific Computing}, number = {4}, pages = {A2461-A2488}, title = {Additive and Hybrid Nonlinear Two-Level Schwarz Methods and Energy Minimizing Coarse Spaces for Unstructured Grids}, url = {https://epubs.siam.org/doi/abs/10.1137/19M1276972}, volume = {42}, year = {2020} }
IJNME
Reduced Dimension GDSW Coarse Spaces for Monolithic Schwarz Domain Decomposition Methods for Incompressible Fluid Flow Problems

Alexander Heinlein, Christian Hochmuth, and Axel Klawonn

International Journal for Numerical Methods in Engineering, Sep 2020

Abs Bib doi URL Preprint

Summary Monolithic preconditioners for incompressible fluid flow problems can significantly improve the convergence speed compared with preconditioners based on incomplete block factorizations. However, the computational costs for the setup and the application of monolithic preconditioners are typically higher. In this article, several techniques are applied to monolithic two-level generalized Dryja-Smith-Widlund (GDSW) preconditioners to further improve the convergence speed and the computing time. In particular, reduced dimension GDSW coarse spaces, restricted and scaled versions of the first level, hybrid, and parallel coupling of the levels, and recycling strategies are investigated. Using a combination of all these improvements, for a small time-dependent Navier-Stokes problem on 240 message passing interface (MPI) ranks, a reduction of 86% of the time-to-solution can be obtained. Even without applying recycling strategies, the time-to-solution can be reduced by more than 50% for a larger steady Stokes problem on 4608 MPI ranks. For the largest problems with 11â979 MPI ranks, the scalability deteriorates drastically for the monolithic GDSW coarse space. On the other hand, using the reduced dimension coarse spaces, good scalability up to 11â979 MPI ranks, which corresponds to the largest problem configuration fitting on the employed supercomputer, could be achieved.
@article{Heinlein:2020:RDG, author = {Heinlein, Alexander and Hochmuth, Christian and Klawonn, Axel}, doi = {10.1002/nme.6258}, journal = {International Journal for Numerical Methods in Engineering}, number = {6}, pages = {1101-1119}, title = {Reduced Dimension {GDSW} Coarse Spaces for Monolithic {S}chwarz Domain Decomposition Methods for Incompressible Fluid Flow Problems}, url = {https://onlinelibrary.wiley.com/doi/abs/10.1002/nme.6258}, volume = {121}, year = {2020} }

2019

SISC
Machine Learning in Adaptive Domain Decomposition Methods - Predicting the Geometric Location of Constraints

Alexander Heinlein, Axel Klawonn, Martin Lanser, and 1 more author

SIAM Journal on Scientific Computing, Sep 2019

Abs Bib doi URL Preprint

Domain decomposition methods are robust and parallel scalable, preconditioned iterative algorithms for the solution of the large linear systems arising in the discretization of elliptic partial differential equations by finite elements. The convergence rate of these methods is generally determined by the eigenvalues of the preconditioned system. For second-order elliptic partial differential equations, coefficient discontinuities with a large contrast can lead to a deterioration of the convergence rate. A remedy can be obtained by enhancing the coarse space with elements, which are often called constraints, that are computed by solving small eigenvalue problems on portions of the interface of the domain decomposition, i.e., edges in two dimensions or faces and edges in three dimensions. In the present work, without restriction of generality, the focus is on two dimensions. In general, it is difficult to predict where these constraints have to be added, and therefore the corresponding local eigenvalue problems have to be computed, i.e., on which edges. Here, a machine learning based strategy using neural networks is suggested to predict the geometric location of these edges in a preprocessing step. This reduces the number of eigenvalue problems that have to be solved before the iteration. Numerical experiments for model problems and realistic microsections using regular decompositions as well as decompositions from graph partitioners are provided, showing very promising results.
@article{Heinlein:2019:MLA, author = {Heinlein, Alexander and Klawonn, Axel and Lanser, Martin and Weber, Janine}, doi = {10.1137/18M1205364}, journal = {SIAM Journal on Scientific Computing}, number = {6}, pages = {A3887-A3912}, title = {Machine Learning in Adaptive Domain Decomposition Methods - Predicting the Geometric Location of Constraints}, url = {https://epubs.siam.org/doi/abs/10.1137/18M1205364?journalCode=sjoce3}, volume = {41}, year = {2019} }
SISC
Adaptive GDSW Coarse Spaces for Overlapping Schwarz Methods in Three Dimensions

Alexander Heinlein, Axel Klawonn, Jascha Knepper, and 1 more author

SIAM Journal on Scientific Computing, Sep 2019

Abs Bib doi URL Preprint

A robust two-level overlapping Schwarz method for scalar elliptic model problems with highly varying coefficient functions is introduced. While the convergence of standard coarse spaces may depend strongly on the contrast of the coefficient function, the condition number bound of the new method is independent of the coefficient function. Indeed, the condition number only depends on a user-prescribed tolerance. The coarse space is based on discrete harmonic extensions of vertex, edge, and face interface functions, which are computed from the solutions of corresponding local generalized edge and face eigenvalue problems. The local eigenvalue problems are of the size of the edges and faces of the decomposition, and the eigenvalue problems can be constructed solely from the local subdomain stiffness matrices and the fully assembled global stiffness matrix. The new AGDSW (adaptive generalized Dryja–Smith–Widlund) coarse space always contains the classical GDSW coarse space by construction of the generalized eigenvalue problems. Numerical results supporting the theory are presented for several model problems in three dimensions using structured as well as unstructured meshes and unstructured decompositions.
@article{Heinlein:2019:AGD1, author = {Heinlein, Alexander and Klawonn, Axel and Knepper, Jascha and Rheinbach, Oliver}, doi = {10.1137/18M1220613}, journal = {SIAM Journal on Scientific Computing}, number = {5}, pages = {A3045-A3072}, title = {Adaptive {GDSW} Coarse Spaces for Overlapping {S}chwarz Methods in Three Dimensions}, url = {https://epubs.siam.org/doi/abs/10.1137/18M1220613}, volume = {41}, year = {2019} }
SISC
Monolithic Overlapping Schwarz Domain Decomposition Methods with GDSW Coarse Spaces for Incompressible Fluid Flow Problems

Alexander Heinlein, Christian Hochmuth, and Axel Klawonn

SIAM Journal on Scientific Computing, Sep 2019

Abs Bib doi URL Preprint

Monolithic overlapping Schwarz preconditioners for saddle point problems of Stokes and Navier–Stokes type are presented. In order to obtain numerically scalable algorithms, coarse spaces obtained from the generalized Dryja–Smith–Widlund (GDSW) approach are used. Numerical results of our parallel implementation are presented for various incompressible fluid flow problems. In particular, cases are considered where the problem cannot or should not be reduced using local static condensation, e.g., Stokes or Navier–Stokes problems with continuous pressure spaces. In the new monolithic preconditioners, the local overlapping problems and the coarse problem are saddle point problems with the same structure as the original problem. Our parallel implementation of these preconditioners is based on the fast and robust overlapping Schwarz (FROSch) library, which is part of the Trilinos package ShyLU. The implementation is essentially algebraic in the sense that, for the class of problems presented here, the preconditioners can be constructed from the fully assembled stiffness matrix and information about the block structure of the problem. Further information about the geometry or the null space of the underlying problem can improve the performance compared to the default settings. Parallel scalability results for several thousand cores for Stokes and Navier–Stokes model problems are reported. Each of the local problems is solved using a direct solver in serial mode, whereas the coarse problem is solved using a direct solver in serial or message passing interface (MPI)-parallel mode or using an MPI-parallel iterative Krylov solver.
@article{Heinlein:2019:MOS, author = {Heinlein, Alexander and Hochmuth, Christian and Klawonn, Axel}, doi = {10.1137/18M1184047}, journal = {SIAM Journal on Scientific Computing}, number = {4}, pages = {C291--C316}, title = {Monolithic Overlapping {S}chwarz Domain Decomposition Methods with {GDSW} Coarse Spaces for Incompressible Fluid Flow Problems}, url = {https://epubs.siam.org/doi/abs/10.1137/18M1184047}, volume = {41}, year = {2019} }

2018

ETNA
Multiscale coarse spaces for overlapping Schwarz methods based on the ACMS space in 2D

Alexander Heinlein, Axel Klawonn, Jascha Knepper, and 1 more author

Electronic Transactions on Numerical Analysis (ETNA), Sep 2018

Abs Bib doi URL Preprint

Two-level overlapping Schwarz domain decomposition methods for second-order elliptic problems in two dimensions are proposed using coarse spaces constructed from the Approximate Component Mode Synthesis (ACMS) multiscale discretization approach. These coarse spaces are based on eigenvalue problems using Schur complements on subdomain edges. It is then shown that the convergence of the resulting preconditioned Krylov method can be controlled by a user-specified tolerance and thus can be made independent of heterogeneities in the coefficient of the partial differential equation. The relations of this new approach to other known adaptive coarse space approaches for overlapping Schwarz methods are also discussed. Compared to one of the competing adaptive approaches, the new coarse space can be significantly smaller. Compared to other competing approaches, the eigenvalue problems are significantly cheaper to solve, i.e., the dimension of the eigenvalue problems is minimal among the competing adaptive approaches under consideration. Our local eigenvalue problems can be solved using one iteration of LobPCG for essentially the same cost as a Cholesky-decomposition of a Schur complement on a subdomain edge.
@article{Heinlein:2018:MCS, author = {Heinlein, Alexander and Klawonn, Axel and Knepper, Jascha and Rheinbach, Oliver}, doi = {10.1553/etna_vol48s156}, journal = {Electronic Transactions on Numerical Analysis (ETNA)}, pages = {156--182}, title = {Multiscale coarse spaces for overlapping Schwarz methods based on the ACMS space in 2D}, url = {https://epub.oeaw.ac.at/?arp=0x0038c0cb}, volume = {48}, year = {2018} }

2016

SISC
A parallel implementation of a two-level overlapping Schwarz method with energy-minimizing coarse space based on Trilinos

Alexander Heinlein, Axel Klawonn, and Oliver Rheinbach

SIAM Journal on Scientific Computing, Sep 2016

Abs Bib doi URL Preprint

We describe a new implementation of a two-level overlapping Schwarz preconditioner with energy-minimizing coarse space (GDSW: generalized Dryja–Smith–Widlund) and show numerical results for an additive and a hybrid additive-multiplicative version. Our parallel implementation makes use of the Trilinos software library and provides a framework for parallel two-level Schwarz methods. We show parallel scalability for two- and three-dimensional scalar second-order elliptic and linear elasticity problems for several thousands of cores. We also discuss techniques for the parallel construction of coarse spaces which are also of interest for other parallel preconditioners and discretization methods using energy minimizing coarse functions. We finally show an application in monolithic fluid-structure interaction, where significant improvements are achieved compared to a standard algebraic, one-level overlapping Schwarz method.
@article{Heinlein:2016:PIT, author = {Heinlein, Alexander and Klawonn, Axel and Rheinbach, Oliver}, doi = {10.1137/16M1062843}, journal = {SIAM Journal on Scientific Computing}, number = {6}, pages = {C713--C747}, title = {A parallel implementation of a two-level overlapping {S}chwarz method with energy-minimizing coarse space based on {T}rilinos}, url = {https://epubs.siam.org/doi/10.1137/16M1062843}, volume = {38}, year = {2016} }
IJNMBE
Numerical modeling of fluid-structure interaction in arteries with anisotropic polyconvex hyperelastic and anisotropic viscoelastic material models at finite strains

Daniel Balzani, Simone Deparis, Simon Fausten, and 6 more authors

International Journal for Numerical Methods in Biomedical Engineering, Sep 2016

Abs Bib doi URL Preprint

The accurate prediction of transmural stresses in arterial walls requires on the one hand robust and efficient numerical schemes for the solution of boundary value problems including fluid-structure interactions and on the other hand the use of a material model for the vessel wall that is able to capture the relevant features of the material behavior. One of the main contributions of this paper is the application of a highly nonlinear, polyconvex anisotropic structural model for the solid in the context of fluid-structure interaction, together with a suitable discretization. Additionally, the influence of viscoelasticity is investigated. The fluid-structure interaction problem is solved using a monolithic approach; that is, the nonlinear system is solved (after time and space discretizations) as a whole without splitting among its components. The linearized block systems are solved iteratively using parallel domain decomposition preconditioners. A simple but nonsymmetric curved geometry is proposed that is demonstrated to be suitable as a benchmark testbed for fluid-structure interaction simulations in biomechanics where nonlinear structural models are used. Based on the curved benchmark geometry, the influence of different material models, spatial discretizations, and meshes of varying refinement is investigated. It turns out that often-used standard displacement elements with linear shape functions are not sufficient to provide good approximations of the arterial wall stresses, whereas for standard displacement elements or F-bar formulations with quadratic shape functions, suitable results are obtained. For the time discretization, a second-order backward differentiation formula scheme is used. It is shown that the curved geometry enables the analysis of non-rotationally symmetric distributions of the mechanical fields. For instance, the maximal shear stresses in the fluidâstructure interface are found to be higher in the inner curve that corresponds to clinical observations indicating a high plaque nucleation probability at such locations.
@article{Balzani:2016:NMF, author = {Balzani, Daniel and Deparis, Simone and Fausten, Simon and Forti, Davide and Heinlein, Alexander and Klawonn, Axel and Quarteroni, Alfio and Rheinbach, Oliver and Schr\"oder, J\"org}, doi = {10.1002/cnm.2756}, journal = {International Journal for Numerical Methods in Biomedical Engineering}, number = {10}, pages = {e02756}, title = {Numerical modeling of fluid-structure interaction in arteries with anisotropic polyconvex hyperelastic and anisotropic viscoelastic material models at finite strains}, url = {https://onlinelibrary.wiley.com/doi/abs/10.1002/cnm.2756}, volume = {32}, year = {2016} }

2015

JCAM
The approximate component mode synthesis special finite element method in two dimensions: Parallel implementation and numerical results

Alexander Heinlein, Ulrich Hetmaniuk, Axel Klawonn, and 1 more author

Journal of Computational and Applied Mathematics, Sep 2015

Sixth International Conference on Advanced Computational Methods in Engineering (ACOMEN 2014)

Abs Bib doi URL

A special finite element method based on approximate component mode synthesis (ACMS) was introduced in Hetmaniuk and Lehoucq (2010). ACMS was developed for second order elliptic partial differential equations with rough or highly varying coefficients. Here, a parallel implementation of ACMS is presented and parallel scalability issues are discussed for representative examples. Additionally, a parallel domain decomposition preconditioner (FETI-DP) is applied to solve the ACMS finite element system. Weak parallel scalability results for ACMS are presented for up to 1024 cores. Our numerical results also suggest a quadratic–logarithmic condition number bound for the preconditioned FETI-DP method applied to ACMS discretizations.
@article{Heinlein:2015:ACM, author = {Heinlein, Alexander and Hetmaniuk, Ulrich and Klawonn, Axel and Rheinbach, Oliver}, doi = {10.1016/j.cam.2015.02.053}, issn = {0377-0427}, journal = {Journal of Computational and Applied Mathematics}, note = {Sixth International Conference on Advanced Computational Methods in Engineering (ACOMEN 2014)}, number = {6}, pages = {116 - 133}, url = {http://www.sciencedirect.com/science/article/pii/S0377042715001405}, volume = {289}, year = {2015} }

Conference Proceedings

2024

Springer LNCSE
A short note on solving partial differential equations using convolutional neural networks

Viktor Grimm, Alexander Heinlein, and Axel Klawonn

In Domain Decomposition Methods in Science and Engineering XXVII, Sep 2024

Abs Bib doi URL Preprint

The approach of using physics-based machine learning to solve PDEs has recently become very popular. A recent approach to solve PDEs based on CNNs uses finite difference stencils to include the residual of the partial differential equation into the loss function. In this work, the relation between the network training and the solution of a respective finite difference linear system of equations using classical numerical solvers is discussed. It turns out that many beneficial properties of the linear equation system are neglected in the network training. Finally, numerical results which underline the benefits of classical numerical solvers are presented.
@inproceedings{Grimm:2024:SHS, author = {Grimm, Viktor and Heinlein, Alexander and Klawonn, Axel}, booktitle = {Domain Decomposition Methods in Science and Engineering XXVII}, doi = {10.1007/978-3-031-50769-4_1}, pages = {3-14}, publisher = {Springer International Publishing}, title = {A short note on solving partial differential equations using convolutional neural networks}, url = {https://link.springer.com/chapter/10.1007/978-3-031-50769-4_1}, year = {2024} }
Springer LNCSE
Finite basis physics-informed neural networks as a Schwarz domain decomposition method

Victorita Dolean, Alexander Heinlein, Siddhartha Mishra, and 1 more author

In Domain Decomposition Methods in Science and Engineering XXVII, Sep 2024

Abs Bib doi URL Preprint

Physics-informed neural networks (PINNs) [4, 10] are an approach for solving boundary value problems based on differential equations (PDEs). The key idea of PINNs is to use a neural network to approximate the solution to the PDE and to incorporate the residual of the PDE as well as boundary conditions into its loss function when training it. This provides a simple and mesh-free approach for solving problems relating to PDEs. However, a key limitation of PINNs is their lack of accuracy and efficiency when solving problems with larger domains and more complex, multi-scale solutions. In a more recent approach, finite basis physics-informed neural networks (FBPINNs) [8] use ideas from domain decomposition to accelerate the learning process of PINNs and improve their accuracy. In this work, we show how Schwarz-like additive, multiplicative, and hybrid iteration methods for training FBPINNs can be developed. We present numerical experiments on the influence of these different training strategies on convergence and accuracy. Furthermore, we propose and evaluate a preliminary implementation of coarse space correction for FBPINNs.
@inproceedings{Dolean:2024:FBP, author = {Dolean, Victorita and Heinlein, Alexander and Mishra, Siddhartha and Moseley, Ben}, booktitle = {Domain Decomposition Methods in Science and Engineering XXVII}, doi = {10.1007/978-3-031-50769-4_19}, pages = {165-172}, publisher = {Springer International Publishing}, title = {Finite basis physics-informed neural networks as a Schwarz domain decomposition method}, url = {https://link.springer.com/chapter/10.1007/978-3-031-50769-4_19}, year = {2024} }

2023

Coupled2023
A Comparison Of Direct Solvers In FROSch Applied To Chemo-Mechanics

Alexander Heinlein, Björn Kiefer, Stefan Prüger, and 2 more authors

In 10th edition of the International Conference on Computational Methods for Coupled Problems in Science and Engineering, Sep 2023

Abs Bib doi URL Preprint

Sparse direct linear solvers are at the computational core of domain decomposition preconditioners and therefore have a strong impact on their performance. In this paper, we consider the Fast and Robust Overlapping Schwarz (FROSch) solver framework of the Trilinos software library, which contains a parallel implementations of the GDSW domain decomposition preconditioner. We compare three different sparse direct solvers used to solve the subdomain problems in FROSch. The preconditioner is applied to different model problems; linear elasticity and more complex fully-coupled deformation diffusion-boundary value problems from chemo-mechanics. We employ FROSch in fully algebraic mode, and therefore, we do not expect numerical scalability. Strong scalability is studied from 64 to 4096 cores, where good scaling results are obtained up to 1728 cores. The increasing size of the coarse problem increases the solution time for all sparse direct solvers.
@inproceedings{Heinlein:2023:CDS, author = {Heinlein, Alexander and Kiefer, Bj{\"o}rn and Pr{\"u}ger, Stefan and Rheinbach, Oliver and R{\"o}ver, Friederike}, booktitle = {10th edition of the International Conference on Computational Methods for Coupled Problems in Science and Engineering}, doi = {10.23967/c.coupled.2023.008}, title = {A Comparison Of Direct Solvers In FROSch Applied To Chemo-Mechanics}, url = {https://www.scipedia.com/public/_2023i}, year = {2023} }
IPDPS
An Experimental Study of Two-level Schwarz Domain-Decomposition Preconditioners on GPUs

Ichitaro Yamazaki, Alexander Heinlein, and Sivasankaran Rajamanickam

In 2023 IEEE International Parallel and Distributed Processing Symposium (IPDPS), Sep 2023

Abs Bib doi URL Preprint

The generalized Dryja–Smith–Widlund (GDSW) preconditioner is a two-level overlapping Schwarz domain decomposition (DD) preconditioner that couples a classical one-level overlapping Schwarz preconditioner with an energy-minimizing coarse space. When used to accelerate the convergence rate of Krylov subspace iterative methods, the GDSW preconditioner provides robustness and scalability for the solution of sparse linear systems arising from the discretization of a wide range of partial different equations. In this paper, we present FROSch (Fast and Robust Schwarz), a domain decomposition solver package which implements GDSW-type preconditioners for both CPU and GPU clusters. To improve the solver performance on GPUs, we use a novel decomposition to run multiple MPI processes on each GPU, reducing both solver’s computational and storage costs and potentially improving the convergence rate. This allowed us to obtain competitive or faster performance using GPUs compared to using CPUs alone. We demonstrate the performance of FROSch on the Summit supercomputer with NVIDIA V100 GPUs, where we used NVIDIA Multi-Process Service (MPS) to implement our decomposition strategy.The solver has a wide variety of algorithmic and implementation choices, which poses both opportunities and challenges for its GPU implementation. We conduct a thorough experimental study with different solver options including the exact or inexact solution of the local overlapping subdomain problems on a GPU. We also discuss the effect of using the iterative variant of the incomplete LU factorization and sparse-triangular solve as the approximate local solver, and using lower precision for computing the whole FROSch preconditioner. Overall, the solve time was reduced by factors of about 2× using GPUs, while the GPU acceleration of the numerical setup time depend on the solver options and the local matrix sizes.
@inproceedings{Yamazaki:2023:EST, author = {Yamazaki, Ichitaro and Heinlein, Alexander and Rajamanickam, Sivasankaran}, booktitle = {2023 IEEE International Parallel and Distributed Processing Symposium (IPDPS)}, doi = {10.1109/IPDPS54959.2023.00073}, pages = {680-689}, title = {An Experimental Study of Two-level Schwarz Domain-Decomposition Preconditioners on GPUs}, url = {https://ieeexplore.ieee.org/document/10177474}, year = {2023} }
PAMM
Reduced order fluid modeling with generative adversarial networks

Mirko Kemna, Alexander Heinlein, and Cornelis Vuik

In Proc. Appl. Math. Mech., Sep 2023

Abs Bib doi URL

Surrogate models based on convolutional neural networks (CNNs) for computational fluid dynamics (CFD) simulations are investigated. In particular, the flow field inside two-dimensional channels with a sudden expansion and an obstacle is predicted using an image representation of the geometry as the input. Generative adversarial neural networks (GANs) have been shown to excel at such image-to-image translation tasks. This motivates the focus of this work on investigating the specific effect of adversarial training on model performance. Numerical results show that the overall accuracy of the GANs is generally lower compared to an identical generator model trained directly on the ground truth using an L1 data loss. On the other hand, GAN predictions are often visually more convincing and exhibit a lower continuity residual.
@inproceedings{Kemna:2023:FST, author = {Kemna, Mirko and Heinlein, Alexander and Vuik, Cornelis}, booktitle = {Proc. Appl. Math. Mech.}, doi = {10.1002/pamm.202200241}, number = {1}, pages = {e202200241}, title = {Reduced order fluid modeling with generative adversarial networks}, url = {https://onlinelibrary.wiley.com/doi/10.1002/pamm.202200241}, volume = {23}, year = {2023} }
PAMM
First steps towards modeling the interaction of cardiovascular agents and smooth muscle activation in arterial walls

Sharan Nurani Ramesh, Klemens Uhlmann, Lea Saßmannshausen, and 4 more authors

In Proc. Appl. Math. Mech., Sep 2023

Abs Bib doi URL

A temporal homogenization approach for the numerical simulation of atherosclerotic plaque growth is extended to fully coupled fluid-structure interaction (FSI) simulations. The numerical results indicate that the two-scale approach yields significantly different results compared to a simple heuristic averaging, where only stationary long-scale FSI problems are solved, confirming the importance of incorporating stress variations on small time-scales. In the homogenization approach, a periodic fine-scale problem, which is periodic with respect to the heart beat, has to be solved for each long-scale time step. Even if no exact initial conditions are available, periodicity can be achieved within only 2–3 heart beats by simple time-stepping.
@inproceedings{Ramesh:2023:FST, author = {Ramesh, Sharan Nurani and Uhlmann, Klemens and Sa{\ss}mannshausen, Lea and Rheinbach, Oliver and Klawonn, Axel and Heinlein, Alexander and Balzani, Daniel}, booktitle = {Proc. Appl. Math. Mech.}, doi = {10.1002/pamm.202200133}, number = {1}, pages = {e202200133}, title = {First steps towards modeling the interaction of cardiovascular agents and smooth muscle activation in arterial walls}, url = {https://onlinelibrary.wiley.com/doi/10.1002/pamm.202200133}, volume = {22}, year = {2023} }
Springer LNCSE
A Three-Level Extension for Fast and Robust Overlapping Schwarz (FROSch) Preconditioners with Reduced Dimensional Coarse Space

Alexander Heinlein, Axel Klawonn, Oliver Rheinbach, and 1 more author

In Domain Decomposition Methods in Science and Engineering XXVI, Sep 2023

Abs Bib doi URL Preprint

The Fast and Robust Overlapping Schwarz (FROSch) preconditioner framework is part of the Trilinos software library and contains parallel implementations of the Generalized-Dryja-Smith-Widlund (GDSW) type overlapping Schwarz domain decomposition preconditioners. It provides implementations of the classical GDSW coarse space as well as of reduced dimensional GDSW coarse spaces, where the coarse problem is smaller compared to the classical approach. To extend the parallel scalability of these approaches, a three-level extension has recently been introduced into the framework. In this paper, we present results, obtained on the SuperMUC-NG supercomputer using up to 85K MPI ranks. The results indicate superior weak parallel scalability of the three-level method compared to the two-level method.
@inproceedings{Heinlein:2023:TLE, author = {Heinlein, Alexander and Klawonn, Axel and Rheinbach, Oliver and R\"over, Friederike}, booktitle = {Domain Decomposition Methods in Science and Engineering XXVI}, doi = {10.1007/978-3-030-95025-5_54}, pages = {505-513}, publisher = {Springer International Publishing}, title = {A Three-Level Extension for Fast and Robust Overlapping Schwarz (FROSch) Preconditioners with Reduced Dimensional Coarse Space}, url = {https://link.springer.com/chapter/10.1007/978-3-030-95025-5_54}, year = {2023} }
Springer LNCSE
Predicting the geometric location of critical edges in adaptive GDSW overlapping domain decomposition methods using deep learning

Alexander Heinlein, Axel Klawonn, Martin Lanser, and 1 more author

In Domain Decomposition Methods in Science and Engineering XXVI, Sep 2023

Abs Bib doi URL Preprint

Overlapping GDSW domain decomposition methods are considered for diffusion problems in two dimensions discretized by finite elements. For a diffusion coefficient with high contrast, the condition number is usually dependent on it. A remedy is given by adaptive domain decomposition methods, where the coarse space is enhanced by additional coarse basis functions. These are chosen problem-dependently by solving small local eigenvalue problems. Here, the eigenvalue problems (EVPs) are associated with the edges of the domain decomposition interface; edges, where these EVPs have to be solved are denoted as critical edges. For many applications, not all edges are critical and the solution of the EVPs is not necessary. In an earlier work, a strategy to predict the location of critical edges, based on deep learning, has been proposed for adaptive FETI-DP, a class of nonoverlapping methods. In the present work, this strategy is successfully applied to adaptive GDSW; differences in the classification process for this overlapping method are described. Choosing the classification threshold has been a challenge in all these approaches. Here, for the first time, a heuristic based on the receiver operating characteristic (ROC) curve and the precision-recall graph is discussed. Results for a challenging realistic coefficient function are presented.
@inproceedings{Heinlein:2023:PGL, author = {Heinlein, Alexander and Klawonn, Axel and Lanser, Martin and Weber, Janine}, booktitle = {Domain Decomposition Methods in Science and Engineering XXVI}, doi = {10.1007/978-3-030-95025-5_32}, pages = {307-315}, publisher = {Springer International Publishing}, title = {Predicting the geometric location of critical edges in adaptive GDSW overlapping domain decomposition methods using deep learning}, url = {https://link.springer.com/chapter/10.1007/978-3-030-95025-5_32}, year = {2023} }

2022

WCCM-ECCOMAS
Efficient coarse correction for parallel time-stepping in plaque growth simulations

Stefan Frei, and Alexander Heinlein

In ECCOMAS2022, Sep 2022

Abs Bib doi URL Preprint

In order to make the numerical simulation of atherosclerotic plaque growth feasible, a temporal homogenization approach is employed. The resulting macro-scale problem for the plaque growth can be further accelerated by using parallel time integration schemes, such as the parareal algorithm. However, the parallel scalability is dominated by the computational cost of the coarse propagator. Therefore, in this paper, an interpolation-based coarse propagator, which uses growth values from previously computed micro-scale problems, is introduced. For a simple model problem, it is shown that this approach reduces both the computational work for a single parareal iteration as well as the required number of parareal iterations.
@inproceedings{Frei:2022:ECC, author = {Frei, Stefan and Heinlein, Alexander}, booktitle = {ECCOMAS2022}, doi = {10.23967/eccomas.2022.015}, title = {Efficient coarse correction for parallel time-stepping in plaque growth simulations}, url = {https://www.scipedia.com/public/Frei_et_al_2022a}, year = {2022} }

2021

PAMM
On temporal homogenization in the numerical simulation of atherosclerotic plaque growth

Stefan Frei, Alexander Heinlein, and Thomas Richter

In Proc. Appl. Math. Mech., Sep 2021

Abs Bib doi URL Preprint

A temporal homogenization approach for the numerical simulation of atherosclerotic plaque growth is extended to fully coupled fluid-structure interaction (FSI) simulations. The numerical results indicate that the two-scale approach yields significantly different results compared to a simple heuristic averaging, where only stationary long-scale FSI problems are solved, confirming the importance of incorporating stress variations on small time-scales. In the homogenization approach, a periodic fine-scale problem, which is periodic with respect to the heart beat, has to be solved for each long-scale time step. Even if no exact initial conditions are available, periodicity can be achieved within only 2–3 heart beats by simple time-stepping.
@inproceedings{Frei:2021:THN, author = {Frei, Stefan and Heinlein, Alexander and Richter, Thomas}, booktitle = {Proc. Appl. Math. Mech.}, doi = {10.1002/pamm.202100055}, number = {1}, pages = {e202100055}, title = {On temporal homogenization in the numerical simulation of atherosclerotic plaque growth}, url = {https://onlinelibrary.wiley.com/doi/10.1002/pamm.202100055}, volume = {21}, year = {2021} }
Springer LNCSE
Stationary flow predictions using convolutional neural networks

Matthias Eichinger, Alexander Heinlein, and Axel Klawonn

In Numerical Mathematics and Advanced Applications ENUMATH 2019, Sep 2021

Abs Bib doi Preprint

Computational Fluid Dynamics (CFD) simulations are a numerical tool to model and analyze the behavior of fluid flow. However, accurate simulations are generally very costly because they require high grid resolutions. In this paper, an alternative approach for computing flow predictions using Convolutional Neural Networks (CNNs) is described; in particular, a classical CNN as well as the U-Net architecture are used. First, the networks are trained in an expensive offline phase using flow fields computed by CFD simulations. Afterwards, the evaluation of the trained neural networks is very cheap. Here, the focus is on the dependence of the stationary flow in a channel on variations of the shape and the location of an obstacle. CNNs perform very well on validation data, where the averaged error for the best networks is below 3%. In addition to that, they also generalize very well to new data, with an averaged error below 10%.
@inproceedings{Eichinger:2021:SFP, author = {Eichinger, Matthias and Heinlein, Alexander and Klawonn, Axel}, booktitle = {Numerical Mathematics and Advanced Applications ENUMATH 2019}, doi = {10.1007/978-3-030-55874-1_53}, pages = {541--549}, publisher = {Springer International Publishing}, title = {Stationary flow predictions using convolutional neural networks}, year = {2021} }
Springer LNCSE
Fully algebraic two-level overlapping Schwarz preconditioners for elasticity problems

Alexander Heinlein, Christian Hochmuth, and Axel Klawonn

In Numerical Mathematics and Advanced Applications ENUMATH 2019, Sep 2021

Abs Bib doi Preprint

Different parallel two-level overlapping Schwarz preconditioners with Generalized Dryja-Smith-Widlund (GDSW) and Reduced dimension GDSW (RGDSW) coarse spaces for elasticity problems are considered. GDSW type coarse spaces can be constructed from the fully assembled system matrix, but they additionally need the index set of the interface of the corresponding nonoverlapping domain decomposition and the null space of the elasticity operator, i.e., the rigid body motions. In this paper, fully algebraic variants, which are constructed solely from the uniquely distributed system matrix, are compared to the classical variants which make use of this additional information; the fully algebraic variants use an approximation of the interface and an incomplete algebraic null space. Nevertheless, the parallel performance of the fully algebraic variants is competitive compared to the classical variants for a stationary homogeneous model problem and a dynamic heterogeneous model problem with coefficient jumps in the shear modulus; the largest parallel computations were performed on 4,096 MPI (Message Passing Interface) ranks. The parallel implementations are based on the Trilinos package FROSch.
@inproceedings{Heinlein:2021:FAT, author = {Heinlein, Alexander and Hochmuth, Christian and Klawonn, Axel}, booktitle = {Numerical Mathematics and Advanced Applications ENUMATH 2019}, doi = {10.1007/978-3-030-55874-1_52}, pages = {531--539}, publisher = {Springer International Publishing}, title = {Fully algebraic two-level overlapping {S}chwarz preconditioners for elasticity problems}, year = {2021} }
Springer LNCSE
Machine Learning in Adaptive Domain Decomposition Methods - Reducing the Effort in Sampling

Alexander Heinlein, Axel Klawonn, Martin Lanser, and 1 more author

In Numerical Mathematics and Advanced Applications ENUMATH 2019, Sep 2021

Abs Bib doi Preprint

The convergence rate of classic domain decomposition methods in general deteriorates severely for large discontinuities in the coefficient functions of the considered partial differential equation. To retain the robustness for such highly heterogeneous problems, the coarse space can be enriched by additional coarse basis functions. These can be obtained by solving local generalized eigenvalue problems on subdomain edges. In order to reduce the number of eigenvalue problems and thus the computational cost, we use a neural network to predict the geometric location of critical edges, i.e., edges where the eigenvalue problem is indispensable. As input data for the neural network, we use function evaluations of the coefficient function within the two subdomains adjacent to an edge. In the present article, we examine the effect of computing the input data only in a neighborhood of the edge, i.e., on slabs next to the edge. We show numerical results for both the training data as well as for a concrete test problem in form of a microsection subsection for linear elasticity problems. We observe that computing the sampling points only in one half or one quarter of each subdomain still provides robust algorithms.
@inproceedings{Heinlein:2021:MLA, author = {Heinlein, Alexander and Klawonn, Axel and Lanser, Martin and Weber, Janine}, booktitle = {Numerical Mathematics and Advanced Applications ENUMATH 2019}, doi = {10.1007/978-3-030-55874-1_58}, pages = {593--603}, publisher = {Springer International Publishing}, title = {Machine Learning in Adaptive Domain Decomposition Methods - Reducing the Effort in Sampling}, year = {2021} }
WCCM-ECCOMAS
Choosing the Subregions in Three-Level FROSch Preconditioners

Alexander Heinlein, Oliver Rheinbach, and Friederike Röver

In WCCM-ECCOMAS2020, Sep 2021

Abs Bib URL Preprint

Different graph partitioning methods, i.e., linear partioning, parallel hypergraph (PHG) partioning, and two approaches using ParMETIS, are considered to generate an unstructured decomposition of the second-level coarse operator of three-level FROSch (Fast and Robust Overlapping Schwarz) preconditioners in the Trilinos software library. In our context, the parallel hypergraph method shows the most consistent results.
@inproceedings{Heinlein:2021:CST, author = {Heinlein, Alexander and Rheinbach, Oliver and R\"over, Friederike}, booktitle = {WCCM-ECCOMAS2020}, title = {Choosing the Subregions in Three-Level FROSch Preconditioners}, url = {https://www.scipedia.com/public/Hainlein_et_al_2021}, year = {2021} }

2020

Springer LNCSE
Coarse Spaces for Nonlinear Schwarz Methods on Unstructured Grids

Alexander Heinlein, and Martin Lanser

In Domain Decomposition Methods in Science and Engineering XXV, Sep 2020

Abs Bib doi URL Preprint

In recent years, nonlinear domain decomposition (DD) methods for the solution of nonlinear partial differential equations as, e.g., ASPIN (Additive Schwarz Preconditioned Inexact Newton) or Nonlinear-FETI-DP (Nonlinear - Finite Element Tearing and Interconnecting - Dual-Primal), became popular. For several model problems, these approaches outperform classical inexact Newton methods, where a corresponding linear DD method is used to solve the linearized problems, in terms of linear and nonlinear iteration counts and time to solution. As in the linear case, in nonlinear DD methods, an appropriate coarse space is often necessary for robustness and numerical scalability. In this paper, a new multiplicative implementation of a coarse space for ASPIN as well as the related RASPEN (Restricted Additive Schwarz Preconditioned Exact Newton) method is suggested. Additionally, several coarse spaces, which are also applicable for unstructured meshes and domain decompositions, are suggested. Robustness and numerical scalability is shown for different homogeneous and heterogeneous p-Laplace problems in two spatial dimensions.
@inproceedings{Heinlein:2020:CSN, author = {Heinlein, Alexander and Lanser, Martin}, booktitle = {Domain Decomposition Methods in Science and Engineering XXV}, doi = {10.1007/978-3-030-56750-7_30}, pages = {268--276}, publisher = {Springer International Publishing}, title = {Coarse Spaces for Nonlinear {S}chwarz Methods on Unstructured Grids}, url = {https://link.springer.com/chapter/10.1007/978-3-030-56750-7_30}, year = {2020} }
Springer LNCSE
Machine Learning in Adaptive FETI-DP - A Comparison of Smart and Random Training Data

Alexander Heinlein, Axel Klawonn, Martin Lanser, and 1 more author

In Domain Decomposition Methods in Science and Engineering XXV, Sep 2020

Abs Bib doi URL Preprint

Adaptive FETI-DP (Finite Element Tearing and Interconnecting - Dual-Primal) methods are considered for the solution of two-dimensional scalar elliptic model problems with complex coefficient distributions where large coefficient jumps can occur along or across the domain decomposition interface. The adaptive coarse space is obtained by solving certain generalized eigenvalue problems on subdomain edges. In order to reduce the number of eigenvalue problems, a machine learning based strategy using a neural network to predict the geometric location of critical edges can be applied in a preprocessing step. Here, the effect of different types of training data sets on the robustness of the machine learning adaptive FETI-DP algorithm is investigated. Therefore, the neural network is first trained on different data sets and then the machine learning model is evaluated for a coefficient distribution obtained from a realistic dual-phase steel microstructure. It can be observed that the best results are obtained using a priori knowledge (smart data), whereas purely random data yields bad results. However, by imposing some structure on the random data and increasing the size of the data set, the performance is comparable to the smart data.
@inproceedings{Heinlein:2020:MLA, author = {Heinlein, Alexander and Klawonn, Axel and Lanser, Martin and Weber, Janine}, booktitle = {Domain Decomposition Methods in Science and Engineering XXV}, doi = {10.1007/978-3-030-56750-7_24}, pages = {218--226}, publisher = {Springer International Publishing}, title = {Machine Learning in Adaptive FETI-DP - A Comparison of Smart and Random Training Data}, url = {https://link.springer.com/chapter/10.1007/978-3-030-56750-7_24}, year = {2020} }
Springer LNCSE
Local spectra of adaptive domain decomposition methods

Alexander Heinlein, Axel Klawonn, and Martin Kühn

In Domain Decomposition Methods in Science and Engineering XXV, Sep 2020

Abs Bib doi URL Preprint

We compare the spectra of local generalized eigenvalue problems in different adaptive coarse spaces for overlapping and nonoverlapping domain decomposition methods. In particular, we compare the AGDSW (Adaptive Generalized Dryja-Smith-Widlund), the OS-ACMS (Overlapping Schwarz-Approximate Component Mode Synthesis), and the SHEM (Spectral Harmonically Enriched Multiscale) coarse spaces for overlapping Schwarz methods, the GenEO (Generalized Eigenproblems in the Overlaps) coarse space for FETI-1 and BDD methods, and two approaches based on estimates for the P_D operator for FETI-DP and BDDC methods. Therefore, we consider eight different two-dimensional coefficient functions with jumps ranging from simple channels to a realistic microstructure of a dual-phase steel. We observe significant differences in the width of the gap between good and bad eigenvalues depending on the coefficient distribution. In addition to that, eigenvalue problems involving sophisticated but more expensive harmonic extensions or deluxe-scaling can reduce the number of bad eigenvalues.
@inproceedings{Heinlein:2020:LSA, author = {Heinlein, Alexander and Klawonn, Axel and K\"uhn, Martin}, booktitle = {Domain Decomposition Methods in Science and Engineering XXV}, doi = {10.1007/978-3-030-56750-7_18}, pages = {167--175}, publisher = {Springer International Publishing}, title = {Local spectra of adaptive domain decomposition methods}, url = {https://link.springer.com/chapter/10.1007/978-3-030-56750-7_18}, year = {2020} }
Springer LNCSE
FROSch: A Fast And Robust Overlapping Schwarz Domain Decomposition Preconditioner Based on Xpetra in Trilinos

Alexander Heinlein, Axel Klawonn, Sivasankaran Rajamanickam, and 1 more author

In Domain Decomposition Methods in Science and Engineering XXV, Sep 2020

Abs Bib doi URL Preprint

A parallel two-level overlapping Schwarz domain decomposition preconditioner has been integrated into the Trilinos ShyLU-package. The preconditioner uses an energy-minimizing coarse space and can be constructed from an assembled sparse matrix. The software implements variants of the two-level overlapping Schwarz method from [Dohrmann, Klawonn, Widlund, SINUM 2008], where it was denoted Generalized Dryja, Smith, Widlund (GDSW). The implementation is based on [Heinlein, Klawonn, Rheinbach, SISC 2016] but has been improved significantly with respect to efficiency, generality, e.g., for the use of Tpetra instead of Epetra matrices, and its interface.
@inproceedings{Heinlein:2020:FRO, author = {Heinlein, Alexander and Klawonn, Axel and Rajamanickam, Sivasankaran and Rheinbach, Oliver}, booktitle = {Domain Decomposition Methods in Science and Engineering XXV}, doi = {10.1007/978-3-030-56750-7_19}, pages = {176--184}, publisher = {Springer International Publishing}, title = {{FROSch}: A Fast And Robust Overlapping {S}chwarz Domain Decomposition Preconditioner Based on {X}petra in {T}rilinos}, url = {https://link.springer.com/chapter/10.1007/978-3-030-56750-7_19}, year = {2020} }
Springer LNCSE
A Three-Level Extension of the GDSW Overlapping Schwarz Preconditioner in Three Dimensions

Alexander Heinlein, Axel Klawonn, Oliver Rheinbach, and 1 more author

In Domain Decomposition Methods in Science and Engineering XXV, Sep 2020

Abs Bib doi URL Preprint

A three-level extension of the GDSW overlapping Schwarz preconditioner in three dimensions is presented, constructed by recursively applying the GDSW preconditioner to the coarse problem using a standard and a reduced dimension coarse space. Numerical results, obtained for a parallel implementation using the Trilinos software library, are presented for up to 64,000 cores of the JUQUEEN supercomputer. The superior weak parallel scalability of the three-level method is verified. For large problems and a large number of cores, the three-level method is faster by more than a factor of two, compared to the standard two-level method. The three-level method shows to scale when the classical method is already be out-of-memory.
@inproceedings{Heinlein:2020:TLE, author = {Heinlein, Alexander and Klawonn, Axel and Rheinbach, Oliver and R\"over, Friederike}, booktitle = {Domain Decomposition Methods in Science and Engineering XXV}, doi = {10.1007/978-3-030-56750-7_20}, pages = {185--192}, publisher = {Springer International Publishing}, title = {A Three-Level Extension of the {GDSW} Overlapping {S}chwarz Preconditioner in Three Dimensions}, url = {https://link.springer.com/chapter/10.1007/978-3-030-56750-7_20}, year = {2020} }

2019

Springer LNCSE
A Three-Level Extension of the GDSW Overlapping Schwarz Preconditioner in Two Dimensions

Alexander Heinlein, Axel Klawonn, Oliver Rheinbach, and 1 more author

In Advanced Finite Element Methods with Applications: Selected Papers from the 30th Chemnitz Finite Element Symposium 2017, Sep 2019

Abs Bib doi URL Preprint

A three-level extension of the GDSW overlapping Schwarz preconditioner in two dimensions is presented, constructed by recursively applying the GDSW preconditioner to the coarse problem. Numerical results, obtained for a parallel implementation using the Trilinos software library, are presented for up to 90,000 cores of the JUQUEEN supercomputer. The superior weak parallel scalability of the three-level method is verified. For large problems and a large number of cores, the three-level method is faster by more than a factor of two, compared to the standard two-level method . The three-level method can also be expected to scale when the classical method will already be out-of-memory.
@inproceedings{Heinlein:2019:TLE, author = {Heinlein, Alexander and Klawonn, Axel and Rheinbach, Oliver and R{\"o}ver, Friederike}, booktitle = {Advanced Finite Element Methods with Applications: Selected Papers from the 30th Chemnitz Finite Element Symposium 2017}, doi = {10.1007/978-3-030-14244-5_10}, pages = {187--204}, publisher = {Springer International Publishing}, title = {A Three-Level Extension of the {GDSW} Overlapping {S}chwarz Preconditioner in Two Dimensions}, url = {https://link.springer.com/chapter/10.1007%2F978-3-030-14244-5_10}, year = {2019} }
Springer LNCSE
Improving the Parallel Performance of Overlapping Schwarz Methods by Using a Smaller Energy Minimizing Coarse Space

Alexander Heinlein, Axel Klawonn, Oliver Rheinbach, and 1 more author

In Domain Decomposition Methods in Science and Engineering XXIV, Sep 2019

Abs Bib doi URL

We consider a recent overlapping Schwarz method with an energy-minimizing coarse space of reduced size. In numerical experiments for up to 64,000 cores, we show that the parallel efficiency and the total time to solution is improved significantly, compared to our previous overlapping Schwarz method using an alternative energy-minimizing coarse space.
@inproceedings{Heinlein:2019:IPP, author = {Heinlein, Alexander and Klawonn, Axel and Rheinbach, Oliver and Widlund, Olof}, booktitle = {Domain Decomposition Methods in Science and Engineering XXIV}, doi = {10.1007/978-3-319-93873-8_36}, pages = {383--392}, publisher = {Springer International Publishing}, title = {Improving the Parallel Performance of Overlapping {S}chwarz Methods by Using a Smaller Energy Minimizing Coarse Space}, url = {https://link.springer.com/chapter/10.1007%2F978-3-319-93873-8_36}, year = {2019} }
Springer LNCSE
An Adaptive GDSW Coarse Space for Two-Level Overlapping Schwarz Methods in Two Dimensions

Alexander Heinlein, Axel Klawonn, Jascha Knepper, and 1 more author

In Domain Decomposition Methods in Science and Engineering XXIV, Sep 2019

Abs Bib doi URL Preprint

We propose robust coarse spaces for two-level overlapping Schwarz preconditioners, which are extensions of the energy minimizing coarse space known as GDSW (Generalized Dryja, Smith, Widlund). The resulting two-level methods with adaptive coarse spaces are robust for second order elliptic problems in two dimensions, even in presence of a highly heterogeneous coefficient function, and reduce to the standard GDSW algorithm if no additional coarse basis functions are used.
@inproceedings{Heinlein:2019:AGD2, author = {Heinlein, Alexander and Klawonn, Axel and Knepper, Jascha and Rheinbach, Oliver}, booktitle = {Domain Decomposition Methods in Science and Engineering XXIV}, doi = {10.1007/978-3-319-93873-8_35}, pages = {373--382}, publisher = {Springer International Publishing}, title = {An Adaptive {GDSW} Coarse Space for Two-Level Overlapping {S}chwarz Methods in Two Dimensions}, url = {https://link.springer.com/chapter/10.1007/978-3-319-93873-8_35}, year = {2019} }
PAMM
Applying the FROSch Overlapping Schwarz Preconditioner to Dislocation Mechanics in Deal.II

Alexander Heinlein, Oliver Rheinbach, Friederike Röver, and 2 more authors

In Proc. Appl. Math. Mech., Sep 2019

Abs Bib doi URL Preprint

In this contribution, results regarding fluid-structure interaction (FSI) simulations for three-dimensional arterial walls are presented. In detail, a benchmark problem for FSI simulations in arteries of sufficient complexity, which combines sophisticated nonlinear models for the fluid and the structure, cf. [1], as well as a short segment from a patient-specific arterial geometry are considered. For the patient-specific arterial geometry a specific inflow profile suited for realistic geometries and simplified boundary conditions for the outflow are taken into account.
@inproceedings{Heinlein:2019:AFR, author = {Heinlein, Alexander and Rheinbach, Oliver and R\"over, Friederike and Sandfeld, Stefan and Steinberger, Dominik}, booktitle = {Proc. Appl. Math. Mech.}, doi = {10.1002/pamm.201900337}, number = {1}, pages = {e201900337}, title = {Applying the FROSch Overlapping Schwarz Preconditioner to Dislocation Mechanics in Deal.II}, url = {https://onlinelibrary.wiley.com/doi/abs/10.1002/pamm.201900337}, volume = {19}, year = {2019} }
PAMM
Comparison of MRI measurements and CFD simulations of hemodynamics in intracranial aneurysms using a 3D printed model - A benchmark problem

Daniel Giese, Alexander Heinlein, Axel Klawonn, and 2 more authors

In Proc. Appl. Math. Mech., Sep 2019

Abs Bib doi URL Preprint

A benchmark for the comparison of MRI (Magnetic Resonance Imaging) measurements and CFD (Computational Fluid Dynamics) simulations for blood flow in intracranial aneurysms is presented. The benchmark setting is designed to allow for CFD simulations that are completely independent of the MRI measurements. This facilitates a fair comparison of both methods. Furthermore, results showing the good agreement of MRI and CFD are presented.
@inproceedings{Giese:2019:CMRa, author = {Giese, Daniel and Heinlein, Alexander and Klawonn, Axel and Knepper, Jascha and Sonnabend, Kristina}, booktitle = {Proc. Appl. Math. Mech.}, doi = {10.1002/pamm.201900398}, number = {1}, pages = {e201900398}, title = {Comparison of MRI measurements and CFD simulations of hemodynamics in intracranial aneurysms using a 3D printed model - A benchmark problem}, url = {https://onlinelibrary.wiley.com/doi/abs/10.1002/pamm.201900398}, volume = {19}, year = {2019} }
PAMM
Comparison of MRI measurements and CFD simulations of hemodynamics in intracranial aneurysms using a 3D printed model - Influence of noisy MRI measurements

Daniel Giese, Alexander Heinlein, Axel Klawonn, and 2 more authors

In Proc. Appl. Math. Mech., Sep 2019

Abs Bib doi URL Preprint

MRI (Magnetic Resonance Imaging) measurements and CFD (Computational Fluid Dynamics) simulations for blood flow in intracranial aneurysms are compared for a benchmark problem. In particular, it is shown that noise and other artifacts in the MRI measurements have an influence on certain properties of the flow field, e.g., on the boundary flow and mass conservation.
@inproceedings{Giese:2019:CMRb, author = {Giese, Daniel and Heinlein, Alexander and Klawonn, Axel and Knepper, Jascha and Sonnabend, Kristina}, booktitle = {Proc. Appl. Math. Mech.}, doi = {10.1002/pamm.201900401}, number = {1}, pages = {e201900401}, title = {Comparison of MRI measurements and CFD simulations of hemodynamics in intracranial aneurysms using a 3D printed model - Influence of noisy MRI measurements}, url = {https://onlinelibrary.wiley.com/doi/abs/10.1002/pamm.201900401}, volume = {19}, year = {2019} }

2018

PAMM
Remarks on Fluid-Structure Interaction Simulations in Realistic Arterial Geometries with regard to the Transmural Stress Distribution

Simon Fausten, Daniel Balzani, Alexander Heinlein, and 3 more authors

In Proc. Appl. Math. Mech., Sep 2018

Abs Bib doi URL

In this contribution, Fluid-Structure-Interaction (FSI) in blood vessels, in detail the simulation of realistic arterial geometries, where the interaction of the blood flow and the vessel wall is of special interest, is considered. Based on pervious research, cf. [1], our existing framework for FSI-simulations is extended towards realistic arterial geometries. The inflow and outflow boundary conditions for the fluid, as well as the boundary conditions for the structure are enhanced and adjusted to the chosen patient-specific geometry. In detail, an inflow profile for arbitrary shaped inflow cross-sections and a zero pressure boundary condition at the outflow are applied. Furthermore, the vessel wall is discretized using realistic material parameters of the media layer. The geometry and material parameters are adopted from [2]. In order to deal with the increasing complexity of the boundary value problem parallel computing and a two-level overlapping Schwarz method with energy-minimizing coarse space are applied; cf. [3]. The numerical simulations are performed using the Open-Source software LifeV, in particular a code which has been developed in cooperation with the group of Prof. Quarteroni from the EPF Lausanne.
@inproceedings{Fausten:2018:RFS, author = {Fausten, Simon and Balzani, Daniel and Heinlein, Alexander and Klawonn, Axel and Rheinbach, Oliver and Schr\"oder, J\"org}, booktitle = {Proc. Appl. Math. Mech.}, doi = {10.1002/pamm.201800312}, number = {1}, pages = {e201800312}, title = {Remarks on Fluid-Structure Interaction Simulations in Realistic Arterial Geometries with regard to the Transmural Stress Distribution}, url = {https://onlinelibrary.wiley.com/doi/abs/10.1002/pamm.201800312}, volume = {18}, year = {2018} }

2017

Springer LNCSE
Parallel Overlapping Schwarz with an Energy-Minimizing Coarse Space

Alexander Heinlein, Axel Klawonn, and Oliver Rheinbach

In Domain Decomposition Methods in Science and Engineering XXIII, Sep 2017

Abs Bib doi URL Preprint

Parallel results obtained with a new implementation of an overlapping Schwarz method using an energy minimizing coarse space are presented. We consider structured and unstructured domain decompositions for scalar elliptic and linear elasticity model problems in two dimensions. In particular, strong and weak parallel scalability studies for up to 1024 processor cores are presented for both types of problems. Additionally, weak scalability results for a three-dimensional linear elasticity model problem using up to 4096 processor cores are discussed. Finally, an application from fully-coupled fluid-structure interaction using a nonlinear hyperelastic material model for the structure is shown.
@inproceedings{Heinlein:2017:POS, author = {Heinlein, Alexander and Klawonn, Axel and Rheinbach, Oliver}, booktitle = {Domain Decomposition Methods in Science and Engineering XXIII}, doi = {10.1007/978-3-319-52389-7_36}, pages = {353--360}, publisher = {Springer International Publishing}, title = {Parallel Overlapping Schwarz with an Energy-Minimizing Coarse Space}, url = {https://link.springer.com/chapter/10.1007%2F978-3-319-52389-7_36}, year = {2017} }
PAMM
Steps Towards More Realistic FSI Simulations of Coronary Arteries

Simon Fausten, Daniel Balzani, Alexander Heinlein, and 3 more authors

In Proc. Appl. Math. Mech., Sep 2017

Abs Bib doi URL

In this contribution, results regarding fluid-structure interaction (FSI) simulations for three-dimensional arterial walls are presented. In detail, a benchmark problem for FSI simulations in arteries of sufficient complexity, which combines sophisticated nonlinear models for the fluid and the structure, cf. [1], as well as a short segment from a patient-specific arterial geometry are considered. For the patient-specific arterial geometry a specific inflow profile suited for realistic geometries and simplified boundary conditions for the outflow are taken into account.
@inproceedings{Fausten:2017:STM, author = {Fausten, Simon and Balzani, Daniel and Heinlein, Alexander and Klawonn, Axel and Rheinbach, Oliver and Schr\"oder, J\"org}, booktitle = {Proc. Appl. Math. Mech.}, doi = {10.1002/pamm.201710063}, number = {1}, pages = {187-188}, title = {Steps Towards More Realistic FSI Simulations of Coronary Arteries}, url = {https://onlinelibrary.wiley.com/doi/abs/10.1002/pamm.201710063}, volume = {17}, year = {2017} }

2016

Springer LNCSE
Parallel Two-Level Overlapping Schwarz Methods in Fluid-Structure Interaction

Alexander Heinlein, Axel Klawonn, and Oliver Rheinbach

In Numerical Mathematics and Advanced Applications ENUMATH 2015, Sep 2016

Abs Bib doi URL Preprint

Parallel overlapping Schwarz preconditioners are considered and applied to the structural block in monolithic fluid-structure interaction (FSI). The two-level overlapping Schwarz method uses a coarse level based on energy minimizing functions. Linear elastic as well as nonlinear, anisotropic hyperelastic structural models are considered in an FSI problem of a pressure wave in a tube. Using our recent parallel implementation of a two-level overlapping Schwarz preconditioner based on the Trilinos library, the total computation time of our FSI benchmark problem was reduced by more than a factor of two compared to the algebraic one-level overlapping Schwarz method used previously. Finally, also strong scalability for our FSI problem is shown for up to 512 processor cores.
@inproceedings{Heinlein:2016:PTO, author = {Heinlein, Alexander and Klawonn, Axel and Rheinbach, Oliver}, booktitle = {Numerical Mathematics and Advanced Applications ENUMATH 2015}, doi = {10.1007/978-3-319-39929-4_50}, pages = {521--530}, publisher = {Springer International Publishing}, title = {Parallel Two-Level Overlapping Schwarz Methods in Fluid-Structure Interaction}, url = {https://link.springer.com/chapter/10.1007/978-3-319-39929-4_50}, year = {2016} }

2015

PAMM
A Comparison of Preconditioners for the Steklov–Poincaré Formulation of the Fluid-Structure Coupling in Hemodynamics

Simone Deparis, Davide Forti, Alexander Heinlein, and 3 more authors

In Proc. Appl. Math. Mech., Sep 2015

Abs Bib doi URL

A Fluid-Structure Interaction (FSI) problem can be reinterpreted as a heterogeneous problem with two subdomains. It is possible to describe the coupled problem at the interface between the fluid and the structure, yielding a nonlinear Steklov-Poincaré problem. The linear system can be linearized by Newton iterations on the interface and the resulting linear problem can be solved by the preconditioned GMRES method. In this work we investigate the behavior of preconditioners of Neumann-Neumann and Dirichlet-Neumann type. We find that, in the context of hemodynamics, the Dirichlet-Neumann, i.e., using Dirichlet boundary conditions on the fluid side and Neumann on the structure side, outperforms the Neumann-Neumann method, except when a weighting is used such that it basically reduces to the Dirichlet-Neumann method.
@inproceedings{Deparis:2015:CPS, author = {Deparis, Simone and Forti, Davide and Heinlein, Alexander and Klawonn, Axel and Quarteroni, Alfio and Rheinbach, Oliver}, booktitle = {Proc. Appl. Math. Mech.}, doi = {10.1002/pamm.201510037}, number = {1}, pages = {93-94}, title = {A Comparison of Preconditioners for the Steklov--Poincar\'e Formulation of the Fluid-Structure Coupling in Hemodynamics}, url = {https://onlinelibrary.wiley.com/doi/abs/10.1002/pamm.201510037}, volume = {15}, year = {2015} }

2014

WCCM XI

Aspects of Arterial Wall Simulations: Nonlinear Anisotropic Material Models and Fluid Structure Interaction

Daniel Balzani, Simone Deparis, Simon Fausten, and 6 more authors

In Proceedings of the WCCM XI, Sep 2014

Bib

@inproceedings{Balzani:2014:AAW,
  author = {Balzani, Daniel and Deparis, Simone and Fausten, Simon and Forti, Davide and Heinlein, Alexander and Klawonn, Axel and Quarteroni, Alfio and Rheinbach, Oliver and Schr\"oder, J\"org},
  booktitle = {Proceedings of the WCCM XI},
  title = {Aspects of Arterial Wall Simulations: Nonlinear Anisotropic Material Models and Fluid Structure Interaction},
  year = {2014}
}

Theses

2016

PhD Thesis
Parallel Overlapping Schwarz Preconditioners and Multiscale Discretizations with Applications to Fluid-Structure Interaction and Highly Heterogeneous Problems

Alexander Heinlein

Universität zu Köln, Sep 2016

Abs Bib URL

Accurate simulations of transmural wall stresses in artherosclerotic coronary arteries may help to predict plaque rupture. Therefore, a robust and efficient numerical framework for Fluid-Structure Interaction (FSI) of the blood flow and the arterial wall has to be set up, and suitable material laws for the modeling of the fluid and the structural response have to be incorporated. In this thesis, monolithic coupling algorithms and corresponding monolithic preconditioners are used to simulate FSI using highly nonlinear anisotropic polyconvex hyperelastic and anisotropic viscoelastic material models for the arterial wall. An MPI-parallel FSI software from the LifeV library is coupled to the software FEAP in order to enable access to the structural material models implemented in FEAP. To define a benchmark test for highly nonlinear material models in FSI, a simple geometry corresponding to a section of an idealized coronary artery, suitable boundary conditions, and material parameters adapted to experimental data are used. In particular, the geometry is chosen to be nonsymmetric to make effects due to the anisotropy of the structure visible. An initialization phase and several heartbeats are simulated, and systematical studies with meshes of increasing refinement and different space discretizations are carried out. The results indicate that, for the highly nonlinear material models, piecewise quadratic or F-bar element discretizations lead to significantly better results than piecewise linear shape functions. The results using piecewise linear shape functions are less accurate with respect to the displacements and, in particular, to the approximation of the stresses. To improve the performance of the FSI simulations, a more robust preconditioner for the highly nonlinear structural material models has to be used. Therefore, a parallel implementation of the GDSW (Generalized Dryja-Smith-Widlund) preconditioner, which is a geometric two-level overlapping Schwarz preconditioner with energy-minimizing coarse space, is presented. The implementation, which is based on the software library Trilinos, is held flexible to make further extensions of the preconditioner possible. Even though the dimension of its coarse space is comparably large, parallel scalability for two and three dimensional scalar elliptic and linear elastic problems for thousands of cores is demonstrated. Also for unstructured domain decompositions and for a hybrid version of the preconditioner, convincing scalability is presented. When used as a preconditioner for the structure block in FSI simulations, the GDSW preconditioner shows excellent performance as well: scalability for up to 512 cores and a significant reduction of the simulation time and of the number of iterations with respect to the previously used preconditioner, IFPACK, are observed. IFPACK is an algebraic one-level overlapping Schwarz preconditioner. Finally, highly heterogeneous (multiscale) problems are investigated. Since the GDSW coarse space is not robust for general problems of this type, spaces based on Approximate Component Mode Synthesis (ACMS) are considered. On the basis of the ACMS space, coarse spaces for overlapping Schwarz methods are constructed, and a parallel implementation of a special finite element method is presented. For the coarse spaces, preliminary results indicating numerical scalability and robustness are discussed. For the parallel implementation of the special finite element method, very good parallel weak scalability is observed with respect to the construction of the basis functions and to the solution of the resulting linear system using the FETI-DP (Finite Element Tearing and Interconnecting - Dual Primal) method.
@phdthesis{Heinlein:2016:POS, author = {Heinlein, Alexander}, school = {Universit{\"a}t zu K{\"o}ln}, title = {Parallel Overlapping Schwarz Preconditioners and Multiscale Discretizations with Applications to Fluid-Structure Interaction and Highly Heterogeneous Problems}, url = {https://kups.ub.uni-koeln.de/6841}, year = {2016} }

Technical Reports

2026

AI&PDE
Resolving Extreme Data Scarcity by Explicit Physics Integration: An Application to Groundwater Heat Transport

Julia Pelzer, Corné Verburg, Alexander Heinlein, and 1 more author

Mar 2026

Accepted for poster presentation at AI&PDE

Abs Bib Preprint

Machine learning methods often struggle with real-world applications in science and engineering due to an insufficient amount or quality of training data. In this work, the example of subsurface porous media flow is considered; this corresponds to advection-diffusion processes under heterogeneous flow conditions, i.e., for spatially varying material parameters, and a large number of spatially distributed source terms. This challenge comes at high computing costs for classical simulation methods due to the required high spatio-temporal resolution and large domains. Machine learning-based surrogate models seem to offer a computationally efficient alternative. However, faced with real-world data-limitations, purely data-driven approaches face difficulties in predicting the advection process, which is highly sensitive to input variations and involves long-range interactions. Therefore, in this work, a Local-Global Convolutional Neural Network (LGCNN) approach is introduced, that combines a lightweight numerical surrogate for the global transport process with convolutional neural networks (CNNs) for the remaining local processes. With the LGCNN, we model a city-wide subsurface temperature field, involving a heterogeneous groundwater flow field and one hundred groundwater heat pump injection points forming interacting heat plumes over long distances. In order to first systematically analyze the method, random subsurface input fields are employed. Then, the model is trained on a single cut-out from a real-world subsurface map of the Munich region in Germany. Our model scales to larger cut-outs without retraining by accounting for the global effects with numerical physics models. All datasets, our code, and trained models are published for reproducibility.
@techreport{Pelzer:2025:FLE, author = {Pelzer, Julia and Verburg, Corné and Heinlein, Alexander and Schulte, Miriam}, month = mar, title = {Resolving Extreme Data Scarcity by Explicit Physics Integration: An Application to Groundwater Heat Transport}, year = {2026} }
ACM TOMS
Trilinos: Enabling Scientific Computing Across Diverse Hardware Architectures at Scale

Matthias Mayr, Alexander Heinlein, Christian Glusa, and 30 more authors

Mar 2026

Accepted for publication in ACM Transactions on Mathematical Software

Abs Bib Preprint

Trilinos is a community-developed, open-source software framework that facilitates building large-scale, complex, multiscale, multiphysics simulation code bases for scientific and engineering problems. Since the Trilinos framework has undergone substantial changes to support new applications and new hardware architectures, this document is an update to “An Overview of the Trilinos project” by Heroux et al. (ACM Transactions on Mathematical Software, 31(3):397–423, 2005). It describes the design of Trilinos, introduces its new organization in product areas, and highlights established and new features available in Trilinos. Particular focus is put on the modernized software stack based on the Kokkos ecosystem to deliver performance portability across heterogeneous hardware architectures. This paper also outlines the organization of the Trilinos community and the contribution model to help onboard interested users and contributors.
@techreport{Mayr:2025:TES, author = {Mayr, Matthias and Heinlein, Alexander and Glusa, Christian and Rajamanickam, Sivasankaran and Arnst, Maarten and Bartlett, Roscoe A. and Berger-Vergiat, Luc and Boman, Erik G. and Devine, Karen and Harper, Graham and Heroux, Michael and Hoemmen, Mark and Hu, Jonathan and Kelley, Brian and Kim, Kyungjoo and Kouri, Drew P. and Kuberry, Paul and Liegeois, Kim and Ober, Curtis C. and Pawlowski, Roger and Pearson, Carl and Perego, Mauro and Phipps, Eric and Ridzal, Denis and Roberts, Nathan V. and Siefert, Christopher M. and Thornquist, Heidi K. and Tomasetti, Romin and Trott, Christian R. and Tuminaro, Raymond S. and Willenbring, James M. and Wolf, Michael M. and Yamazaki, Ichitaro}, month = mar, note = {Accepted for publication in ACM Transactions on Mathematical Software}, title = {Trilinos: Enabling Scientific Computing Across Diverse Hardware Architectures at Scale}, year = {2026} }
Springer LNCSE
Sharpened PCG Iteration Bound for High-Contrast Heterogeneous Scalar Elliptic PDEs

Philip Soliman, Filipe Cumaru, and Alexander Heinlein

Oct 2026

Accepted for publication in Springer Lecture Notes in Computational Science and Engineering

Abs Bib Preprint

We present a new iteration bound for the preconditioned conjugate gradient (PCG) method that accurately captures convergence for systems with clustered eigenspectra, where the classical condition number-based bound is too pessimistic. By using the extremal values of each subcluster in the spectral distribution, our bound is orders of magnitude sharper than the classical bound. We demonstrate its effectiveness on a high-contrast elliptic PDE preconditioned with a two-level overlapping Schwarz method, where it successfully distinguishes between the performance of different (algebraic) preconditioners. A key contribution is theoretical evidence that, for certain high-contrast problems, PCG convergence can make simpler coarse spaces competitive. Conversely, more complex preconditioners are not always necessary. Finally, we show that our bound can be estimated effectively from Ritz values computed during early PCG iterations.
@techreport{Soliman:2026:SCG, author = {Soliman, Philip and Cumaru, Filipe and Heinlein, Alexander}, month = oct, note = {Accepted for publication in Springer Lecture Notes in Computational Science and Engineering}, title = {Sharpened PCG Iteration Bound for High-Contrast Heterogeneous Scalar Elliptic PDEs}, year = {2026} }
The Error of Deep Operator Networks Is the Sum of Its Parts: Branch-Trunk and Mode Error Decompositions

Alexander Heinlein, and Johannes Taraz

Feb 2026

Submitted

Abs Bib Preprint

Operator learning has the potential to strongly impact scientific computing by learning solution operators for differential equations, potentially accelerating multi-query tasks such as design optimization and uncertainty quantification by orders of magnitude. Despite proven universal approximation properties, deep operator networks (DeepONets) often exhibit limited accuracy and generalization in practice, which hinders their adoption. Understanding these limitations is therefore crucial for further advancing the approach. This work analyzes performance limitations of the classical DeepONet architecture. It is shown that the approximation error is dominated by the branch network when the internal dimension is sufficiently large, and that the learned trunk basis can often be replaced by classical basis functions without a significant impact on performance. To investigate this further, a modified DeepONet is constructed in which the trunk network is replaced by the left singular vectors of the training solution matrix. This modification yields several key insights. First, a spectral bias in the branch network is observed, with coefficients of dominant, low-frequency modes learned more effectively. Second, due to singular-value scaling of the branch coefficients, the overall branch error is dominated by modes with intermediate singular values rather than the smallest ones. Third, using a shared branch network for all mode coefficients, as in the standard architecture, improves generalization of small modes compared to a stacked architecture in which coefficients are computed separately. Finally, strong and detrimental coupling between modes in parameter space is identified.
@techreport{Heinlein:2026:EON, author = {Heinlein, Alexander and Taraz, Johannes}, month = feb, note = {Submitted}, title = {The Error of Deep Operator Networks Is the Sum of Its Parts: Branch-Trunk and Mode Error Decompositions}, year = {2026} }
Are Deep Learning Based Hybrid PDE Solvers Reliable? Why Training Paradigms and Update Strategies Matter

Yuhan Wu, Jan Willem Beek, Victorita Dolean, and 1 more author

Feb 2026

Submitted

Abs Bib Preprint

Deep learning-based hybrid iterative methods (DL-HIMs) integrate classical numerical solvers with neural operators, utilizing their complementary spectral biases to accelerate convergence. Despite this promise, many DL-HIMs stagnate at false fixed points where neural updates vanish while the physical residual remains large, raising questions about reliability in scientific computing. In this paper, we provide evidence that performance is highly sensitive to training paradigms and update strategies, even when the neural architecture is fixed. Through a detailed study of a DeepONet-based hybrid iterative numerical transferable solver (HINTS) and an FFT-based Fourier neural solver (FNS), we show that significant physical residuals can persist when training objectives are not aligned with solver dynamics and problem physics. We further examine Anderson acceleration (AA) and demonstrate that its classical form is ill-suited for nonlinear neural operators. To overcome this, we introduce physics-aware Anderson acceleration (PA-AA), which minimizes the physical residual rather than the fixed-point update. Numerical experiments confirm that PA-AA restores reliable convergence in substantially fewer iterations. These findings provide a concrete answer to ongoing controversies surrounding AI-based PDE solvers: reliability hinges not only on architectures but on physically informed training and iteration design.
@techreport{Wu:2026:DLHIM, author = {Wu, Yuhan and van Beek, Jan Willem and Dolean, Victorita and Heinlein, Alexander}, note = {Submitted}, title = {Are Deep Learning Based Hybrid {PDE} Solvers Reliable? Why Training Paradigms and Update Strategies Matter}, year = {2026} }

2025

Springer LNCSE
Nonlinear Two-Level Schwarz Methods: A Parallel Implementation in FROSch

Alexander Heinlein, Kyrill Ho, Axel Klawonn, and 1 more author

Jul 2025

Accepted for publication in the proceedings of the 28th International Conference on Domain Decomposition Methods (DD28)

Abs Bib

Owing to the ability of nonlinear domain decomposition methods to improve the nonlinear convergence behavior of Newton’s method, they have experienced a rise in popularity recently in the context of problems for which Newton’s method converges slowly or not at all. This article introduces a novel parallel implementation of a two-level nonlinear Schwarz solver based on the FROSch (Fast and Robust Overlapping Schwarz) solver framework, part of Sandia’s Trilinos library. First, an introduction to the key concepts underlying two-level nonlinear Schwarz methods is given, including a brief overview of the coarse space used to build the second level. Next, the parallel implementation is discussed, followed by preliminary parallel results for a scalar nonlinear diffusion problem and a 2D nonlinear plane-stress Neo-Hooke elasticity problem with large deformations.
@techreport{Heinlein:2025:NTL, author = {Heinlein, Alexander and Ho, Kyrill and Klawonn, Axel and Lanser, Martin}, month = jul, note = {Accepted for publication in the proceedings of the 28th International Conference on Domain Decomposition Methods (DD28)}, title = {Nonlinear Two-Level Schwarz Methods: A Parallel Implementation in FROSch}, year = {2025} }
Springer LNCSE
Domain decomposition architectures and Gauss–Newton training for physics-informed neural networks

Alexander Heinlein, and Taniya Kapoor

Feb 2025

Accepted for publication in the proceedings of the 28th International Conference on Domain Decomposition Methods (DD28)

Abs Bib Preprint

Approximating the solutions of boundary value problems governed by partial differential equations with neural networks is challenging, largely due to the difficult training process. This difficulty can be partly explained by the spectral bias, that is, the slower convergence of high-frequency components, and can be mitigated by localizing neural networks via (overlapping) domain decomposition. We combine this localization with the Gauss–Newton method as the optimizer to obtain faster convergence than gradient-based schemes such as Adam; this comes at the cost of solving an ill-conditioned linear system in each iteration. Domain decomposition induces a block-sparse structure in the otherwise dense Gauss–Newton system, reducing the computational cost per iteration. Our numerical results indicate that combining localization and Gauss–Newton optimization is promising for neural network-based solvers for partial differential equations.
@techreport{Heinlein:2025:DDA, author = {Heinlein, Alexander and Kapoor, Taniya}, month = feb, note = {Accepted for publication in the proceedings of the 28th International Conference on Domain Decomposition Methods (DD28)}, title = {Domain decomposition architectures and Gauss--Newton training for physics-informed neural networks}, year = {2025} }
Springer LNCSE
Two-level deep domain decomposition method

Victorita Dolean, Serge Gratton, Alexander Heinlein, and 1 more author

Jul 2025

Accepted for publication in the proceedings of the 28th International Conference on Domain Decomposition Methods (DD28)

Abs Bib Preprint

This study presents a two-level Deep Domain Decomposition Method (Deep-DDM) augmented with a coarse-level network for solving boundary value problems using physics-informed neural networks (PINNs). The addition of the coarse level network improves scalability and convergence rates compared to the single level method. Tested on a Poisson equation with Dirichlet boundary conditions, the two-level deep DDM demonstrates superior performance, maintaining efficient convergence regardless of the number of subdomains. This advance provides a more scalable and effective approach to solving complex partial differential equations with machine learning.
@techreport{Dolean:2025:TDD, author = {Dolean, Victorita and Gratton, Serge and Heinlein, Alexander and Mercier, Valentin}, month = jul, note = {Accepted for publication in the proceedings of the 28th International Conference on Domain Decomposition Methods (DD28)}, title = {Two-level deep domain decomposition method}, year = {2025} }
Springer LNCSE
Coarse Spaces Based on Higher-Order Interpolation for Schwarz Preconditioners for Helmholtz Problems

Erik Sieburgh, Alexander Heinlein, Vandana Dwarka, and 1 more author

Jul 2025

Accepted for publication in the proceedings of the 28th International Conference on Domain Decomposition Methods (DD28)

Abs Bib Preprint

The development of scalable and wavenumber-robust iterative solvers for Helmholtz problems is challenging but also relevant for various application fields. In this work, two-level Schwarz domain decomposition preconditioners are enhanced by coarse space constructed using higher-order Bézier interpolation. The numerical results indicate numerical scalability and robustness with respect the wavenumber, as long as the wavenumber times the element size of the coarse mesh is sufficiently low.
@techreport{Sieburgh:2025:CSB, author = {Sieburgh, Erik and Heinlein, Alexander and Dwarka, Vandana and Vuik, Cornelis}, month = jul, note = {Accepted for publication in the proceedings of the 28th International Conference on Domain Decomposition Methods (DD28)}, title = {Coarse Spaces Based on Higher-Order Interpolation for Schwarz Preconditioners for Helmholtz Problems}, year = {2025} }
Springer LNCSE
Domain decomposition method with randomized neural networks

Yong Shang, Alexander Heinlein, Siddhartha Mishra, and 1 more author

Jul 2025

Accepted for publication in the proceedings of the 28th International Conference on Domain Decomposition Methods (DD28)

Abs Bib

An approach for integrating randomized neural networks (RaNNs) with an overlapping Schwarz domain decomposition technique is developed to address multi-scale problems. Neural networks within each subdomain are interconnected using local window functions, enabling the construction of a global solution that serves as an approximation to the solution of the partial differential equation (PDE). Via a constraining operator the boundary conditions are enforced in a straightforward manner. The optimization of the parameters in the last layer of the RaNN yields a least-squares problem, leading to accurate solutions for the PDE. The introduction of a two-level domain decomposition method allows for capturing diverse scaling information, thereby enhancing the performance of the method.
@techreport{Shang:2025:DDN, author = {Shang, Yong and Heinlein, Alexander and Mishra, Siddhartha and Wang, Fei}, month = jul, note = {Accepted for publication in the proceedings of the 28th International Conference on Domain Decomposition Methods (DD28)}, title = {Domain decomposition method with randomized neural networks}, year = {2025} }
ETNA
Monolithic and Block Overlapping Schwarz Preconditioners for the Incompressible Navier–Stokes Equations

Alexander Heinlein, Axel Klawonn, Jascha Knepper, and 1 more author

Feb 2025

Accepted for publication in Electronic Transactions on Numerical Analysis (ETNA)

Abs Bib Preprint

Monolithic preconditioners applied to the linear systems arising during the solution of the discretized incompressible Navier–Stokes equations are typically more robust than preconditioners based on incomplete block factorizations. Lower number of iterations and a reduced sensitivity to parameters like velocity and viscosity can significantly outweigh the additional cost for their setup. Different monolithic preconditioning techniques are introduced and compared to a selection of block preconditioners. In particular, two-level additive overlapping Schwarz methods (OSM) are used to set up monolithic preconditioners and to approximate the inverses arising in the block preconditioners. GDSW-type (Generalized Dryja–Smith–Widlund) coarse spaces are used for the second level. These highly scalable, parallel preconditioners have been implemented in the solver framework FROSch (Fast and Robust Overlapping Schwarz), which is part of the software library Trilinos. The new GDSW-type coarse space GDSW* is introduced; combining it with other techniques results in a robust algorithm. The block preconditioners PCD (Pressure Convection–Diffusion), SIMPLE (Semi-Implicit Method for Pressure Linked Equations), and LSC (Least-Squares Commutator) are considered to various degrees. The OSM for the monolithic as well as the block approach allows the optimized combination of different coarse spaces for the velocity and pressure component, enabling the use of tailored coarse spaces. The numerical and parallel performance of the different preconditioning methods for finite element discretizations of stationary as well as time-dependent incompressible fluid flow problems is investigated and compared. Their robustness is analyzed for a range of Reynolds and Courant-Friedrichs-Lewy (CFL) numbers with respect to a realistic problem setting.
@techreport{Heinlein:2025:MBO, author = {Heinlein, Alexander and Klawonn, Axel and Knepper, Jascha and Sa{\ss}mannshausen, Lea}, month = feb, note = {Accepted for publication in Electronic Transactions on Numerical Analysis (ETNA)}, title = {Monolithic and Block Overlapping Schwarz Preconditioners for the Incompressible Navier--Stokes Equations}, year = {2025} }
Springer LNCSE
A computational study of algebraic coarse spaces for two-level overlapping additive Schwarz preconditioners

Filipe A. C. S. Alves, Alexander Heinlein, and Hadi Hajibeygi

May 2025

Accepted for publication in the proceedings of the 28th International Conference on Domain Decomposition Methods (DD28)

Abs Bib Preprint

The two-level overlapping additive Schwarz method offers a robust and scalable preconditioner for various linear systems resulting from elliptic problems. One of the key to these properties is the construction of the coarse space used to solve a global coupling problem, which traditionally requires information about the underlying discretization. An algebraic formulation of the coarse space reduces the complexity of its assembly. Furthermore, well-chosen coarse basis functions within this space can better represent changes in the problem’s properties. Here we introduce an algebraic formulation of the multiscale finite element method (MsFEM) based on the algebraic multiscale solver (AMS) in the context of the two-level Schwarz method. We show how AMS is related to other energy-minimizing coarse spaces. Furthermore, we compare the AMS with other algebraic energy-minimizing spaces: the generalized Dryja–Smith–Widlund (GDSW), and the reduced dimension GDSW (RGDSW).
@techreport{Alves:2024:CSA, author = {Alves, Filipe A. C. S. and Heinlein, Alexander and Hajibeygi, Hadi}, month = may, note = {Accepted for publication in the proceedings of the 28th International Conference on Domain Decomposition Methods (DD28)}, title = {A computational study of algebraic coarse spaces for two-level overlapping additive Schwarz preconditioners}, year = {2025} }
Two-level additive Schwarz preconditioners for reduced integration methods

Filipe Cumaru, Alexander Heinlein, and Joachim Schöberl

Nov 2025

Submitted

Abs Bib Preprint

Incompressible fluid flow problems appear frequently in different applications. The discretization of such problems may result in large and ill-conditioned systems of linear equations. We consider the case of the Stokes equations discretized using a reduced integration method which approximates the incompressibility constraint by a penalty term thus allowing the problem to be solved only in terms of the velocity unknowns. We investigate the numerical scalability of a two-level overlapping additive Schwarz method with a reduced dimension generalized Dryja-Smith-Widlund (RGDSW) coarse space. In addition, we discuss the parallel implementation of the examples using the Fast and Robust Overlapping Schwarz (FROSch) package for additive Schwarz preconditioners and the NGSolve library, which implements multiple finite element space formulations.
@techreport{Cumaru:2025:DDG, author = {Cumaru, Filipe and Heinlein, Alexander and Sch\"oberl, Joachim}, month = nov, note = {Submitted}, title = {Two-level additive Schwarz preconditioners for reduced integration methods}, year = {2025} }
Domain-Decomposed Graph Neural Network Surrogate Modeling for Ice Sheets

Adrienne M. Propp, Mauro Perego, Eric C. Cyr, and 5 more authors

Nov 2025

Abs Bib Preprint

Accurate yet efficient surrogate models are essential for large-scale simulations of partial differential equations (PDEs), particularly for uncertainty quantification (UQ) tasks that demand hundreds or thousands of evaluations. We develop a physics-inspired graph neural network (GNN) surrogate that operates directly on unstructured meshes and leverages the flexibility of graph attention. To improve both training efficiency and generalization properties of the model, we introduce a domain decomposition (DD) strategy that partitions the mesh into subdomains, trains local GNN surrogates in parallel, and aggregates their predictions. We then employ transfer learning to fine-tune models across subdomains, accelerating training and improving accuracy in data-limited settings. Applied to ice sheet simulations, our approach accurately predicts full-field velocities on high-resolution meshes, substantially reduces training time relative to a single global surrogate model, and provides a ripe foundation for UQ objectives. Our results demonstrate that graph-based DD, combined with transfer learning, provides a scalable and reliable pathway for training GNN surrogates on massive PDE-governed systems, with broad potential for application beyond ice sheet dynamics.
@techreport{Propp:2025:DDG, author = {Propp, Adrienne M. and Perego, Mauro and Cyr, Eric C. and Gruber, Anthony and Howard, Amanda and Heinlein, Alexander and Stinis, Panos and Tartakovsky, Daniel}, month = nov, note = {}, title = {Domain-Decomposed Graph Neural Network Surrogate Modeling for Ice Sheets}, year = {2025} }
The Mie Scattering Theory as Benchmark Problem for the Time-Harmonic Maxwell’s Equations

Alexander Heinlein, Sebastian Kinnewig, and Thomas Wick

Oct 2025

Submitted

Abs Bib Preprint

We introduce a near-field problem as a benchmark for the time-harmonic Maxwell (THM) equations. Therein, the near-field, i.e., the electric field directly around a nanoparticle, is computed using the finite element method (FEM) with Nédélec elements. Afterwards, this is compared to results with analytical solutions obtained from Mie’s scattering theory. By establishing this benchmark, we aim to provide a reliable and reproducible standard for validating numerical methods for the THM equations.
@techreport{Heinlein:2025:MST, author = {Heinlein, Alexander and Kinnewig, Sebastian and Wick, Thomas}, month = oct, note = {Submitted}, title = {The Mie Scattering Theory as Benchmark Problem for the Time-Harmonic Maxwell's Equations}, year = {2025} }
Numerical study on hyper parameter settings for neural network approximation to partial differential equations

Hee Jun Yang, Alexander Heinlein, and Hyea Hyun Kim

Mar 2025

Submitted

Abs Bib Preprint

Approximate solutions of partial differential equations (PDEs) obtained by neural networks are highly affected by hyper parameter settings. For instance, the model training strongly depends on loss function design, including the choice of weight factors for different terms in the loss function, and the sampling set related to numerical integration; other hyper parameters, like the network architecture and the optimizer settings, also impact the model performance. On the other hand, suitable hyper parameter settings are known to be different for different model problems and currently no universal rule for the choice of hyper parameters is known. In this paper, for second order elliptic model problems, various hyper parameter settings are tested numerically to provide a practical guide for efficient and accurate neural network approximation. While a full study of all possible hyper parameter settings is not possible, we focus on studying the formulation of the PDE loss as well as the incorporation of the boundary conditions, the choice of collocation points associated with numerical integration schemes, and various approaches for dealing with loss imbalances will be extensively studied on various model problems; in addition to various Poisson model problems, also a nonlinear and an eigenvalue problem are considered.
@techreport{Yang:2025:NSH, author = {Yang, Hee Jun and Heinlein, Alexander and Kim, Hyea Hyun}, month = mar, note = {Submitted}, title = {Numerical study on hyper parameter settings for neural network approximation to partial differential equations}, year = {2025} }
Finite basis Kolmogorov-Arnold networks: domain decomposition for data-driven and physics-informed problems

Amanda A. Howard, Bruno Jacob, Sarah H. Murphy, and 2 more authors

Sep 2025

Abs Bib Preprint

Kolmogorov-Arnold networks (KANs) have attracted attention recently as an alternative to multilayer perceptrons (MLPs) for scientific machine learning. However, KANs can be expensive to train, even for relatively small networks. Inspired by finite basis physics-informed neural networks (FBPINNs), in this work, we develop a domain decomposition-based architecture for KANs that allows for several small KANs to be trained in parallel to give accurate solutions for multiscale problems. We denote this new approach as finite basis KANs (FBKANs) and show that it yields accurate results for noisy data as well as physics-informed training.
@techreport{Howard:2025:FBK, author = {Howard, Amanda A. and Jacob, Bruno and Murphy, Sarah H. and Heinlein, Alexander and Stinis, Panos}, month = sep, title = {Finite basis Kolmogorov-Arnold networks: domain decomposition for data-driven and physics-informed problems}, year = {2025} }

2024

Predicting Coarse Basis Functions for Two-Level Domain Decomposition Methods Using Graph Neural Networks

Ichitaro Yamazaki, Alexander Heinlein, and Sivasankaran Rajamanickam

Jul 2024

Submitted

Abs Bib

For the robustness and numerical scalability of domain decomposition-based linear solvers, the incorporation of a coarse level, which provides global transport of information, is crucial. State-of-the-art spectral, or adaptive, methods can generate the basis functions of the coarse space, which are adapted to the specific properties of the target problem, and yield provably robust convergence for certain classes of problems. However, their construction is computationally expensive and requires non-algebraic information. To improve the practicability of the solver, in this paper, we design a hierarchical math-informed local Graph Neural Network (GNN) to generate effective coarse-space basis functions. Our GNN uses only the local subdomain matrices available as the input to the algebraic linear solvers. This approach has several advantages including: 1) it is algebraic; 2) it is local and therefore as scalable as the classical domain decomposition solvers; and 3) the cost for training, inference, and generating data sets is much lower than that needed for approaches relying on the global matrix. To study the potential of our GNN architecture, we present numerical results with homogeneous and heterogeneous problems.
@techreport{Yamazaki:2024:PCB, author = {Yamazaki, Ichitaro and Heinlein, Alexander and Rajamanickam, Sivasankaran}, month = jul, note = {Submitted}, title = {Predicting Coarse Basis Functions for Two-Level Domain Decomposition Methods Using Graph Neural Networks}, year = {2024} }

Book Chapters

2025

MathOptML
Multifidelity domain decomposition-based physics-informed neural networks and operators for time-dependent problems

Alexander Heinlein, Amanda A. Howard, Damien Beecroft, and 1 more author

In Mathematical Optimization for Machine Learning, Jul 2025

Section: Mathematical Optimization for Machine Learning

Abs Bib doi URL Preprint

Multiscale problems are challenging for neural network-based discretizations of differential equations, such as physics-informed neural networks (PINNs) and opera-tor networks. This can be (partly) attributed to the so-called spectral bias of neural net-works. To improve the performance of PINNs for time-dependent problems, a combination of multifidelity stacking PINNs and domain decomposition-based finite basis PINNsis employed. Inparticular, to learn thehigh-fidelity part of the multifidelity model, a do-main decomposition in time is employed. The performance is investigated for a pendulum and a two-frequency problem as well as the Allen–Cahn equation. It can be observed that the domain decomposition approach clearly improves the PINN and stacking PINN approaches. Finally, it is demonstrated that the FBPINN approach can be extended tomultifidelity physics-informed deep operator networks.
@incollection{Heinlein:2025:MDD, author = {Heinlein, Alexander and Howard, Amanda A. and Beecroft, Damien and Stinis, Panos}, booktitle = {Mathematical {Optimization} for {Machine} {Learning}}, doi = {10.1515/9783111376776-006}, note = {Section: Mathematical Optimization for Machine Learning}, pages = {79--92}, publisher = {De Gruyter}, title = {Multifidelity domain decomposition-based physics-informed neural networks and operators for time-dependent problems}, year = {2025} }