-
Notifications
You must be signed in to change notification settings - Fork 42
Numerical Implementation
This page has moved to
https://pages.nist.gov/pf-recommended-practices/bp-guide-gh/ch2-numerical-implementation.html
When writing your own phase-field simulation code or when choosing among existing codes, the numerical implementation of the model is an important consideration. This section of Phase-Field Recommended Practices Guide is split into two subsections -- one focused on choosing appropriate numerical methods and one focused on choosing whether to code your own solvers or use existing libraries (and how to choose between existing libraries). Although these subsections are presented sequentially, these choices are more iterative than sequential. For example, you may start out deciding that a certain set of numerical methods are promising, but then switch to others after considering the availability of libraries.
There is no consensus on what the "best" numerical methods for phase-field modeling are. In fact, given the diversity of the models and settings under the phase field umbrella, it is unlikely that any single collection of methods would be optimal for all phase-field modeling. Instead, a wide variety of numerical methods are used in the literature. For most problems of interest there are multiple numerical methods that can be used to obtain a solution to the model equations (and perhaps some methods that cannot obtain a solution). Several factors to consider when choosing between numerical methods are:
- General computational performance
- Appropriateness for the planned computing hardware (e.g., GPUs)
- Scalability, ease of implementation
- Availability of libraries
Each researcher may have their own unique balance of these factors that leads them to choose a numerical approach.
These choices can be very important. The performance of different methods for a given problem can vary by orders of magnitude. Likewise the complexity to implement the methods can vary by orders of magnitude -- some solvers can be written in a few tens of lines of code; some may take tens of thousands of lines. The path-dependency of the choices are also variable. Transitioning from a spectral code to a finite element code might necessitate an entire rewrite. Transitioning from one implicit time integration method to another may take a few hours.
A phase-field code involves a few different types of numerical methods. Different choices for each type are possible, but not all combinations are possible or advisable. In the end, to determine the numerical implementation for your phase-field model you will need to decide on a collection of numerical methods and approaches that are suitable for your problem and work well together.
Before deciding on a strategy for a numerical implementation, we strongly recommend that you put thought into your problem(s) of interest. Understanding the governing equations, boundary conditions, initial conditions, coupled physics, expected symmetries and other factors discussed in the Model Formulation section will aid you in deciding on how to implement your model numerically.
In general, we recommend that you start with a simple method and then add complexity selectively where justified. There are a few benefits to this approach.
- Simpler methods are usually easier to implement, decreasing the time from starting to running your first test simulation. This gives you an earlier opportunity to identify unexpected issues in your model formulation that you didn't catch before implementing it. Part of what makes a simpler method easier to implement is that a simpler method is often easier to debug and/or there are fewer ways for it to fail.
- The simpler implementation can be a backup option if the development of a more complex option takes longer than expected. Even following the recommended practices for software development, code development times can be unpredictable. A suboptimal backup implementation might get you through your first milestones while you're debugging a more advanced method.
- You may learn that the simpler option fulfills your needs. Complex methods aren't always better; the problem of interest might not need or justify investment in more complex methods.
- Even if the simpler method doesn't fulfill your needs, it may illuminate where added complexity is worth investigating ("these small explicit time steps are bogging me down, I should try an implicit time integrator", "I'm wasting so many resources in this area where nothing happens, I should try adaptive meshing").
- Solutions obtained with the simpler method can be used as reference solutions as part of your code verification strategy. Obtaining the same solution with two methods is no guarantee that both are correct, but it is an encouraging sign. And certainly, if two methods yield different solutions, the implementations require attention.
This all said, there may be some compelling reasons to start with more complex methods. One such reason is if the method is already implemented in an off-the-shelf package. Other numerical considerations could come into play: an unpreconditioned Krylov solver is simpler than a preconditioned one, but the preconditioner stabilizes the solution and may be more forgiving for non-optimized simulation parameters in an initial exploratory phase for a project and may be simpler to use.
Additionally, we also recommend initially implementing a simpler form of your model and the building up to the full model, particularly if you are new to the toolchain that you're using. There are classes of issues that with your numerical implementation that show up in even very simple calculations (MPI communication issues, stencil issues, scaling issues, etc.), and it's best to deal with those as soon as possible. Note: this approach works best when the simplified model is a simplified version of the PDE, not simply a simplified version of the physics. Sometimes models at different levels of complexity have a very different structure. A few examples of simpler starting models are:
Target Model | Simplified Variant |
---|---|
BM1 (Cahn-Hilliard) | Diffusion equation, then add the higher order term |
BM2 (Coupled Cahn-Hilliard-Allen-Cahn) | Cahn-Hilliard by itself, add one Allen-Cahn equation, then two, then all of them |
BM4 (Elastic Precipitate) | Cahn-Hilliard by itself, then add elasticity |
Four main types of spatial discretizations are used for phase-field modeling: finite difference, finite volume, finite element, and Fourier pseudospectral. Each of these methods has been used for simulation results uploaded to PFHub. Examples for each method are: finite difference - Memphis, HiPerC, finite volume - FiPy, finite element - MOOSE, PRISMS-PF, Fenics, Fourier pseudospectral - MEUMAPPS).
Here we give a brief overview of the strengths and weaknesses of each of these spatial discretization approaches. Aiming for brevity over completeness, this discussion skips over the considerable nuance in the capabilities of these methods.
Finite difference discretizations are one of the most common spatial discretizations for phase-field codes. The field data is stored on the vertices of the mesh and derivatives are approximated using variable values at the point of interest and/or neighboring points. See the Wikipedia page for a general introduction. While many flavors of finite difference methods exist, most finite difference phase-field codes use structured, uniform grids.
With finite volume methods, the field data is stored for each element in the mesh (i.e., the "finite volume"), and changes to the field data are calculated by summing the fluxes through the faces of the element. Therefore, finite-volume methods are inherently conservative. Because the flux calculations are done on the faces, finite volume methods can be used on unstructured meshes. Also, as a result of the emphasis on fluxes, finite volume methods solve the weak form of a PDE. See the Wikipedia page for a general introduction.
Finite element methods are the most complex of the common spatial discretization methods for phase-field codes. In a finite element method, field values are stored on pre-defined places of each element (the "nodes"). The field variables in the element are continuously represented by a function of pre-defined type (e.g., bilinear) that passes through the field values at each node (the "shape function"). Derivatives are calculated through derivatives of the shape function. The field variables are evolved using volumetric integrals over the element, where the integrals are calculated using quadrature. Therefore, although field variables are stored at the nodes for the element, the evolution expressions are calculated on "quadrature points", which often are not in the same locations as the nodes. Due to the expressions being integral expressions, finite element methods operate on the weak form of a PDE. See the Wikipedia page for a general introduction.
Along with finite difference methods, Fourier pseudospectral methods are historically a common spatial discretization for phase-field codes. Although Fourier pseudospectral methods have similar theoretical underpinnings as finite element methods, in practice, implementing a Fourier pseudospectral code is very different from the other methods listed here. In comparison to the other methods with a local representation of the fields, the Fourier pseudospectral method has a global representation of the fields using sines and cosines. As a result, derivatives of fields can be calculated using fast Fourier transforms (FFTs); in the reciprocal space, a spatial derivative becomes a multiplication with a wavenumber. This FFT-centric approach avoids the need for linear solvers that the other spatial discretization may need, leading to a popular semi-implicit time integration scheme (see below for more details). Fourier pseudospectral methods have the fastest spatial convergence of any scheme here, exponential convergence (as compared to polynomial convergence for the other methods). To balance these advantages, Fourier pseudospectral methods have restrictions that the other methods do not. The reliance on FFTs means that periodic boundary conditions are the only type of boundary conditions possible. The FFTs also require a uniform, structured mesh, so no adaptive meshing is possible in most situations (e.g., unless a coordinate tranformation is possible to project onto a uniform, structured mesh).
Strengths | Weaknesses | |
---|---|---|
Finite difference | Simple to code Minimal computational overhead Adaptive meshing is possible |
Conservation is not guaranteed depending on the stencil Difficult to use on complex geometries (more complicated on non-uniform grids and much more complicated (impossible?) on unstructured grids) Adaptive meshing is constrained by need for a structured mesh |
Finite volume | Naturally conservative Adaptive meshing is possible Can be used for arbitrary geometries (via unstructured meshes) Simpler to implement than finite element methods |
More complicated to implement than finite difference or Fourier-pseudospectral Orders of accuracy higher than two substantially increase complexity |
Finite element | Adaptive meshing is possible Can be used for arbitrary geometries (via unstructured meshes) Fairly straightforward extension to higher orders of accuracy Popularity in other domains may aid multiphysics coupling |
Most complicated to code Most computational overhead |
Fourier pseudospectral | Simple to code Highest spatial error convergence rate "Free" implicit handling of linear terms |
Limited to periodic boundary conditions Limited to uniform, structured meshes |
Adaptive mesh refinement (AMR) is the process of changing the mesh for a calculation so that the mesh is finer in regions where the spatial error is expected to be high (e.g., where fields are changing rapidly) and coarser where the spatial error is expected to be low (e.g., where fields are changing slowly). An example of adaptive mesh refinement for the PFHub Benchmark 3 problem is shown below:
(Insert BM3 AMR example)
Note that in this example, the mesh is the finest near the solid-liquid interface and coarsest far from the interface in the liquid. PFHub Benchmark 3 is an example of a problem where adaptive meshing is particularly useful -- at any given time, the area of the interfacial region is a small fraction of the total area. The interfacial region needs a fine mesh to resolve the transition of the order parameter, but a coarse mesh can be used far into the liquid where only thermal diffusion is active. The uploaded results with by far the fastest run times for this simulations, using MOOSE and PRISMS-PF, both take advantage of adaptive meshing.
In other cases, the benefits of adaptive mesh refinement may not be worth the computation overhead and increase in complexity. The early stages of evolution for PFHub Benchmark 1 are an example of this case. The composition field is rapidly varying almost everywhere in the domain, and coarsening the mesh in any location would likely introduce substantial error.
Three main adaptive meshing schemes are:
- Adaptive unstructured meshes (full freedom of mesh shape, but high overhead)
- Block-structured meshes (a mesh made of rectangular regions at different levels of refinement)
- Quadtree/octree meshes (hierarchical mesh where refinement occurs by bisecting parent elements)
Although many phase-field problems of interest can be solved in a rectangular domain with boundary conditions applied on the boundaries of the computational domain, in some cases, you may want to apply boundary conditions along an internal boundary. There are two typical circumstances where this occurs. One circumstance is where the domain of interest has some complex (perhaps dynamic) shape but you do not want to have a body-fitting mesh (e.g., due to code or method limitations). The second circumstance is that you are using a Fourier pseudospectral method and want to impose non-periodic boundary conditions.
Two methods have been developed to handle this issue -- to impose boundary conditions internally in a computational domain that decouple the solution domain from the computational domain. The immersed interface method is a sharp-interface method for imposing internal boundary conditions. The smoothed boundary method is a diffuse interface method for imposing internal boundary conditions. In some circumstances, these methods may provide a performant way to side-step the restrictions on the meshes for Fourier pseudospectral or finite difference phase-field codes.
To date, none of the PFHub uploads have used these methods, but Benchmark 6b is an example of where it may be appropriate.
If your model is time dependent, you will need to choose a scheme for integrating in time (or time stepping). A time integrator is typically described in terms of its order of accuracy and whether it is explicit or implicit. Classic examples include:
- Forward Euler (FE) method, a first-order explicit method
- Backward Euler (BE) method, a first-order implicit method
- Crank-Nicolson (CN) method, a second-order semi-implicit method
These methods often serve as prototypes for other families of general-purpose time integrators that have successfully been employed in phase-field codes. Many of the community codes that participate in PFHub already offer selections of time integrators; the documentation of those packages will often include suggestions on what to choose for your model.
If you are using a code that does not provide its own time integrators or you are writing your own code, there are many factors to consider when picking a scheme. The "best" time integrator for your model will depend upon your governing equations, the spatial discretization, any required linear and nonlinear solvers, and even the computational hardware you have available. You may need to experiment with different schemes to find the right balance of performance and accuracy for your needs. This last step can require significant amounts of time and coding.
If you need to implement your own time integrator, we provide a brief overview of integration schemes and their strengths and weaknesses, but this discussion serves as a relatively high-level overview. Time integration is an active area of research across the phase-field, applied math, and computational science communities, and it is impossible to cover every consideration.
In explicit methods, the solution at a future time step is only a function of previous time steps.
By comparison, for implicit methods, the solution at a future time step is a function of the future and previous time steps.
Forward Euler, for example, has the form
Explicit | Implicit | |
---|---|---|
Code complexity | + | - |
Time step cost | + | - |
Time step size | - | + |
Memory usage | + | - |
However, there are notable exceptions for each row. Additionally, choices that influence each row of the above table also affect the overall consistency of a scheme.
Complexity: In general, explicit solvers do not require nonlinear solvers for the time-dependent portions of the model, and linear solvers are either not required (finite difference methods, many finite volume methods) or only require the inversion of the mass matrix (finite element methods). This reduction in necessary solver components can reduce overall development time (less code, less debugging), which may allow you to start running simulations sooner. Additionally, from a working FE code, it is straightforward to implement either explicit/embedded Runge-Kutta methods, Adams-Bashforth methods, or Predictor-Corrector (e.g., Milne) methods.
By comparison, fully implicit methods will usually require the implementation of nonlinear and linear solvers. Some methods might allow for one of these eliminated: for example, the linearly implicit Rosenbrock Methods do not require a nonlinear solver. However, most codes will adopt a modified Newton method where a linear solver is required at each nonlinear iteration. The BE method can be a prototype for various flavors of implicit Runge-Kutta methods (IRK, DIRK, SDIRK, ESDIRK, etc.), Adams-Moulton methods, or Backward Differentiation Formula (BDF) methods, but there are fewer similarities between these methods than there is between the explicit methods.
Time step cost and size: Due to the need to solve a (non-)linear system of equations, one time step of an implicit method can be orders of magnitude more expensive than one time step of an explicit method. If iterative solvers are employed, the cost of each step will also vary with the overall conditioning of the system from one time step to the next.
However, it is well-known that explicit solvers often have strict requirements for the stable time step size (e.g.,
Memory usage:
Explicit and implicit schemes can have different requirements on the amount of RAM that is needed for the same overall problem size.
If we consider the FE and BE methods, both can be implemented with two vectors: one for
While not universally true, this leads to a rule of thumb that for equal numbers of stages or steps, an explicit scheme will require less memory than an implicit scheme. However, low-order implicit schemes may be able to take significantly larger time steps than even high-order explicit schemes, which can equalize the overall cost.
Comment about how hard it can be to solve multiple equations simultaneously. Do you solve multiple small systems or one large system?
In semi-implicit and implicit-explicit (ImEx) methods, different terms in the governing equations are approximated by different implicit or explicit schemes. These methods have a long history in the phase-field community and, as suggested by their name, behave "in between" explicit and implicit schemes. For example, convex splitting methods for the Cahn-Hilliard equation naturally result in implicit and explicit terms. Semi-implicit methods are also the default choice in Fourier pseudospectral methods. In more general models, physics with "fast" time scales may be solved implicitly, while physics with "slow" time scales may be solved explicitly.
The resulting schemes are usually much cheaper than fully implicit methods. The largest stable or accurate time step size will typically be reduced compared to an implicit scheme, but is usually much larger than an explicit scheme.
When choosing the time step size of an integrator, there is a need to balance computational cost (i.e., larger step size, fewer time steps, lower cost) and the numerical accuracy (smaller step size, better approximation of time derivatives, lower error, and improved solvability). Just as we can adaptively refine and coarsen the mesh of a spatial discretization to improve accuracy or reduce the computational cost, we can also employ adaptive methods when choosing a time step. In many Runge-Kutta and Rosenbrock methods, the scheme provides an embedded estimate of the error in the solution. A weighted norm of this error can be compared to a specified tolerance, which decides if the time step is accepted or rejected, and then the step size can be adjusted by a PID controller or some other filter. In multistep methods, we can derive estimates of the leading truncation error, which then allows us to check the accuracy of a step. However, different methods will have different rules about how to change step size: Runge-Kutta methods can usually change the step size by large amounts every time step, but multistep methods often have to change by smaller amounts or only every few time steps.
When properly implemented, adaptive time stepping improves confidence in solution quality, minimizes the chance of your solution "exploding" from too large of a time step, and can save time with expensive schemes by reducing step size in fast-changing regions of time and increasing it when the dynamics are slower.
- For initial development, it is best to start with simpler schemes like Forward Euler. This will allow you to gain an understanding of your model's behavior, check for bugs in the basic mechanics of the code, and start estimating the cost of simulations. If the model is inexpensive or you don't plan to use the code more than a few times, a basic integrator may be all you need.
- If you are forced to use small time steps on the scale of time you want to simulate, you might consider an implicit integrator (again, start with simpler schemes like BE or CN).
- If your model contains varying time scales (e.g. from coupled physics, concentration-dependent mobility) and you have difficulty maintaining a stable time step or find that the convergence is unpredictable, adaptive time stepping will likely be helpful. If you are using explicit schemes, consider trying an embedded Runge-Kutta method. For implicit schemes, you might try a higher-order BDF method if your right-hand-side function is expensive to compute or a diagonally implicit Runge-Kutta method.
- If you need to solve an elliptic governing equation simultaneously (as in Benchmark 6), the resulting system of differential-algebraic equations (DAEs) is likely to be very stiff. Even if the time-dependent portion is not expensive to evaluate, the cost of constantly performing the elliptic solve can be significant with explicit methods. Consider BDF methods, diagonally implicit or additive Runge-Kutta methods, or Rosenbrock methods.
- For coupled physics, carefully examine how each equation is related. Some time-dependent terms may be less strongly coupled than others or be more costly to evaluate in a fully implicit scheme, so you might consider a semi-implicit or ImEx scheme such as an additive Runge-Kutta method.
Many families of time integrators have variants that can choose the time step size adaptively. Additionally, the Adams and BDF methods have implementations with variable order of accuracy. If your code does not already provide time integration, many packages exist for various flavors of time steppers, including but not limited to SUNDIALS, PETSc TS, Tempus, and DifferentialEquations.jl.
Depending on your spatial/temporal discretizations and whether you need to solve any elliptic governing equations, you may need to implement or choose a set of appropriate linear and nonlinear solvers. Many codes will already provide solvers that work well for their packages. For example, if you were implementing a model in FEniCS, the developers have already exposed options from PETSC's nonlinear and linear solver interfaces and you can easily switch between different dense, sparse, and iterative methods. Even if you are implementing your own code, solvers may already be available from the libraries you link to and you may not need to write your own. For example, SUNDIALS provides a set of linear and nonlinear solvers that are already optimized for its time-stepping routines.
If you need to write your own solvers, we provide some general guidelines below.
The need for (non)linear solvers in a given code will depend on the equations you are solving and your choices of spatiotemporal discretizations. As a simple example, we can consider incremental changes to a 1D Allen-Cahn-type model for a driven phase transformation (i.e., a simplified form of BM3). For a spatial discretization with centered finite differences and time integration by Forward Euler, the equation at each point of the mesh will be of the form
where
A common approach to improve time step stability of a PFM is to employ a convex split of the governing equations where we decompose the nonlinear terms as
This system of equations is now semi-implicit in nature, but we can form the convex split such that the left-hand side is fully linear, e.g.,
Lastly, we may choose to discretize this system by the Backward Euler method:
The left-hand side is now fully implicit and nonlinear. We will need to perform a nonlinear solve at each time step, and this may or may not require the solution of additional linear systems.
Two of the most common types of nonlinear methods you will encounter are fixed-point methods, which solve problems of the form
While not discussed here, another important class of nonlinear methods is Full Approximation Scheme (FAS) multigrid solvers.
In FAS, we obtain solutions of the system
Fixed-Point Methods: A basic fixed-point method will often adopt the following approach:
- For the current guess of
$u$ , compute the nonlinear function,$G(u)$ - Calculate a new guess by
$\tilde{u} = G(u)$ - Converged if
$||u - \tilde{u}|| < tol$ - Assign
$u = \tilde{u}$ - If not converged, go to (1)
This is often referred to as functional iteration or Picard iteration.
While simple to implement, this basic scheme may converge slowly or fail to converge at all.
There are schemes to accelerate the convergence of the series (e.g., Anderson Mixing, Aitken's Delta-Squared Method), or we can linearize the nonlinear function about
- For the current guess of
$u$ , compute the nonlinear function,$G(u)$ - Solve the linear system
$(\mathbf{I} - \mathbf{J})\tilde{u} = G(u) - \mathbf{J}u$ - Converged if
$||u - \tilde{u}|| < tol$ - Assign
$u = \tilde{u}$ - If not converged, go to (1)
As a note, we may not need to explicitly assemble
where
Root-Finding Methods: In many cases, a root-finding method will often adopt some sort of Newton-type method where we linearize the function through a Taylor expansion, obtain a correction to the solution, and then iterate until convergence. For example, the classic Newton-Raphson method can be written as:
- For the current guess of
$u$ , calculate$F(u)$ and its Jacobian matrix$\mathbf{J}$ - Correct the solution by solving the linear system
$\delta u = - \mathbf{J}^{-1} F(u)$ - Converged if
$||\delta u|| < tol$ - Assemble the next guess as
$\tilde{u} = u + \delta u$ - Assign
$u = \tilde{u}$ - If not converged, go to (1)
Immediately, we can recognize a similarity between the Newton-Raphson approach and the linearized fixed-point iteration: both perform a linearization about the current value of the function for a guess of
Assembling
- For the current guess of
$u$ , calculate$F(u)$ and its Jacobian matrix$\mathbf{J}$ - Perform
$k$ iterations of an iterative linear solver to approximately solve$\mathbf{J} \delta u = - F(u)$ - Converged if
$||\delta u|| < tol$ - Assemble the next guess as
$\tilde{u} = u + \delta u$ - Assign
$u = \tilde{u}$ - If not converged, go to (1)
We may make further modifications to this algorithm to potentially improve performance, such as only evaluating
If we examine the pointwise Jacobi scheme for the fully-implicit Allen-Cahn, we see that this is essentially an inexact Newton method that does one inner iteration of Jacobi relaxation at each outer Newton iteration. Thus, we can see that there are some general similarities between these broad classes of nonlinear methods. However, it is rare that the resulting algorithms are exactly equivalent to each other, and it can be nontrivial to switch from a fixed-point to a root-finding solver.
In the above discussion, we've observed that linear solvers are necessary as a result of the discretization of the governing equations or because we need to solve a nonlinear system of equations. The choice of solver will depend on the size of your simulation and whether you can easily evaluate a Jacobian matrix of your system. Up to a few thousand unknowns, direct linear solvers (e.g. LAPACK, UMFPACK, cuSolver, SuperLU) that offer dense or sparse variants of typical methods like LU decomposition and QR factorization will often be efficient, provided it is easy to evaluate the entries of the Jacobian matrix. However, it is important to remember that dense direct solvers typically scale O(n^2) for required memory and O(n^3) for the number of operations and quickly become unsolvable. The scalability of sparse direct solvers will depend on the particular method and the implementing library. For millions or billions of unknowns across large numbers of CPUs or GPUs, iterative solvers are often necessary. Here, a wide range of options exists; in particular, relaxation, multigrid, and Krylov methods all parallelize well and have been employed in a variety of community codes and numerical libraries.
Some of the simplest iterative methods are stationary relaxation methods such as the Jacobi, Gauss-Seidel, and (Symmetric) Successive Over-Relaxation methods.
Classically, these are derived by splitting the Jacobian matrix as
Krylov methods (e.g. Conjugate Gradient, GMRES, BiCGSTAB, TFQMR) are a powerful family of linear solvers that form a subspace from a linear operator (such as the Jacobian) and the residual vectors of the system (this is a very simplified explanation). While these methods can be formulated to directly use the Jacobian, most only require the evaluation of Jacobian-vector products, which is what leads to the "Jacobian-free" nature of JFNK methods. This Jacobian-vector product is typically evaluated in either a functional form if the Jacobian can be easily obtained (e.g., the kernel of a linear heat equation), but for methods such as JFNK, the Jacobian-vector product is readily approximated by the Gateaux derivative. These methods can, therefore, be straightforward to implement, even with intricate linear functions. However, convergence is strongly dependent on the condition number of the operator matrix.
The performance of Krylov methods can be improved through preconditioning.
Here, we form an approximation to the overall Jacobian matrix,
In summary: when choosing linear solvers, it is not trivial to switch from a relaxation or multigrid method to a Krylov method (or vice versa), although the former methods can be used as preconditioners for the latter. Changing from one Krylov method to another can be relatively straightforward, but some Krylov methods only apply to symmetric positive definite matrices. Modifying the preconditioner can have varying difficulties: switching from a Jacobi to a Gauss-Seidel preconditioner is trivial, but switching to an ILU preconditioner might require much more effort.
(Discuss distributed vs shared, CPU vs GPU)
(pretty much just MPI, also mention UPC or Legion or Charm++?)
CPU/GPU, performance portability layers
The phase-field and broader scientific computing community have developed many libraries that you may choose to use as you implement your phase-field model. As you decide which methods you want to use, an important consideration is whether you intend to write code for certain functionality yourself or whether you want to use an existing library/framework for that functionality.
Before coding your own functionality, it is always a good idea to explore what solutions already exist. Researchers have devoted their careers to developing some scientific computing libraries and frameworks, and they may be better for your application than something you write yourself. Even in the cases where you could write code for some functionality yourself that is an improvement over existing solutions, it may not be worth your time to actually do so. Finally, even if you decide that your best course is to write your own functionality, learning about other solutions may inform your approach and/or help you frame to potential users what distinguishes your implementation.
PFHub maintains a list of open source phase-field codes and frameworks, one or more of which may fit your needs.
Some libraries are packages in a larger collection where interfaces for interoperability have already been developed. Such large collections include Trilinos, PETSc, and hypre/SAMRAI/SUNDIALS. Staying within an ecosystem may reduce your time to solution and will keep you on a well-traveled path if you reach out to developers or a user community for help.
An important consideration for choosing a library or framework is who else uses it. If many other researchers in your organization use a particular library/framework, you may want to as well -- when you run into issues, they might be able to help you (and vice versa).
Another consideration for choosing a library or framework is what the license terms are. For commercial software, license terms may dictate that a new license must be bought for each CPU core being used. With paid software, you may also want to consider the future of your work, even if you have access to that software now, that may not be true if you move institutions, your institution changes what they buy, etc. Other people who would be interested in using code that you develop would also have to pay for the underlying software.
For open-source libraries and frameworks, a wide variety of open-source licenses exist. While a discussion of the differences between these licenses is out of the scope of this document, the details of the license may constrain how you can use the library/framework and how you can distribute derivative works. One reference for open-source licenses is the Open Source Initiative. The terms of these licenses can be confusing and disputed. The legal department at your institution may be able to offer advice.
- Y. Saad. Iterative Methods for Sparse Linear Systems, Second Edition. Society for Industrial and Applied Mechanics, 2003.
- W.L. Briggs, V. Emden Henson, S.F. McCormick. A Multigrid Tutorial, Second Edition. Society for Industrial and Applied Mechanics, 2000.
- D.A. Knoll, D.E. Keyes. "Jacobian-free Newton-Krylov methods: a survey of approaches and applications." J. Comput. Phys. 193 (2004) 357-397.
Topics we want to move to other sections:
- Conservation laws and symmetries (Model Formulation)
- Have an expectation going in (Model Formulation)
- Add pieces of the model one part at a time during debugging (Software Development)
- Ensure physical expectations are respected (Software Development)
OLD TEXT
Contributors: Stephen DeWitt, Alex Chadwick
Alex Chadwick, Steve Dewitt
- Carefully consider your spatial discretization scheme (FFT, FD, FV, FEM)
- Consider your temporal discretization scheme and use temporal error estimation when possible (implicit, explicit, etc.)
- Identify relevant conservation laws and symmetries for your model and carefully ensure they are respected even after long times
- Start small and add complexity. Add pieces to the model one part at a time during debugging. Check to see if it is working, then add more, then test again.
- Have an expectation going in (back of the envelope, consider basic physics), and if it doesn’t do that, figure out why.
- Home
- Benchmark Presentations
- Peter Voorhees Phase Field Lectures
- Phase-Field Method Recommended Practices
- Workshop Presentations
- Miscellaneous