A 3D multigrid solver #
The submission deadline for this work is 13th May 2021.
See below for submission details.
Updates
2021/03/31
- Fixed sign error in forcing term for Part 1. The $f$ needs a minus sign out the front for the exact solution to be a positive product of signs.
- Fix instructions for obtaining
petsc4py
on Hamilton.
Introduction #
In this coursework, we’re going to implement a parallel multigrid solver in three dimensions for the variable-coefficient Laplacian.
We are using PETSc, via petsc4py, to provide the parallel data structures.
There’s a skeleton Python package that provides a lot of the infrastructure that you will build on to develop your solver.
To get going, you’ll need to install petsc4py
in your virtual
environment. On your own machine, pip install petsc4py
should be
sufficient. This will go away and build PETSc, followed by petsc4py
.
If you’re using conda, you can install petsc4py
in your conda
environment with conda install -c conda-forge petsc4py
.
PETSc on Hamilton
On Hamilton, I provide a useable version of PETSc.
To use it, you’ll need to use the following modules
gcc/9.3.0 intelmpi/gcc/2019.6
and set the environment variables
export PETSC_DIR=/ddn/data/vtdb72/petsc export PETSC_ARCH=arch-linux2-c-opt
After that, to get a
petsc4py
install, you should install from the PETSc bindings directory. So dopip install /ddn/data/vtdb72/petsc/src/binding/petsc4py
in your virtual environment.Don’t forget to load those modules and export those environment variables every time your log in.
Building PETSc is sometimes problematic. If the pip install
route
fails for any reason GET IN TOUCH and we’ll figure it out. The
best way to do this is via the discussion forum.
Getting and installing the mgsolver
package
#
We will use GitHub classroom to manage the submissions. To set up and fork the template repository, follow this link.
You should work in your fork and push your code regularly. Having forked the repository, you can clone it locally and install the package. I recommend using an editable install since you’ll be developing new code in the package.
$ git clone git@github.com:Durham-COMP4187/your-repo-name-here.git comp4187-coursework
$ pip install -e comp4187-coursework/
After doing this, you should be able to run the tests with (they will all fail)
$ pytest tests
If you can’t get this far for whatever reason, GET IN TOUCH.
Package layout #
The mgsolver
package contains a number of classes, some of which are
missing functionality that you are to implement. The main classes that
we need are
grid.Grid3D
andgrid.GridHierarchy
. These provide a coarse grid and a hierarchy of regularly refined grids. We’ll just need to construct these.mgsolver.MGSolver
: This class manages the multigrid solver. It has unimplemented methods for Jacobi iteration (jacobi
), a V-cycle (vcycle
), and a W-cycle (wcycle
), which you will need to implement.operator.AbstractOperator
: You should produce a subclass of this for your operators and implement the requisite abstract methods.mult
do a matrix-vector multiply.diagonal
a property that returns the diagonal of the operator.as_sparse_matrix
return the operator as a sparse matrix.
Additionally, visualisation output of solution vectors to
VTK files viewable in
Paraview can be produced using the
write_output
function.
Here is a sketch of how you would use the package (eliding details of the implementation of the operator’s methods).
from functools import cached_property
from mgsolver import AbstractOperator, Grid3D, GridHierarchy, MGSolver, PETSc
from mpi4py import MPI
# Define the operator we want to apply
# Must inherit from AbstractOperator
class Poisson7pt(AbstractOperator):
def __init__(self, grid):
# Can do some stuff here, but remember to always do
super().__init__(grid)
# We need to implement this property. A @cached_property is like
# a @property, but only gets evaluated once.
@cached_property
def diagonal(self):
pass
# We need to implement this method
def mult(self, x, y):
# x is the input vector, y is the output vector.
pass
# Finally, we need to implement this method
def as_sparse_matrix(self):
# Return the operator as a sparse matrix
pass
# Create a coarse grid with 4 vertices in each direction
coarse_grid = Grid3D(4, 4, 4, comm=MPI.COMM_WORLD)
# Create a hierarchy with two refinements (3 levels in total)
hierarchy = GridHierarchy(coarse_grid, nrefinements=2)
# You can index the GridHierarchy like a normal list.
fine_grid = hierarchy[-1]
# Now we can build a solver
solver = MGSolver(hierarchy, Poisson7pt)
# Create vectors to hold the solution and right hand side
x = fine_grid.createGlobalVector()
b = fine_grid.createGlobalVector()
# To get an operator on a given level
A_fine = solver.get_operator(len(hierarchy)-1)
# The solver also provides storage for vectors of residuals and so
# forth on each level. The jacobi iteration needs somewhere to store
# the residual.
r = solver.residuals[len(hierarchy)-1]
# To run 10 iterations of Jacobi
solver.jacobi(A, x, b, r, niter=10)
# To solve using a V-cycle using 1 iteration of pre- and post-smoothing
# to a relative tolerance of 1e-5
solver.solve(x, b, rtol=1e-5, presmooth=1, postsmooth=1,
cycle_type=solver.Type.V)
# To solve using a W-cycle using 1 iteration of pre-smoothing and 2
# iterations of post-smoothing to a relative tolerance of 1e-8
solver.solve(x, b, rtol=1e-8, presmooth=1, postsmooth=2,
cycle_type=solver.Type.W)
For a complete example, see the tests in tests/test_one_dim.py
,
which implement a one-dimensional example.
The tests themselves do not pass, since the various bits of the MGSolver
class are not completed. You can run the tests with
pytest
, using pytest tests
. After
successfully implementing the Jacobi iteration and V-cycle, the test
run looks something like
$ pytest tests/test_one_dim.py -v
================================== test session starts ===================================
platform darwin -- Python 3.8.6, pytest-6.2.2, py-1.10.0, pluggy-0.13.1 -- pscii/bin/python3
cachedir: .pytest_cache
rootdir: XXX, configfile: setup.cfg
collected 3 items
tests/test_one_dim.py::test_mms_convergence PASSED [ 33%]
tests/test_one_dim.py::test_two_grid[Jacobi coarse grid] PASSED [ 66%]
tests/test_one_dim.py::test_two_grid[Exact coarse grid] PASSED [100%]
=================================== 3 passed in 0.34s ====================================
You can also run the tests in parallel by doing
$ mpiexec -n 4 tests/test_one_dim.py -v
Part 1: an explicit solver #
Part 1a: timestepping #
You should do your implementation for this part in a file called
part1_explicit_euler.py
placed in the root of the repository.
Discretise the equation $$ \partial_t u - \nabla \cdot K(x, y, z) \nabla u = f(x, y, z) $$ on the cubic domain $\Omega = [0, 1] \times [0, 1] \times [0, 1]$, using forward Euler as a timestepping scheme.
To do this, create a class Poisson7pt
that discretises the spatial
operator using a 2nd order accurate 7-point stencil (as derived in lectures).
$$
-\nabla \cdot K(x, y, z) \nabla u.
$$
Using $$ K(x, y, z) = x $$ $$ f(x, y, z) = -(\pi \cos(\pi x)-3\pi^2x\sin(\pi x))\sin(\pi y)\sin(\pi z), $$ Dirichlet boundary conditions on the boundary $$ u(x, y, z) = 0. $$ The exact solution for this problem is $$ u^*(x, y, z) = \sin(\pi x)\sin(\pi y)\sin(\pi z). $$
Ensure that your implementation works correctly in parallel as well as serial (when run with MPI).
Part 1a questions
- How does the error in your numerical solution behave under grid refinement? Can you explain what you see?
- What restriction, if any, is there on the size of the timestep you can choose?
Part 1b: A higher-order scheme #
This part doesn’t need to you to write any code.
Part 1b questions
- Derive, but do not implement, the 4th order accurate stencil for a 5-point discretisation of the Laplace operator in one dimension.
- How do you have to modify the stencil at the boundary to maintain the accuracy for:
- Dirichlet conditions $u = g$?
- Neumann conditions $\nabla u \cdot n = h$?
- Would this spatial discretisation have the same timestep restriction as the 2nd order operator, or a different one? Explain your answer.
Part 2: multigrid #
Part 2a: completing the multigrid solver #
Implement the missing pieces in the MGSolver
class, namely
MGSolver.jacobi
MGSolver.vcycle
MGSolver.wcycle
You should do this directly in the mgsolver/mgsolver.py
file (don’t
forget to commit it!). If you do this correctly, the one dimensional
tests should now pass.
Ensure that your implementation is correct in both serial and parallel. Up to round-off error, you should get the same results independent of the number of processes.
Part 2b: solving for a steady state #
You should do your implementation for this part in a file called
part2_multigrid.py
placed in the root of the repository.
Using the same Poisson7pt
operator that you implemented for Part
1a, we will now solve for the steady state directly (rather than
timestepping towards it).
Confirm that your implementation is correct by doing an MMS convergence test. For large problems you will probably want to run in parallel.
If your operator definition was correct in parallel in Part 1, you should not have to worry very hard about parallelism in this part, since everything is done with “collective” operations.
Part 2b questions
What mesh convergence do you get for this problem? Do you have to adjust the tolerance to which you solve the problem as you add more grid levels?
For this problem, which method (jacobi, V-cycles, W-cycles) works best when you add more grid levels?
Consider both algorithmic convergence and time to solution.
Play around with the number of smoothing steps, does that change your conclusions?
Performance hint
You may wish to pull the body of the matrix-vectormult
method out and try JIT-compiling it with numba. I found that made a big (positive) difference in the performance of my code.
Part 3: robustness #
You should do your implementation for this part in a file called
part3_variable_coefficient.py
placed in the root of the repository.
We will now look at how robust your solver is in the face of coefficient variation that does not align well with the grids.
For this setup, we’ll solve the following problem. Meant to be an idealised case of a machine room with hot and cold areas, along with heat extraction from the floor. Again, the domain is the cube $\Omega = [0, 1] \times [0, 1] \times [0, 1]$.
$$ K(x,y,z) = \begin{cases} 1.0 & \text{if } y<0.9 \text{ and }x\in[0.1,0.3],~y\in[0.1,0.9]\\ & \text{or } y<0.9 \text{ and }x\in[0.7,0.9],~y\in[0.1,0.9]\\ 0.001 & \text{else} \end{cases} $$A “door” will be modeled by using Dirichlet boundary conditions $$ u = 1.0 \text{ if } x=0,~y\in[0.4,0.7],~z<0.8 $$ and a vent in the floor cools the room $$ u = 0.0 \text{ if } x\in[0.4,0.6],~y\in[0.1,0.9],~z=0 $$
And on the remaining boundary we assume (as in the lecture) “perfect” walls $$ \nabla u \cdot n = 0. $$
In an overly simplified way we add heat sources coming from the machines $$ f(x,y,z) = -1+e^{-\frac{(x-0.2)^2+(y-0.5)^2+(z-0.5)^2}{0.02}}+e^{-\frac{(x-0.8)^2+(y-0.5)^2+(z-0.5)^2}{0.02}} $$
You can again adapt your Poisson7pt
stencil to incorporate the new
coefficient variation.
Recall from lectures that when we have this kind of coefficient
variation, we can regain some robustness by using Galerkin coarse grid
operators. The MGSolver
class supports this (say galerkin=True
when constructing an instance).
Part 3 questions
Compare the convergence behaviour of your multigrid scheme using rediscretised coarse grids (the default) and Galerkin coarse grids.
Is there a setup with which you can regain the nice multigrid efficiency that we saw previously?
Does your coarsest grid have to be larger?
Discuss your findings.
Submission and mark scheme #
The work will be marked on the basis of three things
- Your submitted code;
- A short report discussing answers to the questions and your findings;
- A brief (10 min) oral exam with the lecturers. We will use this to have a brief discussion about your implementation choices and code, and any interesting things you found in your numerical experiments. No need to prepare anything specific.
You should submit to DUO a zip file containing
- A PDF of your writeup (max 4 pages), use your Z-code to name this as ZCODE.pdf;
- A text file ZCODE.txt containing the commit hash of the code on github you want us to mark.
After submission, please contact the lecturers to arrange a time for the oral exam. Please do so within 5 days of the submission deadline.
Mark scheme #
- Part 1 [35 marks]
- Part 1a [25 marks]
- implementation [15 marks]
- questions/writeup [10 marks]
- Part 1b [10 marks]
- Part 1a [25 marks]
- Part 2 [25 marks]
- Part 2a: implementation [15 marks]
- Part 2b: questions/writeup [10 marks]
- Part 3 [25 marks]
- implementation [5 marks]
- questions/writeup [20 marks]
- Code formatting (tested via flake8) [5 marks]
- Brief oral exam [10 marks]