Matrix-free methods using CUDA and MPI.
