lp:~florian-rathgeber/dolfin/gpu-backend

Created by Florian Rathgeber and last modified
Get this branch:
bzr branch lp:~florian-rathgeber/dolfin/gpu-backend
Only Florian Rathgeber can upload to this branch. If you are Florian Rathgeber please log in for upload directions.

Branch merges

Related bugs

Related blueprints

Branch information

Owner:
Florian Rathgeber
Project:
DOLFIN
Status:
Experimental

Recent revisions

5513. By Florian Rathgeber

Add CMake build script for debugging builds

5512. By Florian Rathgeber

CUDAKrylovSolver: compute convergence i.t.o. residual norm (instead of sqrt(<r,d>)) for better comparability to Cusp and PETSc solvers; bugfix: changed GRID_DIM to power of 2 (otherwise vecdot kernels fail)

5511. By Florian Rathgeber

added positive definite Helmholtz demo for both CPU and GPU; STATUS: Cusp GPU solver works, CUDA GPU solver produces wrong results for all but very small problems

5510. By Florian Rathgeber

CuspKrylovSolver: const arguments to solve functions (as in GenericLinearSolver)
moved parameter "fixed_iteration_number" to KrylovSolver in order to be able to set it from global parameters

5509. By Florian Rathgeber

CUDAKrylovSolver: convergence check uses residual instead of squared residual (squared residual might easily get below floating point precision and hence the check could become inaccurate)

5508. By Florian Rathgeber

added advection-diffusion GPU demos using CUDA backend with both assembled matrix and local-matrix-approach

5507. By Florian Rathgeber

refactored GPUSolver:
* renamed to CUDAKrylovSolver for consistency
* derives from GenericLinearSolver for compatibility with DefaultFactory, but does not implement full interface (set_operator etc.)
* inherits default_parameters from KrylovSolver
* residual is reset (r = b - Ax) every 50 iterations, added kernels to compute this for A both GPUMatrix and GPULocalMatrix
CuspKrylovSolver: output final residual after convergence
STATUS: CUDAKrylovSolver seems to produce wrong results

5506. By Florian Rathgeber

added check for CUDA-capable GPUs present and their compute capability and set code generation accordingly in FindCUDA.cmake

5505. By Florian Rathgeber

updated dolfin-config.cmake.in, seems to correctly pass compiler flags now

5504. By Florian Rathgeber

added GPUAssembler parameters to GlobalParameters, updated parameters in CuspKrylovSolver to be in sync with KrylovSolver

Branch metadata

Branch format:
Branch format 7
Repository format:
Bazaar repository format 2a (needs bzr 1.16 or later)
Stacked on:
lp:dolfin/1.0.x
This branch contains Public information 
Everyone can see this information.