lp:~florian-rathgeber/dolfin/gpu-backend
- Get this branch:
- bzr branch lp:~florian-rathgeber/dolfin/gpu-backend
Branch merges
Branch information
Recent revisions
- 5512. By Florian Rathgeber
-
CUDAKrylovSolver: compute convergence i.t.o. residual norm (instead of sqrt(<r,d>)) for better comparability to Cusp and PETSc solvers; bugfix: changed GRID_DIM to power of 2 (otherwise vecdot kernels fail)
- 5511. By Florian Rathgeber
-
added positive definite Helmholtz demo for both CPU and GPU; STATUS: Cusp GPU solver works, CUDA GPU solver produces wrong results for all but very small problems
- 5510. By Florian Rathgeber
-
CuspKrylovSolver: const arguments to solve functions (as in GenericLinearSo
lver)
moved parameter "fixed_iteration_ number" to KrylovSolver in order to be able to set it from global parameters - 5509. By Florian Rathgeber
-
CUDAKrylovSolver: convergence check uses residual instead of squared residual (squared residual might easily get below floating point precision and hence the check could become inaccurate)
- 5508. By Florian Rathgeber
-
added advection-diffusion GPU demos using CUDA backend with both assembled matrix and local-matrix-
approach - 5507. By Florian Rathgeber
-
refactored GPUSolver:
* renamed to CUDAKrylovSolver for consistency
* derives from GenericLinearSolver for compatibility with DefaultFactory, but does not implement full interface (set_operator etc.)
* inherits default_parameters from KrylovSolver
* residual is reset (r = b - Ax) every 50 iterations, added kernels to compute this for A both GPUMatrix and GPULocalMatrix
CuspKrylovSolver: output final residual after convergence
STATUS: CUDAKrylovSolver seems to produce wrong results - 5506. By Florian Rathgeber
-
added check for CUDA-capable GPUs present and their compute capability and set code generation accordingly in FindCUDA.cmake
- 5505. By Florian Rathgeber
-
updated dolfin-
config. cmake.in, seems to correctly pass compiler flags now - 5504. By Florian Rathgeber
-
added GPUAssembler parameters to GlobalParameters, updated parameters in CuspKrylovSolver to be in sync with KrylovSolver
Branch metadata
- Branch format:
- Branch format 7
- Repository format:
- Bazaar repository format 2a (needs bzr 1.16 or later)
- Stacked on:
- lp:dolfin/1.0.x