howto:compile_with_cuda
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revisionNext revisionBoth sides next revision | ||
howto:compile_with_cuda [2018/10/08 20:02] – oschuett | howto:compile_with_cuda [2019/04/09 10:31] – alazzaro | ||
---|---|---|---|
Line 4: | Line 4: | ||
* Anything that uses '' | * Anything that uses '' | ||
* FFTs, when compiled with '' | * FFTs, when compiled with '' | ||
- | * If linked against an accelerated scalapack/ | + | * If linked against an accelerated scalapack/ |
To enable all CUDA acceleration options the following lines have to be added to the ARCH-file: | To enable all CUDA acceleration options the following lines have to be added to the ARCH-file: | ||
Line 13: | Line 13: | ||
</ | </ | ||
+ | See [[https:// | ||
As a prerequisite the [[https:// | As a prerequisite the [[https:// | ||
+ | |||
===== Libcusmm ===== | ===== Libcusmm ===== | ||
- | The acceleration of DBCSR is performed by libcusmm. This library provides a number of kernels. Each of these kernels can multiply blocks of specific blocksizes. The blocksizes of a simulation are determined by the employed basis-set. | + | The acceleration of DBCSR is performed by libcusmm. This library provides a number of kernels. Each of these kernels can multiply blocks of specific blocksizes. The blocksizes of a simulation are determined by the employed basis-set. |
- | In the following example the kernel for 13x13x15 was missing: | ||
< | < | ||
| | ||
Line 30: | Line 31: | ||
| | ||
| | ||
- | | ||
| | ||
... | ... | ||
Line 38: | Line 38: | ||
</ | </ | ||
- | + | More supported GPUs can be added, please refer to [[howto: | |
- | There are over 2300 readily optimized kernel-parameters available in [[src> | + | |
- | If the desired kernel is already listed in one of the '' | + | |
===== Profiling ===== | ===== Profiling ===== | ||
If you are interested in profiling CP2K with nvprof have a look at [[dev: | If you are interested in profiling CP2K with nvprof have a look at [[dev: |
howto/compile_with_cuda.txt · Last modified: 2020/08/21 10:15 by 127.0.0.1