Cycles: Workaround for performance loss with the CUDA 9.0 SDK.

CUDA 9.0.176 apparently caused some slow down on high-end Pascal cards that can be mitigated by increasing the number of registers. See https://developer.blender.org/F1142667 for a detailed comparison.
This commit is contained in:
Stefan Werner 2017-11-21 10:29:11 +01:00 committed by Brecht Van Lommel
parent 30a0459f2c
commit b3adce7766
1 changed files with 5 additions and 0 deletions

View File

@ -76,6 +76,11 @@
# define CUDA_KERNEL_MAX_REGISTERS 48
# define CUDA_KERNEL_BRANCHED_MAX_REGISTERS 63
/* CUDA 9.0 seems to cause slowdowns on high-end Pascal cards unless we increase the number of registers */
# if __CUDACC_VER_MAJOR__ == 9 && __CUDA_ARCH__ >= 600
# #define CUDA_KERNEL_MAX_REGISTERS 64
# endif
/* unknown architecture */
#else
# error "Unknown or unsupported CUDA architecture, can't determine launch bounds"