Page MenuHome

Cycles fails to compile opencl kernel
Closed, ResolvedPublicBUG

Description

System Information
Operating system: Linux-5.6.10-custom-x86_64-AMD_A8-5600K_APU_with_Radeon-tm-_HD_Graphics-with-slackware-14.2 64 Bits
Graphics card: AMD Radeon (TM) RX 480 Graphics (POLARIS10, DRM 3.36.0, 5.6.10-custom, LLVM 10.0.0) X.Org 4.6 (Core Profile) Mesa 20.0.2

Blender Version
Broken: version: 2.83 (sub 15), branch: master, commit date: 2020-05-05 22:28, hash: rBc036ef136960
Worked: a version from couple of days ago worked fine, ver 2.82 works as well.

Short description of error
Cycles fails to compile opencl kernel.

Exact steps for others to reproduce the error
It fails on the default cube scene .

Logs

./blender --debug-cycles
Read prefs: /home/sanjuro/.config/blender/2.83/config/userpref.blend
found bundled python: /opt/applications/blender-2.83-c036ef136960-linux64/2.83/python
I0506 10:50:55.780392 2271 blender_python.cpp:191] Debug flags initialized to:
CPU flags:
AVX2 : True
AVX : True
SSE4.1 : True
SSE3 : True
SSE2 : True
BVH layout : BVH8
Split : False
CUDA flags:
Adaptive Compile : False
OptiX flags:
CUDA streams : 1
OpenCL flags:
Device type : ALL
Debug : False
Memory limit : 0
I0506 10:51:14.656942 2271 device_cuda.cpp:56] CUEW initialization failed: Error opening the library
I0506 10:51:16.374800 2271 device_opencl.cpp:48] CLEW initialization succeeded.
I0506 10:51:16.448984 2271 opencl_util.cpp:945] Enumerating devices for platform AMD Accelerated Parallel Processing.
I0506 10:51:16.449033 2271 opencl_util.cpp:981] Using more readable device name: AMD Radeon (TM) RX 480 Graphics
I0506 10:51:16.449050 2271 opencl_util.cpp:983] Adding new device AMD Radeon (TM) RX 480 Graphics.
I0506 10:51:17.432133 2271 util_task.cpp:329] Creating pool of 4 threads.
I0506 10:51:17.432173 2271 util_task.cpp:241] Detected 4 processors in active group.
I0506 10:51:17.432196 2271 util_task.cpp:251] Not setting thread group affinity.
I0506 10:51:17.433156 2271 device_opencl_impl.cpp:639] Creating new Cycles device for OpenCL platform AMD Accelerated Parallel Processing, device AMD Radeon (TM) RX 480 Graphics.
I0506 10:51:17.583458 2271 opencl_util.cpp:298] OpenCL program split_subsurface_scatter not found in cache.
I0506 10:51:17.675879 2271 opencl_util.cpp:298] OpenCL program split_subsurface_scatter not found on disk.
I0506 10:51:17.675966 2271 opencl_util.cpp:298] OpenCL program split_direct_lighting not found in cache.
I0506 10:51:17.676230 2356 opencl_util.cpp:298] OpenCL program split_subsurface_scatter not found in cache.
I0506 10:51:17.755847 2271 opencl_util.cpp:298] OpenCL program split_direct_lighting not found on disk.
I0506 10:51:17.755939 2271 opencl_util.cpp:298] OpenCL program split_indirect_background not found in cache.
I0506 10:51:17.756111 2371 opencl_util.cpp:298] OpenCL program split_direct_lighting not found in cache.
Cycles: compiling OpenCL program split_subsurface_scatter...
I0506 10:51:17.776859 2356 opencl_util.cpp:298] Build flags: -D__SPLIT_KERNEL__ -D__COMPUTE_DEVICE_GPU__ -D__KERNEL_AO_PREVIEW__ -D__NODES_MAX_GROUP__=0 -D__NODES_FEATURES__=0 -D__NO_OBJECT_MOTION__ -D__NO_CAMERA_MOTION__ -D__NO_BAKING__ -D__NO_VOLUME__ -D__NO_SUBSURFACE__ -D__NO_BRANCHED_PATH__ -D__NO_PATCH_EVAL__ -D__NO_TRANSPARENT__ -D__NO_SHADOW_TRICKS__ -D__NO_PRINCIPLED__ -D__NO_DENOISING__ -D__NO_SHADER_RAYTRACE__
I0506 10:51:17.830255 2271 opencl_util.cpp:298] OpenCL program split_indirect_background not found on disk.
I0506 10:51:17.830617 2380 opencl_util.cpp:298] OpenCL program split_indirect_background not found in cache.
I0506 10:51:17.840534 2271 opencl_util.cpp:298] OpenCL program split_do_volume not found in cache.
Cycles: compiling OpenCL program split_direct_lighting...
I0506 10:51:17.880977 2371 opencl_util.cpp:298] Build flags: -D__SPLIT_KERNEL__ -D__COMPUTE_DEVICE_GPU__ -D__KERNEL_AO_PREVIEW__ -D__NODES_MAX_GROUP__=0 -D__NODES_FEATURES__=0 -D__NO_OBJECT_MOTION__ -D__NO_CAMERA_MOTION__ -D__NO_BAKING__ -D__NO_VOLUME__ -D__NO_SUBSURFACE__ -D__NO_BRANCHED_PATH__ -D__NO_PATCH_EVAL__ -D__NO_TRANSPARENT__ -D__NO_SHADOW_TRICKS__ -D__NO_PRINCIPLED__ -D__NO_DENOISING__ -D__NO_SHADER_RAYTRACE__
I0506 10:51:17.938980 2271 opencl_util.cpp:298] OpenCL program split_do_volume not found on disk.
I0506 10:51:17.939067 2271 opencl_util.cpp:298] OpenCL program split_shader_eval not found in cache.
I0506 10:51:17.939131 2381 opencl_util.cpp:298] OpenCL program split_do_volume not found in cache.
I0506 10:51:18.016666 2271 opencl_util.cpp:298] OpenCL program split_shader_eval not found on disk.
I0506 10:51:18.016773 2271 opencl_util.cpp:298] OpenCL program split_lamp_emission not found in cache.
I0506 10:51:18.111045 2271 opencl_util.cpp:298] OpenCL program split_lamp_emission not found on disk.
I0506 10:51:18.111166 2271 opencl_util.cpp:298] OpenCL program split_holdout_emission_blurring_pathtermination_ao not found in cache.
I0506 10:51:18.183953 2271 opencl_util.cpp:298] OpenCL program split_holdout_emission_blurring_pathtermination_ao not found on disk.
I0506 10:51:18.184069 2271 opencl_util.cpp:298] OpenCL program split_shadow_blocked_dl not found in cache.
I0506 10:51:18.251524 2271 opencl_util.cpp:298] OpenCL program split_shadow_blocked_dl not found on disk.
I0506 10:51:18.251632 2271 opencl_util.cpp:298] OpenCL program split_shadow_blocked_ao not found in cache.
I0506 10:51:18.330969 2271 opencl_util.cpp:298] OpenCL program split_shadow_blocked_ao not found on disk.
I0506 10:51:18.331269 2271 opencl_util.cpp:298] OpenCL program split_bundle not found in cache.
I0506 10:51:18.421718 2271 opencl_util.cpp:298] OpenCL program split_bundle not found on disk.
I0506 10:51:18.422175 2271 device_opencl_impl.cpp:920] Buffer allocate: RenderBuffers, 43,206,144 bytes. (41.20M)
I0506 10:51:18.469211 2271 device_opencl_impl.cpp:920] Buffer allocate: display buffer half, 21,603,072 bytes. (20.60M)
I0506 10:51:18.936722 2440 session.cpp:803] Requested features:
Experimental features: Off
Max nodes group: 0
Nodes features: 0
Use Hair: False
Use Object Motion: False
Use Camera Motion: False
Use Baking: False
Use Subsurface: False
Use Volume: False
Use Branched Integrator: False
Use Patch Evaluation: False
Use Transparent Shadows: False
Use Principled BSDF: True
Use Denoising: False
Use Displacement: False
Use Background Light: True
I0506 10:51:18.937978 2440 device_opencl_impl.cpp:759] Loading kernels for platform AMD Accelerated Parallel Processing, device AMD Radeon (TM) RX 480 Graphics.
I0506 10:51:18.938197 2440 opencl_util.cpp:298] OpenCL program base not found in cache.
I0506 10:51:18.974778 2440 opencl_util.cpp:325] Build options passed to clBuildProgram: '-cl-no-signed-zeros -cl-mad-enable -D__KERNEL_OPENCL_AMD__ -D__KERNEL_CL_KHR_FP16__ '.
I0506 10:51:18.980587 2440 opencl_util.cpp:298] Loaded program from /home/sanjuro/.cache/cycles/kernels/cycles_kernel_base_3E07356B4B46689D30DE1E70918B033E_E853A8238BDD0E253C66384B1C685DD7.clbin.
I0506 10:51:18.980692 2440 opencl_util.cpp:298] OpenCL program background not found in cache.
I0506 10:51:19.082490 2440 opencl_util.cpp:298] OpenCL program background not found on disk.
I0506 10:51:19.082655 2440 opencl_util.cpp:298] OpenCL program split_shader_eval not found in cache.
Cycles: compiling OpenCL program split_indirect_background...
I0506 10:51:19.091385 2371 opencl_util.cpp:298] Separate-process building of /home/sanjuro/.cache/cycles/kernels/cycles_kernel_split_direct_lighting_4A526F5A1C1A408546E0B35129173E30_4A9D816F3C0CD3D5C15703F8E549BE09.clbin failed, will fall back to regular building.
I0506 10:51:19.091435 2380 opencl_util.cpp:298] Build flags: -D__SPLIT_KERNEL__ -D__COMPUTE_DEVICE_GPU__ -D__KERNEL_AO_PREVIEW__ -D__NODES_MAX_GROUP__=0 -D__NODES_FEATURES__=0 -D__NO_OBJECT_MOTION__ -D__NO_CAMERA_MOTION__ -D__NO_BAKING__ -D__NO_VOLUME__ -D__NO_SUBSURFACE__ -D__NO_BRANCHED_PATH__ -D__NO_PATCH_EVAL__ -D__NO_TRANSPARENT__ -D__NO_SHADOW_TRICKS__ -D__NO_PRINCIPLED__ -D__NO_DENOISING__ -D__NO_SHADER_RAYTRACE__
Cycles: compiling OpenCL program split_do_volume...
I0506 10:51:19.146499 2356 opencl_util.cpp:298] Separate-process building of /home/sanjuro/.cache/cycles/kernels/cycles_kernel_split_subsurface_scatter_4A526F5A1C1A408546E0B35129173E30_3ECCEE227BE1374E6C35F161F5BA9E5B.clbin failed, will fall back to regular building.
I0506 10:51:19.146566 2381 opencl_util.cpp:298] Build flags: -D__SPLIT_KERNEL__ -D__COMPUTE_DEVICE_GPU__ -D__KERNEL_AO_PREVIEW__ -D__NODES_MAX_GROUP__=0 -D__NODES_FEATURES__=0 -D__NO_OBJECT_MOTION__ -D__NO_CAMERA_MOTION__ -D__NO_BAKING__ -D__NO_VOLUME__ -D__NO_SUBSURFACE__ -D__NO_BRANCHED_PATH__ -D__NO_PATCH_EVAL__ -D__NO_TRANSPARENT__ -D__NO_SHADOW_TRICKS__ -D__NO_PRINCIPLED__ -D__NO_DENOISING__ -D__NO_SHADER_RAYTRACE__
Cycles: compiling OpenCL program split_direct_lighting...
I0506 10:51:19.216847 2371 opencl_util.cpp:298] Build flags: -D__SPLIT_KERNEL__ -D__COMPUTE_DEVICE_GPU__ -D__KERNEL_AO_PREVIEW__ -D__NODES_MAX_GROUP__=0 -D__NODES_FEATURES__=0 -D__NO_OBJECT_MOTION__ -D__NO_CAMERA_MOTION__ -D__NO_BAKING__ -D__NO_VOLUME__ -D__NO_SUBSURFACE__ -D__NO_BRANCHED_PATH__ -D__NO_PATCH_EVAL__ -D__NO_TRANSPARENT__ -D__NO_SHADOW_TRICKS__ -D__NO_PRINCIPLED__ -D__NO_DENOISING__ -D__NO_SHADER_RAYTRACE__
I0506 10:51:19.216900 2371 opencl_util.cpp:325] Build options passed to clBuildProgram: '-cl-no-signed-zeros -cl-mad-enable -D__KERNEL_OPENCL_AMD__ -D__KERNEL_CL_KHR_FP16__ -D__SPLIT_KERNEL__ -D__COMPUTE_DEVICE_GPU__ -D__KERNEL_AO_PREVIEW__ -D__NODES_MAX_GROUP__=0 -D__NODES_FEATURES__=0 -D__NO_OBJECT_MOTION__ -D__NO_CAMERA_MOTION__ -D__NO_BAKING__ -D__NO_VOLUME__ -D__NO_SUBSURFACE__ -D__NO_BRANCHED_PATH__ -D__NO_PATCH_EVAL__ -D__NO_TRANSPARENT__ -D__NO_SHADOW_TRICKS__ -D__NO_PRINCIPLED__ -D__NO_DENOISING__ -D__NO_SHADER_RAYTRACE__'.
Cycles: compiling OpenCL program split_subsurface_scatter...
I0506 10:51:19.239979 2356 opencl_util.cpp:298] Build flags: -D__SPLIT_KERNEL__ -D__COMPUTE_DEVICE_GPU__ -D__KERNEL_AO_PREVIEW__ -D__NODES_MAX_GROUP__=0 -D__NODES_FEATURES__=0 -D__NO_OBJECT_MOTION__ -D__NO_CAMERA_MOTION__ -D__NO_BAKING__ -D__NO_VOLUME__ -D__NO_SUBSURFACE__ -D__NO_BRANCHED_PATH__ -D__NO_PATCH_EVAL__ -D__NO_TRANSPARENT__ -D__NO_SHADOW_TRICKS__ -D__NO_PRINCIPLED__ -D__NO_DENOISING__ -D__NO_SHADER_RAYTRACE__
I0506 10:51:19.240196 2356 opencl_util.cpp:325] Build options passed to clBuildProgram: '-cl-no-signed-zeros -cl-mad-enable -D__KERNEL_OPENCL_AMD__ -D__KERNEL_CL_KHR_FP16__ -D__SPLIT_KERNEL__ -D__COMPUTE_DEVICE_GPU__ -D__KERNEL_AO_PREVIEW__ -D__NODES_MAX_GROUP__=0 -D__NODES_FEATURES__=0 -D__NO_OBJECT_MOTION__ -D__NO_CAMERA_MOTION__ -D__NO_BAKING__ -D__NO_VOLUME__ -D__NO_SUBSURFACE__ -D__NO_BRANCHED_PATH__ -D__NO_PATCH_EVAL__ -D__NO_TRANSPARENT__ -D__NO_SHADOW_TRICKS__ -D__NO_PRINCIPLED__ -D__NO_DENOISING__ -D__NO_SHADER_RAYTRACE__'.
OpenCL build failed with error CL_BUILD_PROGRAM_FAILURE, errors in console.
OpenCL program split_direct_lighting build output: source/kernel/svm/svm_voronoi.h:687:7: error: 'opencl_unroll_hint' attribute requires OpenCL version 2.0 or above
ccl_loop_no_unroll for (int j = -1; j <= 1; j++)
^
source/kernel/kernel_compat_opencl.h:46:43: note: expanded from macro 'ccl_loop_no_unroll'
#define ccl_loop_no_unroll __attribute__((opencl_unroll_hint(1)))
^
source/kernel/svm/svm_voronoi.h:726:7: error: 'opencl_unroll_hint' attribute requires OpenCL version 2.0 or above
ccl_loop_no_unroll for (int j = -2; j <= 2; j++)
^
source/kernel/kernel_compat_opencl.h:46:43: note: expanded from macro 'ccl_loop_no_unroll'
#define ccl_loop_no_unroll __attribute__((opencl_unroll_hint(1)))
^
source/kernel/svm/svm_voronoi.h:770:7: error: 'opencl_unroll_hint' attribute requires OpenCL version 2.0 or above
ccl_loop_no_unroll for (int j = -1; j <= 1; j++)
^
source/kernel/kernel_compat_opencl.h:46:43: note: expanded from macro 'ccl_loop_no_unroll'
#define ccl_loop_no_unroll __attribute__((opencl_unroll_hint(1)))
^
source/kernel/svm/svm_voronoi.h:809:7: error: 'opencl_unroll_hint' attribute requires OpenCL version 2.0 or above
ccl_loop_no_unroll for (int j = -1; j <= 1; j++)
^
source/kernel/kernel_compat_opencl.h:46:43: note: expanded from macro 'ccl_loop_no_unroll'
#define ccl_loop_no_unroll __attribute__((opencl_unroll_hint(1)))
^
source/kernel/svm/svm_voronoi.h:829:7: error: 'opencl_unroll_hint' attribute requires OpenCL version 2.0 or above
ccl_loop_no_unroll for (int j = -1; j <= 1; j++)
^
source/kernel/kernel_compat_opencl.h:46:43: note: expanded from macro 'ccl_loop_no_unroll'
#define ccl_loop_no_unroll __attribute__((opencl_unroll_hint(1)))
^
source/kernel/svm/svm_voronoi.h:859:7: error: 'opencl_unroll_hint' attribute requires OpenCL version 2.0 or above
ccl_loop_no_unroll for (int j = -1; j <= 1; j++)
^
source/kernel/kernel_compat_opencl.h:46:43: note: expanded from macro 'ccl_loop_no_unroll'
#define ccl_loop_no_unroll __attribute__((opencl_unroll_hint(1)))
^
source/kernel/svm/svm_voronoi.h:880:7: error: 'opencl_unroll_hint' attribute requires OpenCL version 2.0 or above
ccl_loop_no_unroll for (int j = -1; j <= 1; j++)
^
source/kernel/kernel_compat_opencl.h:46:43: note: expanded from macro 'ccl_loop_no_unroll'
#define ccl_loop_no_unroll __attribute__((opencl_unroll_hint(1)))
^
7 errors generated.

error: Clang front-end compilation failed!
Frontend phase failed compilation.
Error: Compiling CL to IR

clinfo

Number of platforms:                             1
Platform Profile:                              FULL_PROFILE
Platform Version:                              OpenCL 2.1 AMD-APP (3075.10)
Platform Name:                                 AMD Accelerated Parallel Processing
Platform Vendor:                               Advanced Micro Devices, Inc.
Platform Extensions:                           cl_khr_icd cl_amd_event_callback cl_amd_offline_devices 


Platform Name:                                 AMD Accelerated Parallel Processing
Number of devices:                               1
Device Type:                                   CL_DEVICE_TYPE_GPU
Vendor ID:                                     1002h
Board name:                                    AMD Radeon (TM) RX 480 Graphics
Device Topology:                               PCI[ B#1, D#0, F#0 ]
Max compute units:                             36
Max work items dimensions:                     3
  Max work items[0]:                           1024
  Max work items[1]:                           1024
  Max work items[2]:                           1024
Max work group size:                           256
Preferred vector width char:                   4
Preferred vector width short:                  2
Preferred vector width int:                    1
Preferred vector width long:                   1
Preferred vector width float:                  1
Preferred vector width double:                 1
Native vector width char:                      4
Native vector width short:                     2
Native vector width int:                       1
Native vector width long:                      1
Native vector width float:                     1
Native vector width double:                    1
Max clock frequency:                           1303Mhz
Address bits:                                  64
Max memory allocation:                         4244635648
Image support:                                 Yes
Max number of images read arguments:           128
Max number of images write arguments:          8
Max image 2D width:                            16384
Max image 2D height:                           16384
Max image 3D width:                            2048
Max image 3D height:                           2048
Max image 3D depth:                            2048
Max samplers within kernel:                    16
Max size of kernel argument:                   1024
Alignment (bits) of base address:              2048
Minimum alignment (bytes) for any datatype:    128
Single precision floating point capability
  Denorms:                                     No
  Quiet NaNs:                                  Yes
  Round to nearest even:                       Yes
  Round to zero:                               Yes
  Round to +ve and infinity:                   Yes
  IEEE754-2008 fused multiply-add:             Yes
Cache type:                                    Read/Write
Cache line size:                               64
Cache size:                                    16384
Global memory size:                            7852957696
Constant buffer size:                          4244635648
Max number of constant args:                   8
Local memory type:                             Scratchpad
Local memory size:                             32768
Max pipe arguments:                            0
Max pipe active reservations:                  0
Max pipe packet size:                          0
Max global variable size:                      0
Max global variable preferred total size:      0
Max read/write image args:                     0
Max on device events:                          0
Queue on device max size:                      0
Max on device queues:                          0
Queue on device preferred size:                0
SVM capabilities:                              
  Coarse grain buffer:                         No
  Fine grain buffer:                           No
  Fine grain system:                           No
  Atomics:                                     No
Preferred platform atomic alignment:           0
Preferred global atomic alignment:             0
Preferred local atomic alignment:              0
Kernel Preferred work group size multiple:     64
Error correction support:                      0
Unified memory for Host and Device:            0
Profiling timer resolution:                    1
Device endianess:                              Little
Available:                                     Yes
Compiler available:                            Yes
Execution capabilities:                                
  Execute OpenCL kernels:                      Yes
  Execute native function:                     No
Queue on Host properties:                              
  Out-of-Order:                                No
  Profiling :                                  Yes
Queue on Device properties:                            
  Out-of-Order:                                No
  Profiling :                                  No
Platform ID:                                   0x7f48cb9c4e50
Name:                                          Ellesmere
Vendor:                                        Advanced Micro Devices, Inc.
Device OpenCL C version:                       OpenCL C 1.2 
Driver version:                                3075.10
Profile:                                       FULL_PROFILE
Version:                                       OpenCL 1.2 AMD-APP (3075.10)
Extensions:                                    cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_gl_sharing cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_image2d_from_buffer cl_khr_spir cl_khr_gl_event