Targeting AVX2 and full SIMD width,
Performance measured with up to 10-15% improvements on complex scenes.
Summer School/KapiWow <email@example.com>, thanks.
are the 15-10% for the total rendering time of production scenes or just the intersection code?
Very interesting! Here's some tests on i7-4790K, Linux, GCC 7.2, BVH build time excluded.
It's faster though only a few %. Maybe it depends on the CPU, compiler or tests scenes?
I made some tests: http://blender.it4i.cz/research/bounding-volume-hierarchy-bvh .