Fix crashes when system has 2 or more OpenCL platforms. Add async copy to device interface. Fixed completely broken Multi device. Add command throttle. #31836

New Issue

Doug Gale · 2012-06-14T19:50:22+02:00

Doug Gale commented

2012-06-14 19:50:22 +02:00

%%%Almost everywhere in blender, the code assumes there is only one platform, and blindly uses the first platform. When you have the Intel OpenCL SDK and a non-intel GPU, blender will always crash when showing the "system" tab of user preferences. It was also causing a crash when running in background mode with multiple OpenCL platforms present. This patch fixes that.

The Multi device was utterly broken, it was impossible to try to reuse the device_memory object like it tried to do because named const vars store refs to them. Therefore, each subdevice of a multi device must have a separate device_memory object. This fixes the completely broken Multi device.

The multi device did each memory copy using blocking writes. This meant that it had to wait for each memory copy to proceed sequentially, for each multi device. A new memory copy capability exists (that falls back to synchronous if the target implementation doesn't implement the new *_async extensions): they return an integer, which can later be used to wait for an async operation. You can also implicitly gather the operations and wait for them all later. The multi device uses it to convert all synchronous operations to a pair of concurrent async operations.

Blender was completely overloading the GPU queue, causing severe GUI slowdown to the point of loss of control of the machine. This was being caused by putting far too much work into the OpenCL work queue. A self-tuning throttling mechanism now manages queue depth, greatly improving GUI responsiveness while still keeping the GPU full with work.

I made the device_update function do all asynchronous transfers using the new functionality (it synchronizes at the end of the top level update). This allows nearly all (all?) memory transfers to proceed asynchronously, overlapping subsequent processing.

%%%

%%%Almost everywhere in blender, the code assumes there is only one platform, and blindly uses the first platform. When you have the Intel OpenCL SDK and a non-intel GPU, blender will always crash when showing the "system" tab of user preferences. It was also causing a crash when running in background mode with multiple OpenCL platforms present. This patch fixes that. The Multi device was utterly broken, it was impossible to try to reuse the device_memory object like it tried to do because named const vars store refs to them. Therefore, each subdevice of a multi device must have a separate device_memory object. This fixes the completely broken Multi device. The multi device did each memory copy using blocking writes. This meant that it had to wait for each memory copy to proceed sequentially, for each multi device. A new memory copy capability exists (that falls back to synchronous if the target implementation doesn't implement the new *_async extensions): they return an integer, which can later be used to wait for an async operation. You can also implicitly gather the operations and wait for them all later. The multi device uses it to convert all synchronous operations to a pair of concurrent async operations. Blender was completely overloading the GPU queue, causing severe GUI slowdown to the point of loss of control of the machine. This was being caused by putting far too much work into the OpenCL work queue. A self-tuning throttling mechanism now manages queue depth, greatly improving GUI responsiveness while still keeping the GPU full with work. I made the device_update function do all asynchronous transfers using the new functionality (it synchronizes at the end of the top level update). This allows nearly all (all?) memory transfers to proceed asynchronously, overlapping subsequent processing. %%%

Doug Gale commented

2012-06-14 19:50:22 +02:00

Changed status to: 'Open'

Doug Gale commented

2012-06-14 20:03:11 +02:00

%%%Please disregard doug65536_opencl_patch.diff. I accidentally uploaded an old diff I had lying around. Use doug65536_opencl_patch2_IGNORE_OTHER_ONE.diff%%%

Sergey Sharybin self-assigned this 2015-04-03 12:51:22 +02:00

Sergey Sharybin commented

2015-04-03 12:51:22 +02:00

Most of the compilation error fixes and multi device are likely to be solved now anyway. There's still some interesting parts about async API usage, before doing that i think it makes sense to finish kernel split first, which is happening in D1200.

Assigning to self, so the work is not getting totally lost from the radars.

Most of the compilation error fixes and multi device are likely to be solved now anyway. There's still some interesting parts about async API usage, before doing that i think it makes sense to finish kernel split first, which is happening in [D1200](https://archive.blender.org/developer/D1200). Assigning to self, so the work is not getting totally lost from the radars.

tyoc213 commented

2015-08-01 22:38:40 +02:00

Added subscriber: @tyoc213

Aaron Carlisle commented

2015-08-01 22:55:46 +02:00

Added subscriber: @Blendify

Aaron Carlisle commented

2015-08-01 22:55:46 +02:00

poke

Aaron Carlisle commented

2015-08-24 02:21:36 +02:00

Has this gotten lost in the radar?

Sergey Sharybin commented

2015-08-24 13:06:21 +02:00

@Blendify, it's not lost but since split kernel work changed priorities. I would also ask to leave priority triaging reports to developers, especially if it was explicitly set by a developer.

Aaron Carlisle commented

2017-06-01 22:27:33 +02:00

Added subscriber: @MaiLavelle

Aaron Carlisle commented

2017-06-01 22:27:33 +02:00

@MaiLavelle can this be closed?

Dalai Felinto commented

2019-12-23 18:55:36 +01:00

Added subscriber: @dfelinto

Dalai Felinto commented

2019-12-23 18:55:36 +01:00

Changed status from 'Confirmed' to: 'Archived'

Dalai Felinto closed this issue

2019-12-23 18:55:36 +01:00

Dalai Felinto commented

2019-12-23 18:55:36 +01:00

Hi, thanks for your patch.

We are undergoing a Tracker Curfew where we are automatically closing old patches.

If you think the patch is still relevant please update and re-submit it. For new features make sure there is a clear design from the user level perspective.

Hi, thanks for your patch. We are undergoing a [Tracker Curfew ](https://code.blender.org/?p=3861) where we are automatically closing old patches. If you think the patch is still relevant please update and re-submit it. For new features make sure there is a clear design from the user level perspective.

Sign in to join this conversation.

No Label

Download

What's New

Blender Studio

Manual

Developers Blog

Documentation

Benchmark

Blender Conference

Development Fund

One-time Donations

Fix crashes when system has 2 or more OpenCL platforms. Add async copy to device interface. Fixed completely broken Multi device. Add command throttle. #31836