mathieu menuet (bliblubli)
User

Projects

User does not belong to any projects.

User Details

User Since
Jun 2 2013, 4:47 PM (267 w, 3 d)

Recent Activity

Apr 12 2018

mathieu menuet (bliblubli) created T54578: regression python crash modifying text block.
Apr 12 2018, 7:42 AM · Python, BF Blender: Regressions, BF Blender

Apr 3 2018

mathieu menuet (bliblubli) added a comment to T54486: Render difference with big quad light and ray visibility.

@Brecht Van Lommel (brecht) yep, so we agree, it's this same bug that makes the wood floor look strange and adds the noise in the other scene.

Apr 3 2018, 6:16 PM · BF Blender, Cycles
mathieu menuet (bliblubli) added a comment to T54486: Render difference with big quad light and ray visibility.

the difference in noise level in https://developer.blender.org/T54486#491645 are really due to the windows which should be completely ignored due to their visibility set to only camera rays. 2.79a ignores them, latest master not. When I render without the windows, both stable and master render the same image.

Apr 3 2018, 5:47 PM · BF Blender, Cycles
mathieu menuet (bliblubli) added a comment to T54486: Render difference with big quad light and ray visibility.

@Brecht Van Lommel (brecht) I changed line 235 of kernel_path_state.h from

Apr 3 2018, 5:45 PM · BF Blender, Cycles
mathieu menuet (bliblubli) added a comment to T54486: Render difference with big quad light and ray visibility.

I tried to reproduce it in a simple scene (a box with principled on multiscatter GGX, one opening, 1 portal and one window with only camera visibility), but somehow, the bug doesn't show up in master with this file. Making it a bit more complexe with some textures and objects to occlude and have more bounces show a clear difference in noise pattern, but not really noise levels. Complex and real use case scene on the other hand show clearly that 2.79a has a much better noise level, but I couldn't find yet, what the reason is. Keeping only the walls, windows and lights in the living room/kitchen scene above rendered significantly darker in master compared to 2.79a.

Apr 3 2018, 11:10 AM · BF Blender, Cycles
mathieu menuet (bliblubli) added a comment to T54486: Render difference with big quad light and ray visibility.

@Brecht Van Lommel (brecht) so if I understand you right, this also explains the much higher noise levels in these scene (see pictures). This scene is lighten by portals with windows set to be only visible by camera. But indeed, although those windows are only hit by not camera rays, removing the windows completely greatly improves lightning and noise in latest master. So it showcases what you said about ray visibility. Here is a comparison of scene 10 from archinteriors 43 for Blender in master and stable:
2.79a = low level noise:


buildbot = very high noise level:

Apr 3 2018, 10:19 AM · BF Blender, Cycles

Apr 2 2018

mathieu menuet (bliblubli) added a comment to D3109: Blender: Patch for Intel Compiler ( 2016, 2017, 2018 ).

@Milan Jaros (jar091) could you provide a windows or Linux build of latest master using intel compiler 2018 to compare performance in different use cases?

Apr 2 2018, 6:15 PM

Apr 1 2018

mathieu menuet (bliblubli) added a comment to T54486: Render difference with big quad light and ray visibility.

As this pack https://evermotion.org/shop/show_product/archinteriors-vol-43-for-blender/14563 has just been released this week, it would be great if user had an option so that it's not already broken in buildbot/next release. Blender just starts to appear on one of the most renown visualization website.

Apr 1 2018, 10:29 PM · BF Blender, Cycles
mathieu menuet (bliblubli) added a comment to T54486: Render difference with big quad light and ray visibility.

@Brecht Van Lommel (brecht), it's much more subtle in this case, but should be the same thing happening. The master render has a much darker specularity than stable, like in the wood renders in first post.

Apr 1 2018, 8:25 PM · BF Blender, Cycles
mathieu menuet (bliblubli) added a comment to T54486: Render difference with big quad light and ray visibility.

ok, I hope I nailed it.
stable:


Master:

Apr 1 2018, 8:23 PM · BF Blender, Cycles
mathieu menuet (bliblubli) added a comment to T54486: Render difference with big quad light and ray visibility.

@Brecht Van Lommel (brecht) which bugfix commits touched the specular part of principled shader ?

Apr 1 2018, 7:53 PM · BF Blender, Cycles
mathieu menuet (bliblubli) added a comment to T54486: Render difference with big quad light and ray visibility.

OS is Linux. If the difference seen in the attached blend is not enough, I'll try to find a free wood texture alternative. The materials in the renders above are copyrighted by evermotion.

Apr 1 2018, 7:52 PM · BF Blender, Cycles
mathieu menuet (bliblubli) renamed T54486: Render difference with big quad light and ray visibility from principled shader looks like wet in latest master to Regression: principled shader looks like wet in latest master.
Apr 1 2018, 5:50 PM · BF Blender, Cycles
mathieu menuet (bliblubli) updated the task description for T54486: Render difference with big quad light and ray visibility.
Apr 1 2018, 5:48 PM · BF Blender, Cycles
mathieu menuet (bliblubli) created T54486: Render difference with big quad light and ray visibility.
Apr 1 2018, 5:47 PM · BF Blender, Cycles

Mar 27 2018

mathieu menuet (bliblubli) added a comment to T54024: BMesh boolean fails when edge is co-planar to face (under some circumstances).

Addons relying on carve modifier are not working in master like this one https://blenderartists.org/forum/showthread.php?409837-Destructive-Extrude-BETA/page9 and the results of bmesh are not yet there in some cases (like overlapping/coplanar edges/faces).
I think it was a very good idea to keep only one version of the boolean modifier and bmesh is really fast, but it should offer as much as the old combination. Or is the plan to only remove functionality in this case?

Mar 27 2018, 11:10 AM · Mesh Modeling, Modifiers, BF Blender

Mar 3 2018

mathieu menuet (bliblubli) added a comment to T54024: BMesh boolean fails when edge is co-planar to face (under some circumstances).

@Campbell Barton (campbellbarton) As Carve boolean has been removed, would it be possible to finalize Bmesh version? Before user could use the other one as fallback, but now we have to use Bmesh. I'm happy to have only one variant if it's robust enough.

Mar 3 2018, 5:09 PM · Mesh Modeling, Modifiers, BF Blender

Feb 23 2018

mathieu menuet (bliblubli) added a comment to T54019: Undefined behavior happen by copying objects with Ctrl+C with custom material nodes.

thanks for investigating. Can you please attach a file and the according steps to reproduce the crash?

Feb 23 2018, 1:03 PM · BF Blender

Feb 17 2018

mathieu menuet (bliblubli) added a comment to T52163: carve boolean modifier make some files impossible to open or very unstable..

Well, Carve was actually a good fall-back method when Bmesh boolean failed. It would be good to remove fallbacks when the main is polished and covers all the cases the fallback did?

Feb 17 2018, 10:09 AM · Modifiers, BF Blender

Jan 31 2018

mathieu menuet (bliblubli) added a comment to T53956: 2.8: Proposal: Remove OpenCollada.
  • As @Vuk Gardašević (lijenstina) said, I would keep OpenCollada in any way until another exporter allows to get the same speed for heavy meshes/scenes and are as memory efficient (see https://developer.blender.org/T53236#469082).
  • Second, it saved my day many times when the FBX exporter fails.
  • Last but not least, regarding the workflow I see the most (archviz), it's a must have to communicate with Sketchup.
Jan 31 2018, 8:00 PM · BF Blender: 2.8

Jan 30 2018

mathieu menuet (bliblubli) created T53948: Addon Copy-settings.
Jan 30 2018, 6:04 PM · Addons, BF Blender

Jan 16 2018

mathieu menuet (bliblubli) added a comment to T40075: Bevel unexpected result with certain geometry.

@Howard Trickey (howardt) ok, I looked at https://developer.blender.org/T48583 and saw it was the last one. So it's soon the last one :) It's still great to see this polishing going on.

Jan 16 2018, 8:18 PM · BF Blender, Mesh Modeling
mathieu menuet (bliblubli) added a comment to T40075: Bevel unexpected result with certain geometry.

Just wanted to report the same bug and noticed it's the last known one for bevel, congrats @Howard Trickey (howardt) for making it so robust :)

Jan 16 2018, 5:22 PM · BF Blender, Mesh Modeling

Jan 3 2018

mathieu menuet (bliblubli) added a comment to T53683: 2.79a release.

rB5aa08eb3cc7 is safe to include.

Jan 3 2018, 1:56 PM · BF Blender

Dec 29 2017

mathieu menuet (bliblubli) added a comment to T53597: CUDA error: Launch failed in cuCtxSynchronize(), line 1372 (SOLUTION: Changing some registry parameters. See the thread for solution details.).

Could it be a question of complexity too?

Yes it is, although what is difficult for a GPU may be easier for another one, so there is no universal definition of complexity for all GPUs. See http://download.blender.org/institute/benchmark171221/latest_snapshot.html and this selection:


For Classroom, all GPU render about as fast, while for Koro, the Vega and TitanV are much faster

Another question:
Is it possible somehow to render a scene on a nvidia card by using OpenCL and not CUDA for GPU?

Dec 29 2017, 4:57 PM · BF Blender, Rendering, Cycles
mathieu menuet (bliblubli) added a comment to T53597: CUDA error: Launch failed in cuCtxSynchronize(), line 1372 (SOLUTION: Changing some registry parameters. See the thread for solution details.).

By the way, to avoid user fiddle with their registry, we could just send less sample to the GPU at once for CUDA. OpenCL can take anything because their is no such timeout, but for CUDA I don't see an easy solution here to send the right amount of samples to GPU.

  • A static approach based on the number of CUDA cores wouldn't adapt to scene complexity, so we would have to be very conservative.
  • A dynamic solution, for example if you take the mean samples/sec of n last tiles as reference, may lead to very high render time if rendering starts with sky and then go from full sky tile to 10%sky with fur behind a tree with transparency, translucency, etc. wich may render 100 or even 1000x slower than pure sky, triggering the timeout.
Dec 29 2017, 9:56 AM · BF Blender, Rendering, Cycles
mathieu menuet (bliblubli) added a comment to T53597: CUDA error: Launch failed in cuCtxSynchronize(), line 1372 (SOLUTION: Changing some registry parameters. See the thread for solution details.).

@Christoph Werner (Taros) you said 500 samples at 16x16 was already triggering the bug. A 1000 spp render at 64x64 with same GPU takes about 8 times longer to render. But I'll try at 1400spp. IIRC @Brecht Van Lommel (brecht) said that this timeout bug was not happening for all users somehow?

Dec 29 2017, 9:37 AM · BF Blender, Rendering, Cycles

Dec 28 2017

mathieu menuet (bliblubli) added a comment to T53597: CUDA error: Launch failed in cuCtxSynchronize(), line 1372 (SOLUTION: Changing some registry parameters. See the thread for solution details.).
Dec 28 2017, 4:39 PM · BF Blender, Rendering, Cycles

Dec 26 2017

mathieu menuet (bliblubli) lowered the priority of T53635: cycles render crash from High to Incomplete.

HI, thanks for the report but don't set the priority yourself. Everybody find it's bug to be the most important one.
I couldn't reproduce the crash, neither with 2.79 nor with latest buildbot using driver 17.12.2 on Vega64. So I would recommend to update your driver and test again.

Dec 26 2017, 1:39 PM · BF Blender

Dec 24 2017

mathieu menuet (bliblubli) added a comment to T53597: CUDA error: Launch failed in cuCtxSynchronize(), line 1372 (SOLUTION: Changing some registry parameters. See the thread for solution details.).

note that I rendered at 1000spp with 64x64 tiles on a 1080Ti too. Windows 7, latest master and driver 388.31

Dec 24 2017, 9:11 PM · BF Blender, Rendering, Cycles
mathieu menuet (bliblubli) added a comment to T53597: CUDA error: Launch failed in cuCtxSynchronize(), line 1372 (SOLUTION: Changing some registry parameters. See the thread for solution details.).

Trying to render with VS2015 builds, I get a similar error (not sure why it's not the same):

Dec 24 2017, 8:36 PM · BF Blender, Rendering, Cycles
mathieu menuet (bliblubli) added a comment to T53517: Cycles lamp "strength" ignores scene's "unit scale", producing inconsistent renders in scenes with consistent scaling.

@Brecht Van Lommel (brecht) adding this information in the tooltip or as a unit like for the distance values would clarify such discrepancies.

Dec 24 2017, 4:50 PM · Cycles

Dec 16 2017

mathieu menuet (bliblubli) added a comment to T52877: OpenCL render somtime idles on one tile with BPT.

Rendering the scene lightning.blend in the link above (https://cloud.blender.org/p/agent-327/591ac3cabb3ea141675bbaf2), tested with latest master and 17.11.2 on a VEGA 64, OpenCL system memory usage skyrockets at 27.6:


while CUDA and CPU stay both at around 16GB system memory usage

Vega 64 has only 8GB and cycles reports about 10GB peak memory usage, so I could understand a difference around 2GB (the maximal size of buffers in latest OpenCL code) showing the driver allocating system memory. But why nearly 12GB? Of course the GPU will then idle most of the time, waiting for the system memory.
@Brecht Van Lommel (brecht) did it successfully render the scene lightning.blend in a normal time on the RX480?
Another point is, why is as much system memory kept used while doing GPU rendering? Couldn't the data be freed when everything has been written to the GPU?

Dec 16 2017, 6:19 PM · BF Blender, Cycles

Dec 11 2017

mathieu menuet (bliblubli) updated subscribers of D2939: temp diff to reduce recompiles in split kernel.

added description. I also don't see the point to do something like that, but @Ton Roosendaal (ton) and @Campbell Barton (campbellbarton) asked me to upload diffs instead of paste last time, so I do it. Otherwise, I would just upload builds with a paste until the patch is stable and well tested by users.

Dec 11 2017, 3:28 PM
mathieu menuet (bliblubli) updated the summary of D2939: temp diff to reduce recompiles in split kernel.
Dec 11 2017, 3:26 PM

Dec 3 2017

mathieu menuet (bliblubli) updated the summary of D2939: temp diff to reduce recompiles in split kernel.
Dec 3 2017, 2:25 PM
mathieu menuet (bliblubli) created D2939: temp diff to reduce recompiles in split kernel.
Dec 3 2017, 8:46 AM

Dec 2 2017

mathieu menuet (bliblubli) added a comment to D2916: Cycles: Speed up split kernel builds by splitting svm_eval_nodes out of direct_emission functions.

with latest version of the branch, barbershop renders correctly now, but the whole PC hangs after the third tile and only a hard reset is possible (windows 7, driver 17.11.4 on Vega64). It would be good to report the bug to AMD.

Dec 2 2017, 11:04 AM

Nov 30 2017

mathieu menuet (bliblubli) added a comment to rBfa3d50af95fd: Cycles: Improve denoising speed on GPUs with small tile sizes.

@Lukas Stockner (lukasstockner97) wouldn't it be possible to let the cpu do the denoising when using gpu rendering? It would spare dedicated memory (denoising datas and kernel would be in system memory) and let the gpu only work on rendering, which should give some speedup.

Nov 30 2017, 4:19 PM

Nov 27 2017

mathieu menuet (bliblubli) awarded T53405: Dxf support for lwpolyline elevation and small bugfix a Love token.
Nov 27 2017, 8:17 PM · Addons
mathieu menuet (bliblubli) added a comment to rBd75d262318c8: Cycles: Split shader_eval out of the kernel_lamp_emission kernel.

at least on windows, the barbeshop scene renders some tiles like a Z pass with this branch and then crash.

Nov 27 2017, 6:39 PM
mathieu menuet (bliblubli) added a comment to D2916: Cycles: Speed up split kernel builds by splitting svm_eval_nodes out of direct_emission functions.

kernel build time with this patch on the barbershop scene goes from 98sec (split kernel only) to 89sec. Using the branch, it even goes down to 69sec. Together with kernel_base, it goes from nearly 2 minutes to a bit less than 1 and a half minutes. Still pretty high, but already a good speedup.

Nov 27 2017, 3:04 PM
mathieu menuet (bliblubli) added a comment to D2916: Cycles: Speed up split kernel builds by splitting svm_eval_nodes out of direct_emission functions.

OSL can't build with this patch on windows.

Nov 27 2017, 2:01 PM

Nov 26 2017

mathieu menuet (bliblubli) added a comment to T53349: Cycles - difference between OpenCL and CUDA with AO simplify.

@Brecht Van Lommel (brecht) I don't have commit rights, could you commit it please?

Nov 26 2017, 9:57 AM · BF Blender: Regressions, Cycles

Nov 25 2017

mathieu menuet (bliblubli) added a comment to T53349: Cycles - difference between OpenCL and CUDA with AO simplify.

here is the fix

Nov 25 2017, 9:32 AM · BF Blender: Regressions, Cycles

Nov 19 2017

mathieu menuet (bliblubli) updated the task description for T53349: Cycles - difference between OpenCL and CUDA with AO simplify.
Nov 19 2017, 5:37 PM · BF Blender: Regressions, Cycles
mathieu menuet (bliblubli) updated the task description for T53349: Cycles - difference between OpenCL and CUDA with AO simplify.
Nov 19 2017, 5:24 PM · BF Blender: Regressions, Cycles
mathieu menuet (bliblubli) added a comment to T53349: Cycles - difference between OpenCL and CUDA with AO simplify.

https://git.blender.org/gitweb/gitweb.cgi/blender.git/commit/659ba012b0f30450c6de13f8b1c2fccce32fc461 render correctly and https://git.blender.org/gitweb/gitweb.cgi/blender.git/commit/f77cdd1d59f6e895b567c4d5fdcc6f2440e03307 renders black

Nov 19 2017, 5:23 PM · BF Blender: Regressions, Cycles
mathieu menuet (bliblubli) created T53349: Cycles - difference between OpenCL and CUDA with AO simplify.
Nov 19 2017, 4:46 PM · BF Blender: Regressions, Cycles

Nov 14 2017

mathieu menuet (bliblubli) added a comment to T53291: Deleting objects do not delete them from the blend file.

yeah that's annoying, sometime it's even worse. You have to save/open/save/open/etc. until the whole chain of dependencies has been done. Like you delete obj1 wich had material1 which had tex1. You have to save and open and save to really get a file without obj1 which makes material1 with 0 user, then you must save/open to make tex1 with 0 user and then save/open again to really have that texture also removed. When you add linked groups to that story, it's sometime really time consuming and cumbersome to get a clean file.
An option "clean all unused datablocks and save" would really help.

Nov 14 2017, 6:54 PM · BF Blender: 2.8

Nov 12 2017

mathieu menuet (bliblubli) added a comment to T53249: [regression] OpenCL performance becomes very random with big scenes..

I rechecked with VS2013 builds. The system memory usage varies a bit (max 500MB compared to many GB with 2015) and the performance also is more stable (max 35% variation during 10 renders).
Could someone confirm those behaviours on Windows and test on Linux?

Nov 12 2017, 1:51 PM · BF Blender, Cycles

Nov 6 2017

mathieu menuet (bliblubli) added a comment to T53249: [regression] OpenCL performance becomes very random with big scenes..

P556 seems to limit the slowdown to about 68seconds from 48 while latest buildbot 8a72be7 goes up to 78sec from 45sec and it's slowdown grows on each new render.

Nov 6 2017, 5:31 PM · BF Blender, Cycles
mathieu menuet (bliblubli) added a comment to T53249: [regression] OpenCL performance becomes very random with big scenes..

and that's the log with latest buildbot:

Nov 6 2017, 5:27 PM · BF Blender, Cycles
mathieu menuet (bliblubli) added a comment to T53249: [regression] OpenCL performance becomes very random with big scenes..

@Brecht Van Lommel (brecht) thanks for the patch. Latest master with it (I had to apply manually as it seems it was done on a branch?) gives this log on 3 consecutive renders:

Nov 6 2017, 5:10 PM · BF Blender, Cycles
mathieu menuet (bliblubli) updated subscribers of T53249: [regression] OpenCL performance becomes very random with big scenes..
Nov 6 2017, 4:28 PM · BF Blender, Cycles
mathieu menuet (bliblubli) added a comment to T53249: [regression] OpenCL performance becomes very random with big scenes..

Where is the "Free mem AMD" print coming from? I can't find that code in master or earlier revisions. In master, the split kernel global size is determined by max_buffer_size and num_elements, which from the logs don't appear to change. Yet the global size is reported as being different.

Yes, I used another version to get the free memory reported and tried to see if limiting global size to make it all fit in memory would solve the problem, but it didn't. I can redo the log with vanilla master if you want. Here is the code:

Nov 6 2017, 2:25 PM · BF Blender, Cycles
mathieu menuet (bliblubli) added a comment to T53249: [regression] OpenCL performance becomes very random with big scenes..

@Brecht Van Lommel (brecht) thanks for P555 . Tried it but bug is still there.

Nov 6 2017, 10:24 AM · BF Blender, Cycles
mathieu menuet (bliblubli) added a comment to T53249: [regression] OpenCL performance becomes very random with big scenes..

@LazyDodo (LazyDodo) the GPU-Z log is wrong somehow, it ignores half of the memory. But it gives the impression that no memory leak happens on the GPU.

Nov 6 2017, 8:38 AM · BF Blender, Cycles

Nov 5 2017

mathieu menuet (bliblubli) added a comment to T53249: [regression] OpenCL performance becomes very random with big scenes..

actually, 2.79 has the bug, only the official one had the device selection bug and took the 1080Ti instead, which doesn't use system memory.
So it may be a driver bug, but then why is the first render always 30sec?
After some renders, I got up to 114seconds to render = nearly 3x slower... At this point however, the GPU was idling a lot, maybe waiting all the time for system memory access?
Here is a picture of the task manager with 2 consecutive renders on the same instance of Blender.


It may be a coincidence, but VS2013 builds had only +/-10% between first and consecutive renders (made 5 of them) while VS2015 builds go crazy with up to 3x the render time.
If someone could test on Linux with a RX480 to see if GCC or the Linux driver handles this differently. As said before, the RX480 can render this scene. On Linux, the Nvidia drivers destroy a part of the AMD driver and I couldn't find a solution to have both drivers side by side yet.

Nov 5 2017, 8:30 PM · BF Blender, Cycles
mathieu menuet (bliblubli) added a comment to T53249: [regression] OpenCL performance becomes very random with big scenes..

commit b53e35c655d4 already has the bug, so it's not due to the buffer patch.

Nov 5 2017, 6:44 PM · BF Blender, Cycles

Nov 4 2017

mathieu menuet (bliblubli) added a comment to T53249: [regression] OpenCL performance becomes very random with big scenes..

got some explanations on IRC, sorry I didn't know the whole story.

Nov 4 2017, 9:02 PM · BF Blender, Cycles
mathieu menuet (bliblubli) added a comment to T53249: [regression] OpenCL performance becomes very random with big scenes..
Nov 4 2017, 8:19 PM · BF Blender, Cycles
mathieu menuet (bliblubli) added a comment to T53249: [regression] OpenCL performance becomes very random with big scenes..

the bug was already there 24.08.2017, so my guess is that rBec8ae4d5e9f7 is the commit we look for.

Nov 4 2017, 8:15 PM · BF Blender, Cycles
mathieu menuet (bliblubli) added a comment to T53249: [regression] OpenCL performance becomes very random with big scenes..

just to give an idea of the mess to bisect:

  • cuda disables completly opencl in the majority of revision, so you have to rebuild without cuda
  • device selection changed, so userpref have to be modified depending on the revision you test and bisecting requires to go back and forth in time.
  • kernel compilation takes 1min50 for victor
  • scene preparation takes 2min04

so it takes about 5minutes of VS compile, then manual tweaks for user pref, then 2minute kernel compile+ then 2 renders at 2 (scene preps)+2(render)=8minutes of rendering. That's a quarter of an hour with 4 user intervention between which you can't do much.

Nov 4 2017, 7:59 PM · BF Blender, Cycles
mathieu menuet (bliblubli) added a comment to T53249: [regression] OpenCL performance becomes very random with big scenes..

rBec8ae4d5e9f7 only added support for more than 4GB of textures iirc.
You don't need HBCC support. On win7, even on Vega, there is no HBCC and my RX480 also renders full victor scene since a year on windows and since some months on Linux.
@Brecht Van Lommel (brecht) Is there a simple command to disable the context caching?
I could try to bisect, but @Mai Lavelle (maiself) should have better guesses of what could have introduced this bug. The scene preparation of Victor takes more than 2 minutes on my computer. With compile time on windows on top, shooting in the dark to bisect would take a lot of time.

Nov 4 2017, 7:27 PM · BF Blender, Cycles
mathieu menuet (bliblubli) edited projects for T53249: [regression] OpenCL performance becomes very random with big scenes., added: Cycles, BF Blender: Regressions; removed BF Blender.
Nov 4 2017, 9:49 AM · BF Blender, Cycles
mathieu menuet (bliblubli) renamed T53249: [regression] OpenCL performance becomes very random with big scenes. from OpenCL performance becomes very random with big scenes. to [regression] OpenCL performance becomes very random with big scenes..
Nov 4 2017, 9:49 AM · BF Blender, Cycles
mathieu menuet (bliblubli) updated subscribers of T53249: [regression] OpenCL performance becomes very random with big scenes..
Nov 4 2017, 8:38 AM · BF Blender, Cycles
mathieu menuet (bliblubli) assigned T53249: [regression] OpenCL performance becomes very random with big scenes. to Mai Lavelle (maiself).
Nov 4 2017, 8:37 AM · BF Blender, Cycles
mathieu menuet (bliblubli) added a comment to D2249: Cycles: Experiment with removing closures storage for certain evaluation.

The random render times are also in buildbots (but not in 2.79), this patch just make it even more obvious. Render times vary between 29seconds to 102seconds for a small border render of victor. So I reported the bug T53249.

Nov 4 2017, 8:37 AM
mathieu menuet (bliblubli) added a comment to T53249: [regression] OpenCL performance becomes very random with big scenes..

Note that in all above cases, system memory is used as the scene doesn't fit in the dedicated 8Gb memory. So it doesn't seem to come from how the drivers allocates the memory between dedicated and system memory.
Also, power usage of GPU was reduced to ensure no throttling happens, frequency were stable during all the tests.

Nov 4 2017, 8:34 AM · BF Blender, Cycles
mathieu menuet (bliblubli) created T53249: [regression] OpenCL performance becomes very random with big scenes..
Nov 4 2017, 8:33 AM · BF Blender, Cycles
mathieu menuet (bliblubli) added a comment to D2249: Cycles: Experiment with removing closures storage for certain evaluation.

Can someone else confirm that with victor scene, the second render is much slower with OpenCL?

Nov 4 2017, 7:34 AM
mathieu menuet (bliblubli) added a comment to D2803: Cycles: add bevel shader, for raytrace based rounded edges..

This is quite odd, since the kernel compiles without the bevel node by default. Unless you added a bevel shader into the scene specifically to test?

Yes, I added a bevel shader to test if we could remove the selective compilation on all the scenes I tested. It would be ok for scene fitting in memory, but slow a lot on other.

Nov 4 2017, 7:32 AM

Nov 3 2017

mathieu menuet (bliblubli) added a comment to D2803: Cycles: add bevel shader, for raytrace based rounded edges..

Some scenes like Barcelona are even slightly faster with bevel as it seem to remove the slowdowns on simple scenes from D2249. The times in general with latest D2249 + D2803 are the same as with master.
However, victor is 2x slower with bevel and D2249 compared to D2249 alone and 23% slower compared to master.
Could register pressure be further reduced on this patch? It worked really well with bicubic texture filtering.

Nov 3 2017, 8:28 PM
mathieu menuet (bliblubli) added a comment to D2803: Cycles: add bevel shader, for raytrace based rounded edges..

with D2249, render speed is the same with and without bevel on a Vega64 (just forced to be compiled with an unconnected node). I'm still testing Victor to see if scenes that use system memory get slowdowns from it.

Nov 3 2017, 8:08 PM
mathieu menuet (bliblubli) added a comment to D2249: Cycles: Experiment with removing closures storage for certain evaluation.

After testing on several scenes in my library, I only got good speedups for those not fitting in memory and slowdowns under 2% on smaller ones. So on the performance side, the benefit really out-weights the very small slowdowns if any (many scenes were as fast).

Nov 3 2017, 7:42 PM
mathieu menuet (bliblubli) triaged T53246: Carve Boolean solver crash when joining meshes as Confirmed priority.

I had to hit tab a lot (spam it more than 10 times) after joining and got a crash indeed on win7 using latest master.

Nov 3 2017, 6:52 PM · Mesh Modeling, BF Blender
mathieu menuet (bliblubli) added a comment to D2249: Cycles: Experiment with removing closures storage for certain evaluation.

I do all my test with lowered voltage. Temperature never goes above 72°C (max set at 80°C) and frequency is stable. So the timings are also reproducible with differences under 1%.

Nov 3 2017, 6:32 PM
mathieu menuet (bliblubli) added a comment to D2249: Cycles: Experiment with removing closures storage for certain evaluation.

Some further results:

sceneslowdown on Vega64/win7
bmw1%
classroom<1%
koro<1%
Barcelona1%
Nov 3 2017, 6:16 PM
mathieu menuet (bliblubli) added a comment to D2249: Cycles: Experiment with removing closures storage for certain evaluation.

Victor goes from 23minutes to 17min40 to render with the patch. However, rendering a second time takes 43minutes, so something goes wrong. I guess the speedup comes from the reduced memory usage and something is not freed properly until Blender is closed.
Rendering after restarting Blender again gives 17min40 the first time and 43minutes the second time.

Nov 3 2017, 5:59 PM
mathieu menuet (bliblubli) added a comment to D2249: Cycles: Experiment with removing closures storage for certain evaluation.

A border render at low resolution of victor takes 1min20 of pure render time with master vs 3min08 with P553 with crimson drivers 17.10.2 on win7 and Vega64.

Nov 3 2017, 9:16 AM
mathieu menuet (bliblubli) added a comment to D2249: Cycles: Experiment with removing closures storage for certain evaluation.

I can also confirm the speed regression on Vega64/win7 with D2249. Where I awaited speedups the most is in heavy scenes like victor.blend. But render time went from 23minutes to 48minutes. I still have to test Brecht's version.

Nov 3 2017, 8:26 AM

Oct 25 2017

mathieu menuet (bliblubli) added a comment to D2679: Cycles: discard unused Principled BSDF inputs at shader compile time..

this patch makes some scenes with volumes to crash like this one https://blenderartists.org/forum/showthread.php?439394-The-new-Cycles-GPU-2-79-Benchmark

Oct 25 2017, 5:48 PM

Oct 21 2017

mathieu menuet (bliblubli) added a comment to T53117: Principled BSDF stuck on "updating shaders".

I had the same problem with 1080Ti and win7. It was on classroom scene, so it had nothing to do with principled shader. Killing the process, restarting Blender and relaunching the render was enough. No idea what happened as I was using the release UI without any debug info and I couldn't reproduce it since then. It happened just after opening the scene and hitting F12.

Oct 21 2017, 8:06 AM · Cycles, BF Blender
mathieu menuet (bliblubli) added a comment to T53119: Cycles rendering segfaults with OpenCL on AMD.

there is an aur to have proprietery openCL on mesa driver: https://aur.archlinux.org/packages/opencl-amd
and if you know what you do, you can compile the amd staging linux kernel with rocm and custom llvm. It's fully open source.

Oct 21 2017, 8:03 AM · Cycles, BF Blender

Oct 19 2017

mathieu menuet (bliblubli) added a comment to T53097: OpenCL/GPU Memory Managment - Weird Scene, "Too complex to fit in gpu memory". Cycles, AMD GPU..

@Leo (.Pixel) you are welcome. Please report your times with and without HBCC. Some websites report 15% speedup even when it all fits in the VRAM. Would be interesting to have your results to compare.

Oct 19 2017, 2:51 PM · BF Blender
mathieu menuet (bliblubli) added a comment to T53097: OpenCL/GPU Memory Managment - Weird Scene, "Too complex to fit in gpu memory". Cycles, AMD GPU..

@Leo (.Pixel) latest buildbot have the fix for this error. I just tried and it renders without any tricks/simplification

Oct 19 2017, 12:11 PM · BF Blender

Oct 12 2017

mathieu menuet (bliblubli) added a comment to D2873: Cycles: Allow rendering with GPUs an CPUs at once.

viewport rendering of BMW from official benchmark pack takes 12seconds on 1080TI, 20seconds on Vega64 and 16 seconds using both. With F12 render, that's the opposite, Vega is faster with 82sec (at 128x128, best time), 1080Ti takes 93seconds (at 16x16, best time) and both take 44seconds using latest master with initial_num_samples at 5000.
To sum up:

  • viewport seem really slow in latest master. OpenCL. 2.78c with selective node compilation for viewport renders nearly 2x faster on Vega 64. It's not due to SSS or volume as those are not compiled in viewport kernel either. I can investigate on that.
  • multi-device rendering is slower with viewport/progressive rendering than the fastest device alone. Logic would be to wait for the slowest half to finish, which would be around 10seconds for Vega?
Oct 12 2017, 11:46 AM
mathieu menuet (bliblubli) added a comment to rBf61c340bc15e: Cycles: OpenCL bicubic and tricubic texture interpolation support..

if you or someone at BF/BI have a direct contact with AMD, maybe the best would be to report it as a bug if it works under Linux.

Oct 12 2017, 8:17 AM

Oct 11 2017

mathieu menuet (bliblubli) raised a concern with rBf61c340bc15e: Cycles: OpenCL bicubic and tricubic texture interpolation support..

The openCL kernel is about 20% slower on the BMW scene with this commit. from 1min33 to 1min50 on Vega64 using latest driver on win7.

Oct 11 2017, 6:09 PM

Oct 10 2017

mathieu menuet (bliblubli) added reviewers for D2875: initial BVH8 implementation: Brecht Van Lommel (brecht), Hristo Gueorguiev (nirved), Lukas Stockner (lukasstockner97), Mai Lavelle (maiself).

are the 15-10% for the total rendering time of production scenes or just the intersection code?

Oct 10 2017, 7:37 PM
mathieu menuet (bliblubli) added a comment to D2873: Cycles: Allow rendering with GPUs an CPUs at once.

from test made in UI, cpu indeed is always the last one to finish. The more threads a cpu will have, the higher the probability is that the GPUs will idle, because those 16 or 32 tiles are already being rendered by cpu very slowly. So if it's possible without too much work, I would say it would be more effective in real scenarios to let all the CPU thread render one tile, just like all the thread of the GPU render one tile. It may also improve cache behaviour and increase render speed. Of course, letting GPU render several tiles would still be needed to ensure better occupancy.

Oct 10 2017, 8:33 AM

Oct 8 2017

mathieu menuet (bliblubli) added a comment to D2873: Cycles: Allow rendering with GPUs an CPUs at once.

replacing line 1374 with:

Oct 8 2017, 7:12 PM
mathieu menuet (bliblubli) added a comment to T53017: cuda disable openCL on windows.

It seems the multi-device patch works when CPU is not selected. Note that in this case, selecting cpu in either the CUDA or OpenCL Tab of user pref activates it in the other one, so maybe it conflicts somehow?

Oct 8 2017, 6:21 PM · BF Blender
mathieu menuet (bliblubli) added a comment to T53017: cuda disable openCL on windows.

indeed, it works now :) thanks for the solution, I think it's safe for master. Here are the diff we spoke of in IRC:
To allow OpenCL to be selected when CUDA is present:


to render all samples at once on OpenCL to limit update overhead, but botleneck is somewhere else when rendering small tiles:

And the patch you proposed to render with CUDA, OpenCL and CPU together, but only init phase works.

Oct 8 2017, 6:15 PM · BF Blender
mathieu menuet (bliblubli) added a comment to T53017: cuda disable openCL on windows.

full is made with "make.bat full" = no cubins
release is made with "make.bat release" = with cubins

Oct 8 2017, 9:40 AM · BF Blender

Oct 7 2017

mathieu menuet (bliblubli) created T53017: cuda disable openCL on windows.
Oct 7 2017, 2:55 PM · BF Blender

Oct 3 2017

mathieu menuet (bliblubli) awarded D2864: Sculpt pressure curvemapping a Love token.
Oct 3 2017, 9:00 AM · Sculpting and Painting

Oct 1 2017

mathieu menuet (bliblubli) added a comment to D2862: Cycles: schedule more work and prevent CUDA driver timeouts..

If same behavior could be obtained on OpenCL, the next benchmark pack could have only one version with 32x32 for CPU, Cuda and OpenCL. That would also avoid playing with tile size for the end user. On OpenCL currently, only tile size between 256 and 64 are about the same speed. 32 is much slower and one tile also (two digit percent). It would also maybe allow later to have a tile server that would render with CPU codepath for CPU, CUDA code path for NVidia and OpenCL code path for AMD all on one frame, using one tile size.

Oct 1 2017, 8:34 AM