Cycles GPU Compute baking often fails #59286
Labels
No Label
Interest
Alembic
Interest
Animation & Rigging
Interest
Asset Browser
Interest
Asset Browser Project Overview
Interest
Audio
Interest
Automated Testing
Interest
Blender Asset Bundle
Interest
BlendFile
Interest
Collada
Interest
Compatibility
Interest
Compositing
Interest
Core
Interest
Cycles
Interest
Dependency Graph
Interest
Development Management
Interest
EEVEE
Interest
EEVEE & Viewport
Interest
Freestyle
Interest
Geometry Nodes
Interest
Grease Pencil
Interest
ID Management
Interest
Images & Movies
Interest
Import Export
Interest
Line Art
Interest
Masking
Interest
Metal
Interest
Modeling
Interest
Modifiers
Interest
Motion Tracking
Interest
Nodes & Physics
Interest
OpenGL
Interest
Overlay
Interest
Overrides
Interest
Performance
Interest
Physics
Interest
Pipeline, Assets & IO
Interest
Platforms, Builds & Tests
Interest
Python API
Interest
Render & Cycles
Interest
Render Pipeline
Interest
Sculpt, Paint & Texture
Interest
Text Editor
Interest
Translations
Interest
Triaging
Interest
Undo
Interest
USD
Interest
User Interface
Interest
UV Editing
Interest
VFX & Video
Interest
Video Sequencer
Interest
Virtual Reality
Interest
Vulkan
Interest
Wayland
Interest
Workbench
Interest: X11
Legacy
Blender 2.8 Project
Legacy
Milestone 1: Basic, Local Asset Browser
Legacy
OpenGL Error
Meta
Good First Issue
Meta
Papercut
Meta
Retrospective
Meta
Security
Module
Animation & Rigging
Module
Core
Module
Development Management
Module
EEVEE & Viewport
Module
Grease Pencil
Module
Modeling
Module
Nodes & Physics
Module
Pipeline, Assets & IO
Module
Platforms, Builds & Tests
Module
Python API
Module
Render & Cycles
Module
Sculpt, Paint & Texture
Module
Triaging
Module
User Interface
Module
VFX & Video
Platform
FreeBSD
Platform
Linux
Platform
macOS
Platform
Windows
Priority
High
Priority
Low
Priority
Normal
Priority
Unbreak Now!
Status
Archived
Status
Confirmed
Status
Duplicate
Status
Needs Info from Developers
Status
Needs Information from User
Status
Needs Triage
Status
Resolved
Type
Bug
Type
Design
Type
Known Issue
Type
Patch
Type
Report
Type
To Do
No Milestone
No project
No Assignees
16 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: blender/blender#59286
Loading…
Reference in New Issue
No description provided.
Delete Branch "%!s(<nil>)"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
System Information
Operating system: Windows 10 Home 64
Graphics card: NVIDIA GeForce GTX 860M
Blender Version
Broken: 2.80 Beta (all)
Worked: 2.79b
Short description of error
When using
CUDA
andCycles
GPU Compute
to bake textures, it usually fails halfway through the process resulting in half-baked textures with horizontal lines through them, like the one displayed below.In each subsequent attempt, the bake fails immediately resulting in errors like below. It won't bake again unless I use
CPU
or restart Blender, but I can't recall it ever baking fully withGPU Compute
even after restarting.Exact steps for others to reproduce the error
CUDA
fromEdit > Preferences > System > General > Cycles Compute Device
Added subscriber: @DanSinni
#62233 was marked as duplicate of this issue
#61882 was marked as duplicate of this issue
#60630 was marked as duplicate of this issue
#61508 was marked as duplicate of this issue
#60121 was marked as duplicate of this issue
#59238 was marked as duplicate of this issue
Added subscriber: @zeauro
I can't reproduce with
21523b5
under ubuntu 16.04 with a titan black. Can you upload a .blend file with the issue ?Sure. Try this one. I also packed the resulting texture. If I try to bake it again, it fails and throws an error.
gpu_bake_bug.blend
No error. Correct baking, here.
It reminds to me problems at first attempts of hybrid rendering.
Does it work if you only use Graphic Card as CUDA device ?
Just gave that a try. No luck. Actually, it failed immediately.
This appears in the taskbar:
Added subscribers: @JamesOvnik, @alaslipknot
Added subscriber: @ostapblender
Added subscriber: @remotecrab131
Win 10 Pro, gtx 1080
First bake give me an image like the one OP demonstrated, with scan-line like effect, bad bake.
Second try give me nothing. And my viewport cycle straight out vanishes, give me only the GL layer where I can see the light objects and cameras, selection outline, etc.
It also throws an error: "CUDA error at cuCtxCreate: Illegal address", but then reports "Baking map saved to internal image, save it..."
Added subscriber: @dev369
Hell on not sure if this its the same issue but looks similar - baking doesnt work gtx 1070
here is the entire error log
https://pastebin.com/JLepKyff
Added subscriber: @Win7yes
Added subscriber: @ixd
Added subscriber: @royvandewater
Reproduced on a GTX 1070 in blender-2.80.0-git.445433a6913f-windows64 using the default cube scene and attempting to do a combined bake onto a 1024x1024 image with all default render settings.
https://pastebin.com/DgAp8hxg
Added subscriber: @John514
Added subscriber: @Minimalistiker
Added subscriber: @Fjury-4
Reproduced this issue in every comitted Blender file, latest blender-2.80.0-git.d3870471edd7-windows64, using my own Shaderstatue for Shader testing. All bakings fail before 10% got actually baked.
Message Bake stored internally please safe and trying to bake again results in the orange CUDA message " CUDA error: illegal adress".
GPU: 1080 GTX latest graphics driver
CPU: i5 4460
Windows 10 latest build{F6590069}
From my research this issue has been present for more then a year now. Its a pretty big issue, id say. So far devss kept complete radio silence on this issue, or i havent been able to find anywhere where it was anything said about it. Could devs say anything about it like, is it getting fixed at all and when? Why has it been neglected for so long, is this that hard to fix? Considering it has been around for such a long time.
So far we've not found a way to reproduce this yet, tested on multiple computers with different graphics cards.
It will get fixed, but it's one of 1000 other bugs in the tracker that we are working through.
Are there any ways to log those things in blender so u can get further details from users?
Added subscriber: @Rawalanche
Added subscribers: @rileybrown704, @WilliamReynish
Added subscriber: @bulbfish
Added subscriber: @poyodiaz
hi everyone , this bug has not been fixed right? is there a possibility in a close future? seems to work ok in ubuntu...is the only reason I need to reboot my pc... to bake a texture,(not trying to offend anyone I do really appreciate your hard work guys) thanks for all the great effort ! :)
Added subscriber: @JAlcaraz
Still can't confirm myself, but marking as high priority.
Added subscribers: @szymi, @brecht, @JacquesLucke
I tred it on
0ba143a1d6
- same thing, and it's also impossible to start baking again due to CUDA error, as I said in #60121 and it works only after restart of a BlenderBUT! New thing appeared - it looks different with different tile size. Looks like it can render only top half of each tile
Target resolution: 1024x1024
Tile 64:
Tile 256:
Tile 512:
Tile 1024 and upwards:
CubeBake.blend1
Added subscribers: @Ureh, @ZedDB
I was able to redo this on a GTX 960 now. The exact cause is not clear to me, but it seems that CUDA 10.1 works correctly. Possibly it's the same compiler bug(s) we've reported to NVIDIA before and that they fixed in 10.0 and 10.1.
This issue was referenced by blender/cycles@bbd8cb7582
This issue was referenced by
65d95879f7
Changed status from 'Open' to: 'Resolved'
@brecht A bit too late, but I did some research on this (just about 2 hours before it got fixed lol) trying to compile bake kernel with various CUDA versions (9.1, 10.0, 10.1) and optimizations. Here are the results:
OS: Win 10, GPU: GTX 1070
Legend: OK = bakes well, ERROR = cuda errors while baking
"Weird dots" image from NVRTC 9.1 & ptxas 10.0:
And cycles_cubin_cc (which was used to compile cuda kernels shipped with buildbot) did
NVRTC 9.1 & ptxas 9.1 with optimization
. So it seems to me thatnvrtc 9.1
andptxas 9.1
and dont play nice together on some gpu's (butnvcc 9.1 & ptxas 9.1
works good). Latest NVIDIA driver jit compilesnvrtc 9.1 ptx
kernel just fine.Running
cuda_memcheck blender.exe
shows:And after further investigation it shows something like a stack corruption on a random thread is happening in
And all subsequent calls to CUDA api return an error. (cuCtxSynchronize, etc)
It seems to me it's an optimization codegen bug with nvrtc 9.1 (and possible in 10.0).
CUDA 10.1 resolves these issues (and we can use the latest msvc with it!).
Tested on this file (default cube + texture):
bake_bug_2.blend
@ixd, interesting tests. There is indeed an optimization codegen bug that we reported and that was fixed in 10.1.