Cycles HIP error with image textures on Linux and RDNA1 #97591
Labels
No Label
Interest
Alembic
Interest
Animation & Rigging
Interest
Asset Browser
Interest
Asset Browser Project Overview
Interest
Audio
Interest
Automated Testing
Interest
Blender Asset Bundle
Interest
BlendFile
Interest
Collada
Interest
Compatibility
Interest
Compositing
Interest
Core
Interest
Cycles
Interest
Dependency Graph
Interest
Development Management
Interest
EEVEE
Interest
EEVEE & Viewport
Interest
Freestyle
Interest
Geometry Nodes
Interest
Grease Pencil
Interest
ID Management
Interest
Images & Movies
Interest
Import Export
Interest
Line Art
Interest
Masking
Interest
Metal
Interest
Modeling
Interest
Modifiers
Interest
Motion Tracking
Interest
Nodes & Physics
Interest
OpenGL
Interest
Overlay
Interest
Overrides
Interest
Performance
Interest
Physics
Interest
Pipeline, Assets & IO
Interest
Platforms, Builds & Tests
Interest
Python API
Interest
Render & Cycles
Interest
Render Pipeline
Interest
Sculpt, Paint & Texture
Interest
Text Editor
Interest
Translations
Interest
Triaging
Interest
Undo
Interest
USD
Interest
User Interface
Interest
UV Editing
Interest
VFX & Video
Interest
Video Sequencer
Interest
Virtual Reality
Interest
Vulkan
Interest
Wayland
Interest
Workbench
Interest: X11
Legacy
Blender 2.8 Project
Legacy
Milestone 1: Basic, Local Asset Browser
Legacy
OpenGL Error
Meta
Good First Issue
Meta
Papercut
Meta
Retrospective
Meta
Security
Module
Animation & Rigging
Module
Core
Module
Development Management
Module
EEVEE & Viewport
Module
Grease Pencil
Module
Modeling
Module
Nodes & Physics
Module
Pipeline, Assets & IO
Module
Platforms, Builds & Tests
Module
Python API
Module
Render & Cycles
Module
Sculpt, Paint & Texture
Module
Triaging
Module
User Interface
Module
VFX & Video
Platform
FreeBSD
Platform
Linux
Platform
macOS
Platform
Windows
Priority
High
Priority
Low
Priority
Normal
Priority
Unbreak Now!
Status
Archived
Status
Confirmed
Status
Duplicate
Status
Needs Info from Developers
Status
Needs Information from User
Status
Needs Triage
Status
Resolved
Type
Bug
Type
Design
Type
Known Issue
Type
Patch
Type
Report
Type
To Do
No Milestone
No project
No Assignees
13 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: blender/blender#97591
Loading…
Reference in New Issue
No description provided.
Delete Branch "%!s(<nil>)"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
System Information
Operating system: Arch Linux
Graphics card: AMD 5700XT
Blender Version
Broken: Blender 3.2 alpha
c486da0238
Worked: never
Short description of error
Trying to render a scene with HIP support enabled crashes Blender during the "Updating images" stage. Scenes without image textures (eg the default cube, but also elaborate scenes with complex procedural shaders) render perfectly fine. Tested with the open source ROCm stack from rocm-arch and the official binaries provided by AMD, on kernels 5.15.35-lts, 5.17.4-zen1 and 5.17.4-xanmod1. Also tested with the first HIP enabled nightly build, same result. As a user on the Blender forum reported success on Xorg, I tested on both Gnome/ Xorg and Gnome/ Wayland. Looking through the thread again, it seems all successful reports come from users with RDNA2 GPUs, while the only other failure was reported by another 5700XT owner. The last few lines from the console log:
And the crash log:
Exact steps for others to reproduce the error
Render anything with an image texture with HIP and GPU compute enabled. I used the BMW benchmark to create the logs.
Added subscriber: @wsippel
#100711 was marked as duplicate of this issue
#98900 was marked as duplicate of this issue
#98859 was marked as duplicate of this issue
Added subscribers: @JacquesLucke, @Jeroen-Bakker, @iss
@Jeroen-Bakker, @JacquesLucke Can you reproduce? According to HW list you have access to this GPU.
Added subscriber: @Luciddream
Added subscriber: @tschipie
Closed as duplicate of #97997
Changed status from 'Duplicate' to: 'Confirmed'
Added subscribers: @BrianSavery, @Sayak-Biswas
Removed subscriber: @JacquesLucke
Address boundary error rendering with HIP on Linux, maybe RDNA1 specificto Cycles HIP error with image textures on Linux and RDNA1Added subscriber: @brecht
This report was merged into a bug report about Windows, where a driver update fixed the issue. However the Linux driver is quite different, and I have not seen confirmation yet that it's fixed on Linux.
Marking this as a high priority issue since we really should try to fix this for 3.2.
It happens on blender-3.2.0-beta+v32.84e55e3dc251-linux.x86_64-release on Ubuntu 20.04, rocm 5.1.3 and an 5600XT (RDNA1)
If you need any specific tests, logs or information please let me know.
[edit}
It doesn't crash with all image textures, the splash screen from v2.81 (The Junk Shop) renders without a problem, although having quite a lot of image textures.
Interesting. For me, just adding an image texture node to an active material (with no texture actually loaded), and connecting the color output of the texture node to the Principled BSDF color input, crashes Blender immediately if I use Cycles with GPU compute enabled as my viewport renderer.
Adding something else, a Voronoi for example, as color source works just fine, HIP-accelerated raytracing and all.
Can confirm, the Junk Shop scene works. I even triple checked with logging and radeontop to make sure it's really using HIP - it is. No issues with the hardware accelerated Cycles viewport either.
I was able to reproduce this issue with a 5700XT + Ubuntu 20.04 + ROCm 5.1.2 with bmw and classroom scenes. I'm looking into this.
As far as I can tell, the crash happens if a scene uses textures with a horizontal resolution that isn't a multiple of 128. Any random value and any value below 128 I've tested crashes, but 128, 256, 384, 512, 768, 1024, 1536, 1664, 2048 and 4096 all worked perfectly fine. The vertical resolution doesn't matter.
I looked into adding a workaround for 3.2 that would rescale textures automatically, but it's turning out to be rather complicated and a risky change this close to the release. I think it's more likely we'll wait for a driver fix and release with a warning in the release notes, unless @Sayak-Biswas or @BrianSavery think this is going to take a long time to fix in the driver.
Added subscriber: @niobium93
Added subscriber: @Inko
Added subscribers: @TeryakiiSauce, @PratikPB2123, @Alaska
Added subscriber: @Takuro-Shoji
Added subscriber: @Caden-Mitchell
This issue doesn't seem to be RDNA1 specific after all, a Vega user on the forum reported the same problem: Junk Shop (which uses power-of-two textures) renders just fine, other scenes crash in hipTexObjectCreate.
Update on this issue: there is a fix for this in the driver and it should be available on rocm 5.3.0.
Added subscribers: @io7m, @ThomasDinges
I can confirm that the issue is indeed resolved with ROCm 5.3. I'd close the task, but I guess we should wait for confirmation from a Vega user?
Changed status from 'Confirmed' to: 'Resolved'
ROCm 5.3 is out now, and it was confirmed this is fixed.