Skip to content

Instantly share code, notes, and snippets.

@OrsoEric
Last active March 6, 2025 18:24
Show Gist options
  • Save OrsoEric/cbc5bcd6484ef88d93135b334cc8fd8b to your computer and use it in GitHub Desktop.
Save OrsoEric/cbc5bcd6484ef88d93135b334cc8fd8b to your computer and use it in GitHub Desktop.
7900XTX Adrenaline 24 WSL2 Ubuntu22 Driver HIP5.7/6.2/6.3 A1111
https://rocm.docs.amd.com/projects/radeon/en/latest/docs/install/wsl/install-radeon.html
wsl --install
takes forever then stuck at 0%
wsl --list --online
NAME FRIENDLY NAME
SUSE-Linux-Enterprise-15-SP5 SUSE Linux Enterprise 15 SP5
SUSE-Linux-Enterprise-15-SP6 SUSE Linux Enterprise 15 SP6
Ubuntu Ubuntu
Ubuntu-24.04 Ubuntu 24.04 LTS
kali-linux Kali Linux Rolling
openSUSE-Tumbleweed openSUSE Tumbleweed
openSUSE-Leap-15.6 openSUSE Leap 15.6
Debian Debian GNU/Linux
Ubuntu-18.04 Ubuntu 18.04 LTS
Ubuntu-20.04 Ubuntu 20.04 LTS
Ubuntu-22.04 Ubuntu 22.04 LTS
OracleLinux_7_9 Oracle Linux 7.9
OracleLinux_8_7 Oracle Linux 8.7
OracleLinux_9_1 Oracle Linux 9.1
wsl --install -d Ubuntu-22.04
stuck 90%
restart
use windows store
seems working
user: soraka
pwd: soraka
sudo apt update
sudo apt upgrade
wget https://repo.radeon.com/amdgpu-install/6.2.3/ubuntu/jammy/amdgpu-install_6.2.60203-1_all.deb
sudo apt install ./amdgpu-install_6.2.60203-1_all.deb
Fetched 303 kB in 0s (618 kB/s)
Selecting previously unselected package amdgpu-install.
(Reading database ... 42578 files and directories currently installed.)
Preparing to unpack .../amdgpu-install_6.2.60203-1_all.deb ...
Unpacking amdgpu-install (6.2.60203-2044426.22.04) ...
Selecting previously unselected package dialog.
Preparing to unpack .../dialog_1.3-20211214-1_amd64.deb ...
Unpacking dialog (1.3-20211214-1) ...
Setting up dialog (1.3-20211214-1) ...
Setting up amdgpu-install (6.2.60203-2044426.22.04) ...
Processing triggers for man-db (2.10.2-1) ...
N: Download is performed unsandboxed as root as file '/home/sorak
sudo amdgpu-install --list-usecase
takes a while
rocminfo
WSL environment detected.
=====================
HSA System Attributes
=====================
Runtime Version: 1.1
Runtime Ext Version: 1.6
System Timestamp Freq.: 1000.000000MHz
Sig. Max Wait Duration: 18446744073709551615 (0xFFFFFFFFFFFFFFFF) (timestamp count)
Machine Model: LARGE
System Endianness: LITTLE
Mwaitx: DISABLED
DMAbuf Support: NO
==========
HSA Agents
==========
*******
Agent 1
*******
Name: CPU
Uuid: CPU-XX
Marketing Name: CPU
Vendor Name: CPU
Feature: None specified
Profile: FULL_PROFILE
Float Round Mode: NEAR
Max Queue Number: 0(0x0)
Queue Min Size: 0(0x0)
Queue Max Size: 0(0x0)
Queue Type: MULTI
Node: 0
Device Type: CPU
Cache Info:
Chip ID: 0(0x0)
Cacheline Size: 64(0x40)
Internal Node ID: 0
Compute Unit: 24
SIMDs per CU: 0
Shader Engines: 0
Shader Arrs. per Eng.: 0
Memory Properties:
Features: None
Pool Info:
Pool 1
Segment: GLOBAL; FLAGS: KERNARG, FINE GRAINED
Size: 16291340(0xf8960c) KB
Allocatable: TRUE
Alloc Granule: 4KB
Alloc Recommended Granule:4KB
Alloc Alignment: 4KB
Accessible by all: TRUE
Pool 2
Segment: GLOBAL; FLAGS: COARSE GRAINED
Size: 16291340(0xf8960c) KB
Allocatable: TRUE
Alloc Granule: 4KB
Alloc Recommended Granule:4KB
Alloc Alignment: 4KB
Accessible by all: TRUE
ISA Info:
*******
Agent 2
*******
Name: gfx1100
Marketing Name: AMD Radeon RX 7900 XTX
Vendor Name: AMD
Feature: KERNEL_DISPATCH
Profile: BASE_PROFILE
Float Round Mode: NEAR
Max Queue Number: 16(0x10)
Queue Min Size: 4096(0x1000)
Queue Max Size: 131072(0x20000)
Queue Type: MULTI
Node: 1
Device Type: GPU
Cache Info:
L1: 32(0x20) KB
L2: 6144(0x1800) KB
L3: 98304(0x18000) KB
Chip ID: 29772(0x744c)
Cacheline Size: 64(0x40)
Max Clock Freq. (MHz): 2482
Internal Node ID: 1
Compute Unit: 96
SIMDs per CU: 2
Shader Engines: 6
Shader Arrs. per Eng.: 2
Coherent Host Access: FALSE
Memory Properties:
Features: KERNEL_DISPATCH
Fast F16 Operation: TRUE
Wavefront Size: 32(0x20)
Workgroup Max Size: 1024(0x400)
Workgroup Max Size per Dimension:
x 1024(0x400)
y 1024(0x400)
z 1024(0x400)
Max Waves Per CU: 32(0x20)
Max Work-item Per CU: 1024(0x400)
Grid Max Size: 4294967295(0xffffffff)
Grid Max Size per Dimension:
x 4294967295(0xffffffff)
y 4294967295(0xffffffff)
z 4294967295(0xffffffff)
Max fbarriers/Workgrp: 32
Packet Processor uCode:: 2280
SDMA engine uCode:: 21
IOMMU Support:: None
Pool Info:
Pool 1
Segment: GLOBAL; FLAGS: COARSE GRAINED
Size: 25079976(0x17eb0a8) KB
Allocatable: TRUE
Alloc Granule: 4KB
Alloc Recommended Granule:2048KB
Alloc Alignment: 4KB
Accessible by all: FALSE
Pool 2
Segment: GROUP
Size: 64(0x40) KB
Allocatable: FALSE
Alloc Granule: 0KB
Alloc Recommended Granule:0KB
Alloc Alignment: 0KB
Accessible by all: FALSE
ISA Info:
ISA 1
Name: amdgcn-amd-amdhsa--gfx1100
Machine Models: HSA_MACHINE_MODEL_LARGE
Profiles: HSA_PROFILE_BASE
Default Rounding Mode: NEAR
Default Rounding Mode: NEAR
Fast f16: TRUE
Workgroup Max Size: 1024(0x400)
Workgroup Max Size per Dimension:
x 1024(0x400)
y 1024(0x400)
z 1024(0x400)
Grid Max Size: 4294967295(0xffffffff)
Grid Max Size per Dimension:
x 4294967295(0xffffffff)
y 4294967295(0xffffffff)
z 4294967295(0xffffffff)
FBarrier Max Size: 32
*** Done ***
soraka@TowerOfBabel:~$ python3 --version
Python 3.10.12
https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Install-and-Run-on-AMD-GPUs/c6d75545e3c080addcf9494de9c6527c484aba03
soraka@TowerOfBabel:~$
git clone https://github.com/AUTOMATIC1111/stable-diffusion-webui.git
Cloning into 'stable-diffusion-webui'...
remote: Enumerating objects: 34903, done.
remote: Counting objects: 100% (3/3), done.
remote: Compressing objects: 100% (3/3), done.
remote: Total 34903 (delta 0), reused 0 (delta 0), pack-reused 34900 (from 2)
Receiving objects: 100% (34903/34903), 35.47 MiB | 8.36 MiB/s, done.
Resolving deltas: 100% (24379/24379), done.
soraka@TowerOfBabel:~$ cd stable-diffusion-webui
sudo apt install python3.10-venv
python3 -m venv myenv
source myenv/bin/activate
python3 -m pip install --upgrade pip wheel
TORCH_COMMAND='pip install torch torchvision --extra-index-url https://download.pytorch.org/whl/rocm5.1.1' python3 launch.py --precision full --no-half
NO!!!! THIS IS NVIDIA!!!
delete all and restart
sudo rm -r stable-diffusion-webui
git clone https://github.com/AUTOMATIC1111/stable-diffusion-webui.git
sudo chmod +x webui-user.sh
./webui-user.sh
NO!!!
./webui.sh
doing something using pythorch, but looks still cuda version?
pip3 install -r requirements.txt
__________________________________________________________________________________________________________________
skip all untipl rocm because rocm wsl should working
sudo apt update
sudo apt install git python3-pip python3-venv python3-dev libstdc++-12-dev
sudo apt update
wget https://repo.radeon.com/amdgpu-instal...
sudo apt install ./amdgpu-install_5.7.50701-1_all.deb
sudo amdgpu-install --usecase=graphics,rocm
sudo usermod -aG video $USER
sudo usermod -aG render $USER
sudo reboot
After you have rebooted, inside terminal again
make a directory for stable diffusion
mkdir sd
If you get a permission denied then use
sudo mkdir sd
Skip this step if you did not have to use sudo
change owner to be yourself
sudo chown -R $USER:$USER sd/
change into new directory
cd sd
clone automatic1111
git clone https://github.com/AUTOMATIC1111/stab...
cd stable-diffusion-webui
setup python venv
python3 -m venv venv
source venv/bin/activate
install packages
pip3 install -r requirements.txt
uninstall cuda torch
pip3 uninstall torch torchvision
install torch ROCm
pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/rocm5.6
create launch script:
sudo nano launch.sh
sudo chmod +x launch.sh
inside the launch file paste the contents of this url:
https://github.com/jhodak/linux-confi...
to launch SD
bash launch.sh
optional parameters for your launch script:
--no-half --precision full --upcast-sampling --no-half-vae --medvram
If you don't need these extra parameters I would recommend not to turn them on. You may only need one or two of these parameters.
If you do not have an 11.0.0 graphics card, you may need to try:
HSA_OVERRIDE_GFX_VERSION=10.3.0
If you have an AMD integrated GPU, and you are having troubles, you may need to change:
HIP_VISIBLE_DEVICES=1
If you are having some memory troubles try changing garbage collection threshold in launch script to 0.9
HIP version: 6.2.41134-65d174c3e ==hipconfig HIP_PATH :/opt/rocm-6.2.3 ROCM_PATH :/opt/rocm-6.2.3 HIP_COMPILER :clang HIP_PLATFORM :amd HIP_RUNTIME :rocclr CPP_CONFIG : -D__HIP_PLATFORM_HCC__= -D__HIP_PLATFORM_AMD__= -I/opt/rocm-6.2.3/include -I/include ==hip-clang HIP_CLANG_PATH :/opt/rocm-6.2.3/lib/llvm/bin AMD clang version 18.0.0git (https://github.com/RadeonOpenCompute/llvm-project roc-6.2.3 24355 77cf9ad00e298ed06e06aec0f81009510f545714) Target: x86_64-unknown-linux-gnu Thread model: posix InstalledDir: /opt/rocm-6.2.3/lib/llvm/bin Configuration file: /opt/rocm-6.2.3/lib/llvm/bin/clang++.cfg AMD LLVM version 18.0.0git Optimized build. Default target: x86_64-unknown-linux-gnu Host CPU: alderlake Registered Targets: amdgcn - AMD GCN GPUs r600 - AMD GPUs HD2XXX-HD6XXX x86 - 32-bit X86: Pentium-Pro and above x86-64 - 64-bit X86: EM64T and AMD64 hip-clang-cxxflags : -O3 hip-clang-ldflags : --driver-mode=g++ -O3 --hip-link == Environment Variables PATH =/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/snap/bin == Linux Kernel Hostname : TowerOfBabel Linux TowerOfBabel 5.15.167.4-microsoft-standard-WSL2 #1 SMP Tue Nov 5 00:21:55 UTC 2024 x86_64 x86_64 x86_64 GNU/Linux No LSB modules are available. Distributor ID: Ubuntu Description: Ubuntu 22.04.5 LTS Release: 22.04 Codename: jammy
pip install torch torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/rocm5.2
maybe i need rocm 6.3
pip install torch torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/rocm6.3
pip install torch torchvision torchaudio --extra-index-url https://repo.radeon.com/rocm/manylinux/rocm-rel-6.3
https://repo.radeon.com/rocm/manylinux/rocm-rel-6.2.3/
this actually list
Release notes
For information on available ROCm releases, please refer to the ROCm Release Notes
For information on available Radeon Software for Linux releases, please refer to Linux® Drivers for AMD Radeon™ and Radeon PRO™ Graphics.
Index of /rocm/manylinux/rocm-rel-6.2.3/
../
onnxruntime_rocm-1.18.0-cp310-cp310-linux_x86_6..> 28-Sep-2024 01:01 164303365
onnxruntime_training-1.18.0+rocm623.124-cp310-c..> 28-Sep-2024 02:40 172917965
pytorch_triton_rocm-2.3.0+rocm6.2.3.5a02332983-..> 28-Sep-2024 12:00 246086517
tensorflow_rocm-2.16.2-cp310-cp310-manylinux_2_..> 08-Oct-2024 16:12 514072535
torch-2.3.0+rocm6.2.3-cp310-cp310-linux_x86_64.whl 28-Sep-2024 12:00 3910058413
torchvision-0.18.0+rocm6.2.3-cp310-cp310-linux_..> 28-Sep-2024 12:01 75033465
i can download the wheels and install them without donwloading any tunme
wget https://repo.radeon.com/rocm/manylinux/rocm-rel-6.2.3/torch-2.3.0+rocm6.2.3-cp310-cp310-linux_x86_64.whl
wget https://repo.radeon.com/rocm/manylinux/rocm-rel-6.2.3/torchvision-0.18.0+rocm6.2.3-cp310-cp310-linux_x86_64.whl
pip install torch-2.3.0+rocm6.2.3-cp310-cp310-linux_x86_64.whl
pip install torchvision-0.18.0+rocm6.2.3-cp310-cp310-linux_x86_64.whl
(venv) soraka@TowerOfBabel:~/stable-diffusion-webui$ pip install torch-2.3.0+rocm6.2.3-cp310-cp310-linux_x86_64.whl
Processing ./torch-2.3.0+rocm6.2.3-cp310-cp310-linux_x86_64.whl
Requirement already satisfied: filelock in ./venv/lib/python3.10/site-packages (from torch==2.3.0+rocm6.2.3) (3.17.0)
Requirement already satisfied: typing-extensions>=4.8.0 in ./venv/lib/python3.10/site-packages (from torch==2.3.0+rocm6.2.3) (4.12.2)
Collecting sympy<=1.12.1 (from torch==2.3.0+rocm6.2.3)
Using cached sympy-1.12.1-py3-none-any.whl.metadata (12 kB)
Requirement already satisfied: networkx in ./venv/lib/python3.10/site-packages (from torch==2.3.0+rocm6.2.3) (3.4.2)
Requirement already satisfied: jinja2 in ./venv/lib/python3.10/site-packages (from torch==2.3.0+rocm6.2.3) (3.1.5)
Requirement already satisfied: fsspec in ./venv/lib/python3.10/site-packages (from torch==2.3.0+rocm6.2.3) (2025.2.0)
INFO: pip is looking at multiple versions of torch to determine which version is compatible with other requirements. This could take a while.
ERROR: Could not find a version that satisfies the requirement pytorch-triton-rocm==2.3.0+rocm6.2.3.5a02332983 (from torch) (from versions: 0.0.1, 0.0.2, 2.0.0.dev20230209, 2.0.0.dev20230210, 2.0.0.dev20230211, 2.0.0.dev20230218, 2.0.0.dev20230227, 2.0.0.dev20230228, 2.0.0.dev20230307, 2.0.0.dev20230308, 2.0.0, 2.0.1, 2.0.2.dev20230401, 2.0.2.dev20230402, 2.0.2, 2.1.0.dev20230402, 2.1.0)
ERROR: No matching distribution found for pytorch-triton-rocm==2.3.0+rocm6.2.3.5a02332983
It's obvoius i can't accelerate A111
https://vladmandic.github.io/sdnext-docs/Installation/
git clone https://github.com/vladmandic/sdnext
I try with ubuntu 22 and rocm 6.2.3
next i try with ubuntu 24 and rocm 6.3
https://vladmandic.github.io/sdnext-docs/AMD-ROCm/
sudo usermod -a -G render,video $LOGNAME
https://vladmandic.github.io/sdnext-docs/CLI-Arguments/
./webui.sh --use-rocm
IT SEEMS TO WORK!!!
go file explorer and save a model
\\wsl.localhost\Ubuntu-22.04\
\\wsl.localhost\Ubuntu-22.04\home\soraka\sdnext\models\Stable-diffusion
it's vastly better, tried with dreamshaper 7 768px
one stage is fast enough at 2.5it/s
byut the next stage take 10s of seconds
Some pieces aren't accelerated like VAE
i should just derstroy ubuntyu 22 and retry witnh ubuntu24
pip install torch torchvision torchaudio --extra-index-url https://repo.radeon.com/rocm/manylinux/rocm-rel-6.2.3/
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment