Execution is stuck until termination when running on CUDA in WSL2
#538 opened on Jun 2, 2024
Description
Reproducing the behavior
The issue
Hi,
I am having issues running code with bend run-cu on CUDA inside the WSL2. There are not errors, but code is not executing either. Execution is frozen, similary to while (true) {}.
Code executes without any issues when using bend run or bend run-c
Compiling with bend gen-cu and nvcc has the same result.
I've tried both 12.4 and 12.5 and result is the same.
What I attempted
I tried different code examples from the repo, but result is always the same. Since bend allowed to compile code to cuda with gen-cu, I tried to find what is broken inside generated file (assuming bend run-cu will use the same code). This issue happened inside gnet_normalize function, where code could never exit the for loop. This break is never callen (rlen always has the same value)
CUDA verification
Just to rule CUDA out, I have successfully installed CUDA and can confirm it is recognised by compiling and running this code.
cuda-test.cu:
#include <cuda.h>
#include <stdio.h>
int main(int argc, char** argv) {
int driver_version = 0, runtime_version = 0;
cudaDriverGetVersion(&driver_version);
cudaRuntimeGetVersion(&runtime_version);
printf(
"Driver Version: %d\nRuntime Version: %d\n",
driver_version,
runtime_version
);
return 0;
}
output
~> nvcc cuda-test.cu -o cuda-test && ./cuda-test
Driver Version: 10010
Runtime Version: 12040
nvidia-smi
Calling from wsl
> nvidia-smi.exe
Sun Jun 2 16:32:08 2024
+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 552.44 Driver Version: 552.44 CUDA Version: 12.4 |
|-----------------------------------------+------------------------+----------------------+
| GPU Name TCC/WDDM | Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|=========================================+========================+======================|
| 0 NVIDIA GeForce RTX 3070 WDDM | 00000000:08:00.0 On | N/A |
| 49% 56C P3 41W / 220W | 6496MiB / 8192MiB | 36% Default |
| | | N/A |
+-----------------------------------------+------------------------+----------------------+
+-----------------------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=========================================================================================|
| 0 N/A N/A 3832 C+G ...oogle\Chrome\Application\chrome.exe N/A |
| 0 N/A N/A 8292 C+G C:\Windows\explorer.exe N/A |
| 0 N/A N/A 21004 C+G ...\cef\cef.win7x64\steamwebhelper.exe N/A |
| 0 N/A N/A 21416 C+G ...wekyb3d8bbwe\XboxGameBarWidgets.exe N/A |
| 0 N/A N/A 27344 C+G ...ience\NVIDIA GeForce Experience.exe N/A |
....
+-----------------------------------------------------------------------------------------+
System Settings
Example:
- HVM: 2.0.18
- Bend: 0.2.27
- OS: Ubuntu 20.04.6 LTS
- WSL: 2.1.5.0
- CPU: AMD Ryzen 9 5900X
- GPU: RTX 3070
- Cuda Version: 12.5, V12.5.40
Additional context
No response