New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
gpu hotplug doesn't work #8149
Comments
That's a wlroots bug. It's weird that
Would you be able to get a stacktrace? |
How should I get stacktrace? stacktrace of what? |
of the value being given to |
Please give me instructions to get stack trace. |
See the build instruction in the repo to build in debug, then start sway in a gdb session from another computer in ssh. |
I should add debug flags to only wlroots? |
Device and Software Info:
sway version 1.9
dgpu_add:
echo 1 | sudo tee /sys/bus/pci/rescan
dgpu_remove:
echo "0000:03:00.0" | sudo tee /sys/bus/pci/devices/0000:03:00.0/driver/unbind && echo 1 | sudo tee /sys/bus/pci/devices/0000:03:00.0/remove
udev_trigger:
sudo udevadm trigger --verbose --type=devices --action=remove --subsystem-match=drm --property-match="MINOR=0"
Bug report:
lsof_before
shows output ofsudo lsof /dev/dri/card*
before udev_trigger. After udev_trigger there seems to be one thread(?) using file/dev/dri/card0
(seelsof_after
).After dgpu_remove and dgpu_add, dgpu comes backup with different card number(card1) tries to initialize drm backend and fails to initialize egl context. Again removing and adding dgpu, it comes back with different card number(card3) and again fails to initialize egl context with error;
[ERROR] [wlr] [EGL] command: eglQueryDeviceStringEXT, error: EGL_BAD_PARAMETER (0x300c), message: "eglQueryDeviceStringEXT"
[ERROR] [wlr] [EGL] command: eglQueryDeviceStringEXT, error: EGL_BAD_PARAMETER (0x300c), message: "eglQueryDeviceStringEXT"
amdgpu_device_initialize: amdgpu_get_auth (2) failed (-1)
amdgpu: amdgpu_device_initialize failed.
both times.
Closing sway session and removing and adding dgpu from tty does reset the dgpu card number to 0 or1.
Starting a new sway session with dgpu removed, and then adding dgpu does work. It does show up in
sudo lsof /dev/dri/card*
.Log files:
The text was updated successfully, but these errors were encountered: