-
Notifications
You must be signed in to change notification settings - Fork 10.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Are Vulkan and Kompute backend supported on AMD 7800 XT in Windows ? #5698
Comments
What model architecture are you trying to run, and with what kind of quantization? Because these backends don't have full support of all lama.cpp features. |
I just run benchmark.exe and I suppose that if this it's not working correctly then running any model will not work on that type of backend. |
it's support for model and quant is pretty limited. |
Vulkan should work the same regardless of your hardware vendor or operating system... The 7800XT is far from being an ancient card and should support pretty much all Vulkan features, especially on Windows. |
My assumption was if the benchmark.exe doesn't run then also the models will not load. |
I have a question related to the two types backends: Vulkan and Kompute.
I am running on Windows 11 latest version with latest version of AMD drivers on a AMD 7800 XT graphic card.
I have tried with different windows instalations and it is the same. Also tried different drivers version with clean installation and it's the same.
I just want to know if this card is supported on windows with Kompute and Vulkan or it's only for linux.
Thanks
If I run the Vulkan version (b2251) I receive this error:
main: build = 2251 (fd43d66)
main: built with MSVC 19.38.33135.0 for x64
Starting Test
Allocating Memory of size 800194560 bytes, 763 MB
ggml_vulkan: Found 1 Vulkan devices:
Vulkan0: AMD Radeon RX 7800 XT | uma: 0 | fp16: 1 | warp size: 64
Creating new tensors
------ Test 1 - Matrix Mult via F32 code
n_threads=1
m11: type = 0 ( f32) ne = 11008 x 4096 x 1, nb = ( 4, 44032, 180355072) - Sum of tensor m11 is 45088768.00
m2: type = 0 ( f32) ne = 11008 x 128 x 1, nb = ( 4, 44032, 5636096) - Sum of tensor m2 is 2818048.00
GGML_ASSERT: D:\a\llama.cpp\llama.cpp\ggml-vulkan.cpp:1767: false
If I run the Kompute version (b2251) I don't receive and error, but doesn't seems to use the graphic card based on the low points:
main: build = 2251 (fd43d66)
main: built with MSVC 19.38.33135.0 for x64
Starting Test
Allocating Memory of size 800194560 bytes, 763 MB
Creating new tensors
_------ Test 1 - Matrix Mult via F32 code
n_threads=1
m11: type = 0 ( f32) ne = 11008 x 4096 x 1, nb = ( 4, 44032, 180355072) - Sum of tensor m11 is 45088768.00
m2: type = 0 ( f32) ne = 11008 x 128 x 1, nb = ( 4, 44032, 5636096) - Sum of tensor m2 is 2818048.00
gf->nodes[0]: type = 0 ( f32) ne = 4096 x 128 x 1, nb = ( 4, 16384, 2097152) - Sum of tensor gf->nodes[0] is 11542724608.00
------ Test 2 - Matrix Mult via q4_1 code
n_threads=1
Matrix Multiplication of (11008,4096,1) x (11008,128,1) - about 11.54 gFLOPS
Iteration;NThreads; SizeX; SizeY; SizeZ; Required_FLOPS; Elapsed_u_Seconds; gigaFLOPS
Average 81.92_
The my Vulkan info first part (it was to big to add it all) info looks like this:
WARNING: [Loader Message] Code 0 : windows_read_data_files_in_registry: Registry lookup failed to get layer manifest files.
VULKANINFO
Vulkan Instance Version: 1.3.261
Instance Extensions: count = 13
Layers: count = 1
VK_LAYER_AMD_switchable_graphics (AMD switchable graphics layer) Vulkan version 1.3.277, layer version 1:
Layer Extensions: count = 0
Devices: count = 1
GPU id = 0 (AMD Radeon RX 7800 XT)
Layer-Device Extensions: count = 0
Presentable Surfaces:
GPU id : 0 (AMD Radeon RX 7800 XT):
Surface type = VK_KHR_win32_surface
Formats: count = 4
SurfaceFormat[0]:
format = FORMAT_R8G8B8A8_UNORM
colorSpace = COLOR_SPACE_SRGB_NONLINEAR_KHR
SurfaceFormat[1]:
format = FORMAT_B8G8R8A8_UNORM
colorSpace = COLOR_SPACE_SRGB_NONLINEAR_KHR
SurfaceFormat[2]:
format = FORMAT_R8G8B8A8_SRGB
colorSpace = COLOR_SPACE_SRGB_NONLINEAR_KHR
SurfaceFormat[3]:
format = FORMAT_B8G8R8A8_SRGB
colorSpace = COLOR_SPACE_SRGB_NONLINEAR_KHR
Present Modes: count = 3
PRESENT_MODE_IMMEDIATE_KHR
PRESENT_MODE_FIFO_KHR
PRESENT_MODE_FIFO_RELAXED_KHR
VkSurfaceCapabilitiesKHR:
-------------------------
minImageCount = 2
maxImageCount = 16
currentExtent:
width = 256
height = 256
minImageExtent:
width = 256
height = 256
maxImageExtent:
width = 256
height = 256
maxImageArrayLayers = 1
supportedTransforms: count = 1
SURFACE_TRANSFORM_IDENTITY_BIT_KHR
currentTransform = SURFACE_TRANSFORM_IDENTITY_BIT_KHR
supportedCompositeAlpha: count = 1
COMPOSITE_ALPHA_OPAQUE_BIT_KHR
supportedUsageFlags: count = 6
IMAGE_USAGE_TRANSFER_SRC_BIT
IMAGE_USAGE_TRANSFER_DST_BIT
IMAGE_USAGE_SAMPLED_BIT
IMAGE_USAGE_STORAGE_BIT
IMAGE_USAGE_COLOR_ATTACHMENT_BIT
IMAGE_USAGE_INPUT_ATTACHMENT_BIT
VkSurfaceCapabilitiesFullScreenExclusiveEXT:
--------------------------------------------
fullScreenExclusiveSupported = true
Device Properties and Extensions:
GPU0:
VkPhysicalDeviceProperties:
VkPhysicalDeviceLimits:
The text was updated successfully, but these errors were encountered: