Update Makefile for HIPBLAS #4

YellowRoseCx · 2023-07-12T23:06:27Z

This update

changes the DMMV_Y variable to the newer MMV_Y variable
exposes some extra options for the user to change such as "LLAMA_CUDA_KQUANTS_ITER" and "LLAMA_CUDA_KQUANTS_ITER"
sets MMV_Y variable to any DMMV_Y just in case for backwards compatibility
sets CUDA_FORCE_DMMV to 'true' after observing output and reports of garbled output
moves the ggml-cuda.o: CXXFLAGS into ifdef statements

Tested and working on latest build on a 6800xt

SlyEcho · 2023-07-13T08:10:53Z

Adding it to CXXFLAGS like that will define those for all of the code instead of just ggm-cuda.cu, the CUDA version uses a different variable so it is isolated, but what was the problem with target-specific variables?

SlyEcho · 2023-07-13T08:12:25Z

Makefile

+else
+    CXXFLAGS += -DGGML_CUDA_DMMV_X=32
+endif 
+ifeq ($(LLAMA_CUDA_FORCE_DMMV), true)


We should keep it the same as the CUDA version:

Suggested change

ifeq ($(LLAMA_CUDA_FORCE_DMMV), true)

ifdef LLAMA_CUDA_FORCE_DMMV

How would using ifdef work with a true/false boolean? The upstream makefile for CuBLAS doesn't look complete or correct

Well, it's usually defined to be anything to turn it on, not true or false.

YellowRoseCx · 2023-07-13T09:43:14Z

what was the problem with target-specific variables

What target-specific variables?

SlyEcho · 2023-07-13T09:58:06Z

It used to be:

LLAMA_CUDA_DMMV_X ?= 32
ggml-cuda.o: CXXFLAGS += -DGGML_CUDA_DMMV_X=$(LLAMA_CUDA_DMMV_X)

That means that when making the target ggml-cuda.o the CXXFLAGS will have these added definitions. But when making some other target, like common.o for example, these definitions will not be present.

The CUDA version uses this:

ifdef LLAMA_CUDA_DMMV_X
	NVCCFLAGS += -DGGML_CUDA_DMMV_X=$(LLAMA_CUDA_DMMV_X)
else
	NVCCFLAGS += -DGGML_CUDA_DMMV_X=32
endif # LLAMA_CUDA_DMMV_X

Well, it does the same thing but in 5 lines. But the difference here is that it uses a different Make variable NVCCFLAGS, which should only be used on the ggml-cuda.cu file as well.

So maybe we can use a different variable like HIPFLAGS or something?

SlyEcho · 2023-07-13T10:47:38Z

I think I got the variable changes working now except for the backwards compatibility with LLAMA_CUDA_DMMV_Y

Update Makefile for HIPBLAS

a141c68

YellowRoseCx mentioned this pull request Jul 12, 2023

ROCm Port ggerganov/llama.cpp#1087

Merged

SlyEcho reviewed Jul 13, 2023

View reviewed changes

SlyEcho closed this Jul 13, 2023

YellowRoseCx deleted the patch-2 branch April 10, 2024 15:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update Makefile for HIPBLAS #4

Update Makefile for HIPBLAS #4

YellowRoseCx commented Jul 12, 2023

SlyEcho commented Jul 13, 2023

SlyEcho Jul 13, 2023

YellowRoseCx Jul 13, 2023

SlyEcho Jul 13, 2023

YellowRoseCx commented Jul 13, 2023

SlyEcho commented Jul 13, 2023

SlyEcho commented Jul 13, 2023

	ifeq ($(LLAMA_CUDA_FORCE_DMMV), true)
	ifdef LLAMA_CUDA_FORCE_DMMV

Update Makefile for HIPBLAS #4

Update Makefile for HIPBLAS #4

Conversation

YellowRoseCx commented Jul 12, 2023

SlyEcho commented Jul 13, 2023

SlyEcho Jul 13, 2023

Choose a reason for hiding this comment

YellowRoseCx Jul 13, 2023

Choose a reason for hiding this comment

SlyEcho Jul 13, 2023

Choose a reason for hiding this comment

YellowRoseCx commented Jul 13, 2023

SlyEcho commented Jul 13, 2023

SlyEcho commented Jul 13, 2023