Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

sycl: Revert MUL_MAT_OP support changes #10385

Merged

Conversation

Alcpz
Copy link
Collaborator

@Alcpz Alcpz commented Nov 18, 2024


Reverts the changes introduced in #10257 that introduced a performance regression as it disabled more MUL_MAT operations than intended.

Be wary that these changes will make the SYCL test-backend-ops to fail.

@github-actions github-actions bot added the SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language label Nov 18, 2024
@Alcpz
Copy link
Collaborator Author

Alcpz commented Nov 18, 2024

@NeoZhangJianyu reverting here. There's still an issue with quantizations that use get_rows, but that is better addressed in a different PR I think, as it was introduced in a different place.

@NeoZhangJianyu NeoZhangJianyu merged commit 557924f into ggerganov:master Nov 19, 2024
54 checks passed
arthw pushed a commit to arthw/llama.cpp that referenced this pull request Dec 20, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants