[GPU] Fp8 compute backports #2266

kealan-barbieri · 2024-12-13T17:46:50Z

Description

Backport of mixed fp8 support, additional scale support for compute primitivies.

Checklist

General

Do all unit and benchdnn tests (make test and make test_benchdnn_*) pass locally for each commit?
Have you formatted the code using clang-format?

src/common/convolution.cpp

src/common/matmul.cpp

src/common/matmul_pd.hpp

dzarukin · 2024-12-13T19:13:03Z

src/common/matmul_pd.hpp

@@ -195,7 +207,13 @@ struct matmul_pd_t : public primitive_desc_t {
                                sc.group_dims_[0] == 1
                                        && K() % sc.group_dims_[1] == 0);
            } else {
-                ok = ok && (mask == 0);
+                ok = ok


Here must be a check for fp8 versus classic quantization.

I dont think it is required. With these changes gemm supports dst scales whenever dst is an integer or fp8 so there should be no difference in treatment. Attempts to use dst scales with non-fp8 float dst will be filtered in attr checks.

tests/benchdnn/inputs/matmul/option_set_fp8_mixed

tests/benchdnn/utils/cfg.hpp

kealan-barbieri · 2024-12-19T00:08:29Z

make test
disable device_cpu
enable device_gpu
disable benchdnn_all
enable benchdnn_matmul
enable benchdnn_conv

xe: jit: backport mixed fp8 compute

18b58c3

kealan-barbieri added platform:gpu-intel Codeowner: @oneapi-src/onednn-gpu-intel backport labels Dec 13, 2024

kealan-barbieri requested review from a team as code owners December 13, 2024 17:46

atkassen approved these changes Dec 13, 2024

View reviewed changes

github-actions bot removed the backport label Dec 13, 2024

hidefromkgb approved these changes Dec 13, 2024

View reviewed changes

dzarukin reviewed Dec 13, 2024

View reviewed changes

kealan-barbieri force-pushed the kealanba/compute_backports branch 2 times, most recently from 8aa5c64 to accacad Compare December 18, 2024 23:09

github-actions bot added the component:tests label Dec 18, 2024

kealan-barbieri added 9 commits December 18, 2024 16:03

xe: jit: backport src, dst compute scales

ab99cfb

xe: jit: gemm: adjust strategies for fp8 weights decomp

d286adc

tests: benchdnn: matmul: reduce int4 weights range

5568af8

tests: benchdnn: add mixed fp8 conv, matmul inputs

6682d8f

tests: gtests: remove dst scale checks

0478af2

xe: jit: gemm: enable mixed bf16->fp8

1a23937

tests: benchdnn: restrict dst scales to common for cpu

45a227e

xe: jit: gemm: handle quantization offsets

f8703a2

xe: jit: conv: fix typed scaling

606af26

kealan-barbieri force-pushed the kealanba/compute_backports branch from accacad to 606af26 Compare December 19, 2024 00:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[GPU] Fp8 compute backports #2266

[GPU] Fp8 compute backports #2266

kealan-barbieri commented Dec 13, 2024 •

edited

Loading

dzarukin Dec 13, 2024

kealan-barbieri Dec 18, 2024

kealan-barbieri commented Dec 19, 2024

[GPU] Fp8 compute backports #2266

Are you sure you want to change the base?

[GPU] Fp8 compute backports #2266

Conversation

kealan-barbieri commented Dec 13, 2024 • edited Loading

Description

Checklist

General

dzarukin Dec 13, 2024

Choose a reason for hiding this comment

kealan-barbieri Dec 18, 2024

Choose a reason for hiding this comment

kealan-barbieri commented Dec 19, 2024

kealan-barbieri commented Dec 13, 2024 •

edited

Loading