Skip to content

Commit

Permalink
doc: nvidia: update doc for prelu and shuffle
Browse files Browse the repository at this point in the history
  • Loading branch information
densamoilov committed Mar 20, 2023
1 parent a66cd50 commit eaab94f
Showing 1 changed file with 10 additions and 1 deletion.
11 changes: 10 additions & 1 deletion src/gpu/nvidia/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -279,7 +279,9 @@ backward propagation respectively.
### PReLU
The PReLU primitive (Leaky ReLU with a trainable alpha parameter) is implemented
using SYCL kernels. The primitive supports both forward and backward
propagations for the data types f32, s32, bf16, f16, s8 and u8.
propagations.
* Forward pass supports `f32`, `f16`, `bf16`, `s8` and `u8`
* Backward pass supports `f32`, `bf16`

### Reorder

Expand Down Expand Up @@ -341,6 +343,13 @@ changed to `CUDNN_SOFTMAX_LOG`.
The sum operation uses the reorder primitive to sum tensors, so the same
limitation as reorder applies here.

### Shuffle

The shuffle primitive is implemented using SYCL kernels.
This primitive supports both forward and backward propagations.
* Forward pass supports `f32`, `f16`, `bf16` and `s8`
* Backward pass supports `f32`, `bf16`

### Other primitives

Rest primitives not listed above are not supported by Nvidia backend. This is
Expand Down

0 comments on commit eaab94f

Please sign in to comment.