-
Notifications
You must be signed in to change notification settings - Fork 3k
Pull requests: microsoft/onnxruntime
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[webgpu] Optimize matmulnbits with M > 1
ep:WebGPU
ort-web webgpu provider
#23102
opened Dec 13, 2024 by
qjia7
Loading…
[WebNN] Fixes MLTensor caching across different contexts
#23100
opened Dec 13, 2024 by
egalli
Loading…
[Bug Fix] Missing CustomOp SchemaRegister when generator EPContext ONNX model
#23091
opened Dec 12, 2024 by
mingyueliuh
Loading…
Implement some missing element wise Add/Sub/Mul/Div/Neg operations for CPU and CUDA EPs
#23090
opened Dec 12, 2024 by
Zyrin
Loading…
[WebNN EP] Automatically move input CPU tensors to ml-tensor
#23073
opened Dec 11, 2024 by
egalli
Loading…
Improves 2d tiled matmulnbits by repeating A, loads N times for each B load
ep:WebGPU
ort-web webgpu provider
#23071
opened Dec 10, 2024 by
sushraja-msft
Loading…
Implement pre-packed blobs serialization on disk and their memory mapping on load
#23069
opened Dec 10, 2024 by
yuslepukhin
Loading…
Update python version metadata (remove 3.7, 3.8, 3.9; add 3.13).
#23067
opened Dec 10, 2024 by
tianleiwu
Loading…
Upgrade Java version from react-native/android to Java 17
#23066
opened Dec 10, 2024 by
jchen351
Loading…
Make static KV cache work.
ep:WebGPU
ort-web webgpu provider
#23061
opened Dec 10, 2024 by
satyajandhyala
Loading…
Implement DepthToSpace uint8_t and Enable DropQDQNodesRules
#23029
opened Dec 5, 2024 by
yihonglyu
Loading…
[DML] Don't save resources to be released later when the GPU is already done with them.
#22995
opened Dec 3, 2024 by
BTurkelson
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2024-12-11.