-
Notifications
You must be signed in to change notification settings - Fork 4.2k
Issues: microsoft/DeepSpeed
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
How can DeepSpeed be configured to prevent the merging of parameter groups
#6878
opened Dec 16, 2024 by
CLL112
[BUG] Cannot use --hostfile to start multi-node training in Docker.
bug
Something isn't working
training
#6875
opened Dec 16, 2024 by
Ind1x1
[BUG] Setting different learning rates for different optimizer_grouped_parameters is ineffective for zero3
bug
Something isn't working
compression
#6873
opened Dec 15, 2024 by
CLL112
Windows wheel build error - Tried everything with all requirements you have
build
Improvements to the build and testing systems.
windows
#6871
opened Dec 14, 2024 by
FurkanGozukara
[BUG] Invalidate trace cache @ step 10: expected module 11, but got module 19
bug
Something isn't working
training
#6870
opened Dec 14, 2024 by
yafuly
[BUG] Mismatch of model parameters when using Sequence Parallel
bug
Something isn't working
training
#6868
opened Dec 13, 2024 by
chetwin-character
Unable to Install DeepSpeed on Windows using pip
windows
#6865
opened Dec 13, 2024 by
H4CK3R-5M4CK3R
[BUG]When fine-tuning an LLM, the following error occurs after training for some time: self.optimizer.param_groups[param_group_id]['params'] = [] IndexError: list index out of range
bug
Something isn't working
training
#6857
opened Dec 12, 2024 by
tdtgi
[BUG] Unable to Use Something isn't working
compression
quantization_setting
for Customizing MoQ in DeepSpeed Inference
bug
#6853
opened Dec 11, 2024 by
cyx96
[QUESTIONS]:Some questions about running Domino
enhancement
New feature or request
#6851
opened Dec 11, 2024 by
yingtongxiong
Opinion on Refactoring Ulysses
enhancement
New feature or request
#6843
opened Dec 9, 2024 by
Eugene29
[BUG] inference ops unit tests are failing
bug
Something isn't working
inference
#6839
opened Dec 9, 2024 by
oelayan7
[REQUEST] domino integration to nanotron
enhancement
New feature or request
#6835
opened Dec 7, 2024 by
NouamaneTazi
zero-3 cpuadam is so slow
enhancement
New feature or request
#6834
opened Dec 7, 2024 by
SeunghyunSEO
[BUG] offload optmizer states in zero3
bug
Something isn't working
training
#6833
opened Dec 7, 2024 by
Hanqer
[BUG] using deepspeed slower inference time
bug
Something isn't working
inference
#6818
opened Dec 4, 2024 by
williamlin0518
[BUG] DeepSpeed accuracy issue for torch.compile if activation checkpoint function not compiler disabled
bug
Something isn't working
training
#6811
opened Dec 1, 2024 by
NirSonnenschein
[BUG] Enabling drop_tokens in MoE layer causes inference to hang
bug
Something isn't working
inference
#6809
opened Nov 29, 2024 by
Shamauk
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.