-
Notifications
You must be signed in to change notification settings - Fork 4.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
(Do not merge) (CPU) aggregation of few recent fixes/optimizations #3920
Conversation
Signed-off-by: Wang, Yi A <[email protected]>
* fix model partition load cpu mem increase * fix format * fix format
This looks to have been merged already so we can close this PR? |
* support baichuan model * support baichuan without changing model script
Hi @loadams This PR has some new changes that is working on merge into master, I have updated PR description. Can you help reopen this PR with draft mode? Thanks! We get AutoTP support request for new model from time to time or get bug reports, so sometimes we need to submit new PRs to DeepSpeed for supporting, and add these changes to this PR for early customer access before these changes goes to master. Hope this helps. |
Apologies, yes happy to re-open. |
* enable starcode autotp * add get_n_embd
Close as most pending PRs in this list is merged. We will create an issue as to track current open PRs. |
This PR is aggregation of a few recent fixes inorder to support customer. This PR contains the following PRs with some other minor fixes:
Besides, we have these PRs under track (not in this PR branch but we hope they be merged):