add multiple experts per moe layer #4291

jonhilgart22 · 2023-09-08T04:20:00Z

Add the ability to use multiple types of networks per MoE layer instead of only one network.

awan-10 · 2023-09-08T16:53:32Z

@jonhilgart22 - thanks for the PR! I do not fully understand the need for this PR without looking at the client-side code. Do you have any example code to explain the usage of this new feature?

jonhilgart22 · 2023-09-11T04:51:36Z

@jonhilgart22 - thanks for the PR! I do not fully understand the need for this PR without looking at the client-side code. Do you have any example code to explain the usage of this new feature?

My intention is to attempt to train MoE with existing fine-tuned models as the experts. So, instead of having many-to-one experts to network types (e.g. 10 experts of MLP layers) you could instead train differing experts per layer ( t5 as one and gpt2 as another).

awan-10 · 2023-09-11T21:57:12Z

@jonhilgart22. Sounds good. Can you please either modify the existing MoE unit test or add one so we know that this does not introduce any issues for the standard MoE API?

awan-10 · 2023-09-12T17:45:52Z

@jonhilgart22 - please see that failing MoE unit tests. I think that is an indication that something is broken with the new changes.

jonhilgart22 · 2023-09-15T21:06:57Z

closing as this is not exactly what I'm looking for. BTM is closer.

add multiple experts per moe layer

9ca0cab

jonhilgart22 requested a review from awan-10 as a code owner September 8, 2023 04:20

awan-10 self-assigned this Sep 8, 2023

Merge branch 'master' into allow-for-multiple-networks-per-moe-layer

097e191

Merge branch 'master' into allow-for-multiple-networks-per-moe-layer

aebca7f

jonhilgart22 closed this Sep 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add multiple experts per moe layer #4291

add multiple experts per moe layer #4291

jonhilgart22 commented Sep 8, 2023

awan-10 commented Sep 8, 2023

jonhilgart22 commented Sep 11, 2023

awan-10 commented Sep 11, 2023

awan-10 commented Sep 12, 2023

jonhilgart22 commented Sep 15, 2023

add multiple experts per moe layer #4291

add multiple experts per moe layer #4291

Conversation

jonhilgart22 commented Sep 8, 2023

awan-10 commented Sep 8, 2023

jonhilgart22 commented Sep 11, 2023

awan-10 commented Sep 11, 2023

awan-10 commented Sep 12, 2023

jonhilgart22 commented Sep 15, 2023