Added GODEL support with T5 model type #376

Emulator000 · 2023-05-11T01:54:00Z

This PR adds GODEL support as mentioned in #324 issue.

Notes
I just tested with a local download copy of the GODEL base model and it works except if I change this line:

Line 918 in 9fd7983

    
           let generated_response = &generated_sequence[input_length - removed_padding.0..];

Unfortunately it panics with:

thread 'main' panicked at 'range start index 6 out of range for slice of length 5', /home/dario/projects/rust-bert/src/pipelines/conversation.rs:940:43

If instead for testing purposes I put something like:

let generated_response = generated_sequence.as_slice();

It works greatly!

Could someone understand why we have this different behavior between GPT2 and T5 models? Is like we don't have the input sequence in the generated sequence so we don't have to remove it with padding.

Emulator000 · 2023-05-11T01:54:52Z

Also, we should upload the Rust ot model into Huggingface's space in order to work!

- Skip truncation of prompt for encoder-decoder models - Add right padding logic for encoder-decoder models

guillaume-be · 2023-05-14T08:12:35Z

Thank you @Emulator000 for identifying the issue with T5 models and submitting this PR. I have opened a small PR to address the issue without commenting out the input prompt filtering that is needed for decoder models. The suggested changes should work for both encoder-decoders (such as T5) and decoders (such as DialoGPT) - please have a look and merge if this is fine - this should update this PR at the same time.

It would be good to have the rust-version of the GODEL models pushed to the Hugging Face model hub before this gets merged since they are registered in the library. Can you please submit a PR to add the rust weights to the upstream repositories?

When this is completed it would also be great to have a test with the smallest GODEL model, please consider including one as part of the T5 test suite.

Thanks!

Minor GODEL fixes

Emulator000 · 2023-05-14T11:09:18Z

please have a look and merge if this is fine - this should update this PR at the same time.

Definitely good and it works, I just tested it!

It would be good to have the rust-version of the GODEL models pushed to the Hugging Face model hub before this gets merged since they are registered in the library. Can you please submit a PR to add the rust weights to the upstream repositories?

I've already converted it from Microsoft's original weights but how can I push into their upstream repository? Where can I open the PR?

When this is completed it would also be great to have a test with the smallest GODEL model, please consider including one as part of the T5 test suite.

Sure, I'll add tests for this right now 😄

Emulator000 · 2023-05-14T11:34:23Z

Nevermind for the upstream, already opened two PRs for the GODEL's base and large models:

Emulator000 · 2023-05-14T13:16:18Z

@guillaume-be tests added!

The only weird thing that I see here is the output from GODEL (you can see in the tests), for some strange reason, every generated texts starts with a blank space and I don't understand why, is something related to the padding?

Emulator000 · 2023-05-16T17:20:22Z

@guillaume-be should we have to ping Microsoft someway in order to get PRs merged?

guillaume-be · 2023-05-19T17:23:51Z

@Emulator000 yes it seems notifications sometimes don't get through in the model hub. What has worked well in the past is leaving an issue on the model repo (in this case https://github.com/microsoft/GODEL/issues) and hope to be able to reach the Hugging Face repository organization owners.

Emulator000 · 2023-05-20T15:06:14Z

Just opened an issue here 💪🏻

Added GODEL support

7e00a22

Added other missing resources

a3f484e

Emulator000 force-pushed the godel branch from 811b35e to a3f484e Compare May 11, 2023 07:29

guillaume-be added 2 commits May 14, 2023 08:47

Merge branch 'master' into godel

f27b319

- Remove debugging print statement

564ae85

- Skip truncation of prompt for encoder-decoder models - Add right padding logic for encoder-decoder models

guillaume-be mentioned this pull request May 14, 2023

Minor GODEL fixes Emulator000/rust-bert#1

Merged

Merge pull request #1 from guillaume-be/godel-patch

3ba15e5

Minor GODEL fixes

Emulator000 added 2 commits May 14, 2023 14:13

Updated error message

971d401

Added tests for GODEL T5 model

aca49b7

Merge branch 'master' into godel

0c0fcc4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added GODEL support with T5 model type #376

Added GODEL support with T5 model type #376

Emulator000 commented May 11, 2023 •

edited

Loading

Emulator000 commented May 11, 2023 •

edited

Loading

guillaume-be commented May 14, 2023

Emulator000 commented May 14, 2023

Emulator000 commented May 14, 2023

Emulator000 commented May 14, 2023

Emulator000 commented May 16, 2023

guillaume-be commented May 19, 2023

Emulator000 commented May 20, 2023

Added GODEL support with T5 model type #376

Are you sure you want to change the base?

Added GODEL support with T5 model type #376

Conversation

Emulator000 commented May 11, 2023 • edited Loading

Emulator000 commented May 11, 2023 • edited Loading

guillaume-be commented May 14, 2023

Emulator000 commented May 14, 2023

Emulator000 commented May 14, 2023

Emulator000 commented May 14, 2023

Emulator000 commented May 16, 2023

guillaume-be commented May 19, 2023

Emulator000 commented May 20, 2023

Emulator000 commented May 11, 2023 •

edited

Loading

Emulator000 commented May 11, 2023 •

edited

Loading