Can I train OpenFlamingo without LIAON dataset? #215

ElegantLin · 2023-07-04T17:07:05Z

Thanks for your great job. I wonder whether we can train open flamingo with MMC4 dataset only and I wonder why the loss from MMC4 dataset could be nan.

Thanks for your explanation.

The text was updated successfully, but these errors were encountered:

i-gao · 2023-07-06T13:07:08Z

Hi, thanks for your question! The code is not currently configured like this, but it wouldn't be hard to implement (similar to #145). If you'd like to contribute a PR, this would make a great first issue!

Regarding nan losses: great question. This bit of code originally sought to catch cases where the mmc4 sequence looks like "text text ". In this case, all labels are masked to -100, since there are no text tokens after image tokens. We later updated data.py upstream to prevent these sequences from being sampled, so that issue is resolved. There may still be nan cases from training at larger scales than 9B. We have not worked with those scales yet to observe them.

ElegantLin · 2023-07-08T03:33:07Z

Thanks for your kind reply. I am planning to contribute a PR to make this project more complete :). I will close this issue after I finish the PR.

YerongLi · 2023-07-18T10:13:23Z

Good point, we need to train on smaller dataset. Wish we can get an example workflow.

anas-awadalla linked a pull request Sep 19, 2023 that will close this issue

Major refactor to support new architectures #261

Draft

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can I train OpenFlamingo without LIAON dataset? #215

Can I train OpenFlamingo without LIAON dataset? #215

ElegantLin commented Jul 4, 2023 •

edited

Loading

i-gao commented Jul 6, 2023

ElegantLin commented Jul 8, 2023 •

edited

Loading

YerongLi commented Jul 18, 2023 •

edited

Loading

Can I train OpenFlamingo without LIAON dataset? #215

Can I train OpenFlamingo without LIAON dataset? #215

Comments

ElegantLin commented Jul 4, 2023 • edited Loading

i-gao commented Jul 6, 2023

ElegantLin commented Jul 8, 2023 • edited Loading

YerongLi commented Jul 18, 2023 • edited Loading

ElegantLin commented Jul 4, 2023 •

edited

Loading

ElegantLin commented Jul 8, 2023 •

edited

Loading

YerongLi commented Jul 18, 2023 •

edited

Loading