Clarification of model fold data splits #11

adamyhe · 2023-11-11T16:23:52Z

For model f0, sequences labeled fold0 form the test set and fold1 form validation.
For model f1, sequences labeled fold1 form the test set and fold2 form validation. Etc

Originally posted by @davek44 in #1 (comment)

Hi, I just wanted to clarify the exact splits that were used for each of the model folds. My reading is that the test/val/train splits for each of the models is set up as:

f0: test=fold0, val=fold1, train=rest
f1: test=fold1, val=fold2, train=rest
f2: test=fold2, val=fold3, train=rest
f3: test=fold3, val=fold4, train=rest

Thanks!

davek44 · 2023-11-13T21:09:06Z

Yes, this is correct. Here's the code segment that performs that https://github.com/calico/basenji/blob/master/bin/basenji_train_folds.py#L397

adamyhe · 2023-11-13T21:10:20Z

Awesome. Thanks!

davek44 closed this as completed Nov 13, 2023

BDEvan5 mentioned this issue Dec 6, 2024

What training and test folds were used for models in paper #33

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clarification of model fold data splits #11

Clarification of model fold data splits #11

adamyhe commented Nov 11, 2023 •

edited

Loading

davek44 commented Nov 13, 2023

adamyhe commented Nov 13, 2023

Clarification of model fold data splits #11

Clarification of model fold data splits #11

Comments

adamyhe commented Nov 11, 2023 • edited Loading

davek44 commented Nov 13, 2023

adamyhe commented Nov 13, 2023

adamyhe commented Nov 11, 2023 •

edited

Loading