Replies: 1 comment 1 reply
-
@ninedesu This is an excellent question, and yes, we plan to build a community where people can contribute data for fine-tuning. At the moment, we are gathering all our internal and external datasets (eg https://huggingface.co/datasets/ds4sd/DocLayNet) and preparing them so we can share them all on the huggingface website! With regard to OCR, we have a bit of work to do and are right now relying on 3rd party OCR. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I want to know if we can use our own dataset to finetune the OCR
Beta Was this translation helpful? Give feedback.
All reactions