-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Original_caption #11
Comments
Hi @fbj2333 , Thanks for your interest on our work. A quick answer is: For LAION images, GPT-4V only uses image to generate caption, instruction and answer. The original LAION caption is NOT used. It seems that we did not mention "the original caption" in section 3.2.1. Best, |
I see, in Section 3.2.1, you mention additional caption, and i misunderstood it as original caption. By the way, the Chinese edition's allava has about thousands of wrong data. For example: |
Hi @fbj2333 , Thanks for reporting the issue. We have temporarily made the HF repo private and will update the data soon. Best, |
In your paper‘s section 3.2.1, you mention that utilizing original caption can enhance the model's answer. But in your prompt i haven't found any content about the original caption. So GPT4V only uses images to produce data, including caption and instruction? I'd like to know if the original caption used in your data synthesis or model inference procedure.
The text was updated successfully, but these errors were encountered: