Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Assertion Failed during training #45

Open
zealot52099 opened this issue Sep 24, 2024 · 2 comments
Open

Assertion Failed during training #45

zealot52099 opened this issue Sep 24, 2024 · 2 comments

Comments

@zealot52099
Copy link

Hi there ! When I try to train the model, the following assertion failed:

assert (sum([(cur == AUDIO_TOKEN_INDEX).sum() for cur in input_ids]) + sum([(AUDIO_TOKEN_INDEX not in cur) for cur in input_ids]) == audio_features["inputs_embeds"].shape[0]

I checked the value of sums are 8 and 0 respectively if image is included in my data, otherwise 0 and 8, and the value of audio_features["inputs_embeds"].shape[0] is 40.

My json file of dataset is like this:

{
"set": "sharegpt4",
"conversations": [
{
"from": "human",
"value": "\n\n 请尽量准确地转录所有内容,并在不确定发音时提供可能的替代选项。开始转录:"
},
{
"from": "gpt",
"value": "也成为地方政府的眼中钉"
}
],
"image": "/workspace/frame_1.jpg",
"audio": [
"/dataset/audio_1.wav"
]
}

Is there something I did wrong? Thanks!

@linhaojia13
Copy link
Collaborator

Hi @zealot52099 , you should add <audio> in the conversation.

@MonolithFoundation
Copy link

@zealot52099 Hi, why do u have image and audio data, does the audio has any relationship to the image

I thought VITA didn't have audio and image data at the same time

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants