You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi there ! When I try to train the model, the following assertion failed:
assert (sum([(cur == AUDIO_TOKEN_INDEX).sum() for cur in input_ids]) + sum([(AUDIO_TOKEN_INDEX not in cur) for cur in input_ids]) == audio_features["inputs_embeds"].shape[0]
I checked the value of sums are 8 and 0 respectively if image is included in my data, otherwise 0 and 8, and the value of audio_features["inputs_embeds"].shape[0] is 40.
Hi there ! When I try to train the model, the following assertion failed:
assert (sum([(cur == AUDIO_TOKEN_INDEX).sum() for cur in input_ids]) + sum([(AUDIO_TOKEN_INDEX not in cur) for cur in input_ids]) == audio_features["inputs_embeds"].shape[0]
I checked the value of sums are 8 and 0 respectively if image is included in my data, otherwise 0 and 8, and the value of audio_features["inputs_embeds"].shape[0] is 40.
My json file of dataset is like this:
{
"set": "sharegpt4",
"conversations": [
{
"from": "human",
"value": "\n\n 请尽量准确地转录所有内容,并在不确定发音时提供可能的替代选项。开始转录:"
},
{
"from": "gpt",
"value": "也成为地方政府的眼中钉"
}
],
"image": "/workspace/frame_1.jpg",
"audio": [
"/dataset/audio_1.wav"
]
}
Is there something I did wrong? Thanks!
The text was updated successfully, but these errors were encountered: