You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi guys. I converted a RobertaTokenizer to an ONNX format, and now I need to use it from C++ code. This model takes a string tensor as input and is supposed to return an int64 tensor of tokens, attention mask and offset mapping (accoriding to its inputs/outputs inspection). But the model spits out [0, 2] for every input. I tried to use it from the C# code, and everything worked as expected, so it's more likely I'm doing something wrong here. I'm also not quite sure about the correctness of memory ownership. Logs from the debug mode don't look suspicious. Here's what I'm doing:
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
Hi guys. I converted a RobertaTokenizer to an ONNX format, and now I need to use it from C++ code. This model takes a string tensor as input and is supposed to return an int64 tensor of tokens, attention mask and offset mapping (accoriding to its inputs/outputs inspection). But the model spits out [0, 2] for every input. I tried to use it from the C# code, and everything worked as expected, so it's more likely I'm doing something wrong here. I'm also not quite sure about the correctness of memory ownership. Logs from the debug mode don't look suspicious. Here's what I'm doing:
Any help is appreciated.
Just in case, here's the tokenizer.
BPEtokenizer.zip
Beta Was this translation helpful? Give feedback.
All reactions