Skip to content

Commit

Permalink
fix: Change w2v2 word delimiter token to space
Browse files Browse the repository at this point in the history
  • Loading branch information
Søren Winkel Holm committed Sep 20, 2023
1 parent 49c8063 commit c329a88
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion src/coral_models/wav2vec2.py
Original file line number Diff line number Diff line change
Expand Up @@ -115,7 +115,7 @@ def load_processor(self) -> Wav2Vec2Processor:
pad_token="<pad>",
bos_token="<s>",
eos_token="</s>",
word_delimiter_token="|",
word_delimiter_token=" ",
)

# Set the `model_max_length` attribute of the tokenizer, if it hasn't been set,
Expand Down

0 comments on commit c329a88

Please sign in to comment.