Warn about structured generation without a prompt for Llama tokenizers #1321
Labels
correctness
Everything related to the generation correctness
documentation
Linked to documentation and examples
structured generation
Linked to structured generation
tokenization
Llama(/SentencePiece?) tokenizers do something special with spaces and initial tokens. For example,
The decoding used by our structured generation doesn't have this no-space-for-the-first-token consideration, so we need to warn people that structured generation won't allow tokens like 2045 (because it strictly interprets them as
" https"
) when/if generation starts without a prompt. I'm not sure why one would start generation without a prompt, but it's worth mentioning.The text was updated successfully, but these errors were encountered: