Gibberish text generation after converting to Huggingface. #712

kanwatchara-k · 2022-10-31T05:24:58Z

Hi,
I am having trouble converting my checkpoints to Huggingface format. The model works fine when using Deepspeed + Megatron (example).

generate_samples_from_prompt(neox_args, model, ['สวัสดีครับ' for i in range(1)],temperature=0.9, top_k=40)
>>  {'context': 'สวัสดีครับ', ## means 'hello'
  'text': 'เพื่อนๆครับ มีเพื่อนๆ คนไหนที่ทำงานแล้ว หรือกำลังทำงานแล้ว แล้วได้ลาออกจากงานไปแล้วแต่ยังหางานอยู่บ้างครับ พอดีอยากทราบวิธีหางานหรือแนะนำบริษัท ที่ให้เงินเดือนดี และน่าเชื่อถือหน่อยครับ'}

However, it becomes gibberish when converted into Huggingface format (example).

pipe = TextGenerationPipeline(model, tok, device=0)
pipe("สวัสดีครับ",max_new_tokens=50, top_k=40,do_sample=True, temperature=0.9)
>>  [{'generated_text': 'สวัสดีครับ ค.. ดี. แรง 1-". และทำ<|endoftext|>'}]

I have tried multiple conversion scripts so far (e.g., this and this) without success.

All the related files (weights, config, and tokenizer) are in my google drive.

Any help is greatly appreciated!

The text was updated successfully, but these errors were encountered:

StellaAthena · 2022-11-01T15:07:32Z

@haileyschoelkopf

haileyschoelkopf · 2022-11-01T15:43:42Z

Hey! Looking into this to see if it's the case on my end!

haileyschoelkopf · 2022-11-01T16:04:46Z

Oh, @kanwatchara-k would you be willing to send what the exact command you ran to run https://github.com/EleutherAI/gpt-neox/pull/701/files#diff-fff7e2d700e82c3e6027c575c1cd96830ba839ff44fa6b82abf2cb21b029d55c was? There’s a chance that your discrepancy is due to my current script not accepting multiple config files like you used for training.

kanwatchara-k · 2022-11-02T04:52:11Z

@haileyschoelkopf Of course. Though I did make some changes to the code (the modified version is here). To be specific, I hard-coded the vocab file path and the tokenizer type. I also combined the two config files in the code (with the paths also hard-coded).

With the modified code, I just ran the command
python tools/convert_to_hf.py --input_dir checkpoints/global_step300000/ --output_dir ./

Thanks

haileyschoelkopf · 2022-11-02T13:18:09Z

Thank you!! I’ll try this to convert your checkpoint as soon as I can, hopefully later today or early tomorrow!

haileyschoelkopf · 2022-11-06T21:47:47Z

Still working on finding the possible issue here--I'll keep you posted!

haileyschoelkopf · 2022-11-15T00:40:41Z

@kanwatchara-k so sorry for the delay on my end. What fixed the issue on my end, where I had a model that also had this problem, was:

pip install --upgrade transformers to the latest version, to include this PR: Add a use_parallel_residual argument to control the residual computing way huggingface/transformers#18695
changing config.json in the HF model to have use_parallel_residual: false

The issue here was that your and my models which wouldn't convert properly use a layer setup different from GPT-NeoX-20b, which is controlled by gpt_j_residual: true in the 20b config file. setting this value in the HF config allows the model to run in the same way it was trained. Hope this helps and works for you! I'll update my conversion script to take this into account.

kanwatchara-k · 2022-11-15T07:11:35Z

@haileyschoelkopf Thank you so much! It works properly now!

kanwatchara-k closed this as completed Nov 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gibberish text generation after converting to Huggingface. #712

Gibberish text generation after converting to Huggingface. #712

kanwatchara-k commented Oct 31, 2022

StellaAthena commented Nov 1, 2022

haileyschoelkopf commented Nov 1, 2022

haileyschoelkopf commented Nov 1, 2022

kanwatchara-k commented Nov 2, 2022

haileyschoelkopf commented Nov 2, 2022

haileyschoelkopf commented Nov 6, 2022

haileyschoelkopf commented Nov 15, 2022 •

edited

Loading

kanwatchara-k commented Nov 15, 2022

Gibberish text generation after converting to Huggingface. #712

Gibberish text generation after converting to Huggingface. #712

Comments

kanwatchara-k commented Oct 31, 2022

StellaAthena commented Nov 1, 2022

haileyschoelkopf commented Nov 1, 2022

haileyschoelkopf commented Nov 1, 2022

kanwatchara-k commented Nov 2, 2022

haileyschoelkopf commented Nov 2, 2022

haileyschoelkopf commented Nov 6, 2022

haileyschoelkopf commented Nov 15, 2022 • edited Loading

kanwatchara-k commented Nov 15, 2022

haileyschoelkopf commented Nov 15, 2022 •

edited

Loading