Custom Finetuned Models? #32

mallorbc · 2023-03-28T04:06:38Z

Looking at the code, it appears that when loading a model the code is loading preprocessed models that are uploaded to Huggingface and then checks that sha256 to make sure that it matches. It does not seem that the code currently allows loading a model from a path.

I have converted GPTJ myself into the GGML format(which I am almost certain this is built from, correct me if wrong).

I am interested in finetuning a model, converting the model to GGML, quantize to 4 bits, and then using the model through an API.

I believe that should this repo support custom fine-tuned models, it would be great for this use case.

Ayushk4 · 2023-03-28T04:30:51Z

Hi,
You should be able to convert your custom GPTJ based models.

If you want to add new models, then follow these steps:

Run https://github.com/NolanoOrg/cformers/blob/master/cformers/cpp/converters/convert_gptj_to_ggml.py to convert codegen into ggml gptj's format: python3 https://convert_gptj_to_ggml.py/ [HF_Model_URL_or_Local_Path] [GPTJ_Save_path] 0
Run https://github.com/NolanoOrg/cformers/blob/master/cformers/cpp/quantize_gptj.cpp make, then ./quantize_gptj [GPTJ_Save_path] [GPTJ_Int4_Save_path] 2
Upload [GPTJ_Int4_Save_path] model to HF and add an entry like the following:

cformers/cformers/interface.py

Line 72 in 2746a62

'Salesforce/codegen-6B-mono': ModelUrlMap(

After these steps, you should be able to load the model via python.

mallorbc · 2023-03-28T18:55:07Z

I do not want to upload the model to HuggingFace due to the nature of the models being private(although perhaps one can have private models on HuggingFace?).

However, if the method you stated would work, I believe that I could add support for loading models locally. I will likely explore adding this feature.

Thanks for your insight.

Ayushk4 · 2023-03-28T21:14:03Z

Yes. If you don't want to upload, a hack would be to put the int4_fixed_zero at ~/.cformers/models/myUserName/myModel/int4_fixed_zero

Then you can do use it by:

from interface import AutoInference as AI
ai = AI('myUserName/myModel')
x = ai.generate('Some Prompt', num_tokens_to_generate=500)
print(x['token_str'])

For example myUserName/myModel can be EleutherAI/gpt-j-6B for the GPTJ model.

sann3 · 2023-04-01T09:46:06Z

Is same work for hivemind/gpt-j-6B-8bit model ?

Ayushk4 · 2023-04-01T15:36:38Z

You could load it at 4-bit. For now 8-bit isn't supported.

mallorbc · 2023-04-07T22:42:20Z

My PR #38 should close this issue. Will close when merged

Ghatage mentioned this issue Apr 15, 2023

Add detailed error messaging for reading/writing of model files. #39

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Custom Finetuned Models? #32

Custom Finetuned Models? #32

mallorbc commented Mar 28, 2023

Ayushk4 commented Mar 28, 2023

mallorbc commented Mar 28, 2023

Ayushk4 commented Mar 28, 2023

sann3 commented Apr 1, 2023

Ayushk4 commented Apr 1, 2023

mallorbc commented Apr 7, 2023

Custom Finetuned Models? #32

Custom Finetuned Models? #32

Comments

mallorbc commented Mar 28, 2023

Ayushk4 commented Mar 28, 2023

mallorbc commented Mar 28, 2023

Ayushk4 commented Mar 28, 2023

sann3 commented Apr 1, 2023

Ayushk4 commented Apr 1, 2023

mallorbc commented Apr 7, 2023