How to run?
#1136
Replies: 1 comment 5 replies
-
Prebuilt windows binaries are still available from the releases section. If you place your model in the same folder as the executables your current script should still work. You may have to reconvert and quantize the model to get it to work with the newest version or use the convert.py script from this repo to convert the current model to the new format. |
Beta Was this translation helpful? Give feedback.
5 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I used to use Llama.cpp 2 weeks ago. I knew how to run it back when it has a file named "Main" and I used a batfile which included the following.
title llama.cpp
:start
main -i --interactive-first -r "### Human:" --temp 0 -c 2048 -n -1 --ignore-eos --repeat_penalty 1.2 --instruct -m ggml-model-q4_1.bin
pause
goto start
This does not work now. I imagine it's outdated.
I put my models in the models folder. Now I need to know how to actually use this program.
Thanks.
I learned more.
I read into it. I have to like download visual studios, use Cmake etc. Quantize models? The old llama.cpp I downloaded just worked. Oh well I will slowly learn to how operate this nightmare of a software. lol I also have to download weights like that?
Beta Was this translation helpful? Give feedback.
All reactions