`build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... #7809

ochafik · 2024-06-06T23:31:00Z

This renames binaries to have a consistent llama- prefix.

main → llama-cli
any-other-example → llama-any-other-example, e.g:
- llama-server,
- llama-baby-llama,
- llama-llava-cli,
- llama-simple
- ...
(rpc-server = only exception, name unchanged - see below)

Goal is to make any scripts / docs work across all the ways to install (cmake, brew, rpm, nix...).

For RPM-based distros, the cuda & clblast variants' prefix is llama-cuda- / llama-clblast-.

The Homebrew formula will be updated separately (llama → llama-cli).

Note: can install a local build w/:

cmake -B build -DLLAMA_CURL=1 && cmake --build build -j && cmake --install build
# Uninstall: ( cd build && xargs rm < install_manifest.txt )

See discussion below about naming options considered.

…/ homebrew

gitignore llama-server

fix examples/main ref

server

ngxson · 2024-07-02T22:14:19Z

Personally I don't think there are any bad things about communications in this PR.

Llama.cpp has high velocity development, much like any other open-source project out there, so breaking changes are expected. We already had dedicated section on README to note about this and other breaking changes.

because I don't "make clean." I don't know what that is, nor is it mentioned in the readme

If this doesn't change, you will drive away users.

That's the reason why you shouldn't get mad. Llama.cpp is dev-oriented so will not as friendly as friendly GUI apps out there like jan or lmstudio. We assume that people using llama.cpp at least have basic knowledge about compiling. If you don't know something, then now you have a reason to ask & to learn about that, rather than making complaints.

Finally, I think we should give some credits to devs who dedicated their time to work on big changes like this one. It's not just taking energy to write the code, but also to communicate in a professional & friendly way.

Llama.cpp just updated their program names, I've updated the article to use the new name. quantize -> llama-quantize main -> llama-cli simple -> llama-simple [Check out the PR](ggerganov/llama.cpp#7809)

oldgithubman · 2024-07-03T05:05:10Z

Personally I don't think there are any bad things about communications in this PR.

Then why are there so many reports like mine? Not necessarily official reports, mind you (although I think I saw a couple official issues). Check reddit, for example. It's not just me. Also, @crashr predicted the problem early on. He was absolutely right and should have been taken more seriously.

Llama.cpp has high velocity development, much like any other open-source project out there

I don't see the correlation with speed and open-source. There are a full spectrum of development methodologies in both closed and open source. Fast and slow.

so breaking changes are expected

This should not be an excuse to disregard user experience. In this case (and others - again, a pattern here), the change was unnecessary and broke the user experience in a way that cost a lot of people a lot of resources. One person on reddit has been struggling for the past month. The breaking change itself wasn't the problem, perse. The problem was that the impact of the change could have been easily mitigated with the slightest effort. It's the same attitude I've been calling out for a while now. Too many devs simply care too little about the users, and I think you're illustrating my point.

We already had dedicated section on README to note about this and other breaking changes.

Which demonstrably wasn't good enough. The readme is notoriously unreliable.

because I don't "make clean." I don't know what that is, nor is it mentioned in the readme

If this doesn't change, you will drive away users.

That's the reason why you shouldn't get mad. Llama.cpp is dev-oriented so will not as friendly as friendly GUI apps out there like jan or lmstudio.

The readme details what you need to do to use llama.cpp, including "make." If the users are expected to do something different to not get screwed, it would make sense for the readme to reflect that. If you think all your users are supposed to know "make clean," then why do you tell them how to use "make?" This is obviously confusing and inconsistent. "Llama.cpp is dev-oriented so will not as friendly" is an excuse and a bad one at that. I'm not "mad" because you don't have a GUI. Maybe the readme should say in bold letters across the top, "If you don't know what 'make clean' is, f off."

We assume that people using llama.cpp at least have basic knowledge about compiling.

If that was true, you wouldn't tell people the basics of compilation in the readme. Also, the implication that I don't have basic knowledge about compiling is presumptuous and without merit. I compiled my first linux kernel over 20 years ago and have compiled thousands of programs since.

If you don't know something, then now you have a reason to ask & to learn about that, rather than making complaints.

You'll note I did ask about it and no one has responded. Also, I don't see how that precludes me from providing feedback. This is yet another case of someone criticizing me without even bothering to read my posts. You're telling me to do the very thing I've already done. You'll forgive me for not taking you very seriously moving forward, considering the smell of bad faith and elitism.

Finally, I think we should give some credits to devs who dedicated their time to work on big changes like this one.

Speak for yourself. I'm not in the habit of crediting moves that cause the community and me so much trouble for no benefit. Also, I guess I fail to see how renaming a bunch of binaries and breaking everyone's workflows (including organizations like huggingface) is a good thing worthy of praise. But I'm just a user, so what could I possibly know, right?

It's not just taking energy to write the code, but also to communicate in a professional & friendly way.

You're out of touch.

All that said, I do want to thank the devs for all the good work they do, as I often do. But for things to improve, criticism is required (tough love). I understand that's surprising to a lot of people these days.

Edit - I especially want to thank @HanClinto . Most devs could learn a lot from him. MVP

crashr · 2024-07-03T09:44:02Z

@oldmanjk You're not entirely wrong. I recognized the problem because I saw the old binary name in a recent screenshot on Reddit and wondered how that could happen. I generally welcome the change to the binary names, but I also don't see any way of preventing someone from falling foul of this. If, for example, the old binaries had been deleted via makefile, some pipeline somewhere would certainly have broken a productive system. Ok, at least one would have noticed it immediately. Ultimately, one should read the release notes before making any changes like compiling a new binary. Yes, very few of us do. But we should.

HanClinto · 2024-07-03T14:21:03Z

@crashr

I recognized the problem because I saw the old binary name in a recent screenshot on Reddit and wondered how that could happen. I generally welcome the change to the binary names, but I also don't see any way of preventing someone from falling foul of this.

Yeah, this is a bit concerning to me, and I'm trying to think of what more we can do to mitigate this fallout.

I've put some suggestions here, but I'm currently leaning towards:

The second option is to add a minimalistic program that builds to the old binary names, but simply outputs a friendly error message that says that the binary names have changed, and to please update scripts and routines to use the new filenames.

There is more discussion in that PR, but I like the idea of overwriting the old binaries (at least for things like main, the server, and maybe quantization commands) so that they don't linger around, and provide people a nice breakpoint so that errors don't continue silently.

@oldmanjk

I especially want to thank @HanClinto .

Thanks for the shoutout, though it takes all kinds to move a project of this size forward, and I'm just a very small part. Nobody who works on this project takes breaking changes like this casually, and -- despite how it may appear from another vantage -- I hope you trust me when I say that nothing in this migration change was done recklessly or without consideration (the pages and pages of discussion at the beginning of the PR should illustrate this). I hope you'll give us all credit for that -- it's messy, difficult, and there are a lot of tradeoffs to balance, and sometimes we adapt on the fly (which is what we're doing here).

This name change should tide us over for a while, and hopefully we won't need another one until we move to the supercommand (and that one should be more backwards-compatible).

oldgithubman · 2024-07-03T17:32:12Z

You're not entirely wrong.

What am I wrong about?

I also don't see any way of preventing someone from falling foul of this.

There are many ways. @HanClinto and I are discussing a few in another thread.

Ok, at least one would have noticed it immediately.

Exactly. Although there are better ways.

Ultimately, one should read the release notes before making any changes like compiling a new binary.

I usually do when I'm installing releases. That's not what's happening here though. We're not compiling releases.

…st to help people notice the binary name change from ggerganov#7809 and migrate to the new filenames.

crashr · 2024-07-03T18:30:21Z

What am I wrong about?

The communication thing. What else could have be done but write it in big letters at the top?

HanClinto · 2024-07-03T18:38:44Z

What else could have be done but write it in big letters at the top?

Well, there are a few things, though some are most clear in hindsight.

One suggestion is implemented in #8283 -- please give it a look-see! :)

oldgithubman · 2024-07-03T18:56:42Z

Thanks for the shoutout

You're welcome. You deserved it.

I'm just a very small part.

Disagree. I'm going to assume you're just being humble, because based on what I've seen so far, you're intelligent enough to understand your true value.

Nobody who works on this project takes breaking changes like this casually

I understand what you're trying to do here, but strong disagree.

I hope you trust me

I do when you're not lying. I don't think you believe what you're saying. Again, I understand what you're trying to do, but it's misguided.

when I say that nothing in this migration change was done recklessly or without consideration

This migration change was absolutely done recklessly. There was consideration, but it wasn't taken seriously.

(the pages and pages of discussion at the beginning of the PR should illustrate this)

Disagree. They illustrate what I'm saying. Also, and more importantly, the results illustrate what I'm saying.

I hope you'll give us all credit for that

I give credit where credit is due. Like how we used to back in the day (as a society). The "everyone gets a trophy" mentality is more destructive than most people realize.

it's messy

When you make it that way. It certainly didn't have to be.

difficult

I understand software development is difficult, but I disagree that what was done here was difficult. Changing the names of binaries is not difficult. Changing the names of binaries without screwing the userbase is a little more difficult, but still not difficult. The review-complexity label on your new PR being "low" is evidence towards this point. You being able to instantly come up with three perfectly good solutions off the top of your head is evidence towards this point.

and there are a lot of tradeoffs to balance

The scale needs new batteries.

and sometimes we adapt on the fly (which is what we're doing here).

I lit a fire and you responded like an MVP with the best attitude here. That's what happened. Credit where credit is due. Just take the compliment. Also credit to @crashr for his excellent warning, even though it wasn't taken seriously.

This name change should tide us over for a while, and hopefully we won't need another one until we move to the supercommand (and that one should be more backwards-compatible).

Let's prepare for that

HanClinto · 2024-07-03T18:58:48Z

You're not entirely wrong.

What am I wrong about?

The personal deprecations and insinuations are getting tiresome. I'm glad that you appreciate my approach to customer service on bug reports, but aspersions cast at the other volunteers on this project (even if it's only by way of complimenting me to the isolation of others) is not helping me or anyone else. I'm a volunteer here, as is most everyone else, and we all need to strive to be professional and gracious.

A bit of grace and professional courtesy go a long way here -- it's a messy situation, and berating other devs with blame-and-shame is not going to help me do anything other than walk away for a while.

Thank you (and others) for the issue reports, but please recognize that continuing in this vein will wear me (and others) thin.

HanClinto · 2024-07-03T19:08:41Z

Nice censorship. You've lost my respect, "repo owner"

It's getting personal, off-topic, and starting to go in circles. If you want to continue this argument, there are more appropriate venues.

Thank you.

oldgithubman · 2024-07-03T19:13:45Z

You're not entirely wrong.

What am I wrong about?

The personal deprecations and insinuations are getting tiresome. I'm glad that you appreciate my approach to customer service on bug reports, but aspersions cast at the other volunteers on this project (even if it's only by way of complimenting me to the isolation of others) is not helping me or anyone else. I'm a volunteer here, as is most everyone else, and we all need to strive to be professional and gracious.

A bit of grace and professional courtesy go a long way here -- it's a messy situation, and berating other devs with blame-and-shame is not going to help me do anything other than walk away for a while.

Thank you (and others) for the issue reports, but please recognize that continuing in this vein will wear me (and others) thin.

This post is so off-base I'm not even going to bother anymore. My work here is done (for now)

HanClinto · 2024-07-03T19:15:41Z

?That post might have been a mistake. Was I censored? I can't even remember. I edited before you responded

It was a repeat of a question aimed at Crashr. It's cluttering up this PR which is (currently) linked as the primary source for information about the name migration. Continuing in circles with personal arguments will make it more difficult for other users to sift through and find relevant information about the change in the future.

Because of the prominence of this PR, anything we can do to keep this thread focused and un-cluttered is good. If you'd like to continue, maybe moving things to a less prominent location would be best (perhaps #8283?).

Thank you.

* Update llama.cpp.rst Llama.cpp just updated their program names, I've updated the article to use the new name. quantize -> llama-quantize main -> llama-cli simple -> llama-simple [Check out the PR](ggerganov/llama.cpp#7809) * Updated llama.cpp.rst ( removed -cml ) * Update llama.cpp.rst --------- Co-authored-by: Ren Xuancheng <[email protected]>

* Adding a simple program to provide a deprecation warning that can exist to help people notice the binary name change from #7809 and migrate to the new filenames. * Build legacy replacement binaries only if they already exist. Check for their existence every time so that they are not ignored.

…rganov#8283) * Adding a simple program to provide a deprecation warning that can exist to help people notice the binary name change from ggerganov#7809 and migrate to the new filenames. * Build legacy replacement binaries only if they already exist. Check for their existence every time so that they are not ignored.

hgftrdw45ud67is8o89 · 2024-07-16T03:07:44Z

personally i don't think this change is needed? Because i thought the old scripts or docs is already stable? But definitely good to make it neat.

But,If its going to be renamed.
We might as well use less confusing name it would be better to just prepend "llamacpp" instead of "llama".
as llama commonly refers to the "llama" arch and not "llamacpp". It have the equal ambiguous level of "main" or "run" or "llm" or "mistral".

Did no one noticed this problem?

HanClinto · 2024-07-16T17:19:15Z

personally i don't think this change is needed? Because i thought the old scripts or docs is already stable? But definitely good to make it neat.

But,If its going to be renamed. We might as well use less confusing name it would be better to just prepend "llamacpp" instead of "llama". as llama commonly refers to the "llama" arch and not "llamacpp". It have the equal ambiguous level of "main" or "run" or "llm" or "mistral".

Did no one noticed this problem?

llamacpp was suggested on June 8th, and discussed further on June 9th. Final decision was made on June 10th.

I agree with you that llamacpp feels better (at least to me), but I'm still quite content with llama and I'm in support. Moving away from the main name is a good change.

oldgithubman · 2024-07-17T01:50:59Z

personally i don't think this change is needed?

Agreed. I can see some small benefit, but it certainly wasn't worth the damage done. Had it been done in a manner that didn't break anything (and yes, this was very possible), I would have supported it.

Because i thought the old scripts or docs is already stable?

Well, stable might be a stretch, but not breaking things is obviously more-stable.

But definitely good to make it neat.

If you do it without breaking things.

But,If its going to be renamed. We might as well use less confusing name it would be better to just prepend "llamacpp" instead of "llama".

Agreed, but IMO, not prepending everything with the same thing is even better. Even better is to not fix what isn't broken, so I think it's best to just leave it alone now.

as llama commonly refers to the "llama" arch and not "llamacpp". It have the equal ambiguous level of "main" or "run" or "llm" or "mistral".

Agreed. I don't think the usage of "llama" really makes sense at all anymore for this project, but again, better to leave it now.

Did no one noticed this problem?

It never made sense to me, but I assumed it was used for a good reason

hgftrdw45ud67is8o89 · 2024-07-17T14:00:14Z

Thanks @HanClinto for giving me reference. My bad for not noticing /reading the full convo.
Its fine to keep it as it is. I just naively thought that no one suggested that idea or noticed the flaw.

In this PR, I propose some changes: - Update binary name to `llama-cli` (for more details, see this PR: ggerganov/llama.cpp#7809 and this [homebrew formula](https://github.com/Homebrew/homebrew-core/blob/03cf5d39d8bf27dfabfc90d62c9a3fe19205dc2a/Formula/l/llama.cpp.rb)) - Add method to download llama.cpp via pre-built release - Split snippet into 3 sections: `title`, `setup` and `command` - Use `--conversation` mode to start llama.cpp in chat mode (chat template is now supported, ref: ggerganov/llama.cpp#8068) --- Proposal for the UI: (Note: Maybe the 3 sections title - setup - command can be more separated visually) ![image](https://github.com/huggingface/huggingface.js/assets/7702203/2bd302f0-88b1-4057-9cd3-3cf4536aae50)

ochafik added 6 commits June 6, 2024 15:28

main/server: rename to llama / llama-server for consistency w…

8498429

…/ homebrew

server: update refs -> llama-server

f298cc6

gitignore llama-server

server: simplify nix package

f5f19a2

main: update refs -> llama

8b7c734

fix examples/main ref

main/server: fix targets

9a03341

update more names

8695bae

Update build.yml

a0a7f2b

Merge remote-tracking branch 'origin/master' into bins

fbd8313

ggerganov reviewed Jun 7, 2024

View reviewed changes

server Outdated Show resolved Hide resolved

ochafik added 3 commits June 7, 2024 09:40

rm accidentally checked in bins

99df4cc

update straggling refs

7fbe600

Update .gitignore

af8f016

NoumaanAhamed mentioned this pull request Jul 3, 2024

Update llama.cpp.rst QwenLM/Qwen2.5#739

Merged

HanClinto added a commit to HanClinto/llama.cpp that referenced this pull request Jul 3, 2024

Adding a simple program to provide a deprecation warning that can exi…

8445732

…st to help people notice the binary name change from ggerganov#7809 and migrate to the new filenames.

HanClinto mentioned this pull request Jul 3, 2024

Deprecation warning to assist with migration to new binary names #8283

Merged

4 tasks

Repository owner deleted a comment from oldgithubman Jul 3, 2024

arrizalamin mentioned this pull request Jul 23, 2024

Update llama.cpp tangledgroup/llama-cpp-wasm#13

Open

HanClinto mentioned this pull request Jul 23, 2024

Issue: HuggingFace Documentation Refers to Outdated Binaries #8659

Closed

fedric95 mentioned this pull request Aug 6, 2024

Feature Request: Proper Llama 3.1 Support in llama.cpp #8650

Closed

4 tasks

maxim-saplin mentioned this pull request Aug 25, 2024

OpenAI compatible chat completions endpoint microsoft/T-MAC#28

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... #7809

`build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... #7809

ochafik commented Jun 6, 2024 •

edited

Loading

ngxson commented Jul 2, 2024

oldgithubman commented Jul 3, 2024 •

edited

Loading

crashr commented Jul 3, 2024

HanClinto commented Jul 3, 2024

oldgithubman commented Jul 3, 2024

crashr commented Jul 3, 2024

HanClinto commented Jul 3, 2024

oldgithubman commented Jul 3, 2024

HanClinto commented Jul 3, 2024

HanClinto commented Jul 3, 2024

oldgithubman commented Jul 3, 2024

HanClinto commented Jul 3, 2024 •

edited

Loading

hgftrdw45ud67is8o89 commented Jul 16, 2024

HanClinto commented Jul 16, 2024

oldgithubman commented Jul 17, 2024

hgftrdw45ud67is8o89 commented Jul 17, 2024

build: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... #7809

build: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... #7809

Conversation

ochafik commented Jun 6, 2024 • edited Loading

ngxson commented Jul 2, 2024

oldgithubman commented Jul 3, 2024 • edited Loading

crashr commented Jul 3, 2024

HanClinto commented Jul 3, 2024

oldgithubman commented Jul 3, 2024

crashr commented Jul 3, 2024

HanClinto commented Jul 3, 2024

oldgithubman commented Jul 3, 2024

HanClinto commented Jul 3, 2024

HanClinto commented Jul 3, 2024

oldgithubman commented Jul 3, 2024

HanClinto commented Jul 3, 2024 • edited Loading

hgftrdw45ud67is8o89 commented Jul 16, 2024

HanClinto commented Jul 16, 2024

oldgithubman commented Jul 17, 2024

hgftrdw45ud67is8o89 commented Jul 17, 2024

`build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... #7809

`build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... #7809

ochafik commented Jun 6, 2024 •

edited

Loading

oldgithubman commented Jul 3, 2024 •

edited

Loading

HanClinto commented Jul 3, 2024 •

edited

Loading