-
Notifications
You must be signed in to change notification settings - Fork 104
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add Llama #199
Add Llama #199
Conversation
@jonatanklosko Seems there's something with the tokenizer we don't support?
|
@seanmor5 perhaps we need to bump rust tokenizers in |
@philss I tried bumping rust tokenizers to 0.13.3, but I got:
Are they are missing feature flags in tokenizers? Adding explicit dependency on onig didn't help either, but I don't know what I'm doing :) |
@jonatanklosko Not sure, I got the same thing |
@jonatanklosko I think the problem is that they are declaring But they are "requiring" in the code as it would be always available. tokenizers = { version = "0.13.3", default-features = false, features = ["default"]} After that, run |
@jonatanklosko another fix is to use the tokenizers = { version = "0.13.3", default-features = false, features = ["unstable_wasm"]} |
This one just needs a new tokenizers release now! |
@seanmor5 I'm working on that :) |
@seanmor5 @jonatanklosko version |
@philss all passing, thanks |
Co-authored-by: Jonatan Kłosko <[email protected]>
A thought on usability: how do commercial users easily discover that Llama (and its finetunes) are GPL v3 which is usually not license compatible, compared the the Apache 2.0 one usually expects in Elixir OSS? |
Fair point, though I'm not sure there's anything specific we should do. A commercial user should always check the huggingface repository for license and other details. The elixir codebase is one thing, but fetching a model from Hugging Face is clearly downloading external files and should be treated as such. Technically if someone trained llama from scratch using HF transformers I believe it could be distributed under a more permissive license. |
Adds the popular Llama model. The model builds and the parameter map is correct, though the test is not passing. I'll look deeper into why the values differ tomorrow.