Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: split gguf files support #214

Merged
merged 24 commits into from
May 8, 2024
Merged

feat: split gguf files support #214

merged 24 commits into from
May 8, 2024

Conversation

giladgd
Copy link
Contributor

@giladgd giladgd commented May 6, 2024

Description of change

  • feat: split gguf files support
  • feat: pull command
  • feat: stopOnAbortSignal and customStopTriggers on LlamaChat and LlamaChatSession
  • feat: checkTensors parameter on loadModel
  • feat: improve Electron support
  • fix: more efficient max context size finding algorithm
  • fix: make embedding-only models work correctly
  • fix: perform context shift on the correct token index on generation
  • fix: make context loading work for all models on Electron
  • refactor: simplify LlamaText implementation
  • docs: update Vitepress, add code type information on hover
  • docs: update simple usage
  • docs: proper docs for LlamaText

The context loading issue on Electron was fixed via ggerganov/llama.cpp#7078

Fixes #211
Fixes #212

Pull-Request Checklist

  • Code is up-to-date with the master branch
  • npm run format to apply eslint formatting
  • npm run test passes with this change
  • This pull request links relevant issues as Fixes #0000
  • There are new or updated unit tests validating the change
  • Documentation has been updated to reflect this change
  • The new commits and pull request title follow conventions explained in pull request guidelines (PRs that do not follow this convention will not be merged)

@giladgd giladgd requested a review from ido-pluto May 6, 2024 23:39
@giladgd giladgd self-assigned this May 6, 2024
Copy link
Contributor

@ido-pluto ido-pluto left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@giladgd giladgd merged commit 453c162 into beta May 8, 2024
10 of 11 checks passed
@giladgd giladgd deleted the gilad/splitGgufFilesSupport branch May 8, 2024 23:03
Copy link

github-actions bot commented May 9, 2024

🎉 This PR is included in version 3.0.0-beta.18 🎉

The release is available on:

Your semantic-release bot 📦🚀

@giladgd giladgd mentioned this pull request May 9, 2024
17 tasks
Copy link

github-actions bot commented Sep 24, 2024

🎉 This PR is included in version 3.0.0 🎉

The release is available on:

Your semantic-release bot 📦🚀

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Development

Successfully merging this pull request may close these issues.

2 participants