Issues: google/gemma.cpp
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
TODO (**Optimize, potentially using new VQSort PartialSort**)
#121
opened Mar 29, 2024 by
enum-class
Add Self-Extend to the gemma.cpp
enhancement
New feature or request
#60
opened Feb 28, 2024 by
namtranase
[Suggestions] Low effort OpenMP, OpenACC, CBLAS compatible CPU & GPU acceleration + other improvements
enhancement
New feature or request
#28
opened Feb 24, 2024 by
trholding
[Feature request] Add quantization methods
enhancement
New feature or request
#17
opened Feb 23, 2024 by
namtranase
Generate compressed weights file from finetune
enhancement
New feature or request
#11
opened Feb 22, 2024 by
sanjay920
[Feature request] Add simple HTTP API server like in llama.cpp with api like OpenAI
enhancement
New feature or request
#1
opened Feb 21, 2024 by
pythops
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.

