Skip to content

Issues: ggerganov/llama.cpp

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or ⇧ + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Support BitNet b1.58 ternary models enhancement New feature or request Tensor Encoding Scheme https://github.com/ggerganov/llama.cpp/wiki/Tensor-Encoding-Schemes
#5761 opened Feb 28, 2024 by igorbarshteyn
Support CoreML like whisper.cpp? help wanted Extra attention is needed macos Issues specific to macOS performance Speed related topics
#1714 opened Jun 6, 2023 by realcarlos
Phi 3 medium/small support enhancement New feature or request
#7439 opened May 21, 2024 by bartowski1182
Bug: moondream2 inference not correct (severe quality degradation compared to reference) bug-unconfirmed medium severity Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#8037 opened Jun 20, 2024 by cmp-nct
Bug: Quantizing Llama 3.1 70B to Q4_K_S with imatrix gives NaN bug-unconfirmed medium severity Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#8661 opened Jul 23, 2024 by bartowski1182
Server: Add prompt processing progress endpoint? enhancement New feature or request help wanted Extra attention is needed server/webui
#6586 opened Apr 10, 2024 by stduhpf
ci : add Apple silicon (M1) macOS runners good first issue Good for newcomers testing Everything test related
#3469 opened Oct 4, 2023 by ggerganov
ggml : add GPU support for Mamba models enhancement New feature or request help wanted Extra attention is needed Nvidia GPU Issues specific to Nvidia GPUs
#6758 opened Apr 19, 2024 by ggerganov
server: Bring back multimodal support enhancement New feature or request llava LLaVa and multimodal server
#8010 opened Jun 19, 2024 by ngxson
Bug: QWEN2 quantization GGML_ASSERT bug-unconfirmed high severity Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#7805 opened Jun 6, 2024 by bartowski1182
Bug: abort on Android (pixel 8 pro) android Issues specific to Android bug Something isn't working high severity Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#8109 opened Jun 25, 2024 by nivibilla
Bug: b3383 breaks Llama 3.1 bug-unconfirmed critical severity Used to report critical severity bugs in llama.cpp (e.g. Crashing, Corrupted, Dataloss)
#8671 opened Jul 24, 2024 by Azirine
Bug: cant finetune bug-unconfirmed critical severity Used to report critical severity bugs in llama.cpp (e.g. Crashing, Corrupted, Dataloss)
#7643 opened May 30, 2024 by cabfile
feature request - disabling tokenizer in conversion / inference enhancement New feature or request good first issue Good for newcomers help wanted Extra attention is needed
#1765 opened Jun 8, 2023 by genenwoochoi
ProTip! Adding no:label will show everything without a label.