GitHub · Where software is built

changelog : libllama API
#9289 · ggerganov opened on Sep 3, 2024
12
changelog : llama-server REST API
#9291 · ggerganov opened on Sep 3, 2024
18
tutorials : list for llama.cpp
#13523 · ggerganov opened on May 14, 2025
16

Labels Milestones New issue

quantize : configurable neutral imatrix prior

generation quality

ggml-org/llama.cpp

#15060

· compilade opened

on Aug 3, 2025

ggml-quants : weighted rounding algorithms with cumulative search

generation quality

Less than 4 bits

Review Complexity : Medium

Tensor Encoding Scheme

ggml-org/llama.cpp

#12557

· compilade opened

on Mar 25, 2025

Smooth Sampling / Quadratic Sampling support

generation quality

Review Complexity : High

ggml-org/llama.cpp

#6445

· kalomaze opened

on Apr 2, 2024

P-Step Truncation Sampling

generation quality

Review Complexity : High

ggml-org/llama.cpp

#5675

· p-e-w opened

on Feb 23, 2024

[RFC] common, server : add top-a sampler

generation quality

Review Complexity : High

ggml-org/llama.cpp

#5612

· Artefact2 opened

on Feb 20, 2024

Penalty threshold: A mechanism for improving repetition penalties

generation quality

Review Complexity : Medium

ggml-org/llama.cpp

#5561

· p-e-w opened

on Feb 18, 2024

llama : combined beam search + grammar sampling strategy

generation quality

good first issue

#2923

· ggerganov opened

on Aug 31, 2023

llama : tool for evaluating quantization results per layer

generation quality

#2783

· ggerganov opened

on Aug 25, 2023

Implementation of a sequence repetition penalty sampler

generation quality

ggml-org/llama.cpp

#2593

· KerfuffleV2 opened

on Aug 12, 2023

Study how LM Evaluation Harness works and try to implement it

generation quality

#231

· ggerganov opened

on Mar 17, 2023