Skip to content

DEV Community

# gguf

👋 Sign in for the ability to sort posts by relevant, latest, or top.

May 13

GGUF Quantization Explained: Q4_K_M vs Q5_K_M vs Q8 — Which to Pick (2026)

#llamacpp #gguf #quantization #localai

4 min read

Rost

Apr 27

Llama-Server Router Mode - Dynamic Model Switching Without Restarts

#cheatsheet #gguf #ai #llm

9 min read

Rost

Mar 12

llama.cpp Quickstart with CLI and Server

#cheatsheet #gguf #ai #llm

10 min read

👋 Sign in for the ability to sort posts by relevant, latest, or top.