⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
chatbot
stable-diffusion
large-language-model
chatpdf
llm-inference
smoothquant
4-bits
speculative-decoding
llm-cpu
streamingllm
attention-sink
intel-optimized-llamacpp
neural-chat
-
Updated
Nov 25, 2023 - C++

