The Wayback Machine - https://web.archive.org/web/20230730064947/https://github.com/topics/llm-inference
Skip to content
#

llm-inference

Here are 53 public repositories matching this topic...

Extending Hugging Face transformers APIs for Transformer-based models and improve the productivity of inference deployment. With extremely compressed models, the toolkit can greatly improve the inference efficiency on Intel platforms.

  • Updated Jul 28, 2023
  • C++

Improve this page

Add a description, image, and links to the llm-inference topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llm-inference topic, visit your repo's landing page and select "manage topics."

Learn more