The Wayback Machine - https://web.archive.org/web/20240106131821/https://github.com/FasterDecoding
Skip to content
@FasterDecoding

FasterDecoding

Think deeper, decode faster

Pinned

  1. Medusa Medusa Public

    Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

    Jupyter Notebook 1.3k 73

Repositories

Showing 2 of 2 repositories
  • Medusa Public

    Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

    Jupyter Notebook 1,302 Apache-2.0 73 16 1 Updated Dec 23, 2023
  • REST Public

    REST: Retrieval-Based Speculative Decoding

    C 93 Apache-2.0 4 1 0 Updated Nov 17, 2023

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…