Skip to content
View jiahe7ay's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Block or report jiahe7ay

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. MINI_LLM MINI_LLM Public

    This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.

    Python 503 74

  2. MiniCharacterLLM MiniCharacterLLM Public

    这是一个一键让小参数大模型进行角色扮演的项目,从数据构成和训练都包含在这项目中

    Python 25 2

  3. Chinese-miniMamba Chinese-miniMamba Public

    This is a project on training a large language model on Chinese corpora using the Mamba architecture. Its aim is to explore the potential capabilities of the Mamba architecture on Chinese corpora.

  4. infini-mini-transformer infini-mini-transformer Public

    This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and training code.

    Python 59 3