Skip to content
View shjwudp's full-sized avatar
  • Beijing, China

Organizations

@BaguaSys

Block or report shjwudp

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. c4-dataset-script c4-dataset-script Public

    Inspired by google c4, here is a series of colossal clean data cleaning scripts focused on CommonCrawl data processing. Including Chinese data processing and cleaning methods in MassiveText.

    Python 136 18

  2. megabyte megabyte Public

    A PyTorch implementation of MEGABYTE. This multi-scale transformer architecture has the excellent features of tokenization-free and sub-quadratic attention. The paper link: https://arxiv.org/abs/23…

    Python 11 4

  3. BaguaSys/bagua BaguaSys/bagua Public

    Bagua Speeds up PyTorch

    Python 882 81

  4. BaguaSys/bagua-net BaguaSys/bagua-net Public archive

    High performance NCCL plugin for Bagua.

    Rust 15 4

  5. shu shu Public

    中文书籍收录整理, Collection of Chinese Books

    Python 215 47

  6. blueprint-trainer blueprint-trainer Public

    Scaffolding for sequence model training research.

    Python 1