The Wayback Machine - https://web.archive.org/web/20250307173248/https://github.com/jackfsuia
Skip to content
View jackfsuia's full-sized avatar

Block or report jackfsuia

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. nanoRLHF nanoRLHF Public

    RLHF experiments on a single A100 40G GPU. Support PPO, GRPO, REINFORCE, RAFT, RLOO, ReMax, DeepSeek R1-Zero reproducing.

    Python 46 8

  2. vecparser vecparser Public

    Vectorize the Matlab/CVX for-loops as much as possible.

    Python 5 1

  3. bert-chunker bert-chunker Public

    bert-chunker: efficient and trained chunking for unstructured documents. 训练Bert做文档分段.

    Python 3 1

  4. Proactive-Sales-Agent Proactive-Sales-Agent Public

    Proactive LLM sales agent server. 主动销售的大模型销售Agent服务器。

    Python 6 1

  5. cvx-coder cvx-coder Public

    LLM good at answering questions and coding about Matlab CVX. 微调大模型写CVX代�?�,回答CVX问题。

    1