Skip to content
View dingn42's full-sized avatar
:electron:
:electron:
  • Tsinghua University
  • Earth

Organizations

@thunlp @TsinghuaC3I @PRIME-RL

Block or report dingn42

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
dingn42/README.md

Hi

Pinned Loading

  1. PRIME-RL/SimpleVLA-RL PRIME-RL/SimpleVLA-RL Public

    [ICLR 2026] SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

    Python 1.7k 106

  2. PRIME-RL/PRIME PRIME-RL/PRIME Public

    Scalable RL solution for advanced reasoning of language models

    Python 1.9k 112

  3. PRIME-RL/TTRL PRIME-RL/TTRL Public

    [NeurIPS 2025] TTRL: Test-Time Reinforcement Learning

    Python 1.1k 83

  4. thunlp/UltraChat thunlp/UltraChat Public

    Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)

    Python 2.8k 138

  5. OpenBMB/MiniCPM OpenBMB/MiniCPM Public

    MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks

    Jupyter Notebook 8.9k 580

  6. thunlp/OpenPrompt thunlp/OpenPrompt Public

    An Open-Source Framework for Prompt-Learning.

    Python 4.9k 483