Skip to content
View xihuai18's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Highlights

  • Pro

Block or report xihuai18

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
xihuai18/README.md

Hi Here!

I am now focusing on

  • Large Language Models, including
    • Reinforcement Learning for Reasoning and Agency of LLMs;
    • Human-LLMs collaboration in Decision-making Tasks.
  • Reinforcement Learning
  • Multi-agent System, especially
    • Efficiency of Cooperative Multi-agent Reinforcement Learning;
    • Zero-shot Generalization Ability in Cooperative Multi-agent Systems.

Pinned Loading

  1. apexrl/AORPO apexrl/AORPO Public

    Official pytorch implementation of the paper <Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts>.

    Python 23 2

  2. A2PO-ICLR2023 A2PO-ICLR2023 Public

    Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)

    Python 32

  3. sjtu-marl/ZSC-Eval sjtu-marl/ZSC-Eval Public

    This repository is the official implementation of ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-agent Zero-shot Coordination. Pre-trained Agent Zoo: https://huggingface.co/Leoxxxxh/ZSC-Ev…

    JavaScript 56 14

  4. sjtu-marl/DPT-Agent sjtu-marl/DPT-Agent Public

    This is the official implementation of paper "Leveraging Dual Process Theory in Language Agent Framework for Simultaneous Human-AI Collaboration."

    Python 59 8