xihuai18

Follow

💭

I may be slow to respond.

Xihuai Wang xihuai18

💭

I may be slow to respond.

Follow

😭Youth is paid.

86 followers · 64 following

Shanghai Jiao Tong University
Shanghai, China
https://xihuai18.github.io/

Achievements

Achievements

Highlights

Pro

xihuai18/README.md

Hi Here!

I am now focusing on

Large Language Models, including
- Reinforcement Learning for Reasoning and Agency of LLMs;
- Human-LLMs collaboration in Decision-making Tasks.
Reinforcement Learning
Multi-agent System, especially
- Efficiency of Cooperative Multi-agent Reinforcement Learning;
- Zero-shot Generalization Ability in Cooperative Multi-agent Systems.

Pinned Loading

apexrl/AORPO apexrl/AORPO Public

Official pytorch implementation of the paper <Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts>.

Python 23 2
A2PO-ICLR2023 A2PO-ICLR2023 Public

Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)

Python 32
sjtu-marl/ZSC-Eval sjtu-marl/ZSC-Eval Public

This repository is the official implementation of ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-agent Zero-shot Coordination. Pre-trained Agent Zoo: https://huggingface.co/Leoxxxxh/ZSC-Ev…

JavaScript 56 14
sjtu-marl/DPT-Agent sjtu-marl/DPT-Agent Public

This is the official implementation of paper "Leveraging Dual Process Theory in Language Agent Framework for Simultaneous Human-AI Collaboration."

Python 59 8