XingruiWang

Follow

Xingrui Wang XingruiWang

Follow

CS PhD @ Johns Hopkins University Multimodal & Spatial reasoning

31 followers · 19 following

Johns Hopkins University
Baltimore, MD
23:36 (UTC -04:00)
https://xingruiwang.github.io/
@XingruiWang

Achievements

Achievements

Highlights

Pro

XingruiWang/README.md

Hi there, this is Xingrui. 👋

📫 How to reach me: XingrWang@gmail.com
😄 Main interest: AI, Computer Vision, Machine Learning ...
👯 More about me: https://xingruiwang.github.io/

Pinned Loading

Spatial457 Spatial457 Public

[CVPR'25 Highlight] A VQA benchmark for 6D spatial reasoning.

Python 20 3
open-compass/VLMEvalKit open-compass/VLMEvalKit Public

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 4.1k 699
KeyVID KeyVID Public

Offical code of paper KeyVID: Keyframe-Aware Video Diffusion for Audio-Synchronized Visual Animation.

Python 6
XModBench XModBench Public

XModBench: Benchmarking Cross-Modal Capabilities and Consistency in Omni-Language Models

Python 5
DynSuperCLEVR DynSuperCLEVR Public

A video question answering dataset that focuses on the dynamics properties of objects (velocity, acceleration) and their collisions within 4D scenes.

Python 20
3D-Aware-VQA 3D-Aware-VQA Public

Official Code for the NeurIPS'23 paper "3D-Aware Visual Question Answering about Parts, Poses and Occlusions"

Jupyter Notebook 21