Skip to content
View XingruiWang's full-sized avatar
:shipit:
:shipit:

Highlights

  • Pro

Block or report XingruiWang

Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
XingruiWang/README.md

Pinned Loading

  1. Spatial457 Spatial457 Public

    [CVPR'25 Highlight] A VQA benchmark for 6D spatial reasoning.

    Python 20 3

  2. open-compass/VLMEvalKit open-compass/VLMEvalKit Public

    Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

    Python 4.1k 699

  3. KeyVID KeyVID Public

    Offical code of paper KeyVID: Keyframe-Aware Video Diffusion for Audio-Synchronized Visual Animation.

    Python 6

  4. XModBench XModBench Public

    XModBench: Benchmarking Cross-Modal Capabilities and Consistency in Omni-Language Models

    Python 5

  5. DynSuperCLEVR DynSuperCLEVR Public

    A video question answering dataset that focuses on the dynamics properties of objects (velocity, acceleration) and their collisions within 4D scenes.

    Python 20

  6. 3D-Aware-VQA 3D-Aware-VQA Public

    Official Code for the NeurIPS'23 paper "3D-Aware Visual Question Answering about Parts, Poses and Occlusions"

    Jupyter Notebook 21