A curated list of action recognition and related area resources
-
Updated
Jan 18, 2022
A curated list of action recognition and related area resources
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding
An open-source toolbox for action understanding based on PyTorch
Code & Models for Temporal Segment Networks (TSN) in ECCV 2016
Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based action recognition model, practical applications for video tagging and sport action detection.
Temporal Segment Networks (TSN) in PyTorch
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
temporal action detection with SSN
awesome grounding: A curated list of research papers in visual grounding
A collection of recent video understanding datasets, under construction!
Temporal Segments LSTM and Temporal-Inception for Activity Recognition
[CVPR 2021] TDN: Temporal Difference Networks for Efficient Action Recognition
STEP: Spatio-Temporal Progressive Learning for Video Action Detection. CVPR'19 (Oral)
Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
Dataset, code and model for the CVPR'20 paper "The Garden of Forking Paths: Towards Multi-Future Trajectory Prediction". And for the ECCV'20 SimAug paper.
ActionVLAD for video action classification (CVPR 2017)
Temporally Efficient Vision Transformer for Video Instance Segmentation, CVPR 2022, Oral
Tools for movie and video research
The 2nd place Solution to the Youtube-8M Video Understanding Challenge by Team Monkeytyping (based on tensorflow)
Add a description, image, and links to the video-understanding topic page so that developers can more easily learn about it.
To associate your repository with the video-understanding topic, visit your repo's landing page and select "manage topics."