bug
Something isn't working
help wanted
Extra attention is needed
good first issue
Good for newcomers
triaged
A team member looked at the bug, acknowledged and triaged it. Expect a reply soon.
#
captioning
Here are 46 public repositories matching this topic...
Code for "Aligning Linguistic Words and Visual Semantic Units for Image Captioning", ACM MM 2019
-
Updated
Oct 18, 2019 - Python
Fully-Convolutional Point Networks for Large-Scale Point Clouds
deep-neural-networks
computer-vision
deep-learning
point-cloud
point-clouds
semantic-segmentation
meshes
3d
captioning
-
Updated
Mar 22, 2019 - Python
Applying Common-Sense Reasoning to Multi-Modal Dense Video Captioning and Video Question Answering | Python3 | PyTorch | CNNs | Causality | Reasoning | LSTMs | Transformers | Multi-Head Self Attention | Published in IEEE Winter Conference on Applications of Computer Vision (WACV) 2021
python
video
transformers
python3
pytorch
lstm
question-answering
attention
convolutional-neural-networks
causality
multi-modal
reasoning
captioning
dense-captioning
common-sense
captioning-videos
self-attention
resnets
videoqa
distilling-the-knowledge
-
Updated
May 3, 2021 - Python
A Base Tensorflow Project for Medical Report Generation
-
Updated
Jun 16, 2019 - Python
[CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)
nlp
video
vision
srl
captioning
captioning-videos
vision-and-language
grounding
video-language
event-relations
semantic-roles
-
Updated
Aug 17, 2021 - Python
Python code for handling the Clotho dataset.
audio
natural-language-processing
deep-learning
audio-signal-processing
captioning
audio-captioning
clotho-dataset
-
Updated
Nov 24, 2020 - Python
What and How Well You Performed? A Multitask Learning Approach to Action Quality Assessment
pytorch
video-processing
lstm
representation-learning
action-recognition
video-understanding
c3d
video-captioning
captioning
fine-grained-classification
multitask-learning
dilated-convolution
action-quality-assessment
mtl-aqa
fine-grained-action-recognition
dilated-c3d
-
Updated
Jul 28, 2021 - Python
A Pytorch implementation of Attention on Attention module (both self and guided variants), for Visual Question Answering
-
Updated
Nov 8, 2020 - Python
Audio captioning baseline system for DCASE 2020 challenge.
machine-learning
deep-neural-networks
deep-learning
signal-processing
audio-signal-processing
captioning
dcase
machine-listening
audio-captioning
dcase2020
-
Updated
Jun 7, 2021 - Python
My notes on some Deep Learning papers
-
Updated
Dec 8, 2018 - HTML
-
Updated
Nov 15, 2017 - Jupyter Notebook
Toolkit for supporting the EBU-TT Live specification
-
Updated
Feb 11, 2022 - Python
A Tennis dataset and models for event detection & commentary generation
machine-learning
video
computer-vision
mxnet
dataset
tennis
gluon
sportsanalytics
fine-grained
captioning
eventdetection
-
Updated
Aug 17, 2020 - Python
S2VT (seq2seq) video captioning with bahdanau & luong attention implementation in Tensorflow
-
Updated
Apr 26, 2018 - Python
Official python implementation of R3-Transformer
-
Updated
Nov 30, 2020 - Python
CaMEL: Mean Teacher Learning for Image Captioning. arXiv 2022.
-
Updated
Mar 2, 2022 - Python
SimpleSubtitleEditor for Blender
-
Updated
Jan 4, 2018 - Python
Some papers about *diverse* image (a few videos) captioning
-
Updated
Nov 23, 2021
Smart-I is an android application aimed at helping the visually impaired using artificial intelligence and cloud computing.
visualization
android
cloud
deep-neural-networks
deep-learning
android-application
captions
caption
cloud-computing
image-recognition
android-app
captioning-images
andorid
captioning
-
Updated
Nov 17, 2019 - Python
Sample app to display live captioning to a WebRTC video session with the Deepgram API.
-
Updated
Nov 22, 2021 - JavaScript
[CVPR2022] X-Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioning
-
Updated
Mar 8, 2022
Indonesian Image Captioning using Attention-based Semantic Compositional Networks
-
Updated
Jul 31, 2019 - Jupyter Notebook
Python program to generate memes.
-
Updated
Mar 12, 2022 - Jupyter Notebook
Online professional courses that are captioned and/or subtitled
accessibility
captions
subtitles
courses
airtable
online-course
subtitling
captioning
captioning-videos
-
Updated
Mar 2, 2019
-
Updated
Oct 19, 2020
Sample app demonstrating adding live captions to Twilio Video rooms
-
Updated
Oct 19, 2021 - JavaScript
Tools for the evaluation of audio captioning.
-
Updated
May 23, 2020 - Jupyter Notebook
A public repository with key information about the EBU Timed Text (EBU-TT) format.
-
Updated
Feb 24, 2021
Automatically describing the content of an image in Persian
natural-language-processing
computer-vision
deep-learning
tensorflow
image-processing
image-captioning
deeplearning
captioning-images
captioning
-
Updated
Feb 1, 2022 - Jupyter Notebook
Improve this page
Add a description, image, and links to the captioning topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the captioning topic, visit your repo's landing page and select "manage topics."


File "/home/ubuntu/vqa/GMN/mmf/mmf/datasets/builders/visual_genome/dataset.py", line 44, in init
scene_graph_file = self._get_absolute_path(scene_graph_file)
AttributeError: 'VisualGenomeDataset' object has no attribute '_get_absolute_path'
Command that i run in shell
CUDA_VISIBLE_DEVICES="0" mmf_run config=projects/gmn/configs/visual_genome/defaults.yaml model=gm