#
multi-modality
Here are 13 public repositories matching this topic...
Embed images and sentences into fixed-length vectors with CLIP
deep-learning
pytorch
openai
bert
onnx
cross-modality
multi-modality
sentence-encoding
bert-as-service
cross-modal-retrieval
neural-search
clip-model
clip-as-service
-
Updated
Jun 7, 2022 - Python
Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network). Technique was originally created by https://twitter.com/advadnoun
deep-learning
transformers
artificial-intelligence
siren
text-to-image
multi-modality
implicit-neural-representation
-
Updated
Mar 13, 2022 - Python
[ICCV2019] Robust Multi-Modality Multi-Object Tracking
-
Updated
Dec 7, 2019 - Python
This repo contains the official code of our work SAM-SLR which won the CVPR 2021 Challenge on Large Scale Signer Independent Isolated Sign Language Recognition.
sign-language-recognition-system
sign-language-recognition
multi-modality
cvpr2021
skeleton-features
-
Updated
May 11, 2022 - Python
Unifying Voxel-based Representation with Transformer for 3D Object Detection
-
Updated
Jun 6, 2022 - Python
This is the official pytorch implementation for our ICCV 2021 paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering" on VQA Task
visualization
pytorch
transformer
attention
official
multi-modal
clevr
visual-question-answering
vision-and-language
dynamic-network
multi-modality
multi-modal-learning
multi-scale-features
vqav2
iccv2021
local-and-global
-
Updated
Oct 11, 2021 - Python
Official code for WACV 2021 paper - Compositional Learning of Image-Text Query for Image Retrieval
-
Updated
Oct 8, 2021 - Python
CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations, ICCV 2021
natural-language-processing
video
computer-vision
transformers
video-captioning
multi-modality
contrastive-learning
video-text-retrieval
-
Updated
Feb 7, 2022 - Python
An official PyTorch implementation of the CRIS paper
-
Updated
Jun 7, 2022 - Python
[IJHCS] An assistant prototype for breast cancer diagnosis prepared with a multimodality strategy. The work was published in the International Journal of Human-Computer Studies.
machine-learning
deep-learning
cancer
deep-reinforcement-learning
artificial-intelligence
assistant
medical-imaging
neural-networks
cancer-imaging-research
multi-modality
-
Updated
May 26, 2022 - JavaScript
Final project for the course LT2318 Artificial Intelligence: Cognitive Systems. The project concerns multimodal hate speech detection in memes.
-
Updated
May 5, 2021 - TeX
Improve this page
Add a description, image, and links to the multi-modality topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the multi-modality topic, visit your repo's landing page and select "manage topics."


Problem: It is pretty challenging to find resource material and valuable articles, videos and such, and we spend a lot of time searching and finding the appropriate resource for us.
Proposed solution: Faceted search can come a long way when looking for a quick way to find a solution designed for our needs. Ratings on the resource can help us select the best solution based on our search