Focused crawls are collections of frequently-updated webcrawl data from narrow (as opposed to broad or wide) web crawls, often focused on a single domain or subdomain.
This repository contains code for YOLO v3 Object detection, and is capable of fast object detection. Input can be given through images, videos and webcam input feed.
A deep learning based application which is entitled to help the visually impaired people. The application automatically generates the textual description of what's happening in front of the camera and conveys it to person through audio. It is capable of recognising faces and tell user whether a known person is standing in front of him or not.
A system to process visual input on timed frames to produce sensible audio aid in accordance with human information processing limits, using image captioning, semantic text comparison and text-to-speech modules.
MS-COCO-ES is a dataset created from the original MS-COCO dataset. This project aims to provide a small subset of the original image captions translated into Spanish by humans annotators. This subset is composed by 20,000 captions of 4,000 images.