#
dataset-generation
Here are 245 public repositories matching this topic...
A plugin for GTAV that transforms it into a vision-based self-driving car research environment.
-
Updated
Jan 14, 2020 - C++
nlp
text-classification
chatbot
nlu
dataset
named-entity-recognition
chatbots
dataset-generation
nlg
chatito
-
Updated
Jan 26, 2021 - TypeScript
NFStream: a Flexible Network Data Analysis Framework.
python
data-science
machine-learning
data-mining
netflow
pcap
packet-analyser
traffic-analysis
artificial-intelligence
cybersecurity
network-monitoring
data-analysis
dataset-generation
network-analysis
packet-capture
ndpi
network-security
deep-packet-inspection
traffic-classification
-
Updated
Jun 10, 2021 - Python
Convert face dataset to masked dataset
-
Updated
Jun 8, 2021 - Python
DataGene - Identify How Similar TS Datasets Are to One Another (by @firmai)
encoding
finance
data-structures
decomposition
model-checking
similarity-measures
dataset-generation
distance-measures
synthesizers
similarity-score
testing-framework
synthetic-data
predictive-maintenance
synthetic-dataset-generation
distance-calculations
dataset-similarity
transformation-recipes
data-transformations
-
Updated
Nov 12, 2020 - Jupyter Notebook
prepare dataset for voc ultralytics/yolov3 & yolov5
-
Updated
Jun 16, 2020 - Python
Image Aesthetics Toolkit - includes Fisher Vector implementation, AVA (Image Aesthetic Visual Analysis) dataset and fast multi-threaded downloader
image
image-processing
live
aesthetics
dataset
ava
dataset-creation
aesthetic
datasets
dataset-generation
image-aesthetic-visual-analysis
fisher-vectors
-
Updated
Oct 28, 2019 - Python
Tools for ASR Corpus Generation from Online Video
-
Updated
Feb 10, 2019 - Python
[IMC 2020 (Best Paper Finalist)] Using GANs for Sharing Networked Time Series Data: Challenges, Initial Promise, and Open Questions
privacy
timeseries
time-series
generative-adversarial-network
gan
datasets
dataset-generation
gans
fidelity
synthetic-data
doppelganger
synthetic-dataset-generation
synthetic-data-generator
synthetic-data-generation
-
Updated
May 26, 2021 - Python
Download CelebA-HQ dataset easily ! Create with docker or download from Google Drive.
docker
docker-image
dataset
docker-image-available
dataset-generation
celeba
celeba-dataset
celeba-hq-dataset
celeba-hq
-
Updated
Aug 22, 2019 - Python
Trainable categorization tool
-
Updated
Sep 8, 2017 - Python
Vietnamese Optical Character Recognition. It works with Vietnamese and Latin characters as well.
deep-learning
tensorflow
python3
dataset-generation
character-recognition
character-generator
ocr-recognition
-
Updated
Aug 30, 2018 - Python
Gazebo plugins for applying domain randomization
-
Updated
Oct 17, 2018 - C++
TRScraper, doğal dil işleme uygulamalarında kullanılmak amacıyla geliştirilmiş, Türkçe içerik girilen büyük platformlarda metin madenciliği yapma imkanı sunan bir uygulamadır.
nlp
text-mining
scraper
turkish
selenium
web-scraper
dataset
datasets
dataset-generation
turkce
selenium-python
turkce-kaynak
dogal-dil-isleme
turkish-nlp
-
Updated
Feb 16, 2021 - Python
Buckle up, adventure in the styleGAN2-ada-pytorch network latent space awaits
anime
projection
dataset-generation
latent-space
colab-notebook
stylegan-model
stylegan2
stylegan2-ada
latent-space-interpolation
stylegan2-ada-pytorch
-
Updated
Apr 6, 2021 - Jupyter Notebook
Compiles a json dataset using public sources that contains properties to aid in the detection and mitigation of over 1000 variants of ransomware.
security
json
security-audit
detection
spreadsheet
ransomware
security-vulnerability
security-hardening
dataset-generation
excel-to-json
mitigation
ransomware-prevention
json-dataset
ransomware-resources
ransomware-summary
prevention
wannacry
-
Updated
Nov 1, 2019 - Python
Download YouTube video description and video comments without using the YouTube API.
-
Updated
Mar 31, 2021 - Python
Generating retro pixel game characters with Generative Adversarial Networks. Dataset "TinyHero" included.
-
Updated
May 24, 2020 - Jupyter Notebook
The Synbols dataset generator
-
Updated
Apr 30, 2021 - Python
Repository to generate CLEVR-Dialog: A diagnostic dataset for Visual Dialog
computer-vision
deep-learning
dataset-generation
dialogue-generation
visual-dialog
vision-and-language
-
Updated
Feb 18, 2020 - Python
audio
machine-learning
youtube
video
voice
youtube-dl
ontology
audio-files
dataset
speech-recognition
datasets
dataset-generation
pafy
audioset
voice-computing
-
Updated
Apr 14, 2020 - Python
A Python module to generate large scale Music datasets using both Spotify and MusixMatch API's.
data-science
natural-language-processing
spotify-api
dataset-generation
musixmatch-api
audio-processing
-
Updated
Aug 28, 2020 - Python
Corpus and a baseline neural network system for Named Entity Recognition in Hindi-English Code-Mixed social media text.
python
social-media
csv
tweets
neural-network
dataset
named-entity-recognition
stats
dataset-generation
preprocessing
decision-trees
ner
research-paper
crfsuite
nlp-machine-learning
lstm-neural-networks
f1-score
acl-news2018
hindi-english
ner-tags
-
Updated
Sep 25, 2020 - Python
Convert Open Image v4 Dataset to VOC pasacal format XML. Open Images is a dataset of ~9 million images that have been annotated with image-level labels and bounding boxes spanning thousands of classes. https://github.com/openimages/dataset
-
Updated
Apr 2, 2020 - Python
Spatial Temporal Graph Convolutional Networks for Emotion Perception from Gaits
dataset
dataset-generation
affective-computing
graph-convolutional-networks
emotion-detection
emotion-recognition
variational-autoencoder
gait
conditional-vae
gait-analysis
gait-recognition
graph-convolutional-neural-networks
conditional-variational-autoencoder
spatial-temporal-action-detection
emotion-perception
-
Updated
Jun 8, 2021 - Python
Procedural 3D data generation pipeline for architecture
-
Updated
Jun 4, 2021 - Python
-
Updated
Apr 30, 2021 - Python
Open
Update wiki
1
pepper-jk
commented
Feb 18, 2019
This is a quickly written to do list, feel free to add stuff.
- fix main page links
- update attack arguments
- DDoS
- Joomla target.host (URL?)
- SMBScan target.count
- SQLi target.host (URL?)
- MembersMgmtCommAttack
- inject.ip
- ip.reuse.external
- ip.reuse.local
- ip.reuse.total
-
[x] add attack arguments
Improve this page
Add a description, image, and links to the dataset-generation topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the dataset-generation topic, visit your repo's landing page and select "manage topics."


We need description, citation, license, and version meta info to be added to the dataset.
Is your feature request related to a problem?
Some datasets need this info inside them for legal reasons.
If your feature will improve
HUBEasy to implement, won't hurt for sure.
Description of the possible solution
Currently, we have all metadata store