zzw922cn / Automatic_Speech_Recognition

“Data science” is just about as broad of a term as they come. It may be easiest to describe what it is by listing its more concrete components: Data exploration & analysis. Included here: Pandas; NumPy; SciPy; a helping hand from Python's Standard Library.

python data-science numpy jupyter-notebook data-visualization plotting data-preprocessing panda data-processing data-preparation knn

Updated Apr 15, 2020
Jupyter Notebook

ammsa / DTCleaner

Star

DTCleaner: data cleaning using multi-target decision trees.

data-science data-mining data-wrangling data-preprocessing data-cleaning data-quality

Updated Jun 21, 2016
Java

repetere / modelscript

Star

Data Science and Machine learning in JavaScript

javascript data-science machine-learning data-mining data-preprocessing

Updated Dec 2, 2019
JavaScript

maet3608 / nuts-ml

Star

Flow-based data pre-processing for deep learning

data-science deep-learning deep-learning-library data-preprocessing deep-learning-framework

Updated Nov 15, 2019
Python

KwokHing / YandexCatBoost-Python-Demo

Star

Demo on the capability of Yandex CatBoost gradient boosting classifier on a fictitious IBM HR dataset obtained from Kaggle. Data exploration, cleaning, preprocessing and model tuning are performed on the dataset

visualization python seaborn feature-selection data-preprocessing python27 gradient-boosting-classifier gradient-boosting pearson-correlation one-hot-encode catboost variance-analysis yandex-catboost

Updated Dec 5, 2019
Jupyter Notebook

ELToulemonde / dataPreparation

Star

Data preparation for data science projects.

data-science r variable-selection speed data-preprocessing data-preparation date-conversion variable-elimination

Updated Jul 10, 2020
R

ixlan / machine-learning-data-pipeline

Star

Pipeline module for parallel real-time data processing for machine learning models development and production purposes.

python data-science machine-learning natural-language-processing deep-learning algorithms parallel data-preprocessing data-processing computing data-preparation data-pipeline

Updated Nov 13, 2019

ksbg / sparklanes

Star

A lightweight data processing framework for Apache Spark

data-science spark pipeline etl pyspark data-preprocessing preprocessing data-processing

Updated Mar 30, 2020
Python

suraj-maniyar / Stock-Trading-Using-Machine-Learning

Star

A comprehensive approach for stock trading implemented using Neural Network and Reinforcement Learning separately.

reinforcement-learning neural-network pca-analysis data-preprocessing

Updated Jun 25, 2018
Python

mdkearns / automated-data-preprocessing

Star

A command-line utility program for automating the trivial, frequently occurring data preparation tasks: missing value interpolation, outlier removal, and encoding categorical variables.

python data-science machine-learning automation interpolation pandas data-engineering imputation argparse outlier-detection command-line-tool data-preprocessing data-processing outlier-removal one-hot-encode

Updated Jan 4, 2019
Python

Ashwin-op / Machine-Learning-Series

Star

Datasets and Codes for the ML Series

python machine-learning machine-learning-algorithms data-preprocessing introduction blog-series

Updated Apr 14, 2020
Python

elbaulp / DPASF

Star

My MSc on Data Science final project. This is a library for Data Pre-processing Algorithms for Streaming in Flink (DPASF)

scala machine-learning-algorithms feature-selection feature-extraction apache-flink data-preprocessing flink discretization flink-ml

Updated Jul 1, 2019
Scala

orbxball / timit-preprocessor

Star

Extract mfcc vectors and phones from TIMIT dataset

deep-learning phone speech-recognition data-preprocessing mfcc timit-dataset timit

Updated Mar 20, 2018
Shell

priyanshu1210 / lung-cancer-detection

Star

This is a project based on Data Science Bowl 2017. I did my best to propose a solution for the problem but I am still new to Deep Learning so my solution is not the optimal one but it can definitely be improved with some fine tuning and better resources.

data-science machine-learning deep-learning neural-network tensorflow lung-cancer-detection convolutional-neural-networks data-preprocessing data-science-bowl-2017

Updated Sep 16, 2018
Jupyter Notebook

font-bakers / knead

Star

A command line tool for preprocessing, manipulating and serializing font files for deep learning applications.

fonts deep-learning command-line-tool data-preprocessing typefaces

Updated Jul 2, 2019
Python

Pooja-Bhojwani / linked-eed

Star

Aim is to come up with a job recommender system, which takes the skills from LinkedIn and jobs from Indeed and throws the best jobs available for you according to your skills.

python text-mining data-mining data-preprocessing jaccard-similarity social-network-backend job-recommendation skill-algorithm

Updated Oct 3, 2017
Python

sourcecode369 / ml-algorithms-on-scikit-and-keras

Star

Implementation scripts of Machine Learning algorithms on Scikit-learn and Keras for complete novice..

machine-learning natural-language-processing reinforcement-learning deep-learning clustering numpy scikit-learn keras regression pandas xgboost classification dimensionality-reduction matplotlib t-sne data-preprocessing grid-search association-rule-learning kfold-cross-validation multiarm-bandit

Updated Jul 22, 2018
Jupyter Notebook

agrawal-priyank / Web-Scraper-Sentiment-Analysis-TripAdvisor

Star

Academic project for Advances in Data Science and Architecture course

data-science text-mining r web-scraping trip-advisor data-preprocessing data-cleaning sentimental-analysis nlp-machine-learning hotel-review-sentiments

Updated Jan 20, 2018
R

Gowtham1729 / Android-App-Malware-Detector

Star

A Deep Learning Model for detecting Malware Applications

python machine-learning deep-neural-networks deep-learning keras android-app data-preprocessing malware-detection

Updated Aug 8, 2018
Python

halil / sau-ml

Star

SAU Makine Öğrenmesi Eğitim İçerikleri

machine-learning natural-language-processing deep-learning clustering regression artificial-intelligence classification data-preprocessing association-rules data-preparation

Updated Mar 29, 2018
Python

data-preprocessing

Here are 259 public repositories matching this topic...

zzw922cn / Automatic_Speech_Recognition

machinelearnjs / machinelearnjs

msamogh / nonechucks

harunshimanto / 100-Days-Of-ML-Code

TensorMSA / tensormsa

HasnainRaz / SemSegPipeline

dansuh17 / segan-pytorch

thepanacealab / SMMT

danielhanchen / sciblox

nursnaaz / 25DaysInMachineLearning

soumyadip007 / Data-Science-Using-Python-University-Course-Module

ammsa / DTCleaner

repetere / modelscript

maet3608 / nuts-ml

KwokHing / YandexCatBoost-Python-Demo

ELToulemonde / dataPreparation

ixlan / machine-learning-data-pipeline

ksbg / sparklanes

suraj-maniyar / Stock-Trading-Using-Machine-Learning

mdkearns / automated-data-preprocessing

Ashwin-op / Machine-Learning-Series

elbaulp / DPASF

orbxball / timit-preprocessor

priyanshu1210 / lung-cancer-detection

font-bakers / knead

Pooja-Bhojwani / linked-eed

sourcecode369 / ml-algorithms-on-scikit-and-keras

agrawal-priyank / Web-Scraper-Sentiment-Analysis-TripAdvisor

Gowtham1729 / Android-App-Malware-Detector

halil / sau-ml

Improve this page

Add this topic to your repo

Jul	AUG	Sep
	02
2019	2020	2021