Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Jupyter Notebook
Updated Sep 6, 2017
Scala
Updated Aug 13, 2015
dllib is a distributed deep learning library running on Apache Spark
CSS
Updated Oct 26, 2017
Spark-Transformers: Library for exporting Apache Spark MLLIB models in to use them in any Java application with no ot…
Java
Updated Dec 15, 2017
Visualizes the Random Forest debug string from the MLLib in Spark using D3.js
Python
Updated Jan 11, 2018
Feature engineering toolkit for Spark MLlib.
Scala
Updated Apr 1, 2017
kaggle machine learning with spark
Python
Updated May 31, 2017
pyspark
Python
Updated Jun 20, 2017
Slides, code and more for my class: Data Analytics and Machine Learning on Big Data
Jupyter Notebook
Updated Dec 21, 2017
Data Analytics using Spark
Jupyter Notebook
Updated May 17, 2017
Basics of Big Data and Machine Learning using Apache Spark and Scala
Scala
Updated Mar 15, 2017
Implementacion de un recomendador de musicas utilizando spark mllib
Jupyter Notebook
Updated Oct 24, 2017
ML Spark 2.X for Training
Scala
Updated Nov 8, 2017
Prediction of Customer Churn using Spark Mllib
Jupyter Notebook
Updated Sep 10, 2017
Example from Spark MLLib (in python)
Python
Updated Dec 7, 2017
learn spark with python
Python
Updated Jan 9, 2018
🤖 SNS+iMessager for DeepDetect MicroServices' demonstration.
Objective-C
Updated Feb 1, 2018
Scala
Updated Jan 29, 2018
R
Updated Jul 3, 2017
Spark High Order SVD
Scala
Updated Feb 21, 2018
Tokenizer Analytics Plugin: A transform plugin to split data based on a pattern
Java
Updated Nov 20, 2017
Tutorial on how to use the Python API for Spark dataframes.
HTML
Updated Feb 8, 2018
Recommends movies based on your interests
Scala
Updated Jul 5, 2016
My experiments with various ML tools & methods in R, Python & Spark
Jupyter Notebook
Updated Jan 5, 2018
Clojure Sparkling & statistics, machine learning, kungfu
Clojure
Updated Sep 18, 2017
Movie recommendation Engine using Spark MLlib & Scala
Scala
Updated Jan 4, 2018
This repository contains Spark, MLlib, PySpark and Dataframes projects
Jupyter Notebook
Updated Oct 22, 2017
Small facility which reduces naive decision tree models as produced by mllib
Scala
Updated Dec 31, 2017
CF based Music Recommendation using Spark/MLLib
Jupyter Notebook
Updated May 30, 2017
spark with scala, including rdd, transform, action, hdfs, sparkSQL, dataframe and mllib
XSLT
Updated Feb 8, 2018