Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
Java
Updated Feb 27, 2019
Hadoop, Docker, Kafka, Elasticsearch, RabbitMQ, Redis, HBase, Solr, Cassandra, ZooKeeper, HDFS, Yarn, Hive, Presto, D…
Python
Updated Apr 24, 2019
Educational notes,Hands on problems w/ solutions for hadoop ecosystem
Python
Updated Jan 22, 2019
HAProxy Configs for Hadoop, Big Data, NoSQL, Docker, Elasticsearch, SolrCloud, HBase, Hortonworks, Cloudera, MapR, My…
Shell
Updated Mar 19, 2019
CDH安装手册
Updated Apr 15, 2019
Code for the deployment of Hadoop clusters, written in Bourne or Bourne Again shell.
Shell
Updated Apr 30, 2019
Modeling Lifecycle with ACME Occupancy Detection and Cloudera
Scala
Updated Sep 6, 2017
Perl Utility Library for my other repos
Perl
Updated Mar 23, 2019
Homebrew Formulas for cloudera tools
Ruby
Updated Sep 14, 2018
A small Command Line tool to create an Kudu table from an Avro schema or from SQL script
Scala
Updated Jul 4, 2017
Java
Updated Oct 31, 2017
Create Greenplum docker files
Python
Updated Dec 26, 2018
CDH compliant Apache Phoenix
CDH5.15.1 离线安装脚本
Shell
Updated Apr 26, 2019
Shell
Updated Apr 25, 2019
Apache Zeppelin parcel and CSD for Cloudera Manager
Shell
Updated May 18, 2018
Example Apache NiFi to CDSW
Python
Updated Feb 6, 2019
Simple example applying Keras, TensorFlow to Nostradamus's prophecies with Cloudera Data Science Workbench
Python
Updated Oct 26, 2017
Connect c/c++ application to HDFS managed by Cloudera/CDH
C
Updated Jan 11, 2019
Parcels repository for Apache Airflow
Dockerfile
Updated Jan 24, 2019
Complete Learning repo for Data and Devops Engineers.
Java
Updated Mar 22, 2019
Apache Airflow parcel and CSD for Cloudera Manager
Shell
Updated Jan 4, 2019
Easily connect to multiple Hadoop clusters
Java
Updated Apr 26, 2019
A Vagrant setup to run a virtual Cloudera cluster
Puppet
Updated Jul 12, 2016
Using Apache MXNet GluonCV YOLO with Apache NiFi and Cloudera Data Science Workbench
Python
Updated Feb 9, 2019
cdh with spark 2.2
Shell
Updated Jul 9, 2018
This is the final project I had to do to finish my Big Data Expert Program in U-TAD in September 2017. It uses the fo…
Jupyter Notebook
Updated May 4, 2018
Settings for using Jupyter hub/notebook with CDH and Spark 2.3
Updated Sep 19, 2018
phData Retirement Age Hadoop row based data lifecycle management
Scala
Updated Sep 19, 2018
cloudera manager automation scripts devops hadoop
Python
Updated Aug 2, 2017