The Wayback Machine - https://web.archive.org/web/20190730130402/https://github.com/topics/big-data
Skip to content
#

big-data

💫 Industrial-strength Natural Language Processing (NLP) with Python and Cython
Python Updated Jul 30, 2019
The official home of the Presto distributed SQL query engine for big data
Java Updated Jul 30, 2019
ClickHouse is a free analytic DBMS for big data.

Good first issues

See all
C++ Updated Jul 30, 2019
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, reg…
C++ Updated Jul 30, 2019
Apache CouchDB
Erlang Updated Jul 30, 2019
The most widely used Python to C compiler
Python Updated Jul 30, 2019
Moloch is an open source, large scale, full packet capturing, indexing, and database system.

Good first issues

See all
JavaScript Updated Jul 29, 2019
Vespa is an engine for low-latency computation over large data sets.
Java Updated Jul 30, 2019
Loading…
You can’t perform that action at this time.