Grow your team on GitHub
GitHub is home to over 50 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.
Sign upPinned repositories
Repositories
-
ratatool
A tool for data sampling, data generation, and data diffing
-
scio
A Scala API for Apache Beam and Google Cloud Dataflow.
-
missinglink
Build time tool for detecting link problems in java projects
-
JniHelpers
Tools for writing great JNI code
-
NFPlayerJS
A JavaScript/TypeScript audio engine for the Web and Server capable of multitrack time stretching, pitch shifting, declarative effects, faster than realtime processing, and more!
-
styx
"The path to execution", Styx is a service that schedules batch data processing jobs in Docker containers on Kubernetes.
-
klio
Smarter data pipelines for audio.
-
SPTDataLoader
The HTTP library used by the Spotify iOS client
-
-
web-scripts
A collection of base configs and CLI wrappers used to speed up development @ Spotify.
-
-
-
featran
A Scala feature transformation library for data science and machine learning
-
completable-futures
Utilities for working with futures in Java 8
-
-
-
magnolify
A collection of Magnolia add-on modules
-
big-data-rosetta-code
Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code
-
-
-
dbeam
DBeam exports SQL tables into Avro files using JDBC and Apache Beam
-
zoltar
Common library for serving TensorFlow, XGBoost and scikit-learn models in production.
-
bucket-runner
Run a command in parallel, distributing the input files to each.
-
luigi
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
-

