Grow your team on GitHub
GitHub is home to over 50 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.
Sign upRepositories
spark
Apache Spark - A unified analytics engine for large-scale data processing
arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication…
hudi
Upserts, Deletes And Incremental Processing on Big Data.
pulsar
Apache Pulsar - distributed pub-sub messaging system
maven-source-plugin
Apache Maven Source Plugin
hadoop-ozone
Scalable, redundant, and distributed object store for Apache Hadoop
skywalking
APM, Application Performance Monitoring System
beam
Apache Beam is a unified programming model for Batch and Streaming
groovy
Apache Groovy: A powerful multi-faceted programming language for the JVM platform
submarine
Submarine is Cloud Native Machine Learning Platform.
directory-fortress-core
Mirror of Apache Directory Fortress Core

