apache / spark
Apache Spark - A unified analytics engine for large-scale data processing
See what the GitHub community is most excited about today.
Apache Spark - A unified analytics engine for large-scale data processing
A minimal, idiomatic Scala interface for HTTP
Chisel 3: A Modern Hardware Design Language
Build highly concurrent, distributed, and resilient message-driven applications on the JVM
♞ lichess.org: the forever free, adless and open source chess server ♞
Rocket Chip Generator
This project provides Apache Spark SQL, RDD, DataFrame and Dataset examples in Scala language
Lightweight, modular, and extensible library for functional programming.
Apache Spark Connector for Azure Cosmos DB
Linkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Open-source high-performance RISC-V processor
CMAK is a tool for managing Apache Kafka clusters
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
A Scala API for Apache Beam and Google Cloud Dataflow.
The pure asynchronous runtime for Scala
The Daml smart contract language
Migration tools for TiKV, e.g. online bulk load.
The Scala 3 compiler, also known as Dotty.
Modern Load Testing as Code
Kaitai Struct: compiler to translate .ksy => .cpp / .cs / .dot / .java / .js / .php / .pm / .py / .rb
A STAC/OGC API Features Web Service
Scala Language Integrated Connection Kit. Slick is a modern database query and access library for Scala
深圳地铁大数据客流分析系统
TheHive: a Scalable, Open Source and Free Security Incident Response Platform
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.