OpenXiangShan / XiangShan
Open-source high-performance RISC-V processor
See what the GitHub community is most excited about today.
Open-source high-performance RISC-V processor
Apache Spark - A unified analytics engine for large-scale data processing
♞ lichess.org: the forever free, adless and open source chess server ♞
Simple and Distributed Machine Learning
Build highly concurrent, distributed, and resilient message-driven applications on the JVM
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
The Scala 3 compiler, also known as Dotty.
A scala library to write Http apps.
Scientific workflow engine designed for simplicity & scalability. Trivially transition between one off use cases to massive scale production environments
State of the Art Natural Language Processing
Feathr – An Enterprise-Grade, High Performance Feature Store
Apache Kyuubi is a distributed multi-tenant JDBC server for large-scale data processing and analytics, built on top of Apache Spark
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Open-source code analysis platform for C/C++/Java/Binary/Javascript/Python/Kotlin based on code property graphs
Apache OpenWhisk is an open source serverless cloud platform
深圳地铁大数据客流分析系统
Spark: The Definitive Guide's Code Repository
Ergo protocol description & reference client implementation
ZIO — A type-safe, composable library for async and concurrent programming in Scala
A low code Machine Learning peersonalized ranking service for articles, listings, search results, recommendations that boosts user engagement. A friendly Learn-to-Rank engine
A fault tolerant, protocol-agnostic RPC system
A Spark plugin for reading Excel files via Apache POI
DataStax Spark Cassandra Connector