Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
C++
Updated May 1, 2019
Open MPI main development repository
distributed dataflows with functional list operations for data processing with C++14
C++
Updated Oct 19, 2018
OpenHPC Integration, Packaging, and Test Repo
C
Updated Apr 30, 2019
Kratos Multiphysics (A.K.A Kratos) is a framework for building parallel multi-disciplinary simulation software. Modul…
LibTomMath is a free open source portable number theoretic multiple-precision integer library written entirely in C.
C
Updated Apr 28, 2019
Official git repository of Elmer FEM software
Fortran
Updated Apr 29, 2019
C++ library for solving large sparse linear systems with algebraic multigrid method
C++
Updated Mar 13, 2019
Python bindings for MPI
Python
Updated Apr 27, 2019
C
Updated May 1, 2019
Provision, execute, and monitor batch and HPC container workloads on Azure Batch
Python
Updated Apr 16, 2019
一週間でなれる!スパコンプログラマ
HTML
Updated Feb 12, 2019
🔬 Assemble large genomes using short reads
C++
Updated Apr 12, 2019
This is a set of simple programs that can be used to explore the features of a parallel platform.
C
Updated Apr 28, 2019
TomsFastMath is a fast public domain, open source, large integer arithmetic library written in portable ISO C.
C
Updated Jun 22, 2018
DMTCP: Distributed MultiThreaded CheckPointing
🚀 Fast prime counting function implementation
C++
Updated Apr 17, 2019
Information about many aspects of high-performance computing. Wiki content moved to ~/docs.
C++
Updated Apr 25, 2019
A fast simulator and a library dedicated to the channel coding.
C++
Updated Apr 24, 2019
Ytk-mp4j is a fast, user-friendly, cross-platform, multi-process, multi-thread collective message passing java librar…
Java
Updated Jun 14, 2017
Machine Learning Toolkit for Extreme Scale (MaTEx)
C
Updated Aug 12, 2018
DASH, the C++ Template Library for Distributed Data Structures with Support for Hierarchical Locality for HPC and Dat…
QUDA is a library for performing calculations in lattice QCD on GPUs.
C++
Updated May 1, 2019
RAxML Next Generation: faster, easier-to-use and more flexible
C++
Updated Apr 30, 2019
parallel finite element unstructured meshes
Intel(R) Machine Learning Scaling Library is a library providing an efficient implementation of communication pattern…
C++
Updated Apr 12, 2019
MPI Cluster Automation Solution using Docker, based on Alpine Linux with MPICH (see IEEE paper)
Shell
Updated Mar 17, 2019
TensorFlow wheels built for latest CUDA/CuDNN and enabled performance flags: SSE, AVX, FMA, XLA, MPI, MKL
Updated Mar 31, 2019
A collection of code examples as well as presentations for training purposes
Jupyter Notebook
Updated Apr 30, 2019
Efficient and scalable parallelism using the message passing interface (MPI) to handle big data and highly computatio…
Python
Updated Nov 11, 2016