avx-instructions

Here are 12 public repositories matching this topic...

minio / sha256-simd

Accelerate SHA256 computations in pure Go using AVX512, SHA Extensions for x86 and ARM64 for ARM. On AVX512 it provides an up to 8x improvement (over 3 GB/s per core). SHA Extensions give a performance boost of close to 4x over native.

golang arm assembly intel avx plan9 avx512 avx-instructions

Updated Jun 17, 2021
Go

google / highway

Star

Performance-portable, length-agnostic SIMD with runtime dispatch

neon wasm avx simd intrinsics avx2 simd-programming avx512 simd-parallelism simd-instructions simd-library sse42 avx-instructions simd-intrinsics avx-512

Updated May 20, 2022
C++

tgrysztar / fasmg

Star

flat assembler g - adaptable assembly engine

php assembly x86-64 instructions assembler wasm mach-o x86 opcodes macro pe-format avx-instructions hex-format executable-formats binary-format elf-format fasmg

Updated Mar 24, 2022
Assembly

jeffhammond / vpu-count

Star

Information about AVX-512 support on recent Intel processors

xeon floating-point avx512 avx-instructions fused-multiply-add intel-processor

Updated Apr 10, 2022
C

OscarTHZhang / cnn-optimization

Star

Convolutional Neural Network Optimization using Intel AVX and OpenMP

c openmp parallelism high-performance-computing avx-instructions cnn-architecture

Updated Jun 2, 2020
C

Sooryakiran / Convolution-with-AVX

Star

Implementation of 2D Convolution operation for Neural Networks using Intel x86(i368)/x86-6(amd64) AVX-256 instructions. All data flow methods, i.e input stationary, weight stationary and output stationary are implemented. The forward pass of Alexnet architecture is constructed using it.

neural-network x86 alexnet convolutional-neural-networks avx-instructions