#
avx2
Here are 141 public repositories matching this topic...
Visual Studio extension for assembly syntax highlighting and code completion in assembly files and the disassembly window
syntax-highlighting
visual-studio
assembly
x86-64
assembler
avx2
nasm
assembly-language-programming
code-completion
avx512
visual-studio-extension
disassembly
masm
-
Updated
Jan 28, 2021 - C#
oneAPI Deep Neural Network Library (oneDNN)
library
performance
deep-neural-networks
deep-learning
cpp
processor
opencl
x64
x86-64
openmp
avx2
amx
sse41
tbb
aarch64
avx512
intel-openmp-runtime
bfloat16
oneapi
onednn
dpcpp
xe-architecture
-
Updated
Apr 10, 2021 - C++
C++ image processing and machine learning library with using of SIMD: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX-512, VMX(Altivec) and VSX(Power7), NEON for ARM.
c-plus-plus
machine-learning
arm
neural-network
neon
image-processing
avx
sse
simd
avx2
sse2
sse41
avx512
powerpc
altivec
vsx
ssse3
simd-library
haar-cascade
lbp
-
Updated
Apr 9, 2021 - C++
mr-c
commented
Dec 30, 2020
Examples from simd-everywhere/simde#685
SIMDE_FLOAT32_C(-2147483650.0)
SIMDE_FLOAT32_C( 2147483649.0)
SIMDE_FLOAT64_C(-2147483650.0)
SIMDE_FLOAT64_C( 2147483649.0)
The procedure is:
- Find tests that use
SIMDE_FLOAT32_CorSIMDE_FLOAT64_Cin theirtest_vec - Copy-n-paste an entry in the
test_vec, modifying the inputs using the overflow valu
Open
AVX-512BW functions
10
19
SIMD Vector Classes for C++
c-plus-plus
cpp
portable
neon
cpp14
parallel
parallel-computing
avx
sse
cpp11
simd
cpp17
avx2
simd-programming
vectorization
avx512
simd-instructions
simd-vector
data-parallel
-
Updated
Mar 5, 2021 - C++
Performance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
-
Updated
Jul 15, 2019
Native Go version of HighwayHash with optimized assembly implementations on Intel and ARM. Able to process over 10 GB/sec on a single core on Intel CPUs - https://en.wikipedia.org/wiki/HighwayHash
-
Updated
Mar 25, 2021 - Go
C++ SIMD Noise Library
neon
simplex
fractal
sse
simd
noise
cellular
avx2
perlin
perlin-noise
white-noise
noise-library
noise-3d
fastnoise-simd
simplex-noise
fastnoise
-
Updated
Mar 20, 2021 - C++
Fastest Integer Compression
encoding
library
compression
time-series
simd
compressor
integer-compression
intersection
inverted-index
avx2
sse2
-
Updated
Jan 16, 2021 - C
Performance-portable, length-agnostic SIMD with runtime dispatch
-
Updated
Apr 8, 2021 - C++
TurboRLE-Fastest Run Length Encoding
-
Updated
Feb 25, 2020 - C
Storage for my snippets, toy programs, etc.
-
Updated
Feb 18, 2021 - C++
Examples of C# code compiled to GPU by hybridizer
visual-studio
compiler
dotnet
gpu
optimization
parallel
cuda
avx
avx2
vectorization
avx512
hybridizer-essentials
-
Updated
Sep 5, 2019 - C#
Node.js implementation of HighwayHash, Google's fast and strong hash function
-
Updated
Jan 25, 2021 - JavaScript
Open Source Architecture Code Analyzer
python
hpc
latency
assembly
avx
x86
throughput
avx2
performance-analysis
avx512
out-of-order
critical-path
port-mapping
performance-modeling
arm64v8
sve
in-core
loop-carried-dependency
-
Updated
Apr 9, 2021 - Assembly
guillaumekln
commented
Jul 29, 2020
We should prefix CMake build options with "CT2_", e.g. CT2_WITH_MKL instead of WITH_MKL. This is a good practice to avoid possible conflicts with other projects.
Agenium Scale vectorization library for CPUs and GPUs
hpc
neon
cuda
avx
simd
avx2
sse2
simd-programming
aarch64
avx512
simd-instructions
simd-library
sse42
rocm
cpp20
sve
neon128
cpp20-library
vectorization-library
-
Updated
Apr 11, 2021 - Python
TensorFlow wheels built for latest CUDA/CuDNN and enabled performance flags: SSE, AVX, FMA; XLA
-
Updated
Feb 12, 2020
SIMD (SWAR/SSE/SSE4/AVX2/AVX512F/ARM Neon) of Karp-Rabin algorithm's modification
-
Updated
Mar 4, 2020 - C++
python
c
openmp
avx
simd
cosmology
astrophysics
galaxies
large-scale-structure
pair-counting
intrinsics
avx2
avx512
sse42
correlation-functions
-
Updated
Feb 24, 2021 - C
Improve this page
Add a description, image, and links to the avx2 topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the avx2 topic, visit your repo's landing page and select "manage topics."


(I thought that this was already a tracked issue, but I cannot find it.)
See https://github.com/simdjson/simdjson/blob/c5def8f7060a7e9519c7f38e2c4ddf2dc784d6a8/doc/basics.md#json-pointer
If the pointer points at an array or an object, you'd like to be able to do...