The Wayback Machine - https://web.archive.org/web/20220324225337/https://github.com/topics/sse2
Here are
43 public repositories
matching this topic...
C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512 for x86/x64, VMX(Altivec) and VSX(Power7) for PowerPC, NEON for ARM.
good first issue
This is perfect if you're new to the project and looking to help.
A simple and fast linear algebra library for games and graphics
Updated
Mar 22, 2022
Rust
Fastest Integer Compression
SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
Expressive Vector Engine - SIMD in C++ Goes Brrrr
TurboRLE-Fastest Run Length Encoding
Agenium Scale vectorization library for CPUs and GPUs
Unicode routines (UTF8, UTF16): billions of characters per second.
SIMD macro assembler unified for ARM, MIPS, PPC and x86
General purpose machine code manipulation library for x86-32 (IA-32) and x86-64 (AMD64) architectures (Assembler, Disassembler, Library).
Optimized Recursive Bilateral Filter
Fastest Histogram Construction
x64 Assembly Demo Framework
Updated
Apr 17, 2021
Assembly
Realtime raytracer using SIMD on ARM, MIPS, PPC and x86
Simple example for embedding SSE2 assembly in Cython projects
Updated
May 2, 2017
Python
A simple demo shows how to use the SIMD,Single Instruction Multiple Data, to optimize and accelerate the FFT algorithm.
A multi-arch library implementing the Argon2 password hashing algorithm.
Fast BSON to JSON string transcoder
A header only ready to include mirror of the HIIR library by Laurent De Soras, an oversampling and Hilbert transform library in C++, with additional support for double precision on ARM AArch64 using Neon.
Software implementation of ARM and x86 SIMD intrinsics
Operator overloading for vector matrix operation using Intel SIMD SSE/SSE2/SSE3 instructions written in Free Pascal
Updated
Jul 15, 2019
Pascal
A fresh (experimental) look at Scilab 6.x
Updated
Feb 17, 2022
Scilab
CMS(RFC.5652 & PKCS#7)対応 暗号/復号ユーティリティ
library for optimized rgb to/from yuv convertions.
Implementation of string equality based on SSE2 intrinsics
NxN Matrix Multiplication using SIMD with Intrinsics (MMX, SSE, SSE2, AVX, etc.) and FPU as inline ASM in C
Updated
May 4, 2019
Assembly
testing an SIMD api from VecCore VecGeom, using backends of UMESIMD, VC for Avx Avx2,AVX512, SSE, SSE2
Improve this page
Add a description, image, and links to the
sse2
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
sse2
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.
Hi,
thank you very much for this cool library.
I noticed that
simde_mm512_load_psis missing whilesimde_mm512_loadu_psis implemented. However, both seem to exist for Intel intrinsics (see screenshot). Maybe I oversaw one or maybe they do the same anyways?Thanks
Lukas
![Screenshot from 2021-10-25 13-28-29](https://user-images.githubusercontent.com/14993470/138687358-aa6b531e-bdc