The Wayback Machine - https://web.archive.org/web/20200715115602/https://github.com/topics/avx512
Skip to content
#

avx512

Here are 65 public repositories matching this topic...

CSharperMantle
CSharperMantle commented Apr 7, 2020

I am sorry but I don't know how to put it into an Issue format, so I have to explain it plainly.

Dear contributors,

It seems that every source file in Vc library has the following copyright notice:

/*  This file is part of the Vc library. {{{
Copyright © 2009-2015 XXX <xxx@example.com>
Redistribution and use in source and binary forms, with or without
modification, are permitte
HadrienG2
HadrienG2 commented May 1, 2018

At least on x86, the fastest intrinsics for shuffling the contents of a vector or blending data from two vectors take an immediate operand, which must be a compile-time constant. So there would be a use case for a compile-time version of xsimd::select(), as it could use these faster instructions.

An example of prior art for this is the shuffle() instruction family in bSIMD:

nemequ
nemequ commented Feb 21, 2020

We should add WASM simd128 implementations of as many SSE/SSE2/etc. functions as possible.

Some of the functions won't see much, if any, improvements since we already have GCC-style vector extension and OpenMP SIMD support. The real benefit will be for the functions that can't use GCC-style vectors. For example, saturated operations, min/max, etc. And of course there will be a lot of cases w

sleef
gnzlbg
gnzlbg commented Apr 17, 2019

Raising scalars and vectors to integer powers is very common (e.g. llvm.powi), so I think the following APIs would be very useful:

  • double Sleef_ipow_u10(double x, int32_t y);
  • __m128d Sleef_ipowd2_u10(__m128d x, int32_t y);
  • __m256d Sleef_ipowd4_u10(__m256d x, int32_t y);
  • __m512d Sleef_ipowd8_u10(__m512d x, int32_t y);
  • floa
sxleixer
sxleixer commented Apr 26, 2020

Hi there,
I am an experienced C++ programmer but I'm completely lost when it comes to SIMD operations. Currently I'm trying your library for over a week and I still cannot figure out, how to get it to be more performant than the straight forward way.

In my particular case, I am trying to create a SAXPY operation according to BLAS standard using SIMD operations. My vectors are huge and still th

DenisYaroshevskiy
DenisYaroshevskiy commented Jan 23, 2020

Hi

So, following the instructions, I ran cmake

 cmake .. -DSIMD=AVX2 -DDEV=1 -DBOOST_ROOT=/**/boost_1_72_0 -GNinja

It says:

CMake Warning:
  Manually-specified variables were not used by the project:

    BOOST_ROOT
    DEV

Then: ninja -j1 update - unknown target

However, running:

ninja -j 4 tests
ctest

Has worked, apparently successfully: `10

bug

Improve this page

Add a description, image, and links to the avx512 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the avx512 topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.