Skip to main content
Advice
0 votes
1 replies
83 views

GCC documentation is rather clear that the only GPUs supported for OpenMP offload are AMD and Nvidia. LLVM documentation states Clang supports OpenMP target offloading to several different ...
Paul Jurczak's user avatar
  • 8,742
0 votes
0 answers
83 views

I have project in both C and Fortran built with CMake. This project requires standard stuff like BLAS, Lapack and OpenMP. This is the snippet that I use to find the packages and it works great with ...
Ivanodivano's user avatar
Advice
2 votes
3 replies
72 views

This is an odd OpenMP question triggered by a possible compiler bug (?) we are chasing down. For a long time, we've had code that had: !$omp masked ... !$omp end masked that didn't have an explicit ...
Fortran's user avatar
  • 525
Advice
0 votes
2 replies
97 views

I have a question on what is technically allowed by the omp standard when it comes to relinquish resources (using omp_pause_resource) with forking. I understand that in section 12.12, pause_resource.1....
Haiyang He's user avatar
Advice
0 votes
0 replies
23 views

For the flang implementation of OpenMP, we are discussing the following program. At issue is whether e.g. ii, iii, and iiii should be consider private or lastprivate, and if the former, whether the ...
rascal2210's user avatar
Advice
0 votes
2 replies
71 views

I'm seeking opinions on what appears to be an inconsistency between the OpenMP 5.2 specification and an official example regarding the declare simd directive with the linear(p: ref) clause. 1. The ...
Moriyuki Saito's user avatar
2 votes
2 answers
114 views

When conducting scaling experiments on a code accelerated with OpenMP, is it standard practice to set OMP_DYNAMIC to false so that the number of threads spawned per parallel region is deterministic? I ...
Jared's user avatar
  • 790
Best practices
0 votes
3 replies
104 views

I've implemented a prime sieve where segments are passive containers rather than active search spaces. Instead of walking through each segment to mark multiples, a function (ricerca_ciclo) computes ...
claudio G's user avatar
Advice
2 votes
6 replies
92 views

So I have an algorithm that gets a set of objects and computes a next set of objects. The code below is a toy version of the real code for brevity. The next snip illustrates the theme of the program ...
emacs drives me nuts's user avatar
1 vote
0 answers
111 views

Here is a minimal working c++ example to show what I mean: #include <sched.h> #include <iostream> #include <sstream> #include <omp.h> int main(int argc, char** argv) { #pragma ...
amdnim's user avatar
  • 71
1 vote
2 answers
225 views

OpenMP uses its own scheme for numbering threads staring from 0 for the master thread. A master thread is the thread that initiated a parallel region, which is the main program thread in the following ...
Daniel Langr's user avatar
  • 24.4k
2 votes
1 answer
167 views

Consider the following code: #pragma omp parallel for (int run = 0; run < 10; run++) { std::vector<int> out; #pragma omp for for (int i = 0; i < 1'000'000; i++) { ... } } ...
F.X.'s user avatar
  • 7,515
Best practices
0 votes
3 replies
108 views

I'm struggling to finalise the design of my C++17 library. One of the primary goals is to use runtime polymorphism to allow users to extend or rewrite default features of the library for their own use ...
josh_eime's user avatar
  • 308
2 votes
1 answer
154 views

I'm one of the developers of the Lumen code: https://www.lumen-code.org/. That is computational code for condensed matter physics simulations. We are replacing FORALL with DO CONCURRENT, since FORALL ...
attacc's user avatar
  • 21
0 votes
1 answer
103 views

the code below crashes with terminate called after throwing an instance of 'std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >' Aborted ...
user1407220's user avatar

15 30 50 per page
1
2 3 4 5
449