gpgpu
Here are 392 public repositories matching this topic...
Current implementation of join can be improved by performing the operation in a single call to the backend kernel instead of multiple calls.
This is a fairly easy kernel and may be a good issue for someone getting to know CUDA/ArrayFire internals. Ping me if you want additional info.
-
Updated
Dec 12, 2021 - C++
In order to test manually altered IR, it would be nice to have a --skip-compilation flag for futhark test, just like we do for futhark bench.
-
Updated
Apr 1, 2021 - Rust
-
Updated
Jul 30, 2018
-
Updated
Dec 21, 2021 - C#
-
Updated
Dec 8, 2021 - C++
-
Updated
Dec 22, 2021 - Objective-C
-
Updated
Oct 23, 2021 - Clojure
-
Updated
Dec 2, 2021 - Nim
-
Updated
Nov 14, 2021 - C++
Open issue to openly discuss potential ideas or improvements, whether on documentation, interfaces, examples, bug fixes, etc.
-
Updated
Jul 26, 2021 - C++
-
Updated
Nov 19, 2021 - C
-
Updated
Dec 21, 2021 - Rust
-
Updated
Aug 17, 2016 - JavaScript
Just an FYI whilst I was trawling through the ROCm GitHub page:
https://rocmdocs.amd.com/en/latest/Programming_Guides/Programming-Guides.html#
-
Updated
Dec 15, 2021 - C++
-
Updated
Oct 28, 2021 - C++
-
Updated
Dec 24, 2021 - C++
Improve this page
Add a description, image, and links to the gpgpu topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the gpgpu topic, visit your repo's landing page and select "manage topics."


Somehow some of these names start with
fmt_ocl_while most start withfmt_opencl_. Is this intentional? It causes thefmt_ocl_to be listed/tested/benchmarked first. Then there'sfmt_opencl_1otus5with a1(one) in there. So the first OpenCL formats become: