The Wayback Machine - https://web.archive.org/web/20200526142632/https://github.com/topics/gpu-programming
Skip to content
#

gpu-programming

Here are 124 public repositories matching this topic...

yuanming-hu
yuanming-hu commented May 2, 2020

(Found this while reviewing #907)

Currently Expr::operator=(const Expr &o) can have two meanings.

https://github.com/taichi-dev/taichi/blob/f5373b1395a66c78506de0fd3c172dffe0d444d0/taichi/ir/expr.cpp#L54-L69

Let's say we have

Expr a, b;
a = b;
  1. If inside a kernel definition, this creates a FrontendAssignStmt in the AST
  2. Otherwise, this lets a hold the express

我的游戏程序员生涯的读书笔记合辑。你可以把它看作一个加强版的Blog。涉及图形学、实时渲染、编程实践、GPU编程、设计模式、软件工程等内容。Keep Reading , Keep Writing , Keep Coding.

  • Updated Apr 12, 2020
Wimmerer
Wimmerer commented May 7, 2019

Is your feature request related to a problem? Please describe.
This may just be a matter of looking in all the wrong places, but documentation of CuSparse arrays, and their support in packages like Flux, are sorely needed. They can be as minimal as possible, but if you’re not familiar with Nvidia’s libraries (I am not, so bear with me) it can be hard to even discover Cusparse.jl.

In the ne

emrdig
emrdig commented Jul 26, 2018

It seems that there is a bug with the call to gpufit within Matlab when including the user_info parameter. Using the included linear_1d model (which utilizes the user_info parameter), I created a simple program in Matlab to model the equation y=x from x=0 to x=10 and called gpufit on the data. This should return the parameters 0 and 1, but results in 4.3467 and 0.8711 instead.

Additionally, if

eyalroz
eyalroz commented Feb 21, 2020

The following PTX instructions don't have wrapper functions (nor builtins:: templated functions where relevant). Add them!

  • lop3 - Logical operation on 3 operands using an immediate 3-parameter lookup table.
  • prefetching instructions?
  • cvt.pack
  • fns - find n'th bit set
  • Sub-32-bit dot product with accumulation: dp4a, dp2a for bytes and halfword, respecti

Implementation of the Apriori and Eclat algorithms, two of the best-known basic algorithms for mining frequent item sets in a set of transactions, implementation in Python.

  • Updated Dec 12, 2018
  • Python

Improve this page

Add a description, image, and links to the gpu-programming topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gpu-programming topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.