How overloaded functions are implemented on gpu? I.e. how can I set number of threads and thread blocks when I call. GpuArray?

Question

0 votes

I learned about using *.cu files and compile them to get *.ptx files, but I'm concernead about built-in gpu supported functions. If I used gpuArray to transfer a variable to Gpu, will any further operations. (s.a multiplication) performed on that variable be done on Gpu? In that case how can I know/set number of thread blocks and threads in each kernel?

0 Comments
Show -2 older comments Hide -2 older comments

Sign in to comment.

Sign in to answer this question.

Follow Question

Answer 1

Edric Ellis on 11 Nov 2013

Edited: Edric Ellis on 11 Nov 2013

1 vote

All operations on gpuArray data take place on the GPU. For built-in things like matrix multiplication, the allocation of blocks and threads is done automatically, and you have no control over it. In contrast, when using CUDAKernel to operate on gpuArray data, you must explicitly choose the number of threads and blocks to use.

2 Comments
Show None Hide None

Hanan Hassan on 11 Nov 2013

Thanks a lot Edric, I just wanted to make sure that I have no control on blocks and threads when using gpuarray variables, but do you think is it better to use the cuda enabled matlab functions or to rewrite my own kernel to get better performance and optimization?

Edric Ellis on 11 Nov 2013

The built-in gpuArray algorithms should perform well in most circumstances. After that, arrayfun and bsxfun will also perform well on the GPU. You can use CUDAKernel if necessary to handle situations where you still want more performance.

Sign in to comment.

How overloaded functions are implemented on gpu? I.e. how can I set number of threads and thread blocks when I call. GpuArray?

0 Comments
Show -2 older comments Hide -2 older comments

Accepted Answer

2 Comments
Show None Hide None

More Answers (0)

Categories

Tags

Community Treasure Hunt

How overloaded functions are implemented on gpu? I.e. how can I set number of threads and thread blocks when I call. GpuArray?

0 Comments Show -2 older comments Hide -2 older comments

Accepted Answer

2 Comments Show None Hide None

More Answers (0)

Categories

Tags

See Also

Community Treasure Hunt

0 Comments
Show -2 older comments Hide -2 older comments

2 Comments
Show None Hide None