Joss Knight
MathWorks
Although I cannot be contacted directly, if you would like to ask me a question all you have to do is mention "GPU" somewhere in your MATLAB Answers question.
Statistics
0 Questions
557 Answers
RANK
90
of 288 904
REPUTATION
1 600
CONTRIBUTIONS
0 Questions
557 Answers
ANSWER ACCEPTANCE
0.00%
VOTES RECEIVED
293
RANK
of 19 495
REPUTATION
N/A
AVERAGE RATING
0.00
CONTRIBUTIONS
0 Files
DOWNLOADS
0
ALL TIME DOWNLOADS
0
RANK
of 143 101
CONTRIBUTIONS
0 Problems
0 Solutions
SCORE
0
NUMBER OF BADGES
0
CONTRIBUTIONS
0 Posts
CONTRIBUTIONS
0 Public Channels
AVERAGE RATING
CONTRIBUTIONS
0 Highlights
AVERAGE NO. OF LIKES
Content Feed
Parallel Computing in C when using Matlab Coder (parpool and gpuArray)
GPU Coder will generate CUDA code for you. It can even automatically vectorize |for| loops. For a multithreaded parallel for loo...
3 månader ago | 1
Can parfor run a series of GPU programs simultaneously?
It looks like you just have a bug in your CUDAKernel implementation, probably accessing unallocated memory. This is putting the ...
3 månader ago | 0
error of GPU, net = trainNetwork(datastore, lgraph, options);
You are using your display GPU for computation and it does not have enough capacity. Try disabling all hardware acceleration for...
3 månader ago | 1
| accepted
How to train a sequence to classification network on GPU
This performance discrepancy is normal. Small sequence networks often cannot benefit from GPU parallelism, especially if they us...
3 månader ago | 0
gpuArray large sparse arrays. Error codes: "CUSPARSE_INTERNAL_ERROR" / "UNKNOWN_ERROR"
Hi Joseph. It's hard to be definitive. There were some problems with cusparse and also Windows drivers when supporting the newes...
3 månader ago | 0
NVIDIA A2 performance in Matlab R2023b is lousy
The A2's spec says its double precision performance is 140 GFLOPS vs the V100's 7 TFLOPS, so this is pretty much expected. Even ...
6 månader ago | 3
| accepted
Is there a utility like nvidia-smi within matlab to determine which gpus are in current use?
gpuDeviceTable is the utility for listing your devices and their properties. You can also see all your devices in the Parallel m...
6 månader ago | 0
Optimizing distance calculation between vectors and pixels
I feel like I haven't fully understood what you're after here, but |pdist2| is the function you're supposed to use to compute di...
6 månader ago | 0
The value of 'ValidationData' is invalid. Duplicate table variable name: 'input'. error during neural network training
augmentedImageDatastore returns a table so cannot be trivially combined. You should first transform it to convert it into a cell...
7 månader ago | 0
| accepted
How to apply a 2D matrix input to a trainNetwork?
Consult the documentation here. Typically sequence data is passed in as a cell array. In each cell you would be passing one seq...
7 månader ago | 0
I would like to train a smallish network using cpu cores in parallel rather than gpu as they are slower.
setenv CUDA_VISIBLE_DEVICES -1 when you first start MATLAB, assuming you are running everything locally. However, as a general...
7 månader ago | 0
Faster three dimensional higher order interpolation?
Thanks for the request, it will help us prioritise future work. In the meantime, it is possible to write your own interpolation ...
7 månader ago | 0
How to solve "Unexpected error calling cuDNN: CUDNN_STATUS_EXECUTION_FAILED." error?
The Ada GPU architecture is not supported in R2018b. You need to upgrade MATLAB. You should have received a warning about this w...
8 månader ago | 0
| accepted
Will self-written exe application run on GPU on other PC?
Yes, if your other PC has a supported GPU your application will run on it. Isn't MATLAB great?!
8 månader ago | 0
| accepted
Error at linking stage using mexcuda
Relocatable device code needs to be linked by nvcc using -dlink before it can be linked to host code using the C linker, so you'...
8 månader ago | 0
| accepted
OCR returns slightly different results on different machines
This is expected for any highly optimized code like this. Even for two Intel machines, the core count will affect how operations...
9 månader ago | 1
Will the MATLAB Answers community diminish/obsolete with the rise of AI-based chatbots?
You look like you are asking a question about how AI-assisted automation will change MATLAB Answers in the coming years and pote...
9 månader ago | 3
The matlab mexw64 file generated by mexcuda cannot be executed in the standalone app generated by matlab('parallel.gpu.GPUDeviceManager.selected' cannot be detected))
MATLAB Compiler's dependency analyzer cannot detect your dependency on PCT. Either add the product manually or call something ex...
9 månader ago | 0
Unable to perform assignment because the indices on the left side are not compatible with the size of the right side.
If you click on the line number in the editor next to where you create your function layer, you can put a breakpoint at the entr...
9 månader ago | 0
| accepted
Using a "CUDAKernel" type object within a parfor loop
A CUDAKernel object cannot be serialized, as you've found, so you will need to construct it separately on each worker. However, ...
9 månader ago | 2
| accepted
Performance drop on mobile RTX4080
The 4080 is a good 10x slower than the V100 in double precision so this doesn't surprise me - it is designed for workstation gra...
10 månader ago | 0
Using experiment manager on single GPU
It does depend on the balance of CPU and GPU work in your experiment, but as a general rule parallel execution will gain you not...
10 månader ago | 0
Memory issue with texture in mexCUDA compiled code
It looks like the syntax for your function |mxArrayToTexture_3D_float4| is incorrect. You are passing the pointer |cuArray| by v...
10 månader ago | 0
host compiler failed but others all passed with 'coder.checkGpuInstall('full')'
MSVC 2022 was not supported by the NVIDIA CUDA compiler in R2022b. You either need to install MSVC 2019 (or 2017) or upgrade MAT...
10 månader ago | 0
cannot set gpu option in MBPO for catpole example
I do not see any |UseDevice| property in the <https://uk.mathworks.com/help/reinforcement-learning/ref/rl.option.rloptimizeropti...
10 månader ago | 1
How to perform Eigenvalue Decomposition e.g, eig() on multiple GPUs?
eigendecomposition is a highly serial algorithm so that's why simple multi-process solutions aren't easy to find and why the GPU...
11 månader ago | 0
Problem: Image segmentation of forest area using CNN and MATLAB's BLOCKPROC function.
In the documentation for semanticseg it says that the output is a categorical array. Converting to uint8 would normally work, b...
11 månader ago | 0
Summing array elements seems to be slow on GPU
These are my results that I got on my (somewhat old) GeForce GTX 1080 Ti: CPU time: 16.1288 GPU time: 0.96266 If I change the...
11 månader ago | 0
| accepted
Summing array elements seems to be slow on GPU
Why are you recomputing H and HU inside the loop? They do not change. If you remove the sum, because the results are never used ...
11 månader ago | 1
Examples of GPU do not work
This isn't a demo it's a blog from 11 years ago, and unfortunately it's using syntax that was removed from MATLAB 9 years ago. I...
11 månader ago | 1