GPU Coder vs. ONNXRuntime, is there a difference in inference speed?

Question

0 votes

Since I can export from Matlab to ONNX format, why can't I just import my model into TensorRT etc.? Will I get significant speed increases or is the benefit of GPU Coder more about being able to compile all my other Matlab code into optimized Cuda?

Thanks in advance.

0 Comments
Show -2 older comments Hide -2 older comments

Sign in to comment.

Sign in to answer this question.

Follow Question

Answer 1

Joss Knight on 2 Apr 2021

0 votes

You can compile your network for TensorRT using GPU Coder if that's your intended target, no need to go through ONNX.

I don't believe MathWorks have any published benchmarks against ONNX runtime specifically. GPU Coder on the whole outperforms other frameworks, although it does depend on the network.

2 Comments
Show None Hide None

Matti Kaupenjohann on 7 Jan 2022

Could you show/link the benchmark which includes the performance of gpucoder against other frameworks (which one?).

Joss Knight on 7 Jan 2022

Edited: Joss Knight on 7 Jan 2022

We don't publish the competitive benchmarks, you'll have to make a request through your sales agent. we can provide some numbers for MATLAB.

Sign in to comment.

GPU Coder vs. ONNXRuntime, is there a difference in inference speed?

0 Comments
Show -2 older comments Hide -2 older comments

Answers (1)

2 Comments
Show None Hide None

Categories

Products

Release

Tags

Community Treasure Hunt

GPU Coder vs. ONNXRuntime, is there a difference in inference speed?

0 Comments Show -2 older comments Hide -2 older comments

Answers (1)

2 Comments Show None Hide None

Categories

Products

Release

Tags

See Also

Community Treasure Hunt

0 Comments
Show -2 older comments Hide -2 older comments

2 Comments
Show None Hide None