gpucoder.profile
(Not recommended) Create an execution profile report for generated CUDA code
Since R2018b
gpucoder.profile
is not recommended. Use gpuPerformanceAnalyzer
instead. For more information, see Compatibility Considerations.
Description
gpucoder.profile(
generates an execution profiling report of the CUDA code generated for the design file
func_name
,codegen_inputs
)func_name
. The codegen_inputs
argument specifies the
inputs to the design file. You must install the Embedded Coder® product to generate the profiling report.
Note
The profiling workflow depends on profiling tools from NVIDIA®. From CUDA® Toolkit v10.1 onwards, NVIDIA restricts access to performance counters to admin users. To enable GPU performance counters for all user accounts, see the instructions in Permission issue with Performance Counters (NVIDIA).
Note
The profiling tools from NVIDIA might not support legacy GPU hardware such as the Kepler family of devices. For information on supported GPU devices, see the NVIDIA documentation.
gpucoder.profile(___,
generates an execution profiling report with one or more profiling options specified as a
name-value pair argument.Name,Value
)
Examples
Input Arguments
Version History
Introduced in R2018bSee Also
Apps
Functions
Objects
Topics
- GPU Programming Paradigm
- GPU Performance Analyzer
- Analyze Performance of the Generated CUDA Code
- Code Generation by Using the GPU Coder App
- Code Generation Using the Command Line Interface
- Code Generation for Deep Learning Networks by Using cuDNN
- Code Generation for Deep Learning Networks by Using TensorRT