Release 3.5.0 corresponding to NGC container 20.10
PyProf - PyTorch Profiler for GPUs
PyProf is a tool that profiles and analyzes the GPU performance of PyTorch models. PyProf aggregates kernel performance from Nsight Systems or NvProf. The tool can identify the layer that launched the kernel, identify the tensor dimensions and precision, perform forward-backward correlation, determine Tensor Core usage, and correlate the line in the user's code that launched a particular kernel.
What's New In 3.5.0
-
Nsight System database lookup improved to speed up the runtime profile analysis time by 50x.
-
Node names will now include class info and can be linked back to the original Python source.