Skip to content

matrix72c/Daydream

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Intro

Experimental implementation for paper Daydream: Accurately Estimating the Efficacy of Optimizations for DNN Training.

Requirements

  • CUDA 11.7
  • linux-tools-aws
  • python3-dev
  • libunwind-dev

Test on Ubuntu 20.04, Python 3.9, PyTorch 1.13.0, CUDA 11.7.1

Usage

Import daydream, start trace at the beginning of your code and end trace at the end of your code. For an example, see train.py.

Roadmap

  • Trace six activities, including Memory Copy, Memory Set, Kernel Execution, CUDA Driver and CUDA Runtime.
  • Packaged as PyThon extension.
  • Add trace visualization and analysis tools.
  • Modify DNN framework for kernel to layer mapping.
  • Construct dependency graph and expose graph trasformation APIs for users to manipulate the graph.

Reference

NVIDIA/cuda-samples

NihalHarish/cupti-tracer

CUDA Driver API vs. CUDA runtime

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published