-
AMD
- Sunnyvale, CA
-
09:15
(UTC -08:00) - https://www.linkedin.com/in/junliume/
- @junliume
Pinned Loading
-
-
ROCm/composable_kernel
ROCm/composable_kernel PublicComposable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
-
ROCm/rocComposer
ROCm/rocComposer PublicAMD composer for High Performance Deep Learning Kernels and Libraries
-
ROCm/AITemplate
ROCm/AITemplate PublicForked from facebookincubator/AITemplate
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
-
TorchBench
TorchBench PublicForked from pytorch/benchmark
TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.
Python
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.