yiakwy-xpu-ml-framework-team

Follow

💭

I may be slow to respond.

yiakwy-xpu-ml-framework-team

💭

I may be slow to respond.

Follow

12 followers · 59 following

Graphcore
Bristol
00:26 (UTC -12:00)

Achievements

Achievements

yiakwy-xpu-ml-framework-team/README.md

👋 Hi, I’m @yiakwy-xpu-ml-framework-team
👀 I’m interested in accelerating the word through algorithms, chips and intelligence. (compiler/transpiler, c++ ops development/optimization for critical path of overall performance and python bindings for HPC application.)
🌱 I’m currently working on core framework infrastracture and AI compilier technologies.
📫 Please drop me a message through [email protected]

Popular repositories Loading

NV_grouped_gemm NV_grouped_gemm Public

Forked from fanshiqing/grouped_gemm

PyTorch bindings for CUTLASS grouped GEMM for MoE.

Cuda 4
Tooklkit-remote-pdb-for-pytorch-distributed Tooklkit-remote-pdb-for-pytorch-distributed Public

Debugging torch distributed program

Python 3
GC-OXFORD-CVPR2021-gbp-poplar GC-OXFORD-CVPR2021-gbp-poplar Public

Forked from joeaortiz/gbp-poplar

Poplar implementation of "Bundle Adjustment on a Graph Processor" (CVPR 2020)

C++ 2
NV-DOCA-code-examples NV-DOCA-code-examples Public

Forked from openhackathons-org/NVIDIA-DOCA-App-Code-Sharing

DOCA Application code sharing Contest

2
NV-nccl-tests NV-nccl-tests Public

Forked from NVIDIA/nccl-tests

NCCL Tests

Cuda 2
llama.cpp llama.cpp Public

Forked from ggerganov/llama.cpp

Port of Facebook's LLaMA model in C/C++

C 1