mengniwang95

Wang, Mengni mengniwang95

Achievements

models models Public

Forked from onnx/models

A collection of pre-trained, state-of-the-art models in the ONNX format

Jupyter Notebook
onnxruntime onnxruntime Public

Forked from microsoft/onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++
onnxruntime-inference-examples onnxruntime-inference-examples Public

Forked from microsoft/onnxruntime-inference-examples

Examples for using ONNX Runtime for machine learning inferencing.

Python
optimum-intel optimum-intel Public

Forked from huggingface/optimum-intel

🤗 Optimum Intel: Accelerate inference with Intel optimization tools

Python
neural-compressor neural-compressor Public

Forked from onnx/neural-compressor

Python
auto-round auto-round Public

Forked from intel/auto-round

SOTA Weight-only Quantization Algorithm for LLMs. This is official implementation of "Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs"

Python