Change the repository type filter
All
Repositories list
540 repositories
Fuser
PublicNeMo
PublicA scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)cccl
PublicCUDA Core Compute LibrariesTensorRT-LLM
PublicTensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.edk2
Publicbionemo-framework
PublicBioNeMo Framework: For building and adapting AI models in drug discovery at scalespark-rapids-examples
Publicgarak
Publicthe LLM vulnerability scannerTensorRT-Model-Optimizer
PublicTensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillation, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed on NVIDIA GPUs.NeMo-Curator
PublicScalable data pre processing and curation toolkit for LLMsspark-rapids
Publicmulti-storage-client
Publick8s-nim-operator
Publiccuda-quantum
PublicC++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflowscuda-q-academic
PublicGenerativeAIExamples
PublicGenerative AI reference workflows optimized for accelerated infrastructure and microservice architecture.- AIStore: scalable storage for AI applications
spark-rapids-jni
Public- Documentation repository for NVIDIA Cloud Native Technologies
topograph
PublicMegatron-LM
PublicOngoing research training transformer models at scaleCosmos-Tokenizer
PublicA suite of image and video neural tokenizers