-
NVIDIA, ex-Amazon, ex-AMD
- San Jose
-
18:43
(UTC -12:00)
-
flash-attention Public
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention
Python BSD 3-Clause "New" or "Revised" License UpdatedMar 14, 2024 -
-
pytorch-lightning Public
Forked from Lightning-AI/pytorch-lightningPretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.
Python Apache License 2.0 UpdatedFeb 27, 2024 -
LLMTest_NeedleInAHaystack Public
Forked from gkamradt/LLMTest_NeedleInAHaystackDoing simple retrieval from LLM models at various context lengths to measure accuracy
Jupyter Notebook Other UpdatedFeb 26, 2024 -
triton Public
Forked from triton-lang/tritonDevelopment repository for the Triton language and compiler
Python MIT License UpdatedFeb 24, 2024 -
-
FastChat Public
Forked from lm-sys/FastChatAn open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Python Apache License 2.0 UpdatedFeb 19, 2024 -
apex Public
Forked from NVIDIA/apexA PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
Python BSD 3-Clause "New" or "Revised" License UpdatedFeb 9, 2024 -
moondream Public
Forked from vikhyat/moondreamtiny vision language model
Python UpdatedJan 26, 2024 -
-
TransformerEngine Public
Forked from NVIDIA/TransformerEngineA library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper GPUs, to provide better performance with lower memory utilization in bot…
Python Apache License 2.0 UpdatedMar 17, 2023 -
NeMo Public
Forked from NVIDIA/NeMoNeMo: a toolkit for conversational AI
Python Apache License 2.0 UpdatedMar 17, 2023 -
Megatron-LM Public
Forked from NVIDIA/Megatron-LMOngoing research training transformer models at scale
Python Other UpdatedMar 14, 2023 -
NeMo-Megatron-Launcher Public
Forked from NVIDIA/NeMo-Framework-LauncherNeMo Megatron launcher and tools
Python Apache License 2.0 UpdatedFeb 24, 2023