fxmarty

fxmarty

anything deep learning deployment

136 followers · 101 following

France
@efxmarty

Achievements

x4 x3

Achievements

x4 x3

Stars

LadybirdBrowser / ladybird

Truly independent web browser

C++ 20,010 829 Updated Sep 30, 2024

justinchuby / model-explorer-onnx

ONNX Adapter for model-explorer

Python 23 3 Updated Sep 23, 2024

google-ai-edge / model-explorer

A modern model graph visualizer and debugger

JavaScript 998 75 Updated Sep 30, 2024

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 6,568 1,741 Updated Sep 30, 2024

sophiawisdom / benchmarks

Jupyter Notebook 2 Updated Jun 19, 2024

albanD / subclass_zoo

Jupyter Notebook 140 24 Updated Jun 16, 2024

seungrokj / llm_tflops_simulator

Python 1 Updated Aug 9, 2024

ROCm / omnitrace

Omnitrace: Application Profiling, Tracing, and Analysis

C++ 293 25 Updated Sep 30, 2024

ROCm / omniperf

Advanced Profiling and Analytics for AMD Hardware

Python 132 39 Updated Sep 30, 2024

ROCm / rocr_debug_agent

The ROCdebug-agent is a library that can be loaded by ROCm Platform Runtime to provide some debugging functionality.

C++ 22 6 Updated Sep 27, 2024

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 4,314 388 Updated Sep 28, 2024

FasterDecoding / Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,225 153 Updated Jun 25, 2024

ROCm / rocBLAS

Next generation BLAS implementation for ROCm platform

C++ 339 161 Updated Sep 30, 2024

amd / amd-lab-notes

AMD lab notes with code examples to demonstrate use of AMD GPUs

C++ 89 8 Updated Jun 28, 2024

dnhkng / GlaDOS

This is the Personality Core for GLaDOS, the first steps towards a real-life implementation of the AI from the Portal series by Valve.

Python 2,916 276 Updated Aug 19, 2024

ROCm / aotriton

Ahead of Time (AOT) Triton Math Library

Python 36 13 Updated Sep 27, 2024

pytorch / builder

Continuous builder and binary build scripts for pytorch

Shell 328 219 Updated Sep 27, 2024

pytorch / test-infra

This repository hosts code that supports the testing infrastructure for the PyTorch organization. For example, this repo hosts the logic to track disabled tests and slow tests, as well as our conti…

TypeScript 79 75 Updated Sep 30, 2024

tinygrad / open-gpu-kernel-modules

Forked from NVIDIA/open-gpu-kernel-modules

NVIDIA Linux open GPU with P2P support

C 870 76 Updated Jun 7, 2024

github-tooling / ghtopdep

⭐ CLI tool for sorting dependents repo by stars

Python 256 18 Updated Sep 4, 2024

Lightning-AI / lightning-thunder

Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.

Python 1,147 77 Updated Sep 30, 2024

pytorch / torchtune

A Native-PyTorch Library for LLM Fine-tuning

Python 4,052 372 Updated Sep 30, 2024

xl0 / lovely-tensors

Tensors, for human consumption

Jupyter Notebook 1,101 15 Updated Sep 1, 2024

srush / Triton-Puzzles

Puzzles for learning Triton

Jupyter Notebook 1,007 64 Updated Sep 25, 2024

intel / neural-speed

An innovative library for efficient LLM inference via low-bit quantization

C++ 343 37 Updated Aug 30, 2024

huggingface / llm-swarm

Manage scalable open LLM inference endpoints in Slurm clusters

Python 222 22 Updated Jul 11, 2024

SqueezeBits / QUICK

QUICK: Quantization-aware Interleaving and Conflict-free Kernel for efficient LLM inference

Python 107 6 Updated Mar 6, 2024

vosen / ZLUDA

CUDA on ??? GPUs

Rust 8,973 599 Updated Sep 30, 2024

sunlex0717 / DissectingTensorCores

Cuda 73 17 Updated Apr 19, 2024

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 1,198 110 Updated Sep 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fxmarty

Achievements

Achievements

Block or report fxmarty

Stars

LadybirdBrowser / ladybird

justinchuby / model-explorer-onnx

google-ai-edge / model-explorer

EleutherAI / lm-evaluation-harness

sophiawisdom / benchmarks

albanD / subclass_zoo

seungrokj / llm_tflops_simulator

ROCm / omnitrace

ROCm / omniperf

ROCm / rocr_debug_agent

InternLM / lmdeploy

FasterDecoding / Medusa

ROCm / rocBLAS

amd / amd-lab-notes

dnhkng / GlaDOS

ROCm / aotriton

pytorch / builder

pytorch / test-infra

tinygrad / open-gpu-kernel-modules

github-tooling / ghtopdep

Lightning-AI / lightning-thunder

pytorch / torchtune

xl0 / lovely-tensors

srush / Triton-Puzzles

intel / neural-speed

huggingface / llm-swarm

SqueezeBits / QUICK

vosen / ZLUDA

sunlex0717 / DissectingTensorCores

flashinfer-ai / flashinfer