Skip to content
View dukeboard's full-sized avatar

Organizations

@EnTiMid @kevoree @greycat-incubator

Block or report dukeboard

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

14 stars written in Cuda
Clear filter

LLM training in simple, raw C/CUDA

Cuda 23,641 2,644 Updated Oct 2, 2024

Instant neural graphics primitives: lightning fast NeRF and more

Cuda 15,872 1,907 Updated Apr 18, 2024

Fast parallel CTC.

Cuda 4,065 1,042 Updated Mar 4, 2024

Fully Convolutional Instance-aware Semantic Segmentation

Cuda 1,564 413 Updated Sep 27, 2021

GPU database engine

Cuda 1,170 120 Updated Jan 30, 2017

Efficient GPU kernels for block-sparse matrix multiplication and convolution

Cuda 1,021 200 Updated Jun 8, 2023

Learn CUDA Programming, published by Packt

Cuda 993 234 Updated Dec 30, 2023

Examples demonstrating available options to program multiple GPUs in a single node or a cluster

Cuda 535 107 Updated Aug 14, 2024

Distributed multigrid linear solver library on GPU

Cuda 481 139 Updated Aug 14, 2024

A simple high performance CUDA GEMM implementation.

Cuda 326 36 Updated Jan 4, 2024

A CUDNN minimal deep learning training code sample using LeNet.

Cuda 258 92 Updated Jul 30, 2023

FLAME GPU 2 is a GPU accelerated agent based modelling framework for CUDA C++ and Python

Cuda 105 20 Updated Oct 2, 2024

PyTorch bindings for CUTLASS grouped GEMM.

Cuda 45 37 Updated Aug 26, 2024

webgpu GPU code implementation, including CUDA, OpenCL, OpenACC and C++ AMP

Cuda 18 6 Updated Apr 7, 2015