- All languages
- ANTLR
- Assembly
- Astro
- Bicep
- Bison
- Blade
- C
- C#
- C++
- CMake
- CSS
- Clojure
- CoffeeScript
- Cuda
- D
- Dart
- Dockerfile
- Fortran
- FreeMarker
- G-code
- Go
- Groovy
- HTML
- Haskell
- Haxe
- Java
- JavaScript
- JetBrains MPS
- Jinja
- Jsonnet
- Jupyter Notebook
- Kotlin
- LLVM
- Less
- LilyPond
- Lua
- Makefile
- Nemerle
- Nim
- Nix
- Objective-C
- PHP
- PLpgSQL
- Pascal
- Perl
- Processing
- Python
- R
- Red
- Ruby
- Rust
- SCSS
- Sass
- Scala
- Shell
- Slash
- Starlark
- Stylus
- Svelte
- Swift
- TeX
- TypeScript
- V
- Vue
- WebAssembly
- Wren
- Zig
- q
Starred repositories
Instant neural graphics primitives: lightning fast NeRF and more
Fully Convolutional Instance-aware Semantic Segmentation
Efficient GPU kernels for block-sparse matrix multiplication and convolution
Learn CUDA Programming, published by Packt
Examples demonstrating available options to program multiple GPUs in a single node or a cluster
A simple high performance CUDA GEMM implementation.
A CUDNN minimal deep learning training code sample using LeNet.
FLAME GPU 2 is a GPU accelerated agent based modelling framework for CUDA C++ and Python
webgpu GPU code implementation, including CUDA, OpenCL, OpenACC and C++ AMP