Skip to content
@braintaken

braintaken

Popular repositories Loading

  1. cublas_gemm_benchmark cublas_gemm_benchmark Public

    Forked from zeng-zuoqi/cublas_gemm_benchmark

    cublas gemm benchmark (fp32 fp16 int8 fp16(tensor core) int8(tensor core))

    Cuda

  2. xfft xfft Public

    Forked from XiuYuLi/xfft

    High optimized fft library based on CUDA(the same fast as cufft and faster some times)

    C

  3. simdutf simdutf Public

    Forked from simdutf/simdutf

    Unicode routines (UTF8, UTF16, UTF32): billions of characters per second using SSE2, AVX2, NEON, AVX-512. Part of Node.js.

    C++

  4. lmdeploy lmdeploy Public

    Forked from InternLM/lmdeploy

    LMDeploy is a toolkit for compressing, deploying, and serving LLM

    C++

Repositories

Showing 4 of 4 repositories
  • lmdeploy Public Forked from InternLM/lmdeploy

    LMDeploy is a toolkit for compressing, deploying, and serving LLM

    braintaken/lmdeploy’s past year of commit activity
    C++ 0 Apache-2.0 392 0 0 Updated Aug 4, 2023
  • simdutf Public Forked from simdutf/simdutf

    Unicode routines (UTF8, UTF16, UTF32): billions of characters per second using SSE2, AVX2, NEON, AVX-512. Part of Node.js.

    braintaken/simdutf’s past year of commit activity
    C++ 0 Apache-2.0 71 0 0 Updated Jun 29, 2023
  • cublas_gemm_benchmark Public Forked from zeng-zuoqi/cublas_gemm_benchmark

    cublas gemm benchmark (fp32 fp16 int8 fp16(tensor core) int8(tensor core))

    braintaken/cublas_gemm_benchmark’s past year of commit activity
    Cuda 0 6 0 0 Updated Apr 12, 2019
  • xfft Public Forked from XiuYuLi/xfft

    High optimized fft library based on CUDA(the same fast as cufft and faster some times)

    braintaken/xfft’s past year of commit activity
    C 0 9 0 0 Updated Jun 13, 2017

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…