im just a generic research assistant that likes to program for fun. i am curious about acceleration, specifically effective and efficient utilization of llms on the edge. think mixtral8x7b in your pocket.
Block or Report
Block or report slmatrix
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abusePopular repositories Loading
-
qwen
qwen PublicForked from QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Python
-
pytorch
pytorch PublicForked from pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Python
-
-
mistral
mistral PublicForked from mistralai/mistral-inference
Reference implementation of Mistral AI 7B v0.1 model.
Jupyter Notebook
-
chroma
chroma PublicForked from chroma-core/chroma
the AI-native open-source embedding database
Python
If the problem persists, check the GitHub status page or contact support.