Block or Report
Block or report senwang86
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLists (9)
Sort Oldest
LLM
LLM related repos, e.g., application, infra, frameworkPaper Source Code
Source code to reproduce the result in papersSmartCode-LLM
LLM applications regarding code-related tasks, for example, auto-completion, API documentation generation, etc.LLM-RAG
Vector database
Interview material
Web
Prompt Engineering
Stars
Language
Sort by: Recently starred
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
RES-Q: Evaluating the Code-Editing Capability of Large Language Model Systems at the Repository Scale
The official implementation of the paper <MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression>
This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]
Granite Code Models: A Family of Open Foundation Models for Code Intelligence
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
Fast and memory-efficient exact attention
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
Diffusion on syntax trees for program synthesis
A Python framework for high performance GPU simulation and graphics
A nanoGPT pipeline packed in a spreadsheet
"XRec: Large Language Models for Explainable Recommendation"
Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models
Mora: More like Sora for Generalist Video Generation
Large Action Model framework to develop AI Web Agents
Code for reproducing our paper "Not All Language Model Features Are Linear"
Scalable neural net training via automatic normalization in the modular norm.
LoRA-Ensemble: Efficient Uncertainty Modelling for Self-attention Networks
This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM