Skip to content
View senwang86's full-sized avatar
Block or Report

Block or report senwang86

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

MDX 46,380 4,483 Updated Jul 14, 2024

RES-Q: Evaluating the Code-Editing Capability of Large Language Model Systems at the Repository Scale

Python 20 1 Updated Jun 28, 2024

The official implementation of the paper <MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression>

Python 47 2 Updated Jul 18, 2024

This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]

Python 589 56 Updated Jun 24, 2024

LLM101n: Let's build a Storyteller

21,654 1,089 Updated Jul 18, 2024

Granite Code Models: A Family of Open Foundation Models for Code Intelligence

1,017 66 Updated Jun 21, 2024

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,543 91 Updated Jul 10, 2024
Python 422 50 Updated Jul 12, 2024

Simplified Masked Diffusion Language Model

Python 118 9 Updated Jul 9, 2024

Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"

Python 703 38 Updated Jul 11, 2024

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,039 38 Updated Jul 14, 2024

Fast and memory-efficient exact attention

Python 12,439 1,107 Updated Jul 15, 2024
Python 94 6 Updated May 23, 2024

Home of StarCoder2!

Python 1,622 151 Updated Mar 21, 2024

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

1,385 68 Updated Jul 3, 2024

Diffusion on syntax trees for program synthesis

Python 388 19 Updated Jun 27, 2024

gpt-2 from scratch in mlx

Python 328 22 Updated Jun 12, 2024

A Python framework for high performance GPU simulation and graphics

Python 3,868 213 Updated Jul 18, 2024

A nanoGPT pipeline packed in a spreadsheet

2,000 118 Updated Jun 17, 2024

Implementation for MatMul-free LM.

Python 2,689 163 Updated Jun 27, 2024

"XRec: Large Language Models for Explainable Recommendation"

Python 77 5 Updated Jun 21, 2024

Mamba SSM architecture

Python 11,774 967 Updated Jul 18, 2024

Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models

Python 60 1 Updated Apr 4, 2024

Mora: More like Sora for Generalist Video Generation

Python 1,439 91 Updated Jun 21, 2024

Large Action Model framework to develop AI Web Agents

Python 5,016 429 Updated Jul 18, 2024

Code for reproducing our paper "Not All Language Model Features Are Linear"

Python 50 2 Updated Jun 10, 2024

Scalable neural net training via automatic normalization in the modular norm.

Jupyter Notebook 79 4 Updated Jun 18, 2024

LoRA-Ensemble: Efficient Uncertainty Modelling for Self-attention Networks

Python 32 1 Updated Jun 3, 2024

This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM

Python 52 2 Updated May 28, 2024
Next