Skip to content
View senwang86's full-sized avatar
Block or Report

Block or report senwang86

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Granite Code Models: A Family of Open Foundation Models for Code Intelligence

1,006 62 Updated Jun 21, 2024

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,438 88 Updated Jun 21, 2024
Python 409 50 Updated Jun 25, 2024

Simplified Masked Diffusion Language Model

Python 110 7 Updated Jun 28, 2024

Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"

Python 671 35 Updated Jun 27, 2024

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 979 34 Updated Jun 29, 2024

Fast and memory-efficient exact attention

Python 11,905 1,055 Updated Jul 6, 2024
Python 90 6 Updated May 23, 2024

Home of StarCoder2!

Python 1,599 151 Updated Mar 21, 2024

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

1,258 63 Updated Jul 3, 2024

Diffusion on syntax trees for program synthesis

Python 386 19 Updated Jun 27, 2024

gpt-2 from scratch in mlx

Python 321 21 Updated Jun 12, 2024

A Python framework for high performance GPU simulation and graphics

Python 3,642 205 Updated Jul 7, 2024

A nanoGPT pipeline packed in a spreadsheet

1,987 116 Updated Jun 17, 2024

Implementation for MatMul-free LM.

Python 2,614 153 Updated Jun 27, 2024

"XRec: Large Language Models for Explainable Recommendation"

Python 76 5 Updated Jun 21, 2024

Mamba SSM architecture

Python 11,540 944 Updated Jul 3, 2024

Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models

Python 58 Updated Apr 4, 2024

Mora: More like Sora for Generalist Video Generation

Python 1,432 89 Updated Jun 21, 2024

Large Action Model framework to develop AI Web Agents

Python 4,961 421 Updated Jul 8, 2024

Code for reproducing our paper "Not All Language Model Features Are Linear"

Python 45 2 Updated Jun 10, 2024

Scalable neural net training via automatic normalization in the modular norm.

Jupyter Notebook 76 4 Updated Jun 18, 2024

LoRA-Ensemble: Efficient Uncertainty Modelling for Self-attention Networks

Python 32 1 Updated Jun 3, 2024

This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM

Python 50 2 Updated May 28, 2024

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 1,865 175 Updated Apr 24, 2024

The official code of "CSTA: CNN-based Spatiotemporal Attention for Video Summarization"

Python 32 1 Updated Jun 15, 2024

💎 Amber the programming language compiled to bash

Rust 3,570 70 Updated Jul 7, 2024

Open source data anonymization and synthetic data orchestration for developers. Create high fidelity synthetic data and sync it across your environments.

Go 2,575 78 Updated Jul 5, 2024
Jupyter Notebook 343 36 Updated May 18, 2024
Next