Skip to content
View NoahZhang's full-sized avatar
  • Admaster
  • BeiJing

Block or report NoahZhang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
1325 results for source starred repositories
Clear filter

Next-Token Prediction is All You Need

Python 781 21 Updated Sep 30, 2024

ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering

Python 1,183 50 Updated Oct 3, 2024

Model components of the Llama Stack APIs

Python 3,056 373 Updated Oct 4, 2024

A visual and transparent alternative to open-source ChatGPT O1

Python 554 55 Updated Sep 26, 2024

A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.

Python 78 8 Updated Sep 21, 2024

A system for agentic LLM-powered data processing

Python 737 72 Updated Oct 3, 2024

🕵️‍♂️ TUI for sniffing network traffic using eBPF on Linux

Rust 642 16 Updated Oct 2, 2024

MLLM for On-Demand Spatial-Temporal Understanding at Arbitrary Resolution

Python 236 9 Updated Sep 30, 2024
Python 5,963 446 Updated Oct 2, 2024

📄 A curated list of awesome .cursorrules files

581 34 Updated Sep 19, 2024

A 4-hour coding workshop to understand how LLMs are implemented and used

Jupyter Notebook 652 161 Updated Sep 20, 2024

High Performance ServiceMesh Data Plane Based on Programmable Kernel

Go 426 60 Updated Oct 1, 2024
Python 9 Updated Nov 3, 2023
Python 23 3 Updated Sep 29, 2024

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…

Python 42,991 7,715 Updated Oct 2, 2024

Unified management of projects with large model APIs, unified conversion to OpenAI format, calling multiple backend services, OpenAI, Anthropic, Gemini, Vertex, Cloudflare, DeepBricks, OpenRouter, …

CSS 85 8 Updated Sep 25, 2024

🤱🏻 Turn any webpage into a desktop app with Rust. 🤱🏻 利用 Rust 轻松构建轻量级多端桌面应用

Rust 29,617 5,117 Updated Sep 29, 2024

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 1,332 85 Updated Sep 23, 2024

AL-Ref-SAM 2: Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation

Python 65 9 Updated Sep 4, 2024

Vision agent

Python 1,233 125 Updated Oct 3, 2024

A language model programming library.

Python 4,266 242 Updated Oct 3, 2024

face detection face recognition包含人脸检测(retinaface,yolov5face,yolov7face,yolov8face),人脸检测跟踪(ByteTracker),人脸角度计算(Face_Angle)人脸矫正(Face_Aligner),人脸识别(Arcface),口罩检测(MaskRecognitiion),年龄性别检测(Gender_age),静…

C++ 284 60 Updated Mar 4, 2024

[EMNLP 2024] RWKV-CLIP: A Robust Vision-Language Representation Learner

Python 102 7 Updated Jul 12, 2024

This is the official code of VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding (ECCV 2024)

Python 109 5 Updated Sep 9, 2024

Minimal code and examnples for inferencing Sapiens foundation human models in Pytorch

Python 69 5 Updated Sep 21, 2024

High-resolution models for human tasks.

Python 4,121 216 Updated Oct 3, 2024

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 2,446 138 Updated Sep 24, 2024
7 Updated Aug 12, 2024

An open-source RAG-based tool for chatting with your documents.

Python 13,410 998 Updated Oct 2, 2024

Lightning-fast serving engine for AI models. Flexible. Easy. Enterprise-scale.

Python 2,205 134 Updated Oct 3, 2024
Next