Skip to content
View chr2117216003's full-sized avatar
Block or Report

Block or report chr2117216003

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 35,690 4,388 Updated Jul 11, 2024

SALMONN: Speech Audio Language Music Open Neural Network

Python 897 63 Updated May 28, 2024

Diffusers wrapper to run Kwai-Kolors model

Python 325 14 Updated Jul 10, 2024

Kolors Team

Python 2,146 118 Updated Jul 12, 2024

Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Python 412 37 Updated Jul 12, 2024

Multilingual Voice Understanding Model

Python 1,185 97 Updated Jul 12, 2024

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 1,718 148 Updated Jul 12, 2024

Understand Human Behavior to Align True Needs

Python 2,256 167 Updated Jul 12, 2024
C++ 3,079 424 Updated Jul 11, 2024

"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)

Python 2,055 137 Updated Dec 12, 2023

Project page of replacing the human motion in the video with a virtual 3D human

366 24 Updated May 9, 2024

🤢 LipSick: Fast, High Quality, Low Resource Lipsync Tool 🤮

Python 115 13 Updated Jun 27, 2024

Controllable video and image Generation, SVD, Animate Anyone, ControlNet, LoRA

Python 385 13 Updated Jul 12, 2024

LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control

Python 219 13 Updated Jul 10, 2024

Scripts for fine-tuning Llama2 via SFT and DPO.

Python 165 37 Updated Aug 14, 2023

Reference implementation for DPO (Direct Preference Optimization)

Python 1,860 144 Updated May 23, 2024
Jupyter Notebook 3,871 498 Updated Mar 28, 2024

Brand new TTS solution

Python 5,229 413 Updated Jul 11, 2024

Bring portraits to life!

Python 5,900 476 Updated Jul 12, 2024

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Python 1,265 126 Updated Dec 8, 2023

Code for CVPR 2022 paper "Scene Consistency Representation Learning for Video Scene Segmentation"

Python 85 21 Updated Feb 14, 2023

Codebase for CVPR2020 A Local-to-Global Approach to Multi-modal Movie Scene Segmentation

Python 215 44 Updated May 20, 2024
Python 113 15 Updated Jan 3, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 19,159 2,441 Updated Jul 9, 2024

ImageBind One Embedding Space to Bind Them All

Python 8,079 734 Updated Jul 10, 2024

🎥 Python and OpenCV-based scene cut/transition detection program & library.

Python 2,991 376 Updated Jul 8, 2024

[ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion

Python 624 52 Updated Jul 3, 2024

FreeAskInternet is a completely free, PRIVATE and LOCALLY running search aggregator & answer generate using MULTI LLMs, without GPU needed. The user can ask a question and the system will make a mu…

Python 8,331 885 Updated Apr 18, 2024

基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.

Python 3,197 423 Updated Jul 10, 2024

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Python 508 31 Updated Jul 11, 2024
Next