Skip to content
View soonwonh's full-sized avatar
Block or Report

Block or report soonwonh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The official implementation of HierSpeech++

Python 1,133 134 Updated Feb 20, 2024
Jupyter Notebook 2,856 280 Updated Feb 27, 2023

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Python 4,203 215 Updated Jun 14, 2024

Official implementation of AnimateDiff.

Python 9,775 798 Updated Jul 7, 2024

The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."

Python 901 167 Updated Sep 25, 2023

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.

Python 1,595 304 Updated Jun 8, 2023
Python 485 48 Updated Dec 26, 2023

Denoising Diffusion Implicit Models

Python 1,310 181 Updated Apr 1, 2024

PyTorch implementation of "StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator"

Python 186 21 Updated Aug 8, 2023

꼼꼼한 딥러닝 논문 리뷰와 코드 실습

Jupyter Notebook 1,025 324 Updated Jun 28, 2022

TruFor

Python 123 8 Updated Apr 12, 2024

Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)

Jupyter Notebook 1,853 360 Updated Jun 7, 2022

POI-Forensics

Python 49 5 Updated Dec 22, 2023

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Python 8,611 1,953 Updated Apr 16, 2024

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Python 7,512 963 Updated Jun 27, 2024

Denoising Diffusion Probabilistic Models

Python 3,456 349 Updated Aug 29, 2023

A collection of literature after or concurrent with Masked Autoencoder (MAE) (Kaiming He el al.).

726 51 Updated Jun 26, 2024
Python 3,174 352 Updated Jun 10, 2023

Collecting papers about new view synthesis

677 53 Updated Jun 22, 2024

Official Code for DragGAN (SIGGRAPH 2023)

Python 35,567 3,423 Updated May 18, 2024

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Python 4,198 355 Updated Apr 9, 2024

A latent text-to-image diffusion model

Jupyter Notebook 66,623 9,976 Updated Jun 18, 2024

CVPR2023 - Activating More Pixels in Image Super-Resolution Transformer Arxiv - HAT: Hybrid Attention Transformer for Image Restoration

Python 1,148 135 Updated Jun 2, 2024
Python 376 34 Updated Nov 1, 2023

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

Python 4,184 704 Updated May 2, 2023

本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。

Python 1,721 293 Updated Jun 4, 2023

Pytorch implementation of paper "One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing"

Python 726 141 Updated Apr 19, 2022

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Python 11,156 2,089 Updated Jun 26, 2024

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Python 6,260 463 Updated May 31, 2024
Next