Skip to content
View soonwonh's full-sized avatar

Block or report soonwonh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The official implementation of HierSpeech++

Python 1,172 134 Updated Feb 20, 2024
Jupyter Notebook 2,903 279 Updated Feb 27, 2023

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Python 4,307 224 Updated Jun 14, 2024

Official implementation of AnimateDiff.

Python 10,368 851 Updated Jul 31, 2024

The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."

Python 965 172 Updated Sep 25, 2023

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.

Python 1,661 308 Updated Jun 8, 2023
Python 501 50 Updated Dec 26, 2023

Denoising Diffusion Implicit Models

Python 1,401 202 Updated Jul 26, 2024

PyTorch implementation of "StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator"

Python 204 22 Updated Aug 8, 2023

꼼꼼한 딥러닝 논문 리뷰와 코드 실습

Jupyter Notebook 1,060 321 Updated Jun 28, 2022

TruFor

Python 136 9 Updated Apr 12, 2024

Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)

Jupyter Notebook 1,907 364 Updated Jun 7, 2022

POI-Forensics

Python 55 5 Updated Dec 22, 2023

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Python 8,782 1,973 Updated Apr 16, 2024

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Python 8,050 1,006 Updated Oct 6, 2024

Denoising Diffusion Probabilistic Models

Python 3,709 363 Updated Aug 29, 2023

A collection of literature after or concurrent with Masked Autoencoder (MAE) (Kaiming He el al.).

767 53 Updated Jul 10, 2024
Python 3,212 357 Updated Jun 10, 2023

Collecting papers about new view synthesis

690 54 Updated Aug 26, 2024

Official Code for DragGAN (SIGGRAPH 2023)

Python 35,655 3,448 Updated May 18, 2024

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Python 4,334 388 Updated Aug 19, 2024

A latent text-to-image diffusion model

Jupyter Notebook 67,788 10,108 Updated Jun 18, 2024

CVPR2023 - Activating More Pixels in Image Super-Resolution Transformer Arxiv - HAT: Hybrid Attention Transformer for Image Restoration

Python 1,205 148 Updated Jun 2, 2024
Python 397 35 Updated Nov 1, 2023

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

Python 4,267 711 Updated May 2, 2023

本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。

Python 1,831 316 Updated Jun 4, 2023

Pytorch implementation of paper "One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing"

Python 783 143 Updated Apr 19, 2022

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Python 11,744 2,191 Updated Jun 26, 2024

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Python 6,431 478 Updated May 31, 2024
Next