Highlights
- Pro
Block or Report
Block or report Sparks-Lu
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
[arXiv'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation
[ECCV'24] GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image
From anything to mesh like human artists. Official impl. of "MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers"
A unified framework for 3D content generation.
A Python package for fast and robust Image Stitching
A curated list of resources for Image and Video Deblurring
Astronomy Engine: multi-language calculation of Sun, Moon, and planet positions. Predicts lunar phases, eclipses, transits, oppositions, conjunctions, equinoxes, solstices, rise/set times, and othe…
Raspberry Pi Photo booth with Canon Selphy 1200 photo printer.
Single Image to 3D using Cross-Domain Diffusion for 3D Generation
[CVPR 2024] GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting
State-of-the-art papers for depth estimation of 360 images.
SwinIR: Image Restoration Using Swin Transformer (official repository)
Collect super-resolution related papers, data, repositories
High dynamic range (HDR) image viewer for graphics people
The most easy-to-understand tutorial for using LoRA (Low-Rank Adaptation) within diffusers framework for AI Generation Researchers🔥
GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code
Use almost any camera as a webcam—DSLRs, mirrorless, camcorders, and even point-and-shoots
📻Terminal/ssh/telnet/serialport/RDP/VNC/sftp client(linux, mac, win)
A wrapper executable that can run any executable as a Windows service, in a permissive license.
3D Visualization Projects (Planet, Orbit, Solar System)
Code for "NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video", CVPR 2021 oral
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…