Stars
BLSP-Emo: Towards Empathetic Large Speech-Language Models
Korean Sentence Embedding Repository
CoreNet: A library for training deep neural networks
[CVPR 2024 Highlight] Official PyTorch implementation of SpatialTracker: Tracking Any 2D Pixels in 3D Space
Few demos with Unity VisionOS 2D Window and Fully Immersive VR mode.
Diffusion model papers, survey, and taxonomy
Generate images with predefined facial expressions.
[BIONLP@ACL 2024] XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models.
Radiology Objects in COntext (ROCO): A Multimodal Image Dataset
Official Code and Dataset for "High-fidelity 3D Human Digitization from Single 2K Resolution Images" (CVPR 2023 Highlight)
Reading list for research topics in multimodal machine learning
✨✨Latest Advances on Multimodal Large Language Models
Deep Learning Paper Reading Meeting-Archive
ImageBind One Embedding Space to Bind Them All
[IROS'21] SurRoL: An Open-source Reinforcement Learning Centered and dVRK Compatible Platform for Surgical Robot Learning
StyleGAN2 - Official TensorFlow Implementation
Resources for participants of the Synthetic Data for Instrument Segmentation in Surgery (Syn-ISS) challenge at MICCAI 2023 organized by Surgical Science.