Skip to content

🎓Automatically Update CV Papers Daily using Github Actions (Update Every 12th hours)

Notifications You must be signed in to change notification settings

wscffaa/cv-arxiv-daily

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

[![Contributors][contributors-shield]][contributors-url] [![Forks][forks-shield]][forks-url] [![Stargazers][stars-shield]][stars-url] [![Issues][issues-shield]][issues-url]

Updated on 2024.07.10

Table of Contents
  1. Diffusion-Models
  2. Super-Resolution
  3. Image-Super-Resolution
  4. Video-Super-Resolution
  5. Image-Colorization
  6. Video-Colorization
  7. Image Restoration
  8. Image Reconstruction
  9. Image Denoising
  10. Image Inpainting
  11. Style Transfer

Diffusion-Models

Publish Date Title Authors PDF Code
2024-07-09 ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction Shaozhe Hao et.al. 2407.07077v1 link
2024-07-09 RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models Bowen Zhang et.al. 2407.06938v1 null
2024-07-09 HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance Guian Fang et.al. 2407.06937v1 link
2024-07-09 A reaction-diffusion model for relapsing-remitting multiple sclerosis with a treatment term Romina Travaglini et.al. 2407.06802v1 null
2024-07-09 Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning Fanyue Wei et.al. 2407.06642v1 link
2024-07-09 Mobius: An High Efficient Spatial-Temporal Parallel Training Paradigm for Text-to-Video Generation Task Yiran Yang et.al. 2407.06617v1 null
2024-07-09 VQA-Diff: Exploiting VQA and Diffusion for Zero-Shot Image-to-3D Vehicle Asset Generation in Autonomous Driving Yibo Liu et.al. 2407.06516v1 null
2024-07-09 Sketch-Guided Scene Image Generation Tianyu Zhang et.al. 2407.06469v1 null
2024-07-08 Enhanced Safety in Autonomous Driving: Integrating Latent State Diffusion Model for End-to-End Navigation Jianuo Huang et.al. 2407.06317v1 null
2024-07-08 VIMI: Grounding Video Generation through Multi-modal Instruction Yuwei Fang et.al. 2407.06304v1 null
2024-07-08 Beyond theory driven discovery: hot random search and datum derived structures Chris J. Pickard et.al. 2407.06294v1 null
2024-07-08 JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation Yu Zeng et.al. 2407.06187v1 null
2024-07-08 The Tug-of-War Between Deepfake Generation and Detection Hannah Lee et.al. 2407.06174v1 null
2024-07-08 ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation Ethan Chern et.al. 2407.06135v1 link
2024-07-08 Structured Generations: Using Hierarchical Clusters to guide Diffusion Models Jorge da Silva Goncalves et.al. 2407.06124v1 null
2024-07-08 PerlDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Models Jinhua Zhang et.al. 2407.06109v1 link
2024-07-08 Accelerating Diffusion for SAR-to-Optical Image Translation via Adversarial Consistency Distillation Xinyu Bai et.al. 2407.06095v1 null
2024-07-08 Layered Diffusion Model for One-Shot High Resolution Text-to-Image Synthesis Emaad Khwaja et.al. 2407.06079v1 null
2024-07-08 Analysis and finite element approximation of a diffuse interface approach to the Stokes--Biot coupling Francis R. A. Aznaran et.al. 2407.05949v1 null
2024-07-08 Minutes to Seconds: Speeded-up DDPM-based Image Inpainting with Coarse-to-Fine Sampling Lintao Zhang et.al. 2407.05875v1 link
2024-07-08 RadiomicsFill-Mammo: Synthetic Mammogram Mass Manipulation with Radiomics Features Inye Na et.al. 2407.05683v1 link
2024-07-08 BEVWorld: A Multimodal World Model for Autonomous Driving via Unified BEV Latent Space Yumeng Zhang et.al. 2407.05679v1 link
2024-07-08 Ada-adapter:Fast Few-shot Style Personlization of Diffusion Model with Pre-trained Image Encoder Jia Liu et.al. 2407.05552v1 null
2024-07-08 Read, Watch and Scream! Sound Generation from Text and Video Yujin Jeong et.al. 2407.05551v1 null
2024-07-08 LaSe-E2V: Towards Language-guided Semantic-Aware Event-to-Video Reconstruction Kanghao Chen et.al. 2407.05547v1 null
2024-07-07 Diffusion as Sound Propagation: Physics-inspired Model for Ultrasound Image Generation Marina Domínguez et.al. 2407.05428v1 link
2024-07-07 BiRoDiff: Diffusion policies for bipedal robot locomotion on unseen terrains GVS Mothish et.al. 2407.05424v1 null
2024-07-07 Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model Danni Yang et.al. 2407.05352v1 null
2024-07-07 Enhancing Label-efficient Medical Image Segmentation with Text-guided Diffusion Models Chun-Mei Feng et.al. 2407.05323v1 null
2024-07-07 An Improved Method for Personalizing Diffusion Models Yan Zeng et.al. 2407.05312v1 null
2024-07-07 DM-MIMO: Diffusion Models for Robust Semantic Communications over MIMO Channels Yiheng Duan et.al. 2407.05289v1 null
2024-07-07 Gradient Diffusion: A Perturbation-Resilient Gradient Leakage Attack Xuan Liu et.al. 2407.05285v1 null
2024-07-07 Multi-scale Conditional Generative Modeling for Microscopic Image Restoration Luzhe Huang et.al. 2407.05259v1 null
2024-07-06 FedTSA: A Cluster-based Two-Stage Aggregation Method for Model-heterogeneous Federated Learning Boyu Fan et.al. 2407.05098v1 null
2024-07-06 Slice-Consistent 3D Volumetric Brain CT-to-MRI Translation with 2D Brownian Bridge Diffusion Model Kyobin Choo et.al. 2407.05059v1 link
2024-07-06 Laminar-Turbulent Patterns in Shear Flows : Evasion of Tipping, Saddle-Loop Bifurcation and Log scaling of the Turbulent Fraction Pavan V. Kashyap et.al. 2407.04993v1 null
2024-07-06 FreeCompose: Generic Zero-Shot Image Composition with Diffusion Prior Zhekai Chen et.al. 2407.04947v1 null
2024-07-05 Improving ensemble extreme precipitation forecasts using generative artificial intelligence Yingkai Sha et.al. 2407.04882v1 null
2024-07-05 Structural Constraint Integration in Generative Model for Discovery of Quantum Material Candidates Ryotaro Okabe et.al. 2407.04557v1 null
2024-07-05 Unified continuous-time q-learning for mean-field game and mean-field control problems Xiaoli Wei et.al. 2407.04521v1 null
2024-07-08 Speed-accuracy trade-off for the diffusion models: Wisdom from nonequilibrium thermodynamics and optimal transport Kotaro Ikeda et.al. 2407.04495v2 null
2024-07-05 PROUD: PaRetO-gUided Diffusion Model for Multi-objective Generation Yinghua Yao et.al. 2407.04493v1 null
2024-07-05 VCD-Texture: Variance Alignment based 3D-2D Co-Denoising for Text-Guided Texturing Shang Liu et.al. 2407.04461v1 null
2024-07-05 Comparing metallicity correlations in nearby non-AGN and AGN-host galaxies Song-lin Li et.al. 2407.04252v1 null
2024-07-05 GSD: View-Guided Gaussian Splatting Diffusion for 3D Reconstruction Yuxuan Mu et.al. 2407.04237v1 null
2024-07-05 T2IShield: Defending Against Backdoors on Text-to-Image Diffusion Models Zhongqi Wang et.al. 2407.04215v1 link
2024-07-05 TimeLDM: Latent Diffusion Model for Unconditional Time Series Generation Jian Qian et.al. 2407.04211v1 null
2024-07-04 Advances in Diffusion Models for Image Data Augmentation: A Review of Methods, Models, Evaluation Metrics and Future Research Directions Panagiotis Alimisis et.al. 2407.04103v1 null
2024-07-04 Leveraging Latent Diffusion Models for Training-Free In-Distribution Data Augmentation for Surface Defect Detection Federico Girella et.al. 2407.03961v1 link
2024-07-04 The second-order Esscher martingale densities for continuous-time market models Tahir Choulli et.al. 2407.03960v1 null
2024-07-04 Timestep-Aware Correction for Quantized Diffusion Models Yuzhe Yao et.al. 2407.03917v1 null
2024-07-04 Continuous-time q-Learning for Jump-Diffusion Models under Tsallis Entropy Lijun Bo et.al. 2407.03888v1 null
2024-07-04 Generative Technology for Human Emotion Recognition: A Scope Review Fei Ma et.al. 2407.03640v1 null
2024-07-04 Diff-Restorer: Unleashing Visual Prompts for Diffusion-based Universal Image Restoration Yuhong Zhang et.al. 2407.03636v1 null
2024-07-04 MRIR: Integrating Multimodal Insights for Diffusion-based Realistic Image Restoration Yuhong Zhang et.al. 2407.03635v1 null
2024-07-04 Feedback-guided Domain Synthesis with Multi-Source Conditional Diffusion Models for Domain Generalization Mehrdad Noori et.al. 2407.03588v1 link
2024-07-03 HiDiff: Hybrid Diffusion Framework for Medical Image Segmentation Tao Chen et.al. 2407.03548v1 link
2024-07-03 BVI-RLV: A Fully Registered Dataset and Benchmarks for Low-Light Video Enhancement Ruirui Lin et.al. 2407.03535v1 null
2024-07-03 DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents Yilun Xu et.al. 2407.03300v1 null
2024-07-03 Improved Noise Schedule for Diffusion Training Tiankai Hang et.al. 2407.03297v1 null
2024-07-04 Spatio-Temporal Adaptive Diffusion Models for EEG Super-Resolution in Epilepsy Diagnosis Tong Zhou et.al. 2407.03089v2 null
2024-07-03 Electromagnetic Property Sensing Based on Diffusion Model in ISAC System Yuhua Jiang et.al. 2407.03075v1 null
2024-07-03 Semantic-Aware Power Allocation for Generative Semantic Communications with Foundation Models Chunmei Xu et.al. 2407.03050v1 null
2024-07-03 SlerpFace: Face Template Protection via Spherical Linear Interpolation Zhizhou Zhong et.al. 2407.03043v1 null
2024-07-03 Frequency-Controlled Diffusion Model for Versatile Text-Guided Image-to-Image Translation Xiang Gao et.al. 2407.03006v1 link
2024-07-04 VEGS: View Extrapolation of Urban Scenes in 3D Gaussian Splatting using Learned Priors Sungwon Hwang et.al. 2407.02945v2 null
2024-07-03 Single Image Rolling Shutter Removal with Diffusion Models Zhanglei Yang et.al. 2407.02906v1 null
2024-07-03 Robot Shape and Location Retention in Video Generation Using Diffusion Models Peng Wang et.al. 2407.02873v1 null
2024-07-03 Mirage Sources and Large TeV Halo-Pulsar Offsets: Exploring the Parameter Space Yiwei Bao et.al. 2407.02829v1 null
2024-07-03 Highly Accelerated MRI via Implicit Neural Representation Guided Posterior Sampling of Diffusion Models Jiayue Chu et.al. 2407.02744v1 null
2024-07-02 No Training, No Problem: Rethinking Classifier-Free Guidance for Diffusion Models Seyedmorteza Sadat et.al. 2407.02687v1 null
2024-07-02 Diffusion Models for Tabular Data Imputation and Synthetic Data Generation Mario Villaizán-Vallelado et.al. 2407.02549v1 null
2024-07-02 Magic Insert: Style-Aware Drag-and-Drop Nataniel Ruiz et.al. 2407.02489v1 null
2024-07-03 Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models Fei Shen et.al. 2407.02482v2 null
2024-07-02 GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models Jian Ma et.al. 2407.02252v1 link
2024-07-02 LaMoD: Latent Motion Diffusion Model For Myocardial Strain Generation Jiarui Xing et.al. 2407.02229v1 null
2024-07-04 UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks Jingjing Ren et.al. 2407.02158v2 null
2024-07-02 Counterfactual Data Augmentation with Denoising Diffusion for Graph Anomaly Detection Chunjing Xiao et.al. 2407.02143v1 link
2024-07-04 Latent Diffusion Model for Generating Ensembles of Climate Simulations Johannes Meuer et.al. 2407.02070v2 null
2024-07-02 Accompanied Singing Voice Synthesis with Fully Text-controlled Melody Ruiqi Li et.al. 2407.02049v1 null
2024-07-02 ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation Zhiyuan Ma et.al. 2407.02040v1 link
2024-07-02 SwiftDiffusion: Efficient Diffusion Model Serving with Add-on Modules Suyi Li et.al. 2407.02031v1 null
2024-07-02 Zero-shot Video Restoration and Enhancement Using Pre-Trained Image Diffusion Model Cong Cao et.al. 2407.01960v1 null
2024-07-02 LDP: A Local Diffusion Planner for Efficient Robot Navigation and Collision Avoidance Wenhao Yu et.al. 2407.01950v1 null
2024-07-04 GVDIFF: Grounded Text-to-Video Generation with Diffusion Models Huanzhang Dou et.al. 2407.01921v2 null
2024-07-02 Enhancing Multi-Class Anomaly Detection via Diffusion Refinement with Dual Conditioning Jiawei Zhan et.al. 2407.01905v1 null
2024-07-02 Text-Aware Diffusion for Policy Learning Calvin Luo et.al. 2407.01903v1 null
2024-07-01 Equivariant Diffusion Policy Dian Wang et.al. 2407.01812v1 null
2024-07-01 Label-free Neural Semantic Image Synthesis Jiayi Wang et.al. 2407.01790v1 null
2024-07-01 Aligning Target-Aware Molecule Diffusion Models with Exact Energy Optimization Siyi Gu et.al. 2407.01648v1 null
2024-06-29 Guided Trajectory Generation with Diffusion Models for Offline Model-based Optimization Taeyoung Yun et.al. 2407.01624v1 null
2024-07-01 Improving Diffusion Inverse Problem Solving with Decoupled Noise Annealing Bingliang Zhang et.al. 2407.01521v1 null
2024-07-01 DiffIR2VR-Zero: Zero-Shot Video Restoration with Diffusion-based Image Restoration Models Chang-Han Yeh et.al. 2407.01519v1 null
2024-07-01 EquiBot: SIM(3)-Equivariant Diffusion Policy for Generalizable and Data Efficient Learning Jingyun Yang et.al. 2407.01479v1 null
2024-07-01 FORA: Fast-Forward Caching in Diffusion Transformer Acceleration Pratheba Selvaraju et.al. 2407.01425v1 null
2024-07-04 Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion Boyuan Chen et.al. 2407.01392v3 link
2024-07-01 Learning data efficient coarse-grained molecular dynamics from forces and noise Aleksander E. P. Durumeric et.al. 2407.01286v1 null
2024-07-01 Semantic-guided Adversarial Diffusion Model for Self-supervised Shadow Removal Ziqi Zeng et.al. 2407.01104v1 null
2024-07-01 Blind Inversion using Latent Diffusion Priors Weimin Bai et.al. 2407.01027v1 null
2024-07-01 An Expectation-Maximization Algorithm for Training Clean Diffusion Models from Corrupted Observations Weimin Bai et.al. 2407.01014v1 null
2024-07-01 Hybrid RAG-empowered Multi-modal LLM for Secure Healthcare Data Management: A Diffusion-based Contract Theory Approach Cheng Su et.al. 2407.00978v1 null
2024-07-01 Diffusion Transformer Model With Compact Prior for Low-dose PET Reconstruction Bin Huang et.al. 2407.00944v1 null
2024-07-01 Mittag-Leffler stability of complete monotonicity-preserving schemes for time-dependent coefficients sub-diffusion equations Wen Dong et.al. 2407.00893v1 null
2024-06-30 InstantStyle-Plus: Style Transfer with Content-Preserving in Text-to-Image Generation Haofan Wang et.al. 2407.00788v1 null
2024-06-30 Diffusion Models and Representation Learning: A Survey Michael Fuest et.al. 2407.00783v1 link
2024-06-30 Chest-Diffusion: A Light-Weight Text-to-Image Model for Report-to-CXR Generation Peng Huang et.al. 2407.00752v1 null
2024-06-30 Posterior Sampling with Denoising Oracles via Tilted Transport Joan Bruna et.al. 2407.00745v1 null
2024-07-03 Diffusion Models for Offline Multi-agent Reinforcement Learning with Safety Constraints Jianuo Huang et.al. 2407.00741v2 null
2024-06-30 LLM4GEN: Leveraging Semantic Representation of LLMs for Text-to-Image Generation Mushui Liu et.al. 2407.00737v1 null
2024-06-30 Generative prediction of flow field based on the diffusion model Jiajun Hu et.al. 2407.00735v1 null
2024-06-30 Maximum Entropy Inverse Reinforcement Learning of Diffusion Models with Energy-Based Models Sangwoong Yoon et.al. 2407.00626v1 null
2024-06-30 Consistency Purification: Effective and Efficient Diffusion Purification towards Certified Robustness Yiquan Li et.al. 2407.00623v1 null
2024-06-30 Diff-BBO: Diffusion-Based Inverse Modeling for Black-Box Optimization Dongxia Wu et.al. 2407.00610v1 null
2024-06-30 GenderBias-\emph{VL}: Benchmarking Gender Bias in Vision Language Models via Counterfactual Probing Yisong Xiao et.al. 2407.00600v1 null
2024-06-29 Accelerating Longitudinal MRI using Prior Informed Latent Diffusion Yonatan Urman et.al. 2407.00537v1 null
2024-06-29 Toward a Diffusion-Based Generalist for Dense Vision Tasks Yue Fan et.al. 2407.00503v1 null
2024-06-29 OccFusion: Rendering Occluded Humans with Generative Diffusion Priors Adam Sun et.al. 2407.00316v1 null
2024-06-29 A new characterization of the dissipation structure and the relaxation limit for the compressible Euler-Maxwell system Timothée Crin-Barat et.al. 2407.00277v1 null
2024-06-28 DiffuseDef: Improved Robustness to Adversarial Attacks Zhenhao Li et.al. 2407.00248v1 null
2024-06-28 HouseCrafter: Lifting Floorplans to 3D Scenes with 2D Diffusion Model Hieu T. Nguyen et.al. 2406.20077v1 null
2024-06-28 Neural Differentiable Modeling with Diffusion-Based Super-resolution for Two-Dimensional Spatiotemporal Turbulence Xiantao Fan et.al. 2406.20047v1 null
2024-06-28 HAITCH: A Framework for Distortion and Motion Correction in Fetal Multi-Shell Diffusion-Weighted MRI Haykel Snoussi et.al. 2406.20042v1 null
2024-06-28 Deceptive Diffusion: Generating Synthetic Adversarial Examples Lucas Beerens et.al. 2406.19807v1 null
2024-06-28 Comprehensive Generative Replay for Task-Incremental Segmentation with Concurrent Appearance and Semantic Forgetting Wei Li et.al. 2406.19796v1 link
2024-06-28 Decision Transformer for IRS-Assisted Systems with Diffusion-Driven Generative Channels Jie Zhang et.al. 2406.19769v1 null
2024-06-28 DISCO: Efficient Diffusion Solver for Large-Scale Combinatorial Optimization Problems Kexiong Yu et.al. 2406.19705v1 null
2024-06-28 Network Bending of Diffusion Models for Audio-Visual Generation Luke Dzwonczyk et.al. 2406.19589v1 null
2024-06-27 A Thermal Study of Terahertz Induced Protein Interactions Hadeel Elayan et.al. 2406.19521v1 null
2024-06-27 pop-cosmos: Scaleable inference of galaxy properties and redshifts with a data-driven population model Stephen Thorp et.al. 2406.19437v1 null
2024-06-27 Accelerating Multiphase Flow Simulations with Denoising Diffusion Model Driven Initializations Jaehong Chung et.al. 2406.19333v1 null
2024-06-27 Subtractive Training for Music Stem Insertion using Latent Diffusion Models Ivan Villa-Renteria et.al. 2406.19328v1 null
2024-06-27 Compositional Image Decomposition with Diffusion Models Jocelin Su et.al. 2406.19298v1 null
2024-06-27 Using diffusion model as constraint: Empower Image Restoration Network Training with Diffusion Model Jiangtong Tan et.al. 2406.19030v1 null
2024-06-28 AnyControl: Create Your Artwork with Versatile Control on Text-to-Image Generation Yanan Sun et.al. 2406.18958v2 null
2024-06-27 Investigating and Defending Shortcut Learning in Personalized Diffusion Models Yixin Liu et.al. 2406.18944v1 null
2024-06-28 AlignIT: Enhancing Prompt Alignment in Customization of Text-to-Image Models Aishwarya Agarwal et.al. 2406.18893v2 null
2024-06-27 Chemical Continuous Time Random Walks under Anomalous Diffusion Hong Zhang et.al. 2406.18869v1 null
2024-06-26 MultiDiff: Consistent Novel View Synthesis from a Single Image Norman Müller et.al. 2406.18524v1 null
2024-06-26 Denoising as Adaptation: Noise-Space Domain Adaptation for Image Restoration Kang Liao et.al. 2406.18516v1 link
2024-06-26 DiffuseHigh: Training-free Progressive High-Resolution Image Synthesis through Structure Guidance Younghyun Kim et.al. 2406.18459v1 null
2024-06-26 Towards diffusion models for large-scale sea-ice modelling Tobias Sebastian Finn et.al. 2406.18417v1 null
2024-06-27 Stable Diffusion Segmentation for Biomedical Images with Single-step Reverse Process Tianyu Lin et.al. 2406.18361v2 link
2024-06-26 Molecular Diffusion Models with Virtual Receptors Matan Halfon et.al. 2406.18330v1 null
2024-06-26 Galaxy spectroscopy without spectra: Galaxy properties from photometric images with conditional diffusion models Lars Doorenbos et.al. 2406.18175v1 link
2024-06-26 Human-Aware 3D Scene Generation with Spatially-constrained Diffusion Models Xiaolin Hong et.al. 2406.18159v1 null
2024-06-26 Leveraging Pre-trained Models for FF-to-FFPE Histopathological Image Translation Qilai Zhang et.al. 2406.18054v1 link
2024-06-25 DiffusionPDE: Generative PDE-Solving Under Partial Observation Jiahe Huang et.al. 2406.17763v1 link
2024-06-25 Unified Auto-Encoding with Masked Diffusion Philippe Hansen-Estruch et.al. 2406.17688v1 link
2024-06-25 LaTable: Towards Large Tabular Models Boris van Breugel et.al. 2406.17673v1 null
2024-06-25 Aligning Diffusion Models with Noise-Conditioned Perception Alexander Gambashidze et.al. 2406.17636v1 null
2024-06-25 Diffusion-based Adversarial Purification for Intrusion Detection Mohamed Amine Merzouk et.al. 2406.17606v1 null
2024-06-25 Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text Xinyang Li et.al. 2406.17601v1 link
2024-06-25 Detection of Synthetic Face Images: Accuracy, Robustness, Generalization Nela Petrzelkova et.al. 2406.17547v1 null
2024-06-25 Principal Component Clustering for Semantic Segmentation in Synthetic Data Generation Felix Stillger et.al. 2406.17541v1 null
2024-06-25 The Tree of Diffusion Life: Evolutionary Embeddings to Understand the Generation Process of Diffusion Models Vidya Prasad et.al. 2406.17462v1 null
2024-06-25 SyncNoise: Geometrically Consistent Noise Prediction for Text-based 3D Scene Editing Ruihuang Li et.al. 2406.17396v1 null
2024-06-25 Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers Lei Chen et.al. 2406.17343v1 link
2024-06-25 Generative Modelling of Structurally Constrained Graphs Manuel Madeira et.al. 2406.17341v1 link
2024-06-25 Disentangled Motion Modeling for Video Frame Interpolation Jaihyun Lew et.al. 2406.17256v1 link
2024-06-25 Expansive Synthesis: Generating Large-Scale Datasets from Minimal Samples Vahid Jebraeeli et.al. 2406.17238v1 null
2024-06-25 LIPE: Learning Personalized Identity Prior for Non-rigid Image Editing Aoyang Liu et.al. 2406.17236v1 null
2024-06-26 Diff3Dformer: Leveraging Slice Sequence Diffusion for Enhanced 3D CT Classification with Transformer Networks Zihao Jin et.al. 2406.17173v2 null
2024-06-24 Fine-tuning Diffusion Models for Enhancing Face Quality in Text-to-image Generation Zhenyi Liao et.al. 2406.17100v1 null
2024-06-23 On Instabilities of Unsupervised Denoising Diffusion Models in Magnetic Resonance Imaging Reconstruction Tianyu Han et.al. 2406.16983v1 null
2024-06-24 FreeTraj: Tuning-Free Trajectory Control in Video Diffusion Models Haonan Qiu et.al. 2406.16863v1 link
2024-06-24 Dreamitate: Real-World Visuomotor Policy Learning via Video Generation Junbang Liang et.al. 2406.16862v1 null
2024-06-24 General Binding Affinity Guidance for Diffusion Models in Structure-Based Drug Design Yue Jian et.al. 2406.16821v1 null
2024-06-24 Portrait3D: 3D Head Generation from Single In-the-wild Portrait Image Jinkun Hao et.al. 2406.16710v1 null
2024-07-01 Geometry-Aware Score Distillation via 3D Consistent Noising and Gradient Consistency Modeling Min-Seop Kwak et.al. 2406.16695v2 null
2024-06-24 Repulsive Score Distillation for Diverse Sampling of Diffusion Models Nicolas Zilberstein et.al. 2406.16683v1 link
2024-06-24 OAML: Outlier Aware Metric Learning for OOD Detection Enhancement Heng Gao et.al. 2406.16525v1 link
2024-06-24 DaLPSR: Leverage Degradation-Aligned Language Prompt for Real-World Image Super-Resolution Aiwen Jiang et.al. 2406.16477v1 null
2024-06-24 ResMaster: Mastering High-Resolution Image Generation via Structural and Fine-Grained Guidance Shuwei Shi et.al. 2406.16476v1 null
2024-06-24 Prompt-Consistency Image Generation (PCIG): A Unified Framework Integrating LLMs, Knowledge Graphs, and Controllable Diffusion Models Yichen Sun et.al. 2406.16333v1 null
2024-06-24 YouDream: Generating Anatomically Controllable Consistent Text-to-3D Animals Sandeep Mishra et.al. 2406.16273v1 null
2024-06-24 Repairing Catastrophic-Neglect in Text-to-Image Diffusion Models via Attention-Guided Feature Enhancement Zhiyuan Chang et.al. 2406.16272v1 null
2024-06-24 Video-Infinity: Distributed Long Video Generation Zhenxiong Tan et.al. 2406.16260v1 null
2024-06-23 Provable Statistical Rates for Consistency Diffusion Models Zehao Dou et.al. 2406.16213v1 null
2024-06-23 UDHF2-Net: An Uncertainty-diffusion-model-based High-Frequency TransFormer Network for High-accuracy Interpretation of Remotely Sensed Imagery Pengfei Zhang et.al. 2406.16129v1 null
2024-06-23 Diffusion Spectral Representation for Reinforcement Learning Dmitry Shribak et.al. 2406.16121v1 null
2024-06-23 Pose-Diversified Augmentation with Diffusion Model for Person Re-Identification Inès Hyeonsu Kim et.al. 2406.16042v1 null
2024-06-23 TimeAutoDiff: Combining Autoencoder and Diffusion model for time series tabular data synthesizing Namjoon Suh et.al. 2406.16028v1 null
2024-06-22 PUDD: Towards Robust Multi-modal Prototype-based Deepfake Detection Alvaro Lopez Pellcier et.al. 2406.15921v1 null
2024-06-22 Soft Masked Mamba Diffusion Model for CT to MRI Conversion Zhenbin Wang et.al. 2406.15910v1 link
2024-06-22 EmoAttack: Emotion-to-Image Diffusion Models for Emotional Backdoor Generation Tianyu Wei et.al. 2406.15863v1 null
2024-06-22 MVOC: a training-free multiple video object composition method with diffusion models Wei Wang et.al. 2406.15829v1 null
2024-06-22 PointDreamer: Zero-shot 3D Textured Mesh Reconstruction from Colored Point Cloud by 2D Inpainting Qiao Yu et.al. 2406.15811v1 link
2024-06-22 Rethinking the Diffusion Models for Numerical Tabular Data Imputation from the Perspective of Wasserstein Gradient Flow Zhichao Chen et.al. 2406.15762v1 null
2024-06-22 Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model Min Zhao et.al. 2406.15735v1 null
2024-06-21 Adaptive Self-Supervised Consistency-Guided Diffusion Model for Accelerated MRI Reconstruction Mojtaba Safari et.al. 2406.15656v1 null
2024-06-21 Masked Extended Attention for Zero-Shot Virtual Try-On In The Wild Nadav Orzech et.al. 2406.15331v1 null
2024-06-21 You Only Acquire Sparse-channel (YOAS): A Unified Framework for Dense-channel EEG Generation Hongyu Chen et.al. 2406.15269v1 null
2024-06-21 Unsupervised Bayesian Generation of Synthetic CT from CBCT Using Patient-Specific Score-Based Prior Junbo Peng et.al. 2406.15219v1 null
2024-06-21 A3D: Does Diffusion Dream about 3D Alignment? Savva Ignatyev et.al. 2406.15020v1 null
2024-06-21 Probabilistic and Differentiable Wireless Simulation with Geometric Transformers Thomas Hehn et.al. 2406.14995v1 null
2024-06-21 VividDreamer: Towards High-Fidelity and Efficient Text-to-3D Generation Zixuan Chen et.al. 2406.14964v1 null
2024-06-24 LatentExplainer: Explaining Latent Representations in Deep Generative Models with Multi-modal Foundation Models Mengdan Zhu et.al. 2406.14862v2 null
2024-06-21 Six-CD: Benchmarking Concept Removals for Benign Text-to-image Diffusion Models Jie Ren et.al. 2406.14855v1 null
2024-06-21 DExter: Learning and Controlling Performance Expression with Diffusion Models Huan Zhang et.al. 2406.14850v1 null
2024-06-21 Fair Text to Medical Image Diffusion Model with Subgroup Distribution Aligned Tuning Xu Han et.al. 2406.14847v1 null
2024-06-21 Latent diffusion models for parameterization and data assimilation of facies-based geomodels Guido Di Federico et.al. 2406.14815v1 null
2024-06-21 Probabilistic Emulation of a Global Climate Model with Spherical DYffusion Salva Rühling Cachay et.al. 2406.14798v1 null
2024-06-20 Regularized Distribution Matching Distillation for One-step Unpaired Image-to-Image Translation Denis Rakitin et.al. 2406.14762v1 null
2024-06-20 Diffusion-Based Failure Sampling for Cyber-Physical Systems Harrison Delecki et.al. 2406.14761v1 link
2024-06-20 Computing Nonequilibrium Responses with Score-shifted Stochastic Differential Equations Jérémie Klinger et.al. 2406.14752v1 null
2024-06-20 Stylebreeder: Exploring and Democratizing Artistic Styles through Text-to-Image Models Matthew Zheng et.al. 2406.14599v1 null
2024-06-20 A Survey of Multimodal-Guided Image Editing with Text-to-Image Diffusion Models Xincheng Shuai et.al. 2406.14555v1 link
2024-06-21 Advancing Fine-Grained Classification by Structure and Subject Preserving Augmentation Eyal Michaeli et.al. 2406.14551v2 link
2024-06-20 Consistency Models Made Easy Zhengyang Geng et.al. 2406.14548v1 link
2024-06-20 Invertible Consistency Distillation for Text-Guided Image Editing in Around 7 Steps Nikita Starodubcev et.al. 2406.14539v1 null
2024-06-20 V-LASIK: Consistent Glasses-Removal from Videos Using Synthetic Data Rotem Shalev-Arkushin et.al. 2406.14510v1 null
2024-06-20 SafeSora: Towards Safety Alignment of Text2Video Generation via a Human Preference Dataset Josef Dai et.al. 2406.14477v1 link
2024-06-20 CollaFuse: Collaborative Diffusion Models Simeon Allmendinger et.al. 2406.14429v1 link
2024-06-20 Active Diffusion Subsampling Oisin Nolan et.al. 2406.14388v1 null
2024-06-20 In Tree Structure Should Sentence Be Generated Yaguang Li et.al. 2406.14189v1 link
2024-06-20 CriDiff: Criss-cross Injection Diffusion Framework via Generative Pre-train for Prostate Segmentation Tingwei Liu et.al. 2406.14186v1 link
2024-06-20 ExVideo: Extending Video Diffusion Models via Parameter-Efficient Post-Tuning Zhongjie Duan et.al. 2406.14130v1 null
2024-06-20 HeartBeat: Towards Controllable Echocardiography Video Synthesis with Multimodal Conditions-Guided Diffusion Models Xinrui Zhou et.al. 2406.14098v1 null
2024-06-20 Bridging bulk and surface: An interacting particle system towards the field-road diffusion model Matthieu Alfaro et.al. 2406.14093v1 null
2024-06-20 A Practical Diffusion Path for Sampling Omar Chehab et.al. 2406.14040v1 null
2024-06-20 Similarity-aware Syncretic Latent Diffusion Model for Medical Image Translation with Representation Learning Tingyi Lin et.al. 2406.13977v1 null
2024-06-20 Synthesizing Multimodal Electronic Health Records via Predictive Diffusion Models Yuan Zhong et.al. 2406.13942v1 null
2024-06-20 EnTruth: Enhancing the Traceability of Unauthorized Dataset Usage in Text-to-image Diffusion Models with Minimal and Robust Alterations Jie Ren et.al. 2406.13933v1 null
2024-06-19 INFusion: Diffusion Regularized Implicit Neural Representations for 2D and 3D accelerated MRI reconstruction Yamin Arefeen et.al. 2406.13895v1 null
2024-06-19 Stability and Generalizability in SDE Diffusion Models with Measure-Preserving Dynamics Weitong Zhang et.al. 2406.13652v1 null
2024-06-19 On AI-Inspired UI-Design Jialiang Wei et.al. 2406.13631v1 null
2024-06-19 Can AI be enabled to dynamical downscaling? Training a Latent Diffusion Model to mimic km-scale COSMO-CLM downscaling of ERA5 over Italy Elena Tomasi et.al. 2406.13627v1 null
2024-06-19 Enhance the Image: Super Resolution using Artificial Intelligence in MRI Ziyu Li et.al. 2406.13625v1 null
2024-06-19 Image Distillation for Safe Data Sharing in Histopathology Zhe Li et.al. 2406.13536v1 null
2024-06-19 Multi-messenger modeling of the Monogem pulsar halo Youyou Li et.al. 2406.13426v1 null
2024-06-24 Style-NeRF2NeRF: 3D Style Transfer From Style-Aligned Multi-View Images Haruo Fujiwara et.al. 2406.13393v2 null
2024-06-19 ARDuP: Active Region Video Diffusion for Universal Policies Shuaiyi Huang et.al. 2406.13301v1 null
2024-06-19 AniFaceDiff: High-Fidelity Face Reenactment via Facial Parametric Conditioned Diffusion Models Ken Chen et.al. 2406.13272v1 null
2024-06-19 Self-Supervised Diffusion Model for 3-D Seismic Data Reconstruction Xinyang Wang et.al. 2406.13252v1 null
2024-06-19 Neural Residual Diffusion Models for Deep Scalable Vision Generation Zhiyuan Ma et.al. 2406.13215v1 null
2024-06-24 Surgical Triplet Recognition via Diffusion Model Daochang Liu et.al. 2406.13210v2 null
2024-06-19 Diffusion Model-based FOD Restoration from High Distortion in dMRI Shuo Huang et.al. 2406.13209v1 null
2024-06-21 Conditional score-based diffusion models for solving inverse problems in mechanics Agnimitra Dasgupta et.al. 2406.13154v2 null
2024-06-19 MCAD: Multi-modal Conditioned Adversarial Diffusion Model for High-Quality PET Image Reconstruction Jiaqi Cui et.al. 2406.13150v1 null
2024-06-18 Sampling 3D Gaussian Scenes in Seconds with Latent Diffusion Models Paul Henderson et.al. 2406.13099v1 null
2024-06-18 MaskPure: Improving Defense Against Text Adversaries with Stochastic Purification Harrison Gietz et.al. 2406.13066v1 link
2024-06-18 Evaluating the design space of diffusion-based generative models Yuqing Wang et.al. 2406.12839v1 null
2024-06-18 Neural Approximate Mirror Maps for Constrained Diffusion Models Berthy T. Feng et.al. 2406.12816v1 null
2024-06-18 Extracting Training Data from Unconditional Diffusion Models Yunhao Chen et.al. 2406.12752v1 null
2024-06-18 Speak in the Scene: Diffusion-based Acoustic Scene Transfer toward Immersive Speech Generation Miseul Kim et.al. 2406.12688v1 null
2024-06-21 GeoBench: Benchmarking and Analyzing Monocular Geometry Estimation Models Yongtao Ge et.al. 2406.12671v2 link
2024-06-18 Unmasking the Veil: An Investigation into Concept Ablation for Privacy and Copyright Protection in Images Shivank Garg et.al. 2406.12592v1 link
2024-06-18 Training Diffusion Models with Federated Learning Matthijs de Goede et.al. 2406.12575v1 null
2024-06-18 Variational Distillation of Diffusion Policies into Mixture of Experts Hongyi Zhou et.al. 2406.12538v1 null
2024-06-18 HumanSplat: Generalizable Single-Image Human Gaussian Splatting with Structure Priors Panwang Pan et.al. 2406.12459v1 link
2024-06-18 Planning Using Schrödinger Bridge Diffusion Models Adarsh Srivastava et.al. 2406.12458v1 link
2024-06-18 Deep Temporal Deaggregation: Large-Scale Spatio-Temporal Generative Models David Bergström et.al. 2406.12423v1 null
2024-06-18 TADM: Temporally-Aware Diffusion Model for Neurodegenerative Progression on Brain MRI Mattia Litrico et.al. 2406.12411v1 null
2024-06-18 Effective Generation of Feasible Solutions for Integer Programming via Guided Diffusion Hao Zeng et.al. 2406.12349v1 null
2024-06-18 Immiscible Diffusion: Accelerating Diffusion Training with Noise Assignment Yiheng Li et.al. 2406.12303v1 null
2024-06-17 COT Flow: Learning Optimal-Transport Image Sampling and Editing by Contrastive Pairs Xinrui Zu et.al. 2406.12140v1 null
2024-06-17 Adding Conditional Control to Diffusion Models with Reinforcement Learning Yulai Zhao et.al. 2406.12120v1 null
2024-06-17 Optimal withdrawals in a general diffusion model with control rates subject to a state-dependent upper bound Hélène Guérin et.al. 2406.12067v1 null
2024-06-17 ARTIST: Improving the Generation of Text-rich Images by Disentanglement Jianyi Zhang et.al. 2406.12044v1 null
2024-06-17 Not All Prompts Are Made Equal: Prompt-based Pruning of Text-to-Image Diffusion Models Alireza Ganjdanesh et.al. 2406.12042v1 null
2024-06-17 Decomposed evaluations of geographic disparities in text-to-image models Abhishek Sureddy et.al. 2406.11988v1 null
2024-06-17 Crossfusor: A Cross-Attention Transformer Enhanced Conditional Diffusion Model for Car-Following Trajectory Prediction Junwei You et.al. 2406.11941v1 null
2024-06-17 Bridging Design Gaps: A Parametric Data Completion Approach With Graph Guided Diffusion Models Rui Zhou et.al. 2406.11934v1 null
2024-06-16 Mixture-of-Subspaces in Low-Rank Adaptation Taiqiang Wu et.al. 2406.11909v1 null
2024-06-17 Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models Bingqi Ma et.al. 2406.11831v1 null
2024-06-17 MegaScenes: Scene-Level View Synthesis at Scale Joseph Tung et.al. 2406.11819v1 null
2024-06-17 DiffMM: Multi-Modal Diffusion Model for Recommendation Yangqin Jiang et.al. 2406.11781v1 null
2024-06-17 Latent Denoising Diffusion GAN: Faster sampling, Higher image quality Luan Thanh Trinh et.al. 2406.11713v1 link
2024-06-17 MusicScore: A Dataset for Music Score Modeling and Generation Yuheng Lin et.al. 2406.11462v1 null
2024-06-17 AnyTrans: Translate AnyText in the Image with Large Scale Models Zhipeng Qian et.al. 2406.11432v1 null
2024-06-17 DiTTo-TTS: Efficient and Scalable Zero-Shot Text-to-Speech with Diffusion Transformer Keon Lee et.al. 2406.11427v1 null
2024-06-17 Unfolding Time: Generative Modeling for Turbulent Flows in 4D Abdullah Saydemir et.al. 2406.11390v1 null
2024-06-17 Diffusion Models in Low-Level Vision: A Survey Chunming He et.al. 2406.11138v1 null
2024-06-16 Exploiting Diffusion Prior for Out-of-Distribution Detection Armando Zhu et.al. 2406.11105v1 null
2024-06-16 An Analysis on Quantizing Diffusion Transformers Yuewei Yang et.al. 2406.11100v1 null
2024-06-16 A Bayesian Drift-Diffusion Model of Schachter-Singer's Two Factor Theory of Emotion Lance Ying et.al. 2406.11086v1 null
2024-06-16 ViD-GPT: Introducing GPT-style Autoregressive Generation in Video Diffusion Models Kaifeng Gao et.al. 2406.10981v1 null
2024-06-16 Graph Neural Reaction Diffusion Models Moshe Eliasof et.al. 2406.10871v1 null
2024-06-16 Diffusion Model With Optimal Covariance Matching Zijing Ou et.al. 2406.10808v1 null
2024-06-16 Diffusion Models Are Promising for Ab Initio Structure Solutions from Nanocrystalline Powder Diffraction Data Gabe Guo et.al. 2406.10796v1 link
2024-06-15 Beyond the Visible: Jointly Attending to Spectral and Spatial Dimensions with HSI-Diffusion for the FINCH Spacecraft Ian Vyse et.al. 2406.10724v1 link
2024-06-18 A Comprehensive Taxonomy and Analysis of Talking Head Synthesis: Techniques for Portrait Generation, Driving Mechanisms, and Editing Ming Meng et.al. 2406.10553v2 null
2024-06-15 Self-Supervised Vision Transformer for Enhanced Virtual Clothes Try-On Lingxiao Lu et.al. 2406.10539v1 null
2024-06-15 Lift Your Molecules: Molecular Graph Generation in Latent Euclidean Space Mohamed Amine Ketata et.al. 2406.10513v1 null
2024-06-14 Consistency-diversity-realism Pareto fronts of conditional image generative models Pietro Astolfi et.al. 2406.10429v1 null
2024-06-14 SigDiffusions: Score-Based Diffusion Models for Long Time Series via Log-Signature Embeddings Barbora Barancikova et.al. 2406.10354v1 null
2024-06-14 SatDiffMoE: A Mixture of Estimation Method for Satellite Image Super-resolution with Latent Diffusion Models Zhaoxu Luo et.al. 2406.10225v1 null
2024-06-14 DiffusionBlend: Learning 3D Image Prior through Position-aware Diffusion Score Blending for 3D Computed Tomography Reconstruction Bowen Song et.al. 2406.10211v1 null
2024-06-14 Make It Count: Text-to-Image Generation with an Accurate Number of Objects Lital Binyamin et.al. 2406.10210v1 null
2024-06-14 Crafting Parts for Expressive Object Composition Harsh Rangwani et.al. 2406.10197v1 null
2024-06-14 Training-free Camera Control for Video Generation Chen Hou et.al. 2406.10126v1 null
2024-06-14 Group and Shuffle: Efficient Structured Orthogonal Parametrization Mikhail Gorbunov et.al. 2406.10019v1 null
2024-06-14 OrientDream: Streamlining Text-to-3D Generation with Explicit Orientation Control Yuzhong Huang et.al. 2406.10000v1 null
2024-06-14 InstructRL4Pix: Training Diffusion for Image Editing by Reinforcement Learning Tiancheng Li et.al. 2406.09973v1 null
2024-06-14 GradeADreamer: Enhanced Text-to-3D Generation Using Gaussian Splatting and Multi-View Diffusion Trapoom Ukarapol et.al. 2406.09850v1 link
2024-06-14 Unsupervised Monocular Depth Estimation Based on Hierarchical Feature-Guided Diffusion Runze Liu et.al. 2406.09782v1 null
2024-06-14 Bayesian Conditioned Diffusion Models for Inverse Problems Alper Güngör et.al. 2406.09768v1 null
2024-06-14 Language-Guided Manipulation with Diffusion Policies and Constrained Inpainting Ce Hao et.al. 2406.09767v1 null
2024-06-14 ControlVAR: Exploring Controllable Visual Autoregressive Modeling Xiang Li et.al. 2406.09750v1 null
2024-06-14 Neural Pose Representation Learning for Generating and Transferring Non-Rigid Object Poses Seungwoo Yoo et.al. 2406.09728v1 null
2024-06-14 Watch the Watcher! Backdoor Attacks on Security-Enhancing Diffusion Models Changjiang Li et.al. 2406.09669v1 null
2024-06-14 New algorithms for sampling and diffusion models Xicheng Zhang et.al. 2406.09665v1 null
2024-06-13 Turns Out I'm Not Real: Towards Robust Detection of AI-Generated Videos Qingyuan Liu et.al. 2406.09601v1 null
2024-06-13 Improving Consistency Models with Generator-Induced Coupling Thibaut Issenhuth et.al. 2406.09570v1 link
2024-06-13 e-COP : Episodic Constrained Optimization of Policies Akhil Agnihotri et.al. 2406.09563v1 null
2024-06-13 My Body My Choice: Human-Centric Full-Body Anonymization Umur Aybars Ciftci et.al. 2406.09553v1 null
2024-06-13 Between Randomness and Arbitrariness: Some Lessons for Reliable Machine Learning at Scale A. Feder Cooper et.al. 2406.09548v1 null
2024-06-13 CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making Zibin Dong et.al. 2406.09509v1 link
2024-06-13 Fair Data Generation via Score-based Diffusion Model Yujie Lin et.al. 2406.09495v1 null
2024-06-13 Language-driven Grasp Detection An Dinh Vuong et.al. 2406.09489v1 null
2024-06-13 Is Diffusion Model Safe? Severe Data Leakage via Gradient-Guided Diffusion Model Jiayang Meng et.al. 2406.09484v1 null
2024-06-13 Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models Qihao Liu et.al. 2406.09416v1 null
2024-06-13 An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual Pixels Duy-Kien Nguyen et.al. 2406.09415v1 null
2024-06-13 Interpreting the Weight Space of Customized Diffusion Models Amil Dravid et.al. 2406.09413v1 link
2024-06-13 ConsistDreamer: 3D-Consistent 2D Diffusion for High-Fidelity Scene Editing Jun-Kun Chen et.al. 2406.09404v1 null
2024-06-13 Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D Diffusion Linzhan Mou et.al. 2406.09402v1 null
2024-06-13 OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation Junke Wang et.al. 2406.09399v1 link
2024-06-13 SimGen: Simulator-conditioned Driving Scene Generation Yunsong Zhou et.al. 2406.09386v1 null
2024-06-13 CLIPAway: Harmonizing Focused Embeddings for Removing Objects via Diffusion Models Yigit Ekin et.al. 2406.09368v1 null
2024-06-13 Understanding Hallucinations in Diffusion Models through Mode Interpolation Sumukh K Aithal et.al. 2406.09358v1 link
2024-06-13 Advancing Graph Generation through Beta Diffusion Yilin He et.al. 2406.09357v1 null
2024-06-13 StableMaterials: Enhancing Diversity in Material Generation via Semi-Supervised Learning Giuseppe Vecchio et.al. 2406.09293v1 null
2024-06-13 Neural Assets: 3D-Aware Multi-Object Scene Synthesis with Image Diffusion Models Ziyi Wu et.al. 2406.09292v1 null
2024-06-14 Generative Inverse Design of Crystal Structures via Diffusion Models with Transformers Izumi Takahara et.al. 2406.09263v2 null
2024-06-13 EMMA: Your Text-to-Image Diffusion Model Can Secretly Accept Multi-Modal Prompts Yucheng Han et.al. 2406.09162v1 null
2024-06-13 Complex Image-Generative Diffusion Transformer for Audio Denoising Junhui Li et.al. 2406.09161v1 null
2024-06-13 Diffusion Gaussian Mixture Audio Denoise Pu Wang et.al. 2406.09154v1 null
2024-06-13 Operator-informed score matching for Markov diffusion models Zheyang Shen et.al. 2406.09084v1 null
2024-06-13 EquiPrompt: Debiasing Diffusion Models via Iterative Bootstrapping in Chain of Thoughts Zahraa Al Sahili et.al. 2406.09070v1 null
2024-06-13 Preserving Identity with Variational Score for General-purpose 3D Editing Duong H. Le et.al. 2406.08953v1 null
2024-06-13 Step-by-Step Diffusion: An Elementary Tutorial Preetum Nakkiran et.al. 2406.08929v1 null
2024-06-13 Heuristics for Influence Maximization with Tiered Influence and Activation thresholds Rahul Kumar Gautam et.al. 2406.08876v1 null
2024-06-13 COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing Jiangshan Wang et.al. 2406.08850v1 null
2024-06-13 FouRA: Fourier Low Rank Adaptation Shubhankar Borse et.al. 2406.08798v1 null
2024-06-13 Batch-Instructed Gradient for Prompt Evolution:Systematic Prompt Optimization for Enhanced Text-to-Image Synthesis Xinrui Yang et.al. 2406.08713v1 null
2024-06-12 Vivid-ZOO: Multi-View Video Generation with Diffusion Model Bing Li et.al. 2406.08659v1 null
2024-06-12 How to Distinguish AI-Generated Images from Authentic Photographs Negar Kamali et.al. 2406.08651v1 null
2024-06-12 FakeInversion: Learning to Detect Images from Unseen Text-to-Image Models by Inverting Stable Diffusion George Cazenavette et.al. 2406.08603v1 null
2024-06-12 Predicting Cascading Failures with a Hyperparametric Diffusion Model Bin Xiang et.al. 2406.08522v1 null
2024-06-12 Words Worth a Thousand Pictures: Measuring and Understanding Perceptual Variability in Text-to-Image Generation Raphael Tang et.al. 2406.08482v1 null
2024-06-12 Human 3Diffusion: Realistic Avatar Creation via Explicit 3D Consistent Diffusion Models Yuxuan Xue et.al. 2406.08475v1 null
2024-06-12 ** $\texttt{DiffLense}$ : A Conditional Diffusion Model for Super-Resolution of Gravitational Lensing Data** Pranath Reddy et.al. 2406.08442v1 null
2024-06-12 Diffusion Soup: Model Merging for Text-to-Image Diffusion Models Benjamin Biggs et.al. 2406.08431v1 null
2024-06-12 FontStudio: Shape-Adaptive Diffusion Model for Coherent and Consistent Font Effect Generation Xinzhi Mu et.al. 2406.08392v1 null
2024-06-12 Diff-A-Riff: Musical Accompaniment Co-creation via Latent Diffusion Models Javier Nistal et.al. 2406.08384v1 null
2024-06-12 2.5D Multi-view Averaging Diffusion Model for 3D Medical Image Translation: Application to Low-count PET Reconstruction with CT-less Attenuation Correction Tianqi Chen et.al. 2406.08374v1 null
2024-06-12 WMAdapter: Adding WaterMark Control to Latent Diffusion Models Hai Ci et.al. 2406.08337v1 null
2024-06-12 Dataset Enhancement with Instance-Level Augmentations Orest Kupyn et.al. 2406.08249v1 link
2024-06-12 Diffusion-Promoted HDR Video Reconstruction Yuanshen Guan et.al. 2406.08204v1 null
2024-06-12 LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation Wenhao Guan et.al. 2406.08203v1 null
2024-06-14 One-Step Effective Diffusion Network for Real-World Image Super-Resolution Rongyuan Wu et.al. 2406.08177v2 link
2024-06-12 Defect-related Anomalous Mobility of Small polarons in Oxides: the Case of Congruent Lithium Niobate Anton Pfannstiel et.al. 2406.08123v1 null
2024-06-12 Make Your Actor Talk: Generalizable and High-Fidelity Lip Sync with Motion and Appearance Disentanglement Runyi Yu et.al. 2406.08096v1 null
2024-06-12 CFG++: Manifold-constrained Classifier Free Guidance for Diffusion Models Hyungjin Chung et.al. 2406.08070v1 null
2024-06-12 Ablation Based Counterfactuals Zheng Dai et.al. 2406.07908v1 null
2024-06-12 DiffPop: Plausibility-Guided Object Placement Diffusion for Image Composition Jiacheng Liu et.al. 2406.07852v1 null
2024-06-12 Hierarchical Patch Diffusion Models for High-Resolution Video Generation Ivan Skorokhodov et.al. 2406.07792v1 null
2024-06-11 HOI-Swap: Swapping Objects in Videos with Hand-Object Interaction Awareness Zihui Xue et.al. 2406.07754v1 null
2024-06-11 CUPID: Contextual Understanding of Prompt-conditioned Image Distributions Yayan Zhao et.al. 2406.07699v1 null
2024-06-11 Treeffuser: Probabilistic Predictions via Conditional Diffusions with Gradient-Boosted Trees Nicolas Beltran-Velez et.al. 2406.07658v1 link
2024-06-11 Pre-training Feature Guided Diffusion Model for Speech Enhancement Yiyuan Yang et.al. 2406.07646v1 null
2024-06-11 An Image is Worth 32 Tokens for Reconstruction and Generation Qihang Yu et.al. 2406.07550v1 null
2024-06-11 Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance Kuan Heng Lin et.al. 2406.07540v1 null
2024-06-11 Simple and Effective Masked Diffusion Language Models Subham Sekhar Sahoo et.al. 2406.07524v1 link
2024-06-11 Neural Gaffer: Relighting Any Object via Diffusion Haian Jin et.al. 2406.07520v1 null
2024-06-11 Instant 3D Human Avatar Generation using Image Diffusion Models Nikos Kolotouros et.al. 2406.07516v1 null
2024-06-11 Flow Map Matching Nicholas M. Boffi et.al. 2406.07507v1 null
2024-06-11 GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection Hang Yao et.al. 2406.07487v1 null
2024-06-11 Image Neural Field Diffusion Models Yinbo Chen et.al. 2406.07480v1 null
2024-06-11 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models Heng Yu et.al. 2406.07472v1 null
2024-06-11 Noise-robust Speech Separation with Fast Generative Correction Helin Wang et.al. 2406.07461v1 null
2024-06-11 DiffCom: Channel Received Signal is a Natural Condition to Guide Diffusion Posterior Sampling Sixian Wang et.al. 2406.07390v1 null
2024-06-12 Towards Realistic Data Generation for Real-World Super-Resolution Long Peng et.al. 2406.07255v2 null
2024-06-12 Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with Foundation Models Athanasios Tragakis et.al. 2406.07251v2 link
2024-06-11 Eye-for-an-eye: Appearance Transfer with Semantic Correspondence in Diffusion Models Sooyeon Go et.al. 2406.07008v1 null
2024-06-11 DNN Partitioning, Task Offloading, and Resource Allocation in Dynamic Vehicular Networks: A Lyapunov-Guided Diffusion-Based Reinforcement Learning Approach Zhang Liu et.al. 2406.06986v1 null
2024-06-11 Evolving from Single-modal to Multi-modal Facial Deepfake Detection: A Survey Ping Liu et.al. 2406.06965v1 null
2024-06-11 Unleashing the Denoising Capability of Diffusion Prior for Solving Inverse Problems Jiawei Zhang et.al. 2406.06959v1 link
2024-06-11 AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising Zigeng Chen et.al. 2406.06911v1 link
2024-06-11 Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation Yuanhao Zhai et.al. 2406.06890v1 null
2024-06-09 Latent Diffusion Model-Enabled Real-Time Semantic Communication Considering Semantic Ambiguities and Channel Noises Jianhua Pei et.al. 2406.06644v1 null
2024-06-10 IllumiNeRF: 3D Relighting without Inverse Rendering Xiaoming Zhao et.al. 2406.06527v1 null
2024-06-10 Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation Peize Sun et.al. 2406.06525v1 link
2024-06-10 Monkey See, Monkey Do: Harnessing Self-attention in Motion Diffusion for Zero-shot Motion Transfer Sigal Raab et.al. 2406.06508v1 link
2024-06-10 AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction Zhen Xing et.al. 2406.06465v1 null
2024-06-10 Cometh: A continuous-time discrete-state graph diffusion model Antoine Siraudin et.al. 2406.06449v1 null
2024-06-10 Margin-aware Preference Optimization for Aligning Diffusion Models without Reference Jiwoo Hong et.al. 2406.06424v1 null
2024-06-10 Diffusion-RPO: Aligning Diffusion Models through Relative Preference Optimization Yi Gu et.al. 2406.06382v1 link
2024-06-10 Improving Deep Learning-based Automatic Cranial Defect Reconstruction by Heavy Data Augmentation: From Image Registration to Latent Diffusion Models Marek Wodzinski et.al. 2406.06372v1 null
2024-06-10 MVGamba: Unify 3D Content Generation as State Space Sequence Modeling Xuanyu Yi et.al. 2406.06367v1 null
2024-06-11 Tuning-Free Visual Customization via View Iterative Self-Attention Control Xiaojie Li et.al. 2406.06258v2 null
2024-06-10 Data Augmentation in Earth Observation: A Diffusion Model Approach Tiago Sousa et.al. 2406.06218v1 null
2024-06-10 The Effect of Training Dataset Size on Discriminative and Diffusion-Based Speech Enhancement Systems Philippe Gonzalez et.al. 2406.06160v1 null
2024-06-10 Thunder : Unified Regression-Diffusion Speech Enhancement with a Single Reverse Step using Brownian Bridge Thanapat Trachu et.al. 2406.06139v1 null
2024-06-10 DiffInject: Revisiting Debias via Synthetic Data Generation using Diffusion-based Style Injection Donggeun Ko et.al. 2406.06134v1 null
2024-06-10 ExtraNeRF: Visibility-Aware View Extrapolation of Neural Radiance Fields with Diffusion Models Meng-Li Shih et.al. 2406.06133v1 null
2024-06-10 Latent Representation Matters: Human-like Sketches in One-shot Drawing Tasks Victor Boutin et.al. 2406.06079v1 null
2024-06-10 Generalizable Human Gaussians from Single-View Image Jinnan Chen et.al. 2406.06050v1 link
2024-06-10 Synthesizing Efficient Data with Diffusion Models for Person Re-Identification Pre-Training Ke Niu et.al. 2406.06045v1 link
2024-06-10 FRAG: Frequency Adapting Group for Diffusion Video Editing Sunjae Yoon et.al. 2406.06044v1 null
2024-06-09 Improving Antibody Design with Force-Guided Sampling in Diffusion Models Paulina Kulytė et.al. 2406.05832v1 null
2024-06-12 MLCM: Multistep Consistency Distillation of Latent Diffusion Model Qingsong Xie et.al. 2406.05768v3 null
2024-06-09 Binarized Diffusion Model for Image Super-Resolution Zheng Chen et.al. 2406.05723v1 link
2024-06-11 Towards Expressive Zero-Shot Speech Synthesis with Hierarchical Prosody Modeling Yuepeng Jiang et.al. 2406.05681v2 null
2024-06-09 PaRa: Personalizing Text-to-Image Diffusion via Parameter Rank Reduction Shangyu Chen et.al. 2406.05641v1 null
2024-06-08 Autoregressive Diffusion Transformer for Text-to-Speech Synthesis Zhijun Liu et.al. 2406.05551v1 null
2024-06-08 Exploring Bridges Between Creative Coding and Visual Generative AI Jiaqi Wu et.al. 2406.05508v1 null
2024-06-08 Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis Zanlin Ni et.al. 2406.05478v1 null
2024-06-08 3D MRI Synthesis with Slice-Based Latent Diffusion Models: Improving Tumor Segmentation Tasks in Data-Scarce Regimes Aghiles Kebaili et.al. 2406.05421v1 link
2024-06-08 Mean-field Chaos Diffusion Models Sungwoo Park et.al. 2406.05396v1 null
2024-06-12 MotionClone: Training-Free Motion Cloning for Controllable Video Generation Pengyang Ling et.al. 2406.05338v2 null
2024-06-08 LDM-SVC: Latent Diffusion Model Based Zero-Shot Any-to-Any Singing Voice Conversion with Singer Guidance Shihao Chen et.al. 2406.05325v1 null
2024-06-08 CoBL-Diffusion: Diffusion-Based Conditional Robot Planning in Dynamic Environments Using Control Barrier and Lyapunov Functions Kazuki Mizuta et.al. 2406.05309v1 null
2024-06-07 Modelling effects of moisture on mechanical properties of crosslinked polyurethane adhesives S. P. Josyula et.al. 2406.05278v1 null
2024-06-07 Efficient Differentially Private Fine-Tuning of Diffusion Models Jing Liu et.al. 2406.05257v1 null
2024-06-07 DiffusionPID: Interpreting Diffusion via Partial Information Decomposition Shaurya Dewan et.al. 2406.05191v1 null
2024-06-07 CoNo: Consistency Noise Injection for Tuning-free Long Video Diffusion Xingrui Wang et.al. 2406.05082v1 null
2024-06-07 Generative diffusion models for synthetic trajectories of heavy and light particles in turbulence Tianyi Li et.al. 2406.05008v1 null
2024-06-07 Learning Divergence Fields for Shift-Robust Graph Representations Qitian Wu et.al. 2406.04963v1 link
2024-06-07 Combinatorial Complex Score-based Diffusion Modelling through Stochastic Differential Equations Adrien Carrel et.al. 2406.04916v1 link
2024-06-07 Online Continual Learning of Video Diffusion Models From a Single Video Stream Jason Yoo et.al. 2406.04814v1 null
2024-06-07 TEDi Policy: Temporally Entangled Diffusion for Robotic Control Sigmund H. Høeg et.al. 2406.04806v1 null
2024-06-07 Diffusion-based Generative Image Outpainting for Recovery of FOV-Truncated CT Images Michelle Espranita Liman et.al. 2406.04769v1 null
2024-06-07 PQPP: A Joint Benchmark for Text-to-Image Prompt and Query Performance Prediction Eduard Poesina et.al. 2406.04746v1 link
2024-06-07 FlowMM: Generating Materials with Riemannian Flow Matching Benjamin Kurt Miller et.al. 2406.04713v1 null
2024-06-07 MeLFusion: Synthesizing Music from Image and Language Cues using Diffusion Models Sanjoy Chowdhury et.al. 2406.04673v1 null
2024-06-07 GenzIQA: Generalized Image Quality Assessment using Prompt-Guided Latent Diffusion Models Diptanu De et.al. 2406.04654v1 null
2024-06-07 Boosting Diffusion Model for Spectrogram Up-sampling in Text-to-speech: An Empirical Study Chong Zhang et.al. 2406.04633v1 null
2024-06-07 STAR: Skeleton-aware Text-based 4D Avatar Generation with In-Network Motion Retargeting Zenghao Chai et.al. 2406.04629v1 link
2024-06-07 CTSyn: A Foundational Model for Cross Tabular Data Generation Xiaofeng Lin et.al. 2406.04619v1 null
2024-06-07 Diverse Intra- and Inter-Domain Activity Style Fusion for Cross-Person Generalization in Activity Recognition Junru Zhang et.al. 2406.04609v1 null
2024-06-06 Improving Geo-diversity of Generated Images with Contextualized Vendi Score Guidance Reyhane Askari Hemmat et.al. 2406.04551v1 null
2024-06-06 Single Exposure Quantitative Phase Imaging with a Conventional Microscope using Diffusion Models Gabriel della Maggiora et.al. 2406.04388v1 null
2024-06-07 Physics3D: Learning Physical Properties of 3D Gaussians via Video Diffusion Fangfu Liu et.al. 2406.04338v2 null
2024-06-08 Coherent Zero-Shot Visual Instruction Generation Quynh Phung et.al. 2406.04337v2 null
2024-06-06 BitsFusion: 1.99 bits Weight Quantization of Diffusion Model Yang Sui et.al. 2406.04333v1 link
2024-06-06 Simplified and Generalized Masked Diffusion for Discrete Data Jiaxin Shi et.al. 2406.04329v1 null
2024-06-06 SF-V: Single Forward Video Generation Model Zhixing Zhang et.al. 2406.04324v1 null
2024-06-06 ATraDiff: Accelerating Online Reinforcement Learning with Imaginary Trajectories Qianlan Yang et.al. 2406.04323v1 null
2024-06-07 DIRECT-3D: Learning Direct Text-to-3D Generation on Massive Noisy 3D Data Qihao Liu et.al. 2406.04322v2 link
2024-06-06 Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step Zhanhao Liang et.al. 2406.04314v1 null
2024-06-06 Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment Jiayi Guo et.al. 2406.04295v1 link
2024-06-06 VideoTetris: Towards Compositional Text-to-Video Generation Ye Tian et.al. 2406.04277v1 link
2024-06-06 A Survey on 3D Human Avatar Modeling -- From Reconstruction to Generation Ruihe Wang et.al. 2406.04253v1 null
2024-06-06 Diffusion-based image inpainting with internal learning Nicolas Cherel et.al. 2406.04206v1 null
2024-06-06 Multistep Distillation of Diffusion Models via Moment Matching Tim Salimans et.al. 2406.04103v1 null
2024-06-06 Enhancing Weather Predictions: Super-Resolution via Deep Diffusion Models Jan Martinů et.al. 2406.04099v1 null
2024-06-06 LDM-RSIC: Exploring Distortion Prior with Latent Diffusion Models for Remote Sensing Image Compression Junhui Li et.al. 2406.03961v1 null
2024-06-06 LLplace: The 3D Indoor Scene Layout Generation and Editing via Large Language Model Yixuan Yang et.al. 2406.03866v1 null
2024-06-06 Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data Jingyang Ou et.al. 2406.03736v1 null
2024-06-06 JIGMARK: A Black-Box Approach for Enhancing Image Watermarks against Diffusion Model Edits Minzhou Pan et.al. 2406.03720v1 link
2024-06-06 Pi-fusion: Physics-informed diffusion model for learning fluid dynamics Jing Qiu et.al. 2406.03711v1 null
2024-06-06 Mean-variance portfolio selection in jump-diffusion model under no-shorting constraint: A viscosity solution approach Xiaomin Shi et.al. 2406.03709v1 null
2024-06-06 BindGPT: A Scalable Framework for 3D Molecular Design via Language Modeling and Reinforcement Learning Artem Zholus et.al. 2406.03686v1 null
2024-06-06 Bayesian Power Steering: An Effective Approach for Domain Adaptation of Diffusion Models Ding Huang et.al. 2406.03683v1 link
2024-06-05 Understanding the Limitations of Diffusion Concept Algebra Through Food E. Zhixuan Zeng et.al. 2406.03582v1 null
2024-06-05 A Geometric View of Data Complexity: Efficient Local Intrinsic Dimension Estimation with Diffusion Models Hamidreza Kamkari et.al. 2406.03537v1 null
2024-06-05 Text-to-Events: Synthetic Event Camera Streams from Conditional Text Input Joachim Ott et.al. 2406.03439v1 null
2024-06-05 Text-to-Image Rectified Flow as Plug-and-Play Priors Xiaofeng Yang et.al. 2406.03293v1 link
2024-06-05 Generative Diffusion Models for Fast Simulations of Particle Collisions at CERN Mikołaj Kita et.al. 2406.03233v1 null
2024-06-05 Searching Priors Makes Text-to-Video Synthesis Better Haoran Cheng et.al. 2406.03215v1 null
2024-06-05 Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion Hao Wen et.al. 2406.03184v1 link
2024-06-05 Tiny models from tiny data: Textual and null-text inversion for few-shot distillation Erik Landolsi et.al. 2406.03146v1 link
2024-06-05 Floating Anchor Diffusion Model for Multi-motif Scaffolding Ke Liu et.al. 2406.03141v1 link
2024-06-05 Phy-Diff: Physics-guided Hourglass Diffusion Model for Diffusion MRI Synthesis Juanhua Zhang et.al. 2406.03002v1 null
2024-06-05 Exploring Data Efficiency in Zero-Shot Learning with Diffusion Models Zihan Ye et.al. 2406.02929v1 null
2024-06-06 U-KAN Makes Strong Backbone for Medical Image Segmentation and Generation Chenxin Li et.al. 2406.02918v2 null
2024-06-05 TSPDiffuser: Diffusion Models as Learned Samplers for Traveling Salesperson Path Planning Problems Ryo Yonetani et.al. 2406.02858v1 null
2024-06-04 ORACLE: Leveraging Mutual Information for Consistent Character Generation with LoRAs in Diffusion Models Kiymet Akdemir et.al. 2406.02820v1 null
2024-06-04 Diffusion-Refined VQA Annotations for Semi-Supervised Gaze Following Qiaomu Miao et.al. 2406.02774v1 null
2024-06-04 Neural Representations of Dynamic Visual Stimuli Jacob Yeung et.al. 2406.02659v1 null
2024-06-04 Pancreatic Tumor Segmentation as Anomaly Detection in CT Images Using Denoising Diffusion Models Reza Babaei et.al. 2406.02653v1 null
2024-06-04 Dreamguider: Improved Training free Diffusion-based Conditional Generation Nithin Gopalakrishnan Nair et.al. 2406.02549v1 null
2024-06-06 Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting Inkyu Shin et.al. 2406.02541v3 null
2024-06-04 CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation Dejia Xu et.al. 2406.02509v1 null
2024-06-04 Guiding a Diffusion Model with a Bad Version of Itself Tero Karras et.al. 2406.02507v1 null
2024-06-04 Stable-Pose: Leveraging Transformers for Pose-Guided Text-to-Image Generation Jiajun Wang et.al. 2406.02485v1 link
2024-06-04 Inpainting Pathology in Lumbar Spine MRI with Latent Diffusion Colin Hansen et.al. 2406.02477v1 null
2024-06-04 Learning Image Priors through Patch-based Diffusion Models for Solving Inverse Problems Jason Hu et.al. 2406.02462v1 null
2024-06-04 RoomTex: Texturing Compositional Indoor Scenes via Iterative Inpainting Qi Wang et.al. 2406.02461v1 null
2024-06-04 Finding NeMo: Localizing Neurons Responsible For Memorization in Diffusion Models Dominik Hintersdorf et.al. 2406.02366v1 link
2024-06-05 Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation Clement Chadebec et.al. 2406.02347v2 link
2024-06-05 SimpleSpeech: Towards Simple and Efficient Text-to-Speech with Scalar Latent Transformer Diffusion Models Dongchao Yang et.al. 2406.02328v2 null
2024-06-04 A Survey of Transformer Enabled Time Series Synthesis Alexander Sommers et.al. 2406.02322v1 null
2024-06-04 Neural Thermodynamic Integration: Free Energies from Energy-based Diffusion Models Bálint Máté et.al. 2406.02313v1 null
2024-06-04 I4VGen: Image as Stepping Stone for Text-to-Video Generation Xiefan Guo et.al. 2406.02230v1 null
2024-06-04 GraVITON: Graph based garment warping with attention guided inversion for Virtual-tryon Sanhita Pathak et.al. 2406.02184v1 null
2024-06-04 The Crystal Ball Hypothesis in diffusion models: Anticipating object positions from initial noise Yuanhao Ban et.al. 2406.01970v1 null
2024-06-04 Plug-and-Play Diffusion Distillation Yi-Ting Hsiao et.al. 2406.01954v1 null
2024-06-04 Generating Synthetic Net Load Data with Physics-informed Diffusion Model Shaorong Zhang et.al. 2406.01913v1 null
2024-06-06 Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation Yue Ma et.al. 2406.01900v2 null
2024-06-04 Cross-Domain Graph Data Scaling: A Showcase with Diffusion Models Wenzhuo Tang et.al. 2406.01899v1 link
2024-06-04 MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training Kengo Uchida et.al. 2406.01867v1 null
2024-06-03 L-MAGIC: Language Model Assisted Generation of Images with Coherence Zhipeng Cai et.al. 2406.01843v1 link
2024-06-03 Diffusion Boosted Trees Xizewen Han et.al. 2406.01813v1 null
2024-06-03 DEFT: Efficient Finetuning of Conditional Diffusion Models by Learning the Generalised $h$ -transform Alexander Denker et.al. 2406.01781v1 null
2024-06-03 A Diffusion Model Framework for Unsupervised Neural Combinatorial Optimization Sebastian Sanokowski et.al. 2406.01661v1 link
2024-06-03 CoLa-DCE -- Concept-guided Latent Diffusion Counterfactual Explanations Franz Motzkus et.al. 2406.01649v1 null
2024-06-03 DiffUHaul: A Training-Free Method for Object Dragging in Images Omri Avrahami et.al. 2406.01594v1 null
2024-06-03 ManiCM: Real-time 3D Diffusion Policy via Consistency Model for Robotic Manipulation Guanxing Lu et.al. 2406.01586v1 null
2024-06-03 Long and Short Guidance in Score identity Distillation for One-Step Text-to-Image Generation Mingyuan Zhou et.al. 2406.01561v1 null
2024-06-03 Robust Classification by Coupling Data Mollification with Label Smoothing Markus Heinonen et.al. 2406.01494v1 null
2024-06-04 DA-HFNet: Progressive Fine-Grained Forgery Image Detection and Localization Based on Dual Attention Yang Liu et.al. 2406.01489v2 null
2024-06-03 DreamPhysics: Learning Physical Properties of Dynamic 3D Gaussians with Video Diffusion Priors Tianyu Huang et.al. 2406.01476v1 link
2024-06-03 ED-SAM: An Efficient Diffusion Sampling Approach to Domain Generalization in Vision-Language Foundation Models Thanh-Dat Truong et.al. 2406.01432v1 null
2024-06-03 Differentially Private Fine-Tuning of Diffusion Models Yu-Lin Tsai et.al. 2406.01355v1 null
2024-06-03 Important node identification for complex networks based on improved Electre Multi-Attribute fusion Qi Cao et.al. 2406.01341v1 null
2024-06-03 HHMR: Holistic Hand Mesh Recovery by Enhancing the Multimodal Controllability of Graph Diffusion Models Mengcheng Li et.al. 2406.01334v1 null
2024-06-03 Report on Methods and Applications for Crafting 3D Humans Lei Liu et.al. 2406.01223v1 null
2024-06-03 UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation Xiang Wang et.al. 2406.01188v1 null
2024-06-03 Dimba: Transformer-Mamba Diffusion Models Zhengcong Fei et.al. 2406.01159v1 null
2024-06-04 Towards Practical Single-shot Motion Synthesis Konstantinos Roditakis et.al. 2406.01136v2 null
2024-06-03 ** $Δ$ -DiT: A Training-Free Acceleration Method Tailored for Diffusion Transformers** Pengtao Chen et.al. 2406.01125v1 null
2024-06-03 SceneTextGen: Layout-Agnostic Scene Text Image Synthesis with Diffusion Models Qilong Zhangli et.al. 2406.01062v1 null
2024-06-03 Constraint-Aware Diffusion Models for Trajectory Optimization Anjian Li et.al. 2406.00990v1 null
2024-06-03 MultiEdits: Simultaneous Multi-Aspect Editing with Text-to-Image Diffusion Models Mingzhen Huang et.al. 2406.00985v1 null
2024-06-03 Faster Diffusion-based Sampling with Randomized Midpoints: Sequential and Parallel Shivam Gupta et.al. 2406.00924v1 null
2024-06-03 Demystifying SGD with Doubly Stochastic Gradients Kyurae Kim et.al. 2406.00920v1 null
2024-06-03 ZeroSmooth: Training-free Diffuser Adaptation for High Frame Rate Video Generation Shaoshu Yang et.al. 2406.00908v1 link
2024-06-02 DistilDIRE: A Small, Fast, Cheap and Lightweight Diffusion Synthesized Deepfake Detection Yewon Lim et.al. 2406.00856v1 link
2024-06-02 Diffusion-Inspired Quantum Noise Mitigation in Parameterized Quantum Circuits Hoang-Quan Nguyen et.al. 2406.00843v1 null
2024-06-02 Invisible Backdoor Attacks on Diffusion Models Sen Li et.al. 2406.00816v1 link
2024-05-31 Mixed Diffusion for 3D Indoor Scene Synthesis Siyi Hu et.al. 2405.21066v1 null
2024-05-31 Unified Directly Denoising for Both Variance Preserving and Variance Exploding Diffusion Models Jingjing Wang et.al. 2405.21059v1 null
2024-05-31 Spectrum-Aware Parameter Efficient Fine-Tuning for Diffusion Models Xinxi Zhang et.al. 2405.21050v1 null
2024-05-31 Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling Jiatao Gu et.al. 2405.21048v1 null
2024-05-31 Amortizing intractable inference in diffusion models for vision, language, and control Siddarth Venkatraman et.al. 2405.20971v1 link
2024-05-31 Flow matching achieves minimax optimal convergence Kenji Fukumizu et.al. 2405.20879v1 null
2024-05-31 MegActor: Harness the Power of Raw Video for Vivid Portrait Animation Shurong Yang et.al. 2405.20851v1 link
2024-06-03 Stratified Avatar Generation from Sparse Observations Han Feng et.al. 2405.20786v2 null
2024-05-31 Share Your Secrets for Privacy! Confidential Forecasting with Vertical Federated Learning Aditya Shankar et.al. 2405.20761v1 link
2024-05-31 Information Theoretic Text-to-Image Alignment Chao Wang et.al. 2405.20759v1 null
2024-05-31 Diffusion Models Are Innate One-Step Generators Bowen Zheng et.al. 2405.20750v1 link
2024-05-31 Unleashing the Potential of Diffusion Models for Incomplete Data Imputation Hengrui Zhang et.al. 2405.20690v1 null
2024-05-31 Adv-KD: Adversarial Knowledge Distillation for Faster Diffusion Sampling Kidist Amde Mekonnen et.al. 2405.20675v1 link
2024-05-31 4Diffusion: Multi-view Video Diffusion Model for 4D Generation Haiyu Zhang et.al. 2405.20674v1 null
2024-05-31 Fourier123: One Image to High-Quality 3D Object Generation with Hybrid Fourier Score Distillation Shuzhou Yang et.al. 2405.20669v1 null
2024-05-31 GenMix: Combining Generative and Mixture Data Augmentation for Medical Image Classification Hansang Lee et.al. 2405.20650v1 null
2024-06-03 Stochastic Optimal Control for Diffusion Bridges in Function Spaces Byoungwoo Park et.al. 2405.20630v2 null
2024-05-31 Disrupting Diffusion: Token-Level Attention Erasure Attack against Diffusion-based Customization Yisu Liu et.al. 2405.20584v1 null
2024-05-31 Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning Linjiajie Fang et.al. 2405.20555v1 link
2024-05-30 Diffusion On Syntax Trees For Program Synthesis Shreyas Kapur et.al. 2405.20519v1 null
2024-05-30 Slight Corruption in Pre-training Data Makes Better Diffusion Models Hao Chen et.al. 2405.20494v1 null
2024-05-30 Is Synthetic Data all We Need? Benchmarking the Robustness of Models Trained with Synthetic Images Krishnakant Singh et.al. 2405.20469v1 null
2024-05-30 P-MSDiff: Parallel Multi-Scale Diffusion for Remote Sensing Image Segmentation Qi Zhang et.al. 2405.20443v1 null
2024-05-30 Gradient Inversion of Federated Diffusion Models Jiyue Huang et.al. 2405.20380v1 null
2024-05-30 Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image Kailu Wu et.al. 2405.20343v1 null
2024-05-30 VividDream: Generating 3D Scene with Ambient Dynamics Yao-Chih Lee et.al. 2405.20334v1 null
2024-05-30 MotionFollower: Editing Video Motion via Lightweight Score-Guided Diffusion Shuyuan Tu et.al. 2405.20325v1 null
2024-05-30 Don't drop your samples! Coherence-aware training benefits Conditional diffusion Nicolas Dufour et.al. 2405.20324v1 null
2024-05-30 Improving the Training of Rectified Flows Sangyun Lee et.al. 2405.20320v1 link
2024-05-30 DITTO-2: Distilled Diffusion Inference-Time T-Optimization for Music Generation Zachary Novack et.al. 2405.20289v1 null
2024-06-02 MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model Muyao Niu et.al. 2405.20222v2 link
2024-05-30 Boost Your Own Human Image Generation Model via Direct Preference Optimization with AI Feedback Sanghyeon Na et.al. 2405.20216v1 null
2024-05-30 MotionDreamer: Zero-Shot 3D Mesh Animation from Video Diffusion Models Lukas Uzolas et.al. 2405.20155v1 null
2024-06-03 DP-IQA: Utilizing Diffusion Prior for Blind Image Quality Assessment in the Wild Honghao Fu et.al. 2405.19996v3 link
2024-05-30 DiffPhysBA: Diffusion-based Physical Backdoor Attack against Person Re-Identification in Real-World Wenli Sun et.al. 2405.19990v1 null
2024-06-04 PLA4D: Pixel-Level Alignments for Text-to-4D Gaussian Splatting Qiaowei Miao et.al. 2405.19957v2 null
2024-05-30 Exploring Diffusion Models' Corruption Stage in Few-Shot Fine-tuning and Mitigating with Bayesian Neural Networks Xiaoyu Wu et.al. 2405.19931v1 null
2024-05-30 Learning from Random Demonstrations: Offline Reinforcement Learning with Importance-Sampled Diffusion Models Zeyu Fang et.al. 2405.19878v1 null
2024-05-31 HQ-DiT: Efficient Diffusion Transformer with FP4 Hybrid Quantization Wenxuan Liu et.al. 2405.19751v2 null
2024-05-30 Streaming Video Diffusion: Online Video Editing with Diffusion Models Feng Chen et.al. 2405.19726v1 link
2024-05-30 Text Guided Image Editing with Automatic Concept Locating and Forgetting Jia Li et.al. 2405.19708v1 null
2024-05-31 Diffusion Policies creating a Trust Region for Offline Reinforcement Learning Tianyu Chen et.al. 2405.19690v2 link
2024-05-31 Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion Models Masatoshi Uehara et.al. 2405.19673v2 null
2024-05-29 Blind Image Restoration via Fast Diffusion Inversion Hamadi Chihaoui et.al. 2405.19572v1 link
2024-05-29 Predicting Long-Term Human Behaviors in Discrete Representations via Physics-Guided Diffusion Zhitian Zhang et.al. 2405.19528v1 null
2024-05-29 MemControl: Mitigating Memorization in Medical Diffusion Models via Automated Parameter Selection Raman Dutt et.al. 2405.19458v1 null
2024-05-29 Diffusion Policy Attacker: Crafting Adversarial Attacks for Diffusion-based Policies Yipu Chen et.al. 2405.19424v1 null
2024-05-29 ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning Ruchika Chavhan et.al. 2405.19237v1 link
2024-05-30 ** $E^{3}$ Gen: Efficient, Expressive and Editable Avatars Generation** Weitian Zhang et.al. 2405.19203v2 null
2024-05-29 Diffusion-based Dynamics Models for Long-Horizon Rollout in Offline Reinforcement Learning Hanye Zhao et.al. 2405.19189v1 null
2024-05-29 Tuning-Free Alignment of Diffusion Models with Direct Noise Optimization Zhiwei Tang et.al. 2405.18881v1 null
2024-05-29 Principled Probabilistic Imaging using Diffusion Models as Plug-and-Play Priors Zihui Wu et.al. 2405.18782v1 null
2024-05-29 RNAFlow: RNA Structure & Sequence Design via Inverse Folding-Based Flow Matching Divya Nori et.al. 2405.18768v1 link
2024-05-29 Stationary distribution approximations of Two-island Wright-Fisher and seed-bank models using Stein's method Han L. Gan et.al. 2405.18763v1 null
2024-05-29 Preferred-Action-Optimized Diffusion Policies for Offline Reinforcement Learning Tianle Zhang et.al. 2405.18729v1 null
2024-05-29 Reverse the auditory processing pathway: Coarse-to-fine audio reconstruction from fMRI Che Liu et.al. 2405.18726v1 null
2024-05-29 Learning Diffeomorphism for Image Registration with Time-Continuous Networks using Semigroup Regularization Mohammadjavad Matinkia et.al. 2405.18684v1 link
2024-05-29 Zero-to-Hero: Enhancing Zero-Shot Novel View Synthesis via Attention Map Filtering Ido Sobol et.al. 2405.18677v1 null
2024-05-28 DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention Lianghui Zhu et.al. 2405.18428v1 link
2024-05-28 Phased Consistency Model Fu-Yun Wang et.al. 2405.18407v1 null
2024-05-28 RACCooN: Remove, Add, and Change Video Content with Auto-Generated Narratives Jaehong Yoon et.al. 2405.18406v1 link
2024-05-28 Multi-modal Generation via Cross-Modal In-Context Learning Amandeep Kumar et.al. 2405.18304v1 link
2024-05-28 CT-based brain ventricle segmentation via diffusion Schrödinger Bridge without target domain ground truths Reihaneh Teimouri et.al. 2405.18267v1 null
2024-05-28 EG4D: Explicit Generation of 4D Object without Score Distillation Qi Sun et.al. 2405.18132v1 link
2024-05-28 Are Image Distributions Indistinguishable to Humans Indistinguishable to Classifiers? Zebin You et.al. 2405.18029v1 null
2024-05-28 Unveiling the Power of Diffusion Features For Personalized Segmentation and Retrieval Dvir Samuel et.al. 2405.18025v1 null
2024-05-28 MAVIN: Multi-Action Video Generation with Diffusion Models via Transition Video Infilling Bowen Zhang et.al. 2405.18003v1 link
2024-05-28 AttenCraft: Attention-guided Disentanglement of Multiple Concepts for Text-to-Image Customization Junjie Shentu et.al. 2405.17965v1 null
2024-05-28 Improving Discrete Diffusion Models via Structured Preferential Generation Severi Rissanen et.al. 2405.17889v1 null
2024-05-28 Diffusion Rejection Sampling Byeonghu Na et.al. 2405.17880v1 link
2024-05-30 MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization Tianchen Zhao et.al. 2405.17873v2 null
2024-05-28 Discriminator-Guided Cooperative Diffusion for Joint Audio and Video Generation Akio Hayakawa et.al. 2405.17842v1 null
2024-05-28 LDMol: Text-Conditioned Molecule Diffusion Model Leveraging Chemically Informative Latent Space Jinho Chang et.al. 2405.17829v1 null
2024-05-30 Diffusion Model Patching via Mixture-of-Prompts Seokil Ham et.al. 2405.17825v2 null
2024-05-28 ClavaDDPM: Multi-relational Data Synthesis with Cluster-guided Diffusion Models Wei Pang et.al. 2405.17724v1 null
2024-05-28 MindFormer: A Transformer Architecture for Multi-Subject Brain Decoding via fMRI Inhwa Han et.al. 2405.17720v1 null
2024-05-27 RefDrop: Controllable Consistency in Image or Video Generation via Reference Feature Guidance Jiaojiao Fan et.al. 2405.17661v1 null
2024-05-27 Alignment is Key for Applying Diffusion Models to Retrosynthesis Najwa Laabid et.al. 2405.17656v1 null
2024-05-27 ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance Jiannan Huang et.al. 2405.17532v1 null
2024-05-27 Human4DiT: Free-view Human Video Generation with 4D Diffusion Transformer Ruizhi Shao et.al. 2405.17405v1 null
2024-05-27 A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training Kai Wang et.al. 2405.17403v1 link
2024-05-27 RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control Litu Rout et.al. 2405.17401v1 null
2024-05-27 EASI-Tex: Edge-Aware Mesh Texturing from Single Image Sai Raj Kishore Perla et.al. 2405.17393v1 null
2024-05-28 Controllable Longer Image Animation with Diffusion Models Qiang Wang et.al. 2405.17306v2 null
2024-05-27 Does Diffusion Beat GAN in Image Super Resolution? Denis Kuznedelev et.al. 2405.17261v1 null
2024-05-27 DreamMat: High-quality PBR Material Generation with Geometry- and Light-aware Diffusion Models Yuqing Zhang et.al. 2405.17176v1 null
2024-05-27 Partitioned Hankel-based Diffusion Models for Few-shot Low-dose CT Reconstruction Wenhao Zhang et.al. 2405.17167v1 null
2024-05-27 PatchScaler: An Efficient Patch-independent Diffusion Model for Super-Resolution Yong Liu et.al. 2405.17158v1 link
2024-05-27 Ensembling Diffusion Models via Adaptive Feature Aggregation Cong Wang et.al. 2405.17082v1 null
2024-05-27 The Poisson Midpoint Method for Langevin Dynamics: Provably Efficient Discretization for Diffusion Models Saravanan Kandasamy et.al. 2405.17068v1 null
2024-05-27 Glauber Generative Model: Discrete Diffusion Models via Binary Classification Harshit Varma et.al. 2405.17035v1 null
2024-05-27 ** $\text{Di}^2\text{Pose}$ : Discrete Diffusion Model for Occluded 3D Human Pose Estimation** Weiquan Wang et.al. 2405.17016v1 null
2024-05-28 MotionLLM: Multimodal Motion-Language Learning with Large Language Models Qi Wu et.al. 2405.17013v2 null
2024-05-27 A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and Recognition Zilu Guo et.al. 2405.16952v1 null
2024-05-27 Zero-Shot Video Semantic Segmentation based on Pre-Trained Diffusion Models Qian Wang et.al. 2405.16947v1 null
2024-05-27 PASTA: Pathology-Aware MRI to PET Cross-Modal Translation with Diffusion Models Yitong Li et.al. 2405.16942v1 null
2024-05-28 GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning Jaewoo Lee et.al. 2405.16907v2 link
2024-05-27 Anonymization Prompt Learning for Facial Privacy-Preserving Text-to-Image Generation Liang Shi et.al. 2405.16895v1 null
2024-05-27 Part123: Part-aware 3D Reconstruction from a Single-view Image Anran Liu et.al. 2405.16888v1 null
2024-05-28 Transfer Learning for Diffusion Models Yidong Ouyang et.al. 2405.16876v2 null
2024-05-27 CoCoGesture: Toward Coherent Co-speech 3D Gesture Generation in the Wild Xingqun Qi et.al. 2405.16874v1 null
2024-05-27 NCIDiff: Non-covalent Interaction-generative Diffusion Model for Improving Reliability of 3D Molecule Generation Inside Protein Pocket Joongwon Lee et.al. 2405.16861v1 null
2024-05-27 EM Distillation for One-step Diffusion Models Sirui Xie et.al. 2405.16852v1 null
2024-05-27 Enhancing Accuracy in Generative Models via Knowledge Transfer Xinyu Tian et.al. 2405.16837v1 null
2024-05-27 Unified Editing of Panorama, 3D Scenes, and Videos Through Disentangled Self-Attention Injection Gihyun Kwon et.al. 2405.16823v1 null
2024-05-27 Controlling Rate, Distortion, and Realism: Towards a Single Comprehensive Neural Image Compression Model Shoma Iwai et.al. 2405.16817v1 link
2024-05-27 TIE: Revolutionizing Text-based Image Editing for Complex-Prompt Following and High-Fidelity Editing Xinyu Zhang et.al. 2405.16803v1 null
2024-05-27 PromptFix: You Prompt and We Fix the Photo Yongsheng Yu et.al. 2405.16785v1 null
2024-05-27 Greedy Growing Enables High-Resolution Pixel-Based Diffusion Models Cristina N. Vasconcelos et.al. 2405.16759v1 null
2024-05-27 DMPlug: A Plug-in Method for Solving Inverse Problems with Diffusion Models Hengkang Wang et.al. 2405.16749v1 link
2024-05-26 Towards Multi-Task Multi-Modal Models: A Video Generative Perspective Lijun Yu et.al. 2405.16728v1 null
2024-05-26 Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models Hanwen Liang et.al. 2405.16645v1 null
2024-05-26 A Study on Unsupervised Anomaly Detection and Defect Localization using Generative Model in Ultrasonic Non-Destructive Testing Yusaku Ando et.al. 2405.16580v1 null
2024-05-28 ID-to-3D: Expressive ID-guided 3D Heads via Score Distillation Sampling Francesca Babiloni et.al. 2405.16570v2 null
2024-05-26 I2VEdit: First-Frame-Guided Video Editing via Image-to-Video Diffusion Models Wenqi Ouyang et.al. 2405.16537v1 null
2024-05-26 Pruning for Robust Concept Erasing in Diffusion Models Tianyun Yang et.al. 2405.16534v1 null
2024-05-26 Sp2360: Sparse-view 360 Scene Reconstruction using Cascaded 2D Diffusion Priors Soumava Paul et.al. 2405.16517v1 null
2024-05-26 Memory-efficient High-resolution OCT Volume Synthesis with Cascaded Amortized Latent Diffusion Models Kun Huang et.al. 2405.16516v1 null
2024-05-26 Unraveling the Smoothness Properties of Diffusion Models: A Gaussian Mixture Perspective Jiuxiang Gu et.al. 2405.16418v1 null
2024-05-28 Disentangling Foreground and Background Motion for Enhanced Realism in Human Video Generation Jinlin Liu et.al. 2405.16393v2 null
2024-05-26 Reverse Transition Kernel: A Flexible Framework to Accelerate Diffusion Inference Xunpeng Huang et.al. 2405.16387v1 null
2024-05-25 Trivialized Momentum Facilitates Diffusion Generative Modeling on Lie Groups Yuchen Zhu et.al. 2405.16381v1 null
2024-05-25 R.A.C.E.: Robust Adversarial Concept Erasure for Secure Text-to-Image Diffusion Model Changhoon Kim et.al. 2405.16341v1 null
2024-05-25 ModelLock: Locking Your Model With a Spell Yifeng Gao et.al. 2405.16285v1 null
2024-05-25 Enhancing Consistency-Based Image Generation via Adversarialy-Trained Classification and Energy-Based Discrimination Shelly Golan et.al. 2405.16260v1 null
2024-05-25 Underwater Image Enhancement by Diffusion Model with Customized CLIP-Classifier Shuaixin Liu et.al. 2405.16214v1 null
2024-05-25 Analytical photoresponses of Schottky contact MoS2 phototransistors Jianyong Wei et.al. 2405.16209v1 null
2024-05-25 Diffusion-Reward Adversarial Imitation Learning Chun-Mao Lai et.al. 2405.16194v1 null
2024-05-25 Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization Shutong Ding et.al. 2405.16173v1 null
2024-05-24 Looking Backward: Streaming Video-to-Video Translation with Feature Banks Feng Liang et.al. 2405.15757v1 link
2024-05-24 Taming Score-Based Diffusion Priors for Infinite-Dimensional Nonlinear Inverse Problems Lorenzo Baldassari et.al. 2405.15676v1 null
2024-05-24 Reducing the cost of posterior sampling in linear inverse problems via task-dependent score learning Fabian Schneider et.al. 2405.15643v1 null
2024-05-24 DiffCalib: Reformulating Monocular Camera Calibration as Diffusion-Based Dense Incident Map Generation Xiankang He et.al. 2405.15619v1 null
2024-05-24 Learning to Discretize Denoising Diffusion ODEs Vinh Tong et.al. 2405.15506v1 null
2024-05-24 Out of Many, One: Designing and Scaffolding Proteins at the Scale of the Structural Universe with Genie 2 Yeqing Lin et.al. 2405.15489v1 null
2024-05-24 NVS-Solver: Video Diffusion Model as Zero-Shot Novel View Synthesizer Meng You et.al. 2405.15364v1 link
2024-05-24 SoundLoCD: An Efficient Conditional Discrete Contrastive Latent Diffusion Model for Text-to-Sound Generation Xinlei Niu et.al. 2405.15338v1 null
2024-05-24 Challenges and Opportunities in 3D Content Generation Ke Zhao et.al. 2405.15335v1 null
2024-05-24 Towards Understanding the Working Mechanism of Text-to-Image Diffusion Model Mingyang Yi et.al. 2405.15330v1 null
2024-05-24 SG-Adapter: Enhancing Text-to-Image Generation with Scene Graph Guidance Guibao Shen et.al. 2405.15321v1 null
2024-05-24 Enhancing Text-to-Image Editing via Hybrid Mask-Informed Fusion Aoxue Li et.al. 2405.15313v1 null
2024-05-24 Unlearning Concepts in Diffusion Model via Concept Domain Correction and Concept Preserving Gradient Yongliang Wu et.al. 2405.15304v1 null
2024-05-24 StyleMaster: Towards Flexible Stylized Image Generation with Diffusion Models Chengming Xu et.al. 2405.15287v1 null
2024-05-24 Blaze3DM: Marry Triplane Representation with Diffusion for 3D Medical Inverse Problem Solving Jia He et.al. 2405.15241v1 null
2024-05-24 Defensive Unlearning with Adversarial Training for Robust Concept Erasure in Diffusion Models Yimeng Zhang et.al. 2405.15234v1 link
2024-05-24 DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image Perception Run Luo et.al. 2405.15232v1 null
2024-05-24 NIVeL: Neural Implicit Vector Layers for Text-to-Vector Generation Vikas Thamizharasan et.al. 2405.15217v1 null
2024-05-24 ODGEN: Domain-specific Object Detection Data Generation with Diffusion Models Jingyuan Zhu et.al. 2405.15199v1 null
2024-05-24 Diffusion Actor-Critic with Entropy Regulator Yinuo Wang et.al. 2405.15177v1 null
2024-05-23 AdjointDEIS: Efficient Gradients for Diffusion Models Zander W. Blasingame et.al. 2405.15020v1 null
2024-05-23 CraftsMan: High-fidelity Mesh Generation with 3D Native Generation and Interactive Geometry Refiner Weiyu Li et.al. 2405.14979v1 link
2024-05-23 SFDDM: Single-fold Distillation for Diffusion models Chi Hong et.al. 2405.14961v1 null
2024-05-23 PILOT: Equivariant diffusion for pocket conditioned de novo ligand generation with multi-objective guidance via importance sampling Julian Cremer et.al. 2405.14925v1 null
2024-05-24 Improved Distribution Matching Distillation for Fast Image Synthesis Tianwei Yin et.al. 2405.14867v2 null
2024-05-23 Video Diffusion Models are Training-free Motion Interpreter and Controller Zeqi Xiao et.al. 2405.14864v1 null
2024-05-23 Adapting to Unknown Low-Dimensional Structures in Score-Based Diffusion Models Gen Li et.al. 2405.14861v1 null
2024-05-23 Semantica: An Adaptable Image-Conditioned Diffusion Model Manoj Kumar et.al. 2405.14857v1 null
2024-05-23 TerDiT: Ternary Diffusion Models with Transformers Xudong Lu et.al. 2405.14854v1 link
2024-05-23 Direct3D: Scalable Image-to-3D Generation via 3D Latent Diffusion Transformer Shuang Wu et.al. 2405.14832v1 null
2024-05-23 Good Seed Makes a Good Crop: Discovering Secret Seeds in Text-to-Image Diffusion Models Katherine Xu et.al. 2405.14828v1 null
2024-05-23 PaGoDA: Progressive Growing of a One-Step Generator from a Low-Resolution Diffusion Teacher Dongjun Kim et.al. 2405.14822v1 null
2024-05-24 Fast-DDPM: Fast Denoising Diffusion Probabilistic Models for Medical Image-to-Image Generation Hongxu Jiang et.al. 2405.14802v2 link
2024-05-23 Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy Shengfang Zhai et.al. 2405.14800v1 null
2024-05-23 EditWorld: Simulating World Dynamics for Instruction-Following Image Editing Ling Yang et.al. 2405.14785v1 null
2024-05-23 Physics-informed Score-based Diffusion Model for Limited-angle Reconstruction of Cardiac Computed Tomography Shuo Han et.al. 2405.14770v1 null
2024-05-23 RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance Zhicheng Sun et.al. 2405.14677v1 link
2024-05-23 Reinforcement Learning for Fine-tuning Text-to-speech Diffusion Models Jingyi Chen et.al. 2405.14632v1 null
2024-05-23 Neuroexplicit Diffusion Models for Inpainting of Optical Flow Fields Tom Fischer et.al. 2405.14599v1 null
2024-05-24 Visual Echoes: A Simple Unified Transformer for Audio-Visual Generation Shiqi Yang et.al. 2405.14598v2 null
2024-05-23 LDM: Large Tensorial SDF Model for Textured Mesh Generation Rengan Xie et.al. 2405.14580v1 null
2024-05-23 Regressor-free Molecule Generation to Support Drug Response Prediction Kun Li et.al. 2405.14536v1 null
2024-05-23 LiteVAE: Lightweight and Efficient Variational Autoencoders for Latent Diffusion Models Seyedmorteza Sadat et.al. 2405.14477v1 null
2024-05-23 TIGER: Text-Instructed 3D Gaussian Retrieval and Coherent Editing Teng Xu et.al. 2405.14455v1 null
2024-05-23 Adversarial Schrödinger Bridge Matching Nikita Gushchin et.al. 2405.14449v1 null
2024-05-23 Reliable Trajectory Prediction and Uncertainty Quantification with Conditioned Diffusion Models Marion Neumeier et.al. 2405.14384v1 null
2024-05-24 Autoregressive Image Diffusion: Generation of Image Sequence and Application in MRI Guanxiong Luo et.al. 2405.14327v2 null
2024-05-23 Exposure Diffusion: HDR Image Generation by Consistent LDR denoising Mojtaba Bemana et.al. 2405.14304v1 null
2024-05-23 Diffusion-based Quantum Error Mitigation using Stochastic Differential Equation Joo Yong Shim et.al. 2405.14283v1 null
2024-05-23 Diffusion models for Gaussian distributions: Exact solutions and Wasserstein errors Emile Pierret et.al. 2405.14250v1 null
2024-05-23 DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis Yao Teng et.al. 2405.14224v1 null
2024-05-23 Survey on Visual Signal Coding and Processing with Generative Models: Technologies, Standards and Optimization Zhibo Chen et.al. 2405.14221v1 null
2024-05-23 FreeTuner: Any Subject in Any Style with Training-free Diffusion Youcan Xu et.al. 2405.14201v1 null
2024-05-23 The Disappearance of Timestep Embedding in Modern Time-Dependent Neural Networks Bum Jun Kim et.al. 2405.14126v1 null
2024-05-23 Enhancing Image Layout Control with Loss-Guided Diffusion Models Zakaria Patel et.al. 2405.14101v1 null
2024-05-22 Particle physics DL-simulation with control over generated data properties Karol Rogoziński et.al. 2405.14049v1 null
2024-05-22 A Study of Posterior Stability for Time-Series Latent Diffusion Yangming Li et.al. 2405.14021v1 null
2024-05-22 Design Editing for Offline Model-based Optimization Ye Yuan et.al. 2405.13964v1 null
2024-05-22 Learning Latent Space Hierarchical EBM Diffusion Models Jiali Cui et.al. 2405.13910v1 null
2024-05-22 ReVideo: Remake a Video with Motion and Content Control Chong Mou et.al. 2405.13865v1 null
2024-05-22 Diffusion-Based Cloud-Edge-Device Collaborative Learning for Next POI Recommendations Jing Long et.al. 2405.13811v1 null
2024-05-22 Conditioning diffusion models by explicit forward-backward bridging Adrien Corenflos et.al. 2405.13794v1 null
2024-05-22 A Versatile Diffusion Transformer with Mixture of Noise Levels for Audiovisual Generation Gwanghyun Kim et.al. 2405.13762v1 null
2024-05-22 InstaDrag: Lightning Fast and Accurate Drag-based Image Editing Emerging from Videos Yujun Shi et.al. 2405.13722v1 null
2024-05-22 Learning Diffusion Priors from Observations by Expectation Maximization François Rozet et.al. 2405.13712v1 null
2024-05-22 Prompt Mixing in Diffusion Models using the Black Scholes Algorithm Divya Kothandaraman et.al. 2405.13685v1 null
2024-05-22 MetaEarth: A Generative Foundation Model for Global-Scale Remote Sensing Image Generation Zhiping Yu et.al. 2405.13570v1 null
2024-05-22 MotionCraft: Physics-based Zero-Shot Video Generation Luca Savant Aira et.al. 2405.13557v1 null
2024-05-22 Directly Denoising Diffusion Model Dan Zhang et.al. 2405.13540v1 null
2024-05-22 Class-Conditional self-reward mechanism for improved Text-to-Image models Safouane El Ghazouali et.al. 2405.13473v1 link
2024-05-22 Enhanced Creativity and Ideation through Stable Video Synthesis Elijah Miller et.al. 2405.13357v1 null
2024-05-22 SIGGesture: Generalized Co-Speech Gesture Synthesis via Semantic Injection with Large-Scale Pre-Training Diffusion Models Qingrong Cheng et.al. 2405.13336v1 null
2024-05-21 TauAD: MRI-free Tau Anomaly Detection in PET Imaging via Conditioned Diffusion Models Lujia Zhong et.al. 2405.13199v1 null
2024-05-21 Personalized Residuals for Concept-Driven Text-to-Image Generation Cusuh Ham et.al. 2405.12978v1 null
2024-05-21 Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control Yue Han et.al. 2405.12970v1 null
2024-05-21 Impact of inhomogeneous diffusion on secondary cosmic ray and antiproton local spectra Álvaro Tovar-Pardo et.al. 2405.12918v1 null
2024-05-21 Diffusion-RSCC: Diffusion Probabilistic Model for Change Captioning in Remote Sensing Images Xiaofei Yu et.al. 2405.12875v1 link
2024-05-21 Model Free Prediction with Uncertainty Assessment Yuling Jiao et.al. 2405.12684v1 null
2024-05-21 CustomText: Customized Textual Image Generation using Diffusion Models Shubham Paliwal et.al. 2405.12531v1 null
2024-05-21 Customize Your Own Paired Data via Few-shot Way Jinshu Chen et.al. 2405.12490v1 null
2024-05-21 One-step data-driven generative model via Schrödinger Bridge Hanwen Huang et.al. 2405.12453v1 null
2024-05-20 Diffusion for World Modeling: Visual Details Matter in Atari Eloi Alonso et.al. 2405.12399v1 link
2024-05-20 Images that Sound: Composing Images and Sounds on a Single Canvas Ziyang Chen et.al. 2405.12221v1 null
2024-05-20 Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal Slices Nathaniel Cohen et.al. 2405.12211v1 null
2024-05-20 Nonequilbrium physics of generative diffusion models Zhendong Yu et.al. 2405.11932v1 null
2024-05-20 "Set It Up!": Functional Object Arrangement with Compositional Generative Models Yiqing Xu et.al. 2405.11928v1 null
2024-05-20 Diff-BGM: A Diffusion Model for Video Background Music Generation Sizhe Li et.al. 2405.11913v1 null
2024-05-20 Out-of-Distribution Detection with a Single Unconditional Diffusion Model Alvin Heng et.al. 2405.11881v1 link
2024-05-20 Evolving Storytelling: Benchmarks and Methods for New Character Customization with Diffusion Models Xiyu Wang et.al. 2405.11852v1 null
2024-05-20 Alternators For Sequence Modeling Mohammad Reza Rezaei et.al. 2405.11848v1 null
2024-05-20 ViViD: Video Virtual Try-on using Diffusion Models Zixun Fang et.al. 2405.11794v1 null
2024-05-20 Guided Multi-objective Generative AI to Enhance Structure-based Drug Design Amit Kadan et.al. 2405.11785v1 null
2024-05-20 Diffusion Models for Generating Ballistic Spacecraft Trajectories Tyler Presser et.al. 2405.11738v1 null
2024-05-19 InterAct: Capture and Modelling of Realistic, Expressive and Interactive Activities between Two Persons in Daily Scenarios Yinghao Huang et.al. 2405.11690v1 null
2024-05-19 Uncertainty-Aware PPG-2-ECG for Enhanced Cardiovascular Diagnosis using Diffusion Models Omer Belhasin et.al. 2405.11566v1 null
2024-05-19 Diffusion-Based Hierarchical Image Steganography Youmin Xu et.al. 2405.11523v1 null
2024-05-19 FIFO-Diffusion: Generating Infinite Videos from Text without Training Jihwan Kim et.al. 2405.11473v1 null
2024-05-19 Discrete-state Continuous-time Diffusion for Graph Generation Zhe Xu et.al. 2405.11416v1 null
2024-05-18 On the Trajectory Regularity of ODE-based Diffusion Sampling Defang Chen et.al. 2405.11326v1 null
2024-05-18 Diffusion Model Driven Test-Time Image Adaptation for Robust Skin Lesion Classification Ming Hu et.al. 2405.11289v1 null
2024-05-18 HR Human: Modeling Human Avatars with Triangular Mesh and High-Resolution Textures from Videos Qifeng Chen et.al. 2405.11270v1 null
2024-05-18 AquaLoRA: Toward White-box Protection for Customized Stable Diffusion Models via Watermark LoRA Weitao Feng et.al. 2405.11135v1 null
2024-05-17 Flexible Motion In-betweening with Diffusion Models Setareh Cohan et.al. 2405.11126v1 null
2024-05-16 Flow Score Distillation for Diverse Text-to-3D Generation Runjie Yan et.al. 2405.10988v1 null
2024-05-17 Improving face generation quality and prompt following with synthetic captions Michail Tarasiou et.al. 2405.10864v1 null
2024-05-17 Deep Data Consistency: a Fast and Robust Diffusion Model-based Solver for Inverse Problems Hanyu Chen et.al. 2405.10748v1 link
2024-05-17 Numerical Recovery of the Diffusion Coefficient in Diffusion Equations from Terminal Measurement Bangti Jin et.al. 2405.10708v1 null
2024-05-17 LoCI-DiffCom: Longitudinal Consistency-Informed Diffusion Model for 3D Infant Brain Image Completion Zihao Zhu et.al. 2405.10691v1 null
2024-05-17 LighTDiff: Surgical Endoscopic Image Low-Light Enhancement with T-Diffusion Tong Chen et.al. 2405.10550v1 link
2024-05-17 ART3D: 3D Gaussian Splatting for Text-Guided Artistic Scenes Generation Pengzhi Li et.al. 2405.10508v1 null
2024-05-20 Text-to-Vector Generation with Neural Path Representation Peiying Zhang et.al. 2405.10317v2 null
2024-05-16 Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model Zheng Gu et.al. 2405.10316v1 null
2024-05-16 CAT3D: Create Anything in 3D with Multi-View Diffusion Models Ruiqi Gao et.al. 2405.10314v1 null
2024-05-16 Generating Coherent Sequences of Visual Illustrations for Real-World Manual Tasks João Bordalo et.al. 2405.10122v1 null
2024-05-16 Spurious reconstruction from brain activity Ken Shirakawa et.al. 2405.10078v1 null
2024-05-16 Frequency-Domain Refinement with Multiscale Diffusion for Super Resolution Xingjian Wang et.al. 2405.10014v1 null
2024-05-16 VirtualModel: Generating Object-ID-retentive Human-object Interaction Image by Diffusion Model for E-commerce Marketing Binghui Chen et.al. 2405.09985v1 null
2024-05-16 Language-Oriented Semantic Latent Representation for Image Transmission Giordano Cicchetti et.al. 2405.09976v1 link
2024-05-16 Whole-Song Hierarchical Generation of Symbolic Music Using Cascaded Diffusion Models Ziyu Wang et.al. 2405.09901v1 link
2024-05-16 DiffAM: Diffusion-based Adversarial Makeup Transfer for Facial Privacy Protection Yuhao Sun et.al. 2405.09882v1 link
2024-05-16 Dual3D: Efficient and Consistent Text-to-3D Generation with Dual-mode Multi-view Latent Diffusion Xinyang Li et.al. 2405.09874v1 null
2024-05-16 Rethinking Multi-User Semantic Communications with Deep Generative Models Eleonora Grassucci et.al. 2405.09866v1 null
2024-05-16 MediSyn: Text-Guided Diffusion Models for Broad Medical 2D and 3D Image Synthesis Joseph Cho et.al. 2405.09806v1 null
2024-05-15 A Survey of Generative Techniques for Spatial-Temporal Data Mining Qianru Zhang et.al. 2405.09592v1 null
2024-05-16 MMFusion: Multi-modality Diffusion Model for Lymph Node Metastasis Diagnosis in Esophageal Cancer Chengyu Wu et.al. 2405.09539v2 link
2024-05-15 Diffusion-based Contrastive Learning for Sequential Recommendation Ziqiang Cui et.al. 2405.09369v1 null
2024-05-15 Dance Any Beat: Blending Beats with Visuals in Dance Video Generation Xuanchen Wang et.al. 2405.09266v1 null
2024-05-15 SOEDiff: Efficient Distillation for Small Object Editing Qihe Pan et.al. 2405.09114v1 null
2024-05-15 RSHazeDiff: A Unified Fourier-aware Diffusion Model for Remote Sensing Image Dehazing Jiamei Xiong et.al. 2405.09083v1 link
2024-05-17 Naturalistic Music Decoding from EEG Data via Latent Diffusion Models Emilian Postolache et.al. 2405.09062v2 null
2024-05-15 Response Matching for generating materials and molecules Bingqing Cheng et.al. 2405.09057v1 null
2024-05-15 CTS: A Consistency-Based Medical Image Segmentation Model Kejia Zhang et.al. 2405.09056v1 link
2024-05-14 Expensive Multi-Objective Bayesian Optimization Based on Diffusion Models Bingdong Li et.al. 2405.08674v1 null
2024-05-14 Towards Multi-Task Generative-AI Edge Services with an Attention-based Diffusion DRL Approach Yaju Liu et.al. 2405.08328v1 null
2024-05-14 Compositional Text-to-Image Generation with Dense Blob Representations Weili Nie et.al. 2405.08246v1 null
2024-05-13 Infinite Texture: Text-guided High Resolution Diffusion Texture Synthesis Yifan Wang et.al. 2405.08210v1 null
2024-05-13 Do Bayesian imaging methods report trustworthy probabilities? David Y. W. Thong et.al. 2405.08179v1 null
2024-05-13 DiffTF++: 3D-aware Diffusion Transformer for Large-Vocabulary 3D Generation Ziang Cao et.al. 2405.08055v1 link
2024-05-13 Coin3D: Controllable and Interactive 3D Assets Generation with Proxy-Guided Conditioning Wenqi Dong et.al. 2405.08054v1 null
2024-05-11 Diff-ETS: Learning a Diffusion Probabilistic Model for Electromyography-to-Speech Conversion Zhao Ren et.al. 2405.08021v1 null
2024-05-13 Stable Diffusion-based Data Augmentation for Federated Learning with Non-IID Data Mahdi Morafah et.al. 2405.07925v1 null
2024-05-13 CTRLorALTer: Conditional LoRAdapter for Efficient 0-Shot Control & Altering of T2I Models Nick Stracke et.al. 2405.07913v1 null
2024-05-13 SAR Image Synthesis with Diffusion Models Denisa Qosja et.al. 2405.07776v1 null
2024-05-13 CDFormer:When Degradation Prediction Embraces Diffusion Model for Blind Image Super-Resolution Qingguo Liu et.al. 2405.07648v1 link
2024-05-13 De novo antibody design with SE(3) diffusion Daniel Cutting et.al. 2405.07622v1 null
2024-05-13 Reducing Risk for Assistive Reinforcement Learning Policies with Diffusion Models Andrii Tytarenko et.al. 2405.07603v1 null
2024-05-13 PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator Hanshu Yan et.al. 2405.07510v1 link
2024-05-13 GaussianVTON: 3D Human Virtual Try-ON via Multi-Stage Gaussian Splatting Editing with Image Prompting Haodong Chen et.al. 2405.07472v1 null
2024-05-12 Erasing Concepts from Text-to-Image Diffusion Models with Few-shot Unlearning Masane Fuchi et.al. 2405.07288v1 link
2024-05-12 Modeling Pedestrian Intrinsic Uncertainty for Multimodal Stochastic Trajectory Prediction via Energy Plan Denoising Yao Liu et.al. 2405.07164v1 null
2024-05-12 Stable Signature is Unstable: Removing Image Watermark from Diffusion Models Yuepeng Hu et.al. 2405.07145v1 null
2024-05-11 Diffusion models as probabilistic neural operators for recovering unobserved states of dynamical systems Katsiaryna Haitsiukevich et.al. 2405.07097v1 null
2024-05-11 Semantic Guided Large Scale Factor Remote Sensing Image Super-resolution with Generative Diffusion Prior Ce Wang et.al. 2405.07044v1 link
2024-05-11 Non-confusing Generation of Customized Concepts in Diffusion Models Wang Lin et.al. 2405.06914v1 null
2024-05-10 Self-Consistent Recursive Diffusion Bridge for Medical Image Translation Fuat Arslan et.al. 2405.06789v1 link
2024-05-10 Shape Conditioned Human Motion Generation with Diffusion Model Kebing Xue et.al. 2405.06778v1 null
2024-05-10 OneTo3D: One Image to Re-editable Dynamic 3D Model and Video Generation Jinwei Lin et.al. 2405.06547v1 link
2024-05-14 SketchDream: Sketch-based Text-to-3D Generation and Editing Feng-Lin Liu et.al. 2405.06461v2 null
2024-05-10 PUMA: margin-based data pruning Javier Maroto et.al. 2405.06298v1 null
2024-05-10 Prior-guided Diffusion Model for Cell Segmentation in Quantitative Phase Imaging Zhuchen Shao et.al. 2405.06175v1 null
2024-05-09 Distilling Diffusion Models into Conditional GANs Minguk Kang et.al. 2405.05967v1 null
2024-05-09 Self-Supervised Learning of Time Series Representation via Diffusion Process and Imputation-Interpolation-Forecasting Mask Zineb Senane et.al. 2405.05959v1 link
2024-05-09 Frame Interpolation with Consecutive Brownian Bridge Diffusion Zonglin Lyu et.al. 2405.05953v1 null
2024-05-09 Composable Part-Based Manipulation Weiyu Liu et.al. 2405.05876v1 null
2024-05-09 Pre-trained Text-to-Image Diffusion Models Are Versatile Representation Learners for Control Gunshi Gupta et.al. 2405.05852v1 link
2024-05-09 Could It Be Generated? Towards Practical Analysis of Memorization in Text-To-Image Diffusion Models Zhe Ma et.al. 2405.05846v1 null
2024-05-09 MSDiff: Multi-Scale Diffusion Model for Ultra-Sparse View CT Reconstruction Pinhuang Tan et.al. 2405.05814v1 null
2024-05-10 MasterWeaver: Taming Editability and Identity for Personalized Text-to-Image Generation Yuxiang Wei et.al. 2405.05806v2 link
2024-05-09 DragGaussian: Enabling Drag-style Manipulation on 3D Gaussian Representation Sitian Shen et.al. 2405.05800v1 null
2024-05-09 Sequential Amodal Segmentation via Cumulative Occlusion Learning Jiayang Ao et.al. 2405.05791v1 null
2024-05-09 DP-MDM: Detail-Preserving MR Reconstruction via Multiple Diffusion Models Mengxiao Geng et.al. 2405.05763v1 null
2024-05-09 LatentColorization: Latent Diffusion-Based Speaker Video Colorization Rory Ward et.al. 2405.05707v1 null
2024-05-09 StableMoFusion: Towards Robust and Efficient Diffusion-based Motion Generation Framework Yiheng Huang et.al. 2405.05691v1 null
2024-05-09 SubGDiff: A Subgraph Diffusion Model to Improve Molecular Representation Learning Jiying Zhang et.al. 2405.05665v1 null
2024-05-09 AI in Your Toolbox: A Plugin for Generating Renderings from 3D Models Mingming Wang et.al. 2405.05627v1 null
2024-05-09 Denoising Diffusion Delensing Delight: Reconstructing the Non-Gaussian CMB Lensing Potential with Diffusion Models Thomas Flöss et.al. 2405.05598v1 link
2024-05-09 Vision-Language Modeling with Regularized Spatial Transformer Networks for All Weather Crosswind Landing of Aircraft Debabrata Pal et.al. 2405.05574v1 null
2024-05-09 A Survey on Personalized Content Synthesis with Diffusion Models Xulu Zhang et.al. 2405.05538v1 null
2024-05-08 Diffusion-HMC: Parameter Inference with Diffusion Model driven Hamiltonian Monte Carlo Nayantara Mudur et.al. 2405.05255v1 link
2024-05-08 Attention-Driven Training-Free Efficiency Enhancement of Diffusion Models Hongjie Wang et.al. 2405.05252v1 null
2024-05-08 Imagine Flash: Accelerating Emu Diffusion Models with Backward Distillation Jonas Kohler et.al. 2405.05224v1 null
2024-05-08 FinePOSE: Fine-Grained Prompt-Driven 3D Human Pose Estimation via Diffusion Models Jinglin Xu et.al. 2405.05216v1 link
2024-05-08 An anti-noise seismic inversion method based on diffusion model Yingtian Liu et.al. 2405.05026v1 null
2024-05-08 Discrepancy-based Diffusion Models for Lesion Detection in Brain MRI Keqiang Fan et.al. 2405.04974v1 null
2024-05-08 Empowering Wireless Networks with Artificial Intelligence Generated Graph Jiacheng Wang et.al. 2405.04907v1 null
2024-05-08 Fast LiDAR Upsampling using Conditional Diffusion Models Sander Elias Magnussen Helgesen et.al. 2405.04889v1 null
2024-05-08 FlexEControl: Flexible and Efficient Multimodal Control for Text-to-Image Generation Xuehai He et.al. 2405.04834v1 null
2024-05-08 Variational Schrödinger Diffusion Models Wei Deng et.al. 2405.04795v1 null
2024-05-07 Remote Diffusion Kunal Sunil Kasodekar et.al. 2405.04717v1 null
2024-05-07 TexControl: Sketch-Based Two-Stage Fashion Image Generation Using Diffusion Model Yongming Zhang et.al. 2405.04675v1 null
2024-05-07 Tactile-Augmented Radiance Fields Yiming Dou et.al. 2405.04534v1 link
2024-05-07 Edit-Your-Motion: Space-Time Diffusion Decoupling Learning for Video Motion Editing Yi Zuo et.al. 2405.04496v1 null
2024-05-07 CloudDiff: Super-resolution ensemble retrieval of cloud properties for all day using the generative diffusion model Haixia Xiao et.al. 2405.04483v1 null
2024-05-07 Diff-IP2D: Diffusion-Based Hand-Object Interaction Prediction on Egocentric Videos Junyi Ma et.al. 2405.04370v1 null
2024-05-07 Diffusion-driven GAN Inversion for Multi-Modal Face Image Generation Jihyun Kim et.al. 2405.04356v1 null
2024-05-08 Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer Zhuoyi Yang et.al. 2405.04312v2 link
2024-05-07 BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models Eloi Moliner et.al. 2405.04272v1 null
2024-05-07 Vidu: a Highly Consistent, Dynamic and Skilled Text-to-Video Generator with Diffusion Models Fan Bao et.al. 2405.04233v1 null
2024-05-07 Simple Drop-in LoRA Conditioning on Attention Layers Will Improve Your Diffusion Model Joo Young Choi et.al. 2405.03958v1 null
2024-05-06 MVDiff: Scalable and Flexible Multi-View Diffusion for 3D Object Reconstruction from Single-View Emmanuelle Bourigault et.al. 2405.03894v1 null
2024-05-06 MoDiPO: text-to-motion alignment via AI-feedback-driven Direct Preference Optimization Massimiliano Pappa et.al. 2405.03803v1 null
2024-05-06 Synthetic Data from Diffusion Models Improve Drug Discovery Prediction Bing Hu et.al. 2405.03799v1 null
2024-05-06 GraphSL: An Open-Source Library for Graph Source Localization Approaches and Benchmark Datasets Junxiang Wang et.al. 2405.03724v1 link
2024-05-06 Bridging discrete and continuous state spaces: Exploring the Ehrenfest process in time-continuous diffusion models Ludwig Winkler et.al. 2405.03549v1 null
2024-05-06 CCDM: Continuous Conditional Diffusion Models for Image Generation Xin Ding et.al. 2405.03546v1 link
2024-05-06 LGTM: Local-to-Global Text-Driven Human Motion Diffusion Model Haowen Sun et.al. 2405.03485v1 link
2024-05-06 Exploring the Frontiers of Softmax: Provable Optimization, Applications in Diffusion Model, and Beyond Jiuxiang Gu et.al. 2405.03251v1 null
2024-05-06 Hyperbolic Geometric Latent Diffusion Model for Graph Generation Xingcheng Fu et.al. 2405.03188v1 link
2024-05-06 DeepMpMRI: Tensor-decomposition Regularized Learning for Fast and High-Fidelity Multi-Parametric Microstructural MR Imaging Wenxin Fan et.al. 2405.03159v1 null
2024-05-06 Video Diffusion Models: A Survey Andrew Melnik et.al. 2405.03150v1 null
2024-05-06 AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding Tao Liu et.al. 2405.03121v1 link
2024-05-05 Matten: Video Generation with Mamba-Attention Yu Gao et.al. 2405.03025v1 null
2024-05-05 Exploring Text-based Realistic Building Facades Editing Applicaiton Jing Wang et.al. 2405.02967v1 null
2024-05-05 Efficient Text-driven Motion Generation via Latent Consistency Training Mengxian Hu et.al. 2405.02791v1 null
2024-05-04 DiffuseTrace: A Transparent and Flexible Watermarking Scheme for Latent Diffusion Model Liangqi Lei et.al. 2405.02696v1 null
2024-05-08 Functional Imaging Constrained Diffusion for Brain PET Synthesis from Structural MRI Minhui Yu et.al. 2405.02504v2 null
2024-05-03 Continuous Learned Primal Dual Christina Runkel et.al. 2405.02478v1 null
2024-05-03 CogDPM: Diffusion Probabilistic Models via Cognitive Predictive Coding Kaiyuan Chen et.al. 2405.02384v1 null
2024-05-03 DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos Wen-Hsuan Chu et.al. 2405.02280v1 null
2024-05-03 Multi-grid reaction-diffusion master equation: applications to morphogen gradient modelling Radek Erban et.al. 2405.02117v1 null
2024-05-03 DiffMap: Enhancing Map Segmentation with Map Prior Using Diffusion Model Peijin Jia et.al. 2405.02008v1 null
2024-05-03 Defect Image Sample Generation With Diffusion Prior for Steel Surface Defect Recognition Yichun Tai et.al. 2405.01872v1 null
2024-05-03 Creation of Novel Soft Robot Designs using Generative AI Wee Kiat Chan et.al. 2405.01824v1 null
2024-05-03 Report on the AAPM Grand Challenge on deep generative modeling for learning medical image statistics Rucha Deshpande et.al. 2405.01822v1 null
2024-05-02 Converting Anyone's Voice: End-to-End Expressive Voice Conversion with a Conditional Diffusion Model Zongyang Du et.al. 2405.01730v1 null
2024-05-02 Long Tail Image Generation Through Feature Space Augmentation and Iterated Learning Rafael Elberg et.al. 2405.01705v1 link
2024-05-02 LocInv: Localization-aware Inversion for Text-Guided Image Editing Chuanming Tang et.al. 2405.01496v1 link
2024-05-02 Navigating Heterogeneity and Privacy in One-Shot Federated Learning with Diffusion Models Matias Mendieta et.al. 2405.01494v1 null
2024-05-02 Statistical algorithms for low-frequency diffusion data: A PDE approach Matteo Giordano et.al. 2405.01372v1 link
2024-05-02 DiffusionPipe: Training Large Diffusion Models with Efficient Pipelines Ye Tian et.al. 2405.01248v1 null
2024-05-02 Automated Virtual Product Placement and Assessment in Images using Diffusion Models Mohammad Mahmudul Alam et.al. 2405.01130v1 null
2024-05-02 Part-aware Shape Generation with Latent 3D Diffusion of Neural Voxel Fields Yuhang Huang et.al. 2405.00998v1 null
2024-05-02 Generative manufacturing systems using diffusion models and ChatGPT Xingyu Li et.al. 2405.00958v1 null
2024-05-02 EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion Guangyao Zhai et.al. 2405.00915v1 null
2024-05-01 SonicDiffusion: Audio-Driven Image Generation and Editing with Pretrained Diffusion Models Burak Can Biner et.al. 2405.00878v1 null
2024-05-01 Guided Conditional Diffusion Classifier (ConDiff) for Enhanced Prediction of Infection in Diabetic Foot Ulcers Palawat Busaranuvong et.al. 2405.00858v1 null
2024-05-01 ADM: Accelerated Diffusion Model via Estimated Priors for Robust Motion Prediction under Uncertainties Jiahui Li et.al. 2405.00797v1 link
2024-05-01 Obtaining Favorable Layouts for Multiple Object Generation Barak Battash et.al. 2405.00791v1 null
2024-05-01 Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models Xiaoshi Wu et.al. 2405.00760v1 null
2024-05-01 TexSliders: Diffusion-Based Texture Editing in CLIP Space Julia Guerrero-Viu et.al. 2405.00672v1 null
2024-05-01 RGB $\leftrightarrow$ X: Image decomposition and synthesis using material- and lighting-aware diffusion models Zheng Zeng et.al. 2405.00666v1 null
2024-05-01 Deep Metric Learning-Based Out-of-Distribution Detection with Synthetic Outlier Exposure Assefa Seyoum Wahd et.al. 2405.00631v1 null
2024-05-01 Lane Segmentation Refinement with Diffusion Models Antonio Ruiz et.al. 2405.00620v1 null
2024-05-01 Pricing and delta computation in jump-diffusion models with stochastic intensity by Malliavin calculus Ayub Ahmadi et.al. 2405.00473v1 null
2024-05-01 Lazy Layers to Make Fine-Tuned Diffusion Models More Traceable Haozhe Liu et.al. 2405.00466v1 null
2024-05-01 Detail-Enhancing Framework for Reference-Based Image Super-Resolution Zihan Wang et.al. 2405.00431v1 null
2024-05-01 Streamlining Image Editing with Layered Diffusion Brushes Peyman Gholami et.al. 2405.00313v1 null
2024-05-02 An Unstructured Mesh Reaction-Drift-Diffusion Master Equation with Reversible Reactions Samuel A. Isaacson et.al. 2405.00283v2 null
2024-05-01 ASAM: Boosting Segment Anything Model with Adversarial Tuning Bo Li et.al. 2405.00256v1 link
2024-04-30 Semantically Consistent Video Inpainting with Conditional Diffusion Models Dylan Green et.al. 2405.00251v1 null
2024-04-30 IgCONDA-PET: Implicitly-Guided Counterfactual Diffusion for Detecting Anomalies in PET Images Shadab Ahamed et.al. 2405.00239v1 link
2024-04-30 SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound Haohe Liu et.al. 2405.00233v1 null
2024-04-30 Target-Specific De Novo Peptide Binder Design with DiffPepBuilder Fanhao Wang et.al. 2405.00128v1 null
2024-04-30 MotionLCM: Real-time Controllable Motion Generation via Latent Consistency Model Wenxun Dai et.al. 2404.19759v1 null
2024-04-30 Invisible Stitch: Generating Smooth 3D Scenes with Depth Inpainting Paul Engstler et.al. 2404.19758v1 null
2024-04-30 Mixed Continuous and Categorical Flow Matching for 3D De Novo Molecule Generation Ian Dunn et.al. 2404.19739v1 link
2024-04-30 X-Diffusion: Generating Detailed 3D MRI Volumes From a Single Image Using Cross-Sectional Diffusion Models Emmanuelle Bourigault et.al. 2404.19604v1 null
2024-04-30 MicroDreamer: Zero-shot 3D Generation in $\sim$ 20 Seconds by Score-based Iterative Reconstruction Luxi Chen et.al. 2404.19525v1 link
2024-04-30 TwinDiffusion: Enhancing Coherence and Efficiency in Panoramic Image Generation with Diffusion Models Teng Zhou et.al. 2404.19475v1 null
2024-04-30 Probing Unlearned Diffusion Models: A Transferable Adversarial Attack Perspective Xiaoxuan Han et.al. 2404.19382v1 link
2024-04-30 Bridge to Non-Barrier Communication: Gloss-Prompted Fine-grained Cued Speech Gesture Generation with Diffusion Model Wentao Lei et.al. 2404.19277v1 null
2024-04-30 DiffuseLoco: Real-Time Legged Locomotion Control with Diffusion from Offline Datasets Xiaoyu Huang et.al. 2404.19264v1 null
2024-04-30 CONTUNER: Singing Voice Beautifying with Pitch and Expressiveness Condition Jianzong Wang et.al. 2404.19187v1 null
2024-04-29 Stylus: Automatic Adapter Selection for Diffusion Models Michael Luo et.al. 2404.18928v1 null
2024-04-29 TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation Junhao Cheng et.al. 2404.18919v1 link
2024-04-29 Learning general Gaussian mixtures with efficient score matching Sitan Chen et.al. 2404.18893v1 null
2024-04-29 A Survey on Diffusion Models for Time Series and Spatio-Temporal Data Yiyuan Yang et.al. 2404.18886v1 link
2024-04-29 Learning Mixtures of Gaussians Using Diffusion Models Khashayar Gatmiry et.al. 2404.18869v1 null
2024-04-29 Towards Extreme Image Compression with Latent Feature Guidance and Diffusion Prior Zhiyuan Li et.al. 2404.18820v1 null
2024-04-29 Bootstrap 3D Reconstructed Scenes from 3D Gaussian Splatting Yifei Gao et.al. 2404.18669v1 null
2024-04-29 FlexiFilm: Long Video Generation with Flexible Conditions Yichen Ouyang et.al. 2404.18620v1 link
2024-04-29 Anywhere: A Multi-Agent Framework for Reliable and Diverse Foreground-Conditioned Image Inpainting Tianyidan Xie et.al. 2404.18598v1 null
2024-04-26 FashionSD-X: Multimodal Fashion Garment Synthesis using Latent Diffusion Abhishek Kumar Singh et.al. 2404.18591v1 null
2024-05-01 U-Nets as Belief Propagation: Efficient Classification, Denoising, and Diffusion in Generative Hierarchical Models Song Mei et.al. 2404.18444v2 null
2024-04-28 Fisher Information Improved Training-Free Conditional Diffusion Model Kaiyu Song et.al. 2404.18252v1 null
2024-04-28 Paint by Inpaint: Learning to Add Image Objects by Removing Them First Navve Wasserman et.al. 2404.18212v1 link
2024-04-28 Generative AI for Visualization: State of the Art and Future Directions Yilin Ye et.al. 2404.18144v1 null
2024-04-28 Generative AI for Low-Carbon Artificial Intelligence of Things Jinbo Wen et.al. 2404.18077v1 null
2024-04-28 Grounded Compositional and Diverse Text-to-3D with Pretrained Multi-View Diffusion Model Xiaolong Li et.al. 2404.18065v1 null
2024-04-28 Exposing Text-Image Inconsistency Using Diffusion Models Mingzhen Huang et.al. 2404.18033v1 link
2024-04-30 Control randomisation approach for policy gradient and application to reinforcement learning in optimal switching Robert Denkert et.al. 2404.17939v2 null
2024-04-27 Unsupervised Anomaly Detection via Masked Diffusion Posterior Sampling Di Wu et.al. 2404.17900v1 null
2024-04-27 DPER: Diffusion Prior Driven Neural Representation for Limited Angle and Sparse View CT Reconstruction Chenhe Du et.al. 2404.17890v1 null
2024-04-27 Diffusion-Aided Joint Source Channel Coding For High Realism Wireless Image Transmission Mingyu Yang et.al. 2404.17736v1 null
2024-04-27 Causal Diffusion Autoencoders: Toward Counterfactual Generation via Diffusion Probabilistic Models Aneesh Komanduri et.al. 2404.17735v1 null
2024-04-26 Stocking and Harvesting Effects in Advection-Reaction-Diffusion Model: Exploring Decoupled Algorithms and Analysis Mayesha Sharmim Tisha et.al. 2404.17702v1 null
2024-04-26 MaPa: Text-driven Photorealistic Material Painting for 3D Shapes Shangzhan Zhang et.al. 2404.17569v1 null
2024-04-26 Chemotaxis-inspired PDE model for airborne infectious disease transmission: analysis and simulations Pierluigi Colli et.al. 2404.17506v1 null
2024-04-26 Multi-view Image Prompted Multi-view Diffusion for Improved 3D Generation Seungwook Kim et.al. 2404.17419v1 null
2024-04-29 MV-VTON: Multi-View Virtual Try-On with Diffusion Models Haoyu Wang et.al. 2404.17364v2 link
2024-04-26 Simultaneous Tri-Modal Medical Image Fusion and Super-Resolution using Conditional Diffusion Model Yushen Xu et.al. 2404.17357v1 null
2024-04-26 Trinity Detector:text-assisted and attention mechanisms based spectral fusion for diffusion generation image detection Jiawei Song et.al. 2404.17254v1 null
2024-04-26 Few-shot Calligraphy Style Learning Fangda Chen et.al. 2404.17199v1 link
2024-04-25 CyNetDiff -- A Python Library for Accelerated Implementation of Network Diffusion Models Eliot W. Robson et.al. 2404.17059v1 link
2024-04-25 Universal fragmentation in annihilation reactions with constrained kinetics Enrique Rozas Garcia et.al. 2404.16950v1 null
2024-04-25 Inferring solid-state diffusivity in lithium-ion battery active materials: improving upon the classical GITT method A. Emir Gumrukcuoglu et.al. 2404.16658v1 null
2024-04-29 MuseumMaker: Continual Style Customization without Catastrophic Forgetting Chenxi Liu et.al. 2404.16612v2 null
2024-04-29 Conditional Distribution Modelling for Few-Shot Image Synthesis with Diffusion Models Parul Gupta et.al. 2404.16556v2 null
2024-04-25 DiffSeg: A Segmentation Model for Skin Lesions Based on Diffusion Difference Zhihao Shuai et.al. 2404.16474v1 null
2024-04-25 TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models Haomiao Ni et.al. 2404.16306v1 null
2024-04-25 CFMW: Cross-modality Fusion Mamba for Multispectral Object Detection under Adverse Weather Conditions Haoyuan Li et.al. 2404.16302v1 link
2024-04-25 One Noise to Rule Them All: Learning a Unified Model of Spatially-Varying Noise Patterns Arman Maesumi et.al. 2404.16292v1 null
2024-04-24 Editable Image Elements for Controllable Synthesis Jiteng Mu et.al. 2404.16029v1 null
2024-04-24 RetinaRegNet: A Versatile Approach for Retinal Image Registration Vishal Balaji Sivaraman et.al. 2404.16017v1 link
2024-04-24 MYCloth: Towards Intelligent and Interactive Online T-Shirt Customization based on User's Preference Yexin Liu et.al. 2404.15801v1 null
2024-04-24 MotionMaster: Training-free Camera Motion Transfer For Video Generation Teng Hu et.al. 2404.15789v1 null
2024-04-24 Unifying Bayesian Flow Networks and Diffusion Models through Stochastic Differential Equations Kaiwen Xue et.al. 2404.15766v1 link
2024-04-24 DeepFeatureX Net: Deep Features eXtractors based Network for discriminating synthetic from real images Orazio Pontorno et.al. 2404.15697v1 null
2024-04-24 Generative Diffusion Model (GDM) for Optimization of Wi-Fi Networks Tie Liu et.al. 2404.15684v1 null
2024-04-24 AnoFPDM: Anomaly Segmentation with Forward Process of Diffusion Models for Brain MRI Yiming Che et.al. 2404.15683v1 null
2024-04-27 CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models Qinghe Wang et.al. 2404.15677v2 link
2024-04-24 Optimizing OOD Detection in Molecular Graphs: A Novel Approach with Diffusion Models Xu Shen et.al. 2404.15625v1 null
2024-04-26 A Dynamic Kernel Prior Model for Unsupervised Blind Image Super-Resolution Zhixiong Yang et.al. 2404.15620v2 link
2024-04-23 ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning Weifeng Chen et.al. 2404.15449v1 null
2024-04-23 GLoD: Composing Global Contexts and Local Details in Image Generation Moyuru Yamada et.al. 2404.15447v1 null
2024-04-23 ControlTraj: Controllable Trajectory Generation with Topology-Constrained Diffusion Model Yuanshao Zhu et.al. 2404.15380v1 null
2024-04-23 Heat flow, log-concavity, and Lipschitz transport maps Giovanni Brigati et.al. 2404.15205v1 null
2024-04-23 CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method Mingbao Lin et.al. 2404.15141v1 link
2024-04-23 Taming Diffusion Probabilistic Models for Character Control Rui Chen et.al. 2404.15121v1 null
2024-04-23 Perturbing Attention Gives You More Bang for the Buck: Subtle Imaging Perturbations That Efficiently Fool Customized Diffusion Models Jingyao Xu et.al. 2404.15081v1 null
2024-04-23 Music Style Transfer With Diffusion Model Hong Huang et.al. 2404.14771v1 null
2024-04-23 Gradient Guidance for Diffusion Models: An Optimization Perspective Yingqing Guo et.al. 2404.14743v1 null
2024-04-25 FlashSpeech: Efficient Zero-Shot Speech Synthesis Zhen Ye et.al. 2404.14700v3 null
2024-04-23 DreamPBR: Text-driven Generation of High-resolution SVBRDF with Multi-modal Guidance Linxuan Xin et.al. 2404.14676v1 null
2024-04-22 UVMap-ID: A Controllable and Personalized UV Map Generative Model Weijie Wang et.al. 2404.14568v1 link
2024-04-22 Align Your Steps: Optimizing Sampling Schedules in Diffusion Models Amirmojtaba Sabour et.al. 2404.14507v1 null
2024-04-22 Guess The Unseen: Dynamic 3D Scene Reconstruction from Partial 2D Glimpses Inhee Lee et.al. 2404.14410v1 null
2024-04-22 GeoDiffuser: Geometry-Based Image Editing with Diffusion Models Rahul Sajnani et.al. 2404.14403v1 null
2024-04-22 TAVGBench: Benchmarking Text to Audible-Video Generation Yuxin Mao et.al. 2404.14381v1 link
2024-04-22 Full Event Particle-Level Unfolding with Variable-Length Latent Variational Diffusion Alexander Shmakov et.al. 2404.14332v1 null
2024-04-22 X-Ray: A Sequential 3D Representation for Generation Tao Hu et.al. 2404.14329v1 null
2024-04-22 Collaborative Filtering Based on Diffusion Models: Unveiling the Potential of High-Order Connectivity Yu Hou et.al. 2404.14240v1 link
2024-04-22 MultiBooth: Towards Generating All Your Concepts in an Image from Text Chenyang Zhu et.al. 2404.14239v1 link
2024-04-22 Face2Face: Label-driven Facial Retouching Restoration Guanhua Zhao et.al. 2404.14177v1 null
2024-04-22 FLDM-VTON: Faithful Latent Diffusion Model for Virtual Try-on Chenhui Wang et.al. 2404.14162v1 null
2024-04-22 Generative Artificial Intelligence Assisted Wireless Sensing: Human Flow Detection in Practical Communication Environments Jiacheng Wang et.al. 2404.14140v1 null
2024-04-23 RingID: Rethinking Tree-Ring Watermarking for Enhanced Multi-Key Identification Hai Ci et.al. 2404.14055v2 link
2024-04-22 RHanDS: Refining Malformed Hands for Generated Images with Decoupled Structure and Style Guidance Chengrui Wang et.al. 2404.13984v1 null
2024-04-24 MaterialSeg3D: Segmenting Dense Materials from 2D Priors for 3D Assets Zeyu Li et.al. 2404.13923v2 null
2024-04-23 Accelerating Image Generation with Sub-path Linear Approximation Model Chen Xu et.al. 2404.13903v2 null
2024-04-22 Towards Better Text-to-Image Generation Alignment via Attention Modulation Yihang Wu et.al. 2404.13899v1 null
2024-04-23 Decoherence of a charged Brownian particle in a magnetic field : an analysis of the roles of coupling via position and momentum variables Suraka Bhattacharjee et.al. 2404.13883v2 null
2024-04-21 Universal Fingerprint Generation: Controllable Diffusion Model with Multimodal Conditions Steven A. Grosz et.al. 2404.13791v1 null
2024-04-21 Object-Attribute Binding in Text-to-Image Generation: Evaluation and Control Maria Mihaela Trusca et.al. 2404.13766v1 null
2024-04-21 A Splice Method for Local-to-Nonlocal Coupling of Weak Forms Shuai Jiang et.al. 2404.13744v1 null
2024-04-21 Concept Arithmetics for Circumventing Concept Inhibition in Diffusion Models Vitali Petsiuk et.al. 2404.13706v1 null
2024-04-21 Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis Yuxi Ren et.al. 2404.13686v1 null
2024-04-21 An Integrated Communication and Computing Scheme for Wi-Fi Networks based on Generative AI and Reinforcement Learning Xinyang Du et.al. 2404.13598v1 null
2024-04-21 Motion-aware Latent Diffusion Models for Video Frame Interpolation Zhilin Huang et.al. 2404.13534v1 null
2024-04-21 Reliable Model Watermarking: Defending Against Theft without Compromising on Evasion Hongyu Zhu et.al. 2404.13518v1 null
2024-04-21 ODE-DPS: ODE-based Diffusion Posterior Sampling for Inverse Problems in Partial Differential Equation Enze Jiang et.al. 2404.13496v1 null
2024-04-21 Accelerating the Generation of Molecular Conformations with Progressive Distillation of Equivariant Latent Diffusion Models Romain Lacombe et.al. 2404.13491v1 link
2024-04-20 Music Consistency Models Zhengcong Fei et.al. 2404.13358v1 null
2024-04-20 Generating Daylight-driven Architectural Design via Diffusion Models Pengzhi Li et.al. 2404.13353v1 null
2024-04-20 Pixel is a Barrier: Diffusion Models Are More Adversarially Robust Than We Think Haotian Xue et.al. 2404.13320v1 link
2024-04-20 Latent Schr{ö}dinger Bridge Diffusion Model for Generative Learning Yuling Jiao et.al. 2404.13309v1 null
2024-04-20 PCQA: A Strong Baseline for AIGC Quality Assessment Based on Prompt Condition Xi Fang et.al. 2404.13299v1 null
2024-04-20 Optimal Control of a Sub-diffusion Model using Dirichlet-Neumann and Neumann-Neumann Waveform Relaxation Algorithms Soura Sana et.al. 2404.13283v1 null
2024-04-20 A Massive MIMO Sampling Detection Strategy Based on Denoising Diffusion Model Lanxin He et.al. 2404.13281v1 null
2024-04-20 FilterPrompt: Guiding Image Transfer in Diffusion Models Xi Wang et.al. 2404.13263v1 null
2024-04-19 DISC: Latent Diffusion Models with Self-Distillation from Separated Conditions for Prostate Cancer Grading Man M. Ho et.al. 2404.13097v1 null
2024-04-19 Analysis of Classifier-Free Guidance Weight Schedulers Xi Wang et.al. 2404.13040v1 null
2024-04-19 RadRotator: 3D Rotation of Radiographs with Diffusion Models Pouria Rouzrokh et.al. 2404.13000v1 null
2024-04-19 Cross-modal Diffusion Modelling for Super-resolved Spatial Transcriptomics Xiaofei Wang et.al. 2404.12973v1 null
2024-04-19 Neural Flow Diffusion Models: Learnable Forward Process for Improved Diffusion Modelling Grigory Bartosh et.al. 2404.12940v1 null
2024-04-19 Zero-Shot Medical Phrase Grounding with Off-the-shelf Diffusion Models Konstantinos Vilouras et.al. 2404.12920v1 null
2024-04-19 Robust CLIP-Based Detector for Exposing Diffusion Model-Generated Images Santosh et.al. 2404.12908v1 link
2024-04-19 ConCLVD: Controllable Chinese Landscape Video Generation via Diffusion Model Dingming Liu et.al. 2404.12903v1 null
2024-04-19 Training-and-prompt-free General Painterly Harmonization Using Image-wise Attention Sharing Teng-Fang Hsiao et.al. 2404.12900v1 link
2024-04-19 MCM: Multi-condition Motion Synthesis Framework Zeyu Ling et.al. 2404.12886v1 null
2024-04-19 Detecting Out-Of-Distribution Earth Observation Images with Diffusion Models Georges Le Bellier et.al. 2404.12667v1 null
2024-04-19 F2FLDM: Latent Diffusion Models with Histopathology Pre-Trained Embeddings for Unpaired Frozen Section to FFPE Translation Man M. Ho et.al. 2404.12650v1 null
2024-04-19 Dragtraffic: A Non-Expert Interactive and Point-Based Controllable Traffic Scene Generation Framework Sheng Wang et.al. 2404.12624v1 null
2024-04-19 Rethinking Clothes Changing Person ReID: Conflicts, Synthesis, and Optimization Junjie Li et.al. 2404.12611v1 null
2024-04-18 GenVideo: One-shot Target-image and Shape Aware Video Editing using T2I Diffusion Models Sai Sree Harsha et.al. 2404.12541v1 null
2024-04-18 G-HOP: Generative Hand-Object Prior for Interaction Reconstruction and Grasp Synthesis Yufei Ye et.al. 2404.12383v1 null
2024-04-18 Learning the Domain Specific Inverse NUFFT for Accelerated Spiral MRI using Diffusion Models Trevor J. Chan et.al. 2404.12361v1 null
2024-04-18 AniClipart: Clipart Animation with Text-to-Video Priors Ronghuan Wu et.al. 2404.12347v1 null
2024-04-18 Guided Discrete Diffusion for Electronic Health Record Generation Zixiang Chen et.al. 2404.12314v1 null
2024-04-18 StyleBooth: Image Style Editing with Multimodal Instruction Zhen Han et.al. 2404.12154v1 link
2024-04-18 LD-Pruner: Efficient Pruning of Latent Diffusion Models using Task-Agnostic Insights Thibault Castells et.al. 2404.11936v1 null
2024-04-18 FreeDiff: Progressive Frequency Truncation for Image Editing with Diffusion Models Wei Wu et.al. 2404.11895v1 null
2024-04-17 Prompt-Driven Feature Diffusion for Open-World Semi-Supervised Learning Marzi Heidari et.al. 2404.11795v1 null
2024-04-17 Diffusion Schrödinger Bridge Models for High-Quality MR-to-CT Synthesis for Head and Neck Proton Treatment Planning Muheng Li et.al. 2404.11741v1 null
2024-04-17 Factorized Diffusion: Perceptual Illusions by Noise Decomposition Daniel Geng et.al. 2404.11615v1 null
2024-04-17 IntrinsicAnything: Learning Diffusion Priors for Inverse Rendering Under Unknown Illumination Xi Chen et.al. 2404.11593v1 null
2024-04-17 Prompt Optimizer of Text-to-Image Diffusion Models for Abstract Concept Understanding Zezhong Fan et.al. 2404.11589v1 null
2024-04-17 MoA: Mixture-of-Attention for Subject-Context Disentanglement in Personalized Image Generation Kuan-Chieh Wang et.al. 2404.11565v1 null
2024-04-17 Predicting Long-horizon Futures by Conditioning on Geometry and Time Tarasha Khurana et.al. 2404.11554v1 null
2024-04-17 SSDiff: Spatial-spectral Integrated Diffusion Model for Remote Sensing Pansharpening Yu Zhong et.al. 2404.11537v1 null
2024-04-17 Towards Highly Realistic Artistic Style Transfer via Stable Diffusion with Step-aware and Layer-aware Prompt Zhanjie Zhang et.al. 2404.11474v1 link
2024-04-17 Closely Interactive Human Reconstruction with Proxemics and Physics-Guided Adaption Buzhen Huang et.al. 2404.11291v1 link
2024-04-17 Optical Image-to-Image Translation Using Denoising Diffusion Models: Heterogeneous Change Detection as a Use Case João Gabriel Vinholi et.al. 2404.11243v1 null
2024-04-17 RiboDiffusion: Tertiary Structure-based RNA Inverse Folding with Generative Diffusion Models Han Huang et.al. 2404.11199v1 link
2024-04-19 LAPTOP-Diff: Layer Pruning and Normalized Distillation for Compressing Diffusion Models Dingkun Zhang et.al. 2404.11098v3 null
2024-04-16 Molecular relaxation by reverse diffusion with time step prediction Khaled Kahouli et.al. 2404.10935v1 link
2024-04-16 RefFusion: Reference Adapted Diffusion Models for 3D Scene Inpainting Ashkan Mirzaei et.al. 2404.10765v1 null
2024-04-16 LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation? Yuchi Wang et.al. 2404.10763v1 link
2024-04-19 GazeHTA: End-to-end Gaze Target Detection with Head-Target Association Zhi-Yi Lin et.al. 2404.10718v2 null
2024-04-16 Efficient Conditional Diffusion Model with Probability Flow Sampling for Image Super-resolution Yutao Yuan et.al. 2404.10688v1 link
2024-04-16 Generating Human Interaction Motions in Scenes with Text Control Hongwei Yi et.al. 2404.10685v1 null
2024-04-16 StyleCity: Large-Scale 3D Urban Scenes Stylization with Vision-and-Text Reference via Progressive Optimization Yingshu Chen et.al. 2404.10681v1 null
2024-04-18 Continual Offline Reinforcement Learning via Diffusion-based Dual Generative Replay Jinmei Liu et.al. 2404.10662v2 link
2024-04-16 Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences Seungwook Kim et.al. 2404.10603v1 null
2024-04-17 Do Counterfactual Examples Complicate Adversarial Training? Eric Yeats et.al. 2404.10588v2 null
2024-04-17 AAVDiff: Experimental Validation of Enhanced Viability and Diversity in Recombinant Adeno-Associated Virus (AAV) Capsids through Diffusion Generation Lijun Liu et.al. 2404.10573v2 null
2024-04-16 A bridge between spatial and first-passage properties of continuous and discrete time stochastic processes: from hard walls to absorbing boundary conditions Mathis Guéneau et.al. 2404.10537v1 null
2024-04-16 Four-hour thunderstorm nowcasting using deep diffusion models of satellite Kuai Dai et.al. 2404.10512v1 null
2024-04-16 SparseDM: Toward Sparse Efficient Diffusion Models Kafeng Wang et.al. 2404.10445v1 null
2024-04-16 Portrait3D: Text-Guided High-Quality 3D Portrait Generation Using Pyramid Representation and GANs Prior Yiqian Wu et.al. 2404.10394v1 null
2024-04-16 Generating Counterfactual Trajectories with Latent Diffusion Models for Concept Discovery Payal Varshney et.al. 2404.10356v1 null
2024-04-18 Efficiently Adversarial Examples Generation for Visual-Language Models under Targeted Transfer Scenarios using Diffusion Models Qi Guo et.al. 2404.10335v2 null
2024-04-17 OmniSSR: Zero-shot Omnidirectional Image Super-Resolution using Stable Diffusion Model Runyi Li et.al. 2404.10312v2 null
2024-04-16 EucliDreamer: Fast and High-Quality Texturing for 3D Models with Depth-Conditioned Stable Diffusion Cindy Le et.al. 2404.10279v1 null
2024-04-16 OneActor: Consistent Character Generation via Cluster-Conditioned Guidance Jiahao Wang et.al. 2404.10267v1 null
2024-04-16 Diffusion assisted image reconstruction in optoacoustic tomography M. G. González et.al. 2404.10239v1 null
2024-04-15 Salient Object-Aware Background Generation using Text-Guided Diffusion Models Amir Erfan Eshratifar et.al. 2404.10157v1 link
2024-04-15 Taming Latent Diffusion Model for Neural Radiance Field Inpainting Chieh Hubert Lin et.al. 2404.09995v1 null
2024-04-15 in2IN: Leveraging individual Information to Generate Human INteractions Pablo Ruiz Ponce et.al. 2404.09988v1 null
2024-04-15 MaxFusion: Plug&Play Multi-Modal Generation in Text-to-Image Diffusion Models Nithin Gopalakrishnan Nair et.al. 2404.09977v1 null
2024-04-15 Diffscaler: Enhancing the Generative Prowess of Diffusion Transformers Nithin Gopalakrishnan Nair et.al. 2404.09976v1 null
2024-04-15 Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model Han Lin et.al. 2404.09967v1 null
2024-04-16 Tango 2: Aligning Diffusion-based Text-to-Audio Generations through Direct Preference Optimization Navonil Majumder et.al. 2404.09956v2 link
2024-04-15 A Diffusion-based Data Generator for Training Object Recognition Models in Ultra-Range Distance Eran Bamani et.al. 2404.09846v1 null
2024-04-17 Digging into contrastive learning for robust depth estimation with diffusion models Jiyuan Wang et.al. 2404.09831v2 null
2024-04-15 Equipping Diffusion Models with Differentiable Spatial Entropy for Low-Light Image Enhancement Wenyi Lian et.al. 2404.09735v1 link
2024-04-15 Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language Models Ziwei Luo et.al. 2404.09732v1 link
2024-04-15 All-in-one simulation-based inference Manuel Gloeckler et.al. 2404.09636v1 link
2024-04-15 TMPQ-DM: Joint Timestep Reduction and Quantization Precision Selection for Efficient Diffusion Models Haojun Sun et.al. 2404.09532v1 null
2024-04-15 Magic Clothing: Controllable Garment-Driven Image Synthesis Weifeng Chen et.al. 2404.09512v1 link
2024-04-15 PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI Yandan Yang et.al. 2404.09465v1 null
2024-04-15 Watermark-embedded Adversarial Examples for Copyright Protection against Diffusion Models Peifei Zhu et.al. 2404.09401v1 null
2024-04-14 Fault Detection in Mobile Networks Using Diffusion Models Mohamad Nabeel et.al. 2404.09240v1 null
2024-04-14 DreamScape: 3D Scene Creation via Gaussian Splatting joint Correlation Modeling Xuening Yuan et.al. 2404.09227v1 null
2024-04-16 LoopAnimate: Loopable Salient Object Animation Fanyi Wang et.al. 2404.09172v2 null
2024-04-14 RF-Diffusion: Radio Signal Generation via Time-Frequency Diffusion Guoxuan Chi et.al. 2404.09140v1 link
2024-04-13 Rethinking Iterative Stereo Matching from Diffusion Bridge Model Perspective Yuguang Shi et.al. 2404.09051v1 null
2024-04-13 Theoretical research on generative diffusion models: an overview Melike Nur Yeğin et.al. 2404.09016v1 null
2024-04-13 Multimodal Cross-Document Event Coreference Resolution Using Linear Semantic Transfer and Mixed-Modality Ensembles Abhijnan Nath et.al. 2404.08949v1 link
2024-04-13 Enforcing Paraphrase Generation via Controllable Latent Diffusion Wei Zou et.al. 2404.08938v1 link
2024-04-17 Diffusion Models Meet Remote Sensing: Principles, Methods, and Perspectives Yidan Liu et.al. 2404.08926v2 null
2024-04-13 ChangeAnywhere: Sample Generation for Remote Sensing Change Detection via Semantic Latent Diffusion Model Kai Tang et.al. 2404.08892v1 null
2024-04-12 Semantic Approach to Quantifying the Consistency of Diffusion Model Image Generation Brinnae Bent et.al. 2404.08799v1 null
2024-04-12 Diffusion-Based Joint Temperature and Precipitation Emulation of Earth System Models Katie Christensen et.al. 2404.08797v1 null
2024-04-12 Lossy Image Compression with Foundation Diffusion Models Lucas Relic et.al. 2404.08580v1 null
2024-04-12 PiRD: Physics-informed Residual Diffusion for Flow Field Reconstruction Siming Shan et.al. 2404.08412v1 null
2024-04-12 Struggle with Adversarial Defense? Try Diffusion Yujie Li et.al. 2404.08273v1 null
2024-04-12 Balanced Mixed-Type Tabular Data Synthesis with Diffusion Models Zeyu Yang et.al. 2404.08254v1 null
2024-04-12 Interest Maximization in Social Networks Rahul Kumar Gautam et.al. 2404.08236v1 null
2024-04-11 ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback Ming Li et.al. 2404.07987v1 null
2024-04-11 Taming Stable Diffusion for Text to 360° Panorama Image Generation Cheng Zhang et.al. 2404.07949v1 link
2024-04-11 Adaptive Hyperbolic-cross-space Mapped Jacobi Method on Unbounded Domains with Applications to Solving Multidimensional Spatiotemporal Integrodifferential Equations Yunhong Deng et.al. 2404.07844v1 null
2024-04-11 ConsistencyDet: Robust Object Detector with Denoising Paradigm of Consistency Model Lifan Jiang et.al. 2404.07773v1 null
2024-04-11 An Overview of Diffusion Models: Applications, Guided Generation, Statistical Rates and Optimization Minshuo Chen et.al. 2404.07771v1 null
2024-04-11 Joint Conditional Diffusion Model for Image Restoration with Mixed Degradations Yufeng Yue et.al. 2404.07770v1 null
2024-04-11 Diffusing in Someone Else's Shoes: Robotic Perspective Taking with Diffusion Josua Spisak et.al. 2404.07735v1 null
2024-04-11 Applying Guidance in a Limited Interval Improves Sample and Distribution Quality in Diffusion Models Tuomas Kynkäänniemi et.al. 2404.07724v1 null
2024-04-11 Implicit and Explicit Language Guidance for Diffusion-based Visual Perception Hefeng Wang et.al. 2404.07600v1 null
2024-04-11 ObjBlur: A Curriculum Learning Approach With Progressive Object-Level Blurring for Improved Layout-to-Image Generation Stanislav Frolov et.al. 2404.07564v1 null
2024-04-11 Effects of phase separation on extinction times in population models Janik Schüttler et.al. 2404.07563v1 null
2024-04-11 CAT: Contrastive Adapter Training for Personalized Image Generation Jae Wan Park et.al. 2404.07554v1 link
2024-04-10 Object-Conditioned Energy-Based Attention Map Alignment in Text-to-Image Diffusion Models Yasi Zhang et.al. 2404.07389v1 null
2024-04-10 GoodDrag: Towards Good Practices for Drag Editing with Diffusion Models Zewei Zhang et.al. 2404.07206v1 null
2024-04-10 RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion Jaidev Shriram et.al. 2404.07199v1 null
2024-04-14 InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models Jiale Xu et.al. 2404.07191v2 link
2024-04-10 Move Anything with Layered Scene Diffusion Jiawei Ren et.al. 2404.07178v1 null
2024-04-10 Diffusion-based inpainting of incomplete Euclidean distance matrices of trajectories generated by a fractional Brownian motion Alexander Lobashev et.al. 2404.07029v1 link
2024-04-10 DreamScene360: Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting Shijie Zhou et.al. 2404.06903v1 null
2024-04-10 Fine color guidance in diffusion models and its application to image compression at extremely low bitrates Tom Bordin et.al. 2404.06865v1 null
2024-04-10 UDiFF: Generating Conditional Unsigned Distance Fields with Optimal Wavelet Diffusion Junsheng Zhou et.al. 2404.06851v1 null
2024-04-10 Tuning-Free Adaptive Style Incorporation for Structure-Consistent Text-Driven Style Transfer Yanqi Ge et.al. 2404.06835v1 null
2024-04-10 Zero-shot Point Cloud Completion Via 2D Priors Tianxin Huang et.al. 2404.06814v1 null
2024-04-10 Urban Architect: Steerable 3D Urban Scene Generation with Layout Prior Fan Lu et.al. 2404.06780v1 null
2024-04-10 DiffusionDialog: A Diffusion Model for Diverse Dialog Generation with Latent Space Jianxiang Xiang et.al. 2404.06760v1 null
2024-04-11 Disguised Copyright Infringement of Latent Diffusion Models Yiwei Lu et.al. 2404.06737v2 null
2024-04-10 Efficient Denoising using Score Embedding in Score-based Diffusion Models Andrew S. Na et.al. 2404.06661v1 null
2024-04-09 Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation Luca Barsellotti et.al. 2404.06542v1 null
2024-04-09 GeoDirDock: Guiding Docking Along Geodesic Paths Raúl Miñán et.al. 2404.06481v1 null
2024-04-09 Magic-Boost: Boost 3D Generation with Mutli-View Conditioned Diffusion Fan Yang et.al. 2404.06429v1 null
2024-04-09 ZeST: Zero-Shot Material Transfer from a Single Image Ta-Ying Cheng et.al. 2404.06425v1 null
2024-04-09 Policy-Guided Diffusion Matthew Thomas Jackson et.al. 2404.06356v1 link
2024-04-09 Quantum State Generation with Structure-Preserving Diffusion Model Yuchen Zhu et.al. 2404.06336v1 null
2024-04-09 DiffHarmony: Latent Diffusion Model Meets Image Harmonization Pengfei Zhou et.al. 2404.06139v1 null
2024-04-09 Hash3D: Training-free Acceleration for 3D Generation Xingyi Yang et.al. 2404.06091v1 link
2024-04-09 Diffusion-Based Point Cloud Super-Resolution for mmWave Radar Data Kai Luan et.al. 2404.06012v1 null
2024-04-13 Tackling Structural Hallucination in Image Translation with Local Diffusion Seunghoi Kim et.al. 2404.05980v2 null
2024-04-09 Map Optical Properties to Subwavelength Structures Directly via a Diffusion Model Shijie Rao et.al. 2404.05959v1 null
2024-04-08 MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation Kunpeng Song et.al. 2404.05674v1 null
2024-04-08 YaART: Yet Another ART Rendering Technology Sergey Kastryulin et.al. 2404.05666v1 null
2024-04-08 BinaryDM: Towards Accurate Binarization of Diffusion Model Xingyu Zheng et.al. 2404.05662v1 link
2024-04-08 Resistive Memory-based Neural Differential Equation Solver for Score-based Diffusion Model Jichang Yang et.al. 2404.05648v1 null
2024-04-08 Learning a Category-level Object Pose Estimator without Pose Annotations Fengrui Tian et.al. 2404.05626v1 null
2024-04-08 UniFL: Improve Stable Diffusion via Unified Feedback Learning Jiacheng Zhang et.al. 2404.05595v1 null
2024-04-08 Investigating the Effectiveness of Cross-Attention to Unlock Zero-Shot Editing of Text-to-Video Diffusion Models Saman Motamed et.al. 2404.05519v1 null
2024-04-08 Taming Transformers for Realistic Lidar Point Cloud Generation Hamed Haghighi et.al. 2404.05505v1 link
2024-04-08 Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance Dazhong Shen et.al. 2404.05384v1 link
2024-04-08 Mask-ControlNet: Higher-Quality Image Generation with An Additional Mask Prompt Zhiqi Huang et.al. 2404.05331v1 null
2024-04-08 Text-to-Image Synthesis for Any Artistic Styles: Advancements in Personalized Artistic Image Generation via Subdivision and Dual Binding Junseo Park et.al. 2404.05256v1 null
2024-04-08 DiffCJK: Conditional Diffusion Model for High-Quality and Wide-coverage CJK Character Generation Yingtao Tian et.al. 2404.05212v1 null
2024-04-07 Context-dependent Causality (the Non-Nonotonic Case) Nir Billfeld et.al. 2404.05021v1 null
2024-04-07 Generative downscaling of PDE solvers with physics-guided diffusion models Yulong Lu et.al. 2404.05009v1 link
2024-04-07 Gaussian Shading: Provable Performance-Lossless Image Watermarking for Diffusion Models Zijin Yang et.al. 2404.04956v1 null
2024-04-07 Regularized Conditional Diffusion Model for Multi-Task Preference Alignment Xudong Yu et.al. 2404.04920v1 null
2024-04-07 Correcting Diffusion-Based Perceptual Image Compression with Privileged End-to-End Decoder Yiyang Ma et.al. 2404.04916v1 null
2024-04-07 ShoeModel: Learning to Wear on the User-specified Shoes via Diffusion Model Binghui Chen et.al. 2404.04833v1 null
2024-04-07 Light the Night: A Multi-Condition Diffusion Framework for Unpaired Low-Light Enhancement in Autonomous Driving Jinlong Li et.al. 2404.04804v1 null
2024-04-07 Rethinking Diffusion Model for Multi-Contrast MRI Super-Resolution Guangyuan Li et.al. 2404.04785v1 link
2024-04-06 InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization Xiefan Guo et.al. 2404.04650v1 link
2024-04-06 DifFUSER: Diffusion Model for Robust Multi-Sensor Fusion in 3D Object Detection and BEV Segmentation Duy-Tho Le et.al. 2404.04629v1 null
2024-04-11 Diffusion Time-step Curriculum for One Image to 3D Generation Xuanyu Yi et.al. 2404.04562v2 link
2024-04-06 BeyondScene: Higher-Resolution Human-Centric Scene Generation With Pretrained Diffusion Gwanghyun Kim et.al. 2404.04544v1 null
2024-04-06 DATENeRF: Depth-Aware Text-based Editing of NeRFs Sara Rojas et.al. 2404.04526v1 null
2024-04-06 Latent-based Diffusion Model for Long-tailed Recognition Pengxiao Han et.al. 2404.04517v1 null
2024-04-06 Diffusion-RWKV: Scaling RWKV-Like Architectures for Diffusion Models Zhengcong Fei et.al. 2404.04478v1 link
2024-04-06 Aligning Diffusion Models by Optimizing Human Utility Shufan Li et.al. 2404.04465v1 null
2024-04-05 Pixel-wise RL on Diffusion Models: Reinforcement Learning from Rich Feedback Mo Kordzanganeh et.al. 2404.04356v1 null
2024-04-05 Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models Sangwon Jang et.al. 2404.04243v1 null
2024-04-05 ToolEENet: Tool Affordance 6D Pose Estimation Yunlong Wang et.al. 2404.04193v1 null
2024-04-05 Dynamic Prompt Optimizing for Text-to-Image Generation Wenyi Mo et.al. 2404.04095v1 link
2024-04-05 Score identity Distillation: Exponentially Fast Distillation of Pretrained Diffusion Models for One-Step Generation Mingyuan Zhou et.al. 2404.04057v1 null
2024-04-05 Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models Gihyun Kwon et.al. 2404.03913v1 null
2024-04-04 Bi-level Guided Diffusion Models for Zero-Shot Medical Imaging Inverse Problems Hossein Askari et.al. 2404.03706v1 null
2024-04-04 Mitigating analytical variability in fMRI results with style transfer Elodie Germani et.al. 2404.03703v1 null
2024-04-04 MVD-Fusion: Single-view 3D via Depth-consistent Multi-view Generation Hanzhe Hu et.al. 2404.03656v1 null
2024-04-04 CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching Dongzhi Jiang et.al. 2404.03653v1 link
2024-04-04 The More You See in 2D, the More You Perceive in 3D Xinyang Han et.al. 2404.03652v1 null
2024-04-04 DiffBody: Human Body Restoration by Imagining with Generative Diffusion Prior Yiming Zhang et.al. 2404.03642v1 null
2024-04-04 LCM-Lookahead for Encoder-based Text-to-Image Personalization Rinon Gal et.al. 2404.03620v1 null
2024-04-04 DiffDet4SAR: Diffusion-based Aircraft Target Detection Network for SAR Images Zhou Jie et.al. 2404.03595v1 link
2024-04-04 PointInfinity: Resolution-Invariant Point Diffusion Models Zixuan Huang et.al. 2404.03566v1 null
2024-04-04 Segmentation-Guided Knee Radiograph Generation using Conditional Diffusion Models Siyuan Mei et.al. 2404.03541v1 null
2024-04-04 A Directional Diffusion Graph Transformer for Recommendation Zixuan Yi et.al. 2404.03326v1 null
2024-04-04 SiloFuse: Cross-silo Synthetic Data Generation with Latent Tabular Diffusion Models Aditya Shankar et.al. 2404.03299v1 null
2024-04-04 Future-Proofing Class Incremental Learning Quentin Jodelet et.al. 2404.03200v1 null
2024-04-04 HandDiff: 3D Hand Pose Estimation with Diffusion on Image-Point Cloud Wencan Cheng et.al. 2404.03159v1 link
2024-04-04 DreamWalk: Style Space Exploration using Diffusion Guidance Michelle Shu et.al. 2404.03145v1 null
2024-04-04 Diverse and Tailored Image Generation for Zero-shot Multi-label Classification Kaixin Zhang et.al. 2404.03144v1 null
2024-04-04 The Diffusive Ultrasound Modulated Bioluminescence Tomography with Partial Data and Uncertain Optical Parameters Tianyu Yang et.al. 2404.03124v1 null
2024-04-03 Many-to-many Image Generation with Auto-regressive Diffusion Models Ying Shen et.al. 2404.03109v1 null
2024-04-03 Computing macroscopic reaction rates in reaction-diffusion systems using Monte Carlo simulations Mohamed Swailem et.al. 2404.03089v1 null
2024-04-03 ASAP: Interpretable Analysis and Summarization of AI-generated Image Patterns at Scale Jinbin Huang et.al. 2404.02990v1 null
2024-04-03 Deep Generative Models through the Lens of the Manifold Hypothesis: A Survey and New Connections Gabriel Loaiza-Ganem et.al. 2404.02954v1 null
2024-04-02 Jailbreaking Prompt Attack: A Controllable Adversarial Attack against Diffusion Models Jiachen Ma et.al. 2404.02928v1 null
2024-04-03 LidarDM: Generative LiDAR Simulation in a Generated World Vlas Zyrianov et.al. 2404.02903v1 link
2024-04-03 Fast Diffusion Model For Seismic Data Noise Attenuation Junheng Peng et.al. 2404.02767v1 null
2024-04-03 Cross-Attention Makes Inference Cumbersome in Text-to-Image Diffusion Models Wentian Zhang et.al. 2404.02747v1 link
2024-04-03 Deep Privacy Funnel Model: From a Discriminative to a Generative Approach with an Application to Face Recognition Behrooz Razeghi et.al. 2404.02696v1 null
2024-04-03 Diffexplainer: Towards Cross-modal Global Explanations with Diffusion Models Matteo Pennisi et.al. 2404.02618v1 null
2024-04-03 A Unified Editing Method for Co-Speech Gesture Generation via Diffusion Inversion Zeyu Zhao et.al. 2404.02411v1 null
2024-04-03 Enhancing Diffusion-based Point Cloud Generation with Smoothness Constraint Yukun Li et.al. 2404.02396v1 null
2024-04-02 Semantic Augmentation in Images using Language Sahiti Yerramilli et.al. 2404.02353v1 null
2024-04-02 Heat Death of Generative Models in Closed-Loop Learning Matteo Marchi et.al. 2404.02325v1 null
2024-04-02 APEX: Ambidextrous Dual-Arm Robotic Manipulation Using Collision-Free Generative Diffusion Models Apan Dastider et.al. 2404.02284v1 null
2024-04-08 Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better Enshu Liu et.al. 2404.02241v2 link
2024-04-02 Diffusion $^2$ : Dynamic 3D Content Generation via Score Composition of Orthogonal Diffusion Models Zeyu Yang et.al. 2404.02148v1 link
2024-04-02 WcDT: World-centric Diffusion Transformer for Traffic Scene Generation Chen Yang et.al. 2404.02082v1 link
2024-04-03 AUTODIFF: Autoregressive Diffusion Modeling for Structure-based Drug Design Xinze Li et.al. 2404.02003v2 null
2024-04-07 Bi-LORA: A Vision-Language Approach for Synthetic Image Detection Mamadou Keita et.al. 2404.01959v2 null
2024-04-02 Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion Model Xu He et.al. 2404.01862v1 link
2024-04-02 Upsample Guidance: Scale Up Diffusion Models without Training Juno Hwang et.al. 2404.01709v1 null
2024-04-05 FashionEngine: Interactive Generation and Editing of 3D Clothed Humans Tao Hu et.al. 2404.01655v2 null
2024-04-02 Diffusion Deepfake Chaitali Bhattacharyya et.al. 2404.01579v1 null
2024-04-01 Prior Frequency Guided Diffusion Model for Limited Angle (LA)-CBCT Reconstruction Jiacheng Xie et.al. 2404.01448v1 null
2024-04-01 DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery Yixuan Zhu et.al. 2404.01424v1 link
2024-04-01 Is Model Collapse Inevitable? Breaking the Curse of Recursion by Accumulating Real and Synthetic Data Matthias Gerstgrasser et.al. 2404.01413v1 null
2024-04-01 Bigger is not Always Better: Scaling Properties of Latent Diffusion Models Kangfu Mei et.al. 2404.01367v1 null
2024-04-01 MagicMirror: Fast and High-Quality Avatar Generation with a Constrained Search Space Armand Comas-Massagué et.al. 2404.01296v1 null
2024-04-01 CosmicMan: A Text-to-Image Foundation Model for Humans Shikai Li et.al. 2404.01294v1 null
2024-04-01 Measuring Style Similarity in Diffusion Models Gowthami Somepalli et.al. 2404.01292v1 link
2024-04-01 A Unified and Interpretable Emotion Representation and Expression Generation Reni Paskaleva et.al. 2404.01243v1 null
2024-04-02 StructLDM: Structured Latent Diffusion for 3D Human Generation Tao Hu et.al. 2404.01241v2 null
2024-04-01 Video Interpolation with Diffusion Models Siddhant Jain et.al. 2404.01203v1 null
2024-04-01 Uncovering the Text Embedding in Text-to-Image Diffusion Models Hu Yu et.al. 2404.01154v1 null
2024-04-01 UFID: A Unified Framework for Input-level Backdoor Detection on Diffusion Models Zihan Guan et.al. 2404.01101v1 null
2024-04-01 Texture-Preserving Diffusion Models for High-Fidelity Virtual Try-On Xu Yang et.al. 2404.01089v1 null
2024-04-01 PhysReaction: Physically Plausible Real-Time Humanoid Reaction Synthesis via Forward Dynamics Guided 4D Imitation Yunze Liu et.al. 2404.01081v1 null
2024-04-01 Towards Memorization-Free Diffusion Models Chen Chen et.al. 2404.00922v1 null
2024-04-01 The long-time behavior of solutions of a three-component reaction-diffusion model for the population dynamics of farmers and hunter-gatherers: the different motility case Dongyuan Xiao et.al. 2404.00907v1 null
2024-04-01 Model-Agnostic Human Preference Inversion in Diffusion Models Jeeyung Kim et.al. 2404.00879v1 null
2024-04-01 TryOn-Adapter: Efficient Fine-Grained Clothing Identity Adaptation for High-Fidelity Virtual Try-On Jiazheng Xing et.al. 2404.00878v1 link
2024-04-01 DiSR-NeRF: Diffusion-Guided View-Consistent Super-Resolution NeRF Jie Long Lee et.al. 2404.00874v1 null
2024-04-01 Generating Content for HDR Deghosting from Frequency View Tao Hu et.al. 2404.00849v1 null
2024-04-01 Nonlinear ensemble filtering with diffusion models: Application to the surface quasi-geostrophic dynamics Feng Bao et.al. 2404.00844v1 null
2024-03-31 Towards Realistic Scene Generation with LiDAR Diffusion Models Haoxi Ran et.al. 2404.00815v1 link
2024-03-31 Unknown Prompt, the only Lacuna: Unveiling CLIP's Potential for Open Domain Generalization Mainak Singha et.al. 2404.00710v1 null
2024-03-31 DeeDSR: Towards Real-World Image Super-Resolution via Degradation-Aware Stable Diffusion Chunyang Bi et.al. 2404.00661v1 null
2024-03-31 CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models Xiang Li et.al. 2404.00569v1 link
2024-04-02 Text2HOI: Text-guided 3D Motion Generation for Hand-Object Interaction Junuk Cha et.al. 2404.00562v2 null
2024-03-31 Creating synthetic energy meter data using conditional diffusion and building metadata Chun Fu et.al. 2404.00525v1 null
2024-03-30 Denoising Monte Carlo Renders With Diffusion Models Vaibhav Vavilala et.al. 2404.00491v1 null
2024-03-30 DiffHuman: Probabilistic Photorealistic 3D Reconstruction of Humans Akash Sengupta et.al. 2404.00485v1 null
2024-03-30 Score-Based Diffusion Models for Photoacoustic Tomography Image Reconstruction Sreemanti Dey et.al. 2404.00471v1 null
2024-03-30 Joint Pedestrian Trajectory Prediction through Posterior Sampling Haotian Lin et.al. 2404.00237v1 null
2024-03-30 Grid Diffusion Models for Text-to-Video Generation Taegyeong Lee et.al. 2404.00234v1 null
2024-03-30 Latent Watermark: Inject and Detect Watermarks in Latent Diffusion Space Zheling Meng et.al. 2404.00230v1 null
2024-03-29 FetalDiffusion: Pose-Controllable 3D Fetal MRI Synthesis with Conditional Diffusion Model Molin Zhang et.al. 2404.00132v1 null
2024-04-02 GDA: Generalized Diffusion for Robust Test-time Adaptation Yun-Yun Tsai et.al. 2404.00095v2 null
2024-03-29 Relation Rectification in Diffusion Model Yinwei Wu et.al. 2403.20249v1 null
2024-03-29 Motion Inversion for Video Customization Luozhou Wang et.al. 2403.20193v1 null
2024-03-29 FreeSeg-Diff: Training-Free Open-Vocabulary Segmentation with Diffusion Models Barbara Toniella Corradini et.al. 2403.20105v1 null
2024-03-29 SGD: Street View Synthesis with Gaussian Splatting and Diffusion Prior Zhongrui Yu et.al. 2403.20079v1 null
2024-03-29 Probing solar modulation analytic models with cosmic ray periodic spectra Wei-Cheng Long et.al. 2403.20038v1 null
2024-04-01 Structure Matters: Tackling the Semantic Discrepancy in Diffusion Models for Image Inpainting Haipeng Liu et.al. 2403.19898v2 link
2024-03-28 Vision-Language Synthetic Data Enhances Echocardiography Downstream Tasks Pooria Ashrafian et.al. 2403.19880v1 link
2024-03-28 ShapeFusion: A 3D diffusion model for localized shape editing Rolandos Alexandros Potamias et.al. 2403.19773v1 null
2024-03-28 MIST: Mitigating Intersectional Bias with Disentangled Cross-Attention Editing in Text-to-Image Diffusion Models Hidir Yesiltepe et.al. 2403.19738v1 null
2024-03-28 Detecting Image Attribution for Text-to-Image Diffusion Models in RGB and Beyond Katherine Xu et.al. 2403.19653v1 link
2024-03-28 InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction Sirui Xu et.al. 2403.19652v1 null
2024-03-28 GANTASTIC: GAN-based Transfer of Interpretable Directions for Disentangled Image Editing in Text-to-Image Diffusion Models Yusuf Dalva et.al. 2403.19645v1 null
2024-03-28 In the driver's mind: modeling the dynamics of human overtaking decisions in interactions with oncoming automated vehicles Samir H. A. Mohammad et.al. 2403.19637v1 null
2024-03-28 Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model Zhicai Wang et.al. 2403.19600v1 link
2024-03-28 Frame by Familiar Frame: Understanding Replication in Video Diffusion Models Aimon Rahman et.al. 2403.19593v1 null
2024-03-28 Impact of Resin Molecular Weight on Drying Kinetics and Sag of Coatings Marola W. Issa et.al. 2403.19544v1 null
2024-03-28 Debiasing Cardiac Imaging with Controlled Latent Diffusion Models Grzegorz Skorupko et.al. 2403.19508v1 link
2024-03-28 Burst Super-Resolution with Diffusion Models for Improving Perceptual Quality Kyotaro Tokoro et.al. 2403.19428v1 link
2024-03-28 Imperceptible Protection against Style Imitation from Diffusion Models Namhyuk Ahn et.al. 2403.19254v1 null
2024-03-28 RecDiffusion: Rectangling for Image Stitching with Diffusion Models Tianhao Zhou et.al. 2403.19164v1 link
2024-03-28 MoDiTalker: Motion-Disentangled Diffusion Model for High-Fidelity Talking Head Generation Seyeon Kim et.al. 2403.19144v1 link
2024-03-28 QNCD: Quantization Noise Correction for Diffusion Models Huanpeng Chu et.al. 2403.19140v1 link
2024-03-30 Egocentric Scene-aware Human Trajectory Prediction Weizhuo Wang et.al. 2403.19026v2 null
2024-03-27 TextCraftor: Your Text Encoder Can be Image Quality Controller Yanyu Li et.al. 2403.18978v1 null
2024-03-27 CPR: Retrieval Augmented Generation for Copyright Protection Aditya Golatkar et.al. 2403.18920v1 null
2024-03-27 A Geometric Explanation of the Likelihood OOD Detection Paradox Hamidreza Kamkari et.al. 2403.18910v1 link
2024-03-27 ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion Daniel Winter et.al. 2403.18818v1 null
2024-04-01 ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation Suraj Patni et.al. 2403.18807v3 link
2024-03-27 Object Pose Estimation via the Aggregation of Diffusion Features Tianfu Wang et.al. 2403.18791v1 link
2024-03-27 ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic Object Chenshuang Zhang et.al. 2403.18775v1 link
2024-03-27 A Diffusion-Based Generative Equalizer for Music Restoration Eloi Moliner et.al. 2403.18636v1 link
2024-03-27 HandBooster: Boosting 3D Hand-Mesh Reconstruction by Conditional Synthesis and Sampling of Hand-Object Interactions Hao Xu et.al. 2403.18575v1 link
2024-03-27 Artifact Reduction in 3D and 4D Cone-beam Computed Tomography Images with Deep Learning -- A Review Mohammadreza Amirian et.al. 2403.18565v1 null
2024-03-27 CosalPure: Learning Concept from Group Images for Robust Co-Saliency Detection Jiayi Zhu et.al. 2403.18554v1 null
2024-03-27 CT-3DFlow : Leveraging 3D Normalizing Flows for Unsupervised Detection of Pathological Pulmonary CT scans Aissam Djahnine et.al. 2403.18514v1 null
2024-03-27 Synthesizing EEG Signals from Event-Related Potential Paradigms with Conditional Diffusion Models Guido Klein et.al. 2403.18486v1 null
2024-03-27 DiffusionFace: Towards a Comprehensive Dataset for Diffusion-Based Face Forgery Analysis Zhongxi Chen et.al. 2403.18471v1 link
2024-03-27 DiffStyler: Diffusion-based Localized Image Style Transfer Shaoxu Li et.al. 2403.18461v1 null
2024-03-27 SingularTrajectory: Universal Trajectory Predictor Using Diffusion Model Inhwan Bae et.al. 2403.18452v1 link
2024-03-27 U-Sketch: An Efficient Approach for Sketch to Image Diffusion Models Ilias Mitsouras et.al. 2403.18425v1 null
2024-03-27 ECNet: Effective Controllable Text-to-Image Diffusion Models Sicheng Li et.al. 2403.18417v1 null
2024-03-27 Ship in Sight: Diffusion Models for Ship-Image Super Resolution Luigi Sigillo et.al. 2403.18370v1 link
2024-03-27 DODA: Diffusion for Object-detection Domain Adaptation in Agriculture Shuai Xiang et.al. 2403.18334v1 link
2024-03-27 RoboKeyGen: Robot Pose and Joint Angles Estimation via Diffusion-based 3D Keypoint Generation Yang Tian et.al. 2403.18259v1 null
2024-03-27 NeuroPictor: Refining fMRI-to-Image Reconstruction via Multi-individual Pretraining and Multi-level Modulation Jingyang Huo et.al. 2403.18211v1 null
2024-03-28 Oh! We Freeze: Improving Quantized Knowledge Distillation via Signal Propagation Analysis for Large Language Models Kartikeya Bhardwaj et.al. 2403.18159v2 null
2024-03-26 Tutorial on Diffusion Models for Imaging and Vision Stanley H. Chan et.al. 2403.18103v1 null
2024-03-26 Move as You Say, Interact as You Can: Language-guided Human Motion Generation with Scene Affordance Zan Wang et.al. 2403.18036v1 link
2024-03-30 Bidirectional Consistency Models Liangchen Li et.al. 2403.18035v2 null
2024-03-26 Mixing Artificial and Natural Intelligence: From Statistical Mechanics to AI and Back to Turbulence Michael et.al. 2403.17993v1 null
2024-03-26 AID: Attention Interpolation of Text-to-Image Diffusion Qiyuan He et.al. 2403.17924v1 link
2024-03-26 Boosting Diffusion Models with Moving Average Sampling in Frequency Domain Yurui Qian et.al. 2403.17870v1 null
2024-03-26 DiffH2O: Diffusion-Based Synthesis of Hand-Object Interactions from Textual Descriptions Sammy Christen et.al. 2403.17827v1 null
2024-03-26 Annotated Biomedical Video Generation using Denoising Diffusion Probabilistic Models and Flow Fields Rüveyda Yilmaz et.al. 2403.17808v1 null
2024-03-26 GenesisTex: Adapting Image Denoising Diffusion to Texture Space Chenjian Gao et.al. 2403.17782v1 null
2024-03-26 CT Synthesis with Conditional Diffusion Models for Abdominal Lymph Node Segmentation Yongrui Yu et.al. 2403.17770v1 null
2024-03-26 AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation Huawei Wei et.al. 2403.17694v1 link
2024-03-26 Manifold-Guided Lyapunov Control with Diffusion Models Amartya Mukherjee et.al. 2403.17692v1 null
2024-03-26 Not All Similarities Are Created Equal: Leveraging Data-Driven Biases to Inform GenAI Copyright Disputes Uri Hacohen et.al. 2403.17691v1 null
2024-03-26 DiffFAE: Advancing High-fidelity One-shot Facial Appearance Editing with Space-sensitive Customization and Semantic Preservation Qilin Wang et.al. 2403.17664v1 null
2024-03-26 AniArtAvatar: Animatable 3D Art Avatar from a Single Image Shaoxu Li et.al. 2403.17631v1 null
2024-03-26 DiffGaze: A Diffusion Model for Continuous Gaze Sequence Generation on 360° Images Chuhan Jiao et.al. 2403.17477v1 null
2024-03-26 LaRE^2: Latent Reconstruction Error Based Method for Diffusion-Generated Image Detection Yunpeng Luo et.al. 2403.17465v1 null
2024-03-26 Building Bridges across Spatial and Temporal Resolutions: Reference-Based Super-Resolution via Change Priors and Conditional Diffusion Model Runmin Dong et.al. 2403.17460v1 link
2024-03-26 InterHandGen: Two-Hand Interaction Generation via Cascaded Reverse Diffusion Jihyun Lee et.al. 2403.17422v1 null
2024-03-26 A framework to identify supercritical and subcritical Turing bifurcations: Case study of a system sustaining cubic and quadratic autocatalysis Deepak Kumar et.al. 2403.17386v1 null
2024-03-26 Self-Rectifying Diffusion Sampling with Perturbed-Attention Guidance Donghoon Ahn et.al. 2403.17377v1 null
2024-03-25 Diffusion-based Negative Sampling on Graphs for Link Prediction Trung-Kien Nguyen et.al. 2403.17259v1 link
2024-03-25 Latency-Aware Generative Semantic Communications with Pre-Trained Diffusion Models Li Qiao et.al. 2403.17256v1 null
2024-03-25 DiffusionAct: Controllable Diffusion Autoencoder for One-shot Face Reenactment Stella Bounareli et.al. 2403.17217v1 null
2024-03-25 AnimateMe: 4D Facial Expressions via Diffusion Models Dimitrios Gerogiannis et.al. 2403.17213v1 null
2024-03-25 Continuous, Subject-Specific Attribute Control in T2I Models by Identifying Semantic Directions Stefan Andreas Baumann et.al. 2403.17064v1 link
2024-03-25 Provably Robust Score-Based Diffusion Posterior Sampling for Plug-and-Play Image Reconstruction Xingyu Xu et.al. 2403.17042v1 null
2024-03-25 Invertible Diffusion Models for Compressed Sensing Bin Chen et.al. 2403.17006v1 null
2024-03-25 TRIP: Temporal Residual Learning with Image Noise Prior for Image-to-Video Diffusion Models Zhongwei Zhang et.al. 2403.17005v1 null
2024-03-25 SD-DiT: Unleashing the Power of Self-supervised Discrimination in Diffusion Transformer Rui Zhu et.al. 2403.17004v1 null
2024-03-25 VP3D: Unleashing 2D Visual Prompt for Text-to-3D Generation Yang Chen et.al. 2403.17001v1 null
2024-03-25 Learning Spatial Adaptation and Temporal Coherence in Diffusion Models for Video Super-Resolution Zhikai Chen et.al. 2403.17000v1 null
2024-03-25 Comp4D: LLM-Guided Compositional 4D Scene Generation Dejia Xu et.al. 2403.16993v1 null
2024-03-25 Be Yourself: Bounded Attention for Multi-Subject Text-to-Image Generation Omer Dahary et.al. 2403.16990v1 null
2024-03-25 Isolated Diffusion: Optimizing Multi-Concept Text-to-Image Generation Training-Freely with Isolated Diffusion Guidance Jingyuan Zhu et.al. 2403.16954v1 null
2024-03-25 Multiple-Source Localization from a Single-Snapshot Observation Using Graph Bayesian Optimization Zonghan Zhang et.al. 2403.16818v1 link
2024-03-25 Exploiting Priors from 3D Diffusion Models for RGB-Based One-Shot View Planning Sicong Pan et.al. 2403.16803v1 null
2024-03-25 Diff-Def: Diffusion-Generated Deformation Fields for Conditional Atlases Sophie Starck et.al. 2403.16776v1 null
2024-03-25 Improving Diffusion Models's Data-Corruption Resistance using Scheduled Pseudo-Huber Loss Artem Khrapov et.al. 2403.16728v1 null
2024-03-25 SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions Yuda Song et.al. 2403.16627v1 null
2024-03-25 SatSynth: Augmenting Image-Mask Pairs through Diffusion Models for Aerial Semantic Segmentation Aysim Toker et.al. 2403.16605v1 null
2024-03-25 Antigen-Specific Antibody Design via Direct Energy-based Preference Optimization Xiangxin Zhou et.al. 2403.16576v1 null
2024-03-25 An Intermediate Fusion ViT Enables Efficient Text-Image Alignment in Diffusion Models Zizhao Hu et.al. 2403.16530v1 null
2024-03-25 Let Real Images be as a Judger, Spotting Fake Images Synthesized with Generative Models Ziyou Liang et.al. 2403.16513v1 null
2024-03-25 Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework Ziyao Huang et.al. 2403.16510v1 link
2024-03-25 Refining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation Sanyam Lakhanpal et.al. 2403.16422v1 null
2024-03-25 FlashEval: Towards Fast and Accurate Evaluation of Text-to-image Diffusion Generative Models Lin Zhao et.al. 2403.16379v1 null
2024-03-24 Laplacian-guided Entropy Model in Neural Codec with Blur-dissipated Synthesis Atefeh Khoshkhahtinat et.al. 2403.16258v1 null
2024-03-24 Skull-to-Face: Anatomy-Guided 3D Facial Reconstruction and Editing Yongqing Liang et.al. 2403.16207v1 null
2024-03-24 Diffusion Model is a Good Pose Estimator from 3D RF-Vision Junqiao Fan et.al. 2403.16198v1 null
2024-03-24 Pose-Guided Self-Training with Two-Stage Clustering for Unsupervised Landmark Discovery Siddharth Tourani et.al. 2403.16194v1 link
2024-03-26 Gaze-guided Hand-Object Interaction Synthesis: Benchmark and Method Jie Tian et.al. 2403.16169v2 null
2024-03-24 Robust Diffusion Models for Adversarial Purification Guang Lin et.al. 2403.16067v1 null
2024-03-24 A Unified Module for Accelerating STABLE-DIFFUSION: LCM-LORA Ayush Thakur et.al. 2403.16024v1 null
2024-03-23 Feature Manipulation for DDPM based Change Detection Zhenglin Li et.al. 2403.15943v1 null
2024-03-26 X-Portrait: Expressive Portrait Animation with Hierarchical Motion Attention You Xie et.al. 2403.15931v2 null
2024-03-23 Diffusion-based Aesthetic QR Code Generation via Scanning-Robust Perceptual Guidance Jia-Wei Liao et.al. 2403.15878v1 link
2024-03-23 In-Context Matting He Guo et.al. 2403.15789v1 null
2024-03-23 Time-dependent localized patterns in a predator-prey model Fahad Al Saadi et.al. 2403.15788v1 null
2024-03-23 BEND: Bagging Deep Learning Training Based on Efficient Neural Network Diffusion Jia Wei et.al. 2403.15766v1 null
2024-03-22 An Optimization Framework to Enforce Multi-View Consistency for Texturing 3D Meshes Using Pre-Trained Text-to-Image Models Zhengyi Zhao et.al. 2403.15559v1 null
2024-03-22 DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data Hanrong Ye et.al. 2403.15389v1 null
2024-03-22 Ultrasound Imaging based on the Variance of a Diffusion Restoration Model Yuxin Zhang et.al. 2403.15316v1 null
2024-03-22 Controlled Training Data Generation with Diffusion Models Teresa Yeo et.al. 2403.15309v1 null
2024-03-22 Spectral Motion Alignment for Video Motion Transfer using Diffusion Models Geon Yeong Park et.al. 2403.15249v1 null
2024-03-22 Shadow Generation for Composite Image Using Diffusion model Qingyang Liu et.al. 2403.15234v1 link
2024-03-22 MM-Diff: High-Fidelity Image Personalization via Multi-Modal Condition Integration Zhichao Wei et.al. 2403.15059v1 null
2024-03-22 Toward Tiny and High-quality Facial Makeup with Data Amplify Learning Qiaoqiao Jin et.al. 2403.15033v1 null
2024-03-22 Dynamics of a memory-based diffusion model with spatial heterogeneity and nonlinear boundary condition Quanli Ji et.al. 2403.14969v1 null
2024-03-22 DreamFlow: High-Quality Text-to-3D Generation by Approximating Probability Flow Kyungmin Lee et.al. 2403.14966v1 null
2024-03-22 CLIP-VQDiffusion : Langauge Free Training of Text To Image generation using CLIP and vector quantized diffusion model Seungdae Han et.al. 2403.14944v1 null
2024-03-22 STAG4D: Spatial-Temporal Anchored Generative 4D Gaussians Yifei Zeng et.al. 2403.14939v1 null
2024-03-21 Osmosis: RGBD Diffusion Prior for Underwater Image Restoration Opher Bar Nathan et.al. 2403.14837v1 null
2024-03-25 Multimodal-Conditioned Latent Diffusion Models for Fashion Image Editing Alberto Baldrati et.al. 2403.14828v2 link
2024-03-21 Latent Diffusion Models for Attribute-Preserving Image Anonymization Luca Piano et.al. 2403.14790v1 null
2024-03-21 Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance Shenhao Zhu et.al. 2403.14781v1 null
2024-03-21 StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text Roberto Henschel et.al. 2403.14773v1 link
2024-03-21 Open Knowledge Base Canonicalization with Multi-task Learning Bingchen Liu et.al. 2403.14733v1 null
2024-03-21 GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation Yinghao Xu et.al. 2403.14621v1 link
2024-03-21 DreamReward: Text-to-3D Generation with Human Preference Junliang Ye et.al. 2403.14613v1 null
2024-03-21 ReNoise: Real Image Inversion Through Iterative Noising Daniel Garibi et.al. 2403.14602v1 null
2024-03-21 Denoising Diffusion Models for 3D Healthy Brain Tissue Inpainting Alicia Durrer et.al. 2403.14499v1 link
2024-03-21 Style-Extracting Diffusion Models for Semi-Supervised Histopathology Segmentation Mathias Öttl et.al. 2403.14429v1 null
2024-03-21 DP-RDM: Adapting Diffusion Models to Private Domains Without Fine-Tuning Jonathan Lebensold et.al. 2403.14421v1 null
2024-03-21 Physics-Informed Diffusion Models Jan-Hendrik Bastek et.al. 2403.14404v1 null
2024-03-21 Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models Pablo Marcos-Manchón et.al. 2403.14291v1 link
2024-03-21 Zero123-6D: Zero-shot Novel View Synthesis for RGB Category-level 6D Pose Estimation Francesco Di Felice et.al. 2403.14279v1 null
2024-03-21 Diffusion Models with Ensembled Structure-Based Anomaly Scoring for Unsupervised Anomaly Detection Finn Behrendt et.al. 2403.14262v1 link
2024-03-21 Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition Sihyun Yu et.al. 2403.14148v1 null
2024-03-21 Protein Conformation Generation via Force-Guided SE(3) Diffusion Models Yan Wang et.al. 2403.14088v1 null
2024-03-21 QSMDiff: Unsupervised 3D Diffusion Models for Quantitative Susceptibility Mapping Zhuang Xiong et.al. 2403.14070v1 null
2024-03-21 LeFusion: Synthesizing Myocardial Pathology on Cardiac MRI via Lesion-Focus Diffusion Models Hantao Zhang et.al. 2403.14066v1 null
2024-03-21 DiffSTOCK: Probabilistic relational Stock Market Predictions using Diffusion Models Divyanshu Daiya et.al. 2403.14063v1 null
2024-03-20 Enhancing Fingerprint Image Synthesis with GANs, Diffusion Models, and Style Transfer Techniques W. Tang et.al. 2403.13916v1 null
2024-03-20 Towards Learning Contrast Kinetics with Multi-Condition Latent Diffusion Models Richard Osuala et.al. 2403.13890v1 link
2024-03-20 Editing Massive Concepts in Text-to-Image Diffusion Models Tianwei Xiong et.al. 2403.13807v1 link
2024-03-20 ZigMa: Zigzag Mamba Diffusion Model Vincent Tao Hu et.al. 2403.13802v1 null
2024-03-20 TimeRewind: Rewinding Time with Image-and-Events Video Diffusion Jingxi Chen et.al. 2403.13800v1 null
2024-03-20 DepthFM: Fast Monocular Depth Estimation with Flow Matching Ming Gui et.al. 2403.13788v1 null
2024-03-20 Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation Fu-Yun Wang et.al. 2403.13745v1 link
2024-03-20 DanceCamera3D: 3D Camera Movement Synthesis with Music and Dance Zixuan Wang et.al. 2403.13667v1 link
2024-03-20 ZoDi: Zero-Shot Domain Adaptation with Diffusion-Based Image Transfer Hiroki Azuma et.al. 2403.13652v1 null
2024-03-20 ReGround: Improving Textual and Spatial Grounding at No Cost Yuseung Lee et.al. 2403.13589v1 null
2024-03-20 Ground-A-Score: Scaling Up the Score Distillation for Multi-Attribute Editing Hangeol Chang et.al. 2403.13551v1 null
2024-03-20 Compress3D: a Compressed Latent Space for 3D Generation from a Single Image Bowen Zhang et.al. 2403.13524v1 null
2024-03-20 VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis Yumeng Li et.al. 2403.13501v1 null
2024-03-20 Scaling Diffusion Models to Real-World 3D LiDAR Scene Completion Lucas Nunes et.al. 2403.13470v1 link
2024-03-22 S2DM: Sector-Shaped Diffusion Models for Video Generation Haoran Lang et.al. 2403.13408v2 null
2024-03-20 IIDM: Image-to-Image Diffusion Model for Semantic Image Synthesis Feng Liu et.al. 2403.13378v1 link
2024-03-24 AGFSync: Leveraging AI-Generated Feedback for Preference Optimization in Text-to-Image Generation Jingkun An et.al. 2403.13352v2 null
2024-03-21 LaserHuman: Language-guided Scene-aware Human Motion Generation in Free Environment Peishan Cong et.al. 2403.13307v2 null
2024-03-20 DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception Yibo Wang et.al. 2403.13304v1 null
2024-03-20 Building Optimal Neural Architectures using Interpretable Knowledge Keith G. Mills et.al. 2403.13293v1 link
2024-03-20 Beyond Skeletons: Integrative Latent Mapping for Coherent 4D Sequence Generation Qitong Yang et.al. 2403.13238v1 null
2024-03-20 A Contact Model based on Denoising Diffusion to Learn Variable Impedance Control for Contact-rich Manipulation Masashi Okada et.al. 2403.13221v1 null
2024-03-20 Diffusion Model for Data-Driven Black-Box Optimization Zihao Li et.al. 2403.13219v1 null
2024-03-19 Depth-guided NeRF Training via Earth Mover's Distance Anita Rau et.al. 2403.13206v1 null
2024-03-19 Magic Fixup: Streamlining Photo Editing by Watching Dynamic Videos Hadi Alzayer et.al. 2403.13044v1 null
2024-03-19 FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis Linjiang Huang et.al. 2403.12963v1 link
2024-03-19 FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation Shuai Yang et.al. 2403.12962v1 link
2024-03-19 Zero-Reference Low-Light Enhancement via Physical Quadruple Priors Wenjing Wang et.al. 2403.12933v1 null
2024-03-19 Ultra-High-Resolution Image Synthesis with Pyramid Diffusion Model Jiajie Yang et.al. 2403.12915v1 null
2024-03-19 D-Cubed: Latent Diffusion Trajectory Optimisation for Dexterous Deformable Manipulation Jun Yamada et.al. 2403.12861v1 null
2024-03-19 Generative Enhancement for 3D Medical Images Lingting Zhu et.al. 2403.12852v1 link
2024-03-19 Compositional 3D Scene Synthesis with Scene Graph Guided Layout-Shape Generation Yao Wei et.al. 2403.12848v1 null
2024-03-19 DreamDA: Generative Data Augmentation with Diffusion Models Yunxiang Fu et.al. 2403.12803v1 link
2024-03-19 WaveFace: Authentic Face Restoration with Efficient Frequency Recovery Yunqi Miao et.al. 2403.12760v1 null
2024-03-19 Towards Controllable Face Generation with Semantic Latent Diffusion Models Alex Ergasti et.al. 2403.12743v1 link
2024-03-19 AnimateDiff-Lightning: Cross-Model Diffusion Distillation Shanchuan Lin et.al. 2403.12706v1 null
2024-03-19 Tuning-Free Image Customization with Image and Text Guidance Pengzhi Li et.al. 2403.12658v1 null
2024-03-19 LASPA: Latent Spatial Alignment for Fast Training-free Single Image Editing Yazeed Alharbi et.al. 2403.12585v1 null
2024-03-19 Generalized Consistency Trajectory Models for Image Manipulation Beomsu Kim et.al. 2403.12510v1 link
2024-03-19 SC-Diff: 3D Shape Completion with Latent Diffusion Models Juan D. Galvis et.al. 2403.12470v1 null
2024-03-19 Do Generated Data Always Help Contrastive Learning? Yifei Wang et.al. 2403.12448v1 link
2024-03-19 Precise-Physics Driven Text-to-3D Generation Qingshan Xu et.al. 2403.12438v1 null
2024-03-19 ComboVerse: Compositional 3D Assets Creation Using Spatially-Aware Diffusion Guidance Yongwei Chen et.al. 2403.12409v1 null
2024-03-19 Understanding Training-free Diffusion Guidance: Mechanisms and Limitations Yifei Shen et.al. 2403.12404v1 null
2024-03-19 OV9D: Open-Vocabulary Category-Level 9D Object Pose and Size Estimation Junhao Cai et.al. 2403.12396v1 null
2024-03-18 Removing Undesirable Concepts in Text-to-Image Generative Models with Learnable Prompts Anh Bui et.al. 2403.12326v1 null
2024-03-18 Synthetic Image Generation in Cyber Influence Operations: An Emergent Threat? Melanie Mathys et.al. 2403.12207v1 null
2024-03-18 Latent CLAP Loss for Better Foley Sound Synthesis Tornike Karchkhadze et.al. 2403.12182v1 null
2024-03-18 Graph-Jigsaw Conditioned Diffusion Model for Skeleton-based Video Anomaly Detection Ali Karami et.al. 2403.12172v1 null
2024-03-18 Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation Zixin Zhu et.al. 2403.12042v1 link
2024-03-19 MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control Enshen Zhou et.al. 2403.12037v2 link
2024-03-18 One-Step Image Translation with Text-to-Image Models Gaurav Parmar et.al. 2403.12036v1 link
2024-03-18 VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models Junlin Han et.al. 2403.12034v1 null
2024-03-19 Generic 3D Diffusion Adapter Using Controlled Multi-View Editing Hansheng Chen et.al. 2403.12032v2 null
2024-03-18 LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D Generation Yushi Lan et.al. 2403.12019v1 null
2024-03-18 Fast High-Resolution Image Synthesis with Latent Adversarial Diffusion Distillation Axel Sauer et.al. 2403.12015v1 null
2024-03-18 GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image Xiao Fu et.al. 2403.12013v1 null
2024-03-18 HOIDiffusion: Generating Realistic 3D Hand-Object Interaction Data Mengqi Zhang et.al. 2403.12011v1 null
2024-03-18 VideoMV: Consistent Multi-View Generation Based on Large Video Generative Model Qi Zuo et.al. 2403.12010v1 null
2024-03-18 SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video Diffusion Vikram Voleti et.al. 2403.12008v1 null
2024-03-18 SceneSense: Diffusion Models for 3D Occupancy Synthesis from Partial Observation Alec Reed et.al. 2403.11985v1 null
2024-03-18 Diffusion Denoising as a Certified Defense against Clean-label Poisoning Sanghyun Hong et.al. 2403.11981v1 null
2024-03-18 Unveil Conditional Diffusion Models with Classifier-free Guidance: A Sharp Statistical Theory Hengyu Fu et.al. 2403.11968v1 null
2024-03-18 LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model Runhui Huang et.al. 2403.11929v1 null
2024-03-18 Dual-Energy Cone-Beam CT Using Two Complementary Limited-Angle Scans with A Projection-Consistent Diffusion Model Junbo Peng et.al. 2403.11890v1 null
2024-03-18 SuperLoRA: Parameter-Efficient Unified Adaptation of Multi-Layer Attention Modules Xiangyu Chen et.al. 2403.11887v1 null
2024-03-18 IDF-CR: Iterative Diffusion Process for Divide-and-Conquer Cloud Removal in Remote-sensing Images Meilin Wang et.al. 2403.11870v1 null
2024-03-18 Infinite-ID: Identity-preserved Personalization via ID-semantics Decoupling Paradigm Yi Wu et.al. 2403.11781v1 null
2024-03-18 Generalized Multi-Source Inference for Text Conditioned Music Diffusion Models Emilian Postolache et.al. 2403.11706v1 link
2024-03-19 Urban Scene Diffusion through Semantic Occupancy Map Junge Zhang et.al. 2403.11697v2 null
2024-03-18 Binary Noise for Binary Tasks: Masked Bernoulli Diffusion for Unsupervised Anomaly Detection Julia Wolleb et.al. 2403.11667v1 null
2024-03-18 Arc2Face: A Foundation Model of Human Faces Foivos Paraperas Papantoniou et.al. 2403.11641v1 null
2024-03-18 LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models Yang Yang et.al. 2403.11627v1 link
2024-03-18 CRS-Diff: Controllable Generative Remote Sensing Foundation Model Datao Tang et.al. 2403.11614v1 null
2024-03-18 EffiVED:Efficient Video Editing via Text-instruction Diffusion Models Zhenghao Zhang et.al. 2403.11568v1 null
2024-03-18 EchoReel: Enhancing Action Generation of Existing Video Diffusion Models Jianzhi liu et.al. 2403.11535v1 link
2024-03-18 Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-Trained Diffusion Priors Ruicheng Wang et.al. 2403.11503v1 null
2024-03-18 SeisFusion: Constrained Diffusion Model with Input Guidance for 3D Seismic Data Interpolation and Reconstruction Shuang Wang et.al. 2403.11482v1 link
2024-03-18 ALDM-Grasping: Diffusion-aided Zero-Shot Sim-to-Real Transfer for Robot Grasping Yiwei Li et.al. 2403.11459v1 null
2024-03-18 CasSR: Activating Image Power for Real-World Image Super-Resolution Haolan Chen et.al. 2403.11451v1 null
2024-03-18 VmambaIR: Visual State Space Model for Image Restoration Yuan Shi et.al. 2403.11423v1 link
2024-03-18 DreamSampler: Unifying Diffusion Sampling and Score Distillation for Image Manipulation Jeongsol Kim et.al. 2403.11415v1 null
2024-03-18 Divide-and-Conquer Posterior Sampling for Denoising Diffusion Priors Yazid Janati et.al. 2403.11407v1 null
2024-03-17 StainDiffuser: MultiTask Dual Diffusion Model for Virtual Staining Tushar Kataria et.al. 2403.11340v1 null
2024-03-17 Fast Personalized Text-to-Image Syntheses With Attention Injection Yuxuan Zhang et.al. 2403.11284v1 null
2024-03-17 Understanding Diffusion Models by Feynman's Path Integral Yuji Hirono et.al. 2403.11262v1 null
2024-03-17 THOR: Text to Human-Object Interaction Diffusion via Relation Intervention Qianyang Wu et.al. 2403.11208v1 null
2024-03-17 MaskDiffusion: Exploiting Pre-trained Diffusion Models for Semantic Segmentation Yasufumi Kawano et.al. 2403.11194v1 link
2024-03-17 Artifact Feature Purification for Cross-domain Detection of AI-generated Images Zheling Meng et.al. 2403.11172v1 null
2024-03-17 CGI-DM: Digital Copyright Authentication for Diffusion Models via Contrasting Gradient Inversion Xiaoyu Wu et.al. 2403.11162v1 null
2024-03-17 Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion Model Dian Zheng et.al. 2403.11157v1 link
2024-03-17 Omni-Recon: Towards General-Purpose Neural Radiance Fields for Versatile 3D Applications Yonggan Fu et.al. 2403.11131v1 null
2024-03-17 3D Human Reconstruction in the Wild with Synthetic Data Using Generative Models Yongtao Ge et.al. 2403.11111v1 null
2024-03-17 Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models Ruibin Li et.al. 2403.11105v1 link
2024-03-19 Zippo: Zipping Color and Transparency Distributions into a Single Diffusion Model Kangyang Xie et.al. 2403.11077v2 null
2024-03-17 Unveiling and Mitigating Memorization in Text-to-image Diffusion Models through Cross Attention Jie Ren et.al. 2403.11052v1 null
2024-03-16 Reward Guided Latent Consistency Distillation Jiachen Li et.al. 2403.11027v1 null
2024-03-16 OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models Zhe Kong et.al. 2403.10983v1 link
2024-03-16 Ctrl123: Consistent Novel View Synthesis via Closed-Loop Transcription Hongxiang Zhao et.al. 2403.10953v1 null
2024-03-19 Efficient Diffusion-Driven Corruption Editor for Test-Time Adaptation Yeongtak Oh et.al. 2403.10911v2 null
2024-03-19 Urban Sound Propagation: a Benchmark for 1-Step Generative Modeling of Complex Physical Systems Martin Spitznagel et.al. 2403.10904v2 null
2024-03-16 A Watermark-Conditioned Diffusion Model for IP Protection Rui Min et.al. 2403.10893v1 null
2024-03-16 stMCDI: Masked Conditional Diffusion Model with Graph Neural Network for Spatial Transcriptomics Data Imputation Xiaoyu Li et.al. 2403.10863v1 null
2024-03-16 MicroDiffusion: Implicit Representation-Guided Diffusion for 3D Reconstruction from Limited 2D Microscopy Projections Mude Hui et.al. 2403.10815v1 link
2024-03-16 Efficient Trajectory Forecasting and Generation with Conditional Flow Matching Sean Ye et.al. 2403.10809v1 null
2024-03-16 Speech-driven Personalized Gesture Synthetics: Harnessing Automatic Fuzzy Feature Inference Fan Zhang et.al. 2403.10805v1 null
2024-03-16 Diffusion-Reinforcement Learning Hierarchical Motion Planning in Adversarial Multi-agent Games Zixuan Wu et.al. 2403.10794v1 link
2024-03-16 ContourDiff: Unpaired Image Translation with Contour-Guided Diffusion Models Yuwen Chen et.al. 2403.10786v1 null
2024-03-15 Giving a Hand to Diffusion Models: a Two-Stage Approach to Improving Conditional Human Image Generation Anton Pelykh et.al. 2403.10731v1 null
2024-03-15 Debiasing with Diffusion: Probabilistic reconstruction of Dark Matter fields from galaxies with CAMELS Victoria Ono et.al. 2403.10648v1 null
2024-03-15 LightIt: Illumination Modeling and Control for Diffusion Models Peter Kocsis et.al. 2403.10615v1 null
2024-03-15 Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives Ronghui Li et.al. 2403.10518v1 link
2024-03-15 Isotropic3D: Image-to-3D Generation Based on a Single CLIP Embedding Pengkun Liu et.al. 2403.10395v1 link
2024-03-15 Denoising Task Difficulty-based Curriculum for Training Diffusion Models Jin-Young Kim et.al. 2403.10348v1 null
2024-03-15 Optimal Control of Stationary Doubly Diffusive Flows on Two and Three Dimensional Bounded Lipschitz Domains: Numerical Analysis Jai Tushar et.al. 2403.10282v1 null
2024-03-15 Arbitrary-Scale Image Generation and Upsampling using Latent Diffusion Model and Implicit Neural Decoder Jinseok Kim et.al. 2403.10255v1 null
2024-03-15 FDGaussian: Fast Gaussian Splatting from Single Image via Geometric-aware Diffusion Model Qijun Feng et.al. 2403.10242v1 null
2024-03-15 BlindDiff: Empowering Degradation Modelling in Diffusion Models for Blind Image Super-Resolution Feng Li et.al. 2403.10211v1 link
2024-03-15 Spectral CT Two-step and One-step Material Decomposition using Diffusion Posterior Sampling Corentin Vazia et.al. 2403.10183v1 null
2024-03-15 Animate Your Motion: Turning Still Images into Dynamic Videos Mingxiao Li et.al. 2403.10179v1 null
2024-03-15 Being heterogeneous is disadvantageous: Brownian non-Gaussian searches Vittoria Sposini et.al. 2403.10138v1 null
2024-03-15 DiffMAC: Diffusion Manifold Hallucination Correction for High Generalization Blind Face Restoration Nan Gao et.al. 2403.10098v1 null
2024-03-15 RangeLDM: Fast Realistic LiDAR Point Cloud Generation Qianjiang Hu et.al. 2403.10094v1 null
2024-03-15 SphereDiffusion: Spherical Geometry-Aware Distortion Resilient Diffusion Model Tao Wu et.al. 2403.10044v1 null
2024-03-15 ST-LDM: A Universal Framework for Text-Grounded Object Generation in Real Images Xiangtian Xue et.al. 2403.10004v1 null
2024-03-15 Controllable Text-to-3D Generation via Surface-Aligned Gaussian Splatting Zhiqi Li et.al. 2403.09981v1 null
2024-03-14 ProMark: Proactive Diffusion Watermarking for Causal Attribution Vishal Asnani et.al. 2403.09914v1 null
2024-03-14 DTG : Diffusion-based Trajectory Generation for Mapless Global Navigation Jing Liang et.al. 2403.09900v1 null
2024-03-14 SCP-Diff: Photo-Realistic Semantic Image Synthesis with Spatial-Categorical Joint Prior Huan-ang Gao et.al. 2403.09638v1 null
2024-03-14 3D-VLA: A 3D Vision-Language-Action Generative World Model Haoyu Zhen et.al. 2403.09631v1 null
2024-03-14 Generalized Predictive Model for Autonomous Driving Jiazhi Yang et.al. 2403.09630v1 link
2024-03-14 Make-Your-3D: Fast and Consistent Subject-Driven 3D Content Generation Fangfu Liu et.al. 2403.09625v1 null
2024-03-14 Score-Guided Diffusion for 3D Human Recovery Anastasis Stathopoulos et.al. 2403.09623v1 link
2024-03-14 Explore In-Context Segmentation via Latent Diffusion Models Chaoyang Wang et.al. 2403.09616v1 null
2024-03-14 MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models Zunnan Xu et.al. 2403.09471v1 null
2024-03-14 Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing Wonjun Kang et.al. 2403.09468v1 link
2024-03-14 Shake to Leak: Fine-tuning Diffusion Models Can Amplify the Generative Privacy Risk Zhangheng Li et.al. 2403.09450v1 link
2024-03-14 3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation Frank Zhang et.al. 2403.09439v1 null
2024-03-14 LM2D: Lyrics- and Music-Driven Dance Synthesis Wenjie Yin et.al. 2403.09407v1 null
2024-03-14 Mitigating Data Consistency Induced Discrepancy in Cascaded Diffusion Models for Sparse-view CT Reconstruction Hanyu Chen et.al. 2403.09355v1 null
2024-03-14 HeadEvolver: Text to Head Avatars via Locally Learnable Mesh Deformation Duotun Wang et.al. 2403.09326v1 null
2024-03-14 Regularity and trend to equilibrium for a non-local advection-diffusion model of active particles Luca Alasio et.al. 2403.09282v1 null
2024-03-14 XReal: Realistic Anatomy and Pathology-Aware X-ray Generation via Controllable Diffusion Model Anees Ur Rehman Hashmi et.al. 2403.09240v1 null
2024-03-14 Intention-driven Ego-to-Exo Video Generation Hongchen Luo et.al. 2403.09194v1 null
2024-03-14 Intention-aware Denoising Diffusion Model for Trajectory Prediction Chen Liu et.al. 2403.09190v1 null
2024-03-14 Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts Byeongjun Park et.al. 2403.09176v1 link
2024-03-14 Sculpt3D: Multi-View Consistent Text-to-3D Generation with Sparse 3D Prior Cheng Chen et.al. 2403.09140v1 null
2024-03-14 Rethinking Referring Object Removal Xiangtian Xue et.al. 2403.09128v1 null
2024-03-14 StreamMultiDiffusion: Real-Time Interactive Generation with Region-Based Semantic Control Jaerin Lee et.al. 2403.09055v1 link
2024-03-13 Unveiling the Truth: Exploring Human Gaze Patterns in Fake Images Giuseppe Cartella et.al. 2403.08933v1 link
2024-03-13 Envision3D: One Image to 3D with Anchor Views Interpolation Yatian Pang et.al. 2403.08902v1 link
2024-03-13 Federated Data Model Xiao Chen et.al. 2403.08887v1 null
2024-03-13 NoiseDiffusion: Correcting Noise for Image Interpolation with Diffusion Models beyond Spherical Linear Interpolation PengFei Zheng et.al. 2403.08840v1 null
2024-03-13 VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis Enric Corona et.al. 2403.08764v1 null
2024-03-13 Spatiotemporal Diffusion Model with Paired Sampling for Accelerated Cardiac Cine MRI Shihan Qiu et.al. 2403.08758v1 null
2024-03-13 Clinically Feasible Diffusion Reconstruction for Highly-Accelerated Cardiac Cine MRI Shihan Qiu et.al. 2403.08749v1 null
2024-03-14 GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing Jing Wu et.al. 2403.08733v2 null
2024-03-13 Ambient Diffusion Posterior Sampling: Solving Inverse Problems with Diffusion Models trained on Corrupted Data Asad Aali et.al. 2403.08728v1 link
2024-03-13 Data Augmentation in Human-Centric Vision Wentao Jiang et.al. 2403.08650v1 null
2024-03-13 ActionDiffusion: An Action-aware Diffusion Model for Procedure Planning in Instructional Videos Lei Shi et.al. 2403.08591v1 null
2024-03-13 Federated Knowledge Graph Unlearning via Diffusion Model Bingchen Liu et.al. 2403.08554v1 null
2024-03-13 Model Will Tell: Training Membership Inference for Diffusion Models Xiaomeng Fu et.al. 2403.08487v1 null
2024-03-13 MD-Dose: A Diffusion Model based on the Mamba for Radiotherapy Dose Prediction Linjie Fu et.al. 2403.08479v1 link
2024-03-13 An Analysis of Human Alignment of Latent Diffusion Models Lorenz Linhardt et.al. 2403.08469v1 null
2024-03-13 Diffusion Models with Implicit Guidance for Medical Anomaly Detection Cosmin I. Bercea et.al. 2403.08464v1 link
2024-03-13 Towards Dense and Accurate Radar Perception Via Efficient Cross-Modal Diffusion Model Ruibin Zhang et.al. 2403.08460v1 null
2024-03-13 PFStorer: Personalized Face Restoration and Super-Resolution Tuomas Varanka et.al. 2403.08436v1 null
2024-03-13 Iterative Online Image Synthesis via Diffusion Model for Imbalanced Classification Shuhan Li et.al. 2403.08407v1 null
2024-03-13 Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models Pengze Zhang et.al. 2403.08381v1 link
2024-03-13 Mitigate Target-level Insensitivity of Infrared Small Target Detection via Posterior Distribution Modeling Haoqing Li et.al. 2403.08380v1 link
2024-03-13 VIGFace: Virtual Identity Generation Model for Face Image Synthesis Minsoo Kim et.al. 2403.08277v1 null
2024-03-13 Sketch2Manga: Shaded Manga Screening from Sketch with Diffusion Models Jian Lin et.al. 2403.08266v1 null
2024-03-13 Make Me Happier: Evoking Emotions Through Image Diffusion Models Qing Lin et.al. 2403.08255v1 null
2024-03-12 Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation Shihao Zhao et.al. 2403.07860v1 link
2024-03-12 Quantifying and Mitigating Privacy Risks for Tabular Generative Models Chaoyi Zhu et.al. 2403.07842v1 null
2024-03-12 MPCPA: Multi-Center Privacy Computing with Predictions Aggregation based on Denoising Diffusion Probabilistic Model Guibo Luo et.al. 2403.07838v1 null
2024-03-13 SemCity: Semantic Scene Generation with Triplane Diffusion Jumin Lee et.al. 2403.07773v2 link
2024-03-12 Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model Yuxuan Zhang et.al. 2403.07764v1 null
2024-03-12 SSM Meets Video Diffusion Models: Efficient Video Generation with Structured State Spaces Yuta Oshima et.al. 2403.07711v1 link
2024-03-12 Visual Privacy Auditing with Diffusion Models Kristian Schwethelm et.al. 2403.07588v1 null
2024-03-12 D4D: An RGBD diffusion model to boost monocular depth estimation L. Papa et.al. 2403.07516v1 link
2024-03-12 Block-wise LoRA: Revisiting Fine-grained LoRA for Effective Personalization and Stylization in Text-to-Image Generation Likun Li et.al. 2403.07500v1 null
2024-03-12 Time-Efficient and Identity-Consistent Virtual Try-On Using A Variant of Altered Diffusion Models Phuong Dam et.al. 2403.07371v1 null
2024-03-12 Efficient Diffusion Model for Image Restoration by Residual Shifting Zongsheng Yue et.al. 2403.07319v1 link
2024-03-12 It's All About Your Sketch: Democratising Sketch Control in Diffusion Models Subhadeep Koley et.al. 2403.07234v1 link
2024-03-12 Text-to-Image Diffusion Models are Great Sketch-Photo Matchmakers Subhadeep Koley et.al. 2403.07214v1 null
2024-03-11 3M-Diffusion: Latent Multi-Modal Diffusion for Text-Guided Generation of Molecular Graphs Huaisheng Zhu et.al. 2403.07179v1 null
2024-03-11 One Category One Prompt: Dataset Distillation using Diffusion Models Ali Abbasi et.al. 2403.07142v1 null
2024-03-11 BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion Xuan Ju et.al. 2403.06976v1 link
2024-03-11 Bayesian Diffusion Models for 3D Shape Reconstruction Haiyang Xu et.al. 2403.06973v1 null
2024-03-11 POD-ROM methods: from a finite set of snapshots to continuous-in-time approximations Bosco Garcia-Archilla et.al. 2403.06967v1 null
2024-03-11 SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data Jialu Li et.al. 2403.06952v1 null
2024-03-12 DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations Tianhao Qi et.al. 2403.06951v2 null
2024-03-11 Conditional Score-Based Diffusion Model for Cortical Thickness Trajectory Prediction Qing Xiao et.al. 2403.06940v1 null
2024-03-11 Estimation of parameters and local times in a discretely observed threshold diffusion model Sara Mazzonetto et.al. 2403.06858v1 null
2024-03-11 Multistep Consistency Models Jonathan Heek et.al. 2403.06807v1 null
2024-03-11 Distribution-Aware Data Expansion with Diffusion Models Haowei Zhu et.al. 2403.06741v1 link
2024-03-11 V3D: Video Diffusion Models are Effective 3D Generators Zilong Chen et.al. 2403.06738v1 link
2024-03-11 Active Generation for Image Classification Tao Huang et.al. 2403.06517v1 null
2024-03-11 Advancing Text-Driven Chest X-Ray Generation with Policy-Based Reinforcement Learning Woojung Han et.al. 2403.06516v1 null
2024-03-11 Incorporating Improved Sinusoidal Threshold-based Semi-supervised Method and Diffusion Models for Osteoporosis Diagnosis Wenchi Ke et.al. 2403.06498v1 null
2024-03-11 Are you sure? Modelling Drivers' Confidence Judgments in Left-Turn Gap Acceptance Decisions Arkady Zgonnikov et.al. 2403.06496v1 null
2024-03-13 Text2QR: Harmonizing Aesthetic Customization and Scanning Robustness for Text-Guided QR Code Generation Guangyang Wu et.al. 2403.06452v2 null
2024-03-11 DivCon: Divide and Conquer for Progressive Text-to-Image Generation Yuhao Jia et.al. 2403.06400v1 link
2024-03-13 FSViewFusion: Few-Shots View Generation of Novel Objects Rukhshanda Hussain et.al. 2403.06394v2 null
2024-03-11 Enhancing Semantic Fidelity in Text-to-Image Synthesis: Attention Regulation in Diffusion Models Yang Zhang et.al. 2403.06381v1 null
2024-03-12 Style2Talker: High-Resolution Talking Head Generation with Emotion Style and Art Style Shuai Tan et.al. 2403.06365v2 null
2024-03-10 Transferable Reinforcement Learning via Generalized Occupancy Models Chuning Zhu et.al. 2403.06328v1 null
2024-03-10 Spectral Diffusion Posterior Sampling for Synergistic Reconstruction in Spectral Computed Tomography Corentin Vazia et.al. 2403.06308v1 null
2024-03-12 Fine-tuning of diffusion models via stochastic control: entropy regularization and beyond Wenpin Tang et.al. 2403.06279v2 null
2024-03-10 FastVideoEdit: Leveraging Consistency Models for Efficient Text-to-Video Editing Youyuan Zhang et.al. 2403.06269v1 null
2024-03-10 DiffuMatting: Synthesizing Arbitrary Objects with Matting-level Annotation Xiaobin Hu et.al. 2403.06168v1 null
2024-03-10 Platypose: Calibrated Zero-Shot Multi-Hypothesis 3D Human Motion Estimation Paweł A. Pierzchlewicz et.al. 2403.06164v1 link
2024-03-10 MACE: Mass Concept Erasure in Diffusion Models Shilin Lu et.al. 2403.06135v1 link
2024-03-10 VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models Wenhao Wang et.al. 2403.06098v1 link
2024-03-10 Diffusion Models Trained with Large Data Are Transferable Visual Models Guangkai Xu et.al. 2403.06090v1 null
2024-03-10 Implicit Image-to-Image Schrodinger Bridge for CT Super-Resolution and Denoising Yuang Wang et.al. 2403.06069v1 null
2024-03-12 Decoupled Data Consistency with Diffusion Purification for Image Restoration Xiang Li et.al. 2403.06054v2 null
2024-03-09 CoNFiLD: Conditional Neural Field Latent Diffusion Model Generating Spatiotemporal Turbulence Pan Du et.al. 2403.05940v1 null
2024-03-12 SEMRes-DDPM: Residual Network Based Diffusion Modelling Applied to Imbalanced Data Ming Zheng et.al. 2403.05918v2 null
2024-03-09 Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines Michael Toker et.al. 2403.05846v1 null
2024-03-12 An Audio-textual Diffusion Model For Converting Speech Signals Into Ultrasound Tongue Imaging Data Yudong Yang et.al. 2403.05820v2 null
2024-03-09 Adaptive Multi-modal Fusion of Spatially Variant Kernel Refinement with Diffusion Model for Blind Image Super-Resolution Junxiong Lin et.al. 2403.05808v1 null
2024-03-09 Privacy-Preserving Diffusion Model Using Homomorphic Encryption Yaojian Chen et.al. 2403.05794v1 null
2024-03-09 Large Generative Model Assisted 3D Semantic Communication Feibo Jiang et.al. 2403.05783v1 null
2024-03-09 MG-TSD: Multi-Granularity Time Series Diffusion Models with Guided Learning Process Xinyao Fan et.al. 2403.05751v1 link
2024-03-08 Non-robustness of diffusion estimates on networks with measurement error Arun G. Chandrasekhar et.al. 2403.05704v1 null
2024-03-08 Audio-Synchronized Visual Animation Lin Zhang et.al. 2403.05659v1 null
2024-03-08 VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models Yabo Zhang et.al. 2403.05438v1 link
2024-03-08 DiffSF: Diffusion Models for Scene Flow Estimation Yushan Zhang et.al. 2403.05327v1 null
2024-03-08 Noise Level Adaptive Diffusion Model for Robust Reconstruction of Accelerated MRI Shoujin Huang et.al. 2403.05245v1 null
2024-03-08 Towards Effective Usage of Human-Centric Priors in Diffusion Models for Text-based Human Image Generation Junyan Wang et.al. 2403.05239v1 null
2024-03-08 Denoising Autoregressive Representation Learning Yazhe Li et.al. 2403.05196v1 null
2024-03-08 DiffuLT: How to Make Diffusion Model Useful for Long-tail Recognition Jie Shao et.al. 2403.05170v1 null
2024-03-08 GSEdit: Efficient Text-Guided Editing of 3D Objects via Gaussian Splatting Francesco Palandra et.al. 2403.05154v1 null
2024-03-08 Improving Diffusion Models for Virtual Try-on Yisol Choi et.al. 2403.05139v1 null
2024-03-08 ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment Xiwei Hu et.al. 2403.05135v1 null
2024-03-08 CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion Wendi Zheng et.al. 2403.05121v1 null
2024-03-08 Face2Diffusion for Fast and Editable Face Personalization Kaede Shiohara et.al. 2403.05094v1 link
2024-03-08 Spectrum Translation for Refinement of Image Generation (STIG) Based on Contrastive Learning and Spectral Filter Profile Seokjun Lee et.al. 2403.05093v1 null
2024-03-08 Improving Diffusion-Based Generative Models via Approximated Optimal Transport Daegyu Kim et.al. 2403.05069v1 null
2024-03-08 XPSR: Cross-modal Priors for Diffusion-based Image Super-Resolution Yunpeng Qu et.al. 2403.05049v1 null
2024-03-08 BjTT: A Large-scale Multimodal Dataset for Traffic Prediction Chengyang Zhang et.al. 2403.05029v1 link
2024-03-08 InstructGIE: Towards Generalizable Image Editing Zichong Meng et.al. 2403.05018v1 null
2024-03-08 DiffClass: Diffusion-Based Class Incremental Learning Zichong Meng et.al. 2403.05016v1 null
2024-03-08 RFWave: Multi-band Rectified Flow for Audio Waveform Reconstruction Peng Liu et.al. 2403.05010v1 link
2024-03-08 StereoDiffusion: Training-Free Stereo Image Generation Using Latent Diffusion Models Lezhong Wang et.al. 2403.04965v1 null
2024-03-07 AFreeCA: Annotation-Free Counting for All Adriano D'Alessandro et.al. 2403.04943v1 null
2024-03-07 An Item is Worth a Prompt: Versatile Image Editing with Disentangled Control Aosong Feng et.al. 2403.04880v1 null
2024-03-07 ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes Hashmat Shadab Malik et.al. 2403.04701v1 null
2024-03-07 Delving into the Trajectory Long-tail Distribution for Muti-object Tracking Sijia Chen et.al. 2403.04700v1 link
2024-03-07 PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation Junsong Chen et.al. 2403.04692v1 null
2024-03-08 Pix2Gif: Motion-Guided Diffusion for GIF Generation Hitesh Kandala et.al. 2403.04634v2 null
2024-03-07 A Domain Translation Framework with an Adversarial Denoising Diffusion Model to Generate Synthetic Datasets of Echocardiography Images Cristiana Tiago et.al. 2403.04612v1 null
2024-03-07 Anatomy-Guided Surface Diffusion Model for Alzheimer's Disease Normative Modeling Jianwei Zhang et.al. 2403.04531v1 null
2024-03-07 Effect of turbulent diffusion in modeling anaerobic digestion Jeremy Z. Yan et.al. 2403.04457v1 null
2024-03-07 Disentangled Diffusion-Based 3D Human Pose Estimation with Hierarchical Spatial and Temporal Denoiser Qingyuan Cai et.al. 2403.04444v1 null
2024-03-07 StableDrag: Stable Dragging for Point-based Image Editing Yutao Cui et.al. 2403.04437v1 null
2024-03-07 On-demand Quantization for Green Federated Generative Diffusion in Mobile Edge Networks Bingkun Lai et.al. 2403.04430v1 null
2024-03-07 Controllable Generation with Text-to-Image Diffusion Models: A Survey Pu Cao et.al. 2403.04279v1 link
2024-03-06 PromptCharm: Text-to-Image Generation through Multi-modal Prompting and Refinement Zhijie Wang et.al. 2403.04014v1 link
2024-03-06 GUIDE: Guidance-based Incremental Learning with Diffusion Models Bartosz Cywiński et.al. 2403.03938v1 link
2024-03-06 Latent Dataset Distillation with Diffusion Models Brian B. Moser et.al. 2403.03881v1 null
2024-03-06 Accelerating Convergence of Score-Based Diffusion Models, Provably Gen Li et.al. 2403.03852v1 null
2024-03-06 Diffusion on language model embeddings for protein sequence generation Viacheslav Meshchaninov et.al. 2403.03726v1 null
2024-03-06 Efficient Search and Learning for Agile Locomotion on Stepping Stones Adithya Kumar Chinnakkonda Ravi et.al. 2403.03639v1 null
2024-03-06 Diffusion-based Generative Prior for Low-Complexity MIMO Channel Estimation Benedikt Fesl et.al. 2403.03545v1 link
2024-03-06 NoiseCollage: A Layout-Aware Text-to-Image Diffusion Model Based on Noise Cropping and Merging Takahiro Shirakawa et.al. 2403.03485v1 null
2024-03-06 FLAME Diffuser: Grounded Wildfire Image Synthesis using Mask Guided Diffusion Hao Wang et.al. 2403.03463v1 null
2024-03-06 Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing Bingyan Liu et.al. 2403.03431v1 null
2024-03-05 Scaling Rectified Flow Transformers for High-Resolution Image Synthesis Patrick Esser et.al. 2403.03206v1 null
2024-03-05 MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets Hossein Aboutalebi et.al. 2403.03194v1 null
2024-03-05 NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models Zeqian Ju et.al. 2403.03100v1 null
2024-03-05 Global N-body Simulation of Gap Edge Structures Created by Perturbations from a Small Satellite Embedded in Saturn's Rings Naoya Torii et.al. 2403.03012v1 null
2024-03-05 Cross-Domain Image Conversion by CycleDM Sho Shimotsumagari et.al. 2403.02919v1 null
2024-03-05 MMoFusion: Multi-modal Co-Speech Motion Generation with Diffusion Model Sen Wang et.al. 2403.02905v1 null
2024-03-05 Enhancing the Rate-Distortion-Perception Flexibility of Learned Image Codecs with Conditional Diffusion Decoders Daniele Mari et.al. 2403.02887v1 null
2024-03-05 Zero-LED: Zero-Reference Lighting Estimation Diffusion Model for Low-Light Image Enhancement Jinhong He et.al. 2403.02879v1 null
2024-03-05 Scalable Continuous-time Diffusion Framework for Network Inference and Influence Estimation Keke Huang et.al. 2403.02867v1 null
2024-03-05 Tuning-Free Noise Rectification for High Fidelity Image-to-Video Generation Weijie Li et.al. 2403.02827v1 null
2024-03-05 Fast, Scale-Adaptive, and Uncertainty-Aware Downscaling of Earth System Model Fields with Generative Foundation Models Philipp Hess et.al. 2403.02774v1 null
2024-03-05 Few-shot Learner Parameterization by Diffusion Time-steps Zhongqi Yue et.al. 2403.02649v1 null
2024-03-05 Semantic Human Mesh Reconstruction with Textures Xiaoyu Zhan et.al. 2403.02561v1 null
2024-03-05 Updating the Minimum Information about CLinical Artificial Intelligence (MI-CLAIM) checklist for generative modeling research Brenda Y. Miao et.al. 2403.02558v1 link
2024-03-06 UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified Attention Control Xuweiyi Chen et.al. 2403.02332v3 link
2024-03-04 3DTopia: Large Text-to-3D Generation Model with Hybrid Diffusion Priors Fangzhou Hong et.al. 2403.02234v1 link
2024-03-04 DragTex: Generative Point-Based Texture Editing on 3D Mesh Yudi Zhang et.al. 2403.02217v1 null
2024-03-04 ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models Jiaxiang Cheng et.al. 2403.02084v1 link
2024-03-04 FaceChain-ImagineID: Freely Crafting High-Fidelity Diverse Talking Faces from Disentangled Audio Chao Xu et.al. 2403.01901v1 link
2024-03-04 ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models Lukas Höllein et.al. 2403.01807v1 link
2024-03-07 OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on Yuhao Xu et.al. 2403.01779v2 link
2024-03-04 Differentially Private Synthetic Data via Foundation Model APIs 2: Text Chulin Xie et.al. 2403.01749v1 link
2024-03-04 Soft-constrained Schrodinger Bridge: a Stochastic Control Approach Jhanvi Garg et.al. 2403.01717v1 null
2024-03-04 HanDiffuser: Text-to-Image Generation With Realistic Hand Appearances Supreeth Narasimhaswamy et.al. 2403.01693v1 null
2024-03-07 Reaction-diffusion models of biological invasion: Open source computational tools, key concepts and analysis Matthew J Simpson et.al. 2403.01667v4 link
2024-03-03 Theoretical Insights for Diffusion Guidance: A Case Study for Gaussian Mixture Models Yuchen Wu et.al. 2403.01639v1 null
2024-03-03 Critical windows: non-asymptotic theory for feature emergence in diffusion models Marvin Li et.al. 2403.01633v1 null
2024-03-03 Neural Graph Generator: Feature-Conditioned Graph Generation using Latent Diffusion Models Iakovos Evdaimon et.al. 2403.01535v1 link
2024-03-03 SCott: Accelerating Diffusion Models with Stochastic Consistency Distillation Hongjian Liu et.al. 2403.01505v1 null
2024-03-03 Learning A Physical-aware Diffusion Model Based on Transformer for Underwater Image Enhancement Chen Zhao et.al. 2403.01497v1 null
2024-03-03 Approximations to the Fisher Information Metric of Deep Generative Models for Out-Of-Distribution Detection Sam Dauncey et.al. 2403.01485v1 null
2024-03-02 DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction Junwen Xiong et.al. 2403.01226v1 null
2024-03-02 TCIG: Two-Stage Controlled Image Generation with Quality Enhancement through Diffusion Salaheldin Mohamed et.al. 2403.01212v1 null
2024-03-02 Training Unbiased Diffusion Models From Biased Dataset Yeongmin Kim et.al. 2403.01189v1 link
2024-03-02 Volume diffusion modelling of a sheared granular gas Duncan Dockar et.al. 2403.01188v1 null
2024-03-02 Text-guided Explorable Image Super-resolution Kanchana Vaishnavi Gandikota et.al. 2403.01124v1 null
2024-03-02 Face Swap via Diffusion Model Feifei Wang et.al. 2403.01108v1 null
2024-03-01 A time-stepping deep gradient flow method for option pricing in (rough) diffusion models Antonis Papapantoleon et.al. 2403.00746v1 null
2024-03-01 Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks Yuhao Liu et.al. 2403.00644v1 null
2024-03-01 Improving Explicit Spatial Relationships in Text-to-Image Generation through an Automatically Derived Dataset Ander Salaberria et.al. 2403.00587v1 link
2024-03-01 Rethinking cluster-conditioned diffusion models Nikolas Adaloglou et.al. 2403.00570v1 null
2024-03-01 Waves, patterns and bifurcations: a tutorial review on the vertebrate segmentation clock Paul François et.al. 2403.00457v1 null
2024-03-01 An Ordinal Diffusion Model for Generating Medical Images with Different Severity Levels Shumpei Takezaki et.al. 2403.00452v1 null
2024-03-01 LoMOE: Localized Multi-Object Editing via Multi-Diffusion Goirik Chakrabarty et.al. 2403.00437v1 null
2024-03-01 Abductive Ego-View Accident Video Understanding for Safe Driving Perception Jianwu Fang et.al. 2403.00436v1 null
2024-03-01 HyperSDFusion: Bridging Hierarchical Structures in Language and Geometry for Enhanced 3D Text2Shape Generation Zhiying Leng et.al. 2403.00372v1 null
2024-03-05 Robust Policy Learning via Offline Skill Diffusion Woo Kyung Kim et.al. 2403.00225v2 null
2024-02-29 DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models Muyang Li et.al. 2402.19481v1 null
2024-02-29 Towards Generalizable Tumor Synthesis Qi Chen et.al. 2402.19470v1 null
2024-02-29 Listening to the Noise: Blind Denoising with Gibbs Diffusion David Heurtel-Depeiges et.al. 2402.19455v1 link
2024-02-29 Structure Preserving Diffusion Models Haoye Lu et.al. 2402.19369v1 null
2024-02-29 A Novel Approach to Industrial Defect Generation through Blended Latent Diffusion Model with Online Adaptation Hanxi Li et.al. 2402.19330v1 null
2024-02-29 DiffAssemble: A Unified Graph-Diffusion Model for 2D and 3D Reassembly Gianluca Scarpellini et.al. 2402.19302v1 link
2024-02-29 TEncDM: Understanding the Properties of Diffusion Model in the Space of Language Model Encodings Alexander Shabalin et.al. 2402.19097v1 null
2024-03-01 Graph Convolutional Neural Networks for Automated Echocardiography View Recognition: A Holistic Approach Sarina Thomas et.al. 2402.19062v2 null
2024-02-29 WDM: 3D Wavelet Diffusion Models for High-Resolution Medical Image Synthesis Paul Friedrich et.al. 2402.19043v1 link
2024-02-29 Generating, Reconstructing, and Representing Discrete and Continuous Data: Generalized Diffusion with Learnable Encoding-Decoding Guangyi Liu et.al. 2402.19009v1 null
2024-02-29 ViewFusion: Towards Multi-View Consistency via Interpolated Denoising Xianghui Yang et.al. 2402.18842v1 link
2024-03-03 Extended Flow Matching: a Method of Conditional Generation with Generalized Continuity Equation Noboru Isobe et.al. 2402.18839v2 null
2024-02-29 A Quantitative Evaluation of Score Distillation Sampling Based Text-to-3D Xiaohan Fei et.al. 2402.18780v1 null
2024-03-04 Exploring Privacy and Fairness Risks in Sharing Diffusion Models: An Adversarial Perspective Xinjian Luo et.al. 2402.18607v2 null
2024-02-28 Logarithmic Sobolev Inequalities for Bounded Domains and Applications to Drift-Diffusion Equations Elie Abdo et.al. 2402.18572v1 null
2024-02-28 Dynamical Regimes of Diffusion Models Giulio Biroli et.al. 2402.18491v1 null
2024-02-28 Deep Confident Steps to New Pockets: Strategies for Docking Generalization Gabriele Corso et.al. 2402.18396v1 link
2024-02-28 Objective and Interpretable Breast Cosmesis Evaluation with Attention Guided Denoising Diffusion Anomaly Detection Model Sangjoon Park et.al. 2402.18362v1 null
2024-02-28 FineDiffusion: Scaling up Diffusion Models for Fine-grained Image Generation with 10,000 Classes Ziying Pan et.al. 2402.18331v1 link
2024-02-28 Balancing Act: Distribution-Guided Debiasing in Diffusion Models Rishubh Parihar et.al. 2402.18206v1 null
2024-02-28 Diffusion-based Neural Network Weights Generation Bedionita Soro et.al. 2402.18153v1 null
2024-02-28 Context-aware Talking Face Video Generation Meidai Xuanyuan et.al. 2402.18092v1 null
2024-02-28 Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis Yanzuo Lu et.al. 2402.18078v1 link
2024-03-05 SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images via Vision-Language Model Bin Cao et.al. 2402.18068v2 null
2024-02-28 Diffusion Models as Constrained Samplers for Optimization with Unknown Constraints Lingkai Kong et.al. 2402.18012v1 null
2024-03-01 Imagine, Initialize, and Explore: An Effective Exploration Method in Multi-Agent Reinforcement Learning Zeyang Liu et.al. 2402.17978v2 null
2024-02-27 Box It to Bind It: Unified Layout Control and Attribute Binding in T2I Diffusion Models Ashkan Taghipour et.al. 2402.17910v1 null
2024-02-27 Diffusion Meets DAgger: Supercharging Eye-in-hand Imitation Learning Xiaoyu Zhang et.al. 2402.17768v1 null
2024-03-04 Structure-Guided Adversarial Training of Diffusion Models Ling Yang et.al. 2402.17563v2 null
2024-02-27 Diffusion Model-Based Image Editing: A Survey Yi Huang et.al. 2402.17525v1 link
2024-02-27 Label-Noise Robust Diffusion Models Byeonghu Na et.al. 2402.17517v1 link
2024-02-27 EMO: Emote Portrait Alive -- Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions Linrui Tian et.al. 2402.17485v1 null
2024-02-28 DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized Diffusion Models Shyam Marjit et.al. 2402.17412v2 null
2024-02-27 Generative diffusion model for surface structure discovery Nikolaj Rønne et.al. 2402.17404v1 null
2024-02-27 Denoising Diffusion Models for Inpainting of Healthy Brain Tissue Alicia Durrer et.al. 2402.17307v1 null
2024-02-27 DivAvatar: Diverse 3D Avatar Generation with a Single Prompt Weijing Tao et.al. 2402.17292v1 null
2024-02-27 Enhancing Hyperspectral Images via Diffusion Model and Group-Autoencoder Super-resolution Network Zhaoyang Wang et.al. 2402.17285v1 null
2024-02-29 DiFashion: Towards Personalized Outfit Generation and Recommendation Yiyan Xu et.al. 2402.17279v2 null
2024-02-27 One-Shot Structure-Aware Stylized Image Synthesis Hansam Cho et.al. 2402.17275v1 null
2024-02-27 Playground v2.5: Three Insights towards Enhancing Aesthetic Quality in Text-to-Image Generation Daiqing Li et.al. 2402.17245v1 null
2024-02-28 CharacterGen: Efficient 3D Character Generation from Single Images with Multi-View Pose Canonicalization Hao-Yang Peng et.al. 2402.17214v2 null
2024-02-27 Generative Learning for Forecasting the Dynamics of Complex Systems Han Gao et.al. 2402.17157v1 null
2024-02-27 TaxDiff: Taxonomic-Guided Diffusion Model for Protein Sequence Generation Lin Zongying et.al. 2402.17156v1 link
2024-02-27 SAM-DiffSR: Structure-Modulated Diffusion Model for Image Super-Resolution Chengcheng Wang et.al. 2402.17133v1 link
2024-03-01 Transparent Image Layer Diffusion using Latent Transparency Lvmin Zhang et.al. 2402.17113v3 link
2024-03-01 Renormalization Group flow, Optimal Transport and Diffusion-based Generative Model Artan Sheshmani et.al. 2402.17090v2 null
2024-02-26 A Phase Transition in Diffusion Models Reveals the Hierarchical Nature of Data Antonio Sclocchi et.al. 2402.16991v1 null
2024-02-25 Diffusion Posterior Proximal Sampling for Image Restoration Hongjie Wu et.al. 2402.16907v1 null
2024-02-26 Cross-Modal Contextualized Diffusion Models for Text-Guided Visual Generation and Editing Ling Yang et.al. 2402.16627v1 link
2024-02-27 Stochastic Conditional Diffusion Models for Semantic Image Synthesis Juyeon Ko et.al. 2402.16506v2 null
2024-02-26 Outline-Guided Object Inpainting with Diffusion Models Markus Pobitzer et.al. 2402.16421v1 null
2024-02-26 Placing Objects in Context via Inpainting for Out-of-distribution Segmentation Pau de Jorge et.al. 2402.16392v1 link
2024-02-26 Generative AI in Vision: A Survey on Models, Metrics and Applications Gaurav Raut et.al. 2402.16369v1 null
2024-02-27 Feedback Efficient Online Fine-Tuning of Diffusion Models Masatoshi Uehara et.al. 2402.16359v2 null
2024-02-26 Graph Diffusion Policy Optimization Yijing Liu et.al. 2402.16302v1 link
2024-02-25 Photon-counting CT using a Conditional Diffusion Model for Super-resolution and Texture-preservation Christopher Wiedeman et.al. 2402.16212v1 null
2024-02-25 Towards Efficient Quantum Hybrid Diffusion Models Francesca De Falco et.al. 2402.16147v1 null
2024-02-25 Cinematographic Camera Diffusion Model Hongda Jiang et.al. 2402.16143v1 null
2024-02-25 Behavioral Refinement via Interpolant-based Policy Diffusion Kaiqi Chen et.al. 2402.16075v1 null
2024-02-24 HIR-Diff: Unsupervised Hyperspectral Image Restoration Via Improved Diffusion Models Li Pang et.al. 2402.15865v1 link
2024-02-23 Minimax Optimality of Score-based Diffusion Models: Beyond the Density Lower Bound Assumptions Kaihong Zhang et.al. 2402.15602v1 null
2024-02-21 The Bass diffusion model: agent-based implementation on arbitrary networks L. Di Lucchio et.al. 2402.15528v1 null
2024-02-23 Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition Chun-Hsiao Yeh et.al. 2402.15504v1 link
2024-02-23 ProTIP: Probabilistic Robustness Verification on Text-to-Image Diffusion Models against Stochastic Perturbation Yi Zhang et.al. 2402.15429v1 link
2024-02-23 Let's Rectify Step by Step: Improving Aspect-based Sentiment Analysis with Diffusion Models Shunyu Liu et.al. 2402.15289v1 link
2024-02-23 Weak Reproductive Solutions for a Convection-Diffusion Model Describing a Binary Alloy Solidification Processes Blanca Climent-Ezquerra et.al. 2402.15221v1 null
2024-02-23 Label-efficient Multi-organ Segmentation Method with Diffusion Model Yongzhi Huang et.al. 2402.15216v1 null
2024-02-23 Fine-Tuning of Continuous-Time Diffusion Models as Entropy-Regularized Control Masatoshi Uehara et.al. 2402.15194v1 null
2024-02-23 Dynamics-Guided Diffusion Model for Robot Manipulator Design Xiaomeng Xu et.al. 2402.15038v1 null
2024-02-22 Cameras as Rays: Pose Estimation via Ray Diffusion Jason Y. Zhang et.al. 2402.14817v1 null
2024-02-22 Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models Yixuan Ren et.al. 2402.14780v1 null
2024-02-22 Debiasing Text-to-Image Diffusion Models Ruifei He et.al. 2402.14577v1 null
2024-02-22 Model-Based Reinforcement Learning Control of Reaction-Diffusion Problems Christina Schenk et.al. 2402.14446v1 null
2024-02-22 Large-Scale Actionless Video Pre-Training via Discrete Diffusion for Efficient Policy Learning Haoran He et.al. 2402.14407v1 null
2024-02-22 Diffusion Model Based Visual Compensation Guidance and Visual Difference Analysis for No-Reference Image Quality Assessment Zhaoyang Wang et.al. 2402.14401v1 null
2024-02-22 Typographic Text Generation with Off-the-Shelf Diffusion Model KhayTze Peong et.al. 2402.14314v1 null
2024-02-22 Font Style Interpolation with Diffusion Models Tetta Kondo et.al. 2402.14311v1 null
2024-02-23 Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion Yujia Huang et.al. 2402.14285v2 link
2024-02-22 MVD $^2$ : Efficient Multiview 3D Reconstruction for Multiview Diffusion Xin-Yang Zheng et.al. 2402.14253v1 null
2024-02-21 T-Stitch: Accelerating Sampling in Pre-Trained Diffusion Models with Trajectory Stitching Zizheng Pan et.al. 2402.14167v1 link
2024-02-21 Non-asymptotic Convergence of Discrete-time Diffusion Models: New Approach and Improved Rate Yuchen Liang et.al. 2402.13901v1 null
2024-02-21 NeuralDiffuser: Controllable fMRI Reconstruction with Primary Visual Feature Guided Diffusion Haoyu Li et.al. 2402.13809v1 null
2024-02-26 Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future Directions Jiayu Chen et.al. 2402.13777v4 null
2024-02-21 Cas-DiffCom: Cascaded diffusion model for infant longitudinal super-resolution 3D medical image completion Lianghu Guo et.al. 2402.13776v1 null
2024-02-21 Music Style Transfer with Time-Varying Inversion of Diffusion Models Sifei Li et.al. 2402.13763v1 null
2024-02-21 SRNDiff: Short-term Rainfall Nowcasting with Condition Diffusion Model Xudong Ling et.al. 2402.13737v1 null
2024-02-21 Hybrid Video Diffusion Models with 2D Triplane and 3D Wavelet Representation Kihong Kim et.al. 2402.13729v1 null
2024-02-21 Flexible Physical Camouflage Generation Based on a Differential Approach Yang Li et.al. 2402.13575v1 null
2024-02-21 ToDo: Token Downsampling for Efficient Generation of High-Resolution Images Ethan Smith et.al. 2402.13573v1 null
2024-02-21 Generative AI for Secure Physical Layer Communications: A Survey Changyuan Zhao et.al. 2402.13553v1 null
2024-02-21 DiffPLF: A Conditional Diffusion Model for Probabilistic Forecasting of EV Charging Load Siyang Li et.al. 2402.13548v1 link
2024-02-21 Contrastive Prompts Improve Disentanglement in Text-to-Image Diffusion Models Chen Wu et.al. 2402.13490v1 null
2024-02-20 Layout-to-Image Generation with Localized Descriptions using ControlNet with Cross-Attention Control Denis Lukovnikov et.al. 2402.13404v1 null
2024-02-20 The Uncanny Valley: A Comprehensive Analysis of Diffusion Models Karam Ghanem et.al. 2402.13369v1 null
2024-02-20 Neural Network Diffusion Kai Wang et.al. 2402.13144v1 link
2024-02-20 Text-Guided Molecule Generation with Diffusion Language Model Haisong Gong et.al. 2402.13040v1 link
2024-02-21 Visual Style Prompting with Swapping Self-Attention Jaeseok Jeong et.al. 2402.12974v2 null
2024-02-20 CLIPping the Deception: Adapting Vision-Language Models for Universal Deepfake Detection Sohail Ahmed Khan et.al. 2402.12927v1 null
2024-02-20 RealCompo: Dynamic Equilibrium between Realism and Compositionality Improves Text-to-Image Diffusion Models Xinchen Zhang et.al. 2402.12908v1 link
2024-02-20 Two-stage Rainfall-Forecasting Diffusion Model XuDong Ling et.al. 2402.12779v1 link
2024-02-20 MuLan: Multimodal-LLM Agent for Progressive Multi-Object Diffusion Sen Li et.al. 2402.12741v1 link
2024-02-20 Diffusion Posterior Sampling is Computationally Intractable Shivam Gupta et.al. 2402.12727v1 null
2024-02-20 MVDiffusion++: A Dense High-resolution Multi-view Diffusion Model for Single or Sparse-view 3D Object Reconstruction Shitao Tang et.al. 2402.12712v1 null
2024-02-20 SingVisio: Visual Analytics of Diffusion Model for Singing Voice Conversion Liumeng Xue et.al. 2402.12660v1 link
2024-02-20 DiffusionNOCS: Managing Symmetry and Uncertainty in Sim2Real Multi-Modal Category-level Pose Estimation Takuya Ikeda et.al. 2402.12647v1 null
2024-02-19 Hierarchical Bayes Approach to Personalized Federated Unsupervised Learning Kaan Ozkara et.al. 2402.12537v1 null
2024-02-22 Improving Deep Generative Models on Many-To-One Image-to-Image Translation Sagar Saxena et.al. 2402.12531v2 null
2024-02-19 On the Semantic Latent Space of Diffusion-Based Text-to-Speech Models Miri Varshavsky Hassid et.al. 2402.12423v1 null
2024-02-19 FiT: Flexible Vision Transformer for Diffusion Model Zeyu Lu et.al. 2402.12376v1 link
2024-02-19 Synthetic location trajectory generation using categorical diffusion models Simon Dirmeier et.al. 2402.12242v1 link
2024-02-19 Adversarial Feature Alignment: Balancing Robustness and Accuracy in Deep Learning via Adversarial Training Leo Hyun Park et.al. 2402.12187v1 null
2024-02-19 Human Video Translation via Query Warping Haiming Zhu et.al. 2402.12099v1 null
2024-02-19 Direct Consistency Optimization for Compositional Text-to-Image Personalization Kyungmin Lee et.al. 2402.12004v1 null
2024-02-19 Privacy-Preserving Low-Rank Adaptation for Latent Diffusion Models Zihao Luo et.al. 2402.11989v1 link
2024-02-19 DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation Chong Zeng et.al. 2402.11929v1 null
2024-02-20 A Generative Pre-Training Framework for Spatio-Temporal Graph Transfer Learning Yuan Yuan et.al. 2402.11922v2 link
2024-02-19 ComFusion: Personalized Subject Generation in Multiple Specific Scenes From Single Image Yan Hong et.al. 2402.11849v1 null
2024-02-19 UnlearnCanvas: A Stylized Image Dataset to Benchmark Machine Unlearning for Diffusion Models Yihua Zhang et.al. 2402.11846v1 link
2024-02-19 WildFake: A Large-scale Challenging Dataset for AI-Generated Images Detection Yan Hong et.al. 2402.11843v1 null
2024-02-19 Statistical Test for Generated Hypotheses by Diffusion Models Teruyuki Katsuoka et.al. 2402.11789v1 null
2024-02-19 Towards Theoretical Understandings of Self-Consuming Generative Models Shi Fu et.al. 2402.11778v1 null
2024-02-18 SDiT: Spiking Diffusion Model with Transformer Shu Yang et.al. 2402.11588v1 null
2024-02-18 CaloGraph: Graph-based diffusion model for fast shower generation in calorimeters with irregular geometry Dmitrii Kobylianskii et.al. 2402.11575v1 null
2024-02-18 Temporal Disentangled Contrastive Diffusion Model for Spatiotemporal Imputation Yakun Chen et.al. 2402.11558v1 null
2024-02-18 Visual Concept-driven Image Generation with Text-to-Image Diffusion Model Tanzila Rahman et.al. 2402.11487v1 null
2024-02-17 Partial Ly $α$ thermalization in an analytic nonlinear diffusion model Georg Wolschin et.al. 2402.11320v1 null
2024-02-17 TC-DiffRecon: Texture coordination MRI reconstruction method based on diffusion model and modified MF-UNet method Chenyan Zhang et.al. 2402.11274v1 link
2024-02-17 DiffPoint: Single and Multi-view Point Cloud Reconstruction with ViT Based Diffusion Model Yu Feng et.al. 2402.11241v1 null
2024-02-16 3D Diffuser Actor: Policy Diffusion with 3D Scene Representations Tsung-Wei Ke et.al. 2402.10885v1 null
2024-02-16 Training Class-Imbalanced Diffusion Model Via Overlap Optimization Divin Yan et.al. 2402.10821v1 link
2024-02-16 VATr++: Choose Your Words Wisely for Handwritten Text Generation Bram Vanherle et.al. 2402.10798v1 null
2024-02-16 Rethinking Human-like Translation Strategy: Integrating Drift-Diffusion Model with Large Language Models for Machine Translation Hongbin Na et.al. 2402.10699v1 null
2024-02-16 Generative AI and Attentive User Interfaces: Five Strategies to Enhance Take-Over Quality in Automated Driving Patrick Ebel et.al. 2402.10664v1 null
2024-02-16 Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model Xiangyu Zhang et.al. 2402.10642v1 null
2024-02-16 U $^2$ MRPD: Unsupervised undersampled MRI reconstruction by prompting a large latent diffusion model Ziqi Gao et.al. 2402.10609v1 null
2024-02-16 A maximum likelihood estimation of Lévy-driven stochastic systems for univariate and multivariate time series of observations Babak M. S. Arani et.al. 2402.10608v1 null
2024-02-16 Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation Lanqing Guo et.al. 2402.10491v1 link
2024-02-16 Explaining generative diffusion models via visual analysis for interpretable decision-making process Ji-Hoon Park et.al. 2402.10404v1 null
2024-02-20 GaussianObject: Just Taking Four Images to Get A High-Quality 3D Object with Gaussian Splatting Chen Yang et.al. 2402.10259v2 link
2024-02-15 Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation Huizhuo Yuan et.al. 2402.10210v1 null
2024-02-19 Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment Rui Yang et.al. 2402.10207v2 null
2024-02-20 Radio-astronomical Image Reconstruction with Conditional Denoising Diffusion Model Mariia Drozdova et.al. 2402.10204v2 link
2024-02-15 Classification Diffusion Models Shahar Yadin et.al. 2402.10095v1 null
2024-02-15 Diffusion Models Meet Contextual Bandits with Large Action Spaces Imad Aouali et.al. 2402.10028v1 null
2024-02-16 Zero-Shot Unsupervised and Text-Based Audio Editing Using DDPM Inversion Hila Manor et.al. 2402.10009v2 null
2024-02-15 Accelerating Parallel Sampling of Diffusion Models Zhiwei Tang et.al. 2402.09970v1 null
2024-02-15 Textual Localization: Decomposing Multi-concept Images for Subject-Driven Text-to-Image Generation Junjie Shentu et.al. 2402.09966v1 link
2024-02-15 Lester: rotoscope animation through video object segmentation and tracking Ruben Tous et.al. 2402.09883v1 link
2024-02-15 Diffusion Models for Audio Restoration Jean-Marie Lemercier et.al. 2402.09821v1 null
2024-02-15 DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization Jisu Nam et.al. 2402.09812v1 link
2024-02-15 Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement Tao Yang et.al. 2402.09712v1 null
2024-02-12 Rolling Diffusion Models David Ruhe et.al. 2402.09470v1 null
2024-02-14 Synthesizing Knowledge-enhanced Features for Real-world Zero-shot Food Detection Pengfei Zhou et.al. 2402.09242v1 link
2024-02-14 Semi-Supervised Diffusion Model for Brain Age Prediction Ayodeji Ijishakin et.al. 2402.09137v1 null
2024-02-14 L3GO: Language Agents with Chain-of-3D-Thoughts for Generating Unconventional Objects Yutaro Yamada et.al. 2402.09052v1 null
2024-02-14 Extreme Video Compression with Pre-trained Diffusion Models Bohan Li et.al. 2402.08934v1 link
2024-02-14 The Mirrored Influence Hypothesis: Efficient Data Influence Estimation by Harnessing Forward Passes Myeongseob Ko et.al. 2402.08922v1 null
2024-02-13 Percolating transition to turbulence without puffs or bands Sébastien Gomé et.al. 2402.08829v1 null
2024-02-13 LDTrack: Dynamic People Tracking by Service Robots using Diffusion Models Angus Fung et.al. 2402.08774v1 null
2024-02-13 Towards the Detection of AI-Synthesized Human Face Images Yuhang Lu et.al. 2402.08750v1 null
2024-02-13 PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models Fei Deng et.al. 2402.08714v1 null
2024-02-13 Zero Shot Molecular Generation via Similarity Kernels Rokas Elijošius et.al. 2402.08708v1 link
2024-02-13 Chain Reaction of Ideas: Can Radioactive Decay Predict Technological Innovation? Guilherme S. Y. Giardini et.al. 2402.08681v1 null
2024-02-13 Target Score Matching Valentin De Bortoli et.al. 2402.08667v1 null
2024-02-13 Learning Continuous 3D Words for Text-to-Image Generation Ta-Ying Cheng et.al. 2402.08654v1 null
2024-02-14 Denoising Diffusion Restoration Tackles Forward and Inverse Problems for the Laplace Operator Amartya Mukherjee et.al. 2402.08563v2 null
2024-02-13 Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases Ziyi Zhang et.al. 2402.08552v1 null
2024-02-13 A Dense Reward View on Aligning Text-to-Image Diffusion with Preference Shentao Yang et.al. 2402.08265v1 link
2024-02-13 Fine-Tuning Text-To-Image Diffusion Models for Class-Wise Spurious Feature Generation AprilPyone MaungMaung et.al. 2402.08200v1 null
2024-02-14 Convergence Analysis of Discrete Diffusion Model: Exact Implementation through Uniformization Hongrui Chen et.al. 2402.08095v2 null
2024-02-12 Nearest Neighbour Score Estimators for Diffusion Generative Models Matthew Niedoba et.al. 2402.08018v1 null
2024-02-12 Towards a mathematical theory for consistency training in diffusion models Gen Li et.al. 2402.07802v1 null
2024-02-12 Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models Jiacheng Ye et.al. 2402.07754v1 null
2024-02-12 Cosmology at the Field Level with Probabilistic Machine Learning Adam Rouhiainen et.al. 2402.07694v1 null
2024-02-12 Trustworthy SR: Resolving Ambiguity in Image Super-resolution via Diffusion Models and Human Feedback Cansu Korkmaz et.al. 2402.07597v1 null
2024-02-12 Score-based Diffusion Models via Stochastic Differential Equations -- a Technical Tutorial Wenpin Tang et.al. 2402.07487v1 null
2024-02-13 SALAD: Smart AI Language Assistant Daily Ragib Amin Nihal et.al. 2402.07431v2 null
2024-02-12 Diff-RNTraj: A Structure-aware Diffusion Model for Road Network-constrained Trajectory Generation Tonglong Wei et.al. 2402.07369v1 null
2024-02-15 Re-DiffiNet: Modeling discrepancies loss in tumor segmentation using diffusion models Tianyi Ren et.al. 2402.07354v3 null
2024-02-11 Stitching Sub-Trajectories with Conditional Diffusion Model for Goal-Conditioned Offline RL Sungyoon Kim et.al. 2402.07226v1 link
2024-02-13 Towards Fast Stochastic Sampling in Diffusion Generative Models Kushagra Pandey et.al. 2402.07211v2 null
2024-02-10 Synthesizing CTA Image Data for Type-B Aortic Dissection using Stable Diffusion Models Ayman Abaid et.al. 2402.06969v1 null
2024-02-09 Towards Principled Assessment of Tabular Data Synthesis Algorithms Yuntao Du et.al. 2402.06806v1 link
2024-02-08 Social Physics Informed Diffusion Model for Crowd Simulation Hongyi Chen et.al. 2402.06680v1 link
2024-02-06 Weather Prediction with Diffusion Guided by Realistic Forecast Processes Zhanxiang Hua et.al. 2402.06666v1 null
2024-02-09 Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous Driving and Zero-Shot Instruction Following Brian Yang et.al. 2402.06559v1 null
2024-02-15 Sequential Flow Straightening for Generative Modeling Jongmin Yoon et.al. 2402.06461v2 null
2024-02-09 ControlUDA: Controllable Diffusion-assisted Unsupervised Domain Adaptation for Cross-Weather Semantic Segmentation Fengyi Shen et.al. 2402.06446v1 null
2024-02-09 Improving 2D-3D Dense Correspondences with Diffusion Models for 6D Object Pose Estimation Peter Hönig et.al. 2402.06436v1 null
2024-02-09 Particle Denoising Diffusion Sampler Angus Phillips et.al. 2402.06320v1 link
2024-02-09 Controllable seismic velocity synthesis using generative diffusion models Fu Wang et.al. 2402.06277v1 null
2024-02-09 MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models Yixiao Zhang et.al. 2402.06178v1 null
2024-02-08 CLR-Face: Conditional Latent Refinement for Blind Face Restoration Using Score-Based Diffusion Models Maitreya Suin et.al. 2402.06106v1 null
2024-02-08 Animated Stickers: Bringing Stickers to Life with Video Diffusion David Yan et.al. 2402.06088v1 null
2024-02-08 DiscDiff: Latent Diffusion Model for DNA Sequence Generation Zehui Li et.al. 2402.06079v1 null
2024-02-08 InstaGen: Enhancing Object Detection by Training on Synthetic Dataset Chengjian Feng et.al. 2402.05937v1 null
2024-02-08 Time Series Diffusion in the Frequency Domain Jonathan Crabbé et.al. 2402.05933v1 link
2024-02-08 AvatarMMC: 3D Head Avatar Generation and Editing with Multi-Modal Conditioning Wamiq Reyaz Para et.al. 2402.05803v1 null
2024-02-08 DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer Zhiyuan Ma et.al. 2402.05712v1 link
2024-02-08 Scalable Diffusion Models with State Space Backbone Zhengcong Fei et.al. 2402.05608v1 link
2024-02-08 Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models Senmao Li et.al. 2402.05375v1 link
2024-02-08 Descanning: From Scanned to the Original Images with a Color Correction Diffusion Model Junghun Cha et.al. 2402.05350v1 null
2024-02-07 SPAD : Spatially Aware Multiview Diffusers Yash Kant et.al. 2402.05235v1 null
2024-02-09 Anatomically-Controllable Medical Image Generation with Segmentation-Guided Diffusion Models Nicholas Konz et.al. 2402.05210v2 link
2024-02-07 ** $λ$ -ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space** Maitreya Patel et.al. 2402.05195v1 null
2024-02-13 On diffusion models for amortized inference: Benchmarking and improving stochastic control and sampling Marcin Sendera et.al. 2402.05098v2 link
2024-02-07 NITO: Neural Implicit Fields for Resolution-free Topology Optimization Amin Heyrani Nobari et.al. 2402.05073v1 null
2024-02-07 LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation Jiaxiang Tang et.al. 2402.05054v1 null
2024-02-07 Generative Flows on Discrete State-Spaces: Enabling Multimodal Flows with Applications to Protein Co-Design Andrew Campbell et.al. 2402.04997v1 link
2024-02-07 Blue noise for diffusion models Xingchang Huang et.al. 2402.04930v1 null
2024-02-07 Source-Free Domain Adaptation with Diffusion-Guided Source Data Generation Shivang Chopra et.al. 2402.04929v1 null
2024-02-07 Towards Aligned Layout Generation via Diffusion Model with Aesthetic Constraints Jian Chen et.al. 2402.04754v1 link
2024-02-07 Cortical Surface Diffusion Generative Models Zhenshan Xie et.al. 2402.04753v1 null
2024-02-07 EvoSeed: Unveiling the Threat on Deep Neural Networks with Real-World Illusions Shashank Kotyan et.al. 2402.04699v1 link
2024-02-07 Noise Map Guidance: Inversion with Spatial Context for Real Image Editing Hansam Cho et.al. 2402.04625v1 link
2024-02-07 BRI3L: A Brightness Illusion Image Dataset for Identification and Localization of Regions of Illusory Perception Aniket Roy et.al. 2402.04541v1 link
2024-02-07 Text2Street: Controllable Text-to-image Generation for Street Views Jinming Su et.al. 2402.04504v1 null
2024-02-06 Fine-Tuned Language Models Generate Stable Inorganic Materials as Text Nate Gruver et.al. 2402.04379v1 link
2024-02-06 Bidirectional Autoregressive Diffusion Model for Dance Generation Canyu Zhang et.al. 2402.04356v1 null
2024-02-06 Polyp-DDPM: Diffusion-Based Semantic Polyp Synthesis for Enhanced Segmentation Zolnamar Dorjsembe et.al. 2402.04031v1 link
2024-02-06 Space Group Constrained Crystal Generation Rui Jiao et.al. 2402.03992v1 null
2024-02-06 Controllable Diverse Sampling for Diffusion Based Motion Behavior Forecasting Yiming Xu et.al. 2402.03981v1 null
2024-02-03 IMUSIC: IMU-based Facial Expression Capture Youjia Wang et.al. 2402.03944v1 null
2024-02-06 EscherNet: A Generative Model for Scalable View Synthesis Xin Kong et.al. 2402.03908v1 null
2024-02-06 On gauge freedom, conservativity and intrinsic dimensionality estimation in diffusion models Christian Horvat et.al. 2402.03845v1 null
2024-02-06 SDEMG: Score-based Diffusion Model for Surface Electromyographic Signal Denoising Yu-Tung Liu et.al. 2402.03808v1 link
2024-02-06 FoolSDEdit: Deceptively Steering Your Edits Towards Targeted Attribute-aware Distribution Qi Zhou et.al. 2402.03705v1 null
2024-02-06 Improving and Unifying Discrete&Continuous-time Discrete Denoising Diffusion Lingxiao Zhao et.al. 2402.03701v1 null
2024-02-06 Pard: Permutation-Invariant Autoregressive Diffusion for Graph Generation Lingxiao Zhao et.al. 2402.03687v1 null
2024-02-06 QuEST: Low-bit Diffusion Model Quantization via Efficient Selective Finetuning Haoxuan Wang et.al. 2402.03666v1 null
2024-02-11 Diffusion World Model Zihan Ding et.al. 2402.03570v2 null
2024-02-05 Projected Generative Diffusion Models for Constraint Satisfaction Jacob K Christopher et.al. 2402.03559v1 null
2024-02-05 AnaMoDiff: 2D Analogical Motion Diffusion via Disentangled Denoising Maham Tanveer et.al. 2402.03549v1 null
2024-02-05 Hyper-Diffusion: Estimating Epistemic and Aleatoric Uncertainty with a Single Model Matthew A. Chan et.al. 2402.03478v1 null
2024-02-05 Denoising Diffusion via Image-Based Rendering Titas Anciukevicius et.al. 2402.03445v1 null
2024-02-05 Do Diffusion Models Learn Semantically Meaningful and Efficient Representations? Qiyao Liang et.al. 2402.03305v1 null
2024-02-07 Zero-shot Object-Level OOD Detection with Context-Aware Inpainting Quang-Huy Nguyen et.al. 2402.03292v2 null
2024-02-05 InstanceDiffusion: Instance-level Control for Image Generation Xudong Wang et.al. 2402.03290v1 link
2024-02-06 Organic or Diffused: Can We Distinguish Human Art from AI-generated Images? Anna Yoo Jeong Ha et.al. 2402.03214v2 null
2024-02-05 Light and Optimal Schrödinger Bridge Matching Nikita Gushchin et.al. 2402.03207v1 link
2024-02-05 Guidance with Spherical Gaussian Constraint for Conditional Diffusion Lingxiao Yang et.al. 2402.03201v1 null
2024-02-05 Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion Shiyuan Yang et.al. 2402.03162v1 null
2024-02-05 PFDM: Parser-Free Virtual Try-on via Diffusion Model Yunfang Niu et.al. 2402.03047v1 null
2024-02-05 Diffusive Gibbs Sampling Wenlin Chen et.al. 2402.03008v1 null
2024-02-05 DexDiffuser: Generating Dexterous Grasps with Diffusion Models Zehang Weng et.al. 2402.02989v1 null
2024-02-05 Retrieval-Augmented Score Distillation for Text-to-3D Generation Junyoung Seo et.al. 2402.02972v1 null
2024-02-05 ViewFusion: Learning Composable Diffusion Models for Novel View Synthesis Bernard Spiegl et.al. 2402.02906v1 link
2024-02-05 SynthVision -- Harnessing Minimal Input for Maximal Output in Computer Vision Models using Synthetic Image data Yudara Kularathne et.al. 2402.02826v1 null
2024-02-05 Extreme Two-View Geometry From Object Poses with Diffusion Models Yujing Sun et.al. 2402.02800v1 link
2024-02-06 Contrastive Diffuser: Planning Towards High Return States via Contrastive Learning Yixiang Shan et.al. 2402.02772v2 null
2024-02-05 DisDet: Exploring Detectability of Backdoor Attack on Diffusion Models Yang Sui et.al. 2402.02739v1 null
2024-02-04 DiffEditor: Boosting Accuracy and Flexibility on Diffusion-based Image Editing Chong Mou et.al. 2402.02583v1 link
2024-02-04 Latent Graph Diffusion: A Unified Framework for Generation and Prediction on Graphs Zhou Cai et.al. 2402.02518v1 null
2024-02-04 PoCo: Policy Composition from and for Heterogeneous Robot Learning Lirui Wang et.al. 2402.02511v1 null
2024-02-04 PromptRR: Diffusion Models as Prompt Generators for Single Image Reflection Removal Tao Wang et.al. 2402.02374v1 link
2024-02-07 Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models Fangzhao Zhang et.al. 2402.02347v2 link
2024-02-04 Closed-Loop Unsupervised Representation Disentanglement with $β$ -VAE Distillation and Diffusion Probabilistic Feedback Xin Jin et.al. 2402.02346v1 null
2024-02-04 Your Diffusion Model is Secretly a Certifiably Robust Classifier Huanran Chen et.al. 2402.02316v1 null
2024-02-03 Improving Diffusion Models for Inverse Problems Using Optimal Posterior Covariance Xinyu Peng et.al. 2402.02149v1 link
2024-02-03 Risk-Sensitive Diffusion: Learning the Underlying Distribution from Noisy Samples Yangming Li et.al. 2402.02081v1 null
2024-02-03 DiffVein: A Unified Diffusion Network for Finger Vein Segmentation and Authentication Yanjun Liu et.al. 2402.02060v1 null
2024-02-03 GenFace: A Large-Scale Fine-Grained Face Forgery Benchmark and Cross Appearance-Edge Learning Yaning Zhang et.al. 2402.02003v1 null
2024-02-06 Analyzing Neural Network-Based Generative Diffusion Models through Convex Optimization Fangzhao Zhang et.al. 2402.01965v2 null
2024-02-02 Robust Inverse Graphics via Probabilistic Inference Tuan Anh Le et.al. 2402.01915v1 null
2024-02-06 Carthago Delenda Est: Co-opetitive Indirect Information Diffusion Model for Influence Operations on Online Social Media Jwen Fai Low et.al. 2402.01905v2 null
2024-02-02 Mobile Fitting Room: On-device Virtual Try-on via Diffusion Models Justin Blalock et.al. 2402.01877v1 null
2024-02-01 Plug-and-Play image restoration with Stochastic deNOising REgularization Marien Renaud et.al. 2402.01779v1 link
2024-02-02 NeuroCine: Decoding Vivid Video Sequences from Human Brain Activties Jingyuan Sun et.al. 2402.01590v1 null
2024-02-02 Boximator: Generating Rich and Controllable Motions for Video Synthesis Jiawei Wang et.al. 2402.01566v1 null
2024-02-02 Cross-view Masked Diffusion Transformers for Person Image Synthesis Trung X. Pham et.al. 2402.01516v1 null
2024-02-02 Conditioning non-linear and infinite-dimensional diffusion processes Elizabeth Louise Baker et.al. 2402.01434v1 null
2024-02-02 Bass Accompaniment Generation via Latent Diffusion Marco Pasini et.al. 2402.01412v1 null
2024-02-02 Cheating Suffix: Targeted Attack to Text-To-Image Diffusion Models with Multi-Modal Priors Dingcheng Yang et.al. 2402.01369v1 link
2024-02-02 Unsupervised Generation of Pseudo Normal PET from MRI with Diffusion Model for Epileptic Focus Localization Wentao Chen et.al. 2402.01191v1 null
2024-02-01 Unconditional Latent Diffusion Models Memorize Patient Imaging Data Salman Ul Hassan Dar et.al. 2402.01054v1 null
2024-02-01 pop-cosmos: A comprehensive picture of the galaxy population from COSMOS data Justin Alsing et.al. 2402.00935v1 null
2024-02-01 Data-Space Validation of High-Dimensional Models by Comparing Sample Quantiles Stephen Thorp et.al. 2402.00930v1 null
2024-02-01 ViCA-NeRF: View-Consistency-Aware 3D Editing of Neural Radiance Fields Jiahua Dong et.al. 2402.00864v1 link
2024-02-01 An Analysis of the Variance of Diffusion-based Speech Enhancement Bunlong Lay et.al. 2402.00811v1 null
2024-02-01 Distilling Conditional Diffusion Models for Offline Reinforcement Learning through Trajectory Stitching Shangzhe Li et.al. 2402.00807v1 null
2024-02-01 AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning Fu-Yun Wang et.al. 2402.00769v1 link
2024-01-31 SeFi-IDE: Semantic-Fidelity Identity Embedding for Personalized Diffusion-Based Generation Yang Li et.al. 2402.00631v1 null
2024-02-01 Cylindrically symmetric diffusion model for relativistic heavy-ion collisions Johannes Hoelck et.al. 2402.00628v1 null
2024-02-01 CapHuman: Capture Your Moments in Parallel Universes Chao Liang et.al. 2402.00627v1 link
2024-02-01 Masked Conditional Diffusion Model for Enhancing Deepfake Detection Tiewen Chen et.al. 2402.00541v1 null
2024-02-01 Energetic Particles in the Central Starburst, Disc, and Halo of NGC253 Yoel Rephaeli et.al. 2402.00523v1 null
2024-02-01 LRDif: Diffusion Models for Under-Display Camera Emotion Recognition Zhifeng Wang et.al. 2402.00250v1 null
2024-02-02 SuperDiff: Diffusion Models for Conditional Generation of Hypothetical New Families of Superconductors Samuel Yuan et.al. 2402.00198v2 null
2024-01-31 Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators Daniel Geng et.al. 2401.18085v1 null
2024-01-31 Ljusternik-Schnirelmann eigenvalues for the fractional $m-$ Laplacian without the $Δ_2$ condition Julian Fernandez Bonder et.al. 2401.18041v1 null
2024-01-31 Diagnosing the particle transport mechanism in the pulsar halo via X-ray observations Qi-Zuo Wu et.al. 2401.17982v1 null
2024-01-31 Convergence Analysis for General Probability Flow ODEs of Diffusion Models in Wasserstein Distances Xuefeng Gao et.al. 2401.17958v1 null
2024-01-31 AEROBLADE: Training-Free Detection of Latent Diffusion Images Using Autoencoder Reconstruction Error Jonas Ricker et.al. 2401.17879v1 null
2024-01-31 Drift Diffusion Model to understand (mis)information sharing dynamic in complex networks Lucila G. Alvarez-Zuzek et.al. 2401.17846v1 null
2024-01-31 A new class of efficient high order semi-Lagrangian IMEX discontinuous Galerkin methods on staggered unstructured meshes M. Tavelli et.al. 2401.17806v1 null
2024-01-31 Dance-to-Music Generation with Encoder-based Textual Inversion of Diffusion Models Sifei Li et.al. 2401.17800v1 null
2024-01-31 Image Anything: Towards Reasoning-coherent and Training-free Multi-modal Image Generation Yuanhuiyi Lyu et.al. 2401.17664v1 null
2024-01-31 Spatial-and-Frequency-aware Restoration method for Images based on Diffusion Models Kyungsung Lee et.al. 2401.17629v1 null
2024-01-31 Topology-Aware Latent Diffusion for 3D Shape Generation Jiangbei Hu et.al. 2401.17603v1 null
2024-01-31 Head and Neck Tumor Segmentation from [18F]F-FDG PET/CT Images Based on 3D Diffusion Model Yafei Dong et.al. 2401.17593v1 null
2024-01-31 Task-Oriented Diffusion Model Compression Geonung Kim et.al. 2401.17547v1 null
2024-01-31 Enhancing Score-Based Sampling Methods with Ensembles Tobias Bischoff et.al. 2401.17539v1 null
2024-01-30 You Only Need One Step: Fast Super-Resolution with Stable Diffusion via Scale Distillation Mehdi Noroozi et.al. 2401.17258v1 null
2024-02-03 ContactGen: Contact-Guided Interactive 3D Human Generation for Partners Dongjun Gu et.al. 2401.17212v2 null
2024-01-30 Transfer Learning for Text Diffusion Models Kehang Han et.al. 2401.17181v1 null
2024-01-30 PlantoGraphy: Incorporating Iterative Design Process into Generative Artificial Intelligence for Landscape Rendering Rong Huang et.al. 2401.17120v1 null
2024-01-30 Local modification of subdiffusion by initial Fickian diffusion: Multiscale modeling, analysis and computation Xiangcheng Zheng et.al. 2401.16885v1 null
2024-01-30 A Literature Review on Fetus Brain Motion Correction in MRI Haoran Zhang et.al. 2401.16782v1 null
2024-01-30 BoostDream: Efficient Refining for High-Quality Text-to-3D Generation from Multi-View Diffusion Yonghao Yu et.al. 2401.16764v1 null
2024-01-30 Pick-and-Draw: Training-free Semantic Guidance for Text-to-Image Personalization Henglei Lv et.al. 2401.16762v1 null
2024-01-30 Diffusion model for relational inference Shuhan Zheng et.al. 2401.16755v1 null
2024-01-29 Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors Shiyin Dong et.al. 2401.16459v1 null
2024-01-29 Using multiple Dirac delta points to describe inhomogeneous flux density over a cell boundary in a single-cell diffusion model Qiyao Peng et.al. 2401.16261v1 null
2024-01-29 Diffutoon: High-Resolution Editable Toon Shading via Diffusion Models Zhongjie Duan et.al. 2401.16224v1 null
2024-01-29 Spatial-Aware Latent Initialization for Controllable Image Generation Wenqiang Sun et.al. 2401.16157v1 null
2024-01-29 DMCE: Diffusion Model Channel Enhancer for Multi-User Semantic Communication Systems Youcheng Zeng et.al. 2401.16017v1 null
2024-01-31 Motion-I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling Xiaoyu Shi et.al. 2401.15977v2 null
2024-01-29 EmoDM: A Diffusion Model for Evolutionary Multi-objective Optimization Xueming Yan et.al. 2401.15931v1 null
2024-01-28 Object-Driven One-Shot Fine-tuning of Text-to-Image Diffusion with Prototypical Embedding Jianxiang Lu et.al. 2401.15708v1 null
2024-01-30 Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance Qingcheng Zhao et.al. 2401.15687v2 null
2024-01-28 CPDM: Content-Preserving Diffusion Model for Underwater Image Enhancement Xiaowen Shi et.al. 2401.15649v1 null
2024-01-28 FreeStyle: Free Lunch for Text-guided Style Transfer using Diffusion Models Feihong He et.al. 2401.15636v1 null
2024-01-28 Generative AI-enabled Blockchain Networks: Fundamentals, Applications, and Case Study Cong T. Nguyen et.al. 2401.15625v1 null
2024-01-28 Diffusion-based graph generative methods Hongyang Chen et.al. 2401.15617v1 link
2024-01-28 Neural Network-Based Score Estimation in Diffusion Models: Optimization and Generalization Yinbin Han et.al. 2401.15604v1 null
2024-01-28 BrepGen: A B-rep Generative Diffusion Model with Structured Latent Geometry Xiang Xu et.al. 2401.15563v1 null
2024-01-31 Wind speed super-resolution and validation: from ERA5 to CERRA via diffusion models Fabio Merizzi et.al. 2401.15469v2 null
2024-01-27 A Survey on Data Augmentation in Large Model Era Yue Zhou et.al. 2401.15422v1 link
2024-01-27 GEM: Boost Simple Network for Glass Surface Segmentation via Segment Anything Model and Data Synthesis Jing Hao et.al. 2401.15282v1 link
2024-01-26 Annotated Hands for Generative Models Yue Yang et.al. 2401.15075v1 link
2024-01-26 Text Image Inpainting via Global Structure-Guided Diffusion Models Shipeng Zhu et.al. 2401.14832v1 link
2024-01-25 Opposite variations for pore pressure on and off the fault during simulated earthquakes in the laboratory Dong Liu et.al. 2401.14506v1 null
2024-01-24 No Longer Trending on Artstation: Prompt Analysis of Generative AI Art Jon McCormack et.al. 2401.14425v1 null
2024-01-25 Deconstructing Denoising Diffusion Models for Self-Supervised Learning Xinlei Chen et.al. 2401.14404v1 null
2024-01-25 pix2gestalt: Amodal Segmentation by Synthesizing Wholes Ege Ozguroglu et.al. 2401.14398v1 link
2024-01-25 UrbanGenAI: Reconstructing Urban Landscapes using Panoptic Segmentation and Diffusion Models Timo Kapsalis et.al. 2401.14379v1 null
2024-01-27 Sketch2NeRF: Multi-view Sketch-guided Text-to-3D Generation Minglin Chen et.al. 2401.14257v2 null
2024-01-26 Image Synthesis with Graph Conditioning: CLIP-Guided Diffusion Models for Scene Graphs Rameshwar Mishra et.al. 2401.14111v2 null
2024-01-30 CreativeSynth: Creative Blending and Synthesis of Visual Arts based on Multimodal Diffusion Nisha Huang et.al. 2401.14066v2 null
2024-01-25 Diffusion-based Data Augmentation for Object Counting Problems Zhen Wang et.al. 2401.13992v1 null
2024-01-25 BootPIG: Bootstrapping Zero-shot Personalized Image Generation Capabilities in Pretrained Diffusion Models Senthil Purushwalkam et.al. 2401.13974v1 null
2024-01-25 StyleInject: Parameter Efficient Tuning of Text-to-Image Diffusion Models Yalong Bai et.al. 2401.13942v1 null
2024-01-24 Inverse Molecular Design with Multi-Conditional Diffusion Guidance Gang Liu et.al. 2401.13858v1 link
2024-01-24 Diffuse to Choose: Enriching Image Conditioned Inpainting in Latent Diffusion Models for Virtual Try-All Mehmet Saygin Seyfioglu et.al. 2401.13795v1 null
2024-01-24 Guided Diffusion for Fast Inverse Design of Density-based Mechanical Metamaterials Yanyan Yang et.al. 2401.13570v1 null
2024-01-25 UNIMO-G: Unified Image Generation through Multimodal Conditional Diffusion Wei Li et.al. 2401.13388v2 null
2024-01-31 Generative Design of Crystal Structures by Point Cloud Representations and Diffusion Model Zhelin Li et.al. 2401.13192v2 link
2024-01-24 Towards Multi-domain Face Landmark Detection with Synthetic Data from Diffusion model Yuanming Li et.al. 2401.13191v1 null
2024-01-24 Compositional Generative Inverse Design Tailin Wu et.al. 2401.13171v1 link
2024-01-24 Choose Your Diffusion: Efficient and flexible ways to accelerate the diffusion model in fast high energy physics simulation Cheng Jiang et.al. 2401.13162v1 null
2024-01-23 GALA: Generating Animatable Layered Assets from a Single Scan Taeksoo Kim et.al. 2401.12979v1 null
2024-01-24 Zero-Shot Learning for the Primitives of 3D Affordance in General Objects Hyeonwoo Kim et.al. 2401.12978v2 null
2024-01-23 Lumiere: A Space-Time Diffusion Model for Video Generation Omer Bar-Tal et.al. 2401.12945v1 null
2024-01-23 UniHDA: Towards Universal Hybrid Domain Adaptation of Image Generators Hengjia Li et.al. 2401.12596v1 null
2024-01-23 ToDA: Target-oriented Diffusion Attacker against Recommendation System Xiaohao Liu et.al. 2401.12578v1 null
2024-01-23 DDMI: Domain-Agnostic Latent Diffusion Models for Synthesizing High-Quality Implicit Neural Representations Dogyun Park et.al. 2401.12517v1 null
2024-01-20 Large-scale Reinforcement Learning for Diffusion Models Yinan Zhang et.al. 2401.12244v1 null
2024-01-22 DITTO: Diffusion Inference-Time T-Optimization for Music Generation Zachary Novack et.al. 2401.12179v1 null
2024-01-22 Single-View 3D Human Digitalization with Large Reconstruction Models Zhenzhen Weng et.al. 2401.12175v1 null
2024-01-22 Feature Denoising Diffusion Model for Blind Image Quality Assessment Xudong Li et.al. 2401.11949v1 null
2024-01-22 EmerDiff: Emerging Pixel-level Semantic Knowledge in Diffusion Models Koichi Namekata et.al. 2401.11739v1 null
2024-01-22 Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs Ling Yang et.al. 2401.11708v1 link
2024-01-21 Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass Diffusion Transformers Katherine Crowson et.al. 2401.11605v1 link
2024-01-20 Diffusion Model Conditioning on Gaussian Mixture Model and Negative Gaussian Mixture Gradient Weiguo Lu et.al. 2401.11261v1 null
2024-01-20 Product-Level Try-on: Characteristics-preserving Try-on with Realistic Clothes Shading and Wrinkles Yanlong Zang et.al. 2401.11239v1 null
2024-01-24 MotionMix: Weakly-Supervised Diffusion for Controllable Motion Generation Nhat M. Hoang et.al. 2401.11115v3 null
2024-01-20 UltrAvatar: A Realistic Animatable 3D Avatar Diffusion Model with Authenticity Guided Textures Mingyuan Zhou et.al. 2401.11078v1 null
2024-01-20 Make-A-Shape: a Ten-Million-scale 3D Shape Model Ka-Hei Hui et.al. 2401.11067v1 null
2024-01-17 A New Creative Generation Pipeline for Click-Through Rate with Stable Diffusion Model Hao Yang et.al. 2401.10934v1 link
2024-01-19 Synthesizing Moving People with 3D Control Boyi Li et.al. 2401.10889v1 null
2024-01-19 ActAnywhere: Subject-Aware Video Background Generation Boxiao Pan et.al. 2401.10822v1 null
2024-01-19 From Market Saturation to Social Reinforcement: Understanding the Impact of Non-Linearity in Information Diffusion Models Tobias Friedrich et.al. 2401.10818v1 null
2024-01-19 Sat2Scene: 3D Urban Scene Generation from Satellite Images with Diffusion Zuoyue Li et.al. 2401.10786v1 null
2024-01-19 Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model Yinan Zheng et.al. 2401.10700v1 link
2024-01-19 MAEDiff: Masked Autoencoder-enhanced Diffusion Models for Unsupervised Anomaly Detection in Brain Images Rui Xu et.al. 2401.10561v1 null
2024-01-18 Inflation with Diffusion: Efficient Temporal Adaptation for Text-to-Video Super-Resolution Xin Yuan et.al. 2401.10404v1 null
2024-01-18 A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting Wouter Van Gansbeke et.al. 2401.10227v1 link
2024-01-22 Motion-Zero: Zero-Shot Moving Object Control Framework for Diffusion-Based Video Generation Changgu Chen et.al. 2401.10150v3 null
2024-01-18 DiffusionGPT: LLM-Driven Text-to-Image Generation System Jie Qin et.al. 2401.10061v1 null
2024-01-18 CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects Zhao Wang et.al. 2401.09962v1 null
2024-01-18 BlenDA: Domain Adaptive Object Detection through diffusion-based blending Tzuhsuan Huang et.al. 2401.09921v1 null
2024-01-18 Exploring Latent Cross-Channel Embedding for Accurate 3D Human Pose Reconstruction in a Diffusion Framework Junkun Jiang et.al. 2401.09836v1 null
2024-01-18 Wavelet-Guided Acceleration of Text Inversion in Diffusion-Based Image Editing Gwanhyeong Koo et.al. 2401.09794v1 null
2024-01-18 Image Translation as Diffusion Visual Programmers Cheng Han et.al. 2401.09742v1 null
2024-01-17 Total fraction of drug released from diffusion-controlled delivery systems with binding reactions Elliot J. Carr et.al. 2401.09644v1 null
2024-01-17 Efficient generative adversarial networks using linear additive-attention Transformers Emilio Morales-Juarez et.al. 2401.09596v1 link
2024-01-17 TextureDreamer: Image-guided Texture Synthesis through Geometry-aware Diffusion Yu-Ying Yeh et.al. 2401.09416v1 null
2024-01-17 Vlogger: Make Your Dream A Vlog Shaobin Zhuang et.al. 2401.09414v1 link
2024-01-17 On the $\varepsilon$ -Euler-Maruyama scheme for time inhomogeneous jump-driven SDEs Mireille Bossy et.al. 2401.09338v1 null
2024-01-17 Siamese Meets Diffusion Network: SMDNet for Enhanced Change Detection in High-Resolution RS Imagery Jia Jia et.al. 2401.09325v1 null
2024-01-17 T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis Yoonjin Chung et.al. 2401.09294v1 null
2024-01-17 Training-Free Semantic Video Composition via Pre-trained Diffusion Model Jiaqi Guo et.al. 2401.09195v1 null
2024-01-17 Consistent3D: Towards Consistent High-Fidelity Text-to-3D Generation with Deterministic Sampling Prior Zike Wu et.al. 2401.09050v1 null
2024-01-17 Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis Jonghyun Lee et.al. 2401.09048v1 link
2024-01-17 VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models Haoxin Chen et.al. 2401.09047v1 link
2024-01-21 Data Attribution for Diffusion Models: Timestep-induced Bias in Influence Estimation Tong Xie et.al. 2401.09031v2 null
2024-01-17 3D Human Pose Analysis via Diffusion Synthesis Haorui Ji et.al. 2401.08930v1 null
2024-01-16 Adversarial Supervision Makes Layout-to-Image Diffusion Models Thrive Yumeng Li et.al. 2401.08815v1 link
2024-01-16 Fixed Point Diffusion Models Xingjian Bai et.al. 2401.08741v1 null
2024-01-16 SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers Nanye Ma et.al. 2401.08740v1 link
2024-01-18 NODI: Out-Of-Distribution Detection with Noise from Diffusion Jingqiu Zhou et.al. 2401.08689v2 null
2024-01-16 RoHM: Robust Human Motion Reconstruction via Diffusion Siwei Zhang et.al. 2401.08570v1 null
2024-01-16 Multi-Track Timeline Control for Text-Driven 3D Human Motion Generation Mathis Petrovich et.al. 2401.08559v1 null
2024-01-16 Modeling Spoof Noise by De-spoofing Diffusion and its Application in Face Anti-spoofing Bin Zhang et.al. 2401.08275v1 null
2024-01-16 Multi-scale 2D Temporal Map Diffusion Models for Natural Language Video Localization Chongzhi Zhang et.al. 2401.08232v1 null
2024-01-16 Photonic Modes Prediction via Multi-Modal Diffusion Model Jinyang Sun et.al. 2401.08199v1 null
2024-01-16 Key-point Guided Deformable Image Manipulation Using Diffusion Model Seok-Hwan Oh et.al. 2401.08178v1 null
2024-01-23 SpecSTG: A Fast Spectral Diffusion Framework for Probabilistic Spatio-Temporal Traffic Forecasting Lequan Lin et.al. 2401.08119v2 null
2024-01-16 DIFFRENT: A Diffusion Model for Recording Environment Transfer of Speech Jaekwon Im et.al. 2401.08102v1 null
2024-01-16 EmoTalker: Emotionally Editable Talking Face Generation via Diffusion Model Bingyuan Zhang et.al. 2401.08049v1 null
2024-01-16 Forging Vision Foundation Models for Autonomous Driving: Challenges, Methodologies, and Opportunities Xu Yan et.al. 2401.08045v1 link
2024-01-15 Regularity in diffusion models with gradient activation Damião Araújo et.al. 2401.07979v1 null
2024-01-15 HexaGen3D: StableDiffusion is just one step away from Fast and Diverse Text-to-3D Generation Antoine Mercier et.al. 2401.07727v1 null
2024-01-15 Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks Siyu Zou et.al. 2401.07709v1 null
2024-01-15 Multifractal-spectral features enhance classification of anomalous diffusion Henrik Seckler et.al. 2401.07646v1 null
2024-01-15 InstantID: Zero-shot Identity-Preserving Generation in Seconds Qixun Wang et.al. 2401.07519v1 link
2024-01-15 Robo-ABC: Affordance Generalization Beyond Categories via Semantic Correspondence for Robot Manipulation Yuanchen Ju et.al. 2401.07487v1 null
2024-01-20 Hierarchical Fashion Design with Multi-stage Diffusion Models Zhifeng Xie et.al. 2401.07450v3 null
2024-01-14 A Survey on Statistical Theory of Deep Learning: Approximation, Training Dynamics, and Generative Models Namjoon Suh et.al. 2401.07187v1 null
2024-01-13 Exploring Adversarial Attacks against Latent Diffusion Model from the Perspective of Adversarial Transferability Junxi Chen et.al. 2401.07087v1 null
2024-01-13 Quantum Denoising Diffusion Models Michael Kölle et.al. 2401.07049v1 null
2024-01-13 Quantum Generative Diffusion Model Chuangtao Chen et.al. 2401.07039v1 null
2024-01-17 Denoising Diffusion Recommender Model Jujia Zhao et.al. 2401.06982v2 null
2024-01-12 A deep implicit-explicit minimizing movement method for option pricing in jump-diffusion models Emmanuil H. Georgoulis et.al. 2401.06740v1 null
2024-01-12 Decoupling Pixel Flipping and Occlusion Strategy for Consistent XAI Benchmarks Stefan Blücher et.al. 2401.06654v1 link
2024-01-17 Adversarial Examples are Misaligned in Diffusion Model Manifolds Peter Lorenz et.al. 2401.06637v3 null
2024-01-12 Motion2VecSets: 4D Latent Vector Set Diffusion for Non-rigid Shape Reconstruction and Tracking Wei Cao et.al. 2401.06614v1 null
2024-01-12 360DVD: Controllable Panorama Video Generation with 360-Degree Video Diffusion Model Qian Wang et.al. 2401.06578v1 null
2024-01-12 RotationDrag: Point-based Image Editing with Rotated Diffusion Features Minxing Luo et.al. 2401.06442v1 link
2024-01-12 Seek for Incantations: Towards Accurate Text-to-Image Diffusion Synthesis through Prompt Engineering Chang Yu et.al. 2401.06345v1 null
2024-01-11 Frequency-Time Diffusion with Neural Cellular Automata John Kalkhof et.al. 2401.06291v1 null
2024-01-11 Demystifying Variational Diffusion Models Fabio De Sousa Ribeiro et.al. 2401.06281v1 null
2024-01-11 Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications Yuwen Xiong et.al. 2401.06197v1 link
2024-01-11 TriNeRFLet: A Wavelet Based Multiscale Triplane NeRF Representation Rajaei Khatib et.al. 2401.06191v1 null
2024-01-11 E $^{2}$ GAN: Efficient Training of Efficient GANs for Image-to-Image Translation Yifan Gong et.al. 2401.06127v1 null
2024-01-11 DiffDA: a diffusion model for weather-scale data assimilation Langwen Huang et.al. 2401.05932v1 null
2024-01-11 Efficient Image Deblurring Networks based on Diffusion Models Kang Chen et.al. 2401.05907v1 link
2024-01-11 HiCAST: Highly Customized Arbitrary Style Transfer with Adapter Enhanced Diffusion Models Hanzhang Wang et.al. 2401.05870v1 null
2024-01-11 EraseDiff: Erasing Data Influence in Diffusion Models Jing Wu et.al. 2401.05779v1 null
2024-01-10 Diffusion Priors for Dynamic View Synthesis from Monocular Videos Chaoyang Wang et.al. 2401.05583v1 null
2024-01-10 From Pampas to Pixels: Fine-Tuning Diffusion Models for Gaúcho Heritage Marcellus Amadeus et.al. 2401.05520v1 null
2024-01-10 InseRF: Text-Driven Generative Object Insertion in Neural 3D Scenes Mohamad Shahbazi et.al. 2401.05335v1 null
2024-01-10 Score Distillation Sampling with Learned Manifold Corrective Thiemo Alldieck et.al. 2401.05293v1 null
2024-01-10 PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models Junsong Chen et.al. 2401.05252v1 link
2024-01-05 Tailoring Frictional Properties of Surfaces Using Diffusion Models Even Marius Nordhagen et.al. 2401.05206v1 null
2024-01-10 Derm-T2IM: Harnessing Synthetic Skin Lesion Data via Stable Diffusion Models for Enhanced Skin Disease Classification using ViT and CNN Muhammad Ali Farooq et.al. 2401.05159v1 null
2024-01-13 CrossDiff: Exploring Self-Supervised Representation of Pansharpening via Cross-Predictive Diffusion Model Yinghui Xing et.al. 2401.05153v2 null
2024-01-10 SwiMDiff: Scene-wide Matching Contrastive Learning with Diffusion Constraint for Remote Sensing Image Jiayuan Tian et.al. 2401.05093v1 null
2024-01-10 A novel bond-based nonlocal diffusion model with matrix-valued coefficients in non-divergence form and its collocation discretization Lili Ju et.al. 2401.04973v1 null
2024-01-09 Transmission-eigenchannel velocity and diffusion Azriel Z. Genack et.al. 2401.04818v1 null
2024-01-09 DiffSHEG: A Diffusion-Based Approach for Real-Time Speech-driven Holistic 3D Expression and Gesture Generation Junming Chen et.al. 2401.04747v1 null
2024-01-09 Morphable Diffusion: 3D-Consistent Diffusion for Single-image Avatar Creation Xiyi Chen et.al. 2401.04728v1 null
2024-01-09 Efficient estimation for ergodic diffusion processes sampled at high frequency Michael Sørensen et.al. 2401.04689v1 null
2024-01-09 EmoGen: Emotional Image Content Generation with Text-to-Image Diffusion Models Jingyuan Yang et.al. 2401.04608v1 null
2024-01-09 Enhanced Distribution Alignment for Post-Training Quantization of Diffusion Models Xuewen Liu et.al. 2401.04585v1 null
2024-01-09 MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation Weimin Wang et.al. 2401.04468v1 null
2024-01-09 D3AD: Dynamic Denoising Diffusion Probabilistic Model for Anomaly Detection Justin Tebbe et.al. 2401.04463v1 null
2024-01-09 SonicVisionLM: Playing Sound with Vision Language Models Zhifeng Xie et.al. 2401.04394v1 null
2024-01-09 Representative Feature Extraction During Diffusion Process for Sketch Extraction with One Example Kwan Yun et.al. 2401.04362v1 null
2024-01-09 Memory-Efficient Personalization using Quantized Diffusion Model Hyogon Ryu et.al. 2401.04339v1 null
2024-01-08 FADI-AEC: Fast Score Based Diffusion Model Guided by Far-end Signal for Acoustic Echo Cancellation Yang Liu et.al. 2401.04283v1 null
2024-01-08 Robust Image Watermarking using Stable Diffusion Lijun Zhang et.al. 2401.04247v1 null
2024-01-07 The Stronger the Diffusion Model, the Easier the Backdoor: Data Poisoning to Induce Copyright Breaches Without Adjusting Finetuning Pipeline Haonan Wang et.al. 2401.04136v1 null
2024-01-08 scDiffusion: conditional generation of high-quality single-cell data using diffusion model Erpai Luo et.al. 2401.03968v1 link
2024-01-08 D3PRefiner: A Diffusion-based Denoise Method for 3D Human Pose Refinement Danqi Yan et.al. 2401.03914v1 null
2024-01-08 DDM-Lag : A Diffusion-based Decision-making Model for Autonomous Vehicles with Lagrangian Safety Enhancement Jiaqi Liu et.al. 2401.03629v1 null
2024-01-09 ROIC-DM: Robust Text Inference and Classification via Diffusion Model Shilong Yuan et.al. 2401.03514v2 null
2024-01-07 Freetalker: Controllable Speech and Text-Driven Gesture Generation Based on Diffusion Models for Enhanced Speaker Naturalness Sicheng Yang et.al. 2401.03476v1 null
2024-01-07 Deep Learning-based Image and Video Inpainting: A Survey Weize Quan et.al. 2401.03395v1 null
2024-01-06 Reflected Schrödinger Bridge for Constrained Generative Modeling Wei Deng et.al. 2401.03228v1 null
2024-01-06 MirrorDiffusion: Stabilizing Diffusion Process in Zero-shot Image Translation by Prompts Redescription and Beyond Yupei Lin et.al. 2401.03221v1 null
2024-01-09 Fair Sampling in Diffusion Models through Switching Mechanism Yujin Choi et.al. 2401.03140v2 link
2024-01-05 Latte: Latent Diffusion Transformer for Video Generation Xin Ma et.al. 2401.03048v1 link
2024-01-05 The Rise of Diffusion Models in Time-Series Forecasting Caspar Meijer et.al. 2401.03006v1 link
2024-01-08 Uncovering the human motion pattern: Pattern Memory-based Diffusion Model for Trajectory Prediction Yuxin Yang et.al. 2401.02916v2 null
2024-01-05 Plug-in Diffusion Model for Sequential Recommendation Haokai Ma et.al. 2401.02913v1 link
2024-01-05 Diffusion Variational Inference: Diffusion Models as Expressive Variational Posteriors Top Piriyakulkij et.al. 2401.02739v1 null
2024-01-05 Geometric-Facilitated Denoising Diffusion Model for 3D Molecule Generation Can Xu et.al. 2401.02683v1 null
2024-01-04 Comprehensive Exploration of Synthetic Data Generation: A Survey André Bauer et.al. 2401.02524v1 null
2024-01-04 VASE: Object-Centric Appearance and Shape Manipulation of Real Videos Elia Peruzzo et.al. 2401.02473v1 null
2024-01-04 Bring Metric Functions into Diffusion Models Jie An et.al. 2401.02414v1 null
2024-01-06 GUESS:GradUally Enriching SyntheSis for Text-Driven Human Motion Generation Xuehao Gao et.al. 2401.02142v2 null
2024-01-04 Preserving Image Properties Through Initializations in Diffusion Models Jeffrey Zhang et.al. 2401.02097v1 null
2024-01-04 Energy based diffusion generator for efficient sampling of Boltzmann distributions Yan Wang et.al. 2401.02080v1 null
2024-01-09 DiffusionEdge: Diffusion Probabilistic Model for Crisp Edge Detection Yunfan Ye et.al. 2401.02032v2 link
2024-01-04 Improving Diffusion-Based Image Synthesis with Context Prediction Ling Yang et.al. 2401.02015v1 null
2024-01-03 Instruct-Imagen: Image Generation with Multi-modal Instruction Hexiang Hu et.al. 2401.01952v1 null
2024-01-03 Can We Generate Realistic Hands Only Using Convolution? Mehran Hosseini et.al. 2401.01951v1 null
2024-01-03 Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions David Junhao Zhang et.al. 2401.01827v1 link
2024-01-03 DiffYOLO: Object Detection for Anti-Noise via YOLO and Diffusion Models Yichen Liu et.al. 2401.01659v1 null
2024-01-03 SIGNeRF: Scene Integrated Generation for Neural Radiance Fields Jan-Niklas Dihlmann et.al. 2401.01647v1 null
2024-01-03 S $^{2}$ -DMs:Skip-Step Diffusion Models Yixuan Wang et.al. 2401.01520v1 link
2024-01-02 ColorizeDiffusion: Adjustable Sketch Colorization with Reference Image and Text Dingkun Yan et.al. 2401.01456v1 link
2024-01-02 VALD-MD: Visual Attribution via Latent Diffusion for Medical Diagnostics Ammar A. Siddiqui et.al. 2401.01414v1 null
2024-01-01 DiffAugment: Diffusion based Long-Tailed Visual Relationship Recognition Parul Gupta et.al. 2401.01387v1 null
2024-01-02 VideoDrafter: Content-Consistent Multi-Scene Video Generation with LLM Fuchen Long et.al. 2401.01256v1 null
2024-01-02 Towards a Simultaneous and Granular Identity-Expression Control in Personalized Face Generation Renshuai Liu et.al. 2401.01207v1 null
2024-01-02 A comparative study of resistivity models for simulations of magnetic reconnection in the solar atmosphere. II. Plasmoid formation Øystein Håvard Færder et.al. 2401.01177v1 null
2024-01-02 Joint Generative Modeling of Scene Graphs and Images via Diffusion Models Bicheng Xu et.al. 2401.01130v1 null
2024-01-02 Robust single-particle cryo-EM image denoising and restoration Jing Zhang et.al. 2401.01097v1 null
2024-01-02 Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation Jinlong Xue et.al. 2401.01044v1 link
2023-12-30 Improving the Stability of Diffusion Models for Content Consistent Super-Resolution Lingchen Sun et.al. 2401.00877v1 link
2023-12-30 FlashVideo: A Framework for Swift Inference in Text-to-Video Generation Bin Lei et.al. 2401.00869v1 null
2024-01-01 DiffMorph: Text-less Image Morphing with Diffusion Models Shounak Chatterjee et.al. 2401.00739v1 null
2024-01-01 Diffusion Models, Image Super-Resolution And Everything: A Survey Brian B. Moser et.al. 2401.00736v1 null
2024-01-02 GD^2-NeRF: Generative Detail Compensation via GAN and Diffusion for One-shot Generalizable Neural Radiance Fields Xiao Pan et.al. 2401.00616v2 null
2024-01-03 Diff-PCR: Diffusion-Based Correspondence Searching in Doubly Stochastic Matrix Space for Point Cloud Registration Qianliang Wu et.al. 2401.00436v2 null
2023-12-31 SynCDR : Training Cross Domain Retrieval Models with Synthetic Data Samarth Mishra et.al. 2401.00420v1 link
2023-12-31 Controllable Safety-Critical Closed-loop Traffic Simulation via Guided Diffusion Wei-Jer Chang et.al. 2401.00391v1 null
2023-12-30 Probing the Limits and Capabilities of Diffusion Models for the Anatomic Editing of Digital Twins Karim Kadry et.al. 2401.00247v1 null
2023-12-30 Inpaint4DNeRF: Promptable Spatio-Temporal NeRF Inpainting with Generative Diffusion Models Han Jiang et.al. 2401.00208v1 null
2024-01-03 Diffusion Model with Perceptual Loss Shanchuan Lin et.al. 2401.00110v2 null
2023-12-29 Generating Enhanced Negatives for Training Language-Based Object Detectors Shiyu Zhao et.al. 2401.00094v1 null
2024-01-02 6D-Diff: A Keypoint Diffusion Framework for 6D Object Pose Estimation Li Xu et.al. 2401.00029v2 null
2023-12-29 FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis Feng Liang et.al. 2312.17681v1 null
2023-12-29 Data Augmentation for Supervised Graph Outlier Detection with Latent Diffusion Models Kay Liu et.al. 2312.17679v1 link
2023-12-29 Leveraging Open-Vocabulary Diffusion to Camouflaged Instance Segmentation Tuan-Anh Vu et.al. 2312.17505v1 null
2023-12-28 Classifier-free graph diffusion for molecular property targeting Matteo Ninniri et.al. 2312.17397v1 null
2023-12-28 iFusion: Inverting Diffusion for Pose-Free Reconstruction from Sparse Views Chin-Hsuan Wu et.al. 2312.17250v1 link
2023-12-28 Personalized Restoration via Dual-Pivot Tuning Pradyumna Chari et.al. 2312.17234v1 null
2023-12-28 4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency Yuyang Yin et.al. 2312.17225v1 null
2023-12-28 Restoration by Generation with Constrained Priors Zheng Ding et.al. 2312.17161v1 null
2023-12-28 DiffKG: Knowledge Graph Diffusion Model for Recommendation Yangqin Jiang et.al. 2312.16890v1 link
2023-12-29 DiffusionGAN3D: Boosting Text-guided 3D Generation and Domain Adaption by Combining 3D GANs and Diffusion Priors Biwen Lei et.al. 2312.16837v2 null
2023-12-27 I2V-Adapter: A General Image-to-Video Adapter for Video Diffusion Models Xun Guo et.al. 2312.16693v1 null
2023-12-27 Forgery-aware Adaptive Transformer for Generalizable Synthetic Image Detection Huan Liu et.al. 2312.16649v1 null
2023-12-27 Image Restoration by Denoising Diffusion Models with Iteratively Preconditioned Guidance Tomer Garber et.al. 2312.16519v1 null
2023-12-29 PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion Guansong Lu et.al. 2312.16486v2 null
2024-01-03 SVGDreamer: Text Guided SVG Generation with Diffusion Model Ximing Xing et.al. 2312.16476v2 null
2023-12-27 Natural Adversarial Patch Generation Method Based on Latent Diffusion Model Xianyi Chen et.al. 2312.16401v1 null
2023-12-24 Hyper-VolTran: Fast and Generalizable One-Shot Image to 3D Object Structure via HyperNetworks Christian Simon et.al. 2312.16218v1 null
2023-12-23 Iterative Prompt Relabeling for diffusion model with RLDF Jiaxin Ge et.al. 2312.16204v1 null
2023-12-26 One-dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications Mengyao Lyu et.al. 2312.16145v1 null
2023-12-26 Compositional Search of Stable Crystalline Structures in Multi-Component Alloys Using Generative Diffusion Models Grzegorz Kaszuba et.al. 2312.16073v1 null
2023-12-26 HarmonyView: Harmonizing Consistency and Diversity in One-Image-to-3D Sangmin Woo et.al. 2312.15980v1 link
2023-12-26 Semantic Guidance Tuning for Text-To-Image Diffusion Models Hyun Kang et.al. 2312.15964v1 null
2023-12-26 Implied volatility (also) is path-dependent Hervé Andrès et.al. 2312.15950v1 link
2023-12-26 EnchantDance: Unveiling the Potential of Music-Driven Dance Movement Bo Han et.al. 2312.15946v1 link
2023-12-26 Generating and Reweighting Dense Contrastive Patterns for Unsupervised Anomaly Detection Songmin Dai et.al. 2312.15911v1 null
2023-12-26 Cross Initialization for Personalized Text-to-Image Generation Lianyu Pang et.al. 2312.15905v1 link
2024-01-02 Adversarial Item Promotion on Visually-Aware Recommender Systems by Guided Diffusion Lijian Chen et.al. 2312.15826v3 null
2023-12-28 High-Fidelity Diffusion-based Image Editing Chen Hou et.al. 2312.15707v2 null
2023-12-25 A Multi-Modal Contrastive Diffusion Model for Therapeutic Peptide Generation Yongkang Wang et.al. 2312.15665v1 link
2023-12-25 Balanced SNR-Aware Distillation for Guided Text-to-Audio Generation Bingzhi Liu et.al. 2312.15628v1 null
2023-12-25 Conversational Co-Speech Gesture Generation via Modeling Dialog Intention, Emotion, and Context with Diffusion Models Haiwei Xue et.al. 2312.15567v1 null
2023-12-27 A-SDM: Accelerating Stable Diffusion through Redundancy Removal and Performance Optimization Jinchao Zhu et.al. 2312.15516v2 null
2023-12-24 Diffusion-EXR: Controllable Review Generation for Explainable Recommendation via Diffusion Models Ling Li et.al. 2312.15490v1 null
2023-12-24 A Two-stage Personalized Virtual Try-on Framework with Shape Control and Texture Guidance Shufang Zhang et.al. 2312.15480v1 null
2023-12-23 Prompt-Propose-Verify: A Reliable Hand-Object-Interaction Data Generation Framework using Foundational Models Gurusha Juneja et.al. 2312.15247v1 null
2023-12-23 CaLDiff: Camera Localization in NeRF via Pose Diffusion Rashik Shrestha et.al. 2312.15242v1 null
2023-12-23 Majority-based Preference Diffusion on Social Networks Ahad N. Zehmakan et.al. 2312.15140v1 null
2023-12-22 Spectrally Decomposed Diffusion Models for Generative Turbulence Recovery Mohammed Sardar et.al. 2312.15029v1 null
2023-12-22 FineMoGen: Fine-Grained Spatio-Temporal Motion Generation and Editing Mingyuan Zhang et.al. 2312.15004v1 null
2023-12-22 Emage: Non-Autoregressive Text-to-Image Generation Zhangyin Feng et.al. 2312.14988v1 null
2023-12-21 Diffusion Models for Generative Artificial Intelligence: An Introduction for Applied Mathematicians Catherine F. Higham et.al. 2312.14977v1 null
2023-12-21 Gaussian Harmony: Attaining Fairness in Diffusion-based Face Generation Models Basudha Pal et.al. 2312.14976v1 null
2023-12-22 MACS: Mass Conditioned 3D Hand and Object Motion Synthesis Soshi Shimada et.al. 2312.14929v1 null
2023-12-22 BrainVis: Exploring the Bridge between Brain and Visual Signals via Image Reconstruction Honghao Fu et.al. 2312.14871v1 null
2023-12-22 Neural-network-based regularization methods for inverse problems in imaging Andreas Habring et.al. 2312.14849v1 null
2023-12-22 Dreaming of Electrical Waves: Generative Modeling of Cardiac Excitation Waves using Diffusion Models Tanish Baranwal et.al. 2312.14830v1 null
2023-12-22 Neural network models for preferential concentration of particles in two-dimensional turbulence Thibault Maurel-Oujia et.al. 2312.14829v1 null
2023-12-22 Plan, Posture and Go: Towards Open-World Text-to-Motion Generation Jinpeng Liu et.al. 2312.14828v1 null
2023-12-22 Harnessing Diffusion Models for Visual Perception with Meta Prompts Qiang Wan et.al. 2312.14733v1 link
2023-12-22 FM-OV3D: Foundation Model-based Cross-modal Knowledge Blending for Open-Vocabulary 3D Detection Dongmei Zhang et.al. 2312.14465v1 null
2023-12-22 Generative AI Beyond LLMs: System Implications of Multi-Modal Generation Alicia Golden et.al. 2312.14385v1 null
2023-12-21 Single-Cell RNA-seq Synthesis with Latent Diffusion Model Yixuan Wang et.al. 2312.14220v1 null
2023-12-21 DreamDistribution: Prompt Distribution Learning for Text-to-Image Diffusion Models Brian Nlong Zhao et.al. 2312.14216v1 null
2023-12-21 Diffusion Reward: Learning Rewards via Conditional Video Diffusion Tao Huang et.al. 2312.14134v1 null
2023-12-21 Neural Point Cloud Diffusion for Disentangled 3D Shape and Appearance Generation Philipp Schröppel et.al. 2312.14124v1 link
2023-12-25 HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models Hayk Manukyan et.al. 2312.14091v2 link
2023-12-21 Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning Desai Xie et.al. 2312.13980v1 null
2023-12-22 Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models Xianfang Zeng et.al. 2312.13913v2 link
2023-12-20 Fairy: Fast Parallelized Instruction-Guided Video-to-Video Synthesis Bichen Wu et.al. 2312.13834v1 null
2023-12-21 Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models Huan Ling et.al. 2312.13763v1 null
2023-12-21 Free-Editor: Zero-shot Text-driven 3D Scene Editing Nazmul Karim et.al. 2312.13663v1 null
2023-12-21 Diff-Oracle: Diffusion Model for Oracle Character Generation with Controllable Styles and Contents Jing Li et.al. 2312.13631v1 null
2023-12-21 Navigating the Structured What-If Spaces: Counterfactual Generation via Structured Diffusion Nishtha Madaan et.al. 2312.13616v1 null
2023-12-21 Front stability of infinitely steep travelling waves in population biology Matthew J Simpson et.al. 2312.13601v1 link
2023-12-20 Unlocking Pre-trained Image Backbones for Semantic Image Synthesis Tariq Berrada et.al. 2312.13314v1 null
2023-12-20 Generate E-commerce Product Background by Integrating Category Commonality and Personalized Style Haohan Wang et.al. 2312.13309v1 null
2023-12-20 Not All Steps are Equal: Efficient Generation with Progressive Diffusion Models Wenhao Li et.al. 2312.13307v1 null
2023-12-27 Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting Junwu Zhang et.al. 2312.13271v3 link
2023-12-20 Conditional Image Generation with Pretrained Generative Model Rajesh Shrestha et.al. 2312.13253v1 null
2023-12-20 Zero-Shot Metric Depth with a Field-of-View Conditioned Diffusion Model Saurabh Saxena et.al. 2312.13252v1 null
2023-12-20 Diffusion Models With Learned Adaptive Noise Subham Sekhar Sahoo et.al. 2312.13236v1 link
2023-12-22 DiffPortrait3D: Controllable Diffusion for Zero-Shot Portrait View Synthesis Yuming Gu et.al. 2312.13016v3 link
2023-12-21 RadEdit: stress-testing biomedical vision models via diffusion image editing Fernando Pérez-García et.al. 2312.12865v2 null
2023-12-20 ReCo-Diff: Explore Retinex-Based Condition Strategy in Diffusion Model for Low-Light Image Enhancement Yuhui Wu et.al. 2312.12826v1 null
2023-12-20 All but One: Surgical Concept Erasing with Model Preservation in Text-to-Image Diffusion Models Seunghoo Hong et.al. 2312.12807v1 null
2023-12-21 AMD:Anatomical Motion Diffusion with Interpretable Motion Decomposition and Fusion Beibei Jing et.al. 2312.12763v2 null
2023-12-20 How Good Are Deep Generative Models for Solving Inverse Problems? Shichong Peng et.al. 2312.12691v1 null
2023-12-19 Surf-CDM: Score-Based Surface Cold-Diffusion Model For Medical Image Segmentation Fahim Ahmed Zaman et.al. 2312.12649v1 null
2023-12-19 Fixed-point Inversion for Text-to-image diffusion models Barak Meiri et.al. 2312.12540v1 null
2023-12-19 StreamDiffusion: A Pipeline-level Solution for Real-time Interactive Generation Akio Kodaira et.al. 2312.12491v1 link
2023-12-19 InstructVideo: Instructing Video Diffusion Models with Human Feedback Hangjie Yuan et.al. 2312.12490v1 null
2023-12-19 Adaptive Guidance: Training-free Acceleration of Conditional Diffusion Models Angela Castillo et.al. 2312.12487v1 null
2023-12-19 Atlantis: Enabling Underwater Depth Estimation with Stable Diffusion Fan Zhang et.al. 2312.12471v1 link
2023-12-19 MaskINT: Video Editing via Interpolative Non-autoregressive Masked Transformers Haoyu Ma et.al. 2312.12468v1 null
2023-12-19 On Inference Stability for Diffusion Models Viet Nguyen et.al. 2312.12431v1 link
2023-12-19 Scene-Conditional 3D Object Stylization and Composition Jinghao Zhou et.al. 2312.12419v1 null
2023-12-19 Prompting Hard or Hardly Prompting: Prompt Inversion for Text-to-Image Diffusion Models Shweta Mahajan et.al. 2312.12416v1 null
2023-12-19 Travelling pulses on three spatial scales in a Klausmeier-type vegetation-autotoxicity model Paul Carter et.al. 2312.12277v1 null
2023-12-19 Intrinsic Image Diffusion for Single-view Material Estimation Peter Kocsis et.al. 2312.12274v1 null
2023-12-19 Brush Your Text: Synthesize Any Scene Text on Images via Diffusion Model Lingjun Zhang et.al. 2312.12232v1 link
2023-12-19 HuTuMotion: Human-Tuned Navigation of Latent Motion Diffusion Models with Minimal Feedback Gaoge Han et.al. 2312.12227v1 null
2023-12-19 FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning Zhenhua Yang et.al. 2312.12142v1 link
2023-12-19 GazeMoDiff: Gaze-guided Diffusion Model for Stochastic Human Motion Prediction Haodong Yan et.al. 2312.12090v1 null
2023-12-19 Learning Subject-Aware Cropping by Outpainting Professional Photos James Hong et.al. 2312.12080v1 null
2023-12-19 Resource-efficient Generative Mobile Edge Networks in 6G Era: Fundamentals, Framework and Case Study Bingkun Lai et.al. 2312.12063v1 null
2023-12-19 Towards Accurate Guided Diffusion Sampling through Symplectic Adjoint Method Jiachun Pan et.al. 2312.12030v1 null
2023-12-19 Diffusing More Objects for Semi-Supervised Domain Adaptation with Less Labeling Leander van den Heuvel et.al. 2312.12000v1 null
2023-12-19 Optimizing Diffusion Noise Can Serve As Universal Motion Priors Korrawe Karunratanakul et.al. 2312.11994v1 null
2023-12-19 Extending intraday solar forecast horizons with deep generative models Alberto Carpentieri et.al. 2312.11966v1 link
2023-12-19 Text-Image Conditioned Diffusion for Consistent Text-to-3D Generation Yuze He et.al. 2312.11774v1 null
2023-12-18 Learning a Diffusion Model Policy from Rewards via Q-Score Matching Michael Psenka et.al. 2312.11752v1 null
2023-12-18 Unified framework for diffusion generative models in SO(3): applications in computer vision and astrophysics Yesukhei Jagvaral et.al. 2312.11707v1 null
2023-12-18 HAAR: Text-Conditioned Generative Model of 3D Strand-based Human Hairstyles Vanessa Sklyarova et.al. 2312.11666v1 null
2023-12-18 SkillDiffuser: Interpretable Hierarchical Planning via Skill Abstractions in Diffusion-Based Task Execution Zhixuan Liang et.al. 2312.11598v1 null
2023-12-18 TIP: Text-Driven Image Processing with Semantic and Restoration Instructions Chenyang Qi et.al. 2312.11595v1 null
2023-12-15 Iterative Motion Editing with Natural Language Purvi Goel et.al. 2312.11538v1 null
2023-12-15 Customize-It-3D: High-Quality 3D Creation from A Single Image Using Subject-Specific Knowledge Prior Nan Huang et.al. 2312.11535v1 null
2023-12-18 VolumeDiffusion: Flexible Text-to-3D Generation with Efficient Volumetric Encoder Zhicong Tang et.al. 2312.11459v1 link
2023-12-18 PolyDiff: Generating 3D Polygonal Meshes with Diffusion Models Antonio Alliegro et.al. 2312.11417v1 null
2023-12-21 MAG-Edit: Localized Image Editing in Complex Scenarios via Mask-Based Attention-Adjusted Guidance Qi Mao et.al. 2312.11396v2 null
2023-12-18 SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing Zeyinzi Jiang et.al. 2312.11392v1 null
2023-12-18 Adv-Diffusion: Imperceptible Adversarial Face Identity Attack via Latent Diffusion Model Decheng Liu et.al. 2312.11285v1 link
2023-12-18 GraspLDM: Generative 6-DoF Grasp Synthesis using Latent Diffusion Models Kuldeep R Barad et.al. 2312.11243v1 null
2023-12-18 Multi-scale Reconstruction of Turbulent Rotating Flows with Generative Diffusion Models Tianyi Li et.al. 2312.11121v1 null
2023-12-20 DataElixir: Purifying Poisoned Dataset to Mitigate Backdoor Attacks via Diffusion Models Jiachen Zhou et.al. 2312.11057v2 link
2023-12-18 Realistic Human Motion Generation with Cross-Diffusion Models Zeping Ren et.al. 2312.10993v1 null
2023-12-18 Towards Detailed Text-to-Motion Synthesis via Basic-to-Advanced Hierarchical Diffusion Model Zhenyu Xie et.al. 2312.10960v1 null
2023-12-20 A novel diffusion recommendation algorithm based on multi-scale cnn and residual lstm Yong Niu et.al. 2312.10885v2 null
2023-12-17 Your Student is Better Than Expected: Adaptive Teacher-Student Collaboration for Text-Conditional Diffusion Models Nikita Starodubcev et.al. 2312.10835v1 link
2023-12-17 CogCartoon: Towards Practical Story Visualization Zhongyang Zhu et.al. 2312.10718v1 null
2023-12-19 VidToMe: Video Token Merging for Zero-Shot Video Editing Xirui Li et.al. 2312.10656v2 null
2023-12-16 VecFusion: Vector Font Generation with Diffusion Vikas Thamizharasan et.al. 2312.10540v1 null
2023-12-16 A Unified Filter Method for Jointly Estimating State and Parameters of Stochastic Dynamical Systems via the Ensemble Score Filter Feng Bao et.al. 2312.10503v1 null
2023-12-16 Continuous Diffusion for Mixed-Type Tabular Data Markus Mueller et.al. 2312.10431v1 null
2023-12-16 Lecture Notes in Probabilistic Diffusion Models Inga Strümke et.al. 2312.10393v1 null
2023-12-16 Image Restoration Through Generalized Ornstein-Uhlenbeck Bridge Conghan Yue et.al. 2312.10299v1 null
2023-12-15 Two simple criterion to prove the existence of patterns in reaction-diffusion models of two components Francisco J. Vielma-Leal et.al. 2312.10231v1 null
2023-12-15 Tell Me What You See: Text-Guided Real-World Image Denoising Erez Yosef et.al. 2312.10191v1 null
2023-12-19 Improving new physics searches with diffusion models for event observables and jet constituents Debajyoti Sengupta et.al. 2312.10130v2 null
2023-12-15 MVHuman: Tailoring 2D Diffusion with Multi-view Sampling For Realistic 3D Human Generation Suyi Jiang et.al. 2312.10120v1 null
2023-12-15 Plasticine3D: Non-rigid 3D editting with text guidance Yige Chen et.al. 2312.10111v1 null
2023-12-15 Latent Diffusion Models with Image-Derived Annotations for Enhanced AI-Assisted Cancer Diagnosis in Histopathology Pedro Osorio et.al. 2312.09792v1 null
2023-12-15 DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models Yifeng Ma et.al. 2312.09767v1 null
2023-12-19 PPFM: Image denoising in photon-counting CT using single-step posterior sampling Poisson flow generative models Dennis Hein et.al. 2312.09754v2 link
2023-12-15 Positivity and global existence for nonlocal advection-diffusion models of interacting populations Valeria Giunta et.al. 2312.09692v1 null
2023-12-15 Exploring the Feasibility of Generating Realistic 3D Models of Endangered Species Using DreamGaussian: An Analysis of Elevation Angle's Impact on Model Generation Selcuk Anil Karatopak et.al. 2312.09682v1 null
2023-12-15 Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models Senmao Li et.al. 2312.09608v1 link
2023-12-15 Single PW takes a shortcut to compound PW in US imaging Zhiqiang Li et.al. 2312.09514v1 null
2023-12-15 Fast Sampling generative model for Ultrasound image reconstruction Hengrong Lan et.al. 2312.09510v1 null
2023-12-18 Unbiasing Enhanced Sampling on a High-dimensional Free Energy Surface with Deep Generative Model Yikai Liu et.al. 2312.09404v2 null
2023-12-14 LatentEditor: Text Driven Local Editing of 3D Scenes Umar Khalid et.al. 2312.09313v1 link
2023-12-14 LIME: Localized Image Editing via Attention Regularization in Diffusion Models Enis Simsar et.al. 2312.09256v1 null
2023-12-14 FineControlNet: Fine-level Text Control for Image Generation with Spatially Aligned Text Control Injection Hongsuk Choi et.al. 2312.09252v1 null
2023-12-14 Single Mesh Diffusion Models with Field Latents for Texture Generation Thomas W. Mitchel et.al. 2312.09250v1 null
2023-12-14 A framework for conditional diffusion modelling with applications in motif scaffolding for protein design Kieran Didi et.al. 2312.09236v1 null
2023-12-14 Mosaic-SDF for 3D Generative Models Lior Yariv et.al. 2312.09222v1 null
2023-12-14 Fast Sampling via De-randomization for Discrete Diffusion Models Zixiang Chen et.al. 2312.09193v1 null
2023-12-14 Improving Efficiency of Diffusion Models via Multi-Stage Framework and Tailored Multi-Decoder Architectures Huijie Zhang et.al. 2312.09181v1 null
2023-12-14 DiffusionLight: Light Probes for Free by Painting a Chrome Ball Pakkapon Phongthawee et.al. 2312.09168v1 link
2023-12-14 Triplane Meets Gaussian Splatting: Fast and Generalizable Single-View 3D Reconstruction with Transformers Zi-Xin Zou et.al. 2312.09147v1 null
2023-12-14 VideoLCM: Video Latent Consistency Model Xiang Wang et.al. 2312.09109v1 null
2023-12-14 PI3D: Efficient Text-to-3D Generation with Pseudo-Image Diffusion Ying-Tian Liu et.al. 2312.09069v1 null
2023-12-14 Brain Diffuser with Hierarchical Transformer for MCI Causality Analysis Qiankun Zuo et.al. 2312.09022v1 null
2023-12-18 OMG: Towards Open-vocabulary Motion Generation via Mixture of Controllers Han Liang et.al. 2312.08985v2 null
2023-12-14 Motion Flow Matching for Human Motion Synthesis and Editing Vincent Tao Hu et.al. 2312.08895v1 null
2023-12-14 VaLID: Variable-Length Input Diffusion for Novel View Synthesis Shijie Li et.al. 2312.08892v1 null
2023-12-13 SEEAvatar: Photorealistic Text-to-3D Avatar Generation with Constrained Geometry and Appearance Yuanyou Xu et.al. 2312.08889v1 null
2023-12-15 SpeedUpNet: A Plug-and-Play Hyper-Network for Accelerating Text-to-Image Diffusion Models Weilong Chai et.al. 2312.08887v2 null
2023-12-13 Diffusion-based Blind Text Image Super-Resolution Yuzhe Zhang et.al. 2312.08886v1 null
2023-12-13 SceneWiz3D: Towards Text-guided 3D Scene Composition Qihang Zhang et.al. 2312.08885v1 null
2023-12-13 Semantic-Driven Initial Image Construction for Guided Image Synthesis in Diffusion Model Jiafeng Mao et.al. 2312.08872v1 null
2023-12-14 Diffusion-C: Unveiling the Generative Challenges of Diffusion Models through Corrupted Data Keywoong Bae et.al. 2312.08843v1 null
2023-12-14 Speeding up Photoacoustic Imaging using Diffusion Models Irem Loc et.al. 2312.08834v1 link
2023-12-14 Guided Diffusion from Self-Supervised Diffusion Features Vincent Tao Hu et.al. 2312.08825v1 null
2023-12-14 Reconstruction of Sound Field through Diffusion Models Federico Miotello et.al. 2312.08821v1 null
2023-12-14 Local Conditional Controlling for Text-to-Image Diffusion Models Yibo Zhao et.al. 2312.08768v1 link
2023-12-14 UniDream: Unifying Diffusion Priors for Relightable Text-to-3D Generation Zexiang Liu et.al. 2312.08754v1 null
2023-12-17 DreamDrone Hanyang Kong et.al. 2312.08746v2 null
2023-12-14 GOEnFusion: Gradient Origin Encodings for 3D Forward Diffusion Models Animesh Karnewar et.al. 2312.08744v1 null
2023-12-14 Joint2Human: High-quality 3D Human Generation via Compact Spherical Embedding of 3D Joints Muxin Zhang et.al. 2312.08591v1 null
2023-12-13 NViST: In the Wild New View Synthesis from a Single Image with Transformers Wonbong Jang et.al. 2312.08568v1 null
2023-12-13 Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models Liangchen Song et.al. 2312.08563v1 null
2023-12-13 World Models via Policy-Guided Trajectory Diffusion Marc Rigter et.al. 2312.08533v1 link
2023-12-13 PerMod: Perceptually Grounded Voice Modification with Latent Diffusion Models Robin Netzorg et.al. 2312.08494v1 null
2023-12-13 FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head Models Shivangi Aneja et.al. 2312.08459v1 null
2023-12-13 PhenDiff: Revealing Invisible Phenotypes with Conditional Diffusion Models Anis Bourou et.al. 2312.08290v1 link
2023-12-13 Black-box Membership Inference Attacks against Fine-tuned Diffusion Models Yan Pang et.al. 2312.08207v1 null
2023-12-13 Concept-centric Personalization with Large-scale Diffusion Priors Pu Cao et.al. 2312.08195v1 link
2023-12-13 ** $ρ$ -Diffusion: A diffusion-based density estimation framework for computational physics** Maxwell X. Cai et.al. 2312.08153v1 null
2023-12-13 Clockwork Diffusion: Efficient Generation With Model-Step Distillation Amirhossein Habibian et.al. 2312.08128v1 null
2023-12-13 Knowledge-Aware Artifact Image Synthesis with LLM-Enhanced Prompting and Multi-Source Supervision Shengguang Wu et.al. 2312.08056v1 null
2023-12-14 Compositional Inversion for Stable Diffusion Models Xu-Lu Zhang et.al. 2312.08048v2 link
2023-12-13 AdapEdit: Spatio-Temporal Guided Adaptive Editing Algorithm for Text-Based Continuity-Sensitive Image Editing Zhiyuan Ma et.al. 2312.08019v1 link
2023-12-13 Time Series Diffusion Method: A Denoising Diffusion Probabilistic Model for Vibration Signal Generation Haiming Yi et.al. 2312.07981v1 null
2023-12-13 LMD: Faster Image Reconstruction with Latent Masking Diffusion Zhiyuan Ma et.al. 2312.07971v1 link
2023-12-13 Semantic-aware Data Augmentation for Text-to-image Synthesis Zhaorui Tan et.al. 2312.07951v1 null
2023-12-13 BOTH2Hands: Inferring 3D Hands from Both Text Prompts and Body Dynamics Wenqian Zhang et.al. 2312.07937v1 null
2023-12-13 SimAC: A Simple Anti-Customization Method against Text-to-Image Synthesis of Diffusion Models Feifei Wang et.al. 2312.07865v1 null
2023-12-13 Diffusion Models Enable Zero-Shot Pose Estimation for Lower-Limb Prosthetic Users Tianxun Zhou et.al. 2312.07854v1 null
2023-12-14 Noise in the reverse process improves the approximation capabilities of diffusion models Karthik Elamvazhuthi et.al. 2312.07851v2 null
2023-12-13 Stable Rivers: A Case Study in the Application of Text-to-Image Generative Models for Earth Sciences C Kupferschmidt et.al. 2312.07833v1 null
2023-12-12 Brain-optimized inference improves reconstructions of fMRI brain activity Reese Kneeland et.al. 2312.07705v1 null
2023-12-12 FreeInit: Bridging Initialization Gap in Video Diffusion Models Tianxing Wu et.al. 2312.07537v1 link
2023-12-12 FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition Sicheng Mo et.al. 2312.07536v1 null
2023-12-12 Cosmological Field Emulation and Parameter Inference with Diffusion Models Nayantara Mudur et.al. 2312.07534v1 null
2023-12-12 MinD-3D: Reconstruct High-quality 3D objects in Human Brain Jianxiong Gao et.al. 2312.07485v1 null
2023-12-12 DiffMorpher: Unleashing the Capability of Diffusion Models for Image Morphing Kaiwen Zhang et.al. 2312.07409v1 null
2023-12-12 Boosting Latent Diffusion with Flow Matching Johannes S. Fischer et.al. 2312.07360v1 link
2023-12-12 Learned representation-guided diffusion models for large-image generation Alexandros Graikos et.al. 2312.07330v1 null
2023-12-12 GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos Tomáš Souček et.al. 2312.07322v1 link
2023-12-12 Scalable Motion Style Transfer with Constrained Diffusion Generation Wenjie Yin et.al. 2312.07311v1 null
2023-12-12 A Unified Sampling Framework for Solver Searching of Diffusion Probabilistic Models Enshu Liu et.al. 2312.07243v1 null
2023-12-12 Fast Training of Diffusion Transformer with Extreme Masking for 3D Point Clouds Generation Shentong Mo et.al. 2312.07231v1 null
2023-12-12 Equivariant Flow Matching with Hybrid Probability Transport Yuxuan Song et.al. 2312.07168v1 null
2023-12-12 Text2AC-Zero: Consistent Synthesis of Animated Characters using 2D Diffusion Abdelrahman Eldesokey et.al. 2312.07133v1 null
2023-12-12 Generating High-Resolution Regional Precipitation Using Conditional Diffusion Model Naufal Shidqi et.al. 2312.07112v1 null
2023-12-12 Template Free Reconstruction of Human-object Interaction with Procedural Interaction Generation Xianghui Xie et.al. 2312.07063v1 null
2023-12-12 Diff-OP3D: Bridging 2D Diffusion for Open Pose 3D Zero-Shot Classification Weiguang Zhao et.al. 2312.07039v1 null
2023-12-12 Noise Distribution Decomposition based Multi-Agent Distributional Reinforcement Learning Wei Geng et.al. 2312.07025v1 null
2023-12-12 On the notion of Hallucinations from the lens of Bias and Validity in Synthetic CXR Images Gauri Bhardwaj et.al. 2312.06979v1 null
2023-12-12 CCM: Adding Conditional Controls to Text-to-Image Consistency Models Jie Xiao et.al. 2312.06971v1 null
2023-12-12 LoRA-Enhanced Distillation on Guided Diffusion Models Pareesa Ameneh Golnari et.al. 2312.06899v1 null
2023-12-11 Relightful Harmonization: Lighting-aware Portrait Background Replacement Mengwei Ren et.al. 2312.06886v1 null
2023-12-11 Adversarial Estimation of Topological Dimension with Harmonic Score Maps Eric Yeats et.al. 2312.06869v1 null
2023-12-11 SmartEdit: Exploring Complex Instruction-based Image Editing with Multimodal Large Language Models Yuzhou Huang et.al. 2312.06739v1 link
2023-12-11 InstructAny2Pix: Flexible Visual Editing via Multimodal Instruction Following Shufan Li et.al. 2312.06738v1 link
2023-12-11 EpiDiff: Enhancing Multi-View Synthesis via Localized Epipolar-Constrained Diffusion Zehuan Huang et.al. 2312.06725v1 null
2023-12-11 CAD: Photorealistic 3D Generation via Adversarial Distillation Ziyu Wan et.al. 2312.06663v1 null
2023-12-11 Photorealistic Video Generation with Diffusion Models Agrim Gupta et.al. 2312.06662v1 null
2023-12-11 UpFusion: Novel View Diffusion from Unposed Sparse View Observations Bharath Raj Nagoor Kani et.al. 2312.06661v1 null
2023-12-11 Sherpa3D: Boosting High-Fidelity Text-to-3D Generation via Coarse 3D Prior Fangfu Liu et.al. 2312.06655v1 link
2023-12-11 Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution Shangchen Zhou et.al. 2312.06640v1 null
2023-12-11 DiAD: A Diffusion-based Framework for Multi-class Anomaly Detection Haoyang He et.al. 2312.06607v1 link
2023-12-11 ControlNet-XS: Designing an Efficient and Effective Architecture for Controlling Text-to-Image Diffusion Models Denis Zavadski et.al. 2312.06573v1 link
2023-12-11 HOI-Diff: Text-Driven Synthesis of 3D Human-Object Interactions using Diffusion Models Xiaogang Peng et.al. 2312.06553v1 null
2023-12-11 STDiff: Spatio-temporal Diffusion for Continuous Stochastic Video Prediction Xi Ye et.al. 2312.06486v1 link
2023-12-11 Semantic Image Synthesis for Abdominal CT Yan Zhuang et.al. 2312.06453v1 null
2023-12-11 DreamControl: Control-Based Text-to-3D Generation with 3D Self-Prior Tianyu Huang et.al. 2312.06439v1 null
2023-12-11 DiT-Head: High-Resolution Talking Head Synthesis using Diffusion Transformers Aaron Mir et.al. 2312.06400v1 null
2023-12-11 PortraitBooth: A Versatile Portrait Model for Fast Identity-preserved Personalization Xu Peng et.al. 2312.06354v1 null
2023-12-12 DiffAIL: Diffusion Adversarial Imitation Learning Bingzheng Wang et.al. 2312.06348v2 link
2023-12-11 Compensation Sampling for Improved Convergence in Diffusion Models Hui Lu et.al. 2312.06285v1 null
2023-12-11 UIEDP:Underwater Image Enhancement with Diffusion Prior Dazhao Du et.al. 2312.06240v1 null
2023-12-11 The Journey, Not the Destination: How Data Guides Diffusion Models Kristian Georgiev et.al. 2312.06205v1 null
2023-12-11 Offloading and Quality Control for AI Generated Content Services in Edge Computing Networks Yitong Wang et.al. 2312.06203v1 null
2023-12-11 Optimized View and Geometry Distillation from Multi-view Diffuser Youjia Zhang et.al. 2312.06198v1 null
2023-12-11 SP-DiffDose: A Conditional Diffusion Model for Radiation Dose Prediction Based on Multi-Scale Fusion of Anatomical Structures, Guided by SwinTransformer and Projector Linjie Fu et.al. 2312.06187v1 null
2023-12-11 ArtBank: Artistic Style Transfer with Pre-trained Diffusion Model and Implicit Style Prompt Bank Zhanjie Zhang et.al. 2312.06135v1 null
2023-12-11 Probabilistic Precipitation Downscaling with Optical Flow-Guided Diffusion Prakhar Srivastava et.al. 2312.06071v1 null
2023-12-11 PCRDiffusion: Diffusion Probabilistic Models for Point Cloud Registration Yue Wu et.al. 2312.06063v1 null
2023-12-11 CONFORM: Contrast is All You Need For High-Fidelity Text-to-Image Diffusion Models Tuna Han Salih Meral et.al. 2312.06059v1 null
2023-12-10 Correcting Diffusion Generation through Resampling Yujian Liu et.al. 2312.06038v1 link
2023-12-10 A Note on the Convergence of Denoising Diffusion Probabilistic Models Sokhna Diarra Mbacke et.al. 2312.05989v1 null
2023-12-10 InteractDiffusion: Interaction Control in Text-to-Image Diffusion Models Jiun Tian Hoe et.al. 2312.05849v1 null
2023-12-10 Toward Open-ended Embodied Tasks Solving William Wei Wang et.al. 2312.05822v1 null
2023-12-10 HumanCoser: Layered 3D Human Generation via Semantic-Aware Diffusion Model Yi Wang et.al. 2312.05804v1 null
2023-12-10 AnomalyDiffusion: Few-Shot Anomaly Image Generation with Diffusion Model Teng Hu et.al. 2312.05767v1 null
2023-12-09 Iterative Token Evaluation and Refinement for Real-World Super-Resolution Chaofeng Chen et.al. 2312.05616v1 link
2023-12-09 Generative AI for Physical Layer Communications: A Survey Nguyen Van Huynh et.al. 2312.05594v1 null
2023-12-09 DPoser: Diffusion Model as Robust 3D Human Pose Prior Junzhe Lu et.al. 2312.05541v1 link
2023-12-09 BARET : Balanced Attention based Real image Editing driven by Target-text Inversion Yuming Qiao et.al. 2312.05482v1 null
2023-12-09 Spectroscopy-Guided Discovery of Three-Dimensional Structures of Disordered Materials with Diffusion Models Hyuna Kwon et.al. 2312.05472v1 link
2023-12-09 Identifying and Mitigating Model Failures through Few-shot CLIP-aided Diffusion Generation Atoosa Chegini et.al. 2312.05464v1 null
2023-12-09 Efficient Quantization Strategies for Latent Diffusion Models Yuewei Yang et.al. 2312.05431v1 null
2023-12-08 CMMD: Contrastive Multi-Modal Diffusion for Video-Audio Conditional Modeling Ruihan Yang et.al. 2312.05412v1 null
2023-12-08 NoiseCLR: A Contrastive Learning Approach for Unsupervised Discovery of Interpretable Directions in Diffusion Models Yusuf Dalva et.al. 2312.05390v1 null
2023-12-08 Cross Domain Generative Augmentation: Domain Generalization with Latent Diffusion Models Sobhan Hemati et.al. 2312.05387v1 null
2023-12-08 MotionCrafter: One-Shot Motion Customization of Diffusion Models Yuxin Zhang et.al. 2312.05288v1 link
2023-12-08 KBFormer: A Diffusion Model for Structured Entity Completion Ouail Kitouni et.al. 2312.05253v1 null
2023-12-08 SwiftBrush: One-Step Text-to-Image Diffusion Model with Variational Score Distillation Thuan Hoang Nguyen et.al. 2312.05239v1 null
2023-12-08 Membership Inference Attacks on Diffusion Models via Quantile Regression Shuai Tang et.al. 2312.05140v1 null
2023-12-11 DreaMoving: A Human Video Generation Framework based on Diffusion Models Mengyang Feng et.al. 2312.05107v2 null
2023-12-08 SmartMask: Context Aware High-Fidelity Mask Generation for Fine-grained Object Insertion and Layout Control Jaskirat Singh et.al. 2312.05039v1 null
2023-12-07 Customizing Motion in Text-to-Video Diffusion Models Joanna Materzynska et.al. 2312.04966v1 null
2023-12-07 Inversion-Free Image Editing with Natural Language Sihan Xu et.al. 2312.04965v1 null
2023-12-08 UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion Models Yiming Zhao et.al. 2312.04884v1 link
2023-12-08 MVDD: Multi-View Depth Diffusion Models Zhen Wang et.al. 2312.04875v1 null
2023-12-08 HandDiffuse: Generative Controllers for Two-Hand Interactions via Diffusion Models Pei Lin et.al. 2312.04867v1 null
2023-12-08 Learn to Optimize Denoising Scores for 3D Generation: A Unified and Improved Diffusion Prior on NeRF and 3D Gaussian Splatting Xiaofeng Yang et.al. 2312.04820v1 null
2023-12-08 A Unified Particle-Based Solver for Non-Newtonian Behaviors Simulation Chunlei Li et.al. 2312.04814v1 null
2023-12-08 RS-Corrector: Correcting the Racial Stereotypes in Latent Diffusion Models Yue Jiang et.al. 2312.04810v1 null
2023-12-08 RL Dreams: Policy Gradient Optimization for Score Distillation based 3D Generation Aradhya N. Mathur et.al. 2312.04806v1 null
2023-12-08 MimicDiffusion: Purifying Adversarial Perturbation via Mimicking Clean Diffusion Model Kaiyu Song et.al. 2312.04802v1 null
2023-12-08 Reality's Canvas, Language's Brush: Crafting 3D Avatars from Monocular Video Yuchen Rao et.al. 2312.04784v1 null
2023-12-08 Fine-Tuning InstructPix2Pix for Advanced Image Colorization Zifeng An et.al. 2312.04780v1 null
2023-12-07 Diffence: Fencing Membership Privacy With Diffusion Models Yuefeng Peng et.al. 2312.04692v1 null
2023-12-07 ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations Maitreya Patel et.al. 2312.04655v1 null
2023-12-07 NeuSD: Surface Completion with Multi-View Text-to-Image Diffusion Savva Ignatyev et.al. 2312.04654v1 null
2023-12-07 Gen2Det: Generate to Detect Saksham Suri et.al. 2312.04566v1 null
2023-12-07 NeRFiller: Completing Scenes via Generative 3D Inpainting Ethan Weber et.al. 2312.04560v1 null
2023-12-07 PrimDiffusion: Volumetric Primitives Diffusion for 3D Human Generation Zhaoxi Chen et.al. 2312.04559v1 link
2023-12-07 GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation Shoufa Chen et.al. 2312.04557v1 null
2023-12-07 Generating Illustrated Instructions Sachit Menon et.al. 2312.04552v1 null
2023-12-07 PlayFusion: Skill Acquisition via Diffusion from Language-Annotated Play Lili Chen et.al. 2312.04549v1 null
2023-12-07 Diffusion Reflectance Map: Single-Image Stochastic Inverse Rendering of Illumination and Reflectance Yuto Enyo et.al. 2312.04529v1 null
2023-12-07 RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models Ozgur Kara et.al. 2312.04524v1 link
2023-12-07 Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation Zhiwu Qing et.al. 2312.04483v1 null
2023-12-07 Emotional Speech-driven 3D Body Animation via Disentangled Latent Diffusion Kiran Chhatre et.al. 2312.04466v1 link
2023-12-07 FitDiff: Robust monocular 3D facial shape and reflectance estimation using Diffusion Models Stathis Galanakis et.al. 2312.04465v1 null
2023-12-07 DreamVideo: Composing Your Dream Videos with Customized Subject and Motion Yujie Wei et.al. 2312.04433v1 null
2023-12-07 Approximate Caching for Efficiently Serving Diffusion Models Shubham Agarwal et.al. 2312.04429v1 null
2023-12-07 Cascade-Zero123: One Image to Highly Consistent 3D with Self-Prompted Nearby Views Yabo Chen et.al. 2312.04424v1 null
2023-12-07 Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models Jiayi Guo et.al. 2312.04410v1 link
2023-12-07 Adversarial Denoising Diffusion Model for Unsupervised Anomaly Detection Jongmin Yu et.al. 2312.04382v1 null
2023-12-07 Generating Multiphase Fluid Configurations in Fractures using Diffusion Models Jaehong Chung et.al. 2312.04375v1 null
2023-12-07 Investigating the Design Space of Diffusion Models for Speech Enhancement Philippe Gonzalez et.al. 2312.04370v1 null
2023-12-07 Improved Efficient Two-Stage Denoising Diffusion Power System Measurement Recovery Against False Data Injection Attacks and Data Losses Jianhua Pei et.al. 2312.04346v1 null
2023-12-07 Multi-View Unsupervised Image Generation with Cross Attention Guidance Llukman Cerkezi et.al. 2312.04337v1 null
2023-12-07 iDesigner: A High-Resolution and Complex-Prompt Following Text-to-Image Diffusion Model for Interior Design Ruyi Gan et.al. 2312.04326v1 null
2023-12-07 Guided Reconstruction with Conditioned Diffusion Models for Unsupervised Anomaly Detection in Brain MRIs Finn Behrendt et.al. 2312.04215v1 link
2023-12-07 Diffusing Colors: Image Colorization with Text Guided Diffusion Nir Zabari et.al. 2312.04145v1 null
2023-12-07 DiffusionPhase: Motion Diffusion in Frequency Domain Weilin Wan et.al. 2312.04036v1 null
2023-12-07 KOALA: Self-Attention Matters in Knowledge Distillation of Latent Diffusion Models for Memory-Efficient and Fast Image Synthesis Youngwan Lee et.al. 2312.04005v1 null
2023-12-07 Stable diffusion for Data Augmentation in COCO and Weed Datasets Boyang Deng et.al. 2312.03996v1 null
2023-12-06 Adapting HouseDiffusion for conditional Floor Plan generation on Modified Swiss Dwellings dataset Emanuel Kuhn et.al. 2312.03938v1 null
2023-12-06 Controllable Human-Object Interaction Synthesis Jiaman Li et.al. 2312.03913v1 null
2023-12-06 Inpaint3D: 3D Scene Content Generation using 2D Inpainting Diffusion Kira Prabhu et.al. 2312.03869v1 null
2023-12-06 Diffusion Illusions: Hiding Images in Plain Sight Ryan Burgert et.al. 2312.03817v1 null
2023-12-06 AVID: Any-Length Video Inpainting with Diffusion Model Zhixing Zhang et.al. 2312.03816v1 link
2023-12-06 XCube ( $\mathcal{X}^3$ ): Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies Xuanchi Ren et.al. 2312.03806v1 null
2023-12-06 AnimatableDreamer: Text-Guided Non-rigid 3D Model Generation and Reconstruction with Canonical Score Distillation Xinzhou Wang et.al. 2312.03795v1 null
2023-12-06 AnimateZero: Video Diffusion Models are Zero-Shot Image Animators Jiwen Yu et.al. 2312.03793v1 link
2023-12-06 FAAC: Facial Animation Generation with Anchor Frame and Conditional Control for Superior Fidelity and Editability Linze Li et.al. 2312.03775v1 null
2023-12-08 Self-conditioned Image Generation via Generating Representations Tianhong Li et.al. 2312.03701v2 link
2023-12-06 Memory Triggers: Unveiling Memorization in Text-To-Image Generative Models through Word-Level Duplication Ali Naseh et.al. 2312.03692v1 null
2023-12-06 WarpDiffusion: Efficient Diffusion Model for High-Fidelity Virtual Try-on xujie zhang et.al. 2312.03667v1 null
2023-12-06 TokenCompose: Grounding Diffusion with Token-level Supervision Zirui Wang et.al. 2312.03626v1 link
2023-12-06 DreamComposer: Controllable 3D Object Generation via Multi-View Conditions Yunhan Yang et.al. 2312.03611v1 null
2023-12-06 DiffusionSat: A Generative Foundation Model for Satellite Imagery Samar Khanna et.al. 2312.03606v1 null
2023-12-06 MMM: Generative Masked Motion Model Ekkasit Pinyoanuntapong et.al. 2312.03596v1 null
2023-12-06 Personalized Face Inpainting with Diffusion Models by Parallel Visual Attention Jianjin Xu et.al. 2312.03556v1 null
2023-12-06 FoodFusion: A Latent Diffusion Model for Realistic Food Image Generation Olivia Markham et.al. 2312.03540v1 null
2023-12-06 FRDiff: Feature Reuse for Exquisite Zero-shot Acceleration of Diffusion Models Junhyuk So et.al. 2312.03517v1 null
2023-12-06 Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis Zehua Chen et.al. 2312.03491v1 null
2023-12-06 F3-Pruning: A Training-Free and Generalized Pruning Strategy towards Faster and Finer Text-to-Video Synthesis Sitong Su et.al. 2312.03459v1 null
2023-12-06 Generalized Contrastive Divergence: Joint Training of Energy-Based Model and Diffusion Model through Inverse Reinforcement Learning Sangwoong Yoon et.al. 2312.03397v1 null
2023-12-06 Diffused Task-Agnostic Milestone Planner Mineui Hong et.al. 2312.03395v1 null
2023-12-06 DiffPMAE: Diffusion Masked Autoencoders for Point Cloud Reconstruction Yanlong Li et.al. 2312.03298v1 null
2023-12-06 Cache Me if You Can: Accelerating Diffusion Models through Block Caching Felix Wimbauer et.al. 2312.03209v1 null
2023-12-05 ViscoNet: Bridging and Harmonizing Visual and Textual Conditioning for ControlNet Soon Yau Cheong et.al. 2312.03154v1 null
2023-12-05 DiffusionPCR: Diffusion Models for Robust Multi-Step Point Cloud Registration Zhi Chen et.al. 2312.03053v1 null
2023-12-05 DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control Yuru Jia et.al. 2312.03048v1 null
2023-12-05 MagicStick: Controllable Video Editing via Control Handle Transformations Yue Ma et.al. 2312.03047v1 link
2023-12-05 Customization Assistant for Text-to-image Generation Yufan Zhou et.al. 2312.03045v1 null
2023-12-05 DreamVideo: High-Fidelity Image-to-Video Generation with Image Retention and Text Guidance Cong Wang et.al. 2312.03018v1 null
2023-12-05 Alchemist: Parametric Control of Material Properties with Diffusion Models Prafull Sharma et.al. 2312.02970v1 null
2023-12-05 AmbiGen: Generating Ambigrams from Pre-trained Diffusion Model Boheng Zhao et.al. 2312.02967v1 null
2023-12-05 Diffusion-SS3D: Diffusion Model for Semi-supervised 3D Object Detection Cheng-Ju Ho et.al. 2312.02966v1 link
2023-12-05 A Diffusion Model of Dynamic Participant Inflow Management Baris Ata et.al. 2312.02927v1 null
2023-12-05 Deterministic Guidance Diffusion Model for Probabilistic Weather Forecasting Donggeun Yoon et.al. 2312.02819v1 link
2023-12-05 BIVDiff: A Training-Free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models Fengyuan Shi et.al. 2312.02813v1 null
2023-12-05 Generating Fine-Grained Human Motions Using ChatGPT-Refined Descriptions Xu Shi et.al. 2312.02772v1 null
2023-12-05 Neural Sign Actors: A diffusion model for 3D sign language production from text Vasileios Baltatzis et.al. 2312.02702v1 null
2023-12-05 Analyzing and Improving the Training Dynamics of Diffusion Models Tero Karras et.al. 2312.02696v1 null
2023-12-05 Diffusion-Based Speech Enhancement in Matched and Mismatched Conditions Using a Heun-Based Sampler Philippe Gonzalez et.al. 2312.02683v1 null
2023-12-05 TPA3D: Triplane Attention for Fast Text-to-3D Generation Hong-En Chen et.al. 2312.02647v1 null
2023-12-05 Diffusion Noise Feature: Accurate and Fast Generated Image Detection Yichi Zhang et.al. 2312.02625v1 null
2023-12-05 Projection Regret: Reducing Background Bias for Novelty Detection via Diffusion Models Sungik Choi et.al. 2312.02615v1 null
2023-12-05 GeNIe: Generative Hard Negative Images Through Diffusion Soroush Abbasi Koohpayegani et.al. 2312.02548v1 link
2023-12-05 Retrieving Conditions from Reference Images for Diffusion Models Haoran Tang et.al. 2312.02521v1 null
2023-12-05 Creative Agents: Empowering Agents with Imagination for Creative Tasks Chi Zhang et.al. 2312.02519v1 link
2023-12-05 Orthogonal Adaptation for Modular Customization of Diffusion Models Ryan Po et.al. 2312.02432v1 null
2023-12-05 Towards Granularity-adjusted Pixel-level Semantic Annotation Rohit Kundu et.al. 2312.02420v1 null
2023-12-04 EMDM: Efficient Motion Diffusion Model for Fast, High-Quality Motion Generation Wenyang Zhou et.al. 2312.02256v1 null
2023-12-04 Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images Zhuoran Yu et.al. 2312.02253v1 null
2023-12-04 Conditional Variational Diffusion Models Gabriel della Maggiora et.al. 2312.02246v1 null
2023-12-04 X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model Lingmin Ran et.al. 2312.02238v1 null
2023-12-03 Slice3D: Multi-Slice, Occlusion-Revealing, Single View 3D Reconstruction Yizhi Wang et.al. 2312.02221v1 null
2023-12-03 DragVideo: Interactive Drag-style Video Editing Yufan Deng et.al. 2312.02216v1 link
2023-12-03 Portrait Diffusion: Training-free Face Stylization with Chain-of-Painting Jin Liu et.al. 2312.02212v1 link
2023-12-02 ImageDream: Image-Prompt Multi-view Diffusion for 3D Generation Peng Wang et.al. 2312.02201v1 null
2023-12-02 Exploiting Diffusion Priors for All-in-One Image Restoration Yuanbiao Gou et.al. 2312.02197v1 null
2023-12-04 Latent Feature-Guided Diffusion Models for Shadow Removal Kangfu Mei et.al. 2312.02156v1 null
2023-12-04 Readout Guidance: Learning Control from Diffusion Features Grace Luo et.al. 2312.02150v1 null
2023-12-04 Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation Bingxin Ke et.al. 2312.02145v1 link
2023-12-04 DiffiT: Diffusion Vision Transformers for Image Generation Ali Hatamizadeh et.al. 2312.02139v1 link
2023-12-04 Stochastic Optimal Control Matching Carles Domingo-Enrich et.al. 2312.02027v1 null
2023-12-04 UniGS: Unified Representation for Image Generation and Segmentation Lu Qi et.al. 2312.01985v1 link
2023-12-04 Generalization by Adaptation: Diffusion-Based Domain Extension for Domain-Generalized Semantic Segmentation Joshua Niemeijer et.al. 2312.01850v1 null
2023-12-04 Collaborative Neural Painting Nicola Dall'Asen et.al. 2312.01800v1 null
2023-12-04 Open-DDVM: A Reproduction and Extension of Diffusion Model for Optical Flow Estimation Qiaole Dong et.al. 2312.01746v1 link
2023-12-04 Fully Spiking Denoising Diffusion Implicit Models Ryo Watanabe et.al. 2312.01742v1 null
2023-12-04 StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On Jeongho Kim et.al. 2312.01725v1 link
2023-12-04 ResEnsemble-DDPM: Residual Denoising Diffusion Probabilistic Models for Ensemble Learning Shi Zhenning et.al. 2312.01682v1 null
2023-12-03 CalliPaint: Chinese Calligraphy Inpainting with Diffusion Model Qisheng Liao et.al. 2312.01536v1 null
2023-12-03 CityGen: Infinite and Controllable 3D City Layout Generation Jie Deng et.al. 2312.01508v1 null
2023-12-03 Existence of finite time blow-up in Keller-Segel system Federico Buseghin et.al. 2312.01475v1 null
2023-12-03 Distilling Functional Rearrangement Priors from Large Models Yiming Zeng et.al. 2312.01474v1 null
2023-12-03 Diffusion Posterior Sampling for Nonlinear CT Reconstruction Shudong Li et.al. 2312.01464v1 null
2023-12-03 Generative Rendering: Controllable 4D-Guided Video Generation with 2D Diffusion Models Shengqu Cai et.al. 2312.01409v1 null
2023-12-03 Improving In-Context Learning in Diffusion Models with Visual Context-Modulated Prompts Tianqi Chen et.al. 2312.01408v1 null
2023-12-03 ViVid-1-to-3: Novel View Synthesis with Video Diffusion Models Jeong-gi Kwak et.al. 2312.01305v1 null
2023-12-03 Deep Ensembles Meets Quantile Regression: Uncertainty-aware Imputation for Time Series Ying Liu et.al. 2312.01294v1 null
2023-12-02 PAC Privacy Preserving Diffusion Models Qipan Xu et.al. 2312.01201v1 null
2023-12-02 Ultra-Resolution Cascaded Diffusion Model for Gigapixel Image Synthesis in Histopathology Sarah Cechnicka et.al. 2312.01152v1 null
2023-12-02 ControlDreamer: Stylized 3D Generation with Multi-View ControlNet Yeongtak Oh et.al. 2312.01129v1 null
2023-12-02 Planning as In-Painting: A Diffusion-Based Embodied Task Planning Framework for Environments under Uncertainty Cheng-Fu Yang et.al. 2312.01097v1 link
2023-12-02 Taming Latent Diffusion Models to See in the Dark Qiang Wen et.al. 2312.01027v1 null
2023-12-01 Consistent Mesh Diffusion Julian Knodt et.al. 2312.00971v1 null
2023-12-01 Enhancing Diffusion Models with 3D Perspective Geometry Constraints Rishi Upadhyay et.al. 2312.00944v1 null
2023-12-01 Assessment of the Flamelet Generated Manifold method with preferential diffusion modelling for the prediction of partially premixed hydrogen flames Eduardo Javier Pérez-Sánchez et.al. 2312.00929v1 null
2023-12-01 3DiFACE: Diffusion-based Speech-driven 3D Facial Animation and Editing Balamurugan Thambiraja et.al. 2312.00870v1 null
2023-12-01 DeepCache: Accelerating Diffusion Models for Free Xinyin Ma et.al. 2312.00858v1 link
2023-12-01 Motion-Guided Latent Diffusion for Temporally Consistent Real-world Video Super-resolution Xi Yang et.al. 2312.00853v1 null
2023-12-01 Beyond First-Order Tweedie: Solving Inverse Problems using Latent Diffusion Litu Rout et.al. 2312.00852v1 null
2023-12-01 VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models Hyeonho Jeong et.al. 2312.00845v1 null
2023-11-30 Lasagna: Layered Score Distillation for Disentangled Object Relighting Dina Bashkirova et.al. 2312.00833v1 null
2023-11-30 Probing and Mitigating Intersectional Social Biases in Vision-Language Models with Counterfactual Examples Phillip Howard et.al. 2312.00825v1 null
2023-12-01 TrackDiffusion: Multi-object Tracking Data Generation via Diffusion Models Pengxiang Li et.al. 2312.00651v1 null
2023-11-30 LucidDreaming: Controllable Object-Centric 3D Generation Zhaoning Wang et.al. 2312.00588v1 null
2023-12-01 Text-Guided 3D Face Synthesis -- From Generation to Editing Yunjie Wu et.al. 2312.00375v1 null
2023-11-30 DREAM: Diffusion Rectification and Estimation-Adaptive Models Jinxin Zhou et.al. 2312.00210v1 null
2023-11-30 S2ST: Image-to-Image Translation in the Seed Space of Latent Diffusion Or Greenberg et.al. 2312.00116v1 null
2023-11-30 Fast ODE-based Sampling for Diffusion Models in Around 5 Steps Zhenyu Zhou et.al. 2312.00094v1 null
2023-11-30 GraphDreamer: Compositional 3D Scene Synthesis from Scene Graphs Gege Gao et.al. 2312.00093v1 null
2023-11-30 Generative Artificial Intelligence in Learning Analytics: Contextualising Opportunities and Challenges through the Learning Analytics Cycle Lixiang Yan et.al. 2312.00087v1 null
2023-11-30 X-Dreamer: Creating High-quality 3D Content by Bridging the Domain Gap Between Text-to-2D and Text-to-3D Generation Yiwei Ma et.al. 2312.00085v1 null
2023-11-30 Can Protective Perturbation Safeguard Personal Data from Being Exploited by Stable Diffusion? Zhengyue Zhao et.al. 2312.00084v1 null
2023-11-30 HiFi Tuner: High-Fidelity Subject-Driven Fine-Tuning for Diffusion Models Zhonghao Wang et.al. 2312.00079v1 null
2023-11-29 Unsupervised Keypoints from Pretrained Diffusion Models Eric Hedlin et.al. 2312.00065v1 null
2023-11-30 VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models Zhen Xing et.al. 2311.18837v1 null
2023-11-30 ART $\boldsymbol{\cdot}$ V: Auto-Regressive Text-to-Video Generation with Diffusion Models Wenming Weng et.al. 2311.18834v1 null
2023-11-30 Exploiting Diffusion Prior for Generalizable Pixel-Level Semantic Prediction Hsin-Ying Lee et.al. 2311.18832v1 link
2023-11-30 MotionEditor: Editing Video Motion via Content-Aware Diffusion Shuyuan Tu et.al. 2311.18830v1 link
2023-11-30 MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation Yanhui Wang et.al. 2311.18829v1 null
2023-12-05 One-step Diffusion with Distribution Matching Distillation Tianwei Yin et.al. 2311.18828v3 null
2023-11-30 ElasticDiffusion: Training-free Arbitrary Size Image Generation Moayed Haji-Ali et.al. 2311.18822v1 link
2023-11-30 Continual Diffusion with STAMINA: STack-And-Mask INcremental Adapters James Seale Smith et.al. 2311.18763v1 null
2023-11-30 Detailed Human-Centric Text Description-Driven Large Scene Synthesis Gwanghyun Kim et.al. 2311.18654v1 null
2023-11-30 Contrastive Denoising Score for Text-guided Latent Diffusion Image Editing Hyelin Nam et.al. 2311.18608v1 null
2023-11-30 DifAugGAN: A Practical Diffusion-style Data Augmentation for GAN-based Single Image Super-resolution Axi Niu et.al. 2311.18508v1 null
2023-11-30 Layered Rendering Diffusion Model for Zero-Shot Guided Image Synthesis Zipeng Qi et.al. 2311.18435v1 null
2023-11-30 CAT-DM: Controllable Accelerated Virtual Try-on with Diffusion Model Jianhao Zeng et.al. 2311.18405v1 link
2023-11-30 Age Effects on Decision-Making, Drift Diffusion Model Zahra Kavian et.al. 2311.18376v1 null
2023-11-30 Prompt-Based Exemplar Super-Compression and Regeneration for Class-Incremental Learning Ruxiao Duan et.al. 2311.18266v1 link
2023-11-30 Diffusion Models Without Attention Jing Nathan Yan et.al. 2311.18257v1 null
2023-11-30 SMaRt: Improving GANs with Score Matching Regularity Mengfei Xia et.al. 2311.18208v1 null
2023-11-30 HiPA: Enabling One-Step Text-to-Image Diffusion Models via High-Frequency-Promoting Adaptation Yifan Zhang et.al. 2311.18158v1 null
2023-11-29 Zooming Out on Zooming In: Advancing Super-Resolution for Remote Sensing Piper Wolters et.al. 2311.18082v1 link
2023-11-29 DiffGEPCI: 3D MRI Synthesis from mGRE Signals using 2.5D Diffusion Model Yuyang Hu et.al. 2311.18073v1 null
2023-11-29 Turn Down the Noise: Leveraging Diffusion Models for Test-time Adaptation via Pseudo-label Ensembling Mrigank Raman et.al. 2311.18071v1 null
2023-11-29 GELDA: A generative language annotation framework to reveal visual biases in datasets Krish Kabra et.al. 2311.18064v1 null
2023-11-29 Echoes in the Noise: Posterior Samples of Faint Galaxy Surface Brightness Profiles with Score-Based Likelihoods and Priors Alexandre Adam et.al. 2311.18002v1 null
2023-11-29 4D-fy: Text-to-4D Generation Using Hybrid Score Distillation Sampling Sherwin Bahmani et.al. 2311.17984v1 null
2023-12-01 GeoDream: Disentangling 2D and Geometric Priors for High-Fidelity and Consistent 3D Generation Baorui Ma et.al. 2311.17971v2 link
2023-11-29 HandRefiner: Refining Malformed Hands in Generated Images by Diffusion-based Conditional Inpainting Wenquan Lu et.al. 2311.17957v1 link
2023-11-29 C3Net: Compound Conditioned ControlNet for Multimodal Content Generation Juntao Zhang et.al. 2311.17951v1 null
2023-11-28 Unlocking Spatial Comprehension in Text-to-Image Diffusion Models Mohammad Mahdi Derakhshani et.al. 2311.17937v1 null
2023-11-30 Do text-free diffusion models learn discriminative visual representations? Soumik Mukhopadhyay et.al. 2311.17921v2 link
2023-11-29 Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models Daniel Geng et.al. 2311.17919v1 null
2023-11-29 AvatarStudio: High-fidelity and Animatable 3D Avatar Creation from Text Jianfeng Zhang et.al. 2311.17917v1 null
2023-11-29 CG3D: Compositional Generation for Text-to-3D via Gaussian Splatting Alexander Vilesov et.al. 2311.17907v1 null
2023-11-29 SODA: Bottleneck Diffusion Models for Representation Learning Drew A. Hudson et.al. 2311.17901v1 null
2023-11-29 Leveraging Graph Diffusion Models for Network Refinement Tasks Puja Trivedi et.al. 2311.17856v1 null
2023-11-30 SPiC-E : Structural Priors in 3D Diffusion Models using Cross-Entity Attention Etai Sella et.al. 2311.17834v2 null
2023-11-29 Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers Chi-Pin Huang et.al. 2311.17717v1 null
2023-11-29 Fair Text-to-Image Diffusion via Fair Mapping Jia Li et.al. 2311.17695v1 null
2023-11-29 AnyLens: A Generative Diffusion Model with Any Rendering Lens Andrey Voynov et.al. 2311.17609v1 null
2023-11-29 Query-Relevant Images Jailbreak Large Multi-Modal Models Xin Liu et.al. 2311.17600v1 null
2023-11-29 Smooth Video Synthesis with Noise Constraints on Diffusion Models for One-shot Video Tuning Liang Peng et.al. 2311.17536v1 link
2023-11-29 HiDiffusion: Unlocking High-Resolution Creativity and Efficiency in Low-Resolution Trained Diffusion Models Shen Zhang et.al. 2311.17528v1 null
2023-11-29 MMA-Diffusion: MultiModal Attack on Diffusion Models Yijun Yang et.al. 2311.17516v1 null
2023-11-29 When StyleGAN Meets Stable Diffusion: a $\mathscr{W}_+$ Adapter for Personalized Image Generation Xiaoming Li et.al. 2311.17461v1 link
2023-11-29 DifFlow3D: Toward Robust Uncertainty-Aware Scene Flow Estimation with Diffusion Model Jiuming Liu et.al. 2311.17456v1 null
2023-11-29 Wireless Network Digital Twin for 6G: Generative AI as A Key Enabler Zhenyu Tao et.al. 2311.17451v1 null
2023-12-01 VideoAssembler: Identity-Consistent Video Generation with Reference Entities using Diffusion Model Haoyu Zhao et.al. 2311.17338v2 null
2023-11-28 Self-Discovering Interpretable Diffusion Latent Directions for Responsible Text-to-Image Generation Hang Li et.al. 2311.17216v1 null
2023-11-28 A point cloud approach to generative modeling for galaxy surveys at the field level Carolina Cuesta-Lazaro et.al. 2311.17141v1 link
2023-11-28 Generative Models: What do they know? Do they know things? Let's find out! Xiaodan Du et.al. 2311.17137v1 null
2023-11-28 Reason out Your Layout: Evoking the Layout Master from Large Language Models for Text-to-Image Synthesis Xiaohui Chen et.al. 2311.17126v1 null
2023-11-28 ConTex-Human: Free-View Rendering of Human from a Single Image with Texture-Consistent Synthesis Xiangjun Gao et.al. 2311.17123v1 null
2023-11-28 Generative Data Augmentation Improves Scribble-supervised Semantic Segmentation Jacob Schnell et.al. 2311.17121v1 null
2023-11-28 Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation Li Hu et.al. 2311.17117v1 null
2023-11-28 Robust Diffusion GAN using Semi-Unbalanced Optimal Transport Quan Dao et.al. 2311.17101v1 null
2023-11-28 PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generation Jian Ma et.al. 2311.17086v1 link
2023-11-28 DreamPropeller: Supercharge Text-to-3D Generation with Parallel Sampling Linqi Zhou et.al. 2311.17082v1 link
2023-11-28 Material Palette: Extraction of Materials from a Single Image Ivan Lopes et.al. 2311.17060v1 null
2023-11-28 DiffuseBot: Breeding Soft Robots With Physics-Augmented Generative Diffusion Models Tsun-Hsuan Wang et.al. 2311.17053v1 null
2023-11-28 Surf-D: High-Quality Surface Generation for Arbitrary Topologies using Diffusion Models Zhengming Yu et.al. 2311.17050v1 null
2023-11-28 Adversarial Diffusion Distillation Axel Sauer et.al. 2311.17042v1 link
2023-11-28 Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer Danah Yatim et.al. 2311.17009v1 null
2023-11-30 Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following Yutong Feng et.al. 2311.17002v2 null
2023-11-28 COLE: A Hierarchical Generation Framework for Graphic Design Peidong Jia et.al. 2311.16974v1 null
2023-11-28 HumanRef: Single Image to 3D Human Generation via Reference-Guided Diffusion Jingbo Zhang et.al. 2311.16961v1 null
2023-11-28 SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models Yuwei Guo et.al. 2311.16933v1 null
2023-11-28 RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3D Lingteng Qiu et.al. 2311.16918v1 null
2023-11-28 On the existence of optimal multi-valued decoders and their accuracy bounds for undersampled inverse problems Nina Maria Gottschling et.al. 2311.16898v1 null
2023-11-28 Wavelet-based Fourier Information Interaction with Frequency Diffusion Adjustment for Underwater Image Restoration Chen Zhao et.al. 2311.16845v1 null
2023-11-28 As-Plausible-As-Possible: Plausibility-Aware Mesh Deformation Using 2D Diffusion Priors Seungwoo Yoo et.al. 2311.16739v1 null
2023-11-28 LEDITS++: Limitless Image Editing using Text-to-Image Models Manuel Brack et.al. 2311.16711v1 null
2023-11-28 MobileDiffusion: Subsecond Text-to-Image Generation on Mobile Devices Yang Zhao et.al. 2311.16567v1 null
2023-11-28 DiffusionTalker: Personalization and Acceleration for Speech-Driven 3D Face Diffuser Peng Chen et.al. 2311.16565v1 null
2023-11-28 Enhancing Scene Text Detectors with Realistic Text Image Synthesis Using Diffusion Models Ling Fu et.al. 2311.16555v1 null
2023-11-28 Federated Learning with Diffusion Models for Privacy-Sensitive Vision Tasks Ye Lin Tun et.al. 2311.16538v1 null
2023-11-27 SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution Rongyuan Wu et.al. 2311.16518v1 link
2023-11-27 LFSRDiff: Light Field Image Super-Resolution via Diffusion Models Wentao Chao et.al. 2311.16517v1 link
2023-11-27 Video Anomaly Detection via Spatio-Temporal Pseudo-Anomaly Generation : A Unified Approach Ayush K. Rai et.al. 2311.16514v1 null
2023-11-27 CoSeR: Bridging Image and Language for Cognitive Super-Resolution Haoze Sun et.al. 2311.16512v1 null
2023-11-28 Exploring Straighter Trajectories of Flow Matching with Diffusion Guidance Siyu Xing et.al. 2311.16507v1 null
2023-11-27 TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models Yushi Huang et.al. 2311.16503v1 null
2023-11-27 Deceptive-Human: Prompt-to-NeRF 3D Human Generation with 3D-Consistent Synthetic Images Shiu-hong Kao et.al. 2311.16499v1 null
2023-11-27 MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model Zhongcong Xu et.al. 2311.16498v1 null
2023-11-28 Efficient Multimodal Diffusion Models Using Joint Data Infilling with Partially Shared U-Net Zizhao Hu et.al. 2311.16488v1 null
2023-11-28 TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering Jingye Chen et.al. 2311.16465v1 null
2023-11-28 Manifold Preserving Guided Diffusion Yutong He et.al. 2311.16424v1 null
2023-11-29 ChatTraffic: Text-to-Traffic Generation via Diffusion Model Chengyang Zhang et.al. 2311.16203v2 link
2023-11-27 Symphony: Symmetry-Equivariant Point-Centered Spherical Harmonics for Molecule Generation Ameya Daigavane et.al. 2311.16199v1 null
2023-11-27 Test-time Adaptation of Discriminative Models via Diffusion Generative Feedback Mihir Prabhudesai et.al. 2311.16102v1 link
2023-11-27 Self-correcting LLM-controlled Diffusion Models Tsung-Han Wu et.al. 2311.16090v1 null
2023-11-27 DiffSLVA: Harnessing Diffusion Models for Sign Language Video Anonymization Zhaoyang Xia et.al. 2311.16060v1 link
2023-11-27 Exploring Attribute Variations in Style-based GANs using Diffusion Models Rishubh Parihar et.al. 2311.16052v1 null
2023-11-27 GaussianEditor: Editing 3D Gaussians Delicately with Text Instructions Jiemin Fang et.al. 2311.16037v1 null
2023-11-27 Closing the ODE-SDE gap in score-based diffusion models through the Fokker-Planck equation Teo Deveney et.al. 2311.15996v1 null
2023-11-27 DiffAnt: Diffusion Models for Action Anticipation Zeyun Zhong et.al. 2311.15991v1 null
2023-11-27 Direct2.5: Diverse Text-to-3D Generation via Multi-view 2.5D Diffusion Yuanxun Lu et.al. 2311.15980v1 null
2023-11-27 Enhancing Perceptual Quality in Video Super-Resolution through Temporally-Consistent Detail Synthesis using Diffusion Models Claudio Rota et.al. 2311.15908v1 link
2023-11-27 InterControl: Generate Human Motion Interactions by Controlling Every Joint Zhenzhi Wang et.al. 2311.15864v1 link
2023-11-27 SiTH: Single-view Textured Human Reconstruction with Image-Conditioned Diffusion Hsuan-I Ho et.al. 2311.15855v1 null
2023-11-27 FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic Scene Syntax Yu Lu et.al. 2311.15813v1 null
2023-11-27 Check, Locate, Rectify: A Training-Free Layout Calibration System for Text-to-Image Generation Biao Gong et.al. 2311.15773v1 null
2023-11-27 One More Step: A Versatile Plug-and-Play Module for Rectifying Diffusion Schedule Flaws and Enhancing Low-Frequency Controls Minghui Hu et.al. 2311.15744v1 null
2023-11-27 SceneDM: Scene-level Multi-agent Trajectory Generation with Consistent Diffusion Models Zhiming Guo et.al. 2311.15736v1 null
2023-11-27 Regularization by Texts for Latent Diffusion Inverse Solvers Jeongsol Kim et.al. 2311.15658v1 null
2023-11-27 Enhancing Diffusion Models with Text-Encoder Reinforcement Learning Chaofeng Chen et.al. 2311.15657v1 link
2023-11-27 ET3D: Efficient Text-to-3D Generation via Multi-View Distillation Yiming Chen et.al. 2311.15561v1 null
2023-11-27 Instruct2Attack: Language-Guided Semantic Adversarial Attacks Jiang Liu et.al. 2311.15551v1 null
2023-11-27 Efficient Dataset Distillation via Minimax Diffusion Jianyang Gu et.al. 2311.15529v1 link
2023-11-27 AerialBooth: Mutual Information Guidance for Text Controlled Aerial View Synthesis from a Single Image Divya Kothandaraman et.al. 2311.15478v1 null
2023-11-26 DISYRE: Diffusion-Inspired SYnthetic REstoration for Unsupervised Anomaly Detection Sergio Naval Marimont et.al. 2311.15453v1 null
2023-11-26 Quantum Diffusion Models Andrea Cacioppo et.al. 2311.15444v1 null
2023-11-26 Functional Diffusion Biao Zhang et.al. 2311.15435v1 null
2023-11-26 Wired Perspectives: Multi-View Wire Art Embraces Generative AI Zhiyu Qu et.al. 2311.15421v1 null
2023-11-26 Flow-Guided Diffusion for Video Inpainting Bohai Gu et.al. 2311.15368v1 link
2023-11-26 BS-Diff: Effective Bone Suppression Using Conditional Diffusion Models from Chest X-Ray Images Zhanghao Chen et.al. 2311.15328v1 null
2023-11-26 Learning Coarse Propagators in Parareal Algorithm Bangti Jin et.al. 2311.15320v1 null
2023-11-25 Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets Andreas Blattmann et.al. 2311.15127v1 link
2023-11-25 Leveraging Diffusion Perturbations for Measuring Fairness in Computer Vision Nicholas Lui et.al. 2311.15108v1 null
2023-11-25 InstaStyle: Inversion Noise of a Stylized Image is Secretly a Style Adviser Xing Cui et.al. 2311.15040v1 null
2023-11-25 Point Cloud Pre-training with Diffusion Models Xiao Zheng et.al. 2311.14960v1 null
2023-11-25 FreePIH: Training-Free Painterly Image Harmonization with Diffusion Model Ruibin Li et.al. 2311.14926v1 null
2023-11-25 GBD-TS: Goal-based Pedestrian Trajectory Prediction with Diffusion using Tree Sampling Algorithm Ge Sun et.al. 2311.14922v1 null
2023-11-25 Resfusion: Prior Residual Noise embedded Denoising Diffusion Probabilistic Models Shi Zhenning et.al. 2311.14900v1 null
2023-11-24 Geometric theory on large-scale and local determination of density dependence of a recovering large carnivore population Yunyi Shen et.al. 2311.14815v1 null
2023-11-24 AdaDiff: Adaptive Step Selection for Fast Diffusion Hui Zhang et.al. 2311.14768v1 null
2023-11-24 CatVersion: Concatenating Embeddings for Diffusion-Based Text-to-Image Personalization Ruoyu Zhao et.al. 2311.14631v1 null
2023-11-24 Animate124: Animating One Image to 4D Dynamic Scene Yuyang Zhao et.al. 2311.14603v1 null
2023-11-24 ToddlerDiffusion: Flash Interpretable Controllable Diffusion Model Eslam Mohamed Bakr et.al. 2311.14542v1 null
2023-11-24 GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting Yiwen Chen et.al. 2311.14521v1 null
2023-11-27 MVControl: Adding Conditional Control to Multi-view Diffusion for Controllable Text-to-3D Generation Zhiqi Li et.al. 2311.14494v2 null
2023-11-24 Joint Diffusion: Mutual Consistency-Driven Diffusion Model for PET-MRI Co-Reconstruction Taofeng Xie et.al. 2311.14473v1 null
2023-11-24 Highly Detailed and Temporal Consistent Video Stylization via Synchronized Multi-Frame Diffusion Minshan Xie et.al. 2311.14343v1 null
2023-11-24 Decouple Content and Motion for Conditional Image-to-Video Generation Cuifeng Shen et.al. 2311.14294v1 null
2023-11-24 Paragraph-to-Image Generation with Information-Enriched Diffusion Model Weijia Wu et.al. 2311.14284v1 link
2023-11-24 Image Super-Resolution with Text Prompt Diffusion Zheng Chen et.al. 2311.14282v1 link
2023-11-24 Latent Diffusion Prior Enhanced Deep Unfolding for Spectral Image Reconstruction Zongliang Wu et.al. 2311.14280v1 null
2023-11-23 HACD: Hand-Aware Conditional Diffusion for Monocular Hand-Held Object Reconstruction Bowen Fu et.al. 2311.14189v1 null
2023-11-23 ACT: Adversarial Consistency Models Fei Kong et.al. 2311.14097v1 null
2023-11-23 RetroDiff: Retrosynthesis as Multi-stage Distribution Interpolation Yiming Wang et.al. 2311.14077v1 null
2023-11-23 Continual Learning of Diffusion Models with Generative Distillation Sergi Masip et.al. 2311.14028v1 link
2023-11-23 Touring sampling with pushforward maps Vivien Cabannes et.al. 2311.13845v1 null
2023-11-23 Adversarial defense based on distribution transfer Jiahao Chen et.al. 2311.13841v1 null
2023-11-23 Lego: Learning to Disentangle and Invert Concepts Beyond Object Appearance in Text-to-Image Diffusion Models Saman Motamed et.al. 2311.13833v1 null
2023-11-23 Posterior Distillation Sampling Juil Koo et.al. 2311.13831v1 null
2023-11-23 Sample-Efficient Training for Diffusion Shivam Gupta et.al. 2311.13745v1 null
2023-11-22 A Somewhat Robust Image Watermark against Diffusion-based Editing Models Mingtian Tan et.al. 2311.13713v1 null
2023-11-22 Masked Conditional Diffusion Models for Image Analysis with Application to Radiographic Diagnosis of Infant Abuse Shaoju Wu et.al. 2311.13688v1 null
2023-11-22 Diffusion models meet image counter-forensics Matías Tailanian et.al. 2311.13629v1 link
2023-11-22 TDiffDe: A Truncated Diffusion Model for Remote Sensing Hyperspectral Image Denoising Jiang He et.al. 2311.13622v1 null
2023-11-21 Breathing Life Into Sketches Using Text-to-Video Priors Rinon Gal et.al. 2311.13608v1 null
2023-11-22 WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space Katja Schwarz et.al. 2311.13570v1 null
2023-11-22 **ADriver-I: A General Wo

About

🎓Automatically Update CV Papers Daily using Github Actions (Update Every 12th hours)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%