Image Generation - 2024-10

Publish Date Title Authors PDF Translate Read Code
2024-10-31 Generative modelling for mass-mapping with fast uncertainty quantification Jessica J. Whitney et.al. 2410.24197 translate read null
2024-10-31 A Practical Style Transfer Pipeline for 3D Animation: Insights from Production R&D Hideki Todo et.al. 2410.24123 translate read null
2024-10-31 DiffPAD: Denoising Diffusion-based Adversarial Patch Decontamination Jia Fu et.al. 2410.24006 translate read null
2024-10-31 Image Synthesis with Class-Aware Semantic Diffusion Models for Surgical Scene Segmentation Yihang Zhou et.al. 2410.23962 translate read null
2024-10-31 EDT: An Efficient Diffusion Transformer Framework Inspired by Human-like Sketching Xinwang Chen et.al. 2410.23788 translate read link
2024-10-31 SceneComplete: Open-World 3D Scene Completion in Complex Real World Environments for Robot Manipulation Aditya Agarwal et.al. 2410.23643 translate read null
2024-10-31 Language-guided Hierarchical Fine-grained Image Forgery Detection and Localization Xiao Guo et.al. 2410.23556 translate read null
2024-10-30 MoLE: Enhancing Human-centric Text-to-image Diffusion via Mixture of Low-rank Experts Jie Zhu et.al. 2410.23332 translate read null
2024-10-30 RelationBooth: Towards Relation-Aware Customized Object Generation Qingyu Shi et.al. 2410.23280 translate read null
2024-10-30 Multi-student Diffusion Distillation for Better One-step Generators Yanke Song et.al. 2410.23274 translate read null
2024-10-30 Controllable Game Level Generation: Assessing the Effect of Negative Examples in GAN Models Mahsa Bazzaz et.al. 2410.23108 translate read null
2024-10-30 Private Synthetic Text Generation with Diffusion Models Sebastian Ochs et.al. 2410.22971 translate read null
2024-10-30 An Individual Identity-Driven Framework for Animal Re-Identification Yihao Wu et.al. 2410.22927 translate read link
2024-10-30 Latent Diffusion, Implicit Amplification: Efficient Continuous-Scale Super-Resolution for Remote Sensing Images Hanlin Wu et.al. 2410.22830 translate read null
2024-10-30 Diffusion Beats Autoregressive: An Evaluation of Compositional Generation in Text-to-Image Models Arash Marioriyad et.al. 2410.22775 translate read null
2024-10-30 st-DTPM: Spatial-Temporal Guided Diffusion Transformer Probabilistic Model for Delayed Scan PET Image Prediction Ran Hong et.al. 2410.22732 translate read null
2024-10-30 Identifying Drift, Diffusion, and Causal Structure from Temporal Snapshots Vincent Guan et.al. 2410.22729 translate read null
2024-10-30 FlowDCN: Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution Shuai Wang et.al. 2410.22655 translate read null
2024-10-29 Multimodal Semantic Communication for Generative Audio-Driven Video Conferencing Haonan Tong et.al. 2410.22112 translate read null
2024-10-29 PrefPaint: Aligning Image Inpainting Diffusion Model with Human Preference Kendong Liu et.al. 2410.21966 translate read null
2024-10-29 Volumetric Conditioning Module to Control Pretrained Diffusion Models for 3D Medical Images Suhyun Ahn et.al. 2410.21826 translate read link
2024-10-29 HairDiffusion: Vivid Multi-Colored Hair Editing via Latent Diffusion Yu Zeng et.al. 2410.21789 translate read null
2024-10-29 Exploring Local Memorization in Diffusion Models via Bright Ending Attention Chen Chen et.al. 2410.21665 translate read null
2024-10-29 Fingerprints of Super Resolution Networks Jeremy Vonderfecht et.al. 2410.21653 translate read null
2024-10-29 Adapting Diffusion Models for Improved Prompt Compliance and Controllable Image Synthesis Deepak Sridhar et.al. 2410.21638 translate read null
2024-10-28 CaloChallenge 2022: A Community Challenge for Fast Calorimeter Simulation Claudius Krause et.al. 2410.21611 translate read null
2024-10-30 A Novel Score-CAM based Denoiser for Spectrographic Signature Extraction without Ground Truth Noel Elias et.al. 2410.21557 translate read null
2024-10-28 Denoising Diffusion Planner: Learning Complex Paths from Low-Quality Demonstrations Michiel Nikken et.al. 2410.21497 translate read null
2024-10-28 ST-ITO: Controlling Audio Effects for Style Transfer with Inference-Time Optimization Christian J. Steinmetz et.al. 2410.21233 translate read null
2024-10-28 SeriesGAN: Time Series Generation via Adversarial and Autoregressive Learning MohammadReza EskandariNasab et.al. 2410.21203 translate read link
2024-10-28 Extrapolating Prospective Glaucoma Fundus Images through Diffusion Model in Irregular Longitudinal Sequences Zhihao Zhao et.al. 2410.21130 translate read null
2024-10-28 Shallow Diffuse: Robust and Invisible Watermarking through Low-Dimensional Subspaces in Diffusion Models Wenda Li et.al. 2410.21088 translate read link
2024-10-28 Kandinsky 3: Text-to-Image Synthesis for Multifunctional Generative Framework Vladimir Arkhipkin et.al. 2410.21061 translate read null
2024-10-28 Attacking Misinformation Detection Using Adversarial Examples Generated by Language Models Piotr Przybyła et.al. 2410.20940 translate read null
2024-10-28 Markov spin models for image generation : explicit large deviations with respect to the number of pixels Cecile Monthus et.al. 2410.20906 translate read null
2024-10-28 Diff-Instruct*: Towards Human-Preferred One-step Text-to-image Generative Models Weijian Luo et.al. 2410.20898 translate read null
2024-10-28 zGAN: An Outlier-focused Generative Adversarial Network For Realistic Synthetic Data Generation Azizjon Azimi et.al. 2410.20808 translate read null
2024-10-28 Murine AI excels at cats and cheese: Structural differences between human and mouse neurons and their implementation in generative AIs Rino Saiga et.al. 2410.20735 translate read null
2024-10-25 Microplastic Identification Using AI-Driven Image Segmentation and GAN-Generated Ecological Context Alex Dils et.al. 2410.19604 translate read null
2024-10-25 Generative Diffusion Models for Sequential Recommendations Sharare Zolghadr et.al. 2410.19429 translate read null
2024-10-25 Unified Cross-Modal Image Synthesis with Hierarchical Mixture of Product-of-Experts Reuben Dorent et.al. 2410.19378 translate read null
2024-10-25 High Resolution Seismic Waveform Generation using Denoising Diffusion Andreas Bergmeister et.al. 2410.19343 translate read null
2024-10-25 Simpler Diffusion (SiD2): 1.5 FID on ImageNet512 with pixel-space diffusion Emiel Hoogeboom et.al. 2410.19324 translate read null
2024-10-24 Generation of synthetic financial time series by diffusion models Tomonori Takahashi et.al. 2410.18897 translate read null
2024-10-24 Diff-Instruct++: Training One-step Text-to-image Generator Model to Align with Human Preferences Weijian Luo et.al. 2410.18881 translate read null
2024-10-24 Multi-Scale Diffusion: Enhancing Spatial Layout in High-Resolution Panoramic Image Generation Xiaoyu Zhang et.al. 2410.18830 translate read null
2024-10-24 Towards Visual Text Design Transfer Across Languages Yejin Choi et.al. 2410.18823 translate read null
2024-10-24 Ali-AUG: Innovative Approaches to Labeled Data Augmentation using One-Step Diffusion Model Ali Hamza et.al. 2410.18678 translate read null
2024-10-24 FairQueue: Rethinking Prompt Learning for Fair Text-to-Image Generation Christopher T. H Teo et.al. 2410.18615 translate read null
2024-10-24 FreCaS: Efficient Higher-Resolution Image Generation via Frequency-aware Cascaded Sampling Zhengqiang Zhang et.al. 2410.18410 translate read link
2024-10-23 Backdoor in Seconds: Unlocking Vulnerabilities in Large Pre-trained Models via Model Editing Dongliang Guo et.al. 2410.18267 translate read null
2024-10-23 FreeVS: Generative View Synthesis on Free Driving Trajectory Qitai Wang et.al. 2410.18079 translate read null
2024-10-23 Scalable Ranked Preference Optimization for Text-to-Image Generation Shyamgopal Karthik et.al. 2410.18013 translate read null
2024-10-23 A Wavelet Diffusion GAN for Image Super-Resolution Lorenzo Aloisi et.al. 2410.17966 translate read null
2024-10-23 Medical Imaging Complexity and its Effects on GAN Performance William Cagas et.al. 2410.17959 translate read null
2024-10-23 Variational MineGAN: A Data-efficient Knowledge Transfer Architecture for Generative AI-assisted Design of Nanophotonic Structures Shahriar Tarvir Nushin et.al. 2410.17889 translate read null
2024-10-23 TAGE: Trustworthy Attribute Group Editing for Stable Few-shot Image Generation Ruicheng Zhang et.al. 2410.17855 translate read null
2024-10-23 Longitudinal Causal Image Synthesis Yujia Li et.al. 2410.17691 translate read null
2024-10-23 Deep Generative Models for 3D Medical Image Synthesis Paul Friedrich et.al. 2410.17664 translate read null
2024-10-23 Testing Deep Learning Recommender Systems Models on Synthetic GAN-Generated Datasets Jesús Bobadilla et.al. 2410.17651 translate read null
2024-10-22 Offline Evaluation of Set-Based Text-to-Image Generation Negar Arabzadeh et.al. 2410.17331 translate read null
2024-10-22 Altogether: Image Captioning via Re-aligning Alt-text Hu Xu et.al. 2410.17251 translate read null
2024-10-22 PGCS: Physical Law embedded Generative Cloud Synthesis in Remote Sensing Images Liying Xu et.al. 2410.16955 translate read null
2024-10-22 IdenBAT: Disentangled Representation Learning for Identity-Preserved Brain Age Transformation Junyeong Maeng et.al. 2410.16945 translate read link
2024-10-22 DiP-GO: A Diffusion Pruner via Few-step Gradient Optimization Haowei Zhu et.al. 2410.16942 translate read null
2024-10-22 Hierarchical Clustering for Conditional Diffusion in Image Generation Jorge da Silva Goncalves et.al. 2410.16910 translate read link
2024-10-22 CK4Gen: A Knowledge Distillation Framework for Generating High-Utility Synthetic Survival Datasets in Healthcare Nicholas I-Hsien Kuo et.al. 2410.16872 translate read null
2024-10-22 MPDS: A Movie Posters Dataset for Image Generation with Diffusion Model Meng Xu et.al. 2410.16840 translate read null
2024-10-22 Evaluating the Effectiveness of Attack-Agnostic Features for Morphing Attack Detection Laurent Colbois et.al. 2410.16802 translate read link
2024-10-22 Progressive Compositionality In Text-to-Image Generative Models Xu Han et.al. 2410.16719 translate read null
2024-10-22 Privacy-hardened and hallucination-resistant synthetic data generation with logic-solvers Mark A. Burgess et.al. 2410.16705 translate read null
2024-10-21 MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors Honghua Chen et.al. 2410.16272 translate read null
2024-10-21 Elucidating the design space of language models for image generation Xuantong Liu et.al. 2410.16257 translate read null
2024-10-21 A Framework for Evaluating Predictive Models Using Synthetic Image Covariates and Longitudinal Data Simon Deltadahl et.al. 2410.16177 translate read null
2024-10-21 Continuous Speech Synthesis using per-token Latent Diffusion Arnon Turetzky et.al. 2410.16048 translate read null
2024-10-20 MedDiff-FM: A Diffusion-based Foundation Model for Versatile Medical Image Applications Yongrui Yu et.al. 2410.15432 translate read null
2024-10-20 Synthetic Data Generation for Residential Load Patterns via Recurrent GAN and Ensemble Method Xinyu Liang et.al. 2410.15379 translate read null
2024-10-19 Group Diffusion Transformers are Unsupervised Multitask Learners Lianghua Huang et.al. 2410.15027 translate read null
2024-10-19 DiffuseST: Unleashing the Capability of the Diffusion Model for Style Transfer Ying Hu et.al. 2410.15007 translate read null
2024-10-19 SeaS: Few-shot Industrial Anomaly Image Generation with Separation and Sharing Fine-tuning Zhewei Dai et.al. 2410.14987 translate read null
2024-10-19 Non-Invasive to Invasive: Enhancing FFA Synthesis from CFP with a Benchmark Dataset and a Novel Network Hongqiu Wang et.al. 2410.14965 translate read null
2024-10-18 BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities Shaozhe Hao et.al. 2410.14672 translate read link
2024-10-18 FashionR2R: Texture-preserving Rendered-to-Real Image Translation with Diffusion Models Rui Hu et.al. 2410.14429 translate read null
2024-10-18 HiCo: Hierarchical Controllable Diffusion Model for Layout-to-image Generation Bo Cheng et.al. 2410.14324 translate read link
2024-10-18 HYPNOS : Highly Precise Foreground-focused Diffusion Finetuning for Inanimate Objects Oliverio Theophilus Nathanael et.al. 2410.14265 translate read null
2024-10-18 Text-to-Image Representativity Fairness Evaluation Framework Asma Yamani et.al. 2410.14201 translate read null
2024-10-18 Personalized Image Generation with Large Multimodal Models Yiyan Xu et.al. 2410.14170 translate read null
2024-10-18 Assessing Open-world Forgetting in Generative Image Model Customization Héctor Laria et.al. 2410.14159 translate read null
2024-10-17 Inference of morphology and dynamical state of nearby $Planck$ -SZ galaxy clusters with Zernike polynomials Valentina Capalbo et.al. 2410.13929 translate read null
2024-10-17 Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens Lijie Fan et.al. 2410.13863 translate read null
2024-10-17 PUMA: Empowering Unified MLLM with Multi-granular Visual Generation Rongyao Fang et.al. 2410.13861 translate read link
2024-10-17 Diffusing States and Matching Scores: A New Framework for Imitation Learning Runzhe Wu et.al. 2410.13855 translate read link
2024-10-17 Deep Generative Models Unveil Patterns in Medical Images Through Vision-Language Conditioning Xiaodan Xing et.al. 2410.13823 translate read link
2024-10-18 Diffusion Curriculum: Synthetic-to-Real Generative Curriculum Learning via Image-Guided Diffusion Yijun Liang et.al. 2410.13674 translate read link
2024-10-17 An Active Learning Framework for Inclusive Generation by Large Language Models Sabit Hassan et.al. 2410.13641 translate read null
2024-10-17 LoLDU: Low-Rank Adaptation via Lower-Diag-Upper Decomposition for Parameter-Efficient Fine-Tuning Yiming Shi et.al. 2410.13618 translate read link
2024-10-17 GAN-Based Speech Enhancement for Low SNR Using Latent Feature Conditioning Shrishti Saha Shetu et.al. 2410.13599 translate read null
2024-10-17 AI-based 3-Lead to 12-Lead ECG Reconstruction: Towards Smartphone-based Public Healthcare Aditya Mallick et.al. 2410.13528 translate read null
2024-10-17 MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models Donghao Zhou et.al. 2410.13370 translate read null
2024-10-16 Embedding an Ethical Mind: Aligning Text-to-Image Synthesis via Lightweight Value Optimization Xingqi Wang et.al. 2410.12700 translate read link
2024-10-16 3DIS: Depth-Driven Decoupled Instance Synthesis for Text-to-Image Generation Dewei Zhou et.al. 2410.12669 translate read null
2024-10-16 Evaluating Utility of Memory Efficient Medical Image Generation: A Study on Lung Nodule Segmentation Kathrin Khadra et.al. 2410.12542 translate read null
2024-10-16 Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective Yongxin Zhu et.al. 2410.12490 translate read link
2024-10-16 Synthetic Augmentation for Anatomical Landmark Localization using DDPMs Arnela Hadzic et.al. 2410.12489 translate read null
2024-10-16 Imagine2Servo: Intelligent Visual Servoing with Diffusion-Driven Goal Generation for Robotic Tasks Pranjali Pathre et.al. 2410.12432 translate read null
2024-10-16 GAN Based Top-Down View Synthesis in Reinforcement Learning Environments Usama Younus et.al. 2410.12372 translate read null
2024-10-16 FaceChain-FACT: Face Adapter with Decoupled Training for Identity-preserved Personalization Cheng Yu et.al. 2410.12312 translate read null
2024-10-16 NSSI-Net: Multi-Concept Generative Adversarial Network for Non-Suicidal Self-Injury Detection Using High-Dimensional EEG Signals in a Semi-Supervised Learning Framework Zhen Liang et.al. 2410.12159 translate read null
2024-10-16 Facing Identity: The Formation and Performance of Identity via Face-Based Artificial Intelligence Technologies Wells Lucas Santo et.al. 2410.12148 translate read null
2024-10-15 On the Effectiveness of Dataset Alignment for Fake Image Detection Anirudh Sundara Rajan et.al. 2410.11835 translate read null
2024-10-15 KITTEN: A Knowledge-Intensive Evaluation of Image Generation on Visual Entities Hsin-Ping Huang et.al. 2410.11824 translate read null
2024-10-15 Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices Zhiyuan Ma et.al. 2410.11795 translate read null
2024-10-15 Generative Image Steganography Based on Point Cloud Zhong Yangjie et.al. 2410.11673 translate read null
2024-10-15 InvSeg: Test-Time Prompt Inversion for Semantic Segmentation Jiayi Lin et.al. 2410.11473 translate read null
2024-10-15 A Simple Approach to Unifying Diffusion-based Conditional Generation Xirui Li et.al. 2410.11439 translate read null
2024-10-15 Evolutionary Retrofitting Mathurin Videau et.al. 2410.11330 translate read null
2024-10-15 Ctrl-U: Robust Conditional Image Generation via Uncertainty-aware Reward Modeling Guiyu Zhang et.al. 2410.11236 translate read null
2024-10-14 When Does Perceptual Alignment Benefit Vision Representations? Shobhita Sundaram et.al. 2410.10817 translate read null
2024-10-14 HART: Efficient Visual Generation with Hybrid Autoregressive Transformer Haotian Tang et.al. 2410.10812 translate read link
2024-10-14 MMAR: Towards Lossless Multi-Modal Auto-Regressive Prababilistic Modeling Jian Yang et.al. 2410.10798 translate read null
2024-10-14 Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations Litu Rout et.al. 2410.10792 translate read null
2024-10-14 Evaluating SQL Understanding in Large Language Models Ananya Rahaman et.al. 2410.10680 translate read null
2024-10-14 SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers Enze Xie et.al. 2410.10629 translate read null
2024-10-14 ROSAR: An Adversarial Re-Training Framework for Robust Side-Scan Sonar Object Detection Martin Aubard et.al. 2410.10554 translate read link
2024-10-14 Customize Your Visual Autoregressive Recipe with Set Autoregressive Modeling Wenze Liu et.al. 2410.10511 translate read link
2024-10-14 Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing Kejie Wang et.al. 2410.10496 translate read null
2024-10-14 4DStyleGaussian: Zero-shot 4D Style Transfer with Gaussian Splatting Wanlin Liang et.al. 2410.10412 translate read null
2024-10-11 SceneCraft: Layout-Guided 3D Scene Generation Xiuyu Yang et.al. 2410.09049 translate read link
2024-10-11 MiRAGeNews: Multimodal Realistic AI-Generated News Detection Runsheng Huang et.al. 2410.09045 translate read link
2024-10-11 One-shot Generative Domain Adaptation in 3D GANs Ziqiang Li et.al. 2410.08824 translate read link
2024-10-11 Synth-SONAR: Sonar Image Synthesis with Enhanced Diversity and Realism via Dual Diffusion Models and GPT Prompting Purushothaman Natarajan et.al. 2410.08612 translate read link
2024-10-11 Text-To-Image with Generative Adversarial Networks Mehrshad Momen-Tayefeh et.al. 2410.08608 translate read null
2024-10-11 Context-Aware Full Body Anonymization using Text-to-Image Diffusion Models Pascl Zwick et.al. 2410.08551 translate read null
2024-10-11 Score Neural Operator: A Generative Model for Learning and Generalizing Across Multiple Probability Distributions Xinyu Liao et.al. 2410.08549 translate read null
2024-10-11 Diffusion Models Need Visual Priors for Image Generation Xiaoyu Yue et.al. 2410.08531 translate read null
2024-10-10 Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis Jinbin Bai et.al. 2410.08261 translate read link
2024-10-10 DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models Xiaoxiao He et.al. 2410.08207 translate read null
2024-10-10 Scaling Laws For Diffusion Transformers Zhengyang Liang et.al. 2410.08184 translate read null
2024-10-10 DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation Jiatao Gu et.al. 2410.08159 translate read null
2024-10-10 RayEmb: Arbitrary Landmark Detection in X-Ray Images Using Ray Embedding Subspace Pragyan Shrestha et.al. 2410.08152 translate read link
2024-10-10 Generated Bias: Auditing Internal Bias Dynamics of Text-To-Image Generative Models Abhishek Mandal et.al. 2410.07884 translate read null
2024-10-10 MinorityPrompt: Text to Minority Image Generation via Prompt Optimization Soobin Um et.al. 2410.07838 translate read link
2024-10-10 MGMD-GAN: Generalization Improvement of Generative Adversarial Networks with Multiple Generator Multiple Discriminator Framework Against Membership Inference Attacks Nirob Arefin et.al. 2410.07803 translate read null
2024-10-10 Synthesizing Multi-Class Surgical Datasets with Anatomy-Aware Diffusion Models Danush Kumar Venkatesh et.al. 2410.07753 translate read link
2024-10-10 Relational Diffusion Distillation for Efficient Image Generation Weilun Feng et.al. 2410.07679 translate read link
2024-10-10 FLIER: Few-shot Language Image Models Embedded with Latent Representations Zhinuo Zhou et.al. 2410.07648 translate read null
2024-10-09 IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation Xinchen Zhang et.al. 2410.07171 translate read link
2024-10-09 EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models Rui Zhao et.al. 2410.07133 translate read link
2024-10-09 Personalized Visual Instruction Tuning Renjie Pi et.al. 2410.07113 translate read link
2024-10-09 Boosting Few-Shot Detection with Large Language Models and Layout-to-Image Synthesis Ahmed Abdullah et.al. 2410.06841 translate read null
2024-10-09 Decouple-Then-Merge: Towards Better Training for Diffusion Models Qianli Ma et.al. 2410.06664 translate read link
2024-10-09 On the Solution of Linearized Inverse Scattering Problems in Near-Field Microwave Imaging by Operator Inversion and Matched Filtering Matthias M. Saurer et.al. 2410.06465 translate read null
2024-10-08 Story-Adapter: A Training-free Iterative Framework for Long Story Visualization Jiawei Mao et.al. 2410.06244 translate read link
2024-10-08 SD- $π$ XL: Generating Low-Resolution Quantized Imagery via Score Distillation Alexandre Binninger et.al. 2410.06236 translate read null
2024-10-08 Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach Sha Guo et.al. 2410.06149 translate read null
2024-10-08 Estimating the Number of HTTP/3 Responses in QUIC Using Deep Learning Barak Gahtan et.al. 2410.06140 translate read null
2024-10-07 Editing Music with Melody and Text: Using ControlNet for Diffusion Transformer Siyuan Hou et.al. 2410.05151 translate read null
2024-10-07 Human-Feedback Efficient Reinforcement Learning for Online Diffusion Model Finetuning Ayano Hiranaka et.al. 2410.05116 translate read null
2024-10-07 Synthetic Generation of Dermatoscopic Images with GAN and Closed-Form Factorization Rohan Reddy Mekala et.al. 2410.05114 translate read null
2024-10-07 Bi-Directional MS Lesion Filling and Synthesis Using Denoising Diffusion Implicit Model-based Lesion Repainting Jinwei Zhang et.al. 2410.05027 translate read null
2024-10-07 OmniBooth: Learning Latent Control for Image Synthesis with Multi-modal Instruction Leheng Li et.al. 2410.04932 translate read null
2024-10-07 PostEdit: Posterior Sampling for Efficient Zero-Shot Image Editing Feng Tian et.al. 2410.04844 translate read null
2024-10-07 Transforming Color: A Novel Image Colorization Method Hamza Shafiq et.al. 2410.04799 translate read null
2024-10-07 Double Oracle Neural Architecture Search for Game Theoretic Deep Learning Models Aye Phyu Phyu Aung et.al. 2410.04764 translate read null
2024-10-07 Stochastic Runge-Kutta Methods: Provable Acceleration of Diffusion Models Yuchen Wu et.al. 2410.04760 translate read null
2024-10-06 Video Summarization Techniques: A Comprehensive Review Toqa Alaa et.al. 2410.04449 translate read null
2024-10-04 Not All Diffusion Model Activations Have Been Evaluated as Discriminative Features Benyuan Meng et.al. 2410.03558 translate read link
2024-10-04 Dynamic Diffusion Transformer Wangbo Zhao et.al. 2410.03456 translate read link
2024-10-04 Images Speak Volumes: User-Centric Assessment of Image Generation for Accessible Communication Miriam Anschütz et.al. 2410.03430 translate read null
2024-10-04 LANTERN: Accelerating Visual Autoregressive Models with Relaxed Speculative Decoding Doohyuk Jang et.al. 2410.03355 translate read null
2024-10-04 Learning test generators for cyber-physical systems Jarkko Peltomäki et.al. 2410.03202 translate read null
2024-10-04 MultiVerse: Efficient and Expressive Zero-Shot Multi-Task Text-to-Speech Taejun Bak et.al. 2410.03192 translate read null
2024-10-04 Tuning Timestep-Distilled Diffusion Model Using Pairwise Sample Optimization Zichen Miao et.al. 2410.03190 translate read null
2024-10-04 Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach Yaofang Liu et.al. 2410.03160 translate read link
2024-10-03 Revealing the Unseen: Guiding Personalized Diffusion Models to Expose Training Data Xiaoyu Wu et.al. 2410.03039 translate read null
2024-10-03 PixelShuffler: A Simple Image Translation Through Pixel Rearrangement Omar Zamzam et.al. 2410.03021 translate read null
2024-10-03 SteerDiff: Steering towards Safe Text-to-Image Diffusion Models Hongxiang Zhang et.al. 2410.02710 translate read null
2024-10-03 ControlAR: Controllable Image Generation with Autoregressive Models Zongming Li et.al. 2410.02705 translate read link
2024-10-03 Grounded Answers for Multi-agent Decision-making Problem through Generative World Model Zeyang Liu et.al. 2410.02664 translate read null
2024-10-03 Event-Customized Image Generation Zhen Wang et.al. 2410.02483 translate read null
2024-10-03 Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation Muzhi Zhu et.al. 2410.02369 translate read link
2024-10-03 SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration Jintao Zhang et.al. 2410.02367 translate read link
2024-10-03 Plug-and-Play Controllable Generation for Discrete Masked Models Wei Guo et.al. 2410.02143 translate read null
2024-10-02 EC-DIT: Scaling Diffusion Transformers with Adaptive Expert-Choice Routing Haotian Sun et.al. 2410.02098 translate read null
2024-10-02 DisEnvisioner: Disentangled and Enriched Visual Prompt for Customized Image Generation Jing He et.al. 2410.02067 translate read null
2024-10-02 Normalizing Flow Based Metric for Image Generation Pranav Jeevan et.al. 2410.02004 translate read link
2024-10-02 Bellman Diffusion: Generative Modeling as Learning a Linear Operator in the Distribution Space Yangming Li et.al. 2410.01796 translate read null
2024-10-02 ImageFolder: Autoregressive Image Generation with Folded Tokens Xiang Li et.al. 2410.01756 translate read link
2024-10-02 ComfyGen: Prompt-Adaptive Workflows for Text-to-Image Generation Rinon Gal et.al. 2410.01731 translate read null
2024-10-02 Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding Yao Teng et.al. 2410.01699 translate read link
2024-10-02 Data Extrapolation for Text-to-image Generation on Small Datasets Senmao Ye et.al. 2410.01638 translate read link
2024-10-02 KnobGen: Controlling the Sophistication of Artwork in Sketch-Based Diffusion Models Pouyan Navard et.al. 2410.01595 translate read link
2024-10-02 Edge-preserving noise for diffusion models Jente Vandersanden et.al. 2410.01540 translate read null
2024-10-02 Harnessing the Latent Diffusion Model for Training-Free Image Style Transfer Kento Masui et.al. 2410.01366 translate read null
2024-10-02 Aggregation of Multi Diffusion Models for Enhancing Learned Representations Conghan Yue et.al. 2410.01262 translate read link
2024-10-02 The SynCOM Flow Tracking Challenge Valmir Moraes Filho et.al. 2410.01233 translate read null
2024-10-01 Enhancing GANs with Contrastive Learning-Based Multistage Progressive Finetuning SNN and RL-Based External Optimization Osama Mustafa et.al. 2409.20340 translate read null

(<a href=../Image_Generation.md>back to Image Generation</a>)