Image Generation - 2024-09

Publish Date Title Authors PDF Translate Read Code
2024-09-30 Inverse Painting: Reconstructing The Painting Process Bowei Chen et.al. 2409.20556 translate read null
2024-09-30 Dual Encoder GAN Inversion for High-Fidelity 3D Head Reconstruction from Single Images Bahri Batuhan Bilecen et.al. 2409.20530 translate read null
2024-09-30 All-optical autoencoder machine learning framework using diffractive processors Peijie Feng et.al. 2409.20346 translate read null
2024-09-30 Illustrious: an Open Advanced Illustration Model Sang Hyun Park et.al. 2409.19946 translate read null
2024-09-30 MaskMamba: A Hybrid Mamba-Transformer Model for Masked Image Generation Wenchao Chen et.al. 2409.19937 translate read null
2024-09-29 OrganiQ: Mitigating Classical Resource Bottlenecks of Quantum Generative Adversarial Networks on NISQ-Era Machines Daniel Silver et.al. 2409.19823 translate read null
2024-09-29 When Molecular GAN Meets Byte-Pair Encoding Huidong Tang et.al. 2409.19740 translate read null
2024-09-29 Simple and Fast Distillation of Diffusion Models Zhenyu Zhou et.al. 2409.19681 translate read link
2024-09-29 Storynizor: Consistent Story Generation via Inter-Frame Synchronized and Shuffled ID Injection Yuhang Ma et.al. 2409.19624 translate read null
2024-09-27 Detecting Dataset Abuse in Fine-Tuning Stable Diffusion Models for Text-to-Image Synthesis Songrui Wang et.al. 2409.18897 translate read null
2024-09-27 Explainable Artifacts for Synthetic Western Blot Source Attribution João Phillipe Cardenuto et.al. 2409.18881 translate read null
2024-09-27 Simulating Dynamic Tumor Contrast Enhancement in Breast MRI using Conditional Generative Adversarial Networks Richard Osuala et.al. 2409.18872 translate read null
2024-09-27 Underwater Image Enhancement with Physical-based Denoising Diffusion Implicit Models Nguyen Gia Bach et.al. 2409.18476 translate read link
2024-09-27 Gradient-free Decoder Inversion in Latent Diffusion Models Seongmin Hong et.al. 2409.18442 translate read null
2024-09-27 Adaptive Learning of the Latent Space of Wasserstein Generative Adversarial Networks Yixuan Qiu et.al. 2409.18374 translate read null
2024-09-26 DRL-STNet: Unsupervised Domain Adaptation for Cross-modality Medical Image Segmentation via Disentangled Representation Learning Hui Lin et.al. 2409.18340 translate read null
2024-09-26 Realistic Evaluation of Model Merging for Compositional Generalization Derek Tam et.al. 2409.18314 translate read link
2024-09-26 Harnessing Wavelet Transformations for Generalizable Deepfake Forgery Detection Lalith Bharadwaj Baru et.al. 2409.18301 translate read link
2024-09-26 Synthesizing beta-amyloid PET images from T1-weighted Structural MRI: A Preliminary Study Qing Lyu et.al. 2409.18282 translate read null
2024-09-26 FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner Wenliang Zhao et.al. 2409.18128 translate read link
2024-09-26 Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction Jing He et.al. 2409.18124 translate read link
2024-09-26 DiffSSC: Semantic LiDAR Scan Completion using Denoising Diffusion Probabilistic Models Helin Cao et.al. 2409.18092 translate read null
2024-09-26 Pioneering Reliable Assessment in Text-to-Image Knowledge Editing: Leveraging a Fine-Grained Dataset and an Innovative Criterion Hengrui Gu et.al. 2409.17928 translate read null
2024-09-26 Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation Qihan Huang et.al. 2409.17920 translate read link
2024-09-26 WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians Dmytro Kotovenko et.al. 2409.17917 translate read null
2024-09-26 Text Image Generation for Low-Resource Languages with Dual Translation Learning Chihiro Noguchi et.al. 2409.17747 translate read null
2024-09-26 AnyLogo: Symbiotic Subject-Driven Diffusion System with Gemini Status Jinghao Zhang et.al. 2409.17740 translate read null
2024-09-26 ID $^3$ : Identity-Preserving-yet-Diversified Diffusion Models for Synthetic Face Recognition Shen Li et.al. 2409.17576 translate read null
2024-09-26 Pixel-Space Post-Training of Latent Diffusion Models Christina Zhang et.al. 2409.17565 translate read null
2024-09-25 GeoBiked: A Dataset with Geometric Features and Automated Labeling Techniques to Enable Deep Generative Models in Engineering Design Phillip Mueller et.al. 2409.17045 translate read null
2024-09-25 Enhanced Wavelet Scattering Network for image inpainting detection Barglazan Adrian-Alin et.al. 2409.17023 translate read null
2024-09-25 WasteGAN: Data Augmentation for Robotic Waste Sorting through Generative Adversarial Networks Alberto Bacchin et.al. 2409.16999 translate read link
2024-09-25 Towards General Text-guided Image Synthesis for Customized Multimodal Brain MRI Generation Yulin Wang et.al. 2409.16818 translate read link
2024-09-25 Pose-Guided Fine-Grained Sign Language Video Generation Tongkai Shi et.al. 2409.16709 translate read null
2024-09-25 Pix2Next: Leveraging Vision Foundation Models for RGB to NIR Image Translation Youngwan Jin et.al. 2409.16706 translate read link
2024-09-25 Morphological-consistent Diffusion Network for Ultrasound Coronal Image Enhancement Yihao Zhou et.al. 2409.16661 translate read null
2024-09-25 ECG-Image-Database: A Dataset of ECG Images with Real-World Imaging and Scanning Artifacts; A Foundation for Computerized ECG Image Digitization and Analysis Matthew A. Reyna et.al. 2409.16612 translate read null
2024-09-25 Prompt Sliders for Fine-Grained Control, Editing and Erasing of Concepts in Diffusion Models Deepak Sridhar et.al. 2409.16535 translate read link
2024-09-24 MonoFormer: One Transformer for Both Diffusion and Autoregression Chuyang Zhao et.al. 2409.16280 translate read link
2024-09-24 Label-Augmented Dataset Distillation Seoungyoon Kang et.al. 2409.16239 translate read null
2024-09-24 MaskBit: Embedding-free Image Generation via Bit Tokens Mark Weber et.al. 2409.16211 translate read link
2024-09-24 Machine learning approaches for automatic defect detection in photovoltaic systems Swayam Rajat Mohanty et.al. 2409.16069 translate read null
2024-09-24 Enhanced Unsupervised Image-to-Image Translation Using Contrastive Learning and Histogram of Oriented Gradients Wanchen Zhao et.al. 2409.16042 translate read null
2024-09-24 Deep chroma compression of tone-mapped images Xenios Milidonis et.al. 2409.16032 translate read link
2024-09-24 Improvements to SDXL in NovelAI Diffusion V3 Juan Ossa et.al. 2409.15997 translate read null
2024-09-24 StyleSinger 2: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control Yu Zhang et.al. 2409.15977 translate read link
2024-09-24 Data Augmentation for Sparse Multidimensional Learning Performance Data Using Generative AI Liang Zhang et.al. 2409.15631 translate read null
2024-09-23 Critic Loss for Image Classification Brendan Hogan Rappazzo et.al. 2409.15565 translate read null
2024-09-18 Brain-Streams: fMRI-to-Image Reconstruction with Multi-modal Guidance Jaehoon Joo et.al. 2409.12099 translate read null
2024-09-18 ChefFusion: Multimodal Foundation Model Integrating Recipe and Food Image Generation Peiyu Li et.al. 2409.12010 translate read link
2024-09-18 Tracking Any Point with Frame-Event Fusion Network at High Frame Rate Jiaxiong Liu et.al. 2409.11953 translate read null
2024-09-18 Agglomerative Token Clustering Joakim Bruslund Haurum et.al. 2409.11923 translate read link
2024-09-18 Finding the Subjective Truth: Collecting 2 Million Votes for Comprehensive Gen-AI Model Evaluation Dimitrios Christodoulou et.al. 2409.11904 translate read null
2024-09-18 RaggeDi: Diffusion-based State Estimation of Disordered Rags, Sheets, Towels and Blankets Jikai Ye et.al. 2409.11831 translate read null
2024-09-18 Latent fingerprint enhancement for accurate minutiae detection Abdul Wahab et.al. 2409.11802 translate read null
2024-09-18 METEOR: Melody-aware Texture-controllable Symbolic Orchestral Music Generation Dinh-Viet-Toan Le et.al. 2409.11753 translate read link
2024-09-18 GUNet: A Graph Convolutional Network United Diffusion Model for Stable and Diversity Pose Generation Shuowen Liang et.al. 2409.11689 translate read link
2024-09-17 Using Physics Informed Generative Adversarial Networks to Model 3D porous media Zihan Ren et.al. 2409.11541 translate read null
2024-09-17 Training Datasets Generation for Machine Learning: Application to Vision Based Navigation Jérémy Lebreton et.al. 2409.11383 translate read null
2024-09-17 Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think Gonzalo Martin Garcia et.al. 2409.11355 translate read link
2024-09-17 OmniGen: Unified Image Generation Shitao Xiao et.al. 2409.11340 translate read link
2024-09-17 Improving the Efficiency of Visually Augmented Language Models Paula Ontalvilla et.al. 2409.11148 translate read null
2024-09-17 MM2Latent: Text-to-facial image generation and editing in GANs with multimodal assistance Debin Meng et.al. 2409.11010 translate read link
2024-09-16 A Missing Data Imputation GAN for Character Sprite Generation Flávio Coutinho et.al. 2409.10721 translate read link
2024-09-16 Playground v3: Improving Text-to-Image Alignment with Deep-Fusion Large Language Models Bingchen Liu et.al. 2409.10695 translate read null
2024-09-16 Incorporating Classifier-Free Guidance in Diffusion Model-Based Recommendation Noah Buchanan et.al. 2409.10494 translate read null
2024-09-16 SimInversion: A Simple Framework for Inversion-Based Text-to-Image Editing Qi Qian et.al. 2409.10476 translate read null
2024-09-16 Mamba-ST: State Space Model for Efficient Style Transfer Filippo Botti et.al. 2409.10385 translate read null
2024-09-16 Robust image representations with counterfactual contrastive learning Mélanie Roschewitz et.al. 2409.10365 translate read link
2024-09-16 VAE-QWGAN: Improving Quantum GANs for High Resolution Image Generation Aaron Mark Thomas et.al. 2409.10339 translate read null
2024-09-16 On Synthetic Texture Datasets: Challenges, Creation, and Curation Blaine Hoak et.al. 2409.10297 translate read null
2024-09-16 MotionCom: Automatic and Motion-Aware Image Composition with LLM and Video Diffusion Prior Weijing Tao et.al. 2409.10090 translate read null
2024-09-16 Cross-modality image synthesis from TOF-MRA to CTA using diffusion-based models Alexander Koch et.al. 2409.10089 translate read null
2024-09-16 2S-ODIS: Two-Stage Omni-Directional Image Synthesis by Geometric Distortion Correction Atsuya Nakata et.al. 2409.09969 translate read link
2024-09-15 GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion Vitor Guizilini et.al. 2409.09896 translate read null
2024-09-13 InstantDrag: Improving Interactivity in Drag-based Image Editing Joonghyuk Shin et.al. 2409.08857 translate read null
2024-09-13 GroundingBooth: Grounding Text-to-Image Customization Zhexiao Xiong et.al. 2409.08520 translate read null
2024-09-13 Enhancing Privacy in ControlNet and Stable Diffusion via Split Learning Dixi Yao et.al. 2409.08503 translate read null
2024-09-13 Cross-conditioned Diffusion Model for Medical Image to Image Translation Zhaohu Xing et.al. 2409.08500 translate read null
2024-09-12 Learned Compression for Images and Point Clouds Mateen Ulhaq et.al. 2409.08376 translate read link
2024-09-12 Impact of Stain Variation and Color Normalization for Prognostic Predictions in Pathology Siyu et.al. 2409.08338 translate read null
2024-09-12 Click2Mask: Local Editing with Dynamic Mask Generation Omer Regev et.al. 2409.08272 translate read link
2024-09-12 Improving Virtual Try-On with Garment-focused Diffusion Models Siqi Wan et.al. 2409.08258 translate read link
2024-09-12 TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder NaHyeon Park et.al. 2409.08248 translate read link
2024-09-12 IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation Yinwei Wu et.al. 2409.08240 translate read null
2024-09-12 High-Frequency Anti-DreamBooth: Robust Defense Against Image Synthesis Takuto Onikubo et.al. 2409.08167 translate read null
2024-09-12 EZIGen: Enhancing zero-shot subject-driven image generation with precise subject encoding and decoupled guidance Zicheng Duan et.al. 2409.08091 translate read null
2024-09-12 Scribble-Guided Diffusion for Training-free Text-to-Image Generation Seonho Lee et.al. 2409.08026 translate read link
2024-09-12 FPMT: Enhanced Semi-Supervised Model for Traffic Incident Detection Xinying Lu et.al. 2409.07839 translate read null
2024-09-11 Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models Haibo Yang et.al. 2409.07452 translate read link
2024-09-11 FreeEnhance: Tuning-Free Image Enhancement via Content-Consistent Noising-and-Denoising Process Yang Luo et.al. 2409.07451 translate read null
2024-09-11 Controllable retinal image synthesis using conditional StyleGAN and latent space manipulation for improved diagnosis and grading of diabetic retinopathy Somayeh Pakdelmoez et.al. 2409.07422 translate read null
2024-09-11 Some effects of limited wall-sensor availability on flow estimation with 3D-GANs Antonio Cuéllar et.al. 2409.07348 translate read null
2024-09-11 CCFExp: Facial Image Synthesis with Cycle Cross-Fusion Diffusion Model for Facial Paralysis Individuals Weixiang Gao et.al. 2409.07271 translate read link
2024-09-11 Bio-Eng-LMM AI Assist chatbot: A Comprehensive Tool for Research and Education Ali Forootani et.al. 2409.07110 translate read null
2024-09-11 Fidelity-optimized quantum surface code via GAN decoder and application to quantum teleportation Jiaxin Li et.al. 2409.06984 translate read null
2024-09-10 DANCE: Deep Learning-Assisted Analysis of Protein Sequences Using Chaos Enhanced Kaleidoscopic Images Taslim Murad et.al. 2409.06694 translate read null
2024-09-10 Three-dimensional generative adversarial networks for turbulent flow estimation from wall measurements Antonio Cuéllar et.al. 2409.06548 translate read null
2024-09-10 PoseEmbroider: Towards a 3D, Visual, Semantic-aware Human Pose Representation Ginger Delmas et.al. 2409.06535 translate read null
2024-09-10 DiffQRCoder: Diffusion-based Aesthetic QR Code Generation with Scanning Robustness Guided Iterative Refinement Jia-Wei Liao et.al. 2409.06355 translate read link
2024-09-10 Spectral oversubtraction? An approach for speech enhancement after robot ego speech filtering in semi-real-time Yue Li et.al. 2409.06274 translate read null
2024-09-10 EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation Nischal Khanal et.al. 2409.06183 translate read link
2024-09-09 SVS-GAN: Leveraging GANs for Semantic Video Synthesis Khaled M. Seyam et.al. 2409.06074 translate read null
2024-09-09 Statistical Mechanics of Min-Max Problems Yuma Ichikawa et.al. 2409.06053 translate read null
2024-09-09 SVFit: Parameter-Efficient Fine-Tuning of Large Pre-Trained Models Using Singular Values Chengwei Sun et.al. 2409.05926 translate read null
2024-09-09 Quantum Wasserstein Compilation: Unitary Compilation using the Quantum Earth Mover’s Distance Marvin Richter et.al. 2409.05849 translate read null
2024-09-09 CipherDM: Secure Three-Party Inference for Diffusion Model Sampling Xin Zhao et.al. 2409.05414 translate read null
2024-09-09 Sequential Posterior Sampling with Diffusion Models Tristan S. W. Stevens et.al. 2409.05399 translate read null
2024-09-09 Decoupling Contact for Fine-Grained Motion Style Transfer Xiangjun Tang et.al. 2409.05387 translate read null
2024-09-09 TERD: A Unified Framework for Safeguarding Diffusion Models Against Backdoors Yichuan Mo et.al. 2409.05294 translate read null
2024-09-09 Disentangled Representations for Short-Term and Long-Term Person Re-Identification Chanho Eom et.al. 2409.05277 translate read null
2024-09-09 MRStyle: A Unified Framework for Color Style Transfer with Multi-Modality Reference Jiancheng Huang et.al. 2409.05250 translate read null
2024-09-08 Can OOD Object Detectors Learn from Foundation Models? Jiahui Liu et.al. 2409.05162 translate read link
2024-09-08 Physics-augmented Deep Learning with Adversarial Domain Adaptation: Applications to STM Image Denoising Jianxin Xie et.al. 2409.05118 translate read null
2024-09-07 Rethinking The Training And Evaluation of Rich-Context Layout-to-Image Generation Jiaxin Cheng et.al. 2409.04847 translate read link
2024-09-06 VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation Yecheng Wu et.al. 2409.04429 translate read link
2024-09-06 Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual Generation Zhuoyan Luo et.al. 2409.04410 translate read link
2024-09-06 How Fair is Your Diffusion Recommender Model? Daniele Malitesta et.al. 2409.04339 translate read null
2024-09-06 Secure Traffic Sign Recognition: An Attention-Enabled Universal Image Inpainting Mechanism against Light Patch Attacks Hangcheng Cao et.al. 2409.04133 translate read null
2024-09-06 Bi-modality Images Transfer with a Discrete Process Matching Method Zhe Xiong et.al. 2409.03977 translate read null
2024-09-05 Generating High Dimensional User-Specific Wireless Channels using Diffusion Models Taekyun Lee et.al. 2409.03924 translate read null
2024-09-05 ArtiFade: Learning to Generate High-quality Subject from Blemished Images Shuya Yang et.al. 2409.03745 translate read null
2024-09-05 Unsupervised Anomaly Detection and Localization with Generative Adversarial Networks Khouloud Abdelli et.al. 2409.03657 translate read null
2024-09-05 RealisHuman: A Two-Stage Approach for Refining Malformed Human Parts in Generated Images Benzhi Wang et.al. 2409.03644 translate read null
2024-09-05 VFLGAN-TS: Vertical Federated Learning-based Generative Adversarial Networks for Publication of Vertically Partitioned Time-Series Data Xun Yuan et.al. 2409.03612 translate read null
2024-09-05 TCDiff: Triple Condition Diffusion Model with 3D Constraints for Stylizing Synthetic Faces Bernardo Biesseck et.al. 2409.03600 translate read link
2024-09-05 Blended Latent Diffusion under Attention Control for Real-World Video Editing Deyin Liu et.al. 2409.03514 translate read null
2024-09-05 Non-Uniform Illumination Attack for Fooling Convolutional Neural Networks Akshay Jain et.al. 2409.03458 translate read link
2024-09-05 Fine-tuning large language models for domain adaptation: Exploration of training strategies, scaling, model merging and synergistic capabilities Wei Lu et.al. 2409.03444 translate read link
2024-09-05 RoVi-Aug: Robot and Viewpoint Augmentation for Cross-Embodiment Robot Learning Lawrence Yunliang Chen et.al. 2409.03403 translate read null
2024-09-05 Enhancing digital core image resolution using optimal upscaling algorithm: with application to paired SEM images Shaohua You et.al. 2409.03265 translate read null
2024-09-04 HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts Xinyu Liu et.al. 2409.02919 translate read link
2024-09-04 Independence Constrained Disentangled Representation Learning from Epistemological Perspective Ruoyu Wang et.al. 2409.02672 translate read null
2024-09-04 Skip-and-Play: Depth-Driven Pose-Preserved Image Generation for Any Objects Kyungmin Jo et.al. 2409.02653 translate read null
2024-09-04 StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion Models Wen Li et.al. 2409.02543 translate read link
2024-09-04 A Learnable Color Correction Matrix for RAW Reconstruction Anqi Liu et.al. 2409.02497 translate read null
2024-09-04 Training-free Color-Style Disentanglement for Constrained Text-to-Image Synthesis Aishwarya Agarwal et.al. 2409.02429 translate read null
2024-09-04 Exploring Low-Dimensional Subspaces in Diffusion Models for Controllable Image Editing Siyi Chen et.al. 2409.02374 translate read link
2024-09-03 QID $^2$ : An Image-Conditioned Diffusion Model for Q-space Up-sampling of DWI Data Zijian Chen et.al. 2409.02309 translate read null
2024-09-03 FastVoiceGrad: One-step Diffusion-Based Voice Conversion with Adversarial Conditional Diffusion Distillation Takuhiro Kaneko et.al. 2409.02245 translate read null
2024-09-03 LSTM-QGAN: Scalable NISQ Generative Adversarial Network Cheng Chu et.al. 2409.02212 translate read null
2024-09-02 Enabling Local Editing in Diffusion Models by Joint and Individual Component Analysis Theodoros Kouzelis et.al. 2408.16845 translate read null

(<a href=../Image_Generation.md>back to Image Generation</a>)