Image Generation - 2024-09 | Paper Arxiv Daily

Image Generation - 2024-09

Publish Date	Title	Authors	PDF	Translate	Read	Code
2024-09-30	Inverse Painting: Reconstructing The Painting Process	Bowei Chen et.al.	2409.20556	translate	read	null
2024-09-30	Dual Encoder GAN Inversion for High-Fidelity 3D Head Reconstruction from Single Images	Bahri Batuhan Bilecen et.al.	2409.20530	translate	read	null
2024-09-30	All-optical autoencoder machine learning framework using diffractive processors	Peijie Feng et.al.	2409.20346	translate	read	null
2024-09-30	Illustrious: an Open Advanced Illustration Model	Sang Hyun Park et.al.	2409.19946	translate	read	null
2024-09-30	MaskMamba: A Hybrid Mamba-Transformer Model for Masked Image Generation	Wenchao Chen et.al.	2409.19937	translate	read	null
2024-09-29	OrganiQ: Mitigating Classical Resource Bottlenecks of Quantum Generative Adversarial Networks on NISQ-Era Machines	Daniel Silver et.al.	2409.19823	translate	read	null
2024-09-29	When Molecular GAN Meets Byte-Pair Encoding	Huidong Tang et.al.	2409.19740	translate	read	null
2024-09-29	Simple and Fast Distillation of Diffusion Models	Zhenyu Zhou et.al.	2409.19681	translate	read	link
2024-09-29	Storynizor: Consistent Story Generation via Inter-Frame Synchronized and Shuffled ID Injection	Yuhang Ma et.al.	2409.19624	translate	read	null
2024-09-27	Detecting Dataset Abuse in Fine-Tuning Stable Diffusion Models for Text-to-Image Synthesis	Songrui Wang et.al.	2409.18897	translate	read	null
2024-09-27	Explainable Artifacts for Synthetic Western Blot Source Attribution	João Phillipe Cardenuto et.al.	2409.18881	translate	read	null
2024-09-27	Simulating Dynamic Tumor Contrast Enhancement in Breast MRI using Conditional Generative Adversarial Networks	Richard Osuala et.al.	2409.18872	translate	read	null
2024-09-27	Underwater Image Enhancement with Physical-based Denoising Diffusion Implicit Models	Nguyen Gia Bach et.al.	2409.18476	translate	read	link
2024-09-27	Gradient-free Decoder Inversion in Latent Diffusion Models	Seongmin Hong et.al.	2409.18442	translate	read	null
2024-09-27	Adaptive Learning of the Latent Space of Wasserstein Generative Adversarial Networks	Yixuan Qiu et.al.	2409.18374	translate	read	null
2024-09-26	DRL-STNet: Unsupervised Domain Adaptation for Cross-modality Medical Image Segmentation via Disentangled Representation Learning	Hui Lin et.al.	2409.18340	translate	read	null
2024-09-26	Realistic Evaluation of Model Merging for Compositional Generalization	Derek Tam et.al.	2409.18314	translate	read	link
2024-09-26	Harnessing Wavelet Transformations for Generalizable Deepfake Forgery Detection	Lalith Bharadwaj Baru et.al.	2409.18301	translate	read	link
2024-09-26	Synthesizing beta-amyloid PET images from T1-weighted Structural MRI: A Preliminary Study	Qing Lyu et.al.	2409.18282	translate	read	null
2024-09-26	FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner	Wenliang Zhao et.al.	2409.18128	translate	read	link
2024-09-26	Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction	Jing He et.al.	2409.18124	translate	read	link
2024-09-26	DiffSSC: Semantic LiDAR Scan Completion using Denoising Diffusion Probabilistic Models	Helin Cao et.al.	2409.18092	translate	read	null
2024-09-26	Pioneering Reliable Assessment in Text-to-Image Knowledge Editing: Leveraging a Fine-Grained Dataset and an Innovative Criterion	Hengrui Gu et.al.	2409.17928	translate	read	null
2024-09-26	Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation	Qihan Huang et.al.	2409.17920	translate	read	link
2024-09-26	WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians	Dmytro Kotovenko et.al.	2409.17917	translate	read	null
2024-09-26	Text Image Generation for Low-Resource Languages with Dual Translation Learning	Chihiro Noguchi et.al.	2409.17747	translate	read	null
2024-09-26	AnyLogo: Symbiotic Subject-Driven Diffusion System with Gemini Status	Jinghao Zhang et.al.	2409.17740	translate	read	null
2024-09-26	ID $^3$ : Identity-Preserving-yet-Diversified Diffusion Models for Synthetic Face Recognition	Shen Li et.al.	2409.17576	translate	read	null
2024-09-26	Pixel-Space Post-Training of Latent Diffusion Models	Christina Zhang et.al.	2409.17565	translate	read	null
2024-09-25	GeoBiked: A Dataset with Geometric Features and Automated Labeling Techniques to Enable Deep Generative Models in Engineering Design	Phillip Mueller et.al.	2409.17045	translate	read	null
2024-09-25	Enhanced Wavelet Scattering Network for image inpainting detection	Barglazan Adrian-Alin et.al.	2409.17023	translate	read	null
2024-09-25	WasteGAN: Data Augmentation for Robotic Waste Sorting through Generative Adversarial Networks	Alberto Bacchin et.al.	2409.16999	translate	read	link
2024-09-25	Towards General Text-guided Image Synthesis for Customized Multimodal Brain MRI Generation	Yulin Wang et.al.	2409.16818	translate	read	link
2024-09-25	Pose-Guided Fine-Grained Sign Language Video Generation	Tongkai Shi et.al.	2409.16709	translate	read	null
2024-09-25	Pix2Next: Leveraging Vision Foundation Models for RGB to NIR Image Translation	Youngwan Jin et.al.	2409.16706	translate	read	link
2024-09-25	Morphological-consistent Diffusion Network for Ultrasound Coronal Image Enhancement	Yihao Zhou et.al.	2409.16661	translate	read	null
2024-09-25	ECG-Image-Database: A Dataset of ECG Images with Real-World Imaging and Scanning Artifacts; A Foundation for Computerized ECG Image Digitization and Analysis	Matthew A. Reyna et.al.	2409.16612	translate	read	null
2024-09-25	Prompt Sliders for Fine-Grained Control, Editing and Erasing of Concepts in Diffusion Models	Deepak Sridhar et.al.	2409.16535	translate	read	link
2024-09-24	MonoFormer: One Transformer for Both Diffusion and Autoregression	Chuyang Zhao et.al.	2409.16280	translate	read	link
2024-09-24	Label-Augmented Dataset Distillation	Seoungyoon Kang et.al.	2409.16239	translate	read	null
2024-09-24	MaskBit: Embedding-free Image Generation via Bit Tokens	Mark Weber et.al.	2409.16211	translate	read	link
2024-09-24	Machine learning approaches for automatic defect detection in photovoltaic systems	Swayam Rajat Mohanty et.al.	2409.16069	translate	read	null
2024-09-24	Enhanced Unsupervised Image-to-Image Translation Using Contrastive Learning and Histogram of Oriented Gradients	Wanchen Zhao et.al.	2409.16042	translate	read	null
2024-09-24	Deep chroma compression of tone-mapped images	Xenios Milidonis et.al.	2409.16032	translate	read	link
2024-09-24	Improvements to SDXL in NovelAI Diffusion V3	Juan Ossa et.al.	2409.15997	translate	read	null
2024-09-24	StyleSinger 2: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control	Yu Zhang et.al.	2409.15977	translate	read	link
2024-09-24	Data Augmentation for Sparse Multidimensional Learning Performance Data Using Generative AI	Liang Zhang et.al.	2409.15631	translate	read	null
2024-09-23	Critic Loss for Image Classification	Brendan Hogan Rappazzo et.al.	2409.15565	translate	read	null
2024-09-18	Brain-Streams: fMRI-to-Image Reconstruction with Multi-modal Guidance	Jaehoon Joo et.al.	2409.12099	translate	read	null
2024-09-18	ChefFusion: Multimodal Foundation Model Integrating Recipe and Food Image Generation	Peiyu Li et.al.	2409.12010	translate	read	link
2024-09-18	Tracking Any Point with Frame-Event Fusion Network at High Frame Rate	Jiaxiong Liu et.al.	2409.11953	translate	read	null
2024-09-18	Agglomerative Token Clustering	Joakim Bruslund Haurum et.al.	2409.11923	translate	read	link
2024-09-18	Finding the Subjective Truth: Collecting 2 Million Votes for Comprehensive Gen-AI Model Evaluation	Dimitrios Christodoulou et.al.	2409.11904	translate	read	null
2024-09-18	RaggeDi: Diffusion-based State Estimation of Disordered Rags, Sheets, Towels and Blankets	Jikai Ye et.al.	2409.11831	translate	read	null
2024-09-18	Latent fingerprint enhancement for accurate minutiae detection	Abdul Wahab et.al.	2409.11802	translate	read	null
2024-09-18	METEOR: Melody-aware Texture-controllable Symbolic Orchestral Music Generation	Dinh-Viet-Toan Le et.al.	2409.11753	translate	read	link
2024-09-18	GUNet: A Graph Convolutional Network United Diffusion Model for Stable and Diversity Pose Generation	Shuowen Liang et.al.	2409.11689	translate	read	link
2024-09-17	Using Physics Informed Generative Adversarial Networks to Model 3D porous media	Zihan Ren et.al.	2409.11541	translate	read	null
2024-09-17	Training Datasets Generation for Machine Learning: Application to Vision Based Navigation	Jérémy Lebreton et.al.	2409.11383	translate	read	null
2024-09-17	Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think	Gonzalo Martin Garcia et.al.	2409.11355	translate	read	link
2024-09-17	OmniGen: Unified Image Generation	Shitao Xiao et.al.	2409.11340	translate	read	link
2024-09-17	Improving the Efficiency of Visually Augmented Language Models	Paula Ontalvilla et.al.	2409.11148	translate	read	null
2024-09-17	MM2Latent: Text-to-facial image generation and editing in GANs with multimodal assistance	Debin Meng et.al.	2409.11010	translate	read	link
2024-09-16	A Missing Data Imputation GAN for Character Sprite Generation	Flávio Coutinho et.al.	2409.10721	translate	read	link
2024-09-16	Playground v3: Improving Text-to-Image Alignment with Deep-Fusion Large Language Models	Bingchen Liu et.al.	2409.10695	translate	read	null
2024-09-16	Incorporating Classifier-Free Guidance in Diffusion Model-Based Recommendation	Noah Buchanan et.al.	2409.10494	translate	read	null
2024-09-16	SimInversion: A Simple Framework for Inversion-Based Text-to-Image Editing	Qi Qian et.al.	2409.10476	translate	read	null
2024-09-16	Mamba-ST: State Space Model for Efficient Style Transfer	Filippo Botti et.al.	2409.10385	translate	read	null
2024-09-16	Robust image representations with counterfactual contrastive learning	Mélanie Roschewitz et.al.	2409.10365	translate	read	link
2024-09-16	VAE-QWGAN: Improving Quantum GANs for High Resolution Image Generation	Aaron Mark Thomas et.al.	2409.10339	translate	read	null
2024-09-16	On Synthetic Texture Datasets: Challenges, Creation, and Curation	Blaine Hoak et.al.	2409.10297	translate	read	null
2024-09-16	MotionCom: Automatic and Motion-Aware Image Composition with LLM and Video Diffusion Prior	Weijing Tao et.al.	2409.10090	translate	read	null
2024-09-16	Cross-modality image synthesis from TOF-MRA to CTA using diffusion-based models	Alexander Koch et.al.	2409.10089	translate	read	null
2024-09-16	2S-ODIS: Two-Stage Omni-Directional Image Synthesis by Geometric Distortion Correction	Atsuya Nakata et.al.	2409.09969	translate	read	link
2024-09-15	GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion	Vitor Guizilini et.al.	2409.09896	translate	read	null
2024-09-13	InstantDrag: Improving Interactivity in Drag-based Image Editing	Joonghyuk Shin et.al.	2409.08857	translate	read	null
2024-09-13	GroundingBooth: Grounding Text-to-Image Customization	Zhexiao Xiong et.al.	2409.08520	translate	read	null
2024-09-13	Enhancing Privacy in ControlNet and Stable Diffusion via Split Learning	Dixi Yao et.al.	2409.08503	translate	read	null
2024-09-13	Cross-conditioned Diffusion Model for Medical Image to Image Translation	Zhaohu Xing et.al.	2409.08500	translate	read	null
2024-09-12	Learned Compression for Images and Point Clouds	Mateen Ulhaq et.al.	2409.08376	translate	read	link
2024-09-12	Impact of Stain Variation and Color Normalization for Prognostic Predictions in Pathology	Siyu et.al.	2409.08338	translate	read	null
2024-09-12	Click2Mask: Local Editing with Dynamic Mask Generation	Omer Regev et.al.	2409.08272	translate	read	link
2024-09-12	Improving Virtual Try-On with Garment-focused Diffusion Models	Siqi Wan et.al.	2409.08258	translate	read	link
2024-09-12	TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder	NaHyeon Park et.al.	2409.08248	translate	read	link
2024-09-12	IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation	Yinwei Wu et.al.	2409.08240	translate	read	null
2024-09-12	High-Frequency Anti-DreamBooth: Robust Defense Against Image Synthesis	Takuto Onikubo et.al.	2409.08167	translate	read	null
2024-09-12	EZIGen: Enhancing zero-shot subject-driven image generation with precise subject encoding and decoupled guidance	Zicheng Duan et.al.	2409.08091	translate	read	null
2024-09-12	Scribble-Guided Diffusion for Training-free Text-to-Image Generation	Seonho Lee et.al.	2409.08026	translate	read	link
2024-09-12	FPMT: Enhanced Semi-Supervised Model for Traffic Incident Detection	Xinying Lu et.al.	2409.07839	translate	read	null
2024-09-11	Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models	Haibo Yang et.al.	2409.07452	translate	read	link
2024-09-11	FreeEnhance: Tuning-Free Image Enhancement via Content-Consistent Noising-and-Denoising Process	Yang Luo et.al.	2409.07451	translate	read	null
2024-09-11	Controllable retinal image synthesis using conditional StyleGAN and latent space manipulation for improved diagnosis and grading of diabetic retinopathy	Somayeh Pakdelmoez et.al.	2409.07422	translate	read	null
2024-09-11	Some effects of limited wall-sensor availability on flow estimation with 3D-GANs	Antonio Cuéllar et.al.	2409.07348	translate	read	null
2024-09-11	CCFExp: Facial Image Synthesis with Cycle Cross-Fusion Diffusion Model for Facial Paralysis Individuals	Weixiang Gao et.al.	2409.07271	translate	read	link
2024-09-11	Bio-Eng-LMM AI Assist chatbot: A Comprehensive Tool for Research and Education	Ali Forootani et.al.	2409.07110	translate	read	null
2024-09-11	Fidelity-optimized quantum surface code via GAN decoder and application to quantum teleportation	Jiaxin Li et.al.	2409.06984	translate	read	null
2024-09-10	DANCE: Deep Learning-Assisted Analysis of Protein Sequences Using Chaos Enhanced Kaleidoscopic Images	Taslim Murad et.al.	2409.06694	translate	read	null
2024-09-10	Three-dimensional generative adversarial networks for turbulent flow estimation from wall measurements	Antonio Cuéllar et.al.	2409.06548	translate	read	null
2024-09-10	PoseEmbroider: Towards a 3D, Visual, Semantic-aware Human Pose Representation	Ginger Delmas et.al.	2409.06535	translate	read	null
2024-09-10	DiffQRCoder: Diffusion-based Aesthetic QR Code Generation with Scanning Robustness Guided Iterative Refinement	Jia-Wei Liao et.al.	2409.06355	translate	read	link
2024-09-10	Spectral oversubtraction? An approach for speech enhancement after robot ego speech filtering in semi-real-time	Yue Li et.al.	2409.06274	translate	read	null
2024-09-10	EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation	Nischal Khanal et.al.	2409.06183	translate	read	link
2024-09-09	SVS-GAN: Leveraging GANs for Semantic Video Synthesis	Khaled M. Seyam et.al.	2409.06074	translate	read	null
2024-09-09	Statistical Mechanics of Min-Max Problems	Yuma Ichikawa et.al.	2409.06053	translate	read	null
2024-09-09	SVFit: Parameter-Efficient Fine-Tuning of Large Pre-Trained Models Using Singular Values	Chengwei Sun et.al.	2409.05926	translate	read	null
2024-09-09	Quantum Wasserstein Compilation: Unitary Compilation using the Quantum Earth Mover’s Distance	Marvin Richter et.al.	2409.05849	translate	read	null
2024-09-09	CipherDM: Secure Three-Party Inference for Diffusion Model Sampling	Xin Zhao et.al.	2409.05414	translate	read	null
2024-09-09	Sequential Posterior Sampling with Diffusion Models	Tristan S. W. Stevens et.al.	2409.05399	translate	read	null
2024-09-09	Decoupling Contact for Fine-Grained Motion Style Transfer	Xiangjun Tang et.al.	2409.05387	translate	read	null
2024-09-09	TERD: A Unified Framework for Safeguarding Diffusion Models Against Backdoors	Yichuan Mo et.al.	2409.05294	translate	read	null
2024-09-09	Disentangled Representations for Short-Term and Long-Term Person Re-Identification	Chanho Eom et.al.	2409.05277	translate	read	null
2024-09-09	MRStyle: A Unified Framework for Color Style Transfer with Multi-Modality Reference	Jiancheng Huang et.al.	2409.05250	translate	read	null
2024-09-08	Can OOD Object Detectors Learn from Foundation Models?	Jiahui Liu et.al.	2409.05162	translate	read	link
2024-09-08	Physics-augmented Deep Learning with Adversarial Domain Adaptation: Applications to STM Image Denoising	Jianxin Xie et.al.	2409.05118	translate	read	null
2024-09-07	Rethinking The Training And Evaluation of Rich-Context Layout-to-Image Generation	Jiaxin Cheng et.al.	2409.04847	translate	read	link
2024-09-06	VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation	Yecheng Wu et.al.	2409.04429	translate	read	link
2024-09-06	Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual Generation	Zhuoyan Luo et.al.	2409.04410	translate	read	link
2024-09-06	How Fair is Your Diffusion Recommender Model?	Daniele Malitesta et.al.	2409.04339	translate	read	null
2024-09-06	Secure Traffic Sign Recognition: An Attention-Enabled Universal Image Inpainting Mechanism against Light Patch Attacks	Hangcheng Cao et.al.	2409.04133	translate	read	null
2024-09-06	Bi-modality Images Transfer with a Discrete Process Matching Method	Zhe Xiong et.al.	2409.03977	translate	read	null
2024-09-05	Generating High Dimensional User-Specific Wireless Channels using Diffusion Models	Taekyun Lee et.al.	2409.03924	translate	read	null
2024-09-05	ArtiFade: Learning to Generate High-quality Subject from Blemished Images	Shuya Yang et.al.	2409.03745	translate	read	null
2024-09-05	Unsupervised Anomaly Detection and Localization with Generative Adversarial Networks	Khouloud Abdelli et.al.	2409.03657	translate	read	null
2024-09-05	RealisHuman: A Two-Stage Approach for Refining Malformed Human Parts in Generated Images	Benzhi Wang et.al.	2409.03644	translate	read	null
2024-09-05	VFLGAN-TS: Vertical Federated Learning-based Generative Adversarial Networks for Publication of Vertically Partitioned Time-Series Data	Xun Yuan et.al.	2409.03612	translate	read	null
2024-09-05	TCDiff: Triple Condition Diffusion Model with 3D Constraints for Stylizing Synthetic Faces	Bernardo Biesseck et.al.	2409.03600	translate	read	link
2024-09-05	Blended Latent Diffusion under Attention Control for Real-World Video Editing	Deyin Liu et.al.	2409.03514	translate	read	null
2024-09-05	Non-Uniform Illumination Attack for Fooling Convolutional Neural Networks	Akshay Jain et.al.	2409.03458	translate	read	link
2024-09-05	Fine-tuning large language models for domain adaptation: Exploration of training strategies, scaling, model merging and synergistic capabilities	Wei Lu et.al.	2409.03444	translate	read	link
2024-09-05	RoVi-Aug: Robot and Viewpoint Augmentation for Cross-Embodiment Robot Learning	Lawrence Yunliang Chen et.al.	2409.03403	translate	read	null
2024-09-05	Enhancing digital core image resolution using optimal upscaling algorithm: with application to paired SEM images	Shaohua You et.al.	2409.03265	translate	read	null
2024-09-04	HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts	Xinyu Liu et.al.	2409.02919	translate	read	link
2024-09-04	Independence Constrained Disentangled Representation Learning from Epistemological Perspective	Ruoyu Wang et.al.	2409.02672	translate	read	null
2024-09-04	Skip-and-Play: Depth-Driven Pose-Preserved Image Generation for Any Objects	Kyungmin Jo et.al.	2409.02653	translate	read	null
2024-09-04	StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion Models	Wen Li et.al.	2409.02543	translate	read	link
2024-09-04	A Learnable Color Correction Matrix for RAW Reconstruction	Anqi Liu et.al.	2409.02497	translate	read	null
2024-09-04	Training-free Color-Style Disentanglement for Constrained Text-to-Image Synthesis	Aishwarya Agarwal et.al.	2409.02429	translate	read	null
2024-09-04	Exploring Low-Dimensional Subspaces in Diffusion Models for Controllable Image Editing	Siyi Chen et.al.	2409.02374	translate	read	link
2024-09-03	QID $^2$ : An Image-Conditioned Diffusion Model for Q-space Up-sampling of DWI Data	Zijian Chen et.al.	2409.02309	translate	read	null
2024-09-03	FastVoiceGrad: One-step Diffusion-Based Voice Conversion with Adversarial Conditional Diffusion Distillation	Takuhiro Kaneko et.al.	2409.02245	translate	read	null
2024-09-03	LSTM-QGAN: Scalable NISQ Generative Adversarial Network	Cheng Chu et.al.	2409.02212	translate	read	null
2024-09-02	Enabling Local Editing in Diffusion Models by Joint and Individual Component Analysis	Theodoros Kouzelis et.al.	2408.16845	translate	read	null

(<a href=../Image_Generation.md>back to Image Generation</a>)