Image Generation - 2025-04 | Paper Arxiv Daily

Image Generation - 2025-04

Publish Date	Title	Authors	PDF	Translate	Read	Code
2025-04-30	3D Stylization via Large Reconstruction Model	Ipek Oztas et.al.	2504.21836	translate	read	null
2025-04-30	Why Compress What You Can Generate? When GPT-4o Generation Ushers in Image Compression Fields	Yixin Gao et.al.	2504.21814	translate	read	null
2025-04-30	Latent Feature-Guided Conditional Diffusion for High-Fidelity Generative Image Semantic Communication	Zehao Chen et.al.	2504.21577	translate	read	null
2025-04-30	Wasserstein-Aitchison GAN for angular measures of multivariate extremes	Stéphane Lhaut et.al.	2504.21438	translate	read	null
2025-04-30	Sparse-to-Sparse Training of Diffusion Models	Inês Cardoso Oliveira et.al.	2504.21380	translate	read	null
2025-04-30	Nexus-Gen: A Unified Model for Image Understanding, Generation, and Editing	Hong Zhang et.al.	2504.21356	translate	read	link
2025-04-30	Text-Conditioned Diffusion Model for High-Fidelity Korean Font Generation	Abdul Sami et.al.	2504.21325	translate	read	null
2025-04-30	AGHI-QA: A Subjective-Aligned Dataset and Metric for AI-Generated Human Images	Yunhao Li et.al.	2504.21308	translate	read	null
2025-04-30	Can We Achieve Efficient Diffusion without Self-Attention? Distilling Self-Attention into Convolutions	ZiYi Dong et.al.	2504.21292	translate	read	null
2025-04-29	Artificial Intelligence for Personalized Prediction of Alzheimer’s Disease Progression: A Survey of Methods, Data Challenges, and Future Directions	Gulsah Hancerliogullari Koksalmis et.al.	2504.21189	translate	read	null
2025-04-29	YoChameleon: Personalized Vision and Language Generation	Thao Nguyen et.al.	2504.20998	translate	read	link
2025-04-29	Efficient Listener: Dyadic Facial Motion Synthesis via Action Diffusion	Zesheng Wang et.al.	2504.20685	translate	read	null
2025-04-29	DiffusionRIR: Room Impulse Response Interpolation using Diffusion Models	Sagi Della Torre et.al.	2504.20625	translate	read	null
2025-04-29	Digital Shielding for Cross-Domain Wi-Fi Signal Adaptation using Relativistic Average Generative Adversarial Network	Danilo Avola et.al.	2504.20568	translate	read	null
2025-04-29	Generate more than one child in your co-evolutionary semi-supervised learning GAN	Francisco Sedeño et.al.	2504.20560	translate	read	null
2025-04-29	PixelHacker: Image Inpainting with Structural and Semantic Consistency	Ziyang Xu et.al.	2504.20438	translate	read	link
2025-04-29	Inception: Jailbreak the Memory Mechanism of Text-to-Image Generation Systems	Shiqian Zhao et.al.	2504.20376	translate	read	null
2025-04-29	A Picture is Worth a Thousand Prompts? Efficacy of Iterative Human-Driven Prompt Refinement in Image Regeneration Tasks	Khoi Trinh et.al.	2504.20340	translate	read	null
2025-04-28	Physics-Informed Diffusion Models for SAR Ship Wake Generation from Text Prompts	Kamirul Kamirul et.al.	2504.20241	translate	read	null
2025-04-28	CineVerse: Consistent Keyframe Synthesis for Cinematic Scene Composition	Quynh Phung et.al.	2504.19894	translate	read	null
2025-04-28	DeeCLIP: A Robust and Generalizable Transformer-Based Framework for Detecting AI-Generated Images	Mamadou Keita et.al.	2504.19876	translate	read	link
2025-04-28	RepText: Rendering Visual Text via Replicating	Haofan Wang et.al.	2504.19724	translate	read	link
2025-04-28	Transformation & Translation Occupancy Grid Mapping: 2-Dimensional Deep Learning Refined SLAM	Leon Davies et.al.	2504.19654	translate	read	null
2025-04-28	GAN-SLAM: Real-Time GAN Aided Floor Plan Creation Through SLAM	Leon Davies et.al.	2504.19653	translate	read	null
2025-04-28	Image Generation Method Based on Heat Diffusion Models	Pengfei Zhang et.al.	2504.19600	translate	read	null
2025-04-28	WILD: a new in-the-Wild Image Linkage Dataset for synthetic image attribution	Pietro Bongini et.al.	2504.19595	translate	read	null
2025-04-28	GenPTW: In-Generation Image Watermarking for Provenance Tracing and Tamper Localization	Zhenliang Gan et.al.	2504.19567	translate	read	null
2025-04-28	Masked Language Prompting for Generative Data Augmentation in Few-shot Fashion Style Recognition	Yuki Hirakawa et.al.	2504.19455	translate	read	null
2025-04-27	Flow Along the K-Amplitude for Generative Modeling	Weitao Du et.al.	2504.19353	translate	read	null
2025-04-25	Intelligent Attacks and Defense Methods in Federated Learning-enabled Energy-Efficient Wireless Networks	Han Zhang et.al.	2504.18519	translate	read	null
2025-04-25	HepatoGEN: Generating Hepatobiliary Phase MRI with Perceptual and Adversarial Models	Jens Hooge et.al.	2504.18405	translate	read	null
2025-04-25	COCO-Inpaint: A Benchmark for Image Inpainting Detection and Manipulation Localization	Haozhen Yan et.al.	2504.18361	translate	read	null
2025-04-25	TextTIGER: Text-based Intelligent Generation with Entity Prompt Refinement for Text-to-Image Generation	Shintaro Ozaki et.al.	2504.18269	translate	read	null
2025-04-25	Optimizing Multi-Round Enhanced Training in Diffusion Models for Improved Preference Understanding	Kun Li et.al.	2504.18204	translate	read	null
2025-04-25	Diffusion-Driven Universal Model Inversion Attack for Face Recognition	Hanrui Wang et.al.	2504.18015	translate	read	null
2025-04-24	CANet: ChronoAdaptive Network for Enhanced Long-Term Time Series Forecasting under Non-Stationarity	Mert Sonmezer et.al.	2504.17913	translate	read	null
2025-04-24	Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models	Xu Ma et.al.	2504.17789	translate	read	null
2025-04-24	Generative Fields: Uncovering Hierarchical Feature Control for StyleGAN via Inverted Receptive Fields	Zhuo He et.al.	2504.17712	translate	read	null
2025-04-24	STCL:Curriculum learning Strategies for deep learning image steganography models	Fengchun Liu et.al.	2504.17609	translate	read	null
2025-04-24	A Machine Learning Approach for Denoising and Upsampling HRTFs	Xuyi Hu et.al.	2504.17586	translate	read	null
2025-04-24	Text-to-Image Alignment in Denoising-Based Models through Step Selection	Paul Grimal et.al.	2504.17525	translate	read	null
2025-04-24	ESDiff: Encoding Strategy-inspired Diffusion Model with Few-shot Learning for Color Image Inpainting	Junyan Zhang et.al.	2504.17524	translate	read	null
2025-04-24	RefVNLI: Towards Scalable Evaluation of Subject-driven Text-to-image Generation	Aviv Slobodkin et.al.	2504.17502	translate	read	null
2025-04-24	StereoMamba: Real-time and Robust Intraoperative Stereo Disparity Estimation via Long-range Spatial Dependencies	Xu Wang et.al.	2504.17401	translate	read	null
2025-04-24	DRC: Enhancing Personalized Image Generation via Disentangled Representation Composition	Yiyan Xu et.al.	2504.17349	translate	read	null
2025-04-24	Physics-based super-resolved simulation of 3D elastic wave propagation adopting scalable Diffusion Transformer	Hugo Gabrielidis et.al.	2504.17308	translate	read	null
2025-04-23	High-Quality Cloud-Free Optical Image Synthesis Using Multi-Temporal SAR and Contaminated Optical Data	Chenxi Duan et.al.	2504.16870	translate	read	null
2025-04-23	A Comprehensive Survey of Synthetic Tabular Data Generation	Ruxue Shi et.al.	2504.16506	translate	read	null
2025-04-23	CLPSTNet: A Progressive Multi-Scale Convolutional Steganography Model Integrating Curriculum Learning	Fengchun Liu et.al.	2504.16364	translate	read	null
2025-04-22	Learning Energy-Based Generative Models via Potential Flow: A Variational Principle Approach to Probability Density Homotopy Matching	Junn Yong Loo et.al.	2504.16262	translate	read	null
2025-04-22	Survey of Video Diffusion Models: Foundations, Implementations, and Applications	Yimu Wang et.al.	2504.16081	translate	read	null
2025-04-22	From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning	Le Zhuo et.al.	2504.16080	translate	read	link
2025-04-22	Boosting Generative Image Modeling via Joint Image-Feature Synthesis	Theodoros Kouzelis et.al.	2504.16064	translate	read	link
2025-04-22	FreeGraftor: Training-Free Cross-Image Feature Grafting for Subject-Driven Text-to-Image Generation	Zebin Yao et.al.	2504.15958	translate	read	link
2025-04-22	New Recipe for Semi-supervised Community Detection: Clique Annealing under Crystallization Kinetics	Ling Cheng et.al.	2504.15927	translate	read	null
2025-04-22	DualOptim: Enhancing Efficacy and Stability in Machine Unlearning with Dual Optimizers	Xuyang Zhong et.al.	2504.15827	translate	read	null
2025-04-22	Satellite to GroundScape – Large-scale Consistent Ground View Generation from Satellite Views	Ningli Xu et.al.	2504.15786	translate	read	null
2025-04-21	Application of Deep Generative Models for Anomaly Detection in Complex Financial Transactions	Tengda Tang et.al.	2504.15491	translate	read	null
2025-04-21	Emergence and Evolution of Interpretable Concepts in Diffusion Models	Berk Tinaz et.al.	2504.15473	translate	read	null
2025-04-21	StyleMe3D: Stylization with Disentangled Priors by Multiple Encoders on 3D Gaussians	Cailin Zhuang et.al.	2504.15281	translate	read	null
2025-04-22	LACE: Controlled Image Prompting and Iterative Refinement with GenAI for Professional Visual Art Creators	Yenkai Huang et.al.	2504.15189	translate	read	null
2025-04-21	Acquire and then Adapt: Squeezing out Text-to-Image Model for Image Restoration	Junyuan Deng et.al.	2504.15159	translate	read	null
2025-04-21	GIFDL: Generated Image Fluctuation Distortion Learning for Enhancing Steganographic Security	Xiangkun Wang et.al.	2504.15139	translate	read	null
2025-04-21	Fast-Slow Co-advancing Optimizer: Toward Harmonious Adversarial Training of GAN	Lin Wang et.al.	2504.15099	translate	read	null
2025-04-22	VistaDepth: Frequency Modulation With Bias Reweighting For Enhanced Long-Range Depth Estimation	Mingxia Zhan et.al.	2504.15095	translate	read	null
2025-04-21	TWIG: Two-Step Image Generation using Segmentation Masks in Diffusion Models	Mazharul Islam Rakib et.al.	2504.14933	translate	read	null
2025-04-21	Twin Co-Adaptive Dialogue for Progressive Image Generation	Jianhui Wang et.al.	2504.14868	translate	read	null
2025-04-21	LACE: Exploring Turn-Taking and Parallel Interaction Modes in Human-AI Co-Creation for Iterative Image Generation	YenKai Huang et.al.	2504.14827	translate	read	null
2025-04-21	What Lurks Within? Concept Auditing for Shared Diffusion Models at Scale	Xiaoyong Yuan et.al.	2504.14815	translate	read	null
2025-04-18	Collective Learning Mechanism based Optimal Transport Generative Adversarial Network for Non-parallel Voice Conversion	Sandipan Dhar et.al.	2504.13791	translate	read	null
2025-04-18	MLEP: Multi-granularity Local Entropy Patterns for Universal AI-generated Image Detection	Lin Yuan et.al.	2504.13726	translate	read	null
2025-04-18	SupResDiffGAN a new approach for the Super-Resolution task	Dawid Kopeć et.al.	2504.13622	translate	read	null
2025-04-18	U-Shape Mamba: State Space Model for faster diffusion	Alex Ergasti et.al.	2504.13499	translate	read	link
2025-04-18	Early Timestep Zero-Shot Candidate Selection for Instruction-Guided Image Editing	Joowon Kim et.al.	2504.13490	translate	read	null
2025-04-18	POET: Supporting Prompting Creativity and Personalization with Automated Expansion of Text-to-Image Generation	Evans Xu Han et.al.	2504.13392	translate	read	null
2025-04-17	SMPL-GPTexture: Dual-View 3D Human Texture Estimation using Text-to-Image Generation Models	Mingxiao Tu et.al.	2504.13378	translate	read	null
2025-04-17	Personalized Text-to-Image Generation with Auto-Regressive Models	Kaiyue Sun et.al.	2504.13162	translate	read	link
2025-04-17	Science-T2I: Addressing Scientific Illusions in Image Synthesis	Jialuo Li et.al.	2504.13129	translate	read	null
2025-04-17	Hadamard product in deep learning: Introduction, Advances and Challenges	Grigorios G Chrysos et.al.	2504.13112	translate	read	null
2025-04-17	HiScene: Creating Hierarchical 3D Scenes with Isometric View Generation	Wenqi Dong et.al.	2504.13072	translate	read	null
2025-04-17	ArtistAuditor: Auditing Artist Style Pirate in Text-to-Image Generation Models	Linkang Du et.al.	2504.13061	translate	read	link
2025-04-17	RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins	Yao Mu et.al.	2504.13059	translate	read	null
2025-04-17	High-Fidelity Image Inpainting with Multimodal Guided GAN Inversion	Libo Zhang et.al.	2504.12844	translate	read	null
2025-04-17	Supporting Urban Low-Altitude Economy: Channel Gain Map Inference Based on 3D Conditional GAN	Yonghao Wang et.al.	2504.12794	translate	read	null
2025-04-17	Privacy Protection Against Personalized Text-to-Image Synthesis via Cross-image Consistency Constraints	Guanyu Wang et.al.	2504.12747	translate	read	null
2025-04-17	SmartFreeEdit: Mask-Free Spatial-Aware Image Editing with Complex Instruction Understanding	Qianqian Sun et.al.	2504.12704	translate	read	link
2025-04-16	Beyond Reconstruction: A Physics Based Neural Deferred Shader for Photo-realistic Rendering	Zhuo He et.al.	2504.12273	translate	read	null
2025-04-16	SIDME: Self-supervised Image Demoiréing via Masked Encoder-Decoder Reconstruction	Xia Wang et.al.	2504.12245	translate	read	null
2025-04-16	Cobra: Efficient Line Art COlorization with BRoAder References	Junhao Zhuang et.al.	2504.12240	translate	read	link
2025-04-16	Anti-Aesthetics: Protecting Facial Privacy against Customized Text-to-Image Synthesis	Songping Wang et.al.	2504.12129	translate	read	null
2025-04-16	Instruction-augmented Multimodal Alignment for Image-Text and Element Matching	Xinli Yue et.al.	2504.12018	translate	read	null
2025-04-16	Novel-view X-ray Projection Synthesis through Geometry-Integrated Deep Learning	Daiqi Liu et.al.	2504.11953	translate	read	null
2025-04-16	Mind2Matter: Creating 3D Models from EEG Signals	Xia Deng et.al.	2504.11936	translate	read	null
2025-04-16	Synthetic Data for Blood Vessel Network Extraction	Joël Mathys et.al.	2504.11858	translate	read	null
2025-04-16	ACE: Attentional Concept Erasure in Diffusion Models	Finn Carter et.al.	2504.11850	translate	read	null
2025-04-16	HyperKING: Quantum-Classical Generative Adversarial Networks for Hyperspectral Image Restoration	Chia-Hsiang Lin et.al.	2504.11782	translate	read	null
2025-04-15	Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception	Ziqi Pang et.al.	2504.11457	translate	read	link
2025-04-15	ADT: Tuning Diffusion Models with Adversarial Supervision	Dazhong Shen et.al.	2504.11423	translate	read	null
2025-04-15	Omni $^2$ : Unifying Omnidirectional Image Generation and Editing in an Omni Model	Liu Yang et.al.	2504.11379	translate	read	link
2025-04-15	Seedream 3.0 Technical Report	Yu Gao et.al.	2504.11346	translate	read	null
2025-04-15	Using LLMs as prompt modifier to avoid biases in AI image generators	René Peinl et.al.	2504.11104	translate	read	null
2025-04-15	UKDM: Underwater keypoint detection and matching using underwater image enhancement techniques	Pedro Diaz-Garcia et.al.	2504.11063	translate	read	null
2025-04-15	AnimeDL-2M: Million-Scale AI-Generated Anime Image Detection and Localization in Diffusion Era	Chenyang Zhu et.al.	2504.11015	translate	read	null
2025-04-15	Bringing together invertible UNets with invertible attention modules for memory-efficient diffusion models	Karan Jain et.al.	2504.10883	translate	read	null
2025-04-15	IlluSign: Illustrating Sign Language Videos by Leveraging the Attention Mechanism	Janna Bruner et.al.	2504.10822	translate	read	null
2025-04-15	Generative and Explainable AI for High-Dimensional Channel Estimation	Nghia Thinh Nguyen et.al.	2504.10775	translate	read	link
2025-04-14	Art3D: Training-Free 3D Generation from Flat-Colored Illustration	Xiaoyan Cong et.al.	2504.10466	translate	read	null
2025-04-14	Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing	Taihang Hu et.al.	2504.10434	translate	read	null
2025-04-14	InstructEngine: Instruction-driven Text-to-Image Alignment	Xingyu Lu et.al.	2504.10329	translate	read	null
2025-04-14	Trade-offs in Privacy-Preserving Eye Tracking through Iris Obfuscation: A Benchmarking Study	Mengdi Wang et.al.	2504.10267	translate	read	link
2025-04-14	VibrantLeaves: A principled parametric image generator for training deep restoration models	Raphael Achddou et.al.	2504.10201	translate	read	null
2025-04-14	GeoUni: A Unified Model for Generating Geometry Diagrams, Problems and Problem Solutions	Jo-Ku Cheng et.al.	2504.10146	translate	read	link
2025-04-14	Masked Autoencoder Self Pre-Training for Defect Detection in Microelectronics	Nikolai Röhrich et.al.	2504.10021	translate	read	null
2025-04-14	Omni-Dish: Photorealistic and Faithful Image Generation and Editing for Arbitrary Chinese Dishes	Huijie Liu et.al.	2504.09948	translate	read	null
2025-04-14	EquiVDM: Equivariant Video Diffusion Models with Temporally Consistent Noise	Chao Liu et.al.	2504.09789	translate	read	null
2025-04-13	Early-Bird Diffusion: Investigating and Leveraging Timestep-Aware Early-Bird Tickets in Diffusion Models for Efficient Training	Lexington Whalen et.al.	2504.09606	translate	read	null
2025-04-11	GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation	Tianwei Xiong et.al.	2504.08736	translate	read	link
2025-04-11	End-to-End Demonstration of Quantum Generative Adversarial Networks for Steel Microstructure Image Augmentation on a Trapped-Ion Quantum Computer	Samwel Sekwao et.al.	2504.08728	translate	read	null
2025-04-11	Generating Fine Details of Entity Interactions	Xinyi Gu et.al.	2504.08714	translate	read	null
2025-04-11	Latent Diffusion Autoencoders: Toward Efficient and Meaningful Unsupervised Representation Learning in Medical Imaging	Gabriele Lozupone et.al.	2504.08635	translate	read	link
2025-04-11	Discriminator-Free Direct Preference Optimization for Video Diffusion	Haoran Cheng et.al.	2504.08542	translate	read	null
2025-04-11	On the Design of Diffusion-based Neural Speech Codecs	Pietro Foti et.al.	2504.08470	translate	read	null
2025-04-11	Muon-Accelerated Attention Distillation for Real-Time Edge Synthesis via Optimized Latent Diffusion	Weiye Chen et.al.	2504.08451	translate	read	null
2025-04-11	MixDiT: Accelerating Image Diffusion Transformer Inference with Mixed-Precision MX Quantization	Daeun Kim et.al.	2504.08398	translate	read	null
2025-04-11	LMM4LMM: Benchmarking and Evaluating Large-multimodal Image Generation with LMMs	Jiarui Wang et.al.	2504.08358	translate	read	link
2025-04-11	Geometric Consistency Refinement for Single Image Novel View Synthesis via Test-Time Adaptation of Diffusion Models	Josef Bengtson et.al.	2504.08348	translate	read	null
2025-04-10	PixelFlow: Pixel-Space Generative Models with Flow	Shoufa Chen et.al.	2504.07963	translate	read	link
2025-04-10	VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning	Zhong-Yu Li et.al.	2504.07960	translate	read	link
2025-04-10	DiverseFlow: Sample-Efficient Diverse Mode Coverage in Flows	Mashrur M. Morshed et.al.	2504.07894	translate	read	null
2025-04-10	Towards Sustainable Creativity Support: An Exploratory Study on Prompt Based Image Generation	Daniel Hove Paludan et.al.	2504.07879	translate	read	null
2025-04-10	Conformalized Generative Bayesian Imaging: An Uncertainty Quantification Framework for Computational Imaging	Canberk Ekmekci et.al.	2504.07696	translate	read	null
2025-04-10	FlexIP: Dynamic Control of Preservation and Personality for Customized Image Generation	Linyan Huang et.al.	2504.07405	translate	read	null
2025-04-10	ID-Booth: Identity-consistent Face Generation with Diffusion Models	Darian Tomašević et.al.	2504.07392	translate	read	link
2025-04-10	Model Discrepancy Learning: Synthetic Faces Detection Based on Multi-Reconstruction	Qingchao Jiang et.al.	2504.07382	translate	read	link
2025-04-09	OmniCaptioner: One Captioner to Rule Them All	Yiting Lu et.al.	2504.07089	translate	read	link
2025-04-09	A Unified Agentic Framework for Evaluating Conditional Image Generation	Jifang Wang et.al.	2504.07046	translate	read	link
2025-04-09	MedSegFactory: Text-Guided Generation of Medical Image-Mask Pairs	Jiawei Mao et.al.	2504.06897	translate	read	null
2025-04-09	EIDT-V: Exploiting Intersections in Diffusion Trajectories for Model-Agnostic, Zero-Shot, Training-Free Text-to-Video Generation	Diljeet Jagpal et.al.	2504.06861	translate	read	null
2025-04-09	DyDiT++: Dynamic Diffusion Transformers for Efficient Visual Generation	Wangbo Zhao et.al.	2504.06803	translate	read	null
2025-04-09	A Meaningful Perturbation Metric for Evaluating Explainability Methods	Danielle Cohen et.al.	2504.06800	translate	read	null
2025-04-10	Compass Control: Multi Object Orientation Control for Text-to-Image Generation	Rishubh Parihar et.al.	2504.06752	translate	read	null
2025-04-09	Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception	Ruotian Peng et.al.	2504.06666	translate	read	link
2025-04-09	Collision avoidance from monocular vision trained with novel view synthesis	Valentin Tordjman–Levavasseur et.al.	2504.06651	translate	read	null
2025-04-09	PosterMaker: Towards High-Quality Product Poster Generation with Accurate Text Rendering	Yifan Gao et.al.	2504.06632	translate	read	null
2025-04-08	Transfer between Modalities with MetaQueries	Xichen Pan et.al.	2504.06256	translate	read	null
2025-04-08	HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance	Jiazi Bu et.al.	2504.06232	translate	read	link
2025-04-08	A Training-Free Style-aligned Image Generation with Scale-wise Autoregressive Model	Jihun Park et.al.	2504.06144	translate	read	null
2025-04-08	Explainable AI for building energy retrofitting under data scarcity	Panagiota Rempi et.al.	2504.06055	translate	read	null
2025-04-08	An Empirical Study of GPT-4o Image Generation Capabilities	Sixiang Chen et.al.	2504.05979	translate	read	link
2025-04-08	CKGAN: Training Generative Adversarial Networks Using Characteristic Kernel Integral Probability Metrics	Kuntian Zhang et.al.	2504.05945	translate	read	null
2025-04-08	Mind the Trojan Horse: Image Prompt Adapter Enabling Scalable and Deceptive Jailbreaking	Junxi Chen et.al.	2504.05838	translate	read	null
2025-04-08	Parasite: A Steganography-based Backdoor Attack Framework for Diffusion Models	Jiahao Chen et.al.	2504.05815	translate	read	null
2025-04-08	Storybooth: Training-free Multi-Subject Consistency for Improved Visual Storytelling	Jaskirat Singh et.al.	2504.05800	translate	read	null
2025-04-07	Generative Adversarial Networks with Limited Data: A Survey and Benchmarking	Omar De Mitri et.al.	2504.05456	translate	read	null
2025-04-07	Gaussian Mixture Flow Matching Models	Hansheng Chen et.al.	2504.05304	translate	read	link
2025-04-07	Fine tuning generative adversarial networks with universal force fields: application to two-dimensional topological insulators	Alexander C. Tyner et.al.	2504.04940	translate	read	null
2025-04-07	Imagining the Far East: Exploring Perceived Biases in AI-Generated Images of East Asian Women	Xingyu Lan et.al.	2504.04865	translate	read	null
2025-04-07	AnyArtisticGlyph: Multilingual Controllable Artistic Glyph Generation	Xiongbo Lu et.al.	2504.04743	translate	read	link
2025-04-07	Bridging Knowledge Gap Between Image Inpainting and Large-Area Visible Watermark Removal	Yicheng Leng et.al.	2504.04687	translate	read	null
2025-04-06	Your Image Generator Is Your New Private Dataset	Nicolo Resmini et.al.	2504.04582	translate	read	null
2025-04-06	Attributed Synthetic Data Generation for Zero-shot Domain-specific Image Classification	Shijian Wang et.al.	2504.04510	translate	read	null
2025-04-06	Thermoxels: a voxel-based method to generate simulation-ready 3D thermal models	Etienne Chassaing et.al.	2504.04448	translate	read	null
2025-04-06	FluentLip: A Phonemes-Based Two-stage Approach for Audio-Driven Lip Synthesis with Optical Flow Consistency	Shiyan Liu et.al.	2504.04427	translate	read	null
2025-04-06	UniToken: Harmonizing Multimodal Understanding and Generation through Unified Visual Encoding	Yang Jiao et.al.	2504.04423	translate	read	link
2025-04-04	MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models	Wulin Xie et.al.	2504.03641	translate	read	link
2025-04-04	Dynamic Importance in Diffusion U-Net for Enhanced Image Synthesis	Xi Wang et.al.	2504.03471	translate	read	null
2025-04-04	FLAIRBrainSeg: Fine-grained brain segmentation using FLAIR MRI only	Edern Le Bot et.al.	2504.03376	translate	read	null
2025-04-04	QIRL: Boosting Visual Question Answering via Optimized Question-Image Relation Learning	Quanxing Xu et.al.	2504.03337	translate	read	null
2025-04-03	VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning	Xianwei Zhuang et.al.	2504.02949	translate	read	link
2025-04-03	Bias in Large Language Models Across Clinical Applications: A Systematic Review	Thanathip Suenghataiphorn et.al.	2504.02917	translate	read	null
2025-04-03	F-ViTA: Foundation Model Guided Visible to Thermal Translation	Jay N. Paranjape et.al.	2504.02801	translate	read	link
2025-04-03	GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation	Zhiyuan Yan et.al.	2504.02782	translate	read	link
2025-04-03	RoSMM: A Robust and Secure Multi-Modal Watermarking Framework for Diffusion Models	ZhongLi Fang et.al.	2504.02640	translate	read	null
2025-04-03	Fine-Tuning Visual Autoregressive Models for Subject-Driven Generation	Jiwoo Chung et.al.	2504.02612	translate	read	link
2025-04-03	AC-LoRA: Auto Component LoRA for Personalized Artistic Style Image Generation	Zhipu Cui et.al.	2504.02231	translate	read	null
2025-04-02	Foreground Focus: Enhancing Coherence and Fidelity in Camouflaged Image Generation	Pei-Chi Chen et.al.	2504.02180	translate	read	null
2025-04-02	Neural Style Transfer for Synthesising a Dataset of Ancient Egyptian Hieroglyphs	Lewis Matheson Creed et.al.	2504.02163	translate	read	null
2025-04-02	Less-to-More Generalization: Unlocking More Controllability by In-Context Generation	Shaojin Wu et.al.	2504.02160	translate	read	link
2025-04-03	ILLUME+: Illuminating Unified MLLM with Dual Visual Tokenization and Diffusion Refinement	Runhui Huang et.al.	2504.01934	translate	read	link
2025-04-02	FineLIP: Extending CLIP’s Reach via Fine-Grained Alignment with Longer Text Inputs	Mothilal Asokan et.al.	2504.01916	translate	read	link
2025-04-02	A $^\text{T}$ A: Adaptive Transformation Agent for Text-Guided Subject-Position Variable Background Inpainting	Yizhe Tang et.al.	2504.01603	translate	read	null
2025-04-02	Instance Migration Diffusion for Nuclear Instance Segmentation in Pathology	Lirui Qi et.al.	2504.01577	translate	read	null
2025-04-02	Pro-DG: Procedural Diffusion Guidance for Architectural Facade Generation	Aleksander Plocharski et.al.	2504.01571	translate	read	null
2025-04-02	Training-free Dense-Aligned Diffusion Guidance for Modular Conditional Image Synthesis	Zixuan Wang et.al.	2504.01515	translate	read	null
2025-04-02	High-fidelity 3D Object Generation from Single Image with RGBN-Volume Gaussian Reconstruction Model	Yiyang Shen et.al.	2504.01512	translate	read	null
2025-04-02	From Easy to Hard: Building a Shortcut for Differentially Private Image Synthesis	Kecen Li et.al.	2504.01395	translate	read	link
2025-04-01	rPPG-SysDiaGAN: Systolic-Diastolic Feature Localization in rPPG Using Generative Adversarial Network with Multi-Domain Discriminator	Banafsheh Adami et.al.	2504.01220	translate	read	null
2025-04-01	Prompting Forgetting: Unlearning in GANs via Textual Guidance	Piyush Nagasubramaniam et.al.	2504.01218	translate	read	null
2025-04-01	Time-Series Forecasting via Topological Information Supervised Framework with Efficient Topological Feature Learning	ZiXin Lin et.al.	2503.23757	translate	read	null

(<a href=../Image_Generation.md>back to Image Generation</a>)