Image Generation - 2025-07 | Paper Arxiv Daily

Image Generation - 2025-07

Publish Date	Title	Authors	PDF	Translate	Read	Code
2025-07-23	Flow Matching Meets Biology and Life Science: A Survey	Zihao Li et.al.	2507.17731	translate	read	null
2025-07-23	Generalized Dual Discriminator GANs	Penukonda Naga Chandana et.al.	2507.17684	translate	read	null
2025-07-23	Attention (as Discrete-Time Markov) Chains	Yotam Erel et.al.	2507.17657	translate	read	null
2025-07-23	Dual-branch Prompting for Multimodal Machine Translation	Jie Wang et.al.	2507.17588	translate	read	null
2025-07-23	An h-space Based Adversarial Attack for Protection Against Few-shot Personalization	Xide Xu et.al.	2507.17554	translate	read	link
2025-07-23	Unsupervised anomaly detection using Bayesian flow networks: application to brain FDG PET in the context of Alzheimer’s disease	Hugues Roy et.al.	2507.17486	translate	read	null
2025-07-23	EndoGen: Conditional Autoregressive Endoscopic Video Generation	Xinyu Liu et.al.	2507.17388	translate	read	null
2025-07-23	PARTE: Part-Guided Texturing for 3D Human Reconstruction from a Single Image	Hyeongjin Nam et.al.	2507.17332	translate	read	null
2025-07-23	PolarAnything: Diffusion-based Polarimetric Image Synthesis	Kailong Zhang et.al.	2507.17268	translate	read	null
2025-07-22	Bringing Balance to Hand Shape Classification: Mitigating Data Imbalance Through Generative Models	Gaston Gustavo Rios et.al.	2507.17008	translate	read	null
2025-07-22	HarmonPaint: Harmonized Training-Free Diffusion Inpainting	Ying Li et.al.	2507.16732	translate	read	null
2025-07-22	Pyramid Hierarchical Masked Diffusion Model for Imaging Synthesis	Xiaojiao Xiao et.al.	2507.16579	translate	read	null
2025-07-22	Learning Text Styles: A Study on Transfer, Attribution, and Verification	Zhiqiang Hu et.al.	2507.16530	translate	read	null
2025-07-22	DREAM: Scalable Red Teaming for Text-to-Image Generative Systems via Distribution Modeling	Boheng Li et.al.	2507.16329	translate	read	null
2025-07-22	Towards Resilient Safety-driven Unlearning for Diffusion Models against Downstream Fine-tuning	Boheng Li et.al.	2507.16302	translate	read	null
2025-07-22	Edge-case Synthesis for Fisheye Object Detection: A Data-centric Perspective	Seunghyeon Kim et.al.	2507.16254	translate	read	null
2025-07-22	Scale Your Instructions: Enhance the Instruction-Following Fidelity of Unified Image Generation Model by Self-Adaptive Attention Scaling	Chao Zhou et.al.	2507.16240	translate	read	null
2025-07-22	A Human-Centered Approach to Identifying Promises, Risks, & Challenges of Text-to-Image Generative AI in Radiology	Katelyn Morrison et.al.	2507.16207	translate	read	null
2025-07-22	LSSGen: Leveraging Latent Space Scaling in Flow and Diffusion for Efficient Text to Image Generation	Jyun-Ze Tang et.al.	2507.16154	translate	read	null
2025-07-21	Improving Personalized Image Generation through Social Context Feedback	Parul Gupta et.al.	2507.16095	translate	read	null
2025-07-21	Diffusion models for multivariate subsurface generation and efficient probabilistic inversion	Roberto Miele et.al.	2507.15809	translate	read	null
2025-07-21	Toward an event-level analysis of hadron structure using differential programming	Kevin Braga et.al.	2507.15768	translate	read	null
2025-07-21	A Practical Investigation of Spatially-Controlled Image Generation with Transformers	Guoxuan Xia et.al.	2507.15724	translate	read	null
2025-07-21	SustainDiffusion: Optimising the Social and Environmental Sustainability of Stable Diffusion Models	Giordano d’Aloisio et.al.	2507.15663	translate	read	null
2025-07-21	CylinderPlane: Nested Cylinder Representation for 3D-aware Image Generation	Ru Jia et.al.	2507.15606	translate	read	null
2025-07-21	Evaluating Text Style Transfer: A Nine-Language Benchmark for Text Detoxification	Vitaly Protasov et.al.	2507.15557	translate	read	null
2025-07-21	Improving Joint Embedding Predictive Architecture with Diffusion Noise	Yuping Qiu et.al.	2507.15216	translate	read	null
2025-07-20	Aesthetics is Cheap, Show me the Text: An Empirical Evaluation of State-of-the-Art Generative Models for OCR	Peirong Zhang et.al.	2507.15085	translate	read	null
2025-07-20	Deep Generative Models in Condition and Structural Health Monitoring: Opportunities, Limitations and Future Outlook	Xin Yang et.al.	2507.15026	translate	read	null
2025-07-20	Paired Image Generation with Diffusion-Guided Diffusion Models	Haoxuan Zhang et.al.	2507.14833	translate	read	null
2025-07-18	MoDyGAN: Combining Molecular Dynamics With GANs to Investigate Protein Conformational Space	Jingbo Liang et.al.	2507.13950	translate	read	null
2025-07-18	Converting T1-weighted MRI from 3T to 7T quality using deep learning	Malo Gicquel et.al.	2507.13782	translate	read	null
2025-07-18	Tackling fake images in cybersecurity – Interpretation of a StyleGAN and lifting its black-box	Julia Laubmann et.al.	2507.13722	translate	read	null
2025-07-18	PoemTale Diffusion: Minimising Information Loss in Poem to Image Generation with Multi-Stage Prompt Refinement	Sofia Jamil et.al.	2507.13708	translate	read	null
2025-07-18	TexGS-VolVis: Expressive Scene Editing for Volume Visualization via Textured Gaussian Splatting	Kaiyuan Tang et.al.	2507.13586	translate	read	null
2025-07-17	FashionPose: Text to Pose to Relight Image Generation for Personalized Fashion Visualization	Chuancheng Shi et.al.	2507.13311	translate	read	null
2025-07-17	Synthesizing Reality: Leveraging the Generative AI-Powered Platform Midjourney for Construction Worker Detection	Hongyang Zhao et.al.	2507.13221	translate	read	null
2025-07-17	SHIELD: A Secure and Highly Enhanced Integrated Learning for Robust Deepfake Detection against Adversarial Attacks	Kutub Uddin et.al.	2507.13170	translate	read	null
2025-07-17	Multi-population GAN Training: Analyzing Co-Evolutionary Algorithms	Walter P. Casas et.al.	2507.13157	translate	read	null
2025-07-17	fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting	Alicia Durrer et.al.	2507.13146	translate	read	null
2025-07-17	Adversarial attacks to image classification systems using evolutionary algorithms	Sergio Nesmachnow et.al.	2507.13136	translate	read	null
2025-07-17	Resurrect Mask AutoRegressive Modeling for Efficient and Scalable Image Generation	Yi Xin et.al.	2507.13032	translate	read	link
2025-07-17	A Distributed Generative AI Approach for Heterogeneous Multi-Domain Environments under Data Sharing constraints	Youssef Tawfilis et.al.	2507.12979	translate	read	null
2025-07-17	DMQ: Dissecting Outliers of Diffusion Models for Post-Training Quantization	Dongyeun Lee et.al.	2507.12933	translate	read	null
2025-07-17	Local Representative Token Guided Merging for Text-to-Image Generation	Min-Jeong Lee et.al.	2507.12771	translate	read	null
2025-07-16	Compositional Discrete Latent Code for High Fidelity, Productive Diffusion Models	Samuel Lavoie et.al.	2507.12318	translate	read	null
2025-07-16	FADE: Adversarial Concept Erasure in Flow Models	Zixuan Fu et.al.	2507.12283	translate	read	null
2025-07-16	DeepShade: Enable Shade Simulation by Text-conditioned Image Generation	Longchao Da et.al.	2507.12103	translate	read	null
2025-07-16	FloGAN: Scenario-Based Urban Mobility Flow Generation via Conditional GANs and Dynamic Region Decoupling	Seanglidet Yean et.al.	2507.12053	translate	read	null
2025-07-16	ID-EA: Identity-driven Text Enhancement and Adaptation with Textual Inversion for Personalized Text-to-Image Generation	Hyun-Jun Jin et.al.	2507.11990	translate	read	null
2025-07-16	RaDL: Relation-aware Disentangled Learning for Multi-Instance Text-to-Image Generation	Geon Park et.al.	2507.11947	translate	read	null
2025-07-16	Schrödinger Bridge Consistency Trajectory Models for Speech Enhancement	Shuichiro Nishigori et.al.	2507.11925	translate	read	null
2025-07-16	A Multimodal Data Fusion Generative Adversarial Network for Real Time Underwater Sound Speed Field Construction	Wei Huang et.al.	2507.11812	translate	read	null
2025-07-15	Deep Generative Methods and Tire Architecture Design	Fouad Oubari et.al.	2507.11639	translate	read	null
2025-07-15	CharaConsist: Fine-Grained Consistent Character Generation	Mengyu Wang et.al.	2507.11533	translate	read	link
2025-07-15	CATVis: Context-Aware Thought Visualization	Tariq Mehmood et.al.	2507.11522	translate	read	null
2025-07-15	Implementing Adaptations for Vision AutoRegressive Model	Kaif Shaikh et.al.	2507.11441	translate	read	link
2025-07-15	MFGDiffusion: Mask-Guided Smoke Synthesis for Enhanced Forest Fire Detection	Guanghao Wu et.al.	2507.11252	translate	read	null
2025-07-15	Latent Space Consistency for Sparse-View CT Reconstruction	Duoyou Chen et.al.	2507.11152	translate	read	null
2025-07-15	Learning from Imperfect Data: Robust Inference of Dynamic Systems using Simulation-based Generative Model	Hyunwoo Cho et.al.	2507.10884	translate	read	null
2025-07-14	Spatial Reasoners for Continuous Variables in Any Domain	Bart Pogodzinski et.al.	2507.10768	translate	read	null
2025-07-15	Text Embedding Knows How to Quantize Text-Guided Diffusion Models	Hongjae Lee et.al.	2507.10340	translate	read	null
2025-07-14	Transferring Styles for Reduced Texture Bias and Improved Robustness in Semantic Segmentation Networks	Ben Hamscher et.al.	2507.10239	translate	read	null
2025-07-14	From Wardrobe to Canvas: Wardrobe Polyptych LoRA for Part-level Controllable Human Image Generation	Jeongho Kim et.al.	2507.10217	translate	read	null
2025-07-14	Latent Diffusion Models with Masked AutoEncoders	Junho Lee et.al.	2507.09984	translate	read	link
2025-07-14	Counterfactual Visual Explanation via Causally-Guided Adversarial Steering	Yiran Qiao et.al.	2507.09881	translate	read	null
2025-07-13	AI-Enhanced Pediatric Pneumonia Detection: A CNN-Based Approach Using Data Augmentation and Generative Adversarial Networks (GANs)	Abdul Manaf et.al.	2507.09759	translate	read	null
2025-07-13	Hybrid Quantum-Classical Generative Adversarial Networks with Transfer Learning	Asma Al-Othni et.al.	2507.09706	translate	read	null
2025-07-13	Brain Stroke Detection and Classification Using CT Imaging with Transformer Models and Explainable AI	Shomukh Qari et.al.	2507.09630	translate	read	null
2025-07-13	Demystifying Flux Architecture	Or Greenberg et.al.	2507.09595	translate	read	null
2025-07-13	MENTOR: Efficient Multimodal-Conditioned Tuning for Autoregressive Vision Generation Models	Haozhe Zhao et.al.	2507.09574	translate	read	link
2025-07-11	Image Translation with Kernel Prediction Networks for Semantic Segmentation	Cristina Mata et.al.	2507.08554	translate	read	null
2025-07-11	Advancing Multimodal LLMs by Large-Scale 3D Visual Instruction Dataset Generation	Liu He et.al.	2507.08513	translate	read	null
2025-07-11	Vision Foundation Models as Effective Visual Tokenizers for Autoregressive Image Generation	Anlin Zheng et.al.	2507.08441	translate	read	null
2025-07-11	RePaintGS: Reference-Guided Gaussian Splatting for Realistic and View-Consistent 3D Scene Inpainting	Ji Hyun Seo et.al.	2507.08434	translate	read	null
2025-07-11	Subject-Consistent and Pose-Diverse Text-to-Image Generation	Zhanxin Gao et.al.	2507.08396	translate	read	link
2025-07-11	From Enhancement to Understanding: Build a Generalized Bridge for Low-light Vision via Semantically Consistent Unsupervised Fine-tuning	Sen Wang et.al.	2507.08380	translate	read	null
2025-07-11	Lightweight Safety Guardrails via Synthetic Data and RL-guided Adversarial Training	Aleksei Ilin et.al.	2507.08284	translate	read	null
2025-07-11	Single-Step Latent Diffusion for Underwater Image Restoration	Jiayi Wu et.al.	2507.07878	translate	read	null
2025-07-10	Assessing the Alignment of Audio Representations with Timbre Similarity Ratings	Haokun Tian et.al.	2507.07764	translate	read	null
2025-07-10	Degradation-Agnostic Statistical Facial Feature Transformation for Blind Face Restoration in Adverse Weather Conditions	Chang-Hwan Son et.al.	2507.07464	translate	read	null
2025-07-10	Behave Your Motion: Habit-preserved Cross-category Animal Motion Transfer	Zhimin Zhang et.al.	2507.07394	translate	read	null
2025-07-10	Digital Salon: An AI and Physics-Driven Tool for 3D Hair Grooming and Simulation	Chengan He et.al.	2507.07387	translate	read	null
2025-07-09	Scalable and Realistic Virtual Try-on Application for Foundation Makeup with Kubelka-Munk Theory	Hui Pang et.al.	2507.07333	translate	read	null
2025-07-09	Scale leads to compositional generalization	Florian Redhardt et.al.	2507.07207	translate	read	null
2025-07-09	Interpretable EEG-to-Image Generation with Semantic Prompts	Arshak Rezvani et.al.	2507.07157	translate	read	null
2025-07-09	Evaluating Attribute Confusion in Fashion Text-to-Image Generation	Ziyue Liu et.al.	2507.07079	translate	read	null
2025-07-10	Hallucinating 360°: Panoramic Street-View Generation via Local Scenes Diffusion and Probabilistic Prompting	Fei Teng et.al.	2507.06971	translate	read	null
2025-07-09	Concept-TRAK: Understanding how diffusion models learn concepts through concept-level attribution	Yonghyun Park et.al.	2507.06547	translate	read	null
2025-07-10	Concept Unlearning by Modeling Key Steps of Diffusion Process	Chaoshuo Zhang et.al.	2507.06526	translate	read	null
2025-07-08	FedPhD: Federated Pruning with Hierarchical Learning of Diffusion Models	Qianyu Long et.al.	2507.06449	translate	read	null
2025-07-08	Advancing Offline Handwritten Text Recognition: A Systematic Review of Data Augmentation and Generation Techniques	Yassin Hussein Rassul et.al.	2507.06275	translate	read	null
2025-07-08	NeoBabel: A Multilingual Open Tower for Visual Generation	Mohammad Mahdi Derakhshani et.al.	2507.06137	translate	read	link
2025-07-08	CAVGAN: Unifying Jailbreak and Defense of LLMs via Generative Adversarial Attacks on their Internal Representations	Xiaohu Li et.al.	2507.06043	translate	read	null
2025-07-08	TextPixs: Glyph-Conditioned Diffusion with Character-Aware Attention and OCR-Guided Supervision	Syeda Anshrah Gillani et.al.	2507.06033	translate	read	null
2025-07-08	Automatic Synthesis of High-Quality Triplet Data for Composed Image Retrieval	Haiwen Li et.al.	2507.05970	translate	read	null
2025-07-08	DreamGrasp: Zero-Shot 3D Multi-Object Reconstruction from Partial-View Images for Robotic Manipulation	Young Hun Kim et.al.	2507.05627	translate	read	null
2025-07-08	AdaptaGen: Domain-Specific Image Generation through Hierarchical Semantic Optimization Framework	Suoxiang Zhang et.al.	2507.05621	translate	read	null
2025-07-08	Kernel Density Steering: Inference-Time Scaling via Mode Seeking for Image Restoration	Yuyang Hu et.al.	2507.05604	translate	read	null
2025-07-08	Model-free Optical Processors using In Situ Reinforcement Learning with Proximal Policy Optimization	Yuhang Li et.al.	2507.05583	translate	read	null
2025-07-08	SingLoRA: Low Rank Adaptation Using a Single Matrix	David Bensaïd et.al.	2507.05566	translate	read	link
2025-07-07	LoomNet: Enhancing Multi-View Image Generation via Latent Space Weaving	Giulio Federico et.al.	2507.05499	translate	read	null
2025-07-07	SV-DRR: High-Fidelity Novel View X-Ray Synthesis Using Diffusion Model	Chun Xie et.al.	2507.05148	translate	read	link
2025-07-07	VERITAS: Verification and Explanation of Realness in Images for Transparency in AI Systems	Aadi Srivastava et.al.	2507.05146	translate	read	link
2025-07-07	ICAS: Detecting Training Data from Autoregressive Image Generative Models	Hongyao Yu et.al.	2507.05068	translate	read	null
2025-07-07	AI-Driven Cytomorphology Image Synthesis for Medical Diagnostics	Jan Carreras Boada et.al.	2507.05063	translate	read	link
2025-07-07	Estimating Object Physical Properties from RGB-D Vision and Depth Robot Sensors Using Deep Learning	Ricardo Cardoso et.al.	2507.05029	translate	read	null
2025-07-07	DC-AR: Efficient Masked Autoregressive Image Generation with Deep Compression Hybrid Tokenizer	Yecheng Wu et.al.	2507.04947	translate	read	null
2025-07-07	Taming the Tri-Space Tension: ARC-Guided Hallucination Modeling and Control for Text-to-Image Generation	Jianjiang Yang et.al.	2507.04946	translate	read	null
2025-07-07	Leveraging Self-Supervised Features for Efficient Flooded Region Identification in UAV Aerial Images	Dibyabha Deb et.al.	2507.04915	translate	read	null
2025-07-07	When do World Models Successfully Learn Dynamical Systems?	Edmund Ross et.al.	2507.04898	translate	read	null
2025-07-07	Efficacy of Image Similarity as a Metric for Augmenting Small Dataset Retinal Image Segmentation	Thomas Wallace et.al.	2507.04862	translate	read	null
2025-07-03	AnyI2V: Animating Any Conditional Image with Motion Control	Ziye Li et.al.	2507.02857	translate	read	link
2025-07-03	RichControl: Structure- and Appearance-Rich Training-Free Spatial Control for Text-to-Image Generation	Liheng Zhang et.al.	2507.02792	translate	read	null
2025-07-03	FairHuman: Boosting Hand and Face Quality in Human Image Generation with Minimum Potential Delay Fairness in Diffusion Models	Yuxuan Wang et.al.	2507.02714	translate	read	null
2025-07-03	UniMC: Taming Diffusion Transformer for Unified Keypoint-Guided Multi-Class Image Generation	Qin Guo et.al.	2507.02713	translate	read	null
2025-07-03	AC-Refiner: Efficient Arithmetic Circuit Optimization Using Conditional Diffusion Models	Chenhao Xue et.al.	2507.02598	translate	read	null
2025-07-03	Holistic Tokenizer for Autoregressive Image Generation	Anlin Zheng et.al.	2507.02358	translate	read	link
2025-07-03	Transformer-based EEG Decoding: A Survey	Haodong Zhang et.al.	2507.02320	translate	read	null
2025-07-02	Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation	Zhuoyang Zhang et.al.	2507.01957	translate	read	null
2025-07-02	How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks	Rahul Ramachandran et.al.	2507.01955	translate	read	null
2025-07-02	Reasoning to Edit: Hypothetical Instruction-Based Image Editing with Visual Reasoning	Qingdong He et.al.	2507.01908	translate	read	null
2025-07-02	Improving GANs by leveraging the quantum noise from real hardware	Hongni Jin et.al.	2507.01886	translate	read	null
2025-07-02	FreeLoRA: Enabling Training-Free LoRA Fusion for Autoregressive Multi-Subject Personalization	Peng Zheng et.al.	2507.01792	translate	read	null
2025-07-02	Rethinking Discrete Tokens: Treating Them as Conditions for Continuous Autoregressive Image Synthesis	Peng Zheng et.al.	2507.01756	translate	read	null
2025-07-02	Autoregressive Image Generation with Linear Complexity: A Spatial-Aware Decay Perspective	Yuxin Mao et.al.	2507.01652	translate	read	null
2025-07-02	Representation Entanglement for Generation:Training Diffusion Transformers Is Much Easier Than You Think	Ge Wu et.al.	2507.01467	translate	read	null
2025-07-02	DiffMark: Diffusion-based Robust Watermark Against Deepfakes	Chen Sun et.al.	2507.01428	translate	read	null
2025-07-02	BronchoGAN: Anatomically consistent and domain-agnostic image-to-image translation for video bronchoscopy	Ahmad Soliman et.al.	2507.01387	translate	read	null

(<a href=../Image_Generation.md>back to Image Generation</a>)