Image Generation - 2026-03 | Paper Arxiv Daily

Image Generation - 2026-03

Publish Date	Title	Authors	PDF	Translate	Read	Code
2026-03-31	Abstraction in Style	Min Lu et.al.	2603.29924	translate	read	null
2026-03-31	ATP-Bench: Towards Agentic Tool Planning for MLLM Interleaved Generation	Yinuo Liu et.al.	2603.29902	translate	read	null
2026-03-31	Accurate Determination of Chemical Abundances near a Supermassive Black Hole	The XRISM collaboration et.al.	2603.29748	translate	read	null
2026-03-31	MacTok: Robust Continuous Tokenization for Image Generation	Hengyu Zeng et.al.	2603.29634	translate	read	null
2026-03-31	Unify-Agent: A Unified Multimodal Agent for World-Grounded Image Synthesis	Shuang Chen et.al.	2603.29620	translate	read	null
2026-03-31	FlowID : Enhancing Forensic Identification with Latent Flow-Matching Models	Jules Ripoll et.al.	2603.29591	translate	read	null
2026-03-31	Generating Key Postures of Bharatanatyam Adavus with Pose Estimation	Jagadish Kashinath Kamble et.al.	2603.29570	translate	read	null
2026-03-31	CIPHER: Counterfeit Image Pattern High-level Examination via Representation	Kyeonghun Kim et.al.	2603.29356	translate	read	null
2026-03-31	GazeCLIP: Gaze-Guided CLIP with Adaptive-Enhanced Fine-Grained Language Prompt for Deepfake Attribution and Detection	Yaning Zhang et.al.	2603.29295	translate	read	null
2026-03-31	Semantic Communication for 6G Networks: A Trade-off between Distortion Criticality and Information Representability	Faizan Shafi et.al.	2603.29293	translate	read	null
2026-03-30	Gen-Searcher: Reinforcing Agentic Search for Image Generation	Kaituo Feng et.al.	2603.28767	translate	read	null
2026-03-30	PoseDreamer: Scalable and Photorealistic Human Data Generation Pipeline with Diffusion Models	Lorenza Prospero et.al.	2603.28763	translate	read	null
2026-03-30	DreamLite: A Lightweight On-Device Unified Model for Image Generation and Editing	Kailai Feng et.al.	2603.28713	translate	read	null
2026-03-30	TGIF2: Extended Text-Guided Inpainting Forgery Dataset & Benchmark	Hannes Mareen et.al.	2603.28613	translate	read	null
2026-03-30	MRI-to-CT synthesis using drifting models	Qing Lyu et.al.	2603.28498	translate	read	null
2026-03-30	EdgeDiT: Hardware-Aware Diffusion Transformers for Efficient On-Device Image Generation	Sravanth Kodavanti et.al.	2603.28405	translate	read	null
2026-03-30	Integrating Multimodal Large Language Model Knowledge into Amodal Completion	Heecheol Yun et.al.	2603.28333	translate	read	null
2026-03-30	LogiStory: A Logic-Aware Framework for Multi-Image Story Visualization	Chutian Meng et.al.	2603.28082	translate	read	null
2026-03-30	SIMR-NO: A Spectrally-Informed Multi-Resolution Neural Operator for Turbulent Flow Super-Resolution	Muhammad Abid et.al.	2603.28073	translate	read	null
2026-03-30	AIBench: Evaluating Visual-Logical Consistency in Academic Illustration Generation	Zhaohe Liao et.al.	2603.28068	translate	read	null
2026-03-30	MathGen: Revealing the Illusion of Mathematical Competence through Text-to-Image Generation	Ruiyao Liu et.al.	2603.27959	translate	read	null
2026-03-25	Polynomial Speedup in Diffusion Models with the Multilevel Euler-Maruyama Method	Arthur Jacot et.al.	2603.24594	translate	read	null
2026-03-25	Anti-I2V: Safeguarding your photos from malicious image-to-video generation	Duc Vu et.al.	2603.24570	translate	read	null
2026-03-25	ViHOI: Human-Object Interaction Synthesis with Visual Priors	Songjin Cai et.al.	2603.24383	translate	read	null
2026-03-25	Shape-Dependent, Deep-Learning-Assisted Metamaterial Solid Immersion Lens (mSIL) Super-Resolution Imaging	Baidong Wu et.al.	2603.24371	translate	read	null
2026-03-25	ScrollScape: Unlocking 32K Image Generation With Video Diffusion Priors	Haodong Yu et.al.	2603.24270	translate	read	null
2026-03-25	InstanceRSR: Real-World Super-Resolution via Instance-Aware Representation Alignment	Zixin Guo et.al.	2603.24240	translate	read	null
2026-03-25	RefReward-SR: LR-Conditioned Reward Modeling for Preference-Aligned Super-Resolution	Yushuai Song et.al.	2603.24198	translate	read	null
2026-03-25	LGTM: Training-Free Light-Guided Text-to-Image Diffusion Model via Initial Noise Manipulation	Ryugo Morita et.al.	2603.24086	translate	read	null
2026-03-25	When Understanding Becomes a Risk: Authenticity and Safety Risks in the Emerging Image Generation Paradigm	Ye Leng et.al.	2603.24079	translate	read	null
2026-03-25	Human Factors in Detecting AI-Generated Portraits: Age, Sex, Device, and Confidence	Sunwhi Kim et.al.	2603.24048	translate	read	null
2026-03-25	HAM: A Training-Free Style Transfer Approach via Heterogeneous Attention Modulation for Diffusion Models	Yeqi He et.al.	2603.24043	translate	read	null
2026-03-25	Transcending Classical Neural Network Boundaries: A Quantum-Classical Synergistic Paradigm for Seismic Data Processing	Zhengyi Yuan et.al.	2603.23984	translate	read	null
2026-03-25	DepthArb: Training-Free Depth-Arbitrated Generation for Occlusion-Robust Image Synthesis	Hongjin Niu et.al.	2603.23924	translate	read	null
2026-03-25	GenMask: Adapting DiT for Segmentation via Direct Mask	Yuhuan Yang et.al.	2603.23906	translate	read	null
2026-03-24	Very sensitive vapor-cell quasi-DC atomic E-field sensor	Amy Damitz et.al.	2603.23751	translate	read	null
2026-03-24	PoiCGAN: A Targeted Poisoning Based on Feature-Label Joint Perturbation in Federated Learning	Tao Liu et.al.	2603.23574	translate	read	null
2026-03-24	UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation	Jie Liu et.al.	2603.23500	translate	read	link
2026-03-24	InverFill: One-Step Inversion for Enhanced Few-Step Diffusion Inpainting	Duc Vu et.al.	2603.23463	translate	read	null
2026-03-24	Mamba-driven MRI-to-CT Synthesis for MRI-only Radiotherapy Planning	Konstantinos Barmpounakis et.al.	2603.23295	translate	read	null
2026-03-24	VoDaSuRe: A Large-Scale Dataset Revealing Domain Shift in Volumetric Super-Resolution	August Leander Høeg et.al.	2603.23153	translate	read	null
2026-03-24	DAK-UCB: Diversity-Aware Prompt Routing for LLMs and Generative Models	Donya Jafari et.al.	2603.23140	translate	read	null
2026-03-24	Policy-based Tuning of Autoregressive Image Models with Instance- and Distribution-Level Rewards	Orhun Buğra Baran et.al.	2603.23086	translate	read	null
2026-03-24	AuthorMix: Modular Authorship Style Transfer via Layer-wise Adapter Mixing	Sarubi Thillainathan et.al.	2603.23069	translate	read	null
2026-03-24	HUydra: Full-Range Lung CT Synthesis via Multiple HU Interval Generative Modelling	António Cardoso et.al.	2603.23041	translate	read	null
2026-03-24	Zero-Shot Personalization of Objects via Textual Inversion	Aniket Roy et.al.	2603.23010	translate	read	null
2026-03-24	WorldMesh: Generating Navigable Multi-Room 3D Scenes via Mesh-Conditioned Image Diffusion	Manuel-Andreas Schneider et.al.	2603.22972	translate	read	null
2026-03-24	PersonalQ: Select, Quantize, and Serve Personalized Diffusion Models for Efficient Inference	Qirui Wang et.al.	2603.22943	translate	read	null
2026-03-24	From Pixels to Semantics: A Multi-Stage AI Framework for Structural Damage Detection in Satellite Imagery	Bijay Shakya et.al.	2603.22768	translate	read	null
2026-03-23	Single-Subject Multi-View MRI Super-Resolution via Implicit Neural Representations	Heejong Kim et.al.	2603.22627	translate	read	null
2026-03-23	PIVM: Diffusion-Based Prior-Integrated Variation Modeling for Anatomically Precise Abdominal CT Synthesis	Dinglun He et.al.	2603.22626	translate	read	null
2026-03-23	Latent Style-based Quantum Wasserstein GAN for Drug Design	Julien Baglio et.al.	2603.22399	translate	read	null
2026-03-23	Repurposing Geometric Foundation Models for Multi-view Diffusion	Wooseok Jang et.al.	2603.22275	translate	read	null
2026-03-23	DUO-VSR: Dual-Stream Distillation for One-Step Video Super-Resolution	Zhengyao Lv et.al.	2603.22271	translate	read	null
2026-03-23	SelfTTS: cross-speaker style transfer through explicit embedding disentanglement and self-refinement using self-augmentation	Lucas H. Ueda et.al.	2603.22252	translate	read	null
2026-03-23	SpatialReward: Verifiable Spatial Reward Modeling for Fine-Grained Spatial Consistency in Text-to-Image Generation	Sashuai Zhou et.al.	2603.22228	translate	read	null
2026-03-23	DA-VAE: Plug-in Latent Compression for Diffusion via Detail Alignment	Xin Cai et.al.	2603.22125	translate	read	null
2026-03-23	DTVI: Dual-Stage Textual and Visual Intervention for Safe Text-to-Image Generation	Binhong Tan et.al.	2603.22041	translate	read	null
2026-03-23	Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model	SII-GAIR et.al.	2603.21986	translate	read	null
2026-03-23	MultiBind: A Benchmark for Attribute Misbinding in Multi-Subject Generation	Wenqing Tian et.al.	2603.21937	translate	read	null
2026-03-23	Not All Layers Are Created Equal: Adaptive LoRA Ranks for Personalized Image Generation	Donald Shenaj et.al.	2603.21884	translate	read	null
2026-03-23	SHARP: Spectrum-aware Highly-dynamic Adaptation for Resolution Promotion in Remote Sensing Synthesis	Bingxuan Zhao et.al.	2603.21783	translate	read	null
2026-03-23	OmniFM: Toward Modality-Robust and Task-Agnostic Federated Learning for Heterogeneous Medical Imaging	Meilin Liu et.al.	2603.21660	translate	read	null
2026-03-23	Conditional Wasserstein GAN for Simulating Neutrino Event Summaries using Incident Energy of Electron Neutrinos	Dipthi S. et.al.	2603.21599	translate	read	null
2026-03-23	Unregistered Spectral Image Fusion: Unmixing, Adversarial Learning, and Recoverability	Jiahui Song et.al.	2603.21510	translate	read	null
2026-03-22	Efficient Coarse-to-Fine Diffusion Models with Time Step Sequence Redistribution	Yu-Shan Tai et.al.	2603.21348	translate	read	null
2026-03-22	Positional Segmentor-Guided Counterfactual Fine-Tuning for Spatially Localized Image Synthesis	Tian Xia et.al.	2603.21213	translate	read	null
2026-03-22	MS-CustomNet: Controllable Multi-Subject Customization with Hierarchical Relational Semantics	Pengxiang Cai et.al.	2603.21136	translate	read	null
2026-03-22	Taming Sampling Perturbations with Variance Expansion Loss for Latent Diffusion Models	Qifan Li et.al.	2603.21085	translate	read	null
2026-03-22	LPNSR: Prior-Enhanced Diffusion Image Super-Resolution via LR-Guided Noise Prediction	Shuwei Huang et.al.	2603.21045	translate	read	null
2026-03-21	EruDiff: Refactoring Knowledge in Diffusion Models for Advanced Text-to-Image Synthesis	Xiefan Guo et.al.	2603.20828	translate	read	null
2026-03-21	CTCal: Rethinking Text-to-Image Diffusion Models via Cross-Timestep Self-Calibration	Xiefan Guo et.al.	2603.20741	translate	read	null
2026-03-21	Premier: Personalized Preference Modulation with Learnable User Embedding in Text-to-Image Generation	Zihao Wang et.al.	2603.20725	translate	read	null
2026-03-21	MFSR: MeanFlow Distillation for One Step Real-World Image Super Resolution	Ruiqing Wang et.al.	2603.20690	translate	read	null
2026-03-21	ScaleEdit-12M: Scaling Open-Source Image Editing Data Generation via Multi-Agent Framework	Guanzhou Chen et.al.	2603.20644	translate	read	null
2026-03-21	Interpretable Operator Learning for Inverse Problems via Adaptive Spectral Filtering: Convergence and Discretization Invariance	Hang-Cheng Dong et.al.	2603.20602	translate	read	null
2026-03-20	DiffGraph: An Automated Agent-driven Model Merging Framework for In-the-Wild Text-to-Image Generation	Zhuoling Li et.al.	2603.20470	translate	read	null
2026-03-20	Uni-Classifier: Leveraging Video Diffusion Priors for Universal Guidance Classifier	Yujie Zhou et.al.	2603.20382	translate	read	null
2026-03-19	Transferable Multi-Bit Watermarking Across Frozen Diffusion Models via Latent Consistency Bridges	Hong-Hanh Nguyen-Le et.al.	2603.20304	translate	read	null
2026-03-20	Improving Image-to-Image Translation via a Rectified Flow Reformulation	Satoshi Iizuka et.al.	2603.20186	translate	read	null
2026-03-20	Generalizable NGP-SR: Generalizable Neural Radiance Fields Super-Resolution via Neural Graph Primitives	Wanqi Yuan et.al.	2603.20128	translate	read	null
2026-03-20	Preference-Guided Debiasing for No-Reference Enhancement Image Quality Assessment	Shiqi Gao et.al.	2603.20086	translate	read	null
2026-03-20	X-World: Controllable Ego-Centric Multi-Camera World Models for Scalable End-to-End Driving	Chaoda Zheng et.al.	2603.19979	translate	read	null
2026-03-20	Timestep-Aware Block Masking for Efficient Diffusion Model Inference	Haodong He et.al.	2603.19939	translate	read	null
2026-03-20	Evaluating Image Editing with LLMs: A Comprehensive Benchmark and Intermediate-Layer Probing Approach	Shiqi Gao et.al.	2603.19775	translate	read	null
2026-03-20	WorldAgents: Can Foundation Image Models be Agents for 3D World Models?	Ziya Erkoç et.al.	2603.19708	translate	read	null
2026-03-20	Diminishing Returns in Expanding Generative Models and Godel-Tarski-Lob Limits	Angshul Majumdar et.al.	2603.19687	translate	read	null
2026-03-20	Toward High-Fidelity Visual Reconstruction: From EEG-Based Conditioned Generation to Joint-Modal Guided Rebuilding	Zhijian Gong et.al.	2603.19667	translate	read	null
2026-03-20	Fixed-Point Delayed Subgradient Methods for Nonsmooth Convex Optimization Problems	Ontima Pankoon et.al.	2603.19604	translate	read	null
2026-03-20	MagicSeg: Open-World Segmentation Pretraining via Counterfactural Diffusion-Based Auto-Generation	Kaixin Cai et.al.	2603.19575	translate	read	null
2026-03-19	TuLaBM: Tumor-Biased Latent Bridge Matching for Contrast-Enhanced MRI Synthesis	Atharva Rege et.al.	2603.19386	translate	read	null
2026-03-19	Warm-Start Flow Matching for Guaranteed Fast Text/Image Generation	Minyoung Kim et.al.	2603.19360	translate	read	null
2026-03-19	RPiAE: A Representation-Pivoted Autoencoder Enhancing Both Image Generation and Editing	Yue Gong et.al.	2603.19206	translate	read	null
2026-03-19	GenMFSR: Generative Multi-Frame Image Restoration and Super-Resolution	Harshana Weligampola et.al.	2603.19187	translate	read	null
2026-03-19	ADAPT: Attention Driven Adaptive Prompt Scheduling and InTerpolating Orthogonal Complements for Rare Concepts Generation	Kwanyoung Lee et.al.	2603.19157	translate	read	null
2026-03-19	Unmasking Algorithmic Bias in Predictive Policing: A GAN-Based Simulation Framework with Multi-City Temporal Analysis	Pronob Kumar Barman et.al.	2603.18987	translate	read	null
2026-03-19	Sketch2Topo: Using Hand-Drawn Inputs for Diffusion-Based Topology Optimization	Shuyue Feng et.al.	2603.18960	translate	read	null
2026-03-19	Seasoning Generative Models for a Generalization Aftertaste	Hisham Husain et.al.	2603.18817	translate	read	null
2026-03-19	Enhancing the Parameterization of Reservoir Properties for Data Assimilation Using Deep VAE-GAN	M. A. Sampaio et.al.	2603.18766	translate	read	null
2026-03-19	WeNLEX: Weakly Supervised Natural Language Explanations for Multilabel Chest X-ray Classification	Isabel Rio-Torto et.al.	2603.18752	translate	read	null
2026-03-19	Agentic Flow Steering and Parallel Rollout Search for Spatially Grounded Text-to-Image Generation	Ping Chen et.al.	2603.18627	translate	read	null
2026-03-19	SJD-PAC: Accelerating Speculative Jacobi Decoding via Proactive Drafting and Adaptive Continuation	Jialiang Kang et.al.	2603.18599	translate	read	null
2026-03-19	End-to-End QGAN-Based Image Synthesis via Neural Noise Encoding and Intensity Calibration	Xue Yang et.al.	2603.18554	translate	read	null
2026-03-19	CAFlow: Adaptive-Depth Single-Step Flow Matching for Efficient Histopathology Super-Resolution	Elad Yoshai et.al.	2603.18513	translate	read	null
2026-03-19	Recolour What Matters: Region-Aware Colour Editing via Token-Level Diffusion	Yuqi Yang et.al.	2603.18466	translate	read	null
2026-03-18	Learning to See Sharper: A Physics-Informed Artificial Intelligence Framework for Super-Resolving Galaxy Spectra	Aryana Haghjoo et.al.	2603.18357	translate	read	null
2026-03-18	Epistemic Generative Adversarial Networks	Muhammad Mubashar et.al.	2603.18348	translate	read	null
2026-03-18	Unrolled Reconstruction with Integrated Super-Resolution for Accelerated 3D LGE MRI	Md Hasibul Husain Hisham et.al.	2603.18309	translate	read	null
2026-03-18	EchoGen: Cycle-Consistent Learning for Unified Layout-Image Generation and Understanding	Kai Zou et.al.	2603.18001	translate	read	null
2026-03-18	LaDe: Unified Multi-Layered Graphic Media Generation and Decomposition	Vlad-Constantin Lungu-Stan et.al.	2603.17965	translate	read	null
2026-03-18	ChopGrad: Pixel-Wise Losses for Latent Video Diffusion via Truncated Backpropagation	Dmitriy Rivkin et.al.	2603.17812	translate	read	null
2026-03-18	Cache-enabled Generative Joint Source-Channel Coding for Evolving Semantic Communications	Shunpu Tang et.al.	2603.17702	translate	read	null
2026-03-18	DSS-GAN: Directional State Space GAN with Mamba backbone for Class-Conditional Image Synthesis	Aleksander Ogonowski et.al.	2603.17637	translate	read	null
2026-03-18	Searching for Molecular Signatures in 14 Transiting Exoplanets with SPIRou	A. Masson et.al.	2603.17574	translate	read	null
2026-03-18	A Tutorial on Learning-Based Radio Map Construction: Data, Paradigms, and Physics-Awarenes	Xiucheng Wang et.al.	2603.17499	translate	read	null
2026-03-18	UniSAFE: A Comprehensive Benchmark for Safety Evaluation of Unified Multimodal Models	Segyu Lee et.al.	2603.17476	translate	read	null
2026-03-18	Caging the Agents: A Zero Trust Security Architecture for Autonomous AI in Healthcare	Saikat Maiti et.al.	2603.17419	translate	read	null
2026-03-18	Joint Degradation-Aware Arbitrary-Scale Super-Resolution for Variable-Rate Extreme Image Compression	Xinning Chai et.al.	2603.17408	translate	read	null
2026-03-18	Harnessing the Power of Foundation Models for Accurate Material Classification	Qingran Lin et.al.	2603.17390	translate	read	null
2026-03-17	PhysQuantAgent: An Inference Pipeline of Mass Estimation for Vision-Language Models	Hisayuki Yokomizo et.al.	2603.16958	translate	read	null
2026-03-17	SparkVSR: Interactive Video Super-Resolution via Sparse Keyframe Propagation	Jiongze Yu et.al.	2603.16864	translate	read	null
2026-03-16	GDPO-SR: Group Direct Preference Optimization for One-Step Generative Image Super-Resolution	Qiaosi Yi et.al.	2603.16769	translate	read	null
2026-03-17	REFORGE: Multi-modal Attacks Reveal Vulnerable Concept Unlearning in Image Generation Models	Yong Zou et.al.	2603.16576	translate	read	null
2026-03-17	CompDiff: Hierarchical Compositional Diffusion for Fair and Zero-Shot Intersectional Medical Image Generation	Mahmoud Ibrahim et.al.	2603.16551	translate	read	null
2026-03-17	Unlearning for One-Step Generative Models via Unbalanced Optimal Transport	Hyundo Choi et.al.	2603.16489	translate	read	null
2026-03-17	Fanar 2.0: Arabic Generative AI Stack	FANAR TEAM et.al.	2603.16397	translate	read	null
2026-03-17	DermaFlux: Synthetic Skin Lesion Generation with Rectified Flows for Enhanced Image Classification	Stathis Galanakis et.al.	2603.16392	translate	read	null
2026-03-17	Semantic One-Dimensional Tokenizer for Image Reconstruction and Generation	Yunpeng Qu et.al.	2603.16373	translate	read	null
2026-03-17	RASLF: Representation-Aware State Space Model for Light Field Super-Resolution	Zeqiang Wei et.al.	2603.16243	translate	read	null
2026-03-16	Clinically Aware Synthetic Image Generation for Concept Coverage in Chest X-ray Models	Amy Rafferty et.al.	2603.15525	translate	read	null
2026-03-16	RSGen: Enhancing Layout-Driven Remote Sensing Image Generation with Diverse Edge Guidance	Xianbao Hou et.al.	2603.15484	translate	read	null
2026-03-16	Flash-Unified: A Training-Free and Task-Aware Acceleration Framework for Native Unified Models	Junlong Ke et.al.	2603.15271	translate	read	null
2026-03-16	TextOVSR: Text-Guided Real-World Opera Video Super-Resolution	Hua Chang et.al.	2603.15153	translate	read	null
2026-03-16	SNCE: Geometry-Aware Supervision for Scalable Discrete Image Generation	Shufan Li et.al.	2603.15150	translate	read	null
2026-03-16	Training-free Detection of Generated Videos via Spatial-Temporal Likelihoods	Omer Ben Hayun et.al.	2603.15026	translate	read	null
2026-03-16	CyCLeGen: Cycle-Consistent Layout Prediction and Image Generation in Vision Foundation Models	Xiaojun Shan et.al.	2603.14957	translate	read	null
2026-03-16	Relevance Feedback in Text-to-Image Diffusion: A Training-Free And Model-Agnostic Interactive Framework	Wenxi Wang et.al.	2603.14936	translate	read	null
2026-03-16	The Super Fine-Grained Detector for the T2K neutrino oscillation experiment	S. Abe et.al.	2603.14921	translate	read	null
2026-03-16	Seismic full-waveform inversion based on a physics-driven generative adversarial network	Xinyi Zhang et.al.	2603.14879	translate	read	null
2026-03-16	AnyPhoto: Multi-Person Identity Preserving Image Generation with ID Adaptive Modulation on Location Canvas	Longhui Yuan et.al.	2603.14770	translate	read	null
2026-03-16	Investigating the Impact of Speech Enhancement on Audio Deepfake Detection in Noisy Environments	Anacin et.al.	2603.14767	translate	read	null
2026-03-16	PHAC: Promptable Human Amodal Completion	Seung Young Noh et.al.	2603.14741	translate	read	null
2026-03-15	Comparative Analysis of 3D Convolutional and 2.5D Slice-Conditioned U-Net Architectures for MRI Super-Resolution via Elucidated Diffusion Models	Hendrik Chiche et.al.	2603.14667	translate	read	null
2026-03-15	A Decoupling-based Approach for Signature Estimation of Wideband XL MIMO-FMCW Radars	Chandrashekhar Rai et.al.	2603.14542	translate	read	null
2026-03-15	PGcGAN: Pathological Gait-Conditioned GAN for Human Gait Synthesis	Mritula Chandrasekaran et.al.	2603.14409	translate	read	null
2026-03-15	High-Fidelity Compression of Seismic Velocity Models via SIREN Auto-Decoders	Caiyun Liu et.al.	2603.14284	translate	read	null
2026-03-15	FIND: A Simple yet Effective Baseline for Diffusion-Generated Image Detection	Jie Li et.al.	2603.14220	translate	read	null
2026-03-15	DualTSR: Unified Dual-Diffusion Transformer for Scene Text Image Super-Resolution	Axi Niu et.al.	2603.14207	translate	read	null
2026-03-12	The Latent Color Subspace: Emergent Order in High-Dimensional Chaos	Mateusz Pach et.al.	2603.12261	translate	read	null
2026-03-12	Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation	Xiangyu Zhao et.al.	2603.12247	translate	read	null
2026-03-12	EvoTok: A Unified Image Tokenizer via Residual Latent Evolution for Visual Understanding and Generation	Yan Li et.al.	2603.12108	translate	read	null
2026-03-12	Single Pixel Image Classification using an Ultrafast Digital Light Projector	Aisha Kanwal et.al.	2603.12036	translate	read	null
2026-03-12	Unveiling the biconical geometry of the outflow in the ultraluminous X-ray source NGC 5204 X-1	S. Caserta et.al.	2603.11922	translate	read	null
2026-03-12	A Decade of Generative Adversarial Networks for Porous Material Reconstruction	Ali Sadeghkhani et.al.	2603.11836	translate	read	null
2026-03-12	UCAN: Unified Convolutional Attention Network for Expansive Receptive Fields in Lightweight Super-Resolution	Cao Thien Tan et.al.	2603.11680	translate	read	null
2026-03-12	Gen-Fab: A Variation-Aware Generative Model for Predicting Fabrication Variations in Nanophotonic Devices	Rambod Azimi et.al.	2603.11505	translate	read	null
2026-03-11	HanMoVLM: Large Vision-Language Models for Professional Artistic Painting Evaluation	Hongji Yang et.al.	2603.10814	translate	read	null
2026-03-11	The Quadratic Geometry of Flow Matching: Semantic Granularity Alignment for Text-to-Image Synthesis	Zhinan Xiong et.al.	2603.10785	translate	read	null
2026-03-11	Just-in-Time: Training-Free Spatial Acceleration for Diffusion Transformers	Wenhao Sun et.al.	2603.10744	translate	read	null
2026-03-11	HyPER-GAN: Hybrid Patch-Based Image-to-Image Translation for Real-Time Photorealism Enhancement	Stefanos Pasios et.al.	2603.10604	translate	read	null
2026-03-11	Attribution as Retrieval: Model-Agnostic AI-Generated Image Attribution	Hongsong Wang et.al.	2603.10583	translate	read	null
2026-03-11	Visually-Guided Controllable Medical Image Generation via Fine-Grained Semantic Disentanglement	Xin Huang et.al.	2603.10519	translate	read	null
2026-03-11	Enhancing Network Intrusion Detection Systems: A Multi-Layer Ensemble Approach to Mitigate Adversarial Attacks	Nasim Soltani et.al.	2603.10413	translate	read	null
2026-03-11	StyleGallery: Training-free and Semantic-aware Personalized Style Transfer from Arbitrary Image References	Boyu He et.al.	2603.10354	translate	read	null
2026-03-10	Delta-K: Boosting Multi-Instance Generation via Cross-Attention Augmentation	Zitong Wang et.al.	2603.10210	translate	read	null
2026-03-10	4DEquine: Disentangling Motion and Appearance for 4D Equine Reconstruction from Monocular Video	Jin Lyu et.al.	2603.10125	translate	read	null
2026-03-10	Generative Drifting is Secretly Score Matching: a Spectral and Variational Perspective	Erkan Turan et.al.	2603.09936	translate	read	null
2026-03-10	Adaptive Clinical-Aware Latent Diffusion for Multimodal Brain Image Generation and Missing Modality Imputation	Rong Zhou et.al.	2603.09931	translate	read	null
2026-03-10	CycleULM: A unified label-free deep learning framework for ultrasound localisation microscopy	Su Yan et.al.	2603.09840	translate	read	null
2026-03-10	Prompt-Driven Color Accessibility Evaluation in Diffusion-based Image Generation Models	Xinyao Zhuang et.al.	2603.09832	translate	read	null
2026-03-10	LogoDiffuser: Training-Free Multilingual Logo Generation and Stylization via Letter-Aware Attention Control	Mingyu Kang et.al.	2603.09759	translate	read	null
2026-03-10	TriFusion-SR: Joint Tri-Modal Medical Image Fusion and SR	Fayaz Ali Dharejo et.al.	2603.09702	translate	read	null
2026-03-10	Well Log-Guided Synthesis of Subsurface Images from Sparse Petrography Data Using cGANs	Ali Sadeghkhani et.al.	2603.09651	translate	read	null
2026-03-10	Physics-Driven 3D Gaussian Rendering for Zero-Shot MRI Super-Resolution	Shuting Liu et.al.	2603.09621	translate	read	null
2026-03-10	Towards Unified Multimodal Interleaved Generation via Group Relative Policy Optimization	Ming Nie et.al.	2603.09538	translate	read	null
2026-03-10	A Fast Solver for Interpolating Stochastic Differential Equation Diffusion Models for Speech Restoration	Bunlong Lay et.al.	2603.09508	translate	read	null
2026-03-10	Streaming Autoregressive Video Generation via Diagonal Distillation	Jinxiu Liu et.al.	2603.09488	translate	read	null
2026-03-10	Component-Aware Sketch-to-Image Generation Using Self-Attention Encoding and Coordinate-Preserving Fusion	Ali Zia et.al.	2603.09484	translate	read	null
2026-03-10	ShapeMark: Robust and Diversity-Preserving Watermarking for Diffusion Models	Yuqi Qian et.al.	2603.09454	translate	read	null
2026-03-10	Interactive 3D visualization of surface roughness predictions in additive manufacturing: A data-driven framework	Engin Deniz Erkan et.al.	2603.09353	translate	read	null
2026-03-10	CogBlender: Towards Continuous Cognitive Intervention in Text-to-Image Generation	Shengqi Dang et.al.	2603.09286	translate	read	null
2026-03-10	Acoustic and Semantic Modeling of Emotion in Spoken Language	Soumya Dutta et.al.	2603.09212	translate	read	null
2026-03-10	Progressive Split Mamba: Effective State Space Modelling for Image Restoration	Mohammed Hassanin et.al.	2603.09171	translate	read	null
2026-03-10	POLISH’ing the Sky: Wide-Field and High-Dynamic Range Interferometric Image Reconstruction with Application to Strong Lens Discovery	Zihui Wu et.al.	2603.09162	translate	read	null
2026-03-10	RubiCap: Rubric-Guided Reinforcement Learning for Dense Image Captioning	Tzu-Heng Huang et.al.	2603.09160	translate	read	null
2026-03-10	Rotation Equivariant Mamba for Vision Tasks	Zhongchen Zhao et.al.	2603.09138	translate	read	null
2026-03-10	QUSR: Quality-Aware and Uncertainty-Guided Image Super-Resolution Diffusion Model	Junjie Yin et.al.	2603.09125	translate	read	null
2026-03-09	The Coupling Within: Flow Matching via Distilled Normalizing Flows	David Berthelot et.al.	2603.09014	translate	read	null
2026-03-09	CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation	Haodong Li et.al.	2603.08652	translate	read	null
2026-03-09	CARE-Edit: Condition-Aware Routing of Experts for Contextual Image Editing	Yucheng Wang et.al.	2603.08589	translate	read	null
2026-03-09	Cubic maps from the group of order $3$	Vadim Alekseev et.al.	2603.08452	translate	read	null
2026-03-09	Rectified flow-based prediction of post-treatment brain MRI from pre-radiotherapy priors for patients with glioma	Selena Huisman et.al.	2603.08385	translate	read	null
2026-03-09	Retrieval-Augmented Anatomical Guidance for Text-to-CT Generation	Daniele Molino et.al.	2603.08305	translate	read	null
2026-03-09	Prototype-Guided Concept Erasure in Diffusion Models	Yuze Cai et.al.	2603.08271	translate	read	null
2026-03-09	WaDi: Weight Direction-aware Distillation for One-step Image Synthesis	Lei Wang et.al.	2603.08258	translate	read	null
2026-03-09	FlowTouch: View-Invariant Visuo-Tactile Prediction	Seongjin Bien et.al.	2603.08255	translate	read	null
2026-03-09	Fourier Transform Infrared microspectroscopy-based super-resolution virtual staining of unlabeled tissues by pixel Diffusion Transformer	Yudong Tian et.al.	2603.08143	translate	read	null
2026-03-09	DSH-Bench: A Difficulty- and Scenario-Aware Benchmark with Hierarchical Subject Taxonomy for Subject-Driven Text-to-Image Generation	Zhenyu Hu et.al.	2603.08090	translate	read	null
2026-03-09	Synthetic Defect Image Generation for Power Line Insulator Inspection Using Multimodal Large Language Models	Xuesong Wang et.al.	2603.08069	translate	read	null
2026-03-09	Text to Automata Diagrams: Comparing TikZ Code Generation with Direct Image Synthesis	Ethan Young et.al.	2603.07936	translate	read	null
2026-03-09	Enhancing Unregistered Hyperspectral Image Super-Resolution via Unmixing-based Abundance Fusion Learning	Yingkai Zhang et.al.	2603.07918	translate	read	null
2026-03-08	Parameterized Brushstroke Style Transfer	Uma Meleti et.al.	2603.07776	translate	read	null
2026-03-08	Compressed-Domain-Aware Online Video Super-Resolution	Yuhang Wang et.al.	2603.07694	translate	read	null
2026-03-08	GRD-Net: Generative-Reconstructive-Discriminative Anomaly Detection with Region of Interest Attention Module	Niccolò Ferrari et.al.	2603.07566	translate	read	null
2026-03-08	CONSTANT: Towards High-Quality One-Shot Handwriting Generation with Patch Contrastive Enhancement and Style-Aware Quantization	Anh-Duy Le et.al.	2603.07543	translate	read	null
2026-03-08	How Long Can Unified Multimodal Models Generate Images Reliably? Taming Long-Horizon Interleaved Image Generation via Context Curation	Haoyu Chen et.al.	2603.07540	translate	read	null
2026-03-08	Image Generation Models: A Technical History	Rouzbeh Shirvani et.al.	2603.07455	translate	read	null
2026-03-08	Disentangled Textual Priors for Diffusion-based Image Super-Resolution	Lei Jiang et.al.	2603.07430	translate	read	null
2026-03-08	Fluctuation imaging of disorder in monolayer semiconductors	Tom T. C. Sistermans et.al.	2603.07418	translate	read	null
2026-03-08	QdaVPR: A novel query-based domain-agnostic model for visual place recognition	Shanshan Wan et.al.	2603.07414	translate	read	null
2026-03-07	Variational Flow Maps: Make Some Noise for One-Step Conditional Generation	Abbas Mammadov et.al.	2603.07276	translate	read	null
2026-03-07	Single Image Super-Resolution via Bivariate `A Trous Wavelet Diffusion	Heidari Maryam et.al.	2603.07234	translate	read	null
2026-03-07	AdaGen: Learning Adaptive Policy for Image Synthesis	Zanlin Ni et.al.	2603.06993	translate	read	null
2026-03-06	Implementation of Quantum Implicit Neural Representation in Deterministic and Probabilistic Autoencoders for Image Reconstruction/Generation Tasks	Saadet Müzehher Eren et.al.	2603.06755	translate	read	null
2026-03-06	EarthBridge: A Solution for 4th Multi-modal Aerial View Image Challenge Translation Track	Zhenyuan Chen et.al.	2603.06753	translate	read	null
2026-03-06	Rank-Factorized Implicit Neural Bias: Scaling Super-Resolution Transformer with FlashAttention	Dongheon Lee et.al.	2603.06738	translate	read	null
2026-03-04	One step further with Monte-Carlo sampler to guide diffusion better	Minsi Ren et.al.	2603.06685	translate	read	null
2026-03-06	Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion	Lijiang Li et.al.	2603.06577	translate	read	null
2026-03-06	NEGATE: Constrained Semantic Guidance for Linguistic Negation in Text-to-Video Diffusion	Taewon Kang et.al.	2603.06533	translate	read	null
2026-03-06	Pinterest Canvas: Large-Scale Image Generation at Pinterest	Yu Wang et.al.	2603.06453	translate	read	null
2026-03-06	Toward Generative Quantum Utility via Correlation-Complexity Map	Chen-Yu Liu et.al.	2603.06440	translate	read	null
2026-03-06	The Art That Poses Back: Assessing AI Pastiches after Contemporary Artworks	Anca Dinu et.al.	2603.06324	translate	read	null
2026-03-06	3D CBCT Artefact Removal Using Perpendicular Score-Based Diffusion Models	Susanne Schaub et.al.	2603.06300	translate	read	null
2026-03-06	Spectral and Trajectory Regularization for Diffusion Transformer Super-Resolution	Jingkai Wang et.al.	2603.06275	translate	read	null
2026-03-06	Optimizing 3D Diffusion Models for Medical Imaging via Multi-Scale Reward Learning	Yueying Tian et.al.	2603.06173	translate	read	null
2026-03-06	Reflective Flow Sampling Enhancement	Zikai Zhou et.al.	2603.06165	translate	read	null
2026-03-06	Longitudinal NSCLC Treatment Progression via Multimodal Generative Models	Massimiliano Mantegna et.al.	2603.06147	translate	read	null
2026-03-06	FontUse: A Data-Centric Approach to Style- and Use-Case-Conditioned In-Image Typography	Xia Xin et.al.	2603.06038	translate	read	null
2026-03-06	StruVis: Enhancing Reasoning-based Text-to-Image Generation via Thinking with Structured Vision	Yuanhuiyi Lyu et.al.	2603.06032	translate	read	null
2026-03-06	LucidNFT: LR-Anchored Multi-Reward Preference Optimization for Generative Real-World Super-Resolution	Song Fei et.al.	2603.05947	translate	read	null
2026-03-06	StreamWise: Serving Multi-Modal Generation in Real-Time at Scale	Haoran Qiu et.al.	2603.05800	translate	read	null
2026-03-06	Layer-wise Instance Binding for Regional and Occlusion Control in Text-to-Image Diffusion Transformers	Ruidong Chen et.al.	2603.05769	translate	read	null
2026-03-05	Limited-Angle CT Reconstruction Using Multi-Volume Latent Consistency Model	Hinako Isogai et.al.	2603.05183	translate	read	null
2026-03-05	Diff-ES: Stage-wise Structural Diffusion Pruning via Evolutionary Search	Zongfang Liu et.al.	2603.05105	translate	read	null
2026-03-05	CoIn3D: Revisiting Configuration-Invariant Multi-Camera 3D Object Detection	Zhaonian Kuang et.al.	2603.05042	translate	read	null
2026-03-05	Enhancing Zero-shot Commonsense Reasoning by Integrating Visual Knowledge via Machine Imagination	Hyuntae Park et.al.	2603.05040	translate	read	null
2026-03-05	A Simple Baseline for Unifying Understanding, Generation, and Editing via Vanilla Next-token Prediction	Jie Zhu et.al.	2603.04980	translate	read	null
2026-03-05	MWA tied-array processing V: Super-resolved localisation via amplitude-only maximum likelihood direction finding	Bradley W. Meyers et.al.	2603.04961	translate	read	null
2026-03-05	An Efficient Stochastic First-Order Algorithm for Nonconvex-Strongly Concave Minimax Optimization beyond Lipschitz Smoothness	Yan Gao et.al.	2603.04940	translate	read	null
2026-03-05	Stochastic inner workings of subdiffraction laser writing	Julia M. Mikhailova et.al.	2603.04853	translate	read	null
2026-03-05	DSA-SRGS: Super-Resolution Gaussian Splatting for Dynamic Sparse-View DSA Reconstruction	Shiyu Zhang et.al.	2603.04770	translate	read	null
2026-03-05	Toward Real-world Infrared Image Super-Resolution: A Unified Autoregressive Framework and Benchmark Dataset	Yang Zou et.al.	2603.04745	translate	read	null
2026-03-04	sFRC for assessing hallucinations in medical image restoration	Prabhat Kc et.al.	2603.04673	translate	read	null
2026-03-04	Mask-aware inference with State-Space Models	Ignasi Mas et.al.	2603.04568	translate	read	null
2026-03-04	Structure-Guided Histopathology Synthesis via Dual-LoRA Diffusion	Xuan Xu et.al.	2603.04565	translate	read	null
2026-03-04	Enhancing Authorship Attribution with Synthetic Paintings	Clarissa Loures et.al.	2603.04343	translate	read	null
2026-03-04	Balancing Fidelity, Utility, and Privacy in Synthetic Cardiac MRI Generation: A Comparative Study	Madhura Edirisooriya et.al.	2603.04340	translate	read	null
2026-03-04	CubeComposer: Spatio-Temporal Autoregressive 4K 360° Video Generation from Perspective Video	Lingen Li et.al.	2603.04291	translate	read	null
2026-03-04	LikeThis! Empowering App Users to Submit UI Improvement Suggestions Instead of Complaints	Jialiang Wei et.al.	2603.04245	translate	read	null
2026-03-04	Semi-Supervised Generative Learning via Latent Space Distribution Matching	Kwong Yu Chong et.al.	2603.04223	translate	read	null
2026-03-04	FastWave: Optimized Diffusion Model for Audio Super-Resolution	Nikita Kuznetsov et.al.	2603.04122	translate	read	null
2026-03-04	MLOps-Assisted Anomalous Reflector Metasurfaces Design Based on Red Hat OpenShift AI	Wael Elshennawy et.al.	2603.03981	translate	read	null
2026-03-04	Dual-Solver: A Generalized ODE Solver for Diffusion Models with Dual Prediction	Soochul Park et.al.	2603.03973	translate	read	null
2026-03-04	Plug-and-Play blind super-resolution of real MRI images for improved multiple sclerosis diagnosis	Matteo Cannas et.al.	2603.03876	translate	read	null
2026-03-04	Order Is Not Layout: Order-to-Space Bias in Image Generation	Yongkang Zhang et.al.	2603.03714	translate	read	null
2026-03-04	Machine Pareidolia: Protecting Facial Image with Emotional Editing	Binh M. Le et.al.	2603.03665	translate	read	null
2026-03-03	CFG-Ctrl: Control-Based Classifier-Free Diffusion Guidance	Hanyang Wang et.al.	2603.03281	translate	read	null
2026-03-03	AWDiff: An a trous wavelet diffusion model for lung ultrasound image synthesis	Maryam Heidari et.al.	2603.03125	translate	read	null
2026-03-03	Complementarity between atmospheric and super-beam neutrinos at ESSnuSB	ESSnuSB et.al.	2603.02836	translate	read	null
2026-03-03	Structure-Aware Text Recognition for Ancient Greek Critical Editions	Nicolas Angleraud et.al.	2603.02803	translate	read	null
2026-03-03	From “What” to “How”: Constrained Reasoning for Autoregressive Image Generation	Ruxue Yan et.al.	2603.02712	translate	read	null
2026-03-03	FiDeSR: High-Fidelity and Detail-Preserving One-Step Diffusion Super-Resolution	Aro Kim et.al.	2603.02692	translate	read	null
2026-03-03	DREAM: Where Visual Understanding Meets Text-to-Image Generation	Chao Li et.al.	2603.02667	translate	read	null
2026-03-03	ATD: Improved Transformer with Adaptive Token Dictionary for Image Restoration	Leheng Zhang et.al.	2603.02581	translate	read	null
2026-03-02	Ground-based Atmospheric Characterization of Super-Earth L 98-59 d at High Spectral Resolution	Connor J. Cheverall et.al.	2603.02209	translate	read	null
2026-03-02	Kiwi-Edit: Versatile Video Editing via Instruction and Reference Guidance	Yiqi Lin et.al.	2603.02175	translate	read	null
2026-03-02	GeoDiT: Point-Conditioned Diffusion Transformer for Satellite Image Synthesis	Srikumar Sastry et.al.	2603.02172	translate	read	null
2026-03-02	ORGAN: Object-Centric Representation Learning using Cycle Consistent Generative Adversarial Networks	Joël Küchler et.al.	2603.02063	translate	read	null
2026-03-02	Latent attention on masked patches for flow reconstruction	Ben Eze et.al.	2603.02028	translate	read	null
2026-03-02	Tensor-network methodology for super-moiré excitons beyond one billion sites	Anouar Moustaj et.al.	2603.02011	translate	read	null
2026-03-02	Plug-and-play forward backward algorithm to restore Landsat images: A preliminary step to uncover the history of surface waters	Pierre Audisio et.al.	2603.01868	translate	read	null
2026-03-02	Block-coordinate Plug-And-Play Methods with Armijo-like line-search for Image Restoration	Federica Porta et.al.	2603.01734	translate	read	null
2026-03-02	DiffusionXRay: A Diffusion and GAN-Based Approach for Enhancing Digitally Reconstructed Chest Radiographs	Aryan Goyal et.al.	2603.01686	translate	read	null
2026-03-02	SkeleGuide: Explicit Skeleton Reasoning for Context-Aware Human-in-Place Image Synthesis	Chuqiao Wu et.al.	2603.01579	translate	read	null
2026-03-02	Align-cDAE: Alzheimer’s Disease Progression Modeling with Attention-Aligned Conditional Diffusion Auto-Encoder	Ayantika Das et.al.	2603.01552	translate	read	null
2026-03-02	RA-Det: Towards Universal Detection of AI-Generated Images via Robustness Asymmetry	Xinchang Wang et.al.	2603.01544	translate	read	null
2026-03-02	Benchmarking Semantic Segmentation Models via Appearance and Geometry Attribute Editing	Zijin Yin et.al.	2603.01535	translate	read	null
2026-03-02	Revisiting Global Token Mixing in Task-Dependent MRI Restoration: Insights from Minimal Gated CNN Baselines	Xiangjian Hou et.al.	2603.01449	translate	read	null
2026-03-02	ALMA High-J CO Spectroscopy of High-Redshift Galaxies. II. 0.03” Resolution CO Kinematics Reveal Super-Eddington Accretion in a Dust-Obscured Galaxy at z=3.111	Ken-ichi Tadaki et.al.	2603.01352	translate	read	null
2026-03-01	Teacher-Guided Causal Interventions for Image Denoising: Orthogonal Content-Noise Disentanglement in Vision Transformers	Kuai Jiang et.al.	2603.01140	translate	read	null
2026-03-01	Super-resolution of turbulent reacting flows on complex meshes using graph neural networks	Priyabrat Dash et.al.	2603.01080	translate	read	null
2026-03-01	LLaDA-o: An Effective and Length-Adaptive Omni Diffusion Model	Zebin You et.al.	2603.01068	translate	read	link
2026-03-01	Reparameterized Tensor Ring Functional Decomposition for Multi-Dimensional Data Recovery	Yangyang Xu et.al.	2603.01034	translate	read	null
2026-03-01	Fully-analog array signal processor using 3D aperture engineering	Sheng Gao et.al.	2603.00995	translate	read	null
2026-03-01	Spectral Super-Resolution via Adversarial Unfolding and Data-Driven Spectrum Regularization: From Multispectral Satellite Data to NASA Hyperspectral Image	Si-Sheng Young et.al.	2603.00920	translate	read	null
2026-03-01	Improving Text-to-Image Generation with Intrinsic Self-Confidence Rewards	Seungwook Kim et.al.	2603.00918	translate	read	null
2026-03-01	Solving a Nonlinear Blind Inverse Problem for Tagged MRI with Physics and Deep Generative Priors	Zhangxing Bian et.al.	2603.00882	translate	read	null
2026-03-01	Neural Discrimination-Prompted Transformers for Efficient UHD Image Restoration and Enhancement	Cong Wang et.al.	2603.00853	translate	read	null

(<a href=../Image_Generation.md>back to Image Generation</a>)