Image Generation - 2026-03

Publish Date Title Authors PDF Translate Read Code
2026-03-31 Abstraction in Style Min Lu et.al. 2603.29924 translate read null
2026-03-31 ATP-Bench: Towards Agentic Tool Planning for MLLM Interleaved Generation Yinuo Liu et.al. 2603.29902 translate read null
2026-03-31 Accurate Determination of Chemical Abundances near a Supermassive Black Hole The XRISM collaboration et.al. 2603.29748 translate read null
2026-03-31 MacTok: Robust Continuous Tokenization for Image Generation Hengyu Zeng et.al. 2603.29634 translate read null
2026-03-31 Unify-Agent: A Unified Multimodal Agent for World-Grounded Image Synthesis Shuang Chen et.al. 2603.29620 translate read null
2026-03-31 FlowID : Enhancing Forensic Identification with Latent Flow-Matching Models Jules Ripoll et.al. 2603.29591 translate read null
2026-03-31 Generating Key Postures of Bharatanatyam Adavus with Pose Estimation Jagadish Kashinath Kamble et.al. 2603.29570 translate read null
2026-03-31 CIPHER: Counterfeit Image Pattern High-level Examination via Representation Kyeonghun Kim et.al. 2603.29356 translate read null
2026-03-31 GazeCLIP: Gaze-Guided CLIP with Adaptive-Enhanced Fine-Grained Language Prompt for Deepfake Attribution and Detection Yaning Zhang et.al. 2603.29295 translate read null
2026-03-31 Semantic Communication for 6G Networks: A Trade-off between Distortion Criticality and Information Representability Faizan Shafi et.al. 2603.29293 translate read null
2026-03-30 Gen-Searcher: Reinforcing Agentic Search for Image Generation Kaituo Feng et.al. 2603.28767 translate read null
2026-03-30 PoseDreamer: Scalable and Photorealistic Human Data Generation Pipeline with Diffusion Models Lorenza Prospero et.al. 2603.28763 translate read null
2026-03-30 DreamLite: A Lightweight On-Device Unified Model for Image Generation and Editing Kailai Feng et.al. 2603.28713 translate read null
2026-03-30 TGIF2: Extended Text-Guided Inpainting Forgery Dataset & Benchmark Hannes Mareen et.al. 2603.28613 translate read null
2026-03-30 MRI-to-CT synthesis using drifting models Qing Lyu et.al. 2603.28498 translate read null
2026-03-30 EdgeDiT: Hardware-Aware Diffusion Transformers for Efficient On-Device Image Generation Sravanth Kodavanti et.al. 2603.28405 translate read null
2026-03-30 Integrating Multimodal Large Language Model Knowledge into Amodal Completion Heecheol Yun et.al. 2603.28333 translate read null
2026-03-30 LogiStory: A Logic-Aware Framework for Multi-Image Story Visualization Chutian Meng et.al. 2603.28082 translate read null
2026-03-30 SIMR-NO: A Spectrally-Informed Multi-Resolution Neural Operator for Turbulent Flow Super-Resolution Muhammad Abid et.al. 2603.28073 translate read null
2026-03-30 AIBench: Evaluating Visual-Logical Consistency in Academic Illustration Generation Zhaohe Liao et.al. 2603.28068 translate read null
2026-03-30 MathGen: Revealing the Illusion of Mathematical Competence through Text-to-Image Generation Ruiyao Liu et.al. 2603.27959 translate read null
2026-03-25 Polynomial Speedup in Diffusion Models with the Multilevel Euler-Maruyama Method Arthur Jacot et.al. 2603.24594 translate read null
2026-03-25 Anti-I2V: Safeguarding your photos from malicious image-to-video generation Duc Vu et.al. 2603.24570 translate read null
2026-03-25 ViHOI: Human-Object Interaction Synthesis with Visual Priors Songjin Cai et.al. 2603.24383 translate read null
2026-03-25 Shape-Dependent, Deep-Learning-Assisted Metamaterial Solid Immersion Lens (mSIL) Super-Resolution Imaging Baidong Wu et.al. 2603.24371 translate read null
2026-03-25 ScrollScape: Unlocking 32K Image Generation With Video Diffusion Priors Haodong Yu et.al. 2603.24270 translate read null
2026-03-25 InstanceRSR: Real-World Super-Resolution via Instance-Aware Representation Alignment Zixin Guo et.al. 2603.24240 translate read null
2026-03-25 RefReward-SR: LR-Conditioned Reward Modeling for Preference-Aligned Super-Resolution Yushuai Song et.al. 2603.24198 translate read null
2026-03-25 LGTM: Training-Free Light-Guided Text-to-Image Diffusion Model via Initial Noise Manipulation Ryugo Morita et.al. 2603.24086 translate read null
2026-03-25 When Understanding Becomes a Risk: Authenticity and Safety Risks in the Emerging Image Generation Paradigm Ye Leng et.al. 2603.24079 translate read null
2026-03-25 Human Factors in Detecting AI-Generated Portraits: Age, Sex, Device, and Confidence Sunwhi Kim et.al. 2603.24048 translate read null
2026-03-25 HAM: A Training-Free Style Transfer Approach via Heterogeneous Attention Modulation for Diffusion Models Yeqi He et.al. 2603.24043 translate read null
2026-03-25 Transcending Classical Neural Network Boundaries: A Quantum-Classical Synergistic Paradigm for Seismic Data Processing Zhengyi Yuan et.al. 2603.23984 translate read null
2026-03-25 DepthArb: Training-Free Depth-Arbitrated Generation for Occlusion-Robust Image Synthesis Hongjin Niu et.al. 2603.23924 translate read null
2026-03-25 GenMask: Adapting DiT for Segmentation via Direct Mask Yuhuan Yang et.al. 2603.23906 translate read null
2026-03-24 Very sensitive vapor-cell quasi-DC atomic E-field sensor Amy Damitz et.al. 2603.23751 translate read null
2026-03-24 PoiCGAN: A Targeted Poisoning Based on Feature-Label Joint Perturbation in Federated Learning Tao Liu et.al. 2603.23574 translate read null
2026-03-24 UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation Jie Liu et.al. 2603.23500 translate read link
2026-03-24 InverFill: One-Step Inversion for Enhanced Few-Step Diffusion Inpainting Duc Vu et.al. 2603.23463 translate read null
2026-03-24 Mamba-driven MRI-to-CT Synthesis for MRI-only Radiotherapy Planning Konstantinos Barmpounakis et.al. 2603.23295 translate read null
2026-03-24 VoDaSuRe: A Large-Scale Dataset Revealing Domain Shift in Volumetric Super-Resolution August Leander Høeg et.al. 2603.23153 translate read null
2026-03-24 DAK-UCB: Diversity-Aware Prompt Routing for LLMs and Generative Models Donya Jafari et.al. 2603.23140 translate read null
2026-03-24 Policy-based Tuning of Autoregressive Image Models with Instance- and Distribution-Level Rewards Orhun Buğra Baran et.al. 2603.23086 translate read null
2026-03-24 AuthorMix: Modular Authorship Style Transfer via Layer-wise Adapter Mixing Sarubi Thillainathan et.al. 2603.23069 translate read null
2026-03-24 HUydra: Full-Range Lung CT Synthesis via Multiple HU Interval Generative Modelling António Cardoso et.al. 2603.23041 translate read null
2026-03-24 Zero-Shot Personalization of Objects via Textual Inversion Aniket Roy et.al. 2603.23010 translate read null
2026-03-24 WorldMesh: Generating Navigable Multi-Room 3D Scenes via Mesh-Conditioned Image Diffusion Manuel-Andreas Schneider et.al. 2603.22972 translate read null
2026-03-24 PersonalQ: Select, Quantize, and Serve Personalized Diffusion Models for Efficient Inference Qirui Wang et.al. 2603.22943 translate read null
2026-03-24 From Pixels to Semantics: A Multi-Stage AI Framework for Structural Damage Detection in Satellite Imagery Bijay Shakya et.al. 2603.22768 translate read null
2026-03-23 Single-Subject Multi-View MRI Super-Resolution via Implicit Neural Representations Heejong Kim et.al. 2603.22627 translate read null
2026-03-23 PIVM: Diffusion-Based Prior-Integrated Variation Modeling for Anatomically Precise Abdominal CT Synthesis Dinglun He et.al. 2603.22626 translate read null
2026-03-23 Latent Style-based Quantum Wasserstein GAN for Drug Design Julien Baglio et.al. 2603.22399 translate read null
2026-03-23 Repurposing Geometric Foundation Models for Multi-view Diffusion Wooseok Jang et.al. 2603.22275 translate read null
2026-03-23 DUO-VSR: Dual-Stream Distillation for One-Step Video Super-Resolution Zhengyao Lv et.al. 2603.22271 translate read null
2026-03-23 SelfTTS: cross-speaker style transfer through explicit embedding disentanglement and self-refinement using self-augmentation Lucas H. Ueda et.al. 2603.22252 translate read null
2026-03-23 SpatialReward: Verifiable Spatial Reward Modeling for Fine-Grained Spatial Consistency in Text-to-Image Generation Sashuai Zhou et.al. 2603.22228 translate read null
2026-03-23 DA-VAE: Plug-in Latent Compression for Diffusion via Detail Alignment Xin Cai et.al. 2603.22125 translate read null
2026-03-23 DTVI: Dual-Stage Textual and Visual Intervention for Safe Text-to-Image Generation Binhong Tan et.al. 2603.22041 translate read null
2026-03-23 Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model SII-GAIR et.al. 2603.21986 translate read null
2026-03-23 MultiBind: A Benchmark for Attribute Misbinding in Multi-Subject Generation Wenqing Tian et.al. 2603.21937 translate read null
2026-03-23 Not All Layers Are Created Equal: Adaptive LoRA Ranks for Personalized Image Generation Donald Shenaj et.al. 2603.21884 translate read null
2026-03-23 SHARP: Spectrum-aware Highly-dynamic Adaptation for Resolution Promotion in Remote Sensing Synthesis Bingxuan Zhao et.al. 2603.21783 translate read null
2026-03-23 OmniFM: Toward Modality-Robust and Task-Agnostic Federated Learning for Heterogeneous Medical Imaging Meilin Liu et.al. 2603.21660 translate read null
2026-03-23 Conditional Wasserstein GAN for Simulating Neutrino Event Summaries using Incident Energy of Electron Neutrinos Dipthi S. et.al. 2603.21599 translate read null
2026-03-23 Unregistered Spectral Image Fusion: Unmixing, Adversarial Learning, and Recoverability Jiahui Song et.al. 2603.21510 translate read null
2026-03-22 Efficient Coarse-to-Fine Diffusion Models with Time Step Sequence Redistribution Yu-Shan Tai et.al. 2603.21348 translate read null
2026-03-22 Positional Segmentor-Guided Counterfactual Fine-Tuning for Spatially Localized Image Synthesis Tian Xia et.al. 2603.21213 translate read null
2026-03-22 MS-CustomNet: Controllable Multi-Subject Customization with Hierarchical Relational Semantics Pengxiang Cai et.al. 2603.21136 translate read null
2026-03-22 Taming Sampling Perturbations with Variance Expansion Loss for Latent Diffusion Models Qifan Li et.al. 2603.21085 translate read null
2026-03-22 LPNSR: Prior-Enhanced Diffusion Image Super-Resolution via LR-Guided Noise Prediction Shuwei Huang et.al. 2603.21045 translate read null
2026-03-21 EruDiff: Refactoring Knowledge in Diffusion Models for Advanced Text-to-Image Synthesis Xiefan Guo et.al. 2603.20828 translate read null
2026-03-21 CTCal: Rethinking Text-to-Image Diffusion Models via Cross-Timestep Self-Calibration Xiefan Guo et.al. 2603.20741 translate read null
2026-03-21 Premier: Personalized Preference Modulation with Learnable User Embedding in Text-to-Image Generation Zihao Wang et.al. 2603.20725 translate read null
2026-03-21 MFSR: MeanFlow Distillation for One Step Real-World Image Super Resolution Ruiqing Wang et.al. 2603.20690 translate read null
2026-03-21 ScaleEdit-12M: Scaling Open-Source Image Editing Data Generation via Multi-Agent Framework Guanzhou Chen et.al. 2603.20644 translate read null
2026-03-21 Interpretable Operator Learning for Inverse Problems via Adaptive Spectral Filtering: Convergence and Discretization Invariance Hang-Cheng Dong et.al. 2603.20602 translate read null
2026-03-20 DiffGraph: An Automated Agent-driven Model Merging Framework for In-the-Wild Text-to-Image Generation Zhuoling Li et.al. 2603.20470 translate read null
2026-03-20 Uni-Classifier: Leveraging Video Diffusion Priors for Universal Guidance Classifier Yujie Zhou et.al. 2603.20382 translate read null
2026-03-19 Transferable Multi-Bit Watermarking Across Frozen Diffusion Models via Latent Consistency Bridges Hong-Hanh Nguyen-Le et.al. 2603.20304 translate read null
2026-03-20 Improving Image-to-Image Translation via a Rectified Flow Reformulation Satoshi Iizuka et.al. 2603.20186 translate read null
2026-03-20 Generalizable NGP-SR: Generalizable Neural Radiance Fields Super-Resolution via Neural Graph Primitives Wanqi Yuan et.al. 2603.20128 translate read null
2026-03-20 Preference-Guided Debiasing for No-Reference Enhancement Image Quality Assessment Shiqi Gao et.al. 2603.20086 translate read null
2026-03-20 X-World: Controllable Ego-Centric Multi-Camera World Models for Scalable End-to-End Driving Chaoda Zheng et.al. 2603.19979 translate read null
2026-03-20 Timestep-Aware Block Masking for Efficient Diffusion Model Inference Haodong He et.al. 2603.19939 translate read null
2026-03-20 Evaluating Image Editing with LLMs: A Comprehensive Benchmark and Intermediate-Layer Probing Approach Shiqi Gao et.al. 2603.19775 translate read null
2026-03-20 WorldAgents: Can Foundation Image Models be Agents for 3D World Models? Ziya Erkoç et.al. 2603.19708 translate read null
2026-03-20 Diminishing Returns in Expanding Generative Models and Godel-Tarski-Lob Limits Angshul Majumdar et.al. 2603.19687 translate read null
2026-03-20 Toward High-Fidelity Visual Reconstruction: From EEG-Based Conditioned Generation to Joint-Modal Guided Rebuilding Zhijian Gong et.al. 2603.19667 translate read null
2026-03-20 Fixed-Point Delayed Subgradient Methods for Nonsmooth Convex Optimization Problems Ontima Pankoon et.al. 2603.19604 translate read null
2026-03-20 MagicSeg: Open-World Segmentation Pretraining via Counterfactural Diffusion-Based Auto-Generation Kaixin Cai et.al. 2603.19575 translate read null
2026-03-19 TuLaBM: Tumor-Biased Latent Bridge Matching for Contrast-Enhanced MRI Synthesis Atharva Rege et.al. 2603.19386 translate read null
2026-03-19 Warm-Start Flow Matching for Guaranteed Fast Text/Image Generation Minyoung Kim et.al. 2603.19360 translate read null
2026-03-19 RPiAE: A Representation-Pivoted Autoencoder Enhancing Both Image Generation and Editing Yue Gong et.al. 2603.19206 translate read null
2026-03-19 GenMFSR: Generative Multi-Frame Image Restoration and Super-Resolution Harshana Weligampola et.al. 2603.19187 translate read null
2026-03-19 ADAPT: Attention Driven Adaptive Prompt Scheduling and InTerpolating Orthogonal Complements for Rare Concepts Generation Kwanyoung Lee et.al. 2603.19157 translate read null
2026-03-19 Unmasking Algorithmic Bias in Predictive Policing: A GAN-Based Simulation Framework with Multi-City Temporal Analysis Pronob Kumar Barman et.al. 2603.18987 translate read null
2026-03-19 Sketch2Topo: Using Hand-Drawn Inputs for Diffusion-Based Topology Optimization Shuyue Feng et.al. 2603.18960 translate read null
2026-03-19 Seasoning Generative Models for a Generalization Aftertaste Hisham Husain et.al. 2603.18817 translate read null
2026-03-19 Enhancing the Parameterization of Reservoir Properties for Data Assimilation Using Deep VAE-GAN M. A. Sampaio et.al. 2603.18766 translate read null
2026-03-19 WeNLEX: Weakly Supervised Natural Language Explanations for Multilabel Chest X-ray Classification Isabel Rio-Torto et.al. 2603.18752 translate read null
2026-03-19 Agentic Flow Steering and Parallel Rollout Search for Spatially Grounded Text-to-Image Generation Ping Chen et.al. 2603.18627 translate read null
2026-03-19 SJD-PAC: Accelerating Speculative Jacobi Decoding via Proactive Drafting and Adaptive Continuation Jialiang Kang et.al. 2603.18599 translate read null
2026-03-19 End-to-End QGAN-Based Image Synthesis via Neural Noise Encoding and Intensity Calibration Xue Yang et.al. 2603.18554 translate read null
2026-03-19 CAFlow: Adaptive-Depth Single-Step Flow Matching for Efficient Histopathology Super-Resolution Elad Yoshai et.al. 2603.18513 translate read null
2026-03-19 Recolour What Matters: Region-Aware Colour Editing via Token-Level Diffusion Yuqi Yang et.al. 2603.18466 translate read null
2026-03-18 Learning to See Sharper: A Physics-Informed Artificial Intelligence Framework for Super-Resolving Galaxy Spectra Aryana Haghjoo et.al. 2603.18357 translate read null
2026-03-18 Epistemic Generative Adversarial Networks Muhammad Mubashar et.al. 2603.18348 translate read null
2026-03-18 Unrolled Reconstruction with Integrated Super-Resolution for Accelerated 3D LGE MRI Md Hasibul Husain Hisham et.al. 2603.18309 translate read null
2026-03-18 EchoGen: Cycle-Consistent Learning for Unified Layout-Image Generation and Understanding Kai Zou et.al. 2603.18001 translate read null
2026-03-18 LaDe: Unified Multi-Layered Graphic Media Generation and Decomposition Vlad-Constantin Lungu-Stan et.al. 2603.17965 translate read null
2026-03-18 ChopGrad: Pixel-Wise Losses for Latent Video Diffusion via Truncated Backpropagation Dmitriy Rivkin et.al. 2603.17812 translate read null
2026-03-18 Cache-enabled Generative Joint Source-Channel Coding for Evolving Semantic Communications Shunpu Tang et.al. 2603.17702 translate read null
2026-03-18 DSS-GAN: Directional State Space GAN with Mamba backbone for Class-Conditional Image Synthesis Aleksander Ogonowski et.al. 2603.17637 translate read null
2026-03-18 Searching for Molecular Signatures in 14 Transiting Exoplanets with SPIRou A. Masson et.al. 2603.17574 translate read null
2026-03-18 A Tutorial on Learning-Based Radio Map Construction: Data, Paradigms, and Physics-Awarenes Xiucheng Wang et.al. 2603.17499 translate read null
2026-03-18 UniSAFE: A Comprehensive Benchmark for Safety Evaluation of Unified Multimodal Models Segyu Lee et.al. 2603.17476 translate read null
2026-03-18 Caging the Agents: A Zero Trust Security Architecture for Autonomous AI in Healthcare Saikat Maiti et.al. 2603.17419 translate read null
2026-03-18 Joint Degradation-Aware Arbitrary-Scale Super-Resolution for Variable-Rate Extreme Image Compression Xinning Chai et.al. 2603.17408 translate read null
2026-03-18 Harnessing the Power of Foundation Models for Accurate Material Classification Qingran Lin et.al. 2603.17390 translate read null
2026-03-17 PhysQuantAgent: An Inference Pipeline of Mass Estimation for Vision-Language Models Hisayuki Yokomizo et.al. 2603.16958 translate read null
2026-03-17 SparkVSR: Interactive Video Super-Resolution via Sparse Keyframe Propagation Jiongze Yu et.al. 2603.16864 translate read null
2026-03-16 GDPO-SR: Group Direct Preference Optimization for One-Step Generative Image Super-Resolution Qiaosi Yi et.al. 2603.16769 translate read null
2026-03-17 REFORGE: Multi-modal Attacks Reveal Vulnerable Concept Unlearning in Image Generation Models Yong Zou et.al. 2603.16576 translate read null
2026-03-17 CompDiff: Hierarchical Compositional Diffusion for Fair and Zero-Shot Intersectional Medical Image Generation Mahmoud Ibrahim et.al. 2603.16551 translate read null
2026-03-17 Unlearning for One-Step Generative Models via Unbalanced Optimal Transport Hyundo Choi et.al. 2603.16489 translate read null
2026-03-17 Fanar 2.0: Arabic Generative AI Stack FANAR TEAM et.al. 2603.16397 translate read null
2026-03-17 DermaFlux: Synthetic Skin Lesion Generation with Rectified Flows for Enhanced Image Classification Stathis Galanakis et.al. 2603.16392 translate read null
2026-03-17 Semantic One-Dimensional Tokenizer for Image Reconstruction and Generation Yunpeng Qu et.al. 2603.16373 translate read null
2026-03-17 RASLF: Representation-Aware State Space Model for Light Field Super-Resolution Zeqiang Wei et.al. 2603.16243 translate read null
2026-03-16 Clinically Aware Synthetic Image Generation for Concept Coverage in Chest X-ray Models Amy Rafferty et.al. 2603.15525 translate read null
2026-03-16 RSGen: Enhancing Layout-Driven Remote Sensing Image Generation with Diverse Edge Guidance Xianbao Hou et.al. 2603.15484 translate read null
2026-03-16 Flash-Unified: A Training-Free and Task-Aware Acceleration Framework for Native Unified Models Junlong Ke et.al. 2603.15271 translate read null
2026-03-16 TextOVSR: Text-Guided Real-World Opera Video Super-Resolution Hua Chang et.al. 2603.15153 translate read null
2026-03-16 SNCE: Geometry-Aware Supervision for Scalable Discrete Image Generation Shufan Li et.al. 2603.15150 translate read null
2026-03-16 Training-free Detection of Generated Videos via Spatial-Temporal Likelihoods Omer Ben Hayun et.al. 2603.15026 translate read null
2026-03-16 CyCLeGen: Cycle-Consistent Layout Prediction and Image Generation in Vision Foundation Models Xiaojun Shan et.al. 2603.14957 translate read null
2026-03-16 Relevance Feedback in Text-to-Image Diffusion: A Training-Free And Model-Agnostic Interactive Framework Wenxi Wang et.al. 2603.14936 translate read null
2026-03-16 The Super Fine-Grained Detector for the T2K neutrino oscillation experiment S. Abe et.al. 2603.14921 translate read null
2026-03-16 Seismic full-waveform inversion based on a physics-driven generative adversarial network Xinyi Zhang et.al. 2603.14879 translate read null
2026-03-16 AnyPhoto: Multi-Person Identity Preserving Image Generation with ID Adaptive Modulation on Location Canvas Longhui Yuan et.al. 2603.14770 translate read null
2026-03-16 Investigating the Impact of Speech Enhancement on Audio Deepfake Detection in Noisy Environments Anacin et.al. 2603.14767 translate read null
2026-03-16 PHAC: Promptable Human Amodal Completion Seung Young Noh et.al. 2603.14741 translate read null
2026-03-15 Comparative Analysis of 3D Convolutional and 2.5D Slice-Conditioned U-Net Architectures for MRI Super-Resolution via Elucidated Diffusion Models Hendrik Chiche et.al. 2603.14667 translate read null
2026-03-15 A Decoupling-based Approach for Signature Estimation of Wideband XL MIMO-FMCW Radars Chandrashekhar Rai et.al. 2603.14542 translate read null
2026-03-15 PGcGAN: Pathological Gait-Conditioned GAN for Human Gait Synthesis Mritula Chandrasekaran et.al. 2603.14409 translate read null
2026-03-15 High-Fidelity Compression of Seismic Velocity Models via SIREN Auto-Decoders Caiyun Liu et.al. 2603.14284 translate read null
2026-03-15 FIND: A Simple yet Effective Baseline for Diffusion-Generated Image Detection Jie Li et.al. 2603.14220 translate read null
2026-03-15 DualTSR: Unified Dual-Diffusion Transformer for Scene Text Image Super-Resolution Axi Niu et.al. 2603.14207 translate read null
2026-03-12 The Latent Color Subspace: Emergent Order in High-Dimensional Chaos Mateusz Pach et.al. 2603.12261 translate read null
2026-03-12 Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation Xiangyu Zhao et.al. 2603.12247 translate read null
2026-03-12 EvoTok: A Unified Image Tokenizer via Residual Latent Evolution for Visual Understanding and Generation Yan Li et.al. 2603.12108 translate read null
2026-03-12 Single Pixel Image Classification using an Ultrafast Digital Light Projector Aisha Kanwal et.al. 2603.12036 translate read null
2026-03-12 Unveiling the biconical geometry of the outflow in the ultraluminous X-ray source NGC 5204 X-1 S. Caserta et.al. 2603.11922 translate read null
2026-03-12 A Decade of Generative Adversarial Networks for Porous Material Reconstruction Ali Sadeghkhani et.al. 2603.11836 translate read null
2026-03-12 UCAN: Unified Convolutional Attention Network for Expansive Receptive Fields in Lightweight Super-Resolution Cao Thien Tan et.al. 2603.11680 translate read null
2026-03-12 Gen-Fab: A Variation-Aware Generative Model for Predicting Fabrication Variations in Nanophotonic Devices Rambod Azimi et.al. 2603.11505 translate read null
2026-03-11 HanMoVLM: Large Vision-Language Models for Professional Artistic Painting Evaluation Hongji Yang et.al. 2603.10814 translate read null
2026-03-11 The Quadratic Geometry of Flow Matching: Semantic Granularity Alignment for Text-to-Image Synthesis Zhinan Xiong et.al. 2603.10785 translate read null
2026-03-11 Just-in-Time: Training-Free Spatial Acceleration for Diffusion Transformers Wenhao Sun et.al. 2603.10744 translate read null
2026-03-11 HyPER-GAN: Hybrid Patch-Based Image-to-Image Translation for Real-Time Photorealism Enhancement Stefanos Pasios et.al. 2603.10604 translate read null
2026-03-11 Attribution as Retrieval: Model-Agnostic AI-Generated Image Attribution Hongsong Wang et.al. 2603.10583 translate read null
2026-03-11 Visually-Guided Controllable Medical Image Generation via Fine-Grained Semantic Disentanglement Xin Huang et.al. 2603.10519 translate read null
2026-03-11 Enhancing Network Intrusion Detection Systems: A Multi-Layer Ensemble Approach to Mitigate Adversarial Attacks Nasim Soltani et.al. 2603.10413 translate read null
2026-03-11 StyleGallery: Training-free and Semantic-aware Personalized Style Transfer from Arbitrary Image References Boyu He et.al. 2603.10354 translate read null
2026-03-10 Delta-K: Boosting Multi-Instance Generation via Cross-Attention Augmentation Zitong Wang et.al. 2603.10210 translate read null
2026-03-10 4DEquine: Disentangling Motion and Appearance for 4D Equine Reconstruction from Monocular Video Jin Lyu et.al. 2603.10125 translate read null
2026-03-10 Generative Drifting is Secretly Score Matching: a Spectral and Variational Perspective Erkan Turan et.al. 2603.09936 translate read null
2026-03-10 Adaptive Clinical-Aware Latent Diffusion for Multimodal Brain Image Generation and Missing Modality Imputation Rong Zhou et.al. 2603.09931 translate read null
2026-03-10 CycleULM: A unified label-free deep learning framework for ultrasound localisation microscopy Su Yan et.al. 2603.09840 translate read null
2026-03-10 Prompt-Driven Color Accessibility Evaluation in Diffusion-based Image Generation Models Xinyao Zhuang et.al. 2603.09832 translate read null
2026-03-10 LogoDiffuser: Training-Free Multilingual Logo Generation and Stylization via Letter-Aware Attention Control Mingyu Kang et.al. 2603.09759 translate read null
2026-03-10 TriFusion-SR: Joint Tri-Modal Medical Image Fusion and SR Fayaz Ali Dharejo et.al. 2603.09702 translate read null
2026-03-10 Well Log-Guided Synthesis of Subsurface Images from Sparse Petrography Data Using cGANs Ali Sadeghkhani et.al. 2603.09651 translate read null
2026-03-10 Physics-Driven 3D Gaussian Rendering for Zero-Shot MRI Super-Resolution Shuting Liu et.al. 2603.09621 translate read null
2026-03-10 Towards Unified Multimodal Interleaved Generation via Group Relative Policy Optimization Ming Nie et.al. 2603.09538 translate read null
2026-03-10 A Fast Solver for Interpolating Stochastic Differential Equation Diffusion Models for Speech Restoration Bunlong Lay et.al. 2603.09508 translate read null
2026-03-10 Streaming Autoregressive Video Generation via Diagonal Distillation Jinxiu Liu et.al. 2603.09488 translate read null
2026-03-10 Component-Aware Sketch-to-Image Generation Using Self-Attention Encoding and Coordinate-Preserving Fusion Ali Zia et.al. 2603.09484 translate read null
2026-03-10 ShapeMark: Robust and Diversity-Preserving Watermarking for Diffusion Models Yuqi Qian et.al. 2603.09454 translate read null
2026-03-10 Interactive 3D visualization of surface roughness predictions in additive manufacturing: A data-driven framework Engin Deniz Erkan et.al. 2603.09353 translate read null
2026-03-10 CogBlender: Towards Continuous Cognitive Intervention in Text-to-Image Generation Shengqi Dang et.al. 2603.09286 translate read null
2026-03-10 Acoustic and Semantic Modeling of Emotion in Spoken Language Soumya Dutta et.al. 2603.09212 translate read null
2026-03-10 Progressive Split Mamba: Effective State Space Modelling for Image Restoration Mohammed Hassanin et.al. 2603.09171 translate read null
2026-03-10 POLISH’ing the Sky: Wide-Field and High-Dynamic Range Interferometric Image Reconstruction with Application to Strong Lens Discovery Zihui Wu et.al. 2603.09162 translate read null
2026-03-10 RubiCap: Rubric-Guided Reinforcement Learning for Dense Image Captioning Tzu-Heng Huang et.al. 2603.09160 translate read null
2026-03-10 Rotation Equivariant Mamba for Vision Tasks Zhongchen Zhao et.al. 2603.09138 translate read null
2026-03-10 QUSR: Quality-Aware and Uncertainty-Guided Image Super-Resolution Diffusion Model Junjie Yin et.al. 2603.09125 translate read null
2026-03-09 The Coupling Within: Flow Matching via Distilled Normalizing Flows David Berthelot et.al. 2603.09014 translate read null
2026-03-09 CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation Haodong Li et.al. 2603.08652 translate read null
2026-03-09 CARE-Edit: Condition-Aware Routing of Experts for Contextual Image Editing Yucheng Wang et.al. 2603.08589 translate read null
2026-03-09 Cubic maps from the group of order $3$ Vadim Alekseev et.al. 2603.08452 translate read null
2026-03-09 Rectified flow-based prediction of post-treatment brain MRI from pre-radiotherapy priors for patients with glioma Selena Huisman et.al. 2603.08385 translate read null
2026-03-09 Retrieval-Augmented Anatomical Guidance for Text-to-CT Generation Daniele Molino et.al. 2603.08305 translate read null
2026-03-09 Prototype-Guided Concept Erasure in Diffusion Models Yuze Cai et.al. 2603.08271 translate read null
2026-03-09 WaDi: Weight Direction-aware Distillation for One-step Image Synthesis Lei Wang et.al. 2603.08258 translate read null
2026-03-09 FlowTouch: View-Invariant Visuo-Tactile Prediction Seongjin Bien et.al. 2603.08255 translate read null
2026-03-09 Fourier Transform Infrared microspectroscopy-based super-resolution virtual staining of unlabeled tissues by pixel Diffusion Transformer Yudong Tian et.al. 2603.08143 translate read null
2026-03-09 DSH-Bench: A Difficulty- and Scenario-Aware Benchmark with Hierarchical Subject Taxonomy for Subject-Driven Text-to-Image Generation Zhenyu Hu et.al. 2603.08090 translate read null
2026-03-09 Synthetic Defect Image Generation for Power Line Insulator Inspection Using Multimodal Large Language Models Xuesong Wang et.al. 2603.08069 translate read null
2026-03-09 Text to Automata Diagrams: Comparing TikZ Code Generation with Direct Image Synthesis Ethan Young et.al. 2603.07936 translate read null
2026-03-09 Enhancing Unregistered Hyperspectral Image Super-Resolution via Unmixing-based Abundance Fusion Learning Yingkai Zhang et.al. 2603.07918 translate read null
2026-03-08 Parameterized Brushstroke Style Transfer Uma Meleti et.al. 2603.07776 translate read null
2026-03-08 Compressed-Domain-Aware Online Video Super-Resolution Yuhang Wang et.al. 2603.07694 translate read null
2026-03-08 GRD-Net: Generative-Reconstructive-Discriminative Anomaly Detection with Region of Interest Attention Module Niccolò Ferrari et.al. 2603.07566 translate read null
2026-03-08 CONSTANT: Towards High-Quality One-Shot Handwriting Generation with Patch Contrastive Enhancement and Style-Aware Quantization Anh-Duy Le et.al. 2603.07543 translate read null
2026-03-08 How Long Can Unified Multimodal Models Generate Images Reliably? Taming Long-Horizon Interleaved Image Generation via Context Curation Haoyu Chen et.al. 2603.07540 translate read null
2026-03-08 Image Generation Models: A Technical History Rouzbeh Shirvani et.al. 2603.07455 translate read null
2026-03-08 Disentangled Textual Priors for Diffusion-based Image Super-Resolution Lei Jiang et.al. 2603.07430 translate read null
2026-03-08 Fluctuation imaging of disorder in monolayer semiconductors Tom T. C. Sistermans et.al. 2603.07418 translate read null
2026-03-08 QdaVPR: A novel query-based domain-agnostic model for visual place recognition Shanshan Wan et.al. 2603.07414 translate read null
2026-03-07 Variational Flow Maps: Make Some Noise for One-Step Conditional Generation Abbas Mammadov et.al. 2603.07276 translate read null
2026-03-07 Single Image Super-Resolution via Bivariate `A Trous Wavelet Diffusion Heidari Maryam et.al. 2603.07234 translate read null
2026-03-07 AdaGen: Learning Adaptive Policy for Image Synthesis Zanlin Ni et.al. 2603.06993 translate read null
2026-03-06 Implementation of Quantum Implicit Neural Representation in Deterministic and Probabilistic Autoencoders for Image Reconstruction/Generation Tasks Saadet Müzehher Eren et.al. 2603.06755 translate read null
2026-03-06 EarthBridge: A Solution for 4th Multi-modal Aerial View Image Challenge Translation Track Zhenyuan Chen et.al. 2603.06753 translate read null
2026-03-06 Rank-Factorized Implicit Neural Bias: Scaling Super-Resolution Transformer with FlashAttention Dongheon Lee et.al. 2603.06738 translate read null
2026-03-04 One step further with Monte-Carlo sampler to guide diffusion better Minsi Ren et.al. 2603.06685 translate read null
2026-03-06 Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion Lijiang Li et.al. 2603.06577 translate read null
2026-03-06 NEGATE: Constrained Semantic Guidance for Linguistic Negation in Text-to-Video Diffusion Taewon Kang et.al. 2603.06533 translate read null
2026-03-06 Pinterest Canvas: Large-Scale Image Generation at Pinterest Yu Wang et.al. 2603.06453 translate read null
2026-03-06 Toward Generative Quantum Utility via Correlation-Complexity Map Chen-Yu Liu et.al. 2603.06440 translate read null
2026-03-06 The Art That Poses Back: Assessing AI Pastiches after Contemporary Artworks Anca Dinu et.al. 2603.06324 translate read null
2026-03-06 3D CBCT Artefact Removal Using Perpendicular Score-Based Diffusion Models Susanne Schaub et.al. 2603.06300 translate read null
2026-03-06 Spectral and Trajectory Regularization for Diffusion Transformer Super-Resolution Jingkai Wang et.al. 2603.06275 translate read null
2026-03-06 Optimizing 3D Diffusion Models for Medical Imaging via Multi-Scale Reward Learning Yueying Tian et.al. 2603.06173 translate read null
2026-03-06 Reflective Flow Sampling Enhancement Zikai Zhou et.al. 2603.06165 translate read null
2026-03-06 Longitudinal NSCLC Treatment Progression via Multimodal Generative Models Massimiliano Mantegna et.al. 2603.06147 translate read null
2026-03-06 FontUse: A Data-Centric Approach to Style- and Use-Case-Conditioned In-Image Typography Xia Xin et.al. 2603.06038 translate read null
2026-03-06 StruVis: Enhancing Reasoning-based Text-to-Image Generation via Thinking with Structured Vision Yuanhuiyi Lyu et.al. 2603.06032 translate read null
2026-03-06 LucidNFT: LR-Anchored Multi-Reward Preference Optimization for Generative Real-World Super-Resolution Song Fei et.al. 2603.05947 translate read null
2026-03-06 StreamWise: Serving Multi-Modal Generation in Real-Time at Scale Haoran Qiu et.al. 2603.05800 translate read null
2026-03-06 Layer-wise Instance Binding for Regional and Occlusion Control in Text-to-Image Diffusion Transformers Ruidong Chen et.al. 2603.05769 translate read null
2026-03-05 Limited-Angle CT Reconstruction Using Multi-Volume Latent Consistency Model Hinako Isogai et.al. 2603.05183 translate read null
2026-03-05 Diff-ES: Stage-wise Structural Diffusion Pruning via Evolutionary Search Zongfang Liu et.al. 2603.05105 translate read null
2026-03-05 CoIn3D: Revisiting Configuration-Invariant Multi-Camera 3D Object Detection Zhaonian Kuang et.al. 2603.05042 translate read null
2026-03-05 Enhancing Zero-shot Commonsense Reasoning by Integrating Visual Knowledge via Machine Imagination Hyuntae Park et.al. 2603.05040 translate read null
2026-03-05 A Simple Baseline for Unifying Understanding, Generation, and Editing via Vanilla Next-token Prediction Jie Zhu et.al. 2603.04980 translate read null
2026-03-05 MWA tied-array processing V: Super-resolved localisation via amplitude-only maximum likelihood direction finding Bradley W. Meyers et.al. 2603.04961 translate read null
2026-03-05 An Efficient Stochastic First-Order Algorithm for Nonconvex-Strongly Concave Minimax Optimization beyond Lipschitz Smoothness Yan Gao et.al. 2603.04940 translate read null
2026-03-05 Stochastic inner workings of subdiffraction laser writing Julia M. Mikhailova et.al. 2603.04853 translate read null
2026-03-05 DSA-SRGS: Super-Resolution Gaussian Splatting for Dynamic Sparse-View DSA Reconstruction Shiyu Zhang et.al. 2603.04770 translate read null
2026-03-05 Toward Real-world Infrared Image Super-Resolution: A Unified Autoregressive Framework and Benchmark Dataset Yang Zou et.al. 2603.04745 translate read null
2026-03-04 sFRC for assessing hallucinations in medical image restoration Prabhat Kc et.al. 2603.04673 translate read null
2026-03-04 Mask-aware inference with State-Space Models Ignasi Mas et.al. 2603.04568 translate read null
2026-03-04 Structure-Guided Histopathology Synthesis via Dual-LoRA Diffusion Xuan Xu et.al. 2603.04565 translate read null
2026-03-04 Enhancing Authorship Attribution with Synthetic Paintings Clarissa Loures et.al. 2603.04343 translate read null
2026-03-04 Balancing Fidelity, Utility, and Privacy in Synthetic Cardiac MRI Generation: A Comparative Study Madhura Edirisooriya et.al. 2603.04340 translate read null
2026-03-04 CubeComposer: Spatio-Temporal Autoregressive 4K 360° Video Generation from Perspective Video Lingen Li et.al. 2603.04291 translate read null
2026-03-04 LikeThis! Empowering App Users to Submit UI Improvement Suggestions Instead of Complaints Jialiang Wei et.al. 2603.04245 translate read null
2026-03-04 Semi-Supervised Generative Learning via Latent Space Distribution Matching Kwong Yu Chong et.al. 2603.04223 translate read null
2026-03-04 FastWave: Optimized Diffusion Model for Audio Super-Resolution Nikita Kuznetsov et.al. 2603.04122 translate read null
2026-03-04 MLOps-Assisted Anomalous Reflector Metasurfaces Design Based on Red Hat OpenShift AI Wael Elshennawy et.al. 2603.03981 translate read null
2026-03-04 Dual-Solver: A Generalized ODE Solver for Diffusion Models with Dual Prediction Soochul Park et.al. 2603.03973 translate read null
2026-03-04 Plug-and-Play blind super-resolution of real MRI images for improved multiple sclerosis diagnosis Matteo Cannas et.al. 2603.03876 translate read null
2026-03-04 Order Is Not Layout: Order-to-Space Bias in Image Generation Yongkang Zhang et.al. 2603.03714 translate read null
2026-03-04 Machine Pareidolia: Protecting Facial Image with Emotional Editing Binh M. Le et.al. 2603.03665 translate read null
2026-03-03 CFG-Ctrl: Control-Based Classifier-Free Diffusion Guidance Hanyang Wang et.al. 2603.03281 translate read null
2026-03-03 AWDiff: An a trous wavelet diffusion model for lung ultrasound image synthesis Maryam Heidari et.al. 2603.03125 translate read null
2026-03-03 Complementarity between atmospheric and super-beam neutrinos at ESSnuSB ESSnuSB et.al. 2603.02836 translate read null
2026-03-03 Structure-Aware Text Recognition for Ancient Greek Critical Editions Nicolas Angleraud et.al. 2603.02803 translate read null
2026-03-03 From “What” to “How”: Constrained Reasoning for Autoregressive Image Generation Ruxue Yan et.al. 2603.02712 translate read null
2026-03-03 FiDeSR: High-Fidelity and Detail-Preserving One-Step Diffusion Super-Resolution Aro Kim et.al. 2603.02692 translate read null
2026-03-03 DREAM: Where Visual Understanding Meets Text-to-Image Generation Chao Li et.al. 2603.02667 translate read null
2026-03-03 ATD: Improved Transformer with Adaptive Token Dictionary for Image Restoration Leheng Zhang et.al. 2603.02581 translate read null
2026-03-02 Ground-based Atmospheric Characterization of Super-Earth L 98-59 d at High Spectral Resolution Connor J. Cheverall et.al. 2603.02209 translate read null
2026-03-02 Kiwi-Edit: Versatile Video Editing via Instruction and Reference Guidance Yiqi Lin et.al. 2603.02175 translate read null
2026-03-02 GeoDiT: Point-Conditioned Diffusion Transformer for Satellite Image Synthesis Srikumar Sastry et.al. 2603.02172 translate read null
2026-03-02 ORGAN: Object-Centric Representation Learning using Cycle Consistent Generative Adversarial Networks Joël Küchler et.al. 2603.02063 translate read null
2026-03-02 Latent attention on masked patches for flow reconstruction Ben Eze et.al. 2603.02028 translate read null
2026-03-02 Tensor-network methodology for super-moiré excitons beyond one billion sites Anouar Moustaj et.al. 2603.02011 translate read null
2026-03-02 Plug-and-play forward backward algorithm to restore Landsat images: A preliminary step to uncover the history of surface waters Pierre Audisio et.al. 2603.01868 translate read null
2026-03-02 Block-coordinate Plug-And-Play Methods with Armijo-like line-search for Image Restoration Federica Porta et.al. 2603.01734 translate read null
2026-03-02 DiffusionXRay: A Diffusion and GAN-Based Approach for Enhancing Digitally Reconstructed Chest Radiographs Aryan Goyal et.al. 2603.01686 translate read null
2026-03-02 SkeleGuide: Explicit Skeleton Reasoning for Context-Aware Human-in-Place Image Synthesis Chuqiao Wu et.al. 2603.01579 translate read null
2026-03-02 Align-cDAE: Alzheimer’s Disease Progression Modeling with Attention-Aligned Conditional Diffusion Auto-Encoder Ayantika Das et.al. 2603.01552 translate read null
2026-03-02 RA-Det: Towards Universal Detection of AI-Generated Images via Robustness Asymmetry Xinchang Wang et.al. 2603.01544 translate read null
2026-03-02 Benchmarking Semantic Segmentation Models via Appearance and Geometry Attribute Editing Zijin Yin et.al. 2603.01535 translate read null
2026-03-02 Revisiting Global Token Mixing in Task-Dependent MRI Restoration: Insights from Minimal Gated CNN Baselines Xiangjian Hou et.al. 2603.01449 translate read null
2026-03-02 ALMA High-J CO Spectroscopy of High-Redshift Galaxies. II. 0.03” Resolution CO Kinematics Reveal Super-Eddington Accretion in a Dust-Obscured Galaxy at z=3.111 Ken-ichi Tadaki et.al. 2603.01352 translate read null
2026-03-01 Teacher-Guided Causal Interventions for Image Denoising: Orthogonal Content-Noise Disentanglement in Vision Transformers Kuai Jiang et.al. 2603.01140 translate read null
2026-03-01 Super-resolution of turbulent reacting flows on complex meshes using graph neural networks Priyabrat Dash et.al. 2603.01080 translate read null
2026-03-01 LLaDA-o: An Effective and Length-Adaptive Omni Diffusion Model Zebin You et.al. 2603.01068 translate read link
2026-03-01 Reparameterized Tensor Ring Functional Decomposition for Multi-Dimensional Data Recovery Yangyang Xu et.al. 2603.01034 translate read null
2026-03-01 Fully-analog array signal processor using 3D aperture engineering Sheng Gao et.al. 2603.00995 translate read null
2026-03-01 Spectral Super-Resolution via Adversarial Unfolding and Data-Driven Spectrum Regularization: From Multispectral Satellite Data to NASA Hyperspectral Image Si-Sheng Young et.al. 2603.00920 translate read null
2026-03-01 Improving Text-to-Image Generation with Intrinsic Self-Confidence Rewards Seungwook Kim et.al. 2603.00918 translate read null
2026-03-01 Solving a Nonlinear Blind Inverse Problem for Tagged MRI with Physics and Deep Generative Priors Zhangxing Bian et.al. 2603.00882 translate read null
2026-03-01 Neural Discrimination-Prompted Transformers for Efficient UHD Image Restoration and Enhancement Cong Wang et.al. 2603.00853 translate read null

(<a href=../Image_Generation.md>back to Image Generation</a>)