Image Generation - 2026-03
Image Generation - 2026-03
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2026-03-31 | Abstraction in Style | Min Lu et.al. | 2603.29924 | translate | read | null |
| 2026-03-31 | ATP-Bench: Towards Agentic Tool Planning for MLLM Interleaved Generation | Yinuo Liu et.al. | 2603.29902 | translate | read | null |
| 2026-03-31 | Accurate Determination of Chemical Abundances near a Supermassive Black Hole | The XRISM collaboration et.al. | 2603.29748 | translate | read | null |
| 2026-03-31 | MacTok: Robust Continuous Tokenization for Image Generation | Hengyu Zeng et.al. | 2603.29634 | translate | read | null |
| 2026-03-31 | Unify-Agent: A Unified Multimodal Agent for World-Grounded Image Synthesis | Shuang Chen et.al. | 2603.29620 | translate | read | null |
| 2026-03-31 | FlowID : Enhancing Forensic Identification with Latent Flow-Matching Models | Jules Ripoll et.al. | 2603.29591 | translate | read | null |
| 2026-03-31 | Generating Key Postures of Bharatanatyam Adavus with Pose Estimation | Jagadish Kashinath Kamble et.al. | 2603.29570 | translate | read | null |
| 2026-03-31 | CIPHER: Counterfeit Image Pattern High-level Examination via Representation | Kyeonghun Kim et.al. | 2603.29356 | translate | read | null |
| 2026-03-31 | GazeCLIP: Gaze-Guided CLIP with Adaptive-Enhanced Fine-Grained Language Prompt for Deepfake Attribution and Detection | Yaning Zhang et.al. | 2603.29295 | translate | read | null |
| 2026-03-31 | Semantic Communication for 6G Networks: A Trade-off between Distortion Criticality and Information Representability | Faizan Shafi et.al. | 2603.29293 | translate | read | null |
| 2026-03-30 | Gen-Searcher: Reinforcing Agentic Search for Image Generation | Kaituo Feng et.al. | 2603.28767 | translate | read | null |
| 2026-03-30 | PoseDreamer: Scalable and Photorealistic Human Data Generation Pipeline with Diffusion Models | Lorenza Prospero et.al. | 2603.28763 | translate | read | null |
| 2026-03-30 | DreamLite: A Lightweight On-Device Unified Model for Image Generation and Editing | Kailai Feng et.al. | 2603.28713 | translate | read | null |
| 2026-03-30 | TGIF2: Extended Text-Guided Inpainting Forgery Dataset & Benchmark | Hannes Mareen et.al. | 2603.28613 | translate | read | null |
| 2026-03-30 | MRI-to-CT synthesis using drifting models | Qing Lyu et.al. | 2603.28498 | translate | read | null |
| 2026-03-30 | EdgeDiT: Hardware-Aware Diffusion Transformers for Efficient On-Device Image Generation | Sravanth Kodavanti et.al. | 2603.28405 | translate | read | null |
| 2026-03-30 | Integrating Multimodal Large Language Model Knowledge into Amodal Completion | Heecheol Yun et.al. | 2603.28333 | translate | read | null |
| 2026-03-30 | LogiStory: A Logic-Aware Framework for Multi-Image Story Visualization | Chutian Meng et.al. | 2603.28082 | translate | read | null |
| 2026-03-30 | SIMR-NO: A Spectrally-Informed Multi-Resolution Neural Operator for Turbulent Flow Super-Resolution | Muhammad Abid et.al. | 2603.28073 | translate | read | null |
| 2026-03-30 | AIBench: Evaluating Visual-Logical Consistency in Academic Illustration Generation | Zhaohe Liao et.al. | 2603.28068 | translate | read | null |
| 2026-03-30 | MathGen: Revealing the Illusion of Mathematical Competence through Text-to-Image Generation | Ruiyao Liu et.al. | 2603.27959 | translate | read | null |
| 2026-03-25 | Polynomial Speedup in Diffusion Models with the Multilevel Euler-Maruyama Method | Arthur Jacot et.al. | 2603.24594 | translate | read | null |
| 2026-03-25 | Anti-I2V: Safeguarding your photos from malicious image-to-video generation | Duc Vu et.al. | 2603.24570 | translate | read | null |
| 2026-03-25 | ViHOI: Human-Object Interaction Synthesis with Visual Priors | Songjin Cai et.al. | 2603.24383 | translate | read | null |
| 2026-03-25 | Shape-Dependent, Deep-Learning-Assisted Metamaterial Solid Immersion Lens (mSIL) Super-Resolution Imaging | Baidong Wu et.al. | 2603.24371 | translate | read | null |
| 2026-03-25 | ScrollScape: Unlocking 32K Image Generation With Video Diffusion Priors | Haodong Yu et.al. | 2603.24270 | translate | read | null |
| 2026-03-25 | InstanceRSR: Real-World Super-Resolution via Instance-Aware Representation Alignment | Zixin Guo et.al. | 2603.24240 | translate | read | null |
| 2026-03-25 | RefReward-SR: LR-Conditioned Reward Modeling for Preference-Aligned Super-Resolution | Yushuai Song et.al. | 2603.24198 | translate | read | null |
| 2026-03-25 | LGTM: Training-Free Light-Guided Text-to-Image Diffusion Model via Initial Noise Manipulation | Ryugo Morita et.al. | 2603.24086 | translate | read | null |
| 2026-03-25 | When Understanding Becomes a Risk: Authenticity and Safety Risks in the Emerging Image Generation Paradigm | Ye Leng et.al. | 2603.24079 | translate | read | null |
| 2026-03-25 | Human Factors in Detecting AI-Generated Portraits: Age, Sex, Device, and Confidence | Sunwhi Kim et.al. | 2603.24048 | translate | read | null |
| 2026-03-25 | HAM: A Training-Free Style Transfer Approach via Heterogeneous Attention Modulation for Diffusion Models | Yeqi He et.al. | 2603.24043 | translate | read | null |
| 2026-03-25 | Transcending Classical Neural Network Boundaries: A Quantum-Classical Synergistic Paradigm for Seismic Data Processing | Zhengyi Yuan et.al. | 2603.23984 | translate | read | null |
| 2026-03-25 | DepthArb: Training-Free Depth-Arbitrated Generation for Occlusion-Robust Image Synthesis | Hongjin Niu et.al. | 2603.23924 | translate | read | null |
| 2026-03-25 | GenMask: Adapting DiT for Segmentation via Direct Mask | Yuhuan Yang et.al. | 2603.23906 | translate | read | null |
| 2026-03-24 | Very sensitive vapor-cell quasi-DC atomic E-field sensor | Amy Damitz et.al. | 2603.23751 | translate | read | null |
| 2026-03-24 | PoiCGAN: A Targeted Poisoning Based on Feature-Label Joint Perturbation in Federated Learning | Tao Liu et.al. | 2603.23574 | translate | read | null |
| 2026-03-24 | UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation | Jie Liu et.al. | 2603.23500 | translate | read | link |
| 2026-03-24 | InverFill: One-Step Inversion for Enhanced Few-Step Diffusion Inpainting | Duc Vu et.al. | 2603.23463 | translate | read | null |
| 2026-03-24 | Mamba-driven MRI-to-CT Synthesis for MRI-only Radiotherapy Planning | Konstantinos Barmpounakis et.al. | 2603.23295 | translate | read | null |
| 2026-03-24 | VoDaSuRe: A Large-Scale Dataset Revealing Domain Shift in Volumetric Super-Resolution | August Leander Høeg et.al. | 2603.23153 | translate | read | null |
| 2026-03-24 | DAK-UCB: Diversity-Aware Prompt Routing for LLMs and Generative Models | Donya Jafari et.al. | 2603.23140 | translate | read | null |
| 2026-03-24 | Policy-based Tuning of Autoregressive Image Models with Instance- and Distribution-Level Rewards | Orhun Buğra Baran et.al. | 2603.23086 | translate | read | null |
| 2026-03-24 | AuthorMix: Modular Authorship Style Transfer via Layer-wise Adapter Mixing | Sarubi Thillainathan et.al. | 2603.23069 | translate | read | null |
| 2026-03-24 | HUydra: Full-Range Lung CT Synthesis via Multiple HU Interval Generative Modelling | António Cardoso et.al. | 2603.23041 | translate | read | null |
| 2026-03-24 | Zero-Shot Personalization of Objects via Textual Inversion | Aniket Roy et.al. | 2603.23010 | translate | read | null |
| 2026-03-24 | WorldMesh: Generating Navigable Multi-Room 3D Scenes via Mesh-Conditioned Image Diffusion | Manuel-Andreas Schneider et.al. | 2603.22972 | translate | read | null |
| 2026-03-24 | PersonalQ: Select, Quantize, and Serve Personalized Diffusion Models for Efficient Inference | Qirui Wang et.al. | 2603.22943 | translate | read | null |
| 2026-03-24 | From Pixels to Semantics: A Multi-Stage AI Framework for Structural Damage Detection in Satellite Imagery | Bijay Shakya et.al. | 2603.22768 | translate | read | null |
| 2026-03-23 | Single-Subject Multi-View MRI Super-Resolution via Implicit Neural Representations | Heejong Kim et.al. | 2603.22627 | translate | read | null |
| 2026-03-23 | PIVM: Diffusion-Based Prior-Integrated Variation Modeling for Anatomically Precise Abdominal CT Synthesis | Dinglun He et.al. | 2603.22626 | translate | read | null |
| 2026-03-23 | Latent Style-based Quantum Wasserstein GAN for Drug Design | Julien Baglio et.al. | 2603.22399 | translate | read | null |
| 2026-03-23 | Repurposing Geometric Foundation Models for Multi-view Diffusion | Wooseok Jang et.al. | 2603.22275 | translate | read | null |
| 2026-03-23 | DUO-VSR: Dual-Stream Distillation for One-Step Video Super-Resolution | Zhengyao Lv et.al. | 2603.22271 | translate | read | null |
| 2026-03-23 | SelfTTS: cross-speaker style transfer through explicit embedding disentanglement and self-refinement using self-augmentation | Lucas H. Ueda et.al. | 2603.22252 | translate | read | null |
| 2026-03-23 | SpatialReward: Verifiable Spatial Reward Modeling for Fine-Grained Spatial Consistency in Text-to-Image Generation | Sashuai Zhou et.al. | 2603.22228 | translate | read | null |
| 2026-03-23 | DA-VAE: Plug-in Latent Compression for Diffusion via Detail Alignment | Xin Cai et.al. | 2603.22125 | translate | read | null |
| 2026-03-23 | DTVI: Dual-Stage Textual and Visual Intervention for Safe Text-to-Image Generation | Binhong Tan et.al. | 2603.22041 | translate | read | null |
| 2026-03-23 | Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model | SII-GAIR et.al. | 2603.21986 | translate | read | null |
| 2026-03-23 | MultiBind: A Benchmark for Attribute Misbinding in Multi-Subject Generation | Wenqing Tian et.al. | 2603.21937 | translate | read | null |
| 2026-03-23 | Not All Layers Are Created Equal: Adaptive LoRA Ranks for Personalized Image Generation | Donald Shenaj et.al. | 2603.21884 | translate | read | null |
| 2026-03-23 | SHARP: Spectrum-aware Highly-dynamic Adaptation for Resolution Promotion in Remote Sensing Synthesis | Bingxuan Zhao et.al. | 2603.21783 | translate | read | null |
| 2026-03-23 | OmniFM: Toward Modality-Robust and Task-Agnostic Federated Learning for Heterogeneous Medical Imaging | Meilin Liu et.al. | 2603.21660 | translate | read | null |
| 2026-03-23 | Conditional Wasserstein GAN for Simulating Neutrino Event Summaries using Incident Energy of Electron Neutrinos | Dipthi S. et.al. | 2603.21599 | translate | read | null |
| 2026-03-23 | Unregistered Spectral Image Fusion: Unmixing, Adversarial Learning, and Recoverability | Jiahui Song et.al. | 2603.21510 | translate | read | null |
| 2026-03-22 | Efficient Coarse-to-Fine Diffusion Models with Time Step Sequence Redistribution | Yu-Shan Tai et.al. | 2603.21348 | translate | read | null |
| 2026-03-22 | Positional Segmentor-Guided Counterfactual Fine-Tuning for Spatially Localized Image Synthesis | Tian Xia et.al. | 2603.21213 | translate | read | null |
| 2026-03-22 | MS-CustomNet: Controllable Multi-Subject Customization with Hierarchical Relational Semantics | Pengxiang Cai et.al. | 2603.21136 | translate | read | null |
| 2026-03-22 | Taming Sampling Perturbations with Variance Expansion Loss for Latent Diffusion Models | Qifan Li et.al. | 2603.21085 | translate | read | null |
| 2026-03-22 | LPNSR: Prior-Enhanced Diffusion Image Super-Resolution via LR-Guided Noise Prediction | Shuwei Huang et.al. | 2603.21045 | translate | read | null |
| 2026-03-21 | EruDiff: Refactoring Knowledge in Diffusion Models for Advanced Text-to-Image Synthesis | Xiefan Guo et.al. | 2603.20828 | translate | read | null |
| 2026-03-21 | CTCal: Rethinking Text-to-Image Diffusion Models via Cross-Timestep Self-Calibration | Xiefan Guo et.al. | 2603.20741 | translate | read | null |
| 2026-03-21 | Premier: Personalized Preference Modulation with Learnable User Embedding in Text-to-Image Generation | Zihao Wang et.al. | 2603.20725 | translate | read | null |
| 2026-03-21 | MFSR: MeanFlow Distillation for One Step Real-World Image Super Resolution | Ruiqing Wang et.al. | 2603.20690 | translate | read | null |
| 2026-03-21 | ScaleEdit-12M: Scaling Open-Source Image Editing Data Generation via Multi-Agent Framework | Guanzhou Chen et.al. | 2603.20644 | translate | read | null |
| 2026-03-21 | Interpretable Operator Learning for Inverse Problems via Adaptive Spectral Filtering: Convergence and Discretization Invariance | Hang-Cheng Dong et.al. | 2603.20602 | translate | read | null |
| 2026-03-20 | DiffGraph: An Automated Agent-driven Model Merging Framework for In-the-Wild Text-to-Image Generation | Zhuoling Li et.al. | 2603.20470 | translate | read | null |
| 2026-03-20 | Uni-Classifier: Leveraging Video Diffusion Priors for Universal Guidance Classifier | Yujie Zhou et.al. | 2603.20382 | translate | read | null |
| 2026-03-19 | Transferable Multi-Bit Watermarking Across Frozen Diffusion Models via Latent Consistency Bridges | Hong-Hanh Nguyen-Le et.al. | 2603.20304 | translate | read | null |
| 2026-03-20 | Improving Image-to-Image Translation via a Rectified Flow Reformulation | Satoshi Iizuka et.al. | 2603.20186 | translate | read | null |
| 2026-03-20 | Generalizable NGP-SR: Generalizable Neural Radiance Fields Super-Resolution via Neural Graph Primitives | Wanqi Yuan et.al. | 2603.20128 | translate | read | null |
| 2026-03-20 | Preference-Guided Debiasing for No-Reference Enhancement Image Quality Assessment | Shiqi Gao et.al. | 2603.20086 | translate | read | null |
| 2026-03-20 | X-World: Controllable Ego-Centric Multi-Camera World Models for Scalable End-to-End Driving | Chaoda Zheng et.al. | 2603.19979 | translate | read | null |
| 2026-03-20 | Timestep-Aware Block Masking for Efficient Diffusion Model Inference | Haodong He et.al. | 2603.19939 | translate | read | null |
| 2026-03-20 | Evaluating Image Editing with LLMs: A Comprehensive Benchmark and Intermediate-Layer Probing Approach | Shiqi Gao et.al. | 2603.19775 | translate | read | null |
| 2026-03-20 | WorldAgents: Can Foundation Image Models be Agents for 3D World Models? | Ziya Erkoç et.al. | 2603.19708 | translate | read | null |
| 2026-03-20 | Diminishing Returns in Expanding Generative Models and Godel-Tarski-Lob Limits | Angshul Majumdar et.al. | 2603.19687 | translate | read | null |
| 2026-03-20 | Toward High-Fidelity Visual Reconstruction: From EEG-Based Conditioned Generation to Joint-Modal Guided Rebuilding | Zhijian Gong et.al. | 2603.19667 | translate | read | null |
| 2026-03-20 | Fixed-Point Delayed Subgradient Methods for Nonsmooth Convex Optimization Problems | Ontima Pankoon et.al. | 2603.19604 | translate | read | null |
| 2026-03-20 | MagicSeg: Open-World Segmentation Pretraining via Counterfactural Diffusion-Based Auto-Generation | Kaixin Cai et.al. | 2603.19575 | translate | read | null |
| 2026-03-19 | TuLaBM: Tumor-Biased Latent Bridge Matching for Contrast-Enhanced MRI Synthesis | Atharva Rege et.al. | 2603.19386 | translate | read | null |
| 2026-03-19 | Warm-Start Flow Matching for Guaranteed Fast Text/Image Generation | Minyoung Kim et.al. | 2603.19360 | translate | read | null |
| 2026-03-19 | RPiAE: A Representation-Pivoted Autoencoder Enhancing Both Image Generation and Editing | Yue Gong et.al. | 2603.19206 | translate | read | null |
| 2026-03-19 | GenMFSR: Generative Multi-Frame Image Restoration and Super-Resolution | Harshana Weligampola et.al. | 2603.19187 | translate | read | null |
| 2026-03-19 | ADAPT: Attention Driven Adaptive Prompt Scheduling and InTerpolating Orthogonal Complements for Rare Concepts Generation | Kwanyoung Lee et.al. | 2603.19157 | translate | read | null |
| 2026-03-19 | Unmasking Algorithmic Bias in Predictive Policing: A GAN-Based Simulation Framework with Multi-City Temporal Analysis | Pronob Kumar Barman et.al. | 2603.18987 | translate | read | null |
| 2026-03-19 | Sketch2Topo: Using Hand-Drawn Inputs for Diffusion-Based Topology Optimization | Shuyue Feng et.al. | 2603.18960 | translate | read | null |
| 2026-03-19 | Seasoning Generative Models for a Generalization Aftertaste | Hisham Husain et.al. | 2603.18817 | translate | read | null |
| 2026-03-19 | Enhancing the Parameterization of Reservoir Properties for Data Assimilation Using Deep VAE-GAN | M. A. Sampaio et.al. | 2603.18766 | translate | read | null |
| 2026-03-19 | WeNLEX: Weakly Supervised Natural Language Explanations for Multilabel Chest X-ray Classification | Isabel Rio-Torto et.al. | 2603.18752 | translate | read | null |
| 2026-03-19 | Agentic Flow Steering and Parallel Rollout Search for Spatially Grounded Text-to-Image Generation | Ping Chen et.al. | 2603.18627 | translate | read | null |
| 2026-03-19 | SJD-PAC: Accelerating Speculative Jacobi Decoding via Proactive Drafting and Adaptive Continuation | Jialiang Kang et.al. | 2603.18599 | translate | read | null |
| 2026-03-19 | End-to-End QGAN-Based Image Synthesis via Neural Noise Encoding and Intensity Calibration | Xue Yang et.al. | 2603.18554 | translate | read | null |
| 2026-03-19 | CAFlow: Adaptive-Depth Single-Step Flow Matching for Efficient Histopathology Super-Resolution | Elad Yoshai et.al. | 2603.18513 | translate | read | null |
| 2026-03-19 | Recolour What Matters: Region-Aware Colour Editing via Token-Level Diffusion | Yuqi Yang et.al. | 2603.18466 | translate | read | null |
| 2026-03-18 | Learning to See Sharper: A Physics-Informed Artificial Intelligence Framework for Super-Resolving Galaxy Spectra | Aryana Haghjoo et.al. | 2603.18357 | translate | read | null |
| 2026-03-18 | Epistemic Generative Adversarial Networks | Muhammad Mubashar et.al. | 2603.18348 | translate | read | null |
| 2026-03-18 | Unrolled Reconstruction with Integrated Super-Resolution for Accelerated 3D LGE MRI | Md Hasibul Husain Hisham et.al. | 2603.18309 | translate | read | null |
| 2026-03-18 | EchoGen: Cycle-Consistent Learning for Unified Layout-Image Generation and Understanding | Kai Zou et.al. | 2603.18001 | translate | read | null |
| 2026-03-18 | LaDe: Unified Multi-Layered Graphic Media Generation and Decomposition | Vlad-Constantin Lungu-Stan et.al. | 2603.17965 | translate | read | null |
| 2026-03-18 | ChopGrad: Pixel-Wise Losses for Latent Video Diffusion via Truncated Backpropagation | Dmitriy Rivkin et.al. | 2603.17812 | translate | read | null |
| 2026-03-18 | Cache-enabled Generative Joint Source-Channel Coding for Evolving Semantic Communications | Shunpu Tang et.al. | 2603.17702 | translate | read | null |
| 2026-03-18 | DSS-GAN: Directional State Space GAN with Mamba backbone for Class-Conditional Image Synthesis | Aleksander Ogonowski et.al. | 2603.17637 | translate | read | null |
| 2026-03-18 | Searching for Molecular Signatures in 14 Transiting Exoplanets with SPIRou | A. Masson et.al. | 2603.17574 | translate | read | null |
| 2026-03-18 | A Tutorial on Learning-Based Radio Map Construction: Data, Paradigms, and Physics-Awarenes | Xiucheng Wang et.al. | 2603.17499 | translate | read | null |
| 2026-03-18 | UniSAFE: A Comprehensive Benchmark for Safety Evaluation of Unified Multimodal Models | Segyu Lee et.al. | 2603.17476 | translate | read | null |
| 2026-03-18 | Caging the Agents: A Zero Trust Security Architecture for Autonomous AI in Healthcare | Saikat Maiti et.al. | 2603.17419 | translate | read | null |
| 2026-03-18 | Joint Degradation-Aware Arbitrary-Scale Super-Resolution for Variable-Rate Extreme Image Compression | Xinning Chai et.al. | 2603.17408 | translate | read | null |
| 2026-03-18 | Harnessing the Power of Foundation Models for Accurate Material Classification | Qingran Lin et.al. | 2603.17390 | translate | read | null |
| 2026-03-17 | PhysQuantAgent: An Inference Pipeline of Mass Estimation for Vision-Language Models | Hisayuki Yokomizo et.al. | 2603.16958 | translate | read | null |
| 2026-03-17 | SparkVSR: Interactive Video Super-Resolution via Sparse Keyframe Propagation | Jiongze Yu et.al. | 2603.16864 | translate | read | null |
| 2026-03-16 | GDPO-SR: Group Direct Preference Optimization for One-Step Generative Image Super-Resolution | Qiaosi Yi et.al. | 2603.16769 | translate | read | null |
| 2026-03-17 | REFORGE: Multi-modal Attacks Reveal Vulnerable Concept Unlearning in Image Generation Models | Yong Zou et.al. | 2603.16576 | translate | read | null |
| 2026-03-17 | CompDiff: Hierarchical Compositional Diffusion for Fair and Zero-Shot Intersectional Medical Image Generation | Mahmoud Ibrahim et.al. | 2603.16551 | translate | read | null |
| 2026-03-17 | Unlearning for One-Step Generative Models via Unbalanced Optimal Transport | Hyundo Choi et.al. | 2603.16489 | translate | read | null |
| 2026-03-17 | Fanar 2.0: Arabic Generative AI Stack | FANAR TEAM et.al. | 2603.16397 | translate | read | null |
| 2026-03-17 | DermaFlux: Synthetic Skin Lesion Generation with Rectified Flows for Enhanced Image Classification | Stathis Galanakis et.al. | 2603.16392 | translate | read | null |
| 2026-03-17 | Semantic One-Dimensional Tokenizer for Image Reconstruction and Generation | Yunpeng Qu et.al. | 2603.16373 | translate | read | null |
| 2026-03-17 | RASLF: Representation-Aware State Space Model for Light Field Super-Resolution | Zeqiang Wei et.al. | 2603.16243 | translate | read | null |
| 2026-03-16 | Clinically Aware Synthetic Image Generation for Concept Coverage in Chest X-ray Models | Amy Rafferty et.al. | 2603.15525 | translate | read | null |
| 2026-03-16 | RSGen: Enhancing Layout-Driven Remote Sensing Image Generation with Diverse Edge Guidance | Xianbao Hou et.al. | 2603.15484 | translate | read | null |
| 2026-03-16 | Flash-Unified: A Training-Free and Task-Aware Acceleration Framework for Native Unified Models | Junlong Ke et.al. | 2603.15271 | translate | read | null |
| 2026-03-16 | TextOVSR: Text-Guided Real-World Opera Video Super-Resolution | Hua Chang et.al. | 2603.15153 | translate | read | null |
| 2026-03-16 | SNCE: Geometry-Aware Supervision for Scalable Discrete Image Generation | Shufan Li et.al. | 2603.15150 | translate | read | null |
| 2026-03-16 | Training-free Detection of Generated Videos via Spatial-Temporal Likelihoods | Omer Ben Hayun et.al. | 2603.15026 | translate | read | null |
| 2026-03-16 | CyCLeGen: Cycle-Consistent Layout Prediction and Image Generation in Vision Foundation Models | Xiaojun Shan et.al. | 2603.14957 | translate | read | null |
| 2026-03-16 | Relevance Feedback in Text-to-Image Diffusion: A Training-Free And Model-Agnostic Interactive Framework | Wenxi Wang et.al. | 2603.14936 | translate | read | null |
| 2026-03-16 | The Super Fine-Grained Detector for the T2K neutrino oscillation experiment | S. Abe et.al. | 2603.14921 | translate | read | null |
| 2026-03-16 | Seismic full-waveform inversion based on a physics-driven generative adversarial network | Xinyi Zhang et.al. | 2603.14879 | translate | read | null |
| 2026-03-16 | AnyPhoto: Multi-Person Identity Preserving Image Generation with ID Adaptive Modulation on Location Canvas | Longhui Yuan et.al. | 2603.14770 | translate | read | null |
| 2026-03-16 | Investigating the Impact of Speech Enhancement on Audio Deepfake Detection in Noisy Environments | Anacin et.al. | 2603.14767 | translate | read | null |
| 2026-03-16 | PHAC: Promptable Human Amodal Completion | Seung Young Noh et.al. | 2603.14741 | translate | read | null |
| 2026-03-15 | Comparative Analysis of 3D Convolutional and 2.5D Slice-Conditioned U-Net Architectures for MRI Super-Resolution via Elucidated Diffusion Models | Hendrik Chiche et.al. | 2603.14667 | translate | read | null |
| 2026-03-15 | A Decoupling-based Approach for Signature Estimation of Wideband XL MIMO-FMCW Radars | Chandrashekhar Rai et.al. | 2603.14542 | translate | read | null |
| 2026-03-15 | PGcGAN: Pathological Gait-Conditioned GAN for Human Gait Synthesis | Mritula Chandrasekaran et.al. | 2603.14409 | translate | read | null |
| 2026-03-15 | High-Fidelity Compression of Seismic Velocity Models via SIREN Auto-Decoders | Caiyun Liu et.al. | 2603.14284 | translate | read | null |
| 2026-03-15 | FIND: A Simple yet Effective Baseline for Diffusion-Generated Image Detection | Jie Li et.al. | 2603.14220 | translate | read | null |
| 2026-03-15 | DualTSR: Unified Dual-Diffusion Transformer for Scene Text Image Super-Resolution | Axi Niu et.al. | 2603.14207 | translate | read | null |
| 2026-03-12 | The Latent Color Subspace: Emergent Order in High-Dimensional Chaos | Mateusz Pach et.al. | 2603.12261 | translate | read | null |
| 2026-03-12 | Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation | Xiangyu Zhao et.al. | 2603.12247 | translate | read | null |
| 2026-03-12 | EvoTok: A Unified Image Tokenizer via Residual Latent Evolution for Visual Understanding and Generation | Yan Li et.al. | 2603.12108 | translate | read | null |
| 2026-03-12 | Single Pixel Image Classification using an Ultrafast Digital Light Projector | Aisha Kanwal et.al. | 2603.12036 | translate | read | null |
| 2026-03-12 | Unveiling the biconical geometry of the outflow in the ultraluminous X-ray source NGC 5204 X-1 | S. Caserta et.al. | 2603.11922 | translate | read | null |
| 2026-03-12 | A Decade of Generative Adversarial Networks for Porous Material Reconstruction | Ali Sadeghkhani et.al. | 2603.11836 | translate | read | null |
| 2026-03-12 | UCAN: Unified Convolutional Attention Network for Expansive Receptive Fields in Lightweight Super-Resolution | Cao Thien Tan et.al. | 2603.11680 | translate | read | null |
| 2026-03-12 | Gen-Fab: A Variation-Aware Generative Model for Predicting Fabrication Variations in Nanophotonic Devices | Rambod Azimi et.al. | 2603.11505 | translate | read | null |
| 2026-03-11 | HanMoVLM: Large Vision-Language Models for Professional Artistic Painting Evaluation | Hongji Yang et.al. | 2603.10814 | translate | read | null |
| 2026-03-11 | The Quadratic Geometry of Flow Matching: Semantic Granularity Alignment for Text-to-Image Synthesis | Zhinan Xiong et.al. | 2603.10785 | translate | read | null |
| 2026-03-11 | Just-in-Time: Training-Free Spatial Acceleration for Diffusion Transformers | Wenhao Sun et.al. | 2603.10744 | translate | read | null |
| 2026-03-11 | HyPER-GAN: Hybrid Patch-Based Image-to-Image Translation for Real-Time Photorealism Enhancement | Stefanos Pasios et.al. | 2603.10604 | translate | read | null |
| 2026-03-11 | Attribution as Retrieval: Model-Agnostic AI-Generated Image Attribution | Hongsong Wang et.al. | 2603.10583 | translate | read | null |
| 2026-03-11 | Visually-Guided Controllable Medical Image Generation via Fine-Grained Semantic Disentanglement | Xin Huang et.al. | 2603.10519 | translate | read | null |
| 2026-03-11 | Enhancing Network Intrusion Detection Systems: A Multi-Layer Ensemble Approach to Mitigate Adversarial Attacks | Nasim Soltani et.al. | 2603.10413 | translate | read | null |
| 2026-03-11 | StyleGallery: Training-free and Semantic-aware Personalized Style Transfer from Arbitrary Image References | Boyu He et.al. | 2603.10354 | translate | read | null |
| 2026-03-10 | Delta-K: Boosting Multi-Instance Generation via Cross-Attention Augmentation | Zitong Wang et.al. | 2603.10210 | translate | read | null |
| 2026-03-10 | 4DEquine: Disentangling Motion and Appearance for 4D Equine Reconstruction from Monocular Video | Jin Lyu et.al. | 2603.10125 | translate | read | null |
| 2026-03-10 | Generative Drifting is Secretly Score Matching: a Spectral and Variational Perspective | Erkan Turan et.al. | 2603.09936 | translate | read | null |
| 2026-03-10 | Adaptive Clinical-Aware Latent Diffusion for Multimodal Brain Image Generation and Missing Modality Imputation | Rong Zhou et.al. | 2603.09931 | translate | read | null |
| 2026-03-10 | CycleULM: A unified label-free deep learning framework for ultrasound localisation microscopy | Su Yan et.al. | 2603.09840 | translate | read | null |
| 2026-03-10 | Prompt-Driven Color Accessibility Evaluation in Diffusion-based Image Generation Models | Xinyao Zhuang et.al. | 2603.09832 | translate | read | null |
| 2026-03-10 | LogoDiffuser: Training-Free Multilingual Logo Generation and Stylization via Letter-Aware Attention Control | Mingyu Kang et.al. | 2603.09759 | translate | read | null |
| 2026-03-10 | TriFusion-SR: Joint Tri-Modal Medical Image Fusion and SR | Fayaz Ali Dharejo et.al. | 2603.09702 | translate | read | null |
| 2026-03-10 | Well Log-Guided Synthesis of Subsurface Images from Sparse Petrography Data Using cGANs | Ali Sadeghkhani et.al. | 2603.09651 | translate | read | null |
| 2026-03-10 | Physics-Driven 3D Gaussian Rendering for Zero-Shot MRI Super-Resolution | Shuting Liu et.al. | 2603.09621 | translate | read | null |
| 2026-03-10 | Towards Unified Multimodal Interleaved Generation via Group Relative Policy Optimization | Ming Nie et.al. | 2603.09538 | translate | read | null |
| 2026-03-10 | A Fast Solver for Interpolating Stochastic Differential Equation Diffusion Models for Speech Restoration | Bunlong Lay et.al. | 2603.09508 | translate | read | null |
| 2026-03-10 | Streaming Autoregressive Video Generation via Diagonal Distillation | Jinxiu Liu et.al. | 2603.09488 | translate | read | null |
| 2026-03-10 | Component-Aware Sketch-to-Image Generation Using Self-Attention Encoding and Coordinate-Preserving Fusion | Ali Zia et.al. | 2603.09484 | translate | read | null |
| 2026-03-10 | ShapeMark: Robust and Diversity-Preserving Watermarking for Diffusion Models | Yuqi Qian et.al. | 2603.09454 | translate | read | null |
| 2026-03-10 | Interactive 3D visualization of surface roughness predictions in additive manufacturing: A data-driven framework | Engin Deniz Erkan et.al. | 2603.09353 | translate | read | null |
| 2026-03-10 | CogBlender: Towards Continuous Cognitive Intervention in Text-to-Image Generation | Shengqi Dang et.al. | 2603.09286 | translate | read | null |
| 2026-03-10 | Acoustic and Semantic Modeling of Emotion in Spoken Language | Soumya Dutta et.al. | 2603.09212 | translate | read | null |
| 2026-03-10 | Progressive Split Mamba: Effective State Space Modelling for Image Restoration | Mohammed Hassanin et.al. | 2603.09171 | translate | read | null |
| 2026-03-10 | POLISH’ing the Sky: Wide-Field and High-Dynamic Range Interferometric Image Reconstruction with Application to Strong Lens Discovery | Zihui Wu et.al. | 2603.09162 | translate | read | null |
| 2026-03-10 | RubiCap: Rubric-Guided Reinforcement Learning for Dense Image Captioning | Tzu-Heng Huang et.al. | 2603.09160 | translate | read | null |
| 2026-03-10 | Rotation Equivariant Mamba for Vision Tasks | Zhongchen Zhao et.al. | 2603.09138 | translate | read | null |
| 2026-03-10 | QUSR: Quality-Aware and Uncertainty-Guided Image Super-Resolution Diffusion Model | Junjie Yin et.al. | 2603.09125 | translate | read | null |
| 2026-03-09 | The Coupling Within: Flow Matching via Distilled Normalizing Flows | David Berthelot et.al. | 2603.09014 | translate | read | null |
| 2026-03-09 | CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation | Haodong Li et.al. | 2603.08652 | translate | read | null |
| 2026-03-09 | CARE-Edit: Condition-Aware Routing of Experts for Contextual Image Editing | Yucheng Wang et.al. | 2603.08589 | translate | read | null |
| 2026-03-09 | Cubic maps from the group of order $3$ | Vadim Alekseev et.al. | 2603.08452 | translate | read | null |
| 2026-03-09 | Rectified flow-based prediction of post-treatment brain MRI from pre-radiotherapy priors for patients with glioma | Selena Huisman et.al. | 2603.08385 | translate | read | null |
| 2026-03-09 | Retrieval-Augmented Anatomical Guidance for Text-to-CT Generation | Daniele Molino et.al. | 2603.08305 | translate | read | null |
| 2026-03-09 | Prototype-Guided Concept Erasure in Diffusion Models | Yuze Cai et.al. | 2603.08271 | translate | read | null |
| 2026-03-09 | WaDi: Weight Direction-aware Distillation for One-step Image Synthesis | Lei Wang et.al. | 2603.08258 | translate | read | null |
| 2026-03-09 | FlowTouch: View-Invariant Visuo-Tactile Prediction | Seongjin Bien et.al. | 2603.08255 | translate | read | null |
| 2026-03-09 | Fourier Transform Infrared microspectroscopy-based super-resolution virtual staining of unlabeled tissues by pixel Diffusion Transformer | Yudong Tian et.al. | 2603.08143 | translate | read | null |
| 2026-03-09 | DSH-Bench: A Difficulty- and Scenario-Aware Benchmark with Hierarchical Subject Taxonomy for Subject-Driven Text-to-Image Generation | Zhenyu Hu et.al. | 2603.08090 | translate | read | null |
| 2026-03-09 | Synthetic Defect Image Generation for Power Line Insulator Inspection Using Multimodal Large Language Models | Xuesong Wang et.al. | 2603.08069 | translate | read | null |
| 2026-03-09 | Text to Automata Diagrams: Comparing TikZ Code Generation with Direct Image Synthesis | Ethan Young et.al. | 2603.07936 | translate | read | null |
| 2026-03-09 | Enhancing Unregistered Hyperspectral Image Super-Resolution via Unmixing-based Abundance Fusion Learning | Yingkai Zhang et.al. | 2603.07918 | translate | read | null |
| 2026-03-08 | Parameterized Brushstroke Style Transfer | Uma Meleti et.al. | 2603.07776 | translate | read | null |
| 2026-03-08 | Compressed-Domain-Aware Online Video Super-Resolution | Yuhang Wang et.al. | 2603.07694 | translate | read | null |
| 2026-03-08 | GRD-Net: Generative-Reconstructive-Discriminative Anomaly Detection with Region of Interest Attention Module | Niccolò Ferrari et.al. | 2603.07566 | translate | read | null |
| 2026-03-08 | CONSTANT: Towards High-Quality One-Shot Handwriting Generation with Patch Contrastive Enhancement and Style-Aware Quantization | Anh-Duy Le et.al. | 2603.07543 | translate | read | null |
| 2026-03-08 | How Long Can Unified Multimodal Models Generate Images Reliably? Taming Long-Horizon Interleaved Image Generation via Context Curation | Haoyu Chen et.al. | 2603.07540 | translate | read | null |
| 2026-03-08 | Image Generation Models: A Technical History | Rouzbeh Shirvani et.al. | 2603.07455 | translate | read | null |
| 2026-03-08 | Disentangled Textual Priors for Diffusion-based Image Super-Resolution | Lei Jiang et.al. | 2603.07430 | translate | read | null |
| 2026-03-08 | Fluctuation imaging of disorder in monolayer semiconductors | Tom T. C. Sistermans et.al. | 2603.07418 | translate | read | null |
| 2026-03-08 | QdaVPR: A novel query-based domain-agnostic model for visual place recognition | Shanshan Wan et.al. | 2603.07414 | translate | read | null |
| 2026-03-07 | Variational Flow Maps: Make Some Noise for One-Step Conditional Generation | Abbas Mammadov et.al. | 2603.07276 | translate | read | null |
| 2026-03-07 | Single Image Super-Resolution via Bivariate `A Trous Wavelet Diffusion | Heidari Maryam et.al. | 2603.07234 | translate | read | null |
| 2026-03-07 | AdaGen: Learning Adaptive Policy for Image Synthesis | Zanlin Ni et.al. | 2603.06993 | translate | read | null |
| 2026-03-06 | Implementation of Quantum Implicit Neural Representation in Deterministic and Probabilistic Autoencoders for Image Reconstruction/Generation Tasks | Saadet Müzehher Eren et.al. | 2603.06755 | translate | read | null |
| 2026-03-06 | EarthBridge: A Solution for 4th Multi-modal Aerial View Image Challenge Translation Track | Zhenyuan Chen et.al. | 2603.06753 | translate | read | null |
| 2026-03-06 | Rank-Factorized Implicit Neural Bias: Scaling Super-Resolution Transformer with FlashAttention | Dongheon Lee et.al. | 2603.06738 | translate | read | null |
| 2026-03-04 | One step further with Monte-Carlo sampler to guide diffusion better | Minsi Ren et.al. | 2603.06685 | translate | read | null |
| 2026-03-06 | Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion | Lijiang Li et.al. | 2603.06577 | translate | read | null |
| 2026-03-06 | NEGATE: Constrained Semantic Guidance for Linguistic Negation in Text-to-Video Diffusion | Taewon Kang et.al. | 2603.06533 | translate | read | null |
| 2026-03-06 | Pinterest Canvas: Large-Scale Image Generation at Pinterest | Yu Wang et.al. | 2603.06453 | translate | read | null |
| 2026-03-06 | Toward Generative Quantum Utility via Correlation-Complexity Map | Chen-Yu Liu et.al. | 2603.06440 | translate | read | null |
| 2026-03-06 | The Art That Poses Back: Assessing AI Pastiches after Contemporary Artworks | Anca Dinu et.al. | 2603.06324 | translate | read | null |
| 2026-03-06 | 3D CBCT Artefact Removal Using Perpendicular Score-Based Diffusion Models | Susanne Schaub et.al. | 2603.06300 | translate | read | null |
| 2026-03-06 | Spectral and Trajectory Regularization for Diffusion Transformer Super-Resolution | Jingkai Wang et.al. | 2603.06275 | translate | read | null |
| 2026-03-06 | Optimizing 3D Diffusion Models for Medical Imaging via Multi-Scale Reward Learning | Yueying Tian et.al. | 2603.06173 | translate | read | null |
| 2026-03-06 | Reflective Flow Sampling Enhancement | Zikai Zhou et.al. | 2603.06165 | translate | read | null |
| 2026-03-06 | Longitudinal NSCLC Treatment Progression via Multimodal Generative Models | Massimiliano Mantegna et.al. | 2603.06147 | translate | read | null |
| 2026-03-06 | FontUse: A Data-Centric Approach to Style- and Use-Case-Conditioned In-Image Typography | Xia Xin et.al. | 2603.06038 | translate | read | null |
| 2026-03-06 | StruVis: Enhancing Reasoning-based Text-to-Image Generation via Thinking with Structured Vision | Yuanhuiyi Lyu et.al. | 2603.06032 | translate | read | null |
| 2026-03-06 | LucidNFT: LR-Anchored Multi-Reward Preference Optimization for Generative Real-World Super-Resolution | Song Fei et.al. | 2603.05947 | translate | read | null |
| 2026-03-06 | StreamWise: Serving Multi-Modal Generation in Real-Time at Scale | Haoran Qiu et.al. | 2603.05800 | translate | read | null |
| 2026-03-06 | Layer-wise Instance Binding for Regional and Occlusion Control in Text-to-Image Diffusion Transformers | Ruidong Chen et.al. | 2603.05769 | translate | read | null |
| 2026-03-05 | Limited-Angle CT Reconstruction Using Multi-Volume Latent Consistency Model | Hinako Isogai et.al. | 2603.05183 | translate | read | null |
| 2026-03-05 | Diff-ES: Stage-wise Structural Diffusion Pruning via Evolutionary Search | Zongfang Liu et.al. | 2603.05105 | translate | read | null |
| 2026-03-05 | CoIn3D: Revisiting Configuration-Invariant Multi-Camera 3D Object Detection | Zhaonian Kuang et.al. | 2603.05042 | translate | read | null |
| 2026-03-05 | Enhancing Zero-shot Commonsense Reasoning by Integrating Visual Knowledge via Machine Imagination | Hyuntae Park et.al. | 2603.05040 | translate | read | null |
| 2026-03-05 | A Simple Baseline for Unifying Understanding, Generation, and Editing via Vanilla Next-token Prediction | Jie Zhu et.al. | 2603.04980 | translate | read | null |
| 2026-03-05 | MWA tied-array processing V: Super-resolved localisation via amplitude-only maximum likelihood direction finding | Bradley W. Meyers et.al. | 2603.04961 | translate | read | null |
| 2026-03-05 | An Efficient Stochastic First-Order Algorithm for Nonconvex-Strongly Concave Minimax Optimization beyond Lipschitz Smoothness | Yan Gao et.al. | 2603.04940 | translate | read | null |
| 2026-03-05 | Stochastic inner workings of subdiffraction laser writing | Julia M. Mikhailova et.al. | 2603.04853 | translate | read | null |
| 2026-03-05 | DSA-SRGS: Super-Resolution Gaussian Splatting for Dynamic Sparse-View DSA Reconstruction | Shiyu Zhang et.al. | 2603.04770 | translate | read | null |
| 2026-03-05 | Toward Real-world Infrared Image Super-Resolution: A Unified Autoregressive Framework and Benchmark Dataset | Yang Zou et.al. | 2603.04745 | translate | read | null |
| 2026-03-04 | sFRC for assessing hallucinations in medical image restoration | Prabhat Kc et.al. | 2603.04673 | translate | read | null |
| 2026-03-04 | Mask-aware inference with State-Space Models | Ignasi Mas et.al. | 2603.04568 | translate | read | null |
| 2026-03-04 | Structure-Guided Histopathology Synthesis via Dual-LoRA Diffusion | Xuan Xu et.al. | 2603.04565 | translate | read | null |
| 2026-03-04 | Enhancing Authorship Attribution with Synthetic Paintings | Clarissa Loures et.al. | 2603.04343 | translate | read | null |
| 2026-03-04 | Balancing Fidelity, Utility, and Privacy in Synthetic Cardiac MRI Generation: A Comparative Study | Madhura Edirisooriya et.al. | 2603.04340 | translate | read | null |
| 2026-03-04 | CubeComposer: Spatio-Temporal Autoregressive 4K 360° Video Generation from Perspective Video | Lingen Li et.al. | 2603.04291 | translate | read | null |
| 2026-03-04 | LikeThis! Empowering App Users to Submit UI Improvement Suggestions Instead of Complaints | Jialiang Wei et.al. | 2603.04245 | translate | read | null |
| 2026-03-04 | Semi-Supervised Generative Learning via Latent Space Distribution Matching | Kwong Yu Chong et.al. | 2603.04223 | translate | read | null |
| 2026-03-04 | FastWave: Optimized Diffusion Model for Audio Super-Resolution | Nikita Kuznetsov et.al. | 2603.04122 | translate | read | null |
| 2026-03-04 | MLOps-Assisted Anomalous Reflector Metasurfaces Design Based on Red Hat OpenShift AI | Wael Elshennawy et.al. | 2603.03981 | translate | read | null |
| 2026-03-04 | Dual-Solver: A Generalized ODE Solver for Diffusion Models with Dual Prediction | Soochul Park et.al. | 2603.03973 | translate | read | null |
| 2026-03-04 | Plug-and-Play blind super-resolution of real MRI images for improved multiple sclerosis diagnosis | Matteo Cannas et.al. | 2603.03876 | translate | read | null |
| 2026-03-04 | Order Is Not Layout: Order-to-Space Bias in Image Generation | Yongkang Zhang et.al. | 2603.03714 | translate | read | null |
| 2026-03-04 | Machine Pareidolia: Protecting Facial Image with Emotional Editing | Binh M. Le et.al. | 2603.03665 | translate | read | null |
| 2026-03-03 | CFG-Ctrl: Control-Based Classifier-Free Diffusion Guidance | Hanyang Wang et.al. | 2603.03281 | translate | read | null |
| 2026-03-03 | AWDiff: An a trous wavelet diffusion model for lung ultrasound image synthesis | Maryam Heidari et.al. | 2603.03125 | translate | read | null |
| 2026-03-03 | Complementarity between atmospheric and super-beam neutrinos at ESSnuSB | ESSnuSB et.al. | 2603.02836 | translate | read | null |
| 2026-03-03 | Structure-Aware Text Recognition for Ancient Greek Critical Editions | Nicolas Angleraud et.al. | 2603.02803 | translate | read | null |
| 2026-03-03 | From “What” to “How”: Constrained Reasoning for Autoregressive Image Generation | Ruxue Yan et.al. | 2603.02712 | translate | read | null |
| 2026-03-03 | FiDeSR: High-Fidelity and Detail-Preserving One-Step Diffusion Super-Resolution | Aro Kim et.al. | 2603.02692 | translate | read | null |
| 2026-03-03 | DREAM: Where Visual Understanding Meets Text-to-Image Generation | Chao Li et.al. | 2603.02667 | translate | read | null |
| 2026-03-03 | ATD: Improved Transformer with Adaptive Token Dictionary for Image Restoration | Leheng Zhang et.al. | 2603.02581 | translate | read | null |
| 2026-03-02 | Ground-based Atmospheric Characterization of Super-Earth L 98-59 d at High Spectral Resolution | Connor J. Cheverall et.al. | 2603.02209 | translate | read | null |
| 2026-03-02 | Kiwi-Edit: Versatile Video Editing via Instruction and Reference Guidance | Yiqi Lin et.al. | 2603.02175 | translate | read | null |
| 2026-03-02 | GeoDiT: Point-Conditioned Diffusion Transformer for Satellite Image Synthesis | Srikumar Sastry et.al. | 2603.02172 | translate | read | null |
| 2026-03-02 | ORGAN: Object-Centric Representation Learning using Cycle Consistent Generative Adversarial Networks | Joël Küchler et.al. | 2603.02063 | translate | read | null |
| 2026-03-02 | Latent attention on masked patches for flow reconstruction | Ben Eze et.al. | 2603.02028 | translate | read | null |
| 2026-03-02 | Tensor-network methodology for super-moiré excitons beyond one billion sites | Anouar Moustaj et.al. | 2603.02011 | translate | read | null |
| 2026-03-02 | Plug-and-play forward backward algorithm to restore Landsat images: A preliminary step to uncover the history of surface waters | Pierre Audisio et.al. | 2603.01868 | translate | read | null |
| 2026-03-02 | Block-coordinate Plug-And-Play Methods with Armijo-like line-search for Image Restoration | Federica Porta et.al. | 2603.01734 | translate | read | null |
| 2026-03-02 | DiffusionXRay: A Diffusion and GAN-Based Approach for Enhancing Digitally Reconstructed Chest Radiographs | Aryan Goyal et.al. | 2603.01686 | translate | read | null |
| 2026-03-02 | SkeleGuide: Explicit Skeleton Reasoning for Context-Aware Human-in-Place Image Synthesis | Chuqiao Wu et.al. | 2603.01579 | translate | read | null |
| 2026-03-02 | Align-cDAE: Alzheimer’s Disease Progression Modeling with Attention-Aligned Conditional Diffusion Auto-Encoder | Ayantika Das et.al. | 2603.01552 | translate | read | null |
| 2026-03-02 | RA-Det: Towards Universal Detection of AI-Generated Images via Robustness Asymmetry | Xinchang Wang et.al. | 2603.01544 | translate | read | null |
| 2026-03-02 | Benchmarking Semantic Segmentation Models via Appearance and Geometry Attribute Editing | Zijin Yin et.al. | 2603.01535 | translate | read | null |
| 2026-03-02 | Revisiting Global Token Mixing in Task-Dependent MRI Restoration: Insights from Minimal Gated CNN Baselines | Xiangjian Hou et.al. | 2603.01449 | translate | read | null |
| 2026-03-02 | ALMA High-J CO Spectroscopy of High-Redshift Galaxies. II. 0.03” Resolution CO Kinematics Reveal Super-Eddington Accretion in a Dust-Obscured Galaxy at z=3.111 | Ken-ichi Tadaki et.al. | 2603.01352 | translate | read | null |
| 2026-03-01 | Teacher-Guided Causal Interventions for Image Denoising: Orthogonal Content-Noise Disentanglement in Vision Transformers | Kuai Jiang et.al. | 2603.01140 | translate | read | null |
| 2026-03-01 | Super-resolution of turbulent reacting flows on complex meshes using graph neural networks | Priyabrat Dash et.al. | 2603.01080 | translate | read | null |
| 2026-03-01 | LLaDA-o: An Effective and Length-Adaptive Omni Diffusion Model | Zebin You et.al. | 2603.01068 | translate | read | link |
| 2026-03-01 | Reparameterized Tensor Ring Functional Decomposition for Multi-Dimensional Data Recovery | Yangyang Xu et.al. | 2603.01034 | translate | read | null |
| 2026-03-01 | Fully-analog array signal processor using 3D aperture engineering | Sheng Gao et.al. | 2603.00995 | translate | read | null |
| 2026-03-01 | Spectral Super-Resolution via Adversarial Unfolding and Data-Driven Spectrum Regularization: From Multispectral Satellite Data to NASA Hyperspectral Image | Si-Sheng Young et.al. | 2603.00920 | translate | read | null |
| 2026-03-01 | Improving Text-to-Image Generation with Intrinsic Self-Confidence Rewards | Seungwook Kim et.al. | 2603.00918 | translate | read | null |
| 2026-03-01 | Solving a Nonlinear Blind Inverse Problem for Tagged MRI with Physics and Deep Generative Priors | Zhangxing Bian et.al. | 2603.00882 | translate | read | null |
| 2026-03-01 | Neural Discrimination-Prompted Transformers for Efficient UHD Image Restoration and Enhancement | Cong Wang et.al. | 2603.00853 | translate | read | null |
(<a href=../Image_Generation.md>back to Image Generation</a>)