Image Generation - 2026-01 | Paper Arxiv Daily

Image Generation - 2026-01

Publish Date	Title	Authors	PDF	Translate	Read	Code
2026-01-27	A General-Purpose Diversified 2D Seismic Image Dataset from NAMSS	Lucas de Magalhães Araujo et.al.	2602.04890	translate	read	null
2026-01-31	Trajectory Consistency for One-Step Generation on Euler Mean Flows	Zhiqi Li et.al.	2602.02571	translate	read	null
2026-01-30	Super-résolution non supervisée d’images hyperspectrales de télédétection utilisant un entraînement entièrement synthétique	Xinxin Xu et.al.	2602.02552	translate	read	null
2026-01-31	DIAMOND: Directed Inference for Artifact Mitigation in Flow Matching Models	Alicja Polowczyk et.al.	2602.00883	translate	read	null
2026-01-31	Lightweight Super Resolution-enabled Coding Model for the JPEG Pleno Learning-based Point Cloud Coding Standard	André F. R. Guarda et.al.	2602.00863	translate	read	null
2026-01-31	RMFlow: Refined Mean Flow by a Noise-Injection Step for Multimodal Generation	Yuhao Huang et.al.	2602.00849	translate	read	null
2026-01-31	Super-resolution Imaging of Limited-size Objects	Taeyong Chang et.al.	2602.00719	translate	read	null
2026-01-31	Improving Neuropathological Reconstruction Fidelity via AI Slice Imputation	Marina Crespo Aguirre et.al.	2602.00669	translate	read	null
2026-01-31	FaceSnap: Enhanced ID-fidelity Network for Tuning-free Portrait Customization	Benxiang Zhai et.al.	2602.00627	translate	read	null
2026-01-31	Tune-Your-Style: Intensity-tunable 3D Style Transfer with Gaussian Splatting	Yian Zhao et.al.	2602.00618	translate	read	null
2026-01-31	Inference-Only Prompt Projection for Safe Text-to-Image Generation with TV Guarantees	Minhyuk Lee et.al.	2602.00616	translate	read	null
2026-01-31	Bridging Degradation Discrimination and Generation for Universal Image Restoration	JiaKui Hu et.al.	2602.00579	translate	read	null
2026-01-31	Toward Autonomous Laboratory Safety Monitoring with Vision Language Models: Learning to See Hazards Through Scene Structure	Trishna Chakraborty et.al.	2602.00414	translate	read	null
2026-01-31	Alignment of Diffusion Model and Flow Matching for Text-to-Image Generation	Yidong Ouyang et.al.	2602.00413	translate	read	null
2026-01-30	PLACID: Identity-Preserving Multi-Object Compositing via Video Diffusion with Synthetic Trajectories	Gemma Canet Tarrés et.al.	2602.00267	translate	read	null
2026-01-30	MapDream: Task-Driven Map Learning for Vision-Language Navigation	Guoxin Lian et.al.	2602.00222	translate	read	null
2026-01-30	Benchmarking Vanilla GAN, DCGAN, and WGAN Architectures for MRI Reconstruction: A Quantitative Analysis	Humaira Mehwish et.al.	2602.00221	translate	read	null
2026-01-30	Stabilizing Diffusion Posterior Sampling by Noise–Frequency Continuation	Feng Tian et.al.	2602.00176	translate	read	null
2026-01-30	PaperBanana: Automating Academic Illustration for AI Scientists	Dawei Zhu et.al.	2601.23265	translate	read	null
2026-01-30	How well do generative models solve inverse problems? A benchmark study	Patrick Krüger et.al.	2601.23238	translate	read	null
2026-01-30	Solving Inverse Problems with Flow-based Models via Model Predictive Control	George Webber et.al.	2601.23231	translate	read	null
2026-01-30	Scale-Cascaded Diffusion Models for Super-Resolution in Medical Imaging	Darshan Thaker et.al.	2601.23201	translate	read	null
2026-01-30	Adaptive Edge Learning for Density-Aware Graph Generation	Seyedeh Ava Razi Razavi et.al.	2601.23052	translate	read	null
2026-01-30	Improving Supervised Machine Learning Performance in Optical Quality Control via Generative AI for Dataset Expansion	Dennis Sprute et.al.	2601.22961	translate	read	null
2026-01-30	MoVE: Mixture of Value Embeddings – A New Axis for Scaling Parametric Memory in Autoregressive Models	Yangyan Li et.al.	2601.22887	translate	read	null
2026-01-30	Synthetic Time Series Generation via Complex Networks	Jaime Vale et.al.	2601.22879	translate	read	null
2026-01-30	NativeTok: Native Visual Tokenization for Improved Image Generation	Bin Wu et.al.	2601.22837	translate	read	null
2026-01-30	Synthetic Abundance Maps for Unsupervised Super-Resolution of Hyperspectral Remote Sensing Images	Xinxin Xu et.al.	2601.22755	translate	read	null
2026-01-30	Stabilizing Consistency Training: A Flow Map Analysis and Self-Distillation	Youngjoong Kim et.al.	2601.22679	translate	read	null
2026-01-30	LINA: Linear Autoregressive Image Generative Models with Continuous Tokens	Jiahao Wang et.al.	2601.22630	translate	read	null
2026-01-30	Time-Annealed Perturbation Sampling: Diverse Generation for Diffusion Language Models	Jingxuan Wu et.al.	2601.22629	translate	read	null
2026-01-30	Corrected Samplers for Discrete Flow Models	Zhengyan Wan et.al.	2601.22519	translate	read	null
2026-01-30	DreamVAR: Taming Reinforced Visual Autoregressive Model for High-Fidelity Subject-Driven Image Generation	Xin Jiang et.al.	2601.22507	translate	read	null
2026-01-30	ScribbleSense: Generative Scribble-Based Texture Editing with Intent Prediction	Yudi Zhang et.al.	2601.22455	translate	read	null
2026-01-29	Jailbreaks on Vision Language Model via Multimodal Reasoning	Aarush Noheria et.al.	2601.22398	translate	read	null
2026-01-29	Little Red Dots on FIRE: The Ability of Bursty Galaxies to Host an Abundant Population of High-Redshift AGN	Andrew Marszewski et.al.	2601.22213	translate	read	null
2026-01-29	One-step Latent-free Image Generation with Pixel Mean Flows	Yiyang Lu et.al.	2601.22158	translate	read	null
2026-01-29	Creative Image Generation with Diffusion Models	Kunpeng Song et.al.	2601.22125	translate	read	null
2026-01-29	RefAny3D: 3D Asset-Referenced Diffusion Models for Image Generation	Hanzhuo Huang et.al.	2601.22094	translate	read	null
2026-01-29	Investigating Associational Biases in Inter-Model Communication of Large Generative Models	Fethiye Irmak Dogan et.al.	2601.22093	translate	read	null
2026-01-29	MetricAnything: Scaling Metric Depth Pretraining with Noisy Heterogeneous Sources	Baorui Ma et.al.	2601.22054	translate	read	null
2026-01-29	SmartMeterFM: Unifying Smart Meter Data Generative Tasks Using Flow Matching Models	Nan Lin et.al.	2601.21706	translate	read	null
2026-01-29	Noise as a Probe: Membership Inference Attacks on Diffusion Models Leveraging Initial Noise	Puwei Lian et.al.	2601.21628	translate	read	null
2026-01-29	SimGraph: A Unified Framework for Scene Graph-Based Image Generation and Editing	Thanh-Nhan Vo et.al.	2601.21498	translate	read	null
2026-01-29	Revisiting Diffusion Model Predictions Through Dimensionality	Qing Jin et.al.	2601.21419	translate	read	null
2026-01-29	SR $^{2}$ -Net: A General Plug-and-Play Model for Spectral Refinement in Hyperspectral Image Super-Resolution	Ji-Xuan He et.al.	2601.21338	translate	read	null
2026-01-29	Optimization and Mobile Deployment for Anthropocene Neural Style Transfer	Po-Hsun Chen et.al.	2601.21141	translate	read	null
2026-01-28	Shape of Thought: Progressive Object Assembly via Visual Chain-of-Thought	Yu Huo et.al.	2601.21081	translate	read	null
2026-01-28	CompSRT: Quantization and Pruning for Image Super Resolution Transformers	Dorsa Zeinali et.al.	2601.21069	translate	read	null
2026-01-28	Non-Markov Multi-Round Conversational Image Generation with History-Conditioned MLLMs	Haochen Zhang et.al.	2601.20911	translate	read	null
2026-01-28	End-to-end example-based sim-to-real RL policy transfer based on neural stylisation with application to robotic cutting	Jamie Hathaway et.al.	2601.20846	translate	read	null
2026-01-28	Compressible Turbulence as a Source of Particle Beams and Ion Bernstein Waves in Collisionless Plasmas	Chuanpeng Hou et.al.	2601.20842	translate	read	null
2026-01-28	Detecting and Mitigating Memorization in Diffusion Models through Anisotropy of the Log-Probability	Rohan Asthana et.al.	2601.20642	translate	read	null
2026-01-28	CM-GAI: Continuum Mechanistic Generative Artificial Intelligence Theory for Data Dynamics	Shan Tang et.al.	2601.20462	translate	read	null
2026-01-28	Exploiting the Final Component of Generator Architectures for AI-Generated Image Detection	Yanzhu Liu et.al.	2601.20461	translate	read	null
2026-01-28	OSDEnhancer: Taming Real-World Space-Time Video Super-Resolution with One-Step Diffusion	Shuoyan Wei et.al.	2601.20308	translate	read	null
2026-01-28	Reversible Efficient Diffusion for Image Fusion	Xingxin Xu et.al.	2601.20260	translate	read	null
2026-01-28	BLENDER: Blended Text Embeddings and Diffusion Residuals for Intra-Class Image Synthesis in Deep Metric Learning	Jan Niklas Kolf et.al.	2601.20246	translate	read	null
2026-01-28	DenseGRPO: From Sparse to Dense Reward for Flow Matching Model Alignment	Haoyou Deng et.al.	2601.20218	translate	read	null
2026-01-28	TeleStyle: Content-Preserving Style Transfer in Images and Videos	Shiwen Zhang et.al.	2601.20175	translate	read	null
2026-01-27	DiffStyle3D: Consistent 3D Gaussian Stylization via Attention Optimization	Yitong Yang et.al.	2601.19717	translate	read	null
2026-01-27	Cortex-Grounded Diffusion Models for Brain Image Generation	Fabian Bongratz et.al.	2601.19498	translate	read	null
2026-01-27	Engineering Quantum Emission with Mie Voids	Yuchao Fu et.al.	2601.19420	translate	read	null
2026-01-27	LightSBB-M: Bridging Schrödinger and Bass for Generative Diffusion Modeling	Alexandre Alouadi et.al.	2601.19312	translate	read	null
2026-01-27	A sixth-order compact time-splitting Fourier pseudospectral method	Weiguo Gao et.al.	2601.19172	translate	read	null
2026-01-27	GTFMN: Guided Texture and Feature Modulation Network for Low-Light Image Enhancement and Super-Resolution	Yongsong Huang et.al.	2601.19157	translate	read	null
2026-01-27	CLIP-Guided Unsupervised Semantic-Aware Exposure Correction	Puzhen Wu et.al.	2601.19129	translate	read	null
2026-01-27	FBSDiff++: Improved Frequency Band Substitution of Diffusion Features for Efficient and Highly Controllable Text-Driven Image-to-Image Translation	Xiang Gao et.al.	2601.19115	translate	read	null
2026-01-26	Pay Attention to Where You Look	Alex Beriand et.al.	2601.18970	translate	read	null
2026-01-26	Advances in Diffusion-Based Generative Compression	Yibo Yang et.al.	2601.18932	translate	read	null
2026-01-26	SelfieAvatar: Real-time Head Avatar reenactment from a Selfie Video	Wei Liang et.al.	2601.18851	translate	read	null
2026-01-26	OptiGAN for Crystal Arrays: Physics-Informed Generative Modeling of Optical Photon Transport in PET Detector Arrays	Stephan Naunheim et.al.	2601.18780	translate	read	null
2026-01-26	GimmBO: Interactive Generative Image Model Merging via Bayesian Optimization	Chenxi Liu et.al.	2601.18585	translate	read	null
2026-01-26	GenAgent: Scaling Text-to-Image Generation via Agentic Multimodal Reasoning	Kaixun Jiang et.al.	2601.18543	translate	read	null
2026-01-23	SoS: Analysis of Surface over Semantics in Multilingual Text-To-Image Generation	Carolin Holtermann et.al.	2601.16803	translate	read	null
2026-01-23	A Novel Transfer Learning Approach for Mental Stability Classification from Voice Signal	Rafiul Islam et.al.	2601.16793	translate	read	null
2026-01-23	Evaluating Generative AI in the Lab: Methodological Challenges and Guidelines	Hyerim Park et.al.	2601.16740	translate	read	null
2026-01-23	Sim-to-Real Transfer via a Style-Identified Cycle Consistent Generative Adversarial Network: Zero-Shot Deployment on Robotic Manipulators through Visual Domain Adaptation	Lucía Güitta-López et.al.	2601.16677	translate	read	null
2026-01-23	Fast, faithful and photorealistic diffusion-based image super-resolution with enhanced Flow Map models	Maxence Noble et.al.	2601.16660	translate	read	null
2026-01-23	Edge-Aware Image Manipulation via Diffusion Models with a Novel Structure-Preservation Loss	Minsu Gong et.al.	2601.16645	translate	read	link
2026-01-23	Unsupervised Super-Resolution of Hyperspectral Remote Sensing Images Using Fully Synthetic Training	Xinxin Xu et.al.	2601.16602	translate	read	null
2026-01-23	DeMark: A Query-Free Black-Box Attack on Deepfake Watermarking Defenses	Wei Song et.al.	2601.16473	translate	read	null
2026-01-23	Secure Intellicise Wireless Network: Agentic AI for Coverless Semantic Steganography Communication	Rui Meng et.al.	2601.16472	translate	read	null
2026-01-23	Mode Conversion of Hyperbolic Phonon Polaritons in van der Waals terraces	Byung-Il Noh et.al.	2601.16465	translate	read	null
2026-01-23	A Cosine Network for Image Super-Resolution	Chunwei Tian et.al.	2601.16413	translate	read	null
2026-01-22	ProGiDiff: Prompt-Guided Diffusion-Based Medical Image Segmentation	Yuan Lin et.al.	2601.16060	translate	read	null
2026-01-22	Understanding the Transfer Limits of Vision Foundation Models	Shiqi Huang et.al.	2601.15888	translate	read	null
2026-01-22	PMPBench: A Paired Multi-Modal Pan-Cancer Benchmark for Medical Image Synthesis	Yifan Chen et.al.	2601.15884	translate	read	null
2026-01-22	Uncertainty-guided Generation of Dark-field Radiographs	Lina Felsner et.al.	2601.15859	translate	read	null
2026-01-22	TinySense: Effective CSI Compression for Scalable and Accurate Wi-Fi Sensing	Toan Gian et.al.	2601.15838	translate	read	null
2026-01-22	Diffusion Model-Based Data Augmentation for Enhanced Neuron Segmentation	Liuyun Jiang et.al.	2601.15779	translate	read	null
2026-01-22	Beyond Visual Safety: Jailbreaking Multimodal Large Language Models for Harmful Image Generation via Semantic-Agnostic Inputs	Mingyu Yu et.al.	2601.15698	translate	read	null
2026-01-22	Consistency-Regularized GAN for Few-Shot SAR Target Recognition	Yikui Zhai et.al.	2601.15681	translate	read	null
2026-01-22	Relative Classification Accuracy: A Calibrated Metric for Identity Consistency in Fine-Grained K-pop Face Generation	Sylvey Lin et.al.	2601.15560	translate	read	null
2026-01-21	Controllable Layered Image Generation for Real-World Editing	Jinrui Yang et.al.	2601.15507	translate	read	null
2026-01-21	Hybrid Vision Transformer_GAN Attribute Neutralizer for Mitigating Bias in Chest X_Ray Diagnosis	Jobeal Solomon et.al.	2601.15490	translate	read	null
2026-01-21	Ambient Dataloops: Generative Models for Dataset Refinement	Adrián Rodríguez-Muñoz et.al.	2601.15417	translate	read	null
2026-01-21	GeMM-GAN: A Multimodal Generative Model Conditioned on Histopathology Images and Clinical Descriptions for Gene Expression Profile Generation	Francesca Pia Panaccione et.al.	2601.15392	translate	read	link
2026-01-21	OpenVision 3: A Family of Unified Visual Encoder for Both Understanding and Generation	Letian Zhang et.al.	2601.15369	translate	read	null
2026-01-21	Aligned Stable Inpainting: Mitigating Unwanted Object Insertion and Preserving Color Consistency	Yikai Wang et.al.	2601.15368	translate	read	null
2026-01-21	Iterative Refinement Improves Compositional Image Generation	Shantanu Jaiswal et.al.	2601.15286	translate	read	null
2026-01-21	Field-Space Autoencoder for Scalable Climate Emulators	Johannes Meuer et.al.	2601.15102	translate	read	null
2026-01-21	The Pictorial Cortex: Zero-Shot Cross-Subject fMRI-to-Image Reconstruction via Compositional Latent Modeling	Jingyang Huo et.al.	2601.15071	translate	read	null
2026-01-21	Differential Privacy Image Generation with Reconstruction Loss and Noise Injection Using an Error Feedback SGD	Qiwei Ma et.al.	2601.15061	translate	read	null
2026-01-21	HyperNet-Adaptation for Diffusion-Based Test Case Generation	Oliver Weißl et.al.	2601.15041	translate	read	null
2026-01-21	TempViz: On the Evaluation of Temporal Knowledge in Text-to-Image Models	Carolin Holtermann et.al.	2601.14951	translate	read	null
2026-01-21	Synthetic Data Augmentation for Multi-Task Chinese Porcelain Classification: A Stable Diffusion Approach	Ziyao Ling et.al.	2601.14791	translate	read	null
2026-01-21	Semantic-Guided Unsupervised Video Summarization	Haizhou Liu et.al.	2601.14773	translate	read	null
2026-01-21	Enhancing Text-to-Image Generation via End-Edge Collaborative Hybrid Super-Resolution	Chongbin Yi et.al.	2601.14741	translate	read	null
2026-01-21	Mirai: Autoregressive Visual Generation Needs Foresight	Yonghao Yu et.al.	2601.14671	translate	read	null
2026-01-21	3D Space as a Scratchpad for Editable Text-to-Image Generation	Oindrila Saha et.al.	2601.14602	translate	read	null
2026-01-20	Quantum Super-resolution by Adaptive Non-local Observables	Hsin-Yi Lin et.al.	2601.14433	translate	read	null
2026-01-20	When Generative AI Is Intimate, Sexy, and Violent: Examining Not-Safe-For-Work (NSFW) Chatbots on FlowGPT	Xian Li et.al.	2601.14324	translate	read	null
2026-01-20	Implicit Neural Representation Facilitates Unified Universal Vision Encoding	Matthew Gwilliam et.al.	2601.14256	translate	read	link
2026-01-20	Style Transfer as Bias Mitigation: Diffusion Models for Synthetic Mental Health Text for Arabic	Saad Mankarious et.al.	2601.14124	translate	read	null
2026-01-20	Likelihood-Separable Diffusion Inference for Multi-Image MRI Super-Resolution	Samuel W. Remedios et.al.	2601.14030	translate	read	null
2026-01-20	SHARE: A Fully Unsupervised Framework for Single Hyperspectral Image Restoration	Jiangwei Xie et.al.	2601.13987	translate	read	null
2026-01-20	Asymmetric regularization mechanism for GAN training with Variational Inequalities	Spyridon C. Giagtzoglou et.al.	2601.13920	translate	read	null
2026-01-20	Prospecting MeerKAT Continuum Data for Enigmatic Radio Sources with Unsupervised Vector-Quantised Variational Autoencoders	Fernando L. Ventura et.al.	2601.13901	translate	read	null
2026-01-20	Generative Adversarial Networks for Resource State Generation	Shahbaz Shaik et.al.	2601.13708	translate	read	null
2026-01-20	Dynamic Differential Linear Attention: Enhancing Linear Diffusion Transformer for High-Quality Image Generation	Boyuan Cao et.al.	2601.13683	translate	read	null
2026-01-19	SpatialBench-UC: Uncertainty-Aware Evaluation of Spatial Prompt Following in Text-to-Image Generation	Amine Rostane et.al.	2601.13462	translate	read	null
2026-01-19	Diffusion Representations for Fine-Grained Image Classification: A Marine Plankton Case Study	A. Nieto Juscafresa et.al.	2601.13416	translate	read	null
2026-01-19	StyMam: A Mamba-Based Generator for Artistic Style Transfer	Zhou Hong et.al.	2601.12954	translate	read	null
2026-01-19	AI-generated data contamination erodes pathological variability and diagnostic reliability	Hongyu He et.al.	2601.12946	translate	read	null
2026-01-19	Generalizable and Animatable 3D Full-Head Gaussian Avatar from a Single Image	Shuling Zhao et.al.	2601.12770	translate	read	null
2026-01-19	SSPFormer: Self-Supervised Pretrained Transformer for MRI Images	Jingkai Li et.al.	2601.12747	translate	read	null
2026-01-16	ACoT-VLA: Action Chain-of-Thought for Vision-Language-Action Models	Linqing Zhong et.al.	2601.11404	translate	read	null
2026-01-16	Shape-morphing programming of soft materials on complex geometries via neural operator	Lu Chen et.al.	2601.11126	translate	read	null
2026-01-16	Generation of Chest CT pulmonary Nodule Images by Latent Diffusion Models using the LIDC-IDRI Dataset	Kaito Urata et.al.	2601.11085	translate	read	null
2026-01-15	The BigBite Calorimeter for the Super Bigbite Spectrometer Program at Jefferson Lab	Provakar Datta et.al.	2601.10799	translate	read	null
2026-01-15	A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5	Xingjun Ma et.al.	2601.10527	translate	read	link
2026-01-15	Think-Then-Generate: Reasoning-Aware Text-to-Image Diffusion with LLM Encoders	Siqi Kou et.al.	2601.10332	translate	read	link
2026-01-15	Multilingual-To-Multimodal (M2M): Unlocking New Languages with Monolingual Text	Piyush Singh Pasi et.al.	2601.10096	translate	read	link
2026-01-15	Thinking Like Van Gogh: Structure-Aware Style Transfer via Flow-Guided 3D Gaussian Splatting	Zhendong Wang et.al.	2601.10075	translate	read	null
2026-01-15	CoF-T2I: Video Models as Pure Visual Reasoners for Text-to-Image Generation	Chengzhuo Tong et.al.	2601.10061	translate	read	link
2026-01-15	Resistive Memory based Efficient Machine Unlearning and Continual Learning	Ning Lin et.al.	2601.10037	translate	read	null
2026-01-14	VibrantSR: Sub-Meter Canopy Height Models from Sentinel-2 Using Generative Flow Matching	Kiarie Ndegwa et.al.	2601.09866	translate	read	null
2026-01-14	NanoSD: Edge Efficient Foundation Model for Real Time Image Restoration	Subhajit Sanyal et.al.	2601.09823	translate	read	null
2026-01-14	Omni-R1: Towards the Unified Generative Paradigm for Multimodal Reasoning	Dongjie Cheng et.al.	2601.09536	translate	read	link
2026-01-14	Detail Loss in Super-Resolution Models Based on the Laplacian Pyramid and Repeated Upscaling and Downscaling Process	Sangjun Han et.al.	2601.09410	translate	read	null
2026-01-14	PhyRPR: Training-Free Physics-Constrained Video Generation	Yibo Zhao et.al.	2601.09255	translate	read	null
2026-01-14	Knowledge-Embedded and Hypernetwork-Guided Few-Shot Substation Meter Defect Image Generation Method	Jackie Alex et.al.	2601.09238	translate	read	null
2026-01-14	SpikeVAEDiff: Neural Spike-based Natural Visual Scene Reconstruction via VD-VAE and Versatile Diffusion	Jialu Li et.al.	2601.09213	translate	read	null
2026-01-14	Annealed Relaxation of Speculative Decoding for Faster Autoregressive Image Generation	Xingyao Li et.al.	2601.09212	translate	read	null
2026-01-14	Architecture inside the mirage: evaluating generative image models on architectural style, elements, and typologies	Jamie Magrill et.al.	2601.09169	translate	read	null
2026-01-14	How Many Human Judgments Are Enough? Feasibility Limits of Human Preference Evaluation	Wilson Y. Lee et.al.	2601.09084	translate	read	null
2026-01-12	TAG-MoE: Task-Aware Gating for Unified Generative Mixture-of-Experts	Yu Xu et.al.	2601.08881	translate	read	null
2026-01-13	S3-CLIP: Video Super Resolution for Person-ReID	Tamas Endrei et.al.	2601.08807	translate	read	null
2026-01-13	Aggregating Diverse Cue Experts for AI-Generated Image Detection	Lei Tan et.al.	2601.08790	translate	read	null
2026-01-13	Translating Light-Sheet Microscopy Images to Virtual H&E Using CycleGAN	Yanhua Zhao et.al.	2601.08776	translate	read	null
2026-01-13	SafeRedir: Prompt Embedding Redirection for Robust Unlearning in Image Generation Models	Renyang Liu et.al.	2601.08623	translate	read	null
2026-01-13	MASH: Evading Black-Box AI-Generated Text Detectors via Style Humanization	Yongtong Gu et.al.	2601.08564	translate	read	null
2026-01-13	From Local Windows to Adaptive Candidates via Individualized Exploratory: Rethinking Attention for Image Super-Resolution	Chunyu Meng et.al.	2601.08341	translate	read	null
2026-01-13	IGAN: A New Inception-based Model for Stable and High-Fidelity Image Synthesis Using Generative Adversarial Networks	Ahmed A. Hashim et.al.	2601.08332	translate	read	null
2026-01-13	UM-Text: A Unified Multimodal Model for Image Understanding	Lichen Ma et.al.	2601.08321	translate	read	null
2026-01-13	SnapGen++: Unleashing Diffusion Transformers for Efficient High-Fidelity Image Generation on Edge Devices	Dongting Hu et.al.	2601.08303	translate	read	null
2026-01-13	A Usable GAN-Based Tool for Synthetic ECG Generation in Cardiac Amyloidosis Research	Francesco Speziale et.al.	2601.08260	translate	read	null
2026-01-12	MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head	Kewei Zhang et.al.	2601.07832	translate	read	link
2026-01-12	Evaluating the encoding competence of visual language models using uncommon actions	Chen Ling et.al.	2601.07737	translate	read	null
2026-01-12	Sub-Pixel Electron Beam Alignment for Machine Learning Characterization of Hybrid Pixel Detectors	Emiliya Poghosyan et.al.	2601.07682	translate	read	null
2026-01-12	Advancing Multinational License Plate Recognition Through Synthetic and Real Data Fusion: A Comprehensive Evaluation	Rayson Laroca et.al.	2601.07671	translate	read	null
2026-01-12	GenDet: Painting Colored Bounding Boxes on Images via Diffusion Model for Object Detection	Chen Min et.al.	2601.07273	translate	read	null
2026-01-12	Language-Grounded Multi-Domain Image Translation via Semantic Difference Guidance	Jongwon Ryu et.al.	2601.07221	translate	read	null
2026-01-11	When Humans Judge Irises: Pupil Size Normalization as an Aid and Synthetic Irises as a Challenge	Mahsa Mitcheff et.al.	2601.06725	translate	read	null
2026-01-10	Boosting Overlapping Organoid Instance Segmentation Using Pseudo-Label Unmixing and Synthesis-Assisted Learning	Gui Huang et.al.	2601.06642	translate	read	null
2026-01-10	Sissi: Zero-shot Style-guided Image Synthesis via Semantic-style Integration	Yingying Deng et.al.	2601.06605	translate	read	null
2026-01-10	UMLoc: Uncertainty-Aware Map-Constrained Inertial Localization with Quantified Bounds	Mohammed S. Alharbi et.al.	2601.06602	translate	read	null
2026-01-10	APEX: Learning Adaptive Priorities for Multi-Objective Alignment in Vision-Language Generation	Dongliang Chen et.al.	2601.06574	translate	read	null
2026-01-10	A novel RF-enabled Non-Destructive Inspection Method through Machine Learning and Programmable Wireless Environments	Stavros Tsimpoukis et.al.	2601.06512	translate	read	null
2026-01-10	GlobalPaint: Spatiotemporal Coherent Video Outpainting with Global Feature Guidance	Yueming Pan et.al.	2601.06413	translate	read	null
2026-01-10	From Easy to Hard++: Promoting Differentially Private Image Synthesis Through Spatial-Frequency Curriculum	Chen Gong et.al.	2601.06368	translate	read	null
2026-01-09	Circuit Mechanisms for Spatial Relation Generation in Diffusion Transformers	Binxu Wang et.al.	2601.06338	translate	read	null
2026-01-08	QwenStyle: Content-Preserving Style Transfer with Qwen-Image-Edit	Shiwen Zhang et.al.	2601.06202	translate	read	null
2026-01-07	Think Bright, Diffuse Nice: Enhancing T2I-ICL via Inductive-Bias Hint Instruction and Query Contrastive Decoding	Zhiyong Ma et.al.	2601.06169	translate	read	null
2026-01-09	Multi-Modal Style Transfer-based Prompt Tuning for Efficient Federated Domain Generalization	Yuliang Chen et.al.	2601.05955	translate	read	null
2026-01-09	Kidney Cancer Detection Using 3D-Based Latent Diffusion Models	Jen Dusseljee et.al.	2601.05852	translate	read	null
2026-01-09	GenCtrl – A Formal Controllability Toolkit for Generative Models	Emily Cheng et.al.	2601.05637	translate	read	link
2026-01-09	Text Detoxification in isiXhosa and Yorùbá: A Cross-Lingual Machine Learning Approach for Low-Resource African Languages	Abayomi O. Agbeyangi et.al.	2601.05624	translate	read	null
2026-01-09	MoGen: A Unified Collaborative Framework for Controllable Multi-Object Image Generation	Yanfeng Li et.al.	2601.05546	translate	read	null
2026-01-09	Multi-Image Super Resolution Framework for Detection and Analysis of Plant Roots	Shubham Agarwal et.al.	2601.05482	translate	read	null
2026-01-08	Coding the Visual World: From Image to Simulation Using Vision Language Models	Sagi Eppel et.al.	2601.05344	translate	read	null
2026-01-08	Multi-Scale Local Speculative Decoding for Image Generation	Elia Peruzzo et.al.	2601.05149	translate	read	null
2026-01-08	Re-Align: Structured Reasoning-guided Alignment for In-Context Image Generation and Editing	Runze He et.al.	2601.05124	translate	read	link
2026-01-08	From Rays to Projections: Better Inputs for Feed-Forward View Synthesis	Zirui Wu et.al.	2601.05116	translate	read	null
2026-01-08	Exponential capacity scaling of classical GANs compared to hybrid latent style-based quantum GANs	Milan Liepelt et.al.	2601.05036	translate	read	null
2026-01-08	OnomaCompass: A Texture Exploration Interface that Shuttles between Words and Images	Miki Okamura et.al.	2601.04915	translate	read	null
2026-01-08	Illumination Angular Spectrum Encoding for Controlling the Functionality of Diffractive Networks	Matan Kleiner et.al.	2601.04825	translate	read	null
2026-01-08	SRU-Pix2Pix: A Fusion-Driven Generator Network for Medical Image Translation with Few-Shot Learning	Xihe Qiu et.al.	2601.04785	translate	read	null
2026-01-08	Forge-and-Quench: Enhancing Image Generation for Higher Fidelity in Unified Multimodal Models	Yanbing Zeng et.al.	2601.04706	translate	read	link
2026-01-08	HATIR: Heat-Aware Diffusion for Turbulent Infrared Video Super-Resolution	Yang Zou et.al.	2601.04682	translate	read	null
2026-01-08	HyperAlign: Hyperbolic Entailment Cones for Adaptive Text-to-Image Alignment Assessment	Wenzhi Chen et.al.	2601.04614	translate	read	null
2026-01-08	3D Conditional Image Synthesis of Left Atrial LGE MRI from Composite Semantic Masks	Yusri Al-Sanaani et.al.	2601.04588	translate	read	null
2026-01-08	Studies in Astronomical Time Series Analysis: The Double Lomb-Scargle Periodogram and Super Resolution	Jeffrey D. Scargle et.al.	2601.04552	translate	read	null
2026-01-08	FaceRefiner: High-Fidelity Facial Texture Refinement with Differentiable Rendering-based Style Transfer	Chengyang Li et.al.	2601.04520	translate	read	null
2026-01-07	UniDrive-WM: Unified Understanding, Planning and Generation World Model For Autonomous Driving	Zhexiao Xiong et.al.	2601.04453	translate	read	null
2026-01-07	Modifications to Image Phase Alignment Super-sampling Produce up to 4.4 times Increased Image Resolution	James N. Caron et.al.	2601.04391	translate	read	null
2026-01-07	Unified Text-Image Generation with Weakness-Targeted Post-Training	Jiahui Chen et.al.	2601.04339	translate	read	null
2026-01-07	Bridging the Discrete-Continuous Gap: Unified Multimodal Generation via Coupled Manifold Discrete Absorbing Diffusion	Yuanfeng Xu et.al.	2601.04056	translate	read	null
2026-01-07	Padé Neurons for Efficient Neural Models	Onur Keleş et.al.	2601.04005	translate	read	null
2026-01-07	ResTok: Learning Hierarchical Residuals in 1D Visual Tokenizers for Autoregressive Image Generation	Xu Zhang et.al.	2601.03955	translate	read	link
2026-01-07	Local Interpolation via Low-Rank Tensor Trains	Siddhartha E. Guzman et.al.	2601.03885	translate	read	null
2026-01-07	FLNet: Flood-Induced Agriculture Damage Assessment using Super Resolution of Satellite Images	Sanidhya Ghosal et.al.	2601.03884	translate	read	null
2026-01-07	Logic Tensor Network-Enhanced Generative Adversarial Network	Nijesh Upreti et.al.	2601.03839	translate	read	null
2026-01-07	Zak-OTFS ISAC with Bistatic Sensing via Semi-Blind Atomic Norm Denoising Scheme	Kecheng Zhang et.al.	2601.03639	translate	read	null
2026-01-07	Physics-Constrained Cross-Resolution Enhancement Network for Optics-Guided Thermal UAV Image Super-Resolution	Zhicheng Zhao et.al.	2601.03526	translate	read	null
2026-01-07	GeoDiff-SAR: A Geometric Prior Guided Diffusion Model for SAR Image Generation	Fan Zhang et.al.	2601.03499	translate	read	null
2026-01-06	Understanding Reward Hacking in Text-to-Image Reinforcement Learning	Yunqi Hong et.al.	2601.03468	translate	read	null
2026-01-06	ThinkRL-Edit: Thinking in Reinforcement Learning for Reasoning-Centric Image Editing	Hengjia Li et.al.	2601.03467	translate	read	null
2026-01-06	Discriminating real and synthetic super-resolved audio samples using embedding-based classifiers	Mikhail Silaev et.al.	2601.03443	translate	read	null
2026-01-06	Edit2Restore:Few-Shot Image Restoration via Parameter-Efficient Adaptation of Pre-trained Editing Models	M. Akın Yılmaz et.al.	2601.03391	translate	read	link
2026-01-06	Muses: Designing, Composing, Generating Nonexistent Fantasy 3D Creatures without Training	Hexiao Lu et.al.	2601.03256	translate	read	link
2026-01-06	UniCorn: Towards Self-Improving Unified Multimodal Models through Self-Generated Supervision	Ruiyan Han et.al.	2601.03193	translate	read	link
2026-01-06	Unified Thinker: A General Reasoning Modular Core for Image Generation	Sashuai Zhou et.al.	2601.03127	translate	read	null
2026-01-06	Stroke Patches: Customizable Artistic Image Styling Using Regression	Ian Jaffray et.al.	2601.03114	translate	read	null
2026-01-06	LAMS-Edit: Latent and Attention Mixing with Schedulers for Improved Content Preservation in Diffusion-Based Image and Style Editing	Wingwa Fu et.al.	2601.02987	translate	read	null
2026-01-06	VTONQA: A Multi-Dimensional Quality Assessment Dataset for Virtual Try-on	Xinyi Wei et.al.	2601.02945	translate	read	null
2026-01-06	GRRE: Leveraging G-Channel Removed Reconstruction Error for Robust Detection of AI-Generated Images	Shuman He et.al.	2601.02709	translate	read	null
2026-01-05	Annealed Langevin Posterior Sampling (ALPS): A Rapid Algorithm for Image Restoration with Multiscale Energy Models	Jyothi Rikhab Chand et.al.	2601.02594	translate	read	null
2026-01-05	VIBE: Visual Instruction Based Editor	Grigorii Alekseenko et.al.	2601.02242	translate	read	link
2026-01-05	Beam-Brainstorm: A Generative Site-Specific Beamforming Approach	Zihao Zhou et.al.	2601.02219	translate	read	null
2026-01-05	Unraveling MMDiT Blocks: Training-free Analysis and Enhancement of Text-conditioned Diffusion	Binglei Li et.al.	2601.02211	translate	read	null
2026-01-05	Seeing the Unseen: Zooming in the Dark with Event Cameras	Dachun Kai et.al.	2601.02206	translate	read	null
2026-01-05	Agentic Retoucher for Text-To-Image Generation	Shaocheng Shen et.al.	2601.02046	translate	read	null
2026-01-05	AlignVTOFF: Texture-Spatial Feature Alignment for High-Fidelity Virtual Try-Off	Yihan Zhu et.al.	2601.02038	translate	read	null
2026-01-05	SerpentFlow: Generative Unpaired Domain Alignment via Shared-Structure Decomposition	Julie Keisler et.al.	2601.01979	translate	read	null
2026-01-05	Score-based diffusion models for accurate crystal-structure inpainting and reconstruction of hydrogen positions	Timo Reents et.al.	2601.01959	translate	read	null
2026-01-04	Guiding Token-Sparse Diffusion Models	Felix Krause et.al.	2601.01608	translate	read	null
2026-01-04	Improving Flexible Image Tokenizers for Autoregressive Image Generation	Zixuan Fu et.al.	2601.01535	translate	read	null
2026-01-04	Domain Adaptation of Carotid Ultrasound Images using Generative Adversarial Network	Mohd Usama et.al.	2601.01460	translate	read	null
2026-01-04	Image Synthesis Using Spintronic Deep Convolutional Generative Adversarial Network	Saumya Gupta et.al.	2601.01441	translate	read	null
2026-01-04	SwinIFS: Landmark Guided Swin Transformer For Identity Preserving Face Super Resolution	Habiba Kausar et.al.	2601.01406	translate	read	null
2026-01-04	Poisson Centralisers and Polynomial Superintegrability for Magnetic Geodesic Flows on Reductive Homogeneous Spaces	Kai Jiang et.al.	2601.01369	translate	read	null
2026-01-03	Diffusion Timbre Transfer Via Mutual Information Guided Inpainting	Ching Ho Lee et.al.	2601.01294	translate	read	null
2026-01-03	Improved Object-Centric Diffusion Learning with Registers and Contrastive Alignment	Bac Nguyen et.al.	2601.01224	translate	read	null
2026-01-03	RefSR-Adv: Adversarial Attack on Reference-based Image Super-Resolution Models	Jiazhu Dai et.al.	2601.01202	translate	read	null
2026-01-03	Comparative Evaluation of VAE, GAN, and SMOTE for Tor Detection in Encrypted Network Traffic	Saravanan A et.al.	2601.01183	translate	read	null
2026-01-03	Neural Networks on Symmetric Spaces of Noncompact Type	Xuan Son Nguyen et.al.	2601.01097	translate	read	null
2026-01-03	Unsupervised Text Style Transfer for Controllable Intensity	Shuhuan Gu et.al.	2601.01060	translate	read	null
2026-01-03	Out-of-Band Power Side-Channel Detection for Semiconductor Supply Chain Integrity at Scale	Rajiv Thummala et.al.	2601.01054	translate	read	null
2026-01-02	Contractive Diffusion Policies: Robust Action Diffusion via Contractive Score-Based Sampling with Differential Equations	Amin Abyaneh et.al.	2601.01003	translate	read	null
2026-01-02	Peak-Nadir Encoding for Efficient CGM Data Compression and High-Fidelity Reconstruction	Clara Bender et.al.	2601.00608	translate	read	null
2026-01-01	Application Research of a Deep Learning Model Integrating CycleGAN and YOLO in PCB Infrared Defect Detection	Chao Yang et.al.	2601.00237	translate	read	null
2026-01-01	Unknown Aware AI-Generated Content Attribution	Ellie Thieu et.al.	2601.00218	translate	read	null
2026-01-01	MorphAny3D: Unleashing the Power of Structured Latent in 3D Morphing	Xiaokun Sun et.al.	2601.00204	translate	read	null
2026-01-01	DichroGAN: Towards Restoration of in-air Colours of Seafloor from Satellite Imagery	Salma Gonzalez-Sabbagh et.al.	2601.00194	translate	read	null
2026-01-01	SSI-GAN: Semi-Supervised Swin-Inspired Generative Adversarial Networks for Neuronal Spike Classification	Danial Sharifrazi et.al.	2601.00189	translate	read	null

(<a href=../Image_Generation.md>back to Image Generation</a>)