Image Generation - 2025-11
Image Generation - 2025-11
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2025-11-28 | Breaking Scale Anchoring: Frequency Representation Learning for Accurate High-Resolution Inference from Low-Resolution Training | Wenshuo Wang et.al. | 2512.05132 | translate | read | null |
| 2025-11-30 | ChatGPT-5 in Secondary Education: A Mixed-Methods Analysis of Student Attitudes, AI Anxiety, and Hallucination-Aware Use | Tryfon Sivenas et.al. | 2512.04109 | translate | read | null |
| 2025-11-29 | Dispersion Outperforms Absorption: EIT-Enhanced Atomic Localization and Gradient Sensing with Super-Gaussian Beams | Mahboob Ul Haq et.al. | 2512.02063 | translate | read | null |
| 2025-11-30 | Accelerating Inference of Masked Image Generators via Reinforcement Learning | Pranav Subbaraman et.al. | 2512.01094 | translate | read | null |
| 2025-11-30 | Lotus-2: Advancing Geometric Dense Prediction with Powerful Image Generative Model | Jing He et.al. | 2512.01030 | translate | read | null |
| 2025-11-30 | MM-ACT: Learn from Multimodal Parallel Generation to Act | Haotian Liang et.al. | 2512.00975 | translate | read | null |
| 2025-11-30 | Less is More: Resource-Efficient Low-Rank Adaptation | Chunlin Tian et.al. | 2512.00878 | translate | read | null |
| 2025-11-30 | BioPro: On Difference-Aware Gender Fairness for Vision-Language Models | Yujie Lin et.al. | 2512.00807 | translate | read | null |
| 2025-11-30 | Charts Are Not Images: On the Challenges of Scientific Chart Editing | Shawn Li et.al. | 2512.00752 | translate | read | null |
| 2025-11-30 | Multi-GRPO: Multi-Group Advantage Estimation for Text-to-Image Generation with Tree-Based Trajectories and Multiple Rewards | Qiang Lyu et.al. | 2512.00743 | translate | read | null |
| 2025-11-29 | XAI-Driven Skin Disease Classification: Leveraging GANs to Augment ResNet-50 Performance | Kim Gerard A. Villanueva et.al. | 2512.00626 | translate | read | null |
| 2025-11-29 | NeuroVolve: Evolving Visual Stimuli toward Programmable Neural Objectives | Haomiao Chen et.al. | 2512.00557 | translate | read | null |
| 2025-11-29 | SAIDO: Generalizable Detection of AI-Generated Images via Scene-Aware and Importance-Guided Dynamic Optimization in Continual Learning | Yongkang Hu et.al. | 2512.00539 | translate | read | null |
| 2025-11-29 | Image Generation as a Visual Planner for Robotic Manipulation | Ye Pang et.al. | 2512.00532 | translate | read | null |
| 2025-11-29 | RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards | Junyan Ye et.al. | 2512.00473 | translate | read | null |
| 2025-11-29 | FR-TTS: Test-Time Scaling for NTP-based Image Generation with Effective Filling-based Reward Signal | Hang Xu et.al. | 2512.00438 | translate | read | null |
| 2025-11-29 | Recognizing Pneumonia in Real-World Chest X-rays with a Classifier Trained with Images Synthetically Generated by Nano Banana | Jiachuan Peng et.al. | 2512.00428 | translate | read | null |
| 2025-11-29 | HIMOSA: Efficient Remote Sensing Image Super-Resolution with Hierarchical Mixture of Sparse Attention | Yi Liu et.al. | 2512.00275 | translate | read | null |
| 2025-11-29 | USB: Unified Synthetic Brain Framework for Bidirectional Pathology-Healthy Generation and Editing | Jun Wang et.al. | 2512.00269 | translate | read | null |
| 2025-11-28 | SD-CGAN: Conditional Sinkhorn Divergence GAN for DDoS Anomaly Detection in IoT Networks | Henry Onyeka et.al. | 2512.00251 | translate | read | null |
| 2025-11-28 | Near-Field Channel Estimation and Joint Angle-Range Recovery in XL-MIMO Systems: A Gridless Super-Resolution Approach | Feng Xi et.al. | 2511.23187 | translate | read | null |
| 2025-11-28 | Evaluating the Clinical Impact of Generative Inpainting on Bone Age Estimation | Felipe Akio Matsuoka et.al. | 2511.23066 | translate | read | null |
| 2025-11-28 | MultiBanana: A Challenging Benchmark for Multi-Reference Text-to-Image Generation | Yuta Oshima et.al. | 2511.22989 | translate | read | null |
| 2025-11-28 | TARFVAE: Efficient One-Step Generative Time Series Forecasting via TARFLOW based VAE | Jiawen Wei et.al. | 2511.22853 | translate | read | null |
| 2025-11-05 | Diffusion-Based Image Editing: An Unforeseen Adversary to Robust Invisible Watermarks | Wenkai Fu et.al. | 2511.05598 | translate | read | null |
| 2025-11-06 | Sublinear iterations can suffice even for DDPMs | Matthew S. Zhang et.al. | 2511.04844 | translate | read | null |
| 2025-11-06 | Prompt-Based Safety Guidance Is Ineffective for Unlearned Text-to-Image Diffusion Models | Jiwoo Shin et.al. | 2511.04834 | translate | read | null |
| 2025-11-06 | Quantifying the Climate Risk of Generative AI: Region-Aware Carbon Accounting with G-TRACE and the AI Sustainability Pyramid | Zahida Kausar et.al. | 2511.04776 | translate | read | null |
| 2025-11-06 | CPO: Condition Preference Optimization for Controllable Image Generation | Zonglin Lyu et.al. | 2511.04753 | translate | read | null |
| 2025-11-06 | ForecastGAN: A Decomposition-Based Adversarial Framework for Multi-Horizon Time Series Forecasting | Syeda Sitara Wishal Fatima et.al. | 2511.04445 | translate | read | null |
| 2025-11-06 | AStF: Motion Style Transfer via Adaptive Statistics Fusor | Hanmo Chen et.al. | 2511.04192 | translate | read | null |
| 2025-11-06 | Text to Sketch Generation with Multi-Styles | Tengjie Li et.al. | 2511.04123 | translate | read | null |
| 2025-11-06 | Tortoise and Hare Guidance: Accelerating Diffusion Model Inference with Multirate Integration | Yunghee Lee et.al. | 2511.04117 | translate | read | null |
| 2025-11-06 | SpatialLock: Precise Spatial Control in Text-to-Image Synthesis | Biao Liu et.al. | 2511.04112 | translate | read | null |
| 2025-11-05 | Evolutionary Optimization Trumps Adam Optimization on Embedding Space Exploration | Domício Pereira Neto et.al. | 2511.03913 | translate | read | null |
| 2025-11-04 | Attention-based ROI Discovery in 3D Tissue Images | Hossein Fathollahian et.al. | 2511.03751 | translate | read | null |
| 2025-11-05 | SHIELD: Securing Healthcare IoT with Efficient Machine Learning Techniques for Anomaly Detection | Mahek Desai et.al. | 2511.03661 | translate | read | null |
| 2025-11-05 | Seeing What You Say: Expressive Image Generation from Speech | Jiyoung Lee et.al. | 2511.03423 | translate | read | null |
| 2025-11-05 | Finetuning-Free Personalization of Text to Image Generation via Hypernetworks | Sagar Shrestha et.al. | 2511.03156 | translate | read | null |
| 2025-11-04 | Inference-Time Personalized Alignment with a Few User Preference Queries | Victor-Alexandru Pădurean et.al. | 2511.02966 | translate | read | null |
| 2025-11-04 | Diffusion Models are Robust Pretrainers | Mika Yagoda et.al. | 2511.02793 | translate | read | null |
| 2025-11-04 | A Non-Adversarial Approach to Idempotent Generative Modelling | Mohammed Al-Jaff et.al. | 2511.02614 | translate | read | null |
| 2025-11-04 | Generalizable super-resolution turbulence reconstruction from minimal training data | Wu Haokai et.al. | 2511.02604 | translate | read | null |
| 2025-11-04 | TAUE: Training-free Noise Transplant and Cultivation Diffusion Model | Daichi Nagai et.al. | 2511.02580 | translate | read | null |
| 2025-11-04 | Implementation and Evaluation of Stable Diffusion on a General-Purpose CGLA Accelerator | Takuto Ando et.al. | 2511.02530 | translate | read | null |
| 2025-11-04 | DetectiumFire: A Comprehensive Multi-modal Dataset Bridging Vision and Language for Fire Understanding | Zixuan Liu et.al. | 2511.02495 | translate | read | null |
| 2025-11-04 | Object Detection as an Optional Basis: A Graph Matching Network for Cross-View UAV Localization | Tao Liu et.al. | 2511.02489 | translate | read | null |
| 2025-11-04 | KAO: Kernel-Adaptive Optimization in Diffusion for Satellite Image | Teerapong Panboonyuen et.al. | 2511.02462 | translate | read | null |
| 2025-11-04 | Synthetic Crop-Weed Image Generation and its Impact on Model Generalization | Garen Boyadjian et.al. | 2511.02417 | translate | read | null |
| 2025-11-04 | LiveSecBench: A Dynamic and Culturally-Relevant AI Safety Benchmark for LLMs in Chinese Context | Yudong Li et.al. | 2511.02366 | translate | read | null |
| 2025-11-02 | Deciphering Personalization: Towards Fine-Grained Explainability in Natural Language for Personalized Image Generation Models | Haoming Wang et.al. | 2511.01932 | translate | read | null |
| 2025-11-03 | Unified Diffusion VLA: Vision-Language-Action Model via Joint Discrete Denoising Diffusion Process | Jiayi Chen et.al. | 2511.01718 | translate | read | null |
| 2025-11-03 | Generative Adversarial Synthesis and Deep Feature Discrimination of Brain Tumor MRI Images | Md Sumon Ali et.al. | 2511.01574 | translate | read | null |
| 2025-11-03 | NSYNC: Negative Synthetic Image Generation for Contrastive Training to Improve Stylized Text-To-Image Translation | Serkan Ozturk et.al. | 2511.01517 | translate | read | null |
| 2025-11-03 | Diffusion Transformer meets Multi-level Wavelet Spectrum for Single Image Super-Resolution | Peng Du et.al. | 2511.01175 | translate | read | null |
| 2025-11-03 | Conditional Diffusion Model-Enabled Scenario-Specific Neural Receivers for Superimposed Pilot Schemes | Xingyu Zhou et.al. | 2511.01173 | translate | read | null |
| 2025-11-03 | ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation | Yongyuan Liang et.al. | 2511.01163 | translate | read | null |
| 2025-11-02 | MedEqualizer: A Framework Investigating Bias in Synthetic Medical Data and Mitigation via Augmentation | Sama Salarian et.al. | 2511.01054 | translate | read | null |
| 2025-11-02 | Deep Generative Models for Enhanced Vitreous OCT Imaging | Simone Sarrocco et.al. | 2511.00881 | translate | read | null |
| 2025-11-02 | Linear Differential Vision Transformer: Learning Visual Contrasts via Pairwise Differentials | Yifan Pu et.al. | 2511.00833 | translate | read | null |
| 2025-11-02 | EraseFlow: Learning Concept Erasure Policies via GFlowNet-Driven Alignment | Abhiram Kusumba et.al. | 2511.00804 | translate | read | null |
| 2025-11-02 | Erasing ‘Ugly’ from the Internet: Propagation of the Beauty Myth in Text-Image Models | Tanvi Dinkar et.al. | 2511.00749 | translate | read | null |
| 2025-11-01 | Evolve to Inspire: Novelty Search for Diverse Image Generation | Alex Inch et.al. | 2511.00686 | translate | read | null |
| 2025-11-01 | Enhancing Frequency Forgery Clues for Diffusion-Generated Image Detection | Daichi Zhang et.al. | 2511.00429 | translate | read | null |
| 2025-11-01 | Exploiting Latent Space Discontinuities for Building Universal LLM Jailbreaks and Data Extraction Attacks | Kayua Oleques Paim et.al. | 2511.00346 | translate | read | null |
| 2025-11-01 | OSMGen: Highly Controllable Satellite Image Synthesis using OpenStreetMap Data | Amir Ziashahabi et.al. | 2511.00345 | translate | read | null |
(<a href=../Image_Generation.md>back to Image Generation</a>)