Image Generation - 2025-10
Image Generation - 2025-10
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2025-10-26 | EEGReXferNet: A Lightweight Gen-AI Framework for EEG Subspace Reconstruction via Cross-Subject Transfer Learning and Channel-Aware Embedding | Shantanu Sarkar et.al. | 2511.02848 | translate | read | null |
| 2025-10-31 | Cross-fluctuation phase transitions reveal sampling dynamics in diffusion models | Sai Niranjan Ramachandran et.al. | 2511.00124 | translate | read | null |
| 2025-10-31 | End-to-End Framework Integrating Generative AI and Deep Reinforcement Learning for Autonomous Ultrasound Scanning | Hanae Elmekki et.al. | 2511.00114 | translate | read | null |
| 2025-10-30 | Chain of Time: In-Context Physical Simulation with Image Generation Models | YingQiao Wang et.al. | 2511.00110 | translate | read | null |
| 2025-10-30 | A generative adversarial network optimization method for damage detection and digital twinning by deep AI fault learning: Z24 Bridge structural health monitoring benchmark validation | Marios Impraimakis et.al. | 2511.00099 | translate | read | null |
| 2025-10-26 | Gen AI in Automotive: Applications, Challenges, and Opportunities with a Case study on In-Vehicle Experience | Chaitanya Shinde et.al. | 2511.00026 | translate | read | null |
| 2025-10-31 | From Pixels to Paths: A Multi-Agent Framework for Editable Scientific Illustration | Jianwen Sun et.al. | 2510.27452 | translate | read | null |
| 2025-10-31 | A Hybrid Deep Learning and Forensic Approach for Robust Deepfake Detection | Sales Aribe Jr et.al. | 2510.27392 | translate | read | null |
| 2025-10-31 | Generative Semantic Coding for Ultra-Low Bitrate Visual Communication and Analysis | Weiming Chen et.al. | 2510.27324 | translate | read | null |
| 2025-10-31 | H2-Cache: A Novel Hierarchical Dual-Stage Cache for High-Performance Acceleration of Generative Diffusion Models | Mingyu Sung et.al. | 2510.27171 | translate | read | null |
| 2025-10-31 | E-MMDiT: Revisiting Multimodal Diffusion Transformer Design for Fast Image Synthesis under Limited Resources | Tong Shen et.al. | 2510.27135 | translate | read | null |
| 2025-10-31 | A Hierarchical Deep Learning Model for Predicting Pedestrian-Level Urban Winds | Reda Snaiki et.al. | 2510.27101 | translate | read | null |
| 2025-10-30 | BI-DCGAN: A Theoretically Grounded Bayesian Framework for Efficient and Diverse GANs | Mahsa Valizadeh et.al. | 2510.26892 | translate | read | null |
| 2025-10-29 | Beyond Data Scarcity Optimizing R3GAN for Medical Image Generation from Small Datasets | Tsung-Wei Pan et.al. | 2510.26828 | translate | read | null |
| 2025-10-30 | ResMatching: Noise-Resilient Computational Super-Resolution via Guided Conditional Flow Matching | Anirban Ray et.al. | 2510.26601 | translate | read | null |
| 2025-10-30 | Emu3.5: Native Multimodal Models are World Learners | Yufeng Cui et.al. | 2510.26583 | translate | read | null |
| 2025-10-30 | Quantum Gated Recurrent GAN with Gaussian Uncertainty for Network Anomaly Detection | Wajdi Hammami et.al. | 2510.26487 | translate | read | null |
| 2025-10-30 | EEG-Driven Image Reconstruction with Saliency-Guided Diffusion Models | Igor Abramov et.al. | 2510.26391 | translate | read | null |
| 2025-10-30 | Generative Artificial Intelligence for Air Shower Simulation | C. Bozza et.al. | 2510.26316 | translate | read | null |
| 2025-10-30 | Security Risk of Misalignment between Text and Image in Multi-modal Model | Xiaosen Wang et.al. | 2510.26105 | translate | read | null |
| 2025-10-30 | New Money: A Systematic Review of Synthetic Data Generation for Finance | James Meldrum et.al. | 2510.26076 | translate | read | null |
| 2025-10-29 | SplitFlow: Flow Decomposition for Inversion-Free Text-to-Image Editing | Sung-Hoon Yoon et.al. | 2510.25970 | translate | read | null |
| 2025-10-29 | MIRO: MultI-Reward cOnditioned pretraining improves T2I quality and efficiency | Nicolas Dufour et.al. | 2510.25897 | translate | read | null |
| 2025-10-29 | ScaleDiff: Higher-Resolution Image Synthesis via Efficient and Model-Agnostic Diffusion | Sungho Koh et.al. | 2510.25818 | translate | read | null |
| 2025-10-29 | DiagramEval: Evaluating LLM-Generated Diagrams via Graphs | Chumeng Liang et.al. | 2510.25761 | translate | read | null |
| 2025-10-29 | Hawk: Leveraging Spatial Context for Faster Autoregressive Text-to-Image Generation | Zhi-Kai Chen et.al. | 2510.25739 | translate | read | null |
| 2025-10-29 | BOLT-GAN: Bayes-Optimal Loss for Stable GAN Training | Mohammadreza Tavasoli Naeini et.al. | 2510.25609 | translate | read | null |
| 2025-10-29 | Target-Guided Bayesian Flow Networks for Quantitatively Constrained CAD Generation | Wenhao Zheng et.al. | 2510.25163 | translate | read | null |
| 2025-10-29 | PSTF-AttControl: Per-Subject-Tuning-Free Personalized Image Generation with Controllable Face Attributes | Xiang liu et.al. | 2510.25084 | translate | read | null |
| 2025-10-28 | Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and Generation | Inclusion AI et.al. | 2510.24821 | translate | read | null |
| 2025-10-28 | SafeEditor: Unified MLLM for Efficient Post-hoc T2I Safety Editing | Ruiyang Zhang et.al. | 2510.24820 | translate | read | null |
| 2025-10-28 | CT-Less Attenuation Correction Using Multiview Ensemble Conditional Diffusion Model on High-Resolution Uncorrected PET Images | Alexandre St-Georges et.al. | 2510.24805 | translate | read | null |
| 2025-10-28 | Uniform Discrete Diffusion with Metric Path for Video Generation | Haoge Deng et.al. | 2510.24717 | translate | read | null |
| 2025-10-28 | A Dual-Branch CNN for Robust Detection of AI-Generated Facial Forgeries | Xin Zhang et.al. | 2510.24640 | translate | read | null |
| 2025-10-28 | A Comprehensive Evaluation Framework for Synthetic Trip Data Generation in Public Transport | Yuanyuan Wu et.al. | 2510.24375 | translate | read | null |
| 2025-10-28 | A Domain Adaptive Position Reconstruction Method for Time Projection Chamber based on Deep Neural Network | Xiaoran Guo et.al. | 2510.24329 | translate | read | null |
| 2025-10-28 | Training-free Source Attribution of AI-generated Images via Resynthesis | Pietro Bongini et.al. | 2510.24278 | translate | read | null |
| 2025-10-28 | MC-SJD : Maximal Coupling Speculative Jacobi Decoding for Autoregressive Visual Generation Acceleration | Junhyuk So et.al. | 2510.24211 | translate | read | null |
| 2025-10-28 | Compositional Image Synthesis with Inference-Time Scaling | Minsuk Ji et.al. | 2510.24133 | translate | read | null |
| 2025-10-28 | Causal-Aware Generative Adversarial Networks with Reinforcement Learning | Tu Anh Hoang Nguyen et.al. | 2510.24046 | translate | read | null |
| 2025-10-27 | Galactic Alchemy: Deep Learning Map-to-Map Translation in Hydrodynamical Simulations | Philipp Denzel et.al. | 2510.23768 | translate | read | null |
| 2025-10-27 | FARMER: Flow AutoRegressive Transformer over Pixels | Guangting Zheng et.al. | 2510.23588 | translate | read | null |
| 2025-10-27 | More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models | Hongkai Lin et.al. | 2510.23574 | translate | read | null |
| 2025-10-27 | FreeFuse: Multi-Subject LoRA Fusion via Auto Masking at Test Time | Yaoli Liu et.al. | 2510.23515 | translate | read | null |
| 2025-10-27 | Privacy-Preserving Semantic Communication over Wiretap Channels with Learnable Differential Privacy | Weixuan Chen et.al. | 2510.23274 | translate | read | null |
| 2025-10-27 | Autoregressive Styled Text Image Generation, but Make it Reliable | Carmine Zaccagnino et.al. | 2510.23240 | translate | read | null |
| 2025-10-27 | Nested AutoRegressive Models | Hongyu Wu et.al. | 2510.23028 | translate | read | null |
| 2025-10-27 | UniAIDet: A Unified and Universal Benchmark for AI-Generated Image Content Detection and Localization | Huixuan Zhang et.al. | 2510.23023 | translate | read | null |
| 2025-10-27 | M $^{3}$ T2IBench: A Large-Scale Multi-Category, Multi-Instance, Multi-Relation Text-to-Image Benchmark | Huixuan Zhang et.al. | 2510.23020 | translate | read | null |
| 2025-10-27 | SceneDecorator: Towards Scene-Oriented Story Generation with Scene Planning and Scene Consistency | Quanjian Song et.al. | 2510.22994 | translate | read | null |
| 2025-10-27 | LightBagel: A Light-weighted, Double Fusion Framework for Unified Multimodal Understanding and Generation | Zeyu Wang et.al. | 2510.22946 | translate | read | null |
| 2025-10-26 | Semantic Surgery: Zero-Shot Concept Erasure in Diffusion Models | Lexiang Xiong et.al. | 2510.22851 | translate | read | null |
| 2025-10-26 | Cross-view Localization and Synthesis – Datasets, Challenges and Opportunities | Ningli Xu et.al. | 2510.22736 | translate | read | null |
| 2025-10-26 | Self-Attention Decomposition For Training Free Diffusion Editing | Tharun Anand et.al. | 2510.22650 | translate | read | null |
| 2025-10-26 | Open Multimodal Retrieval-Augmented Factual Image Generation | Yang Tian et.al. | 2510.22521 | translate | read | link |
| 2025-10-26 | CANDI: Hybrid Discrete-Continuous Diffusion Models | Patrick Pynadath et.al. | 2510.22510 | translate | read | null |
| 2025-10-25 | T2SMark: Balancing Robustness and Diversity in Noise-as-Watermark for Diffusion Models | Jindong Yang et.al. | 2510.22366 | translate | read | null |
| 2025-10-25 | GeoDiffusion: A Training-Free Framework for Accurate 3D Geometric Conditioning in Image Generation | Phillip Mueller et.al. | 2510.22337 | translate | read | null |
| 2025-10-25 | Scaling Non-Parametric Sampling with Representation | Vincent Lu et.al. | 2510.22196 | translate | read | null |
| 2025-10-25 | Discovering Latent Graphs with GFlowNets for Diverse Conditional Image Generation | Bailey Trang et.al. | 2510.22107 | translate | read | null |
| 2025-10-24 | FlowOpt: Fast Optimization Through Whole Flow Processes for Training-Free Editing | Or Ronai et.al. | 2510.22010 | translate | read | null |
| 2025-10-24 | LiteDiff | Ruchir Namjoshi et.al. | 2510.22004 | translate | read | null |
| 2025-10-23 | Generative AI in Depth: A Survey of Recent Advances, Model Variants, and Real-World Applications | Shamim Yazdani et.al. | 2510.21887 | translate | read | null |
| 2025-10-24 | Visual Diffusion Models are Geometric Solvers | Nir Goren et.al. | 2510.21697 | translate | read | null |
| 2025-10-24 | Sample By Step, Optimize By Chunk: Chunk-Level GRPO For Text-to-Image Generation | Yifu Luo et.al. | 2510.21583 | translate | read | null |
| 2025-10-24 | VidSplice: Towards Coherent Video Inpainting via Explicit Spaced Frame Guidance | Ming Xie et.al. | 2510.21461 | translate | read | null |
| 2025-10-24 | TerraGen: A Unified Multi-Task Layout Generation Framework for Remote Sensing Data Augmentation | Datao Tang et.al. | 2510.21391 | translate | read | null |
| 2025-10-24 | FairImagen: Post-Processing for Bias Mitigation in Text-to-Image Models | Zihao Fu et.al. | 2510.21363 | translate | read | null |
| 2025-10-24 | Generative Federated Learning for Smart Prediction and Recommendation Applications | Anwesha Mukherjee et.al. | 2510.21183 | translate | read | null |
| 2025-10-24 | In Silico Mapping of Visual Categorical Selectivity Across the Whole Brain | Ethan Hwang et.al. | 2510.21142 | translate | read | null |
| 2025-10-24 | Digital Contrast CT Pulmonary Angiography Synthesis from Non-contrast CT for Pulmonary Vascular Disease | Ying Ming et.al. | 2510.21140 | translate | read | null |
| 2025-10-24 | SafetyPairs: Isolating Safety Critical Image Features with Counterfactual Image Generation | Alec Helbling et.al. | 2510.21120 | translate | read | null |
| 2025-10-23 | Preventing Shortcuts in Adapter Training via Providing the Shortcuts | Anujraaj Argo Goyal et.al. | 2510.20887 | translate | read | null |
| 2025-10-23 | LayerComposer: Interactive Personalized T2I via Spatially-Aware Layered Canvas | Guocheng Gordon Qian et.al. | 2510.20820 | translate | read | null |
| 2025-10-23 | ARGenSeg: Image Segmentation with Autoregressive Image Generation Model | Xiaolong Wang et.al. | 2510.20803 | translate | read | null |
(<a href=../Image_Generation.md>back to Image Generation</a>)