Image Generation - 2025-02
Image Generation - 2025-02
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2025-02-28 | How far can we go with ImageNet for Text-to-Image generation? | L. Degeorge et.al. | 2502.21318 | translate | read | link |
| 2025-02-28 | A Review on Generative AI For Text-To-Image and Image-To-Image Generation and Implications To Scientific Images | Zineb Sordo et.al. | 2502.21151 | translate | read | null |
| 2025-02-28 | Deep learning-based filtering of cross-spectral matrices using generative adversarial networks | Christof Puhle et.al. | 2502.21097 | translate | read | null |
| 2025-02-28 | Synthesizing Individualized Aging Brains in Health and Disease with Generative Models and Parallel Transport | Jingru Fu et.al. | 2502.21049 | translate | read | null |
| 2025-02-28 | Synthesizing Tabular Data Using Selectivity Enhanced Generative Adversarial Networks | Youran Zhou et.al. | 2502.21034 | translate | read | null |
| 2025-02-28 | DiffBrush:Just Painting the Art by Your Hands | Jiaming Chu et.al. | 2502.20904 | translate | read | null |
| 2025-02-28 | Diffusion Restoration Adapter for Real-World Image Restoration | Hanbang Liang et.al. | 2502.20679 | translate | read | null |
| 2025-02-28 | Advancing AI-Powered Medical Image Synthesis: Insights from MedVQA-GI Challenge Using CLIP, Fine-Tuned Stable Diffusion, and Dream-Booth + LoRA | Ojonugwa Oluwafemi Ejiga Peter et.al. | 2502.20667 | translate | read | null |
| 2025-02-28 | Gungnir: Exploiting Stylistic Features in Images for Backdoor Attacks on Diffusion Models | Yu Pan et.al. | 2502.20650 | translate | read | link |
| 2025-02-27 | FlexVAR: Flexible Visual Autoregressive Modeling without Residual Prediction | Siyu Jiao et.al. | 2502.20313 | translate | read | link |
| 2025-02-27 | Attention Distillation: A Unified Approach to Visual Characteristics Transfer | Yang Zhou et.al. | 2502.20235 | translate | read | link |
| 2025-02-27 | Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think | Liang Chen et.al. | 2502.20172 | translate | read | link |
| 2025-02-27 | FlexiDiT: Your Diffusion Transformer Can Easily Generate High-Quality Samples with Less Compute | Sotiris Anagnostidis et.al. | 2502.20126 | translate | read | null |
| 2025-02-27 | New Dataset and Methods for Fine-Grained Compositional Referring Expression Comprehension via Specialist-MLLM Collaboration | Xuzheng Yang et.al. | 2502.20104 | translate | read | null |
| 2025-02-27 | Analyzing CLIP’s Performance Limitations in Multi-Object Scenarios: A Controlled High-Resolution Study | Reza Abbasi et.al. | 2502.19828 | translate | read | null |
| 2025-02-27 | MFSR: Multi-fractal Feature for Super-resolution Reconstruction with Fine Details Recovery | Lianping Yang et.al. | 2502.19797 | translate | read | null |
| 2025-02-27 | The erasure of intensive livestock farming in text-to-image generative AI | Kehan Sheng et.al. | 2502.19771 | translate | read | null |
| 2025-02-27 | Finding Local Diffusion Schrödinger Bridge using Kolmogorov-Arnold Network | Xingyu Qiu et.al. | 2502.19754 | translate | read | null |
| 2025-02-27 | Language-Informed Hyperspectral Image Synthesis for Imbalanced-Small Sample Classification via Semi-Supervised Conditional Diffusion Model | Yimin Zhu et.al. | 2502.19700 | translate | read | null |
| 2025-02-26 | Multi-modal Contrastive Learning for Tumor-specific Missing Modality Synthesis | Minjoo Lim et.al. | 2502.19390 | translate | read | null |
| 2025-02-26 | Reimagining Personal Data: Unlocking the Potential of AI-Generated Images in Personal Data Meaning-Making | Soobin Park et.al. | 2502.18853 | translate | read | null |
| 2025-02-26 | Optimal Stochastic Trace Estimation in Generative Modeling | Xinyang Liu et.al. | 2502.18808 | translate | read | null |
| 2025-02-26 | AI-Instruments: Embodying Prompts as Instruments to Abstract & Reflect Graphical Interface Commands as General-Purpose Tools | Nathalie Riche et.al. | 2502.18736 | translate | read | null |
| 2025-02-25 | Investigating Youth AI Auditing | Jaemarie Solyst et.al. | 2502.18576 | translate | read | null |
| 2025-02-25 | ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation | Yifan Pu et.al. | 2502.18364 | translate | read | link |
| 2025-02-25 | LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation | Pengzhi Li et.al. | 2502.18302 | translate | read | link |
| 2025-02-25 | Training Consistency Models with Variational Noise Coupling | Gianluigi Silvestri et.al. | 2502.18197 | translate | read | link |
| 2025-02-25 | Inverse Materials Design by Large Language Model-Assisted Generative Framework | Yun Hao et.al. | 2502.18127 | translate | read | null |
| 2025-02-26 | Bayesian Optimization for Controlled Image Editing via LLMs | Chengkun Cai et.al. | 2502.18116 | translate | read | null |
| 2025-02-25 | Robust Polyp Detection and Diagnosis through Compositional Prompt-Guided Diffusion Models | Jia Yu et.al. | 2502.17951 | translate | read | null |
| 2025-02-25 | ASurvey: Spatiotemporal Consistency in Video Generation | Zhiyu Yin et.al. | 2502.17863 | translate | read | null |
| 2025-02-25 | TagGAN: A Generative Model for Data Tagging | Muhammad Nawaz et.al. | 2502.17836 | translate | read | null |
| 2025-02-25 | FoREST: Frame of Reference Evaluation in Spatial Reasoning Tasks | Tanawan Premsri et.al. | 2502.17775 | translate | read | null |
| 2025-02-24 | IBURD: Image Blending for Underwater Robotic Detection | Jungseok Hong et.al. | 2502.17706 | translate | read | null |
| 2025-02-24 | Fractal Generative Models | Tianhong Li et.al. | 2502.17437 | translate | read | link |
| 2025-02-24 | RELICT: A Replica Detection Framework for Medical Image Generation | Orhun Utku Aydin et.al. | 2502.17360 | translate | read | null |
| 2025-02-24 | A Pragmatic Note on Evaluating Generative Models with Fréchet Inception Distance for Retinal Image Synthesis | Yuli Wu et.al. | 2502.17160 | translate | read | null |
| 2025-02-24 | DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks | Canyu Zhao et.al. | 2502.17157 | translate | read | link |
| 2025-02-24 | Conditional Generative Adversarial Networks for Channel Estimation in RIS-Assisted ISAC Systems | Alice Faisal et.al. | 2502.17128 | translate | read | null |
| 2025-02-24 | Diffusion Models for Tabular Data: Challenges, Current Progress, and Future Directions | Zhong Li et.al. | 2502.17119 | translate | read | link |
| 2025-02-24 | Generative Models in Decision Making: A Survey | Yinchuan Li et.al. | 2502.17100 | translate | read | null |
| 2025-02-24 | Distributional Vision-Language Alignment by Cauchy-Schwarz Divergence | Wenzhe Yin et.al. | 2502.17028 | translate | read | null |
| 2025-02-24 | Predicting Liquidity-Aware Bond Yields using Causal GANs and Deep Reinforcement Learning with LLM Evaluation | Jaskaran Singh Walia et.al. | 2502.17011 | translate | read | null |
| 2025-02-24 | PQDAST: Depth-Aware Arbitrary Style Transfer for Games via Perceptual Quality-Guided Distillation | Eleftherios Ioannou et.al. | 2502.16996 | translate | read | null |
| 2025-02-21 | One-step Diffusion Models with $f$ -Divergence Distribution Matching | Yilun Xu et.al. | 2502.15681 | translate | read | null |
| 2025-02-21 | Soybean pod and seed counting in both outdoor fields and indoor laboratories using unions of deep neural networks | Tianyou Jiang et.al. | 2502.15286 | translate | read | null |
| 2025-02-21 | Unsettling the Hegemony of Intention: Agonistic Image Generation | Andre Ye et.al. | 2502.15242 | translate | read | null |
| 2025-02-21 | Lung-DDPM: Semantic Layout-guided Diffusion Models for Thoracic CT Image Synthesis | Yifan Jiang et.al. | 2502.15204 | translate | read | null |
| 2025-02-21 | FlipConcept: Tuning-Free Multi-Concept Personalization for Text-to-Image Generation | Young Beom Woo et.al. | 2502.15203 | translate | read | null |
| 2025-02-21 | Methods and Trends in Detecting Generated Images: A Comprehensive Review | Arpan Mahara et.al. | 2502.15176 | translate | read | null |
| 2025-02-21 | mStyleDistance: Multilingual Style Embeddings and their Evaluation | Justin Qiu et.al. | 2502.15168 | translate | read | null |
| 2025-02-20 | A Meta-Evaluation of Style and Attribute Transfer Metrics | Amalie Brogaard Pauli et.al. | 2502.15022 | translate | read | null |
| 2025-02-20 | Generative Modeling of Individual Behavior at Scale | Nabil Omi et.al. | 2502.14998 | translate | read | null |
| 2025-02-20 | Improving the Diffusability of Autoencoders | Ivan Skorokhodov et.al. | 2502.14831 | translate | read | null |
| 2025-02-20 | DC-ControlNet: Decoupling Inter- and Intra-Element Conditions in Image Generation with Diffusion Models | Hongji Yang et.al. | 2502.14779 | translate | read | null |
| 2025-02-20 | AIdeation: Designing a Human-AI Collaborative Ideation System for Concept Designers | Wen-Fan Wang et.al. | 2502.14747 | translate | read | null |
| 2025-02-20 | Generative adversarial networks vs large language models: a comparative study on synthetic tabular data generation | Austin A. Barr et.al. | 2502.14523 | translate | read | null |
| 2025-02-20 | PhotoDoodle: Learning Artistic Image Editing from Few-Shot Pairwise Data | Shijie Huang et.al. | 2502.14397 | translate | read | null |
| 2025-02-20 | Pandora3D: A Comprehensive Framework for High-Quality 3D Shape and Texture Generation | Jiayu Yang et.al. | 2502.14247 | translate | read | null |
| 2025-02-19 | A Racing Dataset and Baseline Model for Track Detection in Autonomous Racing | Shreya Ghosh et.al. | 2502.14068 | translate | read | null |
| 2025-02-19 | FlexTok: Resampling Images into 1D Token Sequences of Flexible Length | Roman Bachmann et.al. | 2502.13967 | translate | read | null |
| 2025-02-19 | IP-Composer: Semantic Composition of Visual Concepts | Sara Dorfman et.al. | 2502.13951 | translate | read | null |
| 2025-02-19 | MagicGeo: Training-Free Text-Guided Geometric Diagram Generation | Junxiao Wang et.al. | 2502.13855 | translate | read | null |
| 2025-02-19 | Flow-based generative models as iterative algorithms in probability space | Yao Xie et.al. | 2502.13394 | translate | read | null |
| 2025-02-18 | Breaking the bonds of generative artificial intelligence by minimizing the maximum entropy | Mattia Miotto et.al. | 2502.13287 | translate | read | null |
| 2025-02-18 | Personalized Image Generation with Deep Generative Models: A Decade Survey | Yuxiang Wei et.al. | 2502.13081 | translate | read | null |
| 2025-02-18 | Beyond Profile: From Surface-Level Facts to Deep Persona Simulation in LLMs | Zixiao Wang et.al. | 2502.12988 | translate | read | null |
| 2025-02-18 | Flow-of-Options: Diversified and Improved LLM Reasoning by Thinking Through Options | Lakshmi Nair et.al. | 2502.12929 | translate | read | null |
| 2025-02-18 | 3D Shape-to-Image Brownian Bridge Diffusion for Brain MRI Synthesis from Cortical Surfaces | Fabian Bongratz et.al. | 2502.12742 | translate | read | null |
| 2025-02-19 | Spherical Dense Text-to-Image Synthesis | Timon Winter et.al. | 2502.12691 | translate | read | null |
| 2025-02-18 | CHATS: Combining Human-Aligned Optimization and Test-Time Sampling for Text-to-Image Generation | Minghao Fu et.al. | 2502.12579 | translate | read | null |
| 2025-02-18 | DeltaDiff: A Residual-Guided Diffusion Model for Enhanced Image Super-Resolution | Chao Yang et.al. | 2502.12567 | translate | read | null |
| 2025-02-18 | Multi Image Super Resolution Modeling for Earth System Models | Ehsan Zeraatkar et.al. | 2502.12427 | translate | read | null |
| 2025-02-17 | A Survey on Bridging EEG Signals and Generative AI: From Image and Text to Beyond | Shreya Shukla et.al. | 2502.12048 | translate | read | null |
| 2025-02-17 | Characterizing Photorealism and Artifacts in Diffusion Model-Generated Images | Negar Kamali et.al. | 2502.11989 | translate | read | null |
| 2025-02-17 | Image Inversion: A Survey from GANs to Diffusion and Beyond | Yinan Chen et.al. | 2502.11974 | translate | read | null |
| 2025-02-17 | Evaluation of machine learning techniques for conditional generative adversarial networks in inverse design | Timo Gahlmann et.al. | 2502.11934 | translate | read | null |
| 2025-02-17 | GRAPHGPT-O: Synergistic Multimodal Comprehension and Generation on Graphs | Yi Fang et.al. | 2502.11925 | translate | read | null |
| 2025-02-17 | Learning to Sample Effective and Diverse Prompts for Text-to-Image Generation | Taeyoung Yun et.al. | 2502.11477 | translate | read | null |
| 2025-02-17 | MARS: Mesh AutoRegressive Model for 3D Shape Detailization | Jingnan Gao et.al. | 2502.11390 | translate | read | null |
| 2025-02-15 | Preconditioned Inexact Stochastic ADMM for Deep Model | Shenglong Zhou et.al. | 2502.10784 | translate | read | null |
| 2025-02-15 | Hybrid Deepfake Image Detection: A Comprehensive Dataset-Driven Approach Integrating Convolutional and Attention Mechanisms with Frequency Domain Features | Kafi Anan et.al. | 2502.10682 | translate | read | null |
| 2025-02-15 | REAL: Realism Evaluation of Text-to-Image Generation Models for Effective Data Augmentation | Ran Li et.al. | 2502.10663 | translate | read | null |
| 2025-02-14 | Ocular Disease Classification Using CNN with Deep Convolutional Generative Adversarial Network | Arun Kunwar et.al. | 2502.10334 | translate | read | null |
| 2025-02-14 | ManiTrend: Bridging Future Generation and Action Prediction with 3D Flow for Robotic Manipulation | Yuxin He et.al. | 2502.10028 | translate | read | null |
| 2025-02-13 | CellFlow: Simulating Cellular Morphology Changes via Flow Matching | Yuhui Zhang et.al. | 2502.09775 | translate | read | null |
| 2025-02-13 | Designing a Conditional Prior Distribution for Flow-Based Generative Models | Noam Issachar et.al. | 2502.09611 | translate | read | null |
| 2025-02-13 | Zero-shot generation of synthetic neurosurgical data with large language models | Austin A. Barr et.al. | 2502.09566 | translate | read | link |
| 2025-02-14 | EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling | Theodoros Kouzelis et.al. | 2502.09509 | translate | read | null |
| 2025-02-13 | DiffRenderGAN: Addressing Training Data Scarcity in Deep Segmentation Networks for Quantitative Nanomaterial Analysis through Differentiable Rendering and Generative Modelling | Dennis Possart et.al. | 2502.09477 | translate | read | null |
| 2025-02-13 | Redistribute Ensemble Training for Mitigating Memorization in Diffusion Models | Xiaoliu Guan et.al. | 2502.09434 | translate | read | link |
| 2025-02-13 | ImageRAG: Dynamic Image Retrieval for Reference-Guided Image Generation | Rotem Shalev-Arkushin et.al. | 2502.09411 | translate | read | null |
| 2025-02-13 | When the LM misunderstood the human chuckled: Analyzing garden path effects in humans and language models | Samuel Joseph Amouyal et.al. | 2502.09307 | translate | read | null |
| 2025-02-13 | Joint Attention Mechanism Learning to Facilitate Opto-physiological Monitoring during Physical Activity | Xiaoyu Zheng et.al. | 2502.09291 | translate | read | null |
| 2025-02-13 | Sequential Covariance Fitting for InSAR Phase Linking | Dana El Hajjar et.al. | 2502.09248 | translate | read | null |
| 2025-02-13 | Dynamic watermarks in images generated by diffusion models | Yunzhuo Chen et.al. | 2502.08927 | translate | read | null |
| 2025-02-12 | Enhancing Diffusion Models Efficiency by Disentangling Total-Variance and Signal-to-Noise Ratio | Khaled Kahouli et.al. | 2502.08598 | translate | read | null |
| 2025-02-12 | Ultrasound Image Generation using Latent Diffusion Models | Benoit Freiche et.al. | 2502.08580 | translate | read | null |
| 2025-02-12 | BCDDM: Branch-Corrected Denoising Diffusion Model for Black Hole Image Generation | Ao liu et.al. | 2502.08528 | translate | read | null |
| 2025-02-12 | A Survey on Pre-Trained Diffusion Model Distillations | Xuhui Fan et.al. | 2502.08364 | translate | read | null |
| 2025-02-12 | Learning Human Skill Generators at Key-Step Levels | Yilu Wu et.al. | 2502.08234 | translate | read | null |
| 2025-02-12 | PoGDiff: Product-of-Gaussians Diffusion Models for Imbalanced Text-to-Image Generation | Ziyan Wang et.al. | 2502.08106 | translate | read | null |
| 2025-02-12 | ID-Cloak: Crafting Identity-Specific Cloaks Against Personalized Text-to-Image Generation | Qianrui Teng et.al. | 2502.08097 | translate | read | null |
| 2025-02-12 | Rapid prediction of organisation in engineered corneal, glial and fibroblast tissues using machine learning and biophysical models | Allison E. Andrews et.al. | 2502.08062 | translate | read | null |
| 2025-02-11 | Training-Free Safe Denoisers for Safe Use of Diffusion Models | Mingyu Kim et.al. | 2502.08011 | translate | read | null |
| 2025-02-11 | SurGrID: Controllable Surgical Simulation via Scene Graph to Image Diffusion | Yannik Frisch et.al. | 2502.07945 | translate | read | null |
| 2025-02-11 | Direct Ascent Synthesis: Revealing Hidden Generative Capabilities in Discriminative Models | Stanislav Fort et.al. | 2502.07753 | translate | read | null |
| 2025-02-11 | CausalGeD: Blending Causality and Diffusion for Spatial Gene Expression Generation | Rabeya Tus Sadia et.al. | 2502.07751 | translate | read | null |
| 2025-02-11 | Magic 1-For-1: Generating One Minute Video Clips within One Minute | Hongwei Yi et.al. | 2502.07701 | translate | read | null |
| 2025-02-11 | SketchFlex: Facilitating Spatial-Semantic Coherence in Text-to-Image Generation with Region-Based Sketches | Haichuan Lin et.al. | 2502.07556 | translate | read | null |
| 2025-02-11 | Towards THz-based Obstacle Sensing: A Generative Radio Environment Awareness Framework | Tianyu Hu et.al. | 2502.07504 | translate | read | null |
| 2025-02-11 | Less is More: Masking Elements in Image Condition Features Avoids Content Leakages in Style Transfer Diffusion Models | Lin Zhu et.al. | 2502.07466 | translate | read | null |
| 2025-02-11 | RusCode: Russian Cultural Code Benchmark for Text-to-Image Generation | Viacheslav Vasilev et.al. | 2502.07455 | translate | read | null |
| 2025-02-11 | Optimizing Knowledge Distillation in Transformers: Enabling Multi-Head Attention without Alignment Barriers | Zhaodong Bing et.al. | 2502.07436 | translate | read | null |
| 2025-02-11 | Articulate That Object Part (ATOP): 3D Part Articulation from Text and Motion Personalization | Aditya Vora et.al. | 2502.07278 | translate | read | null |
| 2025-02-11 | Exploring Active Data Selection Strategies for Continuous Training in Deepfake Detection | Yoshihiko Furuhashi et.al. | 2502.07269 | translate | read | null |
| 2025-02-10 | A Large-scale AI-generated Image Inpainting Benchmark | Paschalis Giakoumoglou et.al. | 2502.06593 | translate | read | null |
| 2025-02-10 | CustomVideoX: 3D Reference Attention Driven Dynamic Adaptation for Zero-Shot Customized Video Diffusion Transformers | D. She et.al. | 2502.06527 | translate | read | null |
| 2025-02-10 | Universal Approximation of Visual Autoregressive Transformers | Yifang Chen et.al. | 2502.06167 | translate | read | null |
| 2025-02-10 | Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models | Ce Zhang et.al. | 2502.06130 | translate | read | null |
| 2025-02-09 | Online Reward-Weighted Fine-Tuning of Flow Matching with Wasserstein Regularization | Jiajun Fan et.al. | 2502.06061 | translate | read | null |
| 2025-02-09 | Make the Fastest Faster: Importance Mask for Interactive Volume Visualization using Reconstruction Neural Networks | Jianxin Sun et.al. | 2502.06053 | translate | read | null |
| 2025-02-09 | A Conditional Tabular GAN-Enhanced Intrusion Detection System for Rare Attacks in IoT Networks | Safaa Menssouri et.al. | 2502.06031 | translate | read | null |
| 2025-02-09 | A Semi-Supervised Text Generation Framework Combining a Deep Transformer and a GAN | Shengquan Wang et.al. | 2502.05937 | translate | read | null |
| 2025-02-09 | Beyond Fine-Tuning: A Systematic Study of Sampling Techniques in Personalized Image Generation | Vera Soboleva et.al. | 2502.05895 | translate | read | null |
| 2025-02-09 | Understanding Design Fixation in Generative AI | Liuqing Chen et.al. | 2502.05870 | translate | read | null |
| 2025-02-07 | QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation | Yue Zhao et.al. | 2502.05178 | translate | read | null |
| 2025-02-07 | Hummingbird: High Fidelity Image Generation via Multimodal Context Alignment | Minh-Quan Le et.al. | 2502.05153 | translate | read | null |
| 2025-02-07 | Investigating the impact of kernel harmonization and deformable registration on inspiratory and expiratory chest CT images for people with COPD | Aravind R. Krishnan et.al. | 2502.05119 | translate | read | null |
| 2025-02-07 | C2GM: Cascading Conditional Generation of Multi-scale Maps from Remote Sensing Images Constrained by Geographic Features | Chenxing Sun et.al. | 2502.04991 | translate | read | null |
| 2025-02-07 | Cached Multi-Lora Composition for Multi-Concept Image Generation | Xiandong Zou et.al. | 2502.04923 | translate | read | null |
| 2025-02-07 | ARTInp: CBCT-to-CT Image Inpainting and Image Translation in Radiotherapy | Ricardo Coimbra Brioso et.al. | 2502.04898 | translate | read | null |
| 2025-02-07 | Goku: Flow Based Video Generative Foundation Models | Shoufa Chen et.al. | 2502.04896 | translate | read | null |
| 2025-02-07 | Evaluating Text Style Transfer Evaluation: Are There Any Reliable Metrics? | Sourabrata Mukherjee et.al. | 2502.04718 | translate | read | null |
| 2025-02-07 | G2PDiffusion: Genotype-to-Phenotype Prediction with Diffusion Models | Mengdi Liu et.al. | 2502.04684 | translate | read | null |
| 2025-02-07 | Multiscale style transfer based on a Laplacian pyramid for traditional Chinese painting | Kunxiao Liu et.al. | 2502.04597 | translate | read | null |
| 2025-02-06 | HOG-Diff: Higher-Order Guided Diffusion for Graph Generation | Yiming Huang et.al. | 2502.04308 | translate | read | null |
| 2025-02-06 | Realistic Image-to-Image Machine Unlearning via Decoupling and Knowledge Retention | Ayush K. Varshney et.al. | 2502.04260 | translate | read | null |
| 2025-02-06 | Multi-fidelity emulator for large-scale 21 cm lightcone images: a few-shot transfer learning approach with generative adversarial network | Kangning Diao et.al. | 2502.04246 | translate | read | null |
| 2025-02-06 | Generative Adversarial Networks Bridging Art and Machine Intelligence | Junhao Song et.al. | 2502.04116 | translate | read | null |
| 2025-02-06 | FairT2I: Mitigating Social Bias in Text-to-Image Generation via Large Language Model-Assisted Detection and Attribute Rebalancing | Jinya Sakurai et.al. | 2502.03826 | translate | read | null |
| 2025-02-06 | DeblurDiff: Real-World Image Deblurring with Generative Diffusion Models | Lingshun Kong et.al. | 2502.03810 | translate | read | null |
| 2025-02-06 | DICE: Distilling Classifier-Free Guidance into Text Embeddings | Zhenyu Zhou et.al. | 2502.03726 | translate | read | null |
| 2025-02-06 | Conditional Diffusion Models are Medical Image Classifiers that Provide Explainability and Uncertainty for Free | Gian Mario Favero et.al. | 2502.03687 | translate | read | null |
| 2025-02-05 | YINYANG-ALIGN: Benchmarking Contradictory Objectives and Proposing Multi-Objective Optimization based DPO for Text-to-Image Alignment | Amitava Das et.al. | 2502.03512 | translate | read | null |
| 2025-02-05 | Masked Autoencoders Are Effective Tokenizers for Diffusion Models | Hao Chen et.al. | 2502.03444 | translate | read | null |
| 2025-02-05 | On Fairness of Unified Multimodal Large Language Model for Image Generation | Ming Liu et.al. | 2502.03429 | translate | read | null |
| 2025-02-05 | TruePose: Human-Parsing-guided Attention Diffusion for Full-ID Preserving Pose Transfer | Zhihong Xu et.al. | 2502.03426 | translate | read | null |
| 2025-02-05 | Can Text-to-Image Generative Models Accurately Depict Age? A Comparative Study on Synthetic Portrait Generation and Age Estimation | Alexey A. Novikov et.al. | 2502.03420 | translate | read | null |
| 2025-02-05 | Poisson Flow Joint Model for Multiphase contrast-enhanced CT | Rongjun Ge et.al. | 2502.03079 | translate | read | null |
| 2025-02-05 | Optimal control of the fidelity coefficient in a Cahn-Hilliard image inpainting model | Elena Beretta et.al. | 2502.03025 | translate | read | null |
| 2025-02-05 | A Survey of Sample-Efficient Deep Learning for Change Detection in Remote Sensing: Tasks, Strategies, and Challenges | Lei Ding et.al. | 2502.02835 | translate | read | null |
| 2025-02-04 | When are Diffusion Priors Helpful in Sparse Reconstruction? A Study with Sparse-view CT | Matt Y. Cheung et.al. | 2502.02771 | translate | read | null |
| 2025-02-04 | Controllable Video Generation with Provable Disentanglement | Yifan Shen et.al. | 2502.02690 | translate | read | null |
| 2025-02-05 | AAD-DCE: An Aggregated Multimodal Attention Mechanism for Early and Late Dynamic Contrast Enhanced Prostate MRI Synthesis | Divya Bharti et.al. | 2502.02555 | translate | read | null |
| 2025-02-04 | Style transfer as data augmentation: evaluating unpaired image-to-image translation models in mammography | Emir Ahmed et.al. | 2502.02475 | translate | read | null |
| 2025-02-04 | Towards Consistent and Controllable Image Synthesis for Face Editing | Mengting Wei et.al. | 2502.02465 | translate | read | null |
| 2025-02-04 | On the Guidance of Flow Matching | Ruiqi Feng et.al. | 2502.02150 | translate | read | null |
| 2025-02-04 | Layer Separation: Adjustable Joint Space Width Images Synthesis in Conventional Radiography | Haolin Wang et.al. | 2502.01972 | translate | read | null |
| 2025-02-03 | Texture Image Synthesis Using Spatial GAN Based on Vision Transformers | Elahe Salari et.al. | 2502.01842 | translate | read | null |
| 2025-02-03 | Scalable 3D Gaussian Splatting-Based RF Signal Spatial Propagation Modeling | Kang Yang et.al. | 2502.01826 | translate | read | null |
| 2025-02-03 | MakeAnything: Harnessing Diffusion Transformers for Multi-Domain Procedural Sequence Generation | Yiren Song et.al. | 2502.01572 | translate | read | null |
| 2025-02-03 | BD-Diff: Generative Diffusion Model for Image Deblurring on Unknown Domains with Blur-Decoupled Learning | Junhao Cheng et.al. | 2502.01522 | translate | read | null |
| 2025-02-03 | End-to-end Training for Text-to-Image Synthesis using Dual-Text Embeddings | Yeruru Asrar Ahmed et.al. | 2502.01507 | translate | read | null |
(<a href=../Image_Generation.md>back to Image Generation</a>)