Image Generation - 2024-11
Image Generation - 2024-11
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2024-11-29 | Enhanced anomaly detection in well log data through the application of ensemble GANs | Abdulrahman Al-Fakih et.al. | 2411.19875 | translate | read | link |
| 2024-11-29 | JetFormer: An Autoregressive Generative Model of Raw Images and Text | Michael Tschannen et.al. | 2411.19722 | translate | read | null |
| 2024-11-29 | TexGaussian: Generating High-quality PBR Material via Octree-based 3D Gaussian Splatting | Bojun Xiong et.al. | 2411.19654 | translate | read | link |
| 2024-11-29 | Uniform Attention Maps: Boosting Image Fidelity in Reconstruction and Editing | Wenyi Mo et.al. | 2411.19652 | translate | read | link |
| 2024-11-29 | QUOTA: Quantifying Objects with Text-to-Image Models for Any Domain | Wenfang Sun et.al. | 2411.19534 | translate | read | null |
| 2024-11-29 | Retrieval-guided Cross-view Image Synthesis | Hongji Yang et.al. | 2411.19510 | translate | read | null |
| 2024-11-29 | Achromatic single-layer hologram | Zhi Li et.al. | 2411.19445 | translate | read | null |
| 2024-11-28 | AMO Sampler: Enhancing Text Rendering with Overshooting | Xixi Hu et.al. | 2411.19415 | translate | read | link |
| 2024-11-28 | DreamBlend: Advancing Personalized Fine-tuning of Text-to-Image Diffusion Models | Shwetha Ram et.al. | 2411.19390 | translate | read | null |
| 2024-11-28 | 3D Wasserstein generative adversarial network with dense U-Net based discriminator for preclinical fMRI denoising | Sima Soltanpour et.al. | 2411.19345 | translate | read | null |
| 2024-11-27 | Diffusion Self-Distillation for Zero-Shot Customized Image Generation | Shengqu Cai et.al. | 2411.18616 | translate | read | link |
| 2024-11-27 | FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion | Haosen Yang et.al. | 2411.18552 | translate | read | null |
| 2024-11-27 | TryOffDiff: Virtual-Try-Off via High-Fidelity Garment Reconstruction using Diffusion Models | Riza Velioglu et.al. | 2411.18350 | translate | read | link |
| 2024-11-27 | HiFiVFS: High Fidelity Video Face Swapping | Xu Chen et.al. | 2411.18293 | translate | read | null |
| 2024-11-27 | Towards Lensless Image Deblurring with Prior-Embedded Implicit Neural Representations in the Low-Data Regime | Abeer Banerjee et.al. | 2411.18189 | translate | read | null |
| 2024-11-27 | Prediction with Action: Visual Policy Learning via Joint Denoising Process | Yanjiang Guo et.al. | 2411.18179 | translate | read | null |
| 2024-11-27 | Type-R: Automatically Retouching Typos for Text-to-Image Generation | Wataru Shimoda et.al. | 2411.18159 | translate | read | null |
| 2024-11-27 | Music2Fail: Transfer Music to Failed Recorder Style | Chon In Leong et.al. | 2411.18075 | translate | read | null |
| 2024-11-27 | PersonaCraft: Personalized Full-Body Image Synthesis for Multiple Identities from Single References Using 3D-Model-Conditioned Diffusion | Gwanghyun Kim et.al. | 2411.18068 | translate | read | null |
| 2024-11-27 | Exploring Visual Vulnerabilities via Multi-Loss Adversarial Search for Jailbreaking Vision-Language Models | Shuyang Hao et.al. | 2411.18000 | translate | read | null |
| 2024-11-26 | An Ensemble Approach for Brain Tumor Segmentation and Synthesis | Juampablo E. Heras Rivera et.al. | 2411.17617 | translate | read | null |
| 2024-11-26 | Accelerating Vision Diffusion Transformers with Skip Branches | Guanjie Chen et.al. | 2411.17616 | translate | read | link |
| 2024-11-26 | IMPROVE: Improving Medical Plausibility without Reliance on HumanValidation – An Enhanced Prototype-Guided Diffusion Framework | Anurag Shandilya et.al. | 2411.17535 | translate | read | null |
| 2024-11-26 | Image Generation with Multimodule Semantic Feature-Aided Selection for Semantic Communications | Chengyang Liang et.al. | 2411.17428 | translate | read | null |
| 2024-11-26 | Cross-modal Medical Image Generation Based on Pyramid Convolutional Attention Network | Fuyou Mao et.al. | 2411.17420 | translate | read | null |
| 2024-11-26 | Reward Incremental Learning in Text-to-Image Generation | Maorong Wang et.al. | 2411.17310 | translate | read | null |
| 2024-11-26 | From Graph Diffusion to Graph Classification | Jia Jun Cheng Xian et.al. | 2411.17236 | translate | read | null |
| 2024-11-26 | DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting | Yicheng Yang et.al. | 2411.17223 | translate | read | link |
| 2024-11-26 | cWDM: Conditional Wavelet Diffusion Models for Cross-Modality 3D Medical Image Synthesis | Paul Friedrich et.al. | 2411.17203 | translate | read | link |
| 2024-11-26 | The Role of Urban Designers in the Era of AIGC: An Experimental Study Based on Public Participation | Di Mo et.al. | 2411.17194 | translate | read | null |
| 2024-11-25 | Factorized Visual Tokenization and Generation | Zechen Bai et.al. | 2411.16681 | translate | read | null |
| 2024-11-25 | Enhancing Few-Shot Learning with Integrated Data and GAN Model Approaches | Yinqiu Feng et.al. | 2411.16567 | translate | read | null |
| 2024-11-25 | Noise Diffusion for Enhancing Semantic Faithfulness in Text-to-Image Synthesis | Boming Miao et.al. | 2411.16503 | translate | read | null |
| 2024-11-25 | Unsupervised Event Outlier Detection in Continuous Time | Somjit Nath et.al. | 2411.16427 | translate | read | null |
| 2024-11-25 | Comparison of Generative Learning Methods for Turbulence Modeling | Claudia Drygala et.al. | 2411.16417 | translate | read | null |
| 2024-11-25 | Synthesising Handwritten Music with GANs: A Comprehensive Evaluation of CycleWGAN, ProGAN, and DCGAN | Elona Shatri et.al. | 2411.16405 | translate | read | null |
| 2024-11-25 | CapHDR2IR: Caption-Driven Transfer from Visible Light to Infrared Domain | Jingchao Peng et.al. | 2411.16327 | translate | read | null |
| 2024-11-25 | One Diffusion to Generate Them All | Duong H. Le et.al. | 2411.16318 | translate | read | link |
| 2024-11-25 | Image Generation Diversity Issues and How to Tame Them | Mischa Dombrowski et.al. | 2411.16171 | translate | read | link |
| 2024-11-25 | BadSFL: Backdoor Attack against Scaffold Federated Learning | Xingshuo Han et.al. | 2411.16167 | translate | read | null |
| 2024-11-22 | Efficient Pruning of Text-to-Image Models: Insights from Pruning Stable Diffusion | Samarth N Ramesh et.al. | 2411.15113 | translate | read | null |
| 2024-11-22 | OminiControl: Minimal and Universal Control for Diffusion Transformer | Zhenxiong Tan et.al. | 2411.15098 | translate | read | link |
| 2024-11-22 | Leapfrog Latent Consistency Model (LLCM) for Medical Images Generation | Lakshmikar R. Polamreddy et.al. | 2411.15084 | translate | read | link |
| 2024-11-22 | HeadRouter: A Training-free Image Editing Framework for MM-DiTs by Adaptively Routing Attention Heads | Yu Xu et.al. | 2411.15034 | translate | read | null |
| 2024-11-22 | Prioritize Denoising Steps on Diffusion Model Preference Alignment via Explicit Denoised Distribution Estimation | Dingyuan Shi et.al. | 2411.14871 | translate | read | null |
| 2024-11-22 | Latent Schrodinger Bridge: Prompting Latent Diffusion for Fast Unpaired Image-to-Image Translation | Jeongsol Kim et.al. | 2411.14863 | translate | read | null |
| 2024-11-22 | Unsupervised Multi-view UAV Image Geo-localization via Iterative Rendering | Haoyuan Li et.al. | 2411.14816 | translate | read | null |
| 2024-11-22 | High-Resolution Image Synthesis via Next-Token Prediction | Dengsheng Chen et.al. | 2411.14808 | translate | read | null |
| 2024-11-22 | Reconciling Semantic Controllability and Diversity for Remote Sensing Image Synthesis with Hybrid Semantic Embedding | Junde Liu et.al. | 2411.14781 | translate | read | null |
| 2024-11-22 | FairAdapter: Detecting AI-generated Images with Improved Fairness | Feng Ding et.al. | 2411.14755 | translate | read | link |
| 2024-11-21 | Multimodal 3D Brain Tumor Segmentation with Adversarial Training and Conditional Random Field | Lan Jiang et.al. | 2411.14418 | translate | read | null |
| 2024-11-21 | Landing Trajectory Prediction for UAS Based on Generative Adversarial Network | Jun Xiang et.al. | 2411.14403 | translate | read | null |
| 2024-11-21 | ComfyGI: Automatic Improvement of Image Generation Workflows | Dominik Sobania et.al. | 2411.14193 | translate | read | null |
| 2024-11-21 | MMGenBench: Evaluating the Limits of LMMs from the Text-to-Image Generation Perspective | Hailang Huang et.al. | 2411.14062 | translate | read | link |
| 2024-11-21 | Safety Without Semantic Disruptions: Editing-free Safe Image Generation via Context-preserving Dual Latent Reconstruction | Jordan Vice et.al. | 2411.13982 | translate | read | null |
| 2024-11-21 | On the Fairness, Diversity and Reliability of Text-to-Image Generative Models | Jordan Vice et.al. | 2411.13981 | translate | read | null |
| 2024-11-21 | Zero-Shot Low-Light Image Enhancement via Joint Frequency Domain Priors Guided Diffusion | Jinhong He et.al. | 2411.13961 | translate | read | link |
| 2024-11-21 | iHQGAN: A Lightweight Invertible Hybrid Quantum-Classical Generative Adversarial Network for Unsupervised Image-to-Image Translation | Xue Yang et.al. | 2411.13920 | translate | read | null |
| 2024-11-21 | Dealing with Synthetic Data Contamination in Online Continual Learning | Maorong Wang et.al. | 2411.13852 | translate | read | link |
| 2024-11-21 | Detecting Human Artifacts from Text-to-Image Models | Kaihong Wang et.al. | 2411.13842 | translate | read | link |
| 2024-11-20 | VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models | Ziqi Huang et.al. | 2411.13503 | translate | read | link |
| 2024-11-20 | From Prompt Engineering to Prompt Craft | Joseph Lindley et.al. | 2411.13422 | translate | read | null |
| 2024-11-20 | On the Way to LLM Personalization: Learning to Remember User Conversations | Lucie Charlotte Magister et.al. | 2411.13405 | translate | read | null |
| 2024-11-20 | RAW-Diffusion: RGB-Guided Diffusion Models for High-Fidelity RAW Image Generation | Christoph Reinders et.al. | 2411.13150 | translate | read | null |
| 2024-11-20 | CopyrightMeter: Revisiting Copyright Protection in Text-to-image Models | Naen Xu et.al. | 2411.13144 | translate | read | null |
| 2024-11-19 | From Text to Pose to Image: Improving Diffusion Model Control and Quality | Clément Bonnett et.al. | 2411.12872 | translate | read | link |
| 2024-11-19 | HyperGAN-CLIP: A Unified Framework for Domain Adaptation, Image Synthesis and Manipulation | Abdul Basit Anees et.al. | 2411.12832 | translate | read | null |
| 2024-11-19 | Stylecodes: Encoding Stylistic Information For Image Generation | Ciara Rowles et.al. | 2411.12811 | translate | read | link |
| 2024-11-19 | Frequency-Aware Guidance for Blind Image Restoration via Diffusion Models | Jun Xiao et.al. | 2411.12450 | translate | read | null |
| 2024-11-19 | Enhancing Blind Source Separation with Dissociative Principal Component Analysis | Muhammad Usman Khalid et.al. | 2411.12321 | translate | read | null |
| 2024-11-19 | CCIS-Diff: A Generative Model with Stable Diffusion Prior for Controlled Colonoscopy Image Synthesis | Yifan Xie et.al. | 2411.12198 | translate | read | null |
| 2024-11-19 | Constant Rate Schedule: Constant-Rate Distributional Change for Efficient Training and Sampling in Diffusion Models | Shuntaro Okada et.al. | 2411.12188 | translate | read | null |
| 2024-11-19 | Enhancing Low Dose Computed Tomography Images Using Consistency Training Techniques | Mahmut S. Gokmen et.al. | 2411.12181 | translate | read | null |
| 2024-11-18 | Zoomed In, Diffused Out: Towards Local Degradation-Aware Multi-Diffusion for Extreme Image Super-Resolution | Brian B. Moser et.al. | 2411.12072 | translate | read | link |
| 2024-11-18 | Analyzing and Improving the Skin Tone Consistency and Bias in Implicit 3D Relightable Face Generators | Libing Zeng et.al. | 2411.12002 | translate | read | null |
| 2024-11-18 | Parallelly Tempered Generative Adversarial Networks | Jinwon Sohn et.al. | 2411.11786 | translate | read | null |
| 2024-11-18 | Conceptwm: A Diffusion Model Watermark for Concept Protection | Liangqi Lei et.al. | 2411.11688 | translate | read | null |
| 2024-11-19 | Cascaded Diffusion Models for 2D and 3D Microscopy Image Synthesis to Enhance Cell Segmentation | Rüveyda Yilmaz et.al. | 2411.11515 | translate | read | null |
| 2024-11-18 | A Modular Open Source Framework for Genomic Variant Calling | Ankita Vaishnobi Bisoi et.al. | 2411.11513 | translate | read | null |
| 2024-11-18 | MVLight: Relightable Text-to-3D Generation via Light-conditioned Multi-View Diffusion | Dongseok Shim et.al. | 2411.11475 | translate | read | null |
| 2024-11-18 | BeautyBank: Encoding Facial Makeup in Latent Space | Qianwen Lu et.al. | 2411.11231 | translate | read | null |
| 2024-11-17 | Enhanced Anime Image Generation Using USE-CMHSA-GAN | J. Lu et.al. | 2411.11179 | translate | read | null |
| 2024-11-17 | Time Step Generating: A Universal Synthesized Deepfake Image Detector | Ziyue Zeng et.al. | 2411.11016 | translate | read | link |
| 2024-11-17 | SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration | Jintao Zhang et.al. | 2411.10958 | translate | read | link |
| 2024-11-16 | Test-time Conditional Text-to-Image Synthesis Using Diffusion Models | Tripti Shukla et.al. | 2411.10800 | translate | read | null |
| 2024-11-15 | M-VAR: Decoupled Scale-wise Autoregressive Modeling for High-Quality Image Generation | Sucheng Ren et.al. | 2411.10433 | translate | read | null |
| 2024-11-15 | Mechanisms of Generative Image-to-Image Translation Networks | Guangzong Chen et.al. | 2411.10368 | translate | read | null |
| 2024-11-15 | Safe Text-to-Image Generation: Simply Sanitize the Prompt Embedding | Huming Qiu et.al. | 2411.10329 | translate | read | null |
| 2024-11-15 | The Unreasonable Effectiveness of Guidance for Diffusion Models | Tim Kaiser et.al. | 2411.10257 | translate | read | null |
| 2024-11-15 | Visual question answering based evaluation metrics for text-to-image generation | Mizuki Miyamoto et.al. | 2411.10183 | translate | read | null |
| 2024-11-15 | CART: Compositional Auto-Regressive Transformer for Image Generation | Siddharth Roheda et.al. | 2411.10180 | translate | read | null |
| 2024-11-15 | Towards Multi-View Consistent Style Transfer with One-Step Diffusion via Vision Conditioning | Yushen Zuo et.al. | 2411.10130 | translate | read | null |
| 2024-11-15 | Adaptive Non-Uniform Timestep Sampling for Diffusion Model Training | Myunsoo Kim et.al. | 2411.09998 | translate | read | null |
| 2024-11-15 | Instruction-Guided Editing Controls for Images and Multimedia: A Survey in LLM era | Thanh Tam Nguyen et.al. | 2411.09955 | translate | read | null |
| 2024-11-15 | Content-Aware Preserving Image Generation | Giang H. Le et.al. | 2411.09871 | translate | read | null |
| 2024-11-14 | GAN-Based Architecture for Low-dose Computed Tomography Imaging Denoising | Yunuo Wang et.al. | 2411.09512 | translate | read | null |
| 2024-11-14 | Image Regeneration: Evaluating Text-to-Image Model via Generating Identical Image with Multimodal Large Language Models | Chutian Meng et.al. | 2411.09449 | translate | read | null |
| 2024-11-12 | Mediffusion: Joint Diffusion for Self-Explainable Semi-Supervised Classification and Medical Image Generation | Joanna Kaleta et.al. | 2411.09434 | translate | read | null |
| 2024-11-14 | Advancing Diffusion Models: Alias-Free Resampling and Enhanced Rotational Equivariance | Md Fahim Anjum et.al. | 2411.09174 | translate | read | null |
| 2024-11-13 | A Survey on Vision Autoregressive Model | Kai Jiang et.al. | 2411.08666 | translate | read | null |
| 2024-11-13 | Towards More Accurate Fake Detection on Images Generated from Advanced Generative and Neural Rendering Models | Chengdong Dong et.al. | 2411.08642 | translate | read | null |
| 2024-11-13 | I Can Embrace and Avoid Vagueness Myself: Supporting the Design Process by Balancing Vagueness through Text-to-Image Generative AI | Myungjin Kim et.al. | 2411.08588 | translate | read | null |
| 2024-11-13 | Physics Informed Distillation for Diffusion Models | Joshua Tian Jin Tee et.al. | 2411.08378 | translate | read | link |
| 2024-11-12 | Latent Space Disentanglement in Diffusion Transformers Enables Precise Zero-shot Semantic Editing | Zitao Shuai et.al. | 2411.08196 | translate | read | null |
| 2024-11-12 | TIPO: Text to Image with Text Presampling for Prompt Optimization | Shih-Ying Yeh et.al. | 2411.08127 | translate | read | null |
| 2024-11-12 | Artistic Neural Style Transfer Algorithms with Activation Smoothing | Xiangtian Li et.al. | 2411.08014 | translate | read | null |
| 2024-11-12 | Markov Processes for Enhanced Deepfake Generation and Detection | Jyoti Bhadana et.al. | 2411.07993 | translate | read | null |
| 2024-11-12 | DuoLift-GAN:Reconstructing CT from Single-view and Biplanar X-Rays with Generative Adversarial Networks | Zhaoxi Zhang et.al. | 2411.07941 | translate | read | null |
| 2024-11-12 | Emotion Classification of Children Expressions | Sanchayan Vivekananthan et.al. | 2411.07708 | translate | read | null |
| 2024-11-12 | Evaluating the Generation of Spatial Relations in Text and Image Generative Models | Shang Hong Sim et.al. | 2411.07664 | translate | read | null |
| 2024-11-12 | Leveraging Previous Steps: A Training-free Fast Solver for Flow Diffusion | Kaiyu Song et.al. | 2411.07627 | translate | read | null |
| 2024-11-12 | Harmonizing Pixels and Melodies: Maestro-Guided Film Score Generation and Composition Style Transfer | F. Qi et.al. | 2411.07539 | translate | read | null |
| 2024-11-12 | GUS-IR: Gaussian Splatting with Unified Shading for Inverse Rendering | Zhihao Liang et.al. | 2411.07478 | translate | read | null |
| 2024-11-12 | Tracing the Roots: Leveraging Temporal Dynamics in Diffusion Trajectories for Origin Attribution | Andreas Floros et.al. | 2411.07449 | translate | read | null |
| 2024-11-11 | Instance Performance Difference: A Metric to Measure the Sim-To-Real Gap in Camera Simulation | Bo-Hsun Chen et.al. | 2411.07375 | translate | read | null |
| 2024-11-11 | Learning from Limited and Imperfect Data | Harsh Rangwani et.al. | 2411.07229 | translate | read | null |
| 2024-11-11 | DLCR: A Generative Data Expansion Framework via Diffusion for Clothes-Changing Person Re-ID | Nyle Siddiqui et.al. | 2411.07205 | translate | read | link |
| 2024-11-11 | More Expressive Attention with Negative Weights | Ang Lv et.al. | 2411.07176 | translate | read | link |
| 2024-11-11 | Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesis | Taihang Hu et.al. | 2411.07132 | translate | read | link |
| 2024-11-11 | Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models | NVIDIA et.al. | 2411.07126 | translate | read | null |
| 2024-11-11 | Decoding Visual Experience and Mapping Semantics through Whole-Brain Analysis Using fMRI Foundation Models | Yanchen Wang et.al. | 2411.07121 | translate | read | link |
| 2024-11-11 | An Interpretable X-ray Style Transfer via Trainable Local Laplacian Filter | Dominik Eckert et.al. | 2411.07072 | translate | read | null |
| 2024-11-11 | ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis | Zanlin Ni et.al. | 2411.06959 | translate | read | link |
| 2024-11-11 | Layout Control and Semantic Guidance with Attention Loss Backward for T2I Diffusion Model | Guandong Li et.al. | 2411.06692 | translate | read | null |
| 2024-11-11 | SeedEdit: Align Image Re-Generation to Image Editing | Yichun Shi et.al. | 2411.06686 | translate | read | null |
| 2024-11-08 | Image2Text2Image: A Novel Framework for Label-Free Evaluation of Image-to-Text Generation with Text-to-Image Diffusion Models | Jia-Hong Huang et.al. | 2411.05706 | translate | read | null |
| 2024-11-08 | Image inpainting enhancement by replacing the original mask with a self-attended region from the input image | Kourosh Kiani et.al. | 2411.05705 | translate | read | null |
| 2024-11-08 | A Nerf-Based Color Consistency Method for Remote Sensing Images | Zongcheng Zuo et.al. | 2411.05557 | translate | read | null |
| 2024-11-08 | Improving image synthesis with diffusion-negative sampling | Alakh Desai et.al. | 2411.05473 | translate | read | null |
| 2024-11-07 | Precision or Recall? An Analysis of Image Captions for Training Text-to-Image Generation Model | Sheng Cheng et.al. | 2411.05079 | translate | read | link |
| 2024-11-07 | Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion Models | Shuhong Zheng et.al. | 2411.05005 | translate | read | null |
| 2024-11-07 | Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models | Weixin Liang et.al. | 2411.04996 | translate | read | null |
| 2024-11-07 | AsCAN: Asymmetric Convolution-Attention Networks for Efficient Recognition and Generation | Anil Kag et.al. | 2411.04967 | translate | read | null |
| 2024-11-07 | End-to-end Inception-Unet based Generative Adversarial Networks for Snow and Rain Removals | Ibrahim Kajo et.al. | 2411.04821 | translate | read | null |
| 2024-11-07 | Taming Rectified Flow for Inversion and Editing | Jiangshan Wang et.al. | 2411.04746 | translate | read | link |
| 2024-11-07 | DomainGallery: Few-shot Domain-driven Image Generation by Attribute-centric Finetuning | Yuxuan Duan et.al. | 2411.04571 | translate | read | link |
| 2024-11-07 | BendVLM: Test-Time Debiasing of Vision-Language Embeddings | Walter Gerych et.al. | 2411.04420 | translate | read | link |
| 2024-11-07 | Image Understanding Makes for A Good Tokenizer for Image Generation | Luting Wang et.al. | 2411.04406 | translate | read | null |
| 2024-11-06 | DiMSUM: Diffusion Mamba – A Scalable and Unified Spatial-Frequency Method for Image Generation | Hao Phung et.al. | 2411.04168 | translate | read | null |
| 2024-11-06 | ParaGAN: A Scalable Distributed Training Framework for Generative Adversarial Networks | Ziji Shi et.al. | 2411.03999 | translate | read | null |
| 2024-11-06 | Investigating Conceptual Blending of a Diffusion Model for Improving Nonword-to-Image Generation | Chihaya Matsuhira et.al. | 2411.03595 | translate | read | null |
| 2024-11-05 | Enhancing Weakly Supervised Semantic Segmentation for Fibrosis via Controllable Image Generation | Zhiling Yue et.al. | 2411.03551 | translate | read | null |
| 2024-11-05 | SynthSet: Generative Diffusion Model for Semantic Segmentation in Precision Agriculture | Andrew Heschl et.al. | 2411.03505 | translate | read | link |
| 2024-11-05 | Rainfall regression from C-band Synthetic Aperture Radar using Multi-Task Generative Adversarial Networks | Aurélien Colin et.al. | 2411.03480 | translate | read | null |
| 2024-11-05 | DiT4Edit: Diffusion Transformer for Image Editing | Kunyu Feng et.al. | 2411.03286 | translate | read | null |
| 2024-11-05 | On Improved Conditioning Mechanisms and Pre-training Strategies for Diffusion Models | Tariq Berrada Ifriqi et.al. | 2411.03177 | translate | read | null |
| 2024-11-05 | Local Lesion Generation is Effective for Capsule Endoscopy Image Data Augmentation in a Limited Data Setting | Adrian B. Chłopowiec et.al. | 2411.03098 | translate | read | null |
| 2024-11-05 | Gradient-Guided Conditional Diffusion Models for Private Image Reconstruction: Analyzing Adversarial Impacts of Differential Privacy and Denoising | Tao Huang et.al. | 2411.03053 | translate | read | null |
| 2024-11-05 | Textual Aesthetics in Large Language Models | Lingjie Jiang et.al. | 2411.02930 | translate | read | link |
| 2024-11-05 | BrainBits: How Much of the Brain are Generative Reconstruction Methods Using? | David Mayo et.al. | 2411.02783 | translate | read | null |
| 2024-11-04 | TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives | Maitreya Patel et.al. | 2411.02545 | translate | read | null |
| 2024-11-04 | Training-free Regional Prompting for Diffusion Transformers | Anthony Chen et.al. | 2411.02395 | translate | read | link |
| 2024-11-05 | Digi2Real: Bridging the Realism Gap in Synthetic Data Face Recognition via Foundation Models | Anjith George et.al. | 2411.02188 | translate | read | null |
| 2024-11-03 | DreamPolish: Domain Score Distillation With Progressive Geometry Generation | Yean Cheng et.al. | 2411.01602 | translate | read | null |
| 2024-11-03 | Towards Small Object Editing: A Benchmark Dataset and A Training-Free Approach | Qihe Pan et.al. | 2411.01545 | translate | read | null |
| 2024-11-03 | DPCL-Diff: The Temporal Knowledge Graph Reasoning based on Graph Node Diffusion Model with Dual-Domain Periodic Contrastive Learning | Yukun Cao et.al. | 2411.01477 | translate | read | null |
| 2024-11-03 | Privacy-Preserving Customer Churn Prediction Model in the Context of Telecommunication Industry | Joydeb Kumar Sana et.al. | 2411.01447 | translate | read | null |
| 2024-11-03 | TPOT: Topology Preserving Optimal Transport in Retinal Fundus Image Enhancement | Xuanzhao Dong et.al. | 2411.01403 | translate | read | null |
| 2024-11-02 | Guided Synthesis of Labeled Brain MRI Data Using Latent Diffusion Models for Segmentation of Enlarged Ventricles | Tim Ruschke et.al. | 2411.01351 | translate | read | null |
| 2024-11-02 | AquaFuse: Waterbody Fusion for Physics Guided View Synthesis of Underwater Scenes | Md Abu Bakr Siddique et.al. | 2411.01119 | translate | read | null |
| 2024-11-01 | Evaluation Metric for Quality Control and Generative Models in Histopathology Images | Pranav Jeevan et.al. | 2411.01034 | translate | read | null |
| 2024-11-01 | In-Context LoRA for Diffusion Transformers | Lianghua Huang et.al. | 2410.23775 | translate | read | link |
(<a href=../Image_Generation.md>back to Image Generation</a>)