Image Generation - 2024-03
Image Generation - 2024-03
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2024-03-29 | Benchmarking Counterfactual Image Generation | Thomas Melistas et.al. | 2403.20287 | translate | read | link |
| 2024-03-29 | FreeSeg-Diff: Training-Free Open-Vocabulary Segmentation with Diffusion Models | Barbara Toniella Corradini et.al. | 2403.20105 | translate | read | null |
| 2024-03-29 | SCINeRF: Neural Radiance Fields from a Snapshot Compressive Image | Yunhao Li et.al. | 2403.20018 | translate | read | link |
| 2024-03-29 | FairRAG: Fair Human Generation via Fair Retrieval Augmentation | Robik Shrestha et.al. | 2403.19964 | translate | read | null |
| 2024-03-28 | Vision-Language Synthetic Data Enhances Echocardiography Downstream Tasks | Pooria Ashrafian et.al. | 2403.19880 | translate | read | link |
| 2024-03-28 | Is Synthetic Image Useful for Transfer Learning? An Investigation into Data Generation, Volume, and Utilization | Yuhang Li et.al. | 2403.19866 | translate | read | null |
| 2024-03-28 | CLoRA: A Contrastive Approach to Compose Multiple LoRA Models | Tuna Han Salih Meral et.al. | 2403.19776 | translate | read | null |
| 2024-03-28 | Detecting Image Attribution for Text-to-Image Diffusion Models in RGB and Beyond | Katherine Xu et.al. | 2403.19653 | translate | read | link |
| 2024-03-28 | GANTASTIC: GAN-based Transfer of Interpretable Directions for Disentangled Image Editing in Text-to-Image Diffusion Models | Yusuf Dalva et.al. | 2403.19645 | translate | read | null |
| 2024-03-28 | Lane-Change in Dense Traffic with Model Predictive Control and Neural Networks | Sangjae Bae et.al. | 2403.19633 | translate | read | link |
| 2024-03-28 | Collaborative Interactive Evolution of Art in the Latent Space of Deep Generative Models | Ole Hall et.al. | 2403.19620 | translate | read | null |
| 2024-03-28 | Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model | Zhicai Wang et.al. | 2403.19600 | translate | read | link |
| 2024-03-28 | Frame by Familiar Frame: Understanding Replication in Video Diffusion Models | Aimon Rahman et.al. | 2403.19593 | translate | read | null |
| 2024-03-28 | Locate, Assign, Refine: Taming Customized Image Inpainting with Text-Subject Guidance | Yulin Pan et.al. | 2403.19534 | translate | read | null |
| 2024-03-28 | Imperceptible Protection against Style Imitation from Diffusion Models | Namhyuk Ahn et.al. | 2403.19254 | translate | read | null |
| 2024-03-28 | QNCD: Quantization Noise Correction for Diffusion Models | Huanpeng Chu et.al. | 2403.19140 | translate | read | link |
| 2024-03-28 | Synthetic Medical Imaging Generation with Generative Adversarial Networks For Plain Radiographs | John R. McNulty et.al. | 2403.19107 | translate | read | null |
| 2024-03-27 | Conditional Wasserstein Distances with Applications in Bayesian OT Flow Matching | Jannis Chemseddine et.al. | 2403.18705 | translate | read | null |
| 2024-03-27 | Attention Calibration for Disentangled Text-to-Image Personalization | Yanbing Zhang et.al. | 2403.18551 | translate | read | link |
| 2024-03-27 | DiffusionFace: Towards a Comprehensive Dataset for Diffusion-Based Face Forgery Analysis | Zhongxi Chen et.al. | 2403.18471 | translate | read | link |
| 2024-03-27 | DiffStyler: Diffusion-based Localized Image Style Transfer | Shaoxu Li et.al. | 2403.18461 | translate | read | null |
| 2024-03-27 | U-Sketch: An Efficient Approach for Sketch to Image Diffusion Models | Ilias Mitsouras et.al. | 2403.18425 | translate | read | null |
| 2024-03-27 | ECNet: Effective Controllable Text-to-Image Diffusion Models | Sicheng Li et.al. | 2403.18417 | translate | read | null |
| 2024-03-27 | Colour and Brush Stroke Pattern Recognition in Abstract Art using Modified Deep Convolutional Generative Adversarial Networks | Srinitish Srinivasan et.al. | 2403.18397 | translate | read | link |
| 2024-03-27 | Ship in Sight: Diffusion Models for Ship-Image Super Resolution | Luigi Sigillo et.al. | 2403.18370 | translate | read | link |
| 2024-03-27 | DSF-GAN: DownStream Feedback Generative Adversarial Network | Oriel Perets et.al. | 2403.18267 | translate | read | link |
| 2024-03-27 | Don’t Look into the Dark: Latent Codes for Pluralistic Image Inpainting | Haiwei Chen et.al. | 2403.18186 | translate | read | null |
| 2024-03-26 | Boosting Diffusion Models with Moving Average Sampling in Frequency Domain | Yurui Qian et.al. | 2403.17870 | translate | read | null |
| 2024-03-26 | CT Synthesis with Conditional Diffusion Models for Abdominal Lymph Node Segmentation | Yongrui Yu et.al. | 2403.17770 | translate | read | null |
| 2024-03-26 | FaultGuard: A Generative Approach to Resilient Fault Prediction in Smart Electrical Grids | Emad Efatinasab et.al. | 2403.17494 | translate | read | null |
| 2024-03-26 | LaRE^2: Latent Reconstruction Error Based Method for Diffusion-Generated Image Detection | Yunpeng Luo et.al. | 2403.17465 | translate | read | null |
| 2024-03-26 | An inexact proximal MM method for a class of nonconvex composite image reconstruction models | Bujin Li et.al. | 2403.17450 | translate | read | null |
| 2024-03-25 | DiffusionAct: Controllable Diffusion Autoencoder for One-shot Face Reenactment | Stella Bounareli et.al. | 2403.17217 | translate | read | null |
| 2024-03-25 | FlashFace: Human Image Personalization with High-fidelity Identity Preservation | Shilong Zhang et.al. | 2403.17008 | translate | read | null |
| 2024-03-25 | SD-DiT: Unleashing the Power of Self-supervised Discrimination in Diffusion Transformer | Rui Zhu et.al. | 2403.17004 | translate | read | null |
| 2024-03-25 | Be Yourself: Bounded Attention for Multi-Subject Text-to-Image Generation | Omer Dahary et.al. | 2403.16990 | translate | read | null |
| 2024-03-25 | Isolated Diffusion: Optimizing Multi-Concept Text-to-Image Generation Training-Freely with Isolated Diffusion Guidance | Jingyuan Zhu et.al. | 2403.16954 | translate | read | null |
| 2024-03-25 | Iso-Diffusion: Improving Diffusion Probabilistic Models Using the Isotropy of the Additive Gaussian Noise | Dilum Fernando et.al. | 2403.16790 | translate | read | null |
| 2024-03-25 | Diff-Def: Diffusion-Generated Deformation Fields for Conditional Atlases | Sophie Starck et.al. | 2403.16776 | translate | read | null |
| 2024-03-25 | Multi-Scale Texture Loss for CT denoising with GANs | Francesco Di Feola et.al. | 2403.16640 | translate | read | link |
| 2024-03-25 | SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions | Yuda Song et.al. | 2403.16627 | translate | read | null |
| 2024-03-25 | Enhancing Cross-Dataset EEG Emotion Recognition: A Novel Approach with Emotional EEG Style Transfer Network | Yijin Zhou et.al. | 2403.16540 | translate | read | null |
| 2024-03-25 | An Intermediate Fusion ViT Enables Efficient Text-Image Alignment in Diffusion Models | Zizhao Hu et.al. | 2403.16530 | translate | read | null |
| 2024-03-25 | Training Generative Adversarial Network-Based Vocoder with Limited Data Using Augmentation-Conditional Discriminator | Takuhiro Kaneko et.al. | 2403.16464 | translate | read | null |
| 2024-03-25 | Refining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation | Sanyam Lakhanpal et.al. | 2403.16422 | translate | read | null |
| 2024-03-25 | Skews in the Phenomenon Space Hinder Generalization in Text-to-Image Generation | Yingshan Chang et.al. | 2403.16394 | translate | read | null |
| 2024-03-25 | Illuminating Systematic Trends in Nuclear Data with Generative Machine Learning Models | Jordan M. R. Fox et.al. | 2403.16389 | translate | read | null |
| 2024-03-25 | FlashEval: Towards Fast and Accurate Evaluation of Text-to-image Diffusion Generative Models | Lin Zhao et.al. | 2403.16379 | translate | read | null |
| 2024-03-24 | Fill in the __ (a Diffusion-based Image Inpainting Pipeline) | Eyoel Gebre et.al. | 2403.16016 | translate | read | null |
| 2024-03-22 | DragAPart: Learning a Part-Level Motion Prior for Articulated Objects | Ruining Li et.al. | 2403.15382 | translate | read | null |
| 2024-03-22 | Long-CLIP: Unlocking the Long-Text Capability of CLIP | Beichen Zhang et.al. | 2403.15378 | translate | read | null |
| 2024-03-22 | A Wasserstein perspective of Vanilla GANs | Lea Kunkel et.al. | 2403.15312 | translate | read | null |
| 2024-03-22 | Controlled Training Data Generation with Diffusion Models | Teresa Yeo et.al. | 2403.15309 | translate | read | null |
| 2024-03-22 | Robust Utility Optimization via a GAN Approach | Florian Krach et.al. | 2403.15243 | translate | read | null |
| 2024-03-22 | A Multimodal Approach for Cross-Domain Image Retrieval | Lucas Iijima et.al. | 2403.15152 | translate | read | null |
| 2024-03-22 | MM-Diff: High-Fidelity Image Personalization via Multi-Modal Condition Integration | Zhichao Wei et.al. | 2403.15059 | translate | read | null |
| 2024-03-22 | Cartoon Hallucinations Detection: Pose-aware In Context Visual Learning | Bumsoo Kim et.al. | 2403.15048 | translate | read | null |
| 2024-03-22 | Generative Active Learning for Image Synthesis Personalization | Xulu Zhang et.al. | 2403.14987 | translate | read | null |
| 2024-03-22 | CLIP-VQDiffusion : Langauge Free Training of Text To Image generation using CLIP and vector quantized diffusion model | Seungdae Han et.al. | 2403.14944 | translate | read | null |
| 2024-03-21 | Implicit Style-Content Separation using B-LoRA | Yarden Frenkel et.al. | 2403.14572 | translate | read | null |
| 2024-03-21 | DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image Editing | Yueru Jia et.al. | 2403.14487 | translate | read | null |
| 2024-03-21 | AnyV2V: A Plug-and-Play Framework For Any Video-to-Video Editing Tasks | Max Ku et.al. | 2403.14468 | translate | read | null |
| 2024-03-21 | Analysing Diffusion Segmentation for Medical Images | Mathias Öttl et.al. | 2403.14440 | translate | read | null |
| 2024-03-21 | Style-Extracting Diffusion Models for Semi-Supervised Histopathology Segmentation | Mathias Öttl et.al. | 2403.14429 | translate | read | null |
| 2024-03-21 | HySim: An Efficient Hybrid Similarity Measure for Patch Matching in Image Inpainting | Saad Noufel et.al. | 2403.14292 | translate | read | null |
| 2024-03-21 | Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models | Pablo Marcos-Manchón et.al. | 2403.14291 | translate | read | link |
| 2024-03-21 | Safeguarding Medical Image Segmentation Datasets against Unauthorized Training via Contour- and Texture-Aware Perturbations | Xun Lin et.al. | 2403.14250 | translate | read | null |
| 2024-03-21 | StyleCineGAN: Landscape Cinemagraph Generation using a Pre-trained StyleGAN | Jongwoo Choi et.al. | 2403.14186 | translate | read | null |
| 2024-03-21 | QSMDiff: Unsupervised 3D Diffusion Models for Quantitative Susceptibility Mapping | Zhuang Xiong et.al. | 2403.14070 | translate | read | null |
| 2024-03-20 | Learning from Models and Data for Visual Grounding | Ruozhen He et.al. | 2403.13804 | translate | read | null |
| 2024-03-20 | Step-Calibrated Diffusion for Biomedical Optical Image Restoration | Yiwei Lyu et.al. | 2403.13680 | translate | read | null |
| 2024-03-20 | ReGround: Improving Textual and Spatial Grounding at No Cost | Yuseung Lee et.al. | 2403.13589 | translate | read | null |
| 2024-03-20 | Diversity-aware Channel Pruning for StyleGAN Compression | Jiwoo Chung et.al. | 2403.13548 | translate | read | link |
| 2024-03-20 | IDAdapter: Learning Mixed Features for Tuning-Free Personalization of Text-to-Image Models | Siying Cui et.al. | 2403.13535 | translate | read | null |
| 2024-03-20 | Deepfake Detection without Deepfakes: Generalization via Synthetic Frequency Patterns Injection | Davide Alessandro Coccomini et.al. | 2403.13479 | translate | read | null |
| 2024-03-20 | S2DM: Sector-Shaped Diffusion Models for Video Generation | Haoran Lang et.al. | 2403.13408 | translate | read | null |
| 2024-03-20 | IIDM: Image-to-Image Diffusion Model for Semantic Image Synthesis | Feng Liu et.al. | 2403.13378 | translate | read | null |
| 2024-03-20 | AGFSync: Leveraging AI-Generated Feedback for Preference Optimization in Text-to-Image Generation | Jingkun An et.al. | 2403.13352 | translate | read | null |
| 2024-03-20 | TiBiX: Leveraging Temporal Information for Bidirectional X-ray and Report Generation | Santosh Sanjeev et.al. | 2403.13343 | translate | read | null |
| 2024-03-19 | FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis | Linjiang Huang et.al. | 2403.12963 | translate | read | link |
| 2024-03-19 | Segment Anything for comprehensive analysis of grapevine cluster architecture and berry properties | Efrain Torres-Lomas et.al. | 2403.12935 | translate | read | null |
| 2024-03-19 | You Only Sample Once: Taming One-Step Text-To-Image Synthesis by Self-Cooperative Diffusion GANs | Yihong Luo et.al. | 2403.12931 | translate | read | link |
| 2024-03-19 | Ultra-High-Resolution Image Synthesis with Pyramid Diffusion Model | Jiajie Yang et.al. | 2403.12915 | translate | read | link |
| 2024-03-19 | Generative Enhancement for 3D Medical Images | Lingting Zhu et.al. | 2403.12852 | translate | read | link |
| 2024-03-19 | How Spammers and Scammers Leverage AI-Generated Images on Facebook for Audience Growth | Renee DiResta et.al. | 2403.12838 | translate | read | null |
| 2024-03-19 | Total Disentanglement of Font Images into Style and Character Class Features | Daichi Haraguchi et.al. | 2403.12784 | translate | read | null |
| 2024-03-19 | Towards Controllable Face Generation with Semantic Latent Diffusion Models | Alex Ergasti et.al. | 2403.12743 | translate | read | link |
| 2024-03-19 | Tuning-Free Image Customization with Image and Text Guidance | Pengzhi Li et.al. | 2403.12658 | translate | read | null |
| 2024-03-19 | NSGAN: A Non-Dominant Sorting Optimisation-Based Generative Adversarial Design Framework for Alloy Discovery | Zhipeng Li et.al. | 2403.12495 | translate | read | null |
| 2024-03-18 | Urban Scene Diffusion through Semantic Occupancy Map | Junge Zhang et.al. | 2403.11697 | translate | read | null |
| 2024-03-18 | Binary Noise for Binary Tasks: Masked Bernoulli Diffusion for Unsupervised Anomaly Detection | Julia Wolleb et.al. | 2403.11667 | translate | read | null |
| 2024-03-18 | LocalStyleFool: Regional Video Style Transfer Attack Using Segment Anything Model | Yuxin Cao et.al. | 2403.11656 | translate | read | null |
| 2024-03-18 | QEAN: Quaternion-Enhanced Attention Network for Visual Dance Generation | Zhizhen Zhou et.al. | 2403.11626 | translate | read | null |
| 2024-03-18 | CRS-Diff: Controllable Generative Remote Sensing Foundation Model | Datao Tang et.al. | 2403.11614 | translate | read | null |
| 2024-03-18 | VmambaIR: Visual State Space Model for Image Restoration | Yuan Shi et.al. | 2403.11423 | translate | read | link |
| 2024-03-17 | StainDiffuser: MultiTask Dual Diffusion Model for Virtual Staining | Tushar Kataria et.al. | 2403.11340 | translate | read | null |
| 2024-03-17 | Fast Personalized Text-to-Image Syntheses With Attention Injection | Yuxuan Zhang et.al. | 2403.11284 | translate | read | null |
| 2024-03-17 | Forging the Forger: An Attempt to Improve Authorship Verification via Data Augmentation | Silvia Corbara et.al. | 2403.11265 | translate | read | null |
| 2024-03-17 | Understanding Diffusion Models by Feynman’s Path Integral | Yuji Hirono et.al. | 2403.11262 | translate | read | null |
| 2024-03-14 | SCP-Diff: Photo-Realistic Semantic Image Synthesis with Spatial-Categorical Joint Prior | Huan-ang Gao et.al. | 2403.09638 | translate | read | null |
| 2024-03-14 | Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering | Zeyu Liu et.al. | 2403.09622 | translate | read | null |
| 2024-03-14 | PrompTHis: Visualizing the Process and Influence of Prompt Editing during Text-to-Image Creation | Yuhan Guo et.al. | 2403.09615 | translate | read | null |
| 2024-03-14 | Counterfactual contrastive learning: robust representations via causal image synthesis | Melanie Roschewitz et.al. | 2403.09605 | translate | read | link |
| 2024-03-14 | Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing | Wonjun Kang et.al. | 2403.09468 | translate | read | link |
| 2024-03-14 | Mitigating attribute amplification in counterfactual image generation | Tian Xia et.al. | 2403.09422 | translate | read | null |
| 2024-03-14 | Machine Learning Processes as Sources of Ambiguity: Insights from AI Art | Christian Sivertsen et.al. | 2403.09374 | translate | read | null |
| 2024-03-14 | Mitigating Data Consistency Induced Discrepancy in Cascaded Diffusion Models for Sparse-view CT Reconstruction | Hanyu Chen et.al. | 2403.09355 | translate | read | null |
| 2024-03-14 | StainFuser: Controlling Diffusion for Faster Neural Style Transfer in Multi-Gigapixel Histology Images | Robert Jewsbury et.al. | 2403.09302 | translate | read | link |
| 2024-03-14 | Noise Dimension of GAN: An Image Compression Perspective | Ziran Zhu et.al. | 2403.09196 | translate | read | null |
| 2024-03-13 | Ambient Diffusion Posterior Sampling: Solving Inverse Problems with Diffusion Models trained on Corrupted Data | Asad Aali et.al. | 2403.08728 | translate | read | link |
| 2024-03-13 | HAIFIT: Human-Centered AI for Fashion Image Translation | Jianan Jiang et.al. | 2403.08651 | translate | read | link |
| 2024-03-13 | Gaussian Splatting in Style | Abhishek Saroha et.al. | 2403.08498 | translate | read | null |
| 2024-03-13 | An Analysis of Human Alignment of Latent Diffusion Models | Lorenz Linhardt et.al. | 2403.08469 | translate | read | null |
| 2024-03-13 | Generating Synthetic Computed Tomography for Radiotherapy: SynthRAD2023 Challenge Report | Evi M. C. Huijben et.al. | 2403.08447 | translate | read | null |
| 2024-03-13 | Iterative Online Image Synthesis via Diffusion Model for Imbalanced Classification | Shuhan Li et.al. | 2403.08407 | translate | read | null |
| 2024-03-13 | StyleDyRF: Zero-shot 4D Style Transfer for Dynamic Neural Radiance Fields | Hongbin Xu et.al. | 2403.08310 | translate | read | null |
| 2024-03-13 | Attack Deterministic Conditional Image Generative Models for Diverse and Controllable Generation | Tianyi Chu et.al. | 2403.08294 | translate | read | null |
| 2024-03-13 | VIGFace: Virtual Identity Generation Model for Face Image Synthesis | Minsoo Kim et.al. | 2403.08277 | translate | read | null |
| 2024-03-13 | CoroNetGAN: Controlled Pruning of GANs via Hypernetworks | Aman Kumar et.al. | 2403.08261 | translate | read | null |
| 2024-03-12 | Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation | Shihao Zhao et.al. | 2403.07860 | translate | read | link |
| 2024-03-12 | Quantifying and Mitigating Privacy Risks for Tabular Generative Models | Chaoyi Zhu et.al. | 2403.07842 | translate | read | null |
| 2024-03-12 | StyleGaussian: Instant 3D Style Transfer with Gaussian Splatting | Kunhao Liu et.al. | 2403.07807 | translate | read | null |
| 2024-03-12 | BraSyn 2023 challenge: Missing MRI synthesis and the effect of different learning objectives | Ivo M. Baltruschat et.al. | 2403.07800 | translate | read | null |
| 2024-03-12 | Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model | Yuxuan Zhang et.al. | 2403.07764 | translate | read | null |
| 2024-03-12 | Synth $^2$ : Boosting Visual-Language Models with Synthetic Captions and Image Embeddings | Sahand Sharifzadeh et.al. | 2403.07750 | translate | read | null |
| 2024-03-12 | Visual Decoding and Reconstruction via EEG Embeddings with Guided Diffusion | Dongyang Li et.al. | 2403.07721 | translate | read | link |
| 2024-03-12 | SSM Meets Video Diffusion Models: Efficient Video Generation with Structured State Spaces | Yuta Oshima et.al. | 2403.07711 | translate | read | link |
| 2024-03-12 | Towards Model Extraction Attacks in GAN-Based Image Translation via Domain Shift Mitigation | Di Mi et.al. | 2403.07673 | translate | read | null |
| 2024-03-12 | Gender-ambiguous voice generation through feminine speaking style transfer in male voices | Maria Koutsogiannaki et.al. | 2403.07661 | translate | read | null |
| 2024-03-11 | BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion | Xuan Ju et.al. | 2403.06976 | translate | read | null |
| 2024-03-11 | Surface-aware Mesh Texture Synthesis with Pre-trained 2D CNNs | Áron Samuel Kovács et.al. | 2403.06855 | translate | read | null |
| 2024-03-11 | Medical Image Synthesis via Fine-Grained Image-Text Alignment and Anatomy-Pathology Prompting | Wenting Chen et.al. | 2403.06835 | translate | read | null |
| 2024-03-11 | Data-Independent Operator: A Training-Free Artifact Representation Extractor for Generalizable Deepfake Detection | Chuangchuang Tan et.al. | 2403.06803 | translate | read | link |
| 2024-03-11 | FaceChain-SuDe: Building Derived Class to Inherit Category Attributes for One-shot Subject-Driven Generation | Pengchong Qiao et.al. | 2403.06775 | translate | read | link |
| 2024-03-11 | Distribution-Aware Data Expansion with Diffusion Models | Haowei Zhu et.al. | 2403.06741 | translate | read | link |
| 2024-03-11 | Enhancing Image Caption Generation Using Reinforcement Learning with Human Feedback | Adarsh N L et.al. | 2403.06735 | translate | read | null |
| 2024-03-11 | Galaxy Morphologies Revealed with Subaru HSC and Super-Resolution Techniques II: Environmental Dependence of Galaxy Mergers at z~2-5 | Takatoshi Shibuya et.al. | 2403.06729 | translate | read | null |
| 2024-03-11 | FFAD: A Novel Metric for Assessing Generated Time Series Data Utilizing Fourier Transform and Auto-encoder | Yang Chen et.al. | 2403.06576 | translate | read | null |
| 2024-03-11 | Active Generation for Image Classification | Tao Huang et.al. | 2403.06517 | translate | read | null |
| 2024-03-08 | Beyond Finite Data: Towards Data-free Out-of-distribution Generalization via Extrapola | Yijiang Li et.al. | 2403.05523 | translate | read | null |
| 2024-03-08 | A Data Augmentation Pipeline to Generate Synthetic Labeled Datasets of 3D Echocardiography Images using a GAN | Cristiana Tiago et.al. | 2403.05384 | translate | read | null |
| 2024-03-08 | Federated Learning Method for Preserving Privacy in Face Recognition System | Enoch Solomon et.al. | 2403.05344 | translate | read | null |
| 2024-03-08 | Fine-tuning a Multiple Instance Learning Feature Extractor with Masked Context Modelling and Knowledge Distillation | Juan I. Pisula et.al. | 2403.05325 | translate | read | null |
| 2024-03-08 | GAN-based Massive MIMO Channel Model Trained on Measured Data | Florian Euchner et.al. | 2403.05321 | translate | read | null |
| 2024-03-08 | An Efficient Quasi-Random Sampling for Copulas | Sumin Wang et.al. | 2403.05281 | translate | read | null |
| 2024-03-08 | Towards Effective Usage of Human-Centric Priors in Diffusion Models for Text-based Human Image Generation | Junyan Wang et.al. | 2403.05239 | translate | read | null |
| 2024-03-08 | Synthetic Privileged Information Enhances Medical Image Representation Learning | Lucas Farndale et.al. | 2403.05220 | translate | read | null |
| 2024-03-08 | Denoising Autoregressive Representation Learning | Yazhe Li et.al. | 2403.05196 | translate | read | null |
| 2024-03-08 | Robust Semantic Communications for Speech-to-Text Translation | Zhenzi Weng et.al. | 2403.05187 | translate | read | null |
| 2024-03-07 | Photonic probabilistic machine learning using quantum vacuum noise | Seou Choi et.al. | 2403.04731 | translate | read | null |
| 2024-03-07 | PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation | Junsong Chen et.al. | 2403.04692 | translate | read | null |
| 2024-03-07 | A Domain Translation Framework with an Adversarial Denoising Diffusion Model to Generate Synthetic Datasets of Echocardiography Images | Cristiana Tiago et.al. | 2403.04612 | translate | read | null |
| 2024-03-07 | Discriminative Probing and Tuning for Text-to-Image Generation | Leigang Qu et.al. | 2403.04321 | translate | read | null |
| 2024-03-06 | PromptCharm: Text-to-Image Generation through Multi-modal Prompting and Refinement | Zhijie Wang et.al. | 2403.04014 | translate | read | link |
| 2024-03-06 | Unifying Generation and Compression: Ultra-low bitrate Image Coding Via Multi-stage Transformer | Naifu Xue et.al. | 2403.03736 | translate | read | null |
| 2024-03-06 | Seamless Virtual Reality with Integrated Synchronizer and Synthesizer for Autonomous Driving | He Li et.al. | 2403.03541 | translate | read | null |
| 2024-03-06 | NoiseCollage: A Layout-Aware Text-to-Image Diffusion Model Based on Noise Cropping and Merging | Takahiro Shirakawa et.al. | 2403.03485 | translate | read | link |
| 2024-03-06 | FLAME Diffuser: Grounded Wildfire Image Synthesis using Mask Guided Diffusion | Hao Wang et.al. | 2403.03463 | translate | read | null |
| 2024-03-07 | DLP-GAN: learning to draw modern Chinese landscape photos with generative adversarial network | Xiangquan Gui et.al. | 2403.03456 | translate | read | null |
| 2024-03-06 | Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing | Bingyan Liu et.al. | 2403.03431 | translate | read | null |
| 2024-03-05 | Scaling Rectified Flow Transformers for High-Resolution Image Synthesis | Patrick Esser et.al. | 2403.03206 | translate | read | null |
| 2024-03-05 | Behavior Generation with Latent Actions | Seungjae Lee et.al. | 2403.03181 | translate | read | link |
| 2024-03-05 | Doubly Abductive Counterfactual Inference for Text-based Image Editing | Xue Song et.al. | 2403.02981 | translate | read | null |
| 2024-03-05 | Bias in Generative AI | Mi Zhou et.al. | 2403.02726 | translate | read | null |
| 2024-03-05 | Time Weaver: A Conditional Time Series Generation Model | Sai Shankar Narasimhan et.al. | 2403.02682 | translate | read | null |
| 2024-03-04 | Transformer for Times Series: an Application to the S&P500 | Pierre Brugiere et.al. | 2403.02523 | translate | read | null |
| 2024-03-04 | NiNformer: A Network in Network Transformer with Token Mixing Generated Gating Function | Abdullah Nazhat Abdullah et.al. | 2403.02411 | translate | read | link |
| 2024-03-04 | ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models | Jiaxiang Cheng et.al. | 2403.02084 | translate | read | link |
| 2024-03-05 | Matrix Completion with Convex Optimization and Column Subset Selection | Antonina Krajewska et.al. | 2403.01919 | translate | read | link |
| 2024-03-04 | PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis | Zhengyao Lv et.al. | 2403.01852 | translate | read | link |
| 2024-03-02 | Bespoke Non-Stationary Solvers for Fast Sampling of Diffusion and Flow Models | Neta Shaul et.al. | 2403.01329 | translate | read | null |
| 2024-03-02 | TCIG: Two-Stage Controlled Image Generation with Quality Enhancement through Diffusion | Salaheldin Mohamed et.al. | 2403.01212 | translate | read | null |
| 2024-03-02 | A Hybrid Model for Traffic Incident Detection based on Generative Adversarial Networks and Transformer Model | Xinying Lu et.al. | 2403.01147 | translate | read | null |
| 2024-03-02 | Distilling Text Style Transfer With Self-Explanation From LLMs | Chiyu Zhang et.al. | 2403.01106 | translate | read | null |
| 2024-03-01 | BasedAI: A decentralized P2P network for Zero Knowledge Large Language Models (ZK-LLMs) | Sean Wellington et.al. | 2403.01008 | translate | read | null |
| 2024-03-01 | Improving Android Malware Detection Through Data Augmentation Using Wasserstein Generative Adversarial Networks | Kawana Stalin et.al. | 2403.00890 | translate | read | null |
| 2024-03-01 | Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks | Yuhao Liu et.al. | 2403.00644 | translate | read | null |
| 2024-03-01 | Improving Explicit Spatial Relationships in Text-to-Image Generation through an Automatically Derived Dataset | Ander Salaberria et.al. | 2403.00587 | translate | read | link |
| 2024-03-01 | Rethinking cluster-conditioned diffusion models | Nikolas Adaloglou et.al. | 2403.00570 | translate | read | null |
| 2024-03-01 | VisionLLaMA: A Unified LLaMA Interface for Vision Tasks | Xiangxiang Chu et.al. | 2403.00522 | translate | read | link |
(<a href=../Image_Generation.md>back to Image Generation</a>)