Image Generation - 2024-03

Publish Date Title Authors PDF Translate Read Code
2024-03-29 Benchmarking Counterfactual Image Generation Thomas Melistas et.al. 2403.20287 translate read link
2024-03-29 FreeSeg-Diff: Training-Free Open-Vocabulary Segmentation with Diffusion Models Barbara Toniella Corradini et.al. 2403.20105 translate read null
2024-03-29 SCINeRF: Neural Radiance Fields from a Snapshot Compressive Image Yunhao Li et.al. 2403.20018 translate read link
2024-03-29 FairRAG: Fair Human Generation via Fair Retrieval Augmentation Robik Shrestha et.al. 2403.19964 translate read null
2024-03-28 Vision-Language Synthetic Data Enhances Echocardiography Downstream Tasks Pooria Ashrafian et.al. 2403.19880 translate read link
2024-03-28 Is Synthetic Image Useful for Transfer Learning? An Investigation into Data Generation, Volume, and Utilization Yuhang Li et.al. 2403.19866 translate read null
2024-03-28 CLoRA: A Contrastive Approach to Compose Multiple LoRA Models Tuna Han Salih Meral et.al. 2403.19776 translate read null
2024-03-28 Detecting Image Attribution for Text-to-Image Diffusion Models in RGB and Beyond Katherine Xu et.al. 2403.19653 translate read link
2024-03-28 GANTASTIC: GAN-based Transfer of Interpretable Directions for Disentangled Image Editing in Text-to-Image Diffusion Models Yusuf Dalva et.al. 2403.19645 translate read null
2024-03-28 Lane-Change in Dense Traffic with Model Predictive Control and Neural Networks Sangjae Bae et.al. 2403.19633 translate read link
2024-03-28 Collaborative Interactive Evolution of Art in the Latent Space of Deep Generative Models Ole Hall et.al. 2403.19620 translate read null
2024-03-28 Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model Zhicai Wang et.al. 2403.19600 translate read link
2024-03-28 Frame by Familiar Frame: Understanding Replication in Video Diffusion Models Aimon Rahman et.al. 2403.19593 translate read null
2024-03-28 Locate, Assign, Refine: Taming Customized Image Inpainting with Text-Subject Guidance Yulin Pan et.al. 2403.19534 translate read null
2024-03-28 Imperceptible Protection against Style Imitation from Diffusion Models Namhyuk Ahn et.al. 2403.19254 translate read null
2024-03-28 QNCD: Quantization Noise Correction for Diffusion Models Huanpeng Chu et.al. 2403.19140 translate read link
2024-03-28 Synthetic Medical Imaging Generation with Generative Adversarial Networks For Plain Radiographs John R. McNulty et.al. 2403.19107 translate read null
2024-03-27 Conditional Wasserstein Distances with Applications in Bayesian OT Flow Matching Jannis Chemseddine et.al. 2403.18705 translate read null
2024-03-27 Attention Calibration for Disentangled Text-to-Image Personalization Yanbing Zhang et.al. 2403.18551 translate read link
2024-03-27 DiffusionFace: Towards a Comprehensive Dataset for Diffusion-Based Face Forgery Analysis Zhongxi Chen et.al. 2403.18471 translate read link
2024-03-27 DiffStyler: Diffusion-based Localized Image Style Transfer Shaoxu Li et.al. 2403.18461 translate read null
2024-03-27 U-Sketch: An Efficient Approach for Sketch to Image Diffusion Models Ilias Mitsouras et.al. 2403.18425 translate read null
2024-03-27 ECNet: Effective Controllable Text-to-Image Diffusion Models Sicheng Li et.al. 2403.18417 translate read null
2024-03-27 Colour and Brush Stroke Pattern Recognition in Abstract Art using Modified Deep Convolutional Generative Adversarial Networks Srinitish Srinivasan et.al. 2403.18397 translate read link
2024-03-27 Ship in Sight: Diffusion Models for Ship-Image Super Resolution Luigi Sigillo et.al. 2403.18370 translate read link
2024-03-27 DSF-GAN: DownStream Feedback Generative Adversarial Network Oriel Perets et.al. 2403.18267 translate read link
2024-03-27 Don’t Look into the Dark: Latent Codes for Pluralistic Image Inpainting Haiwei Chen et.al. 2403.18186 translate read null
2024-03-26 Boosting Diffusion Models with Moving Average Sampling in Frequency Domain Yurui Qian et.al. 2403.17870 translate read null
2024-03-26 CT Synthesis with Conditional Diffusion Models for Abdominal Lymph Node Segmentation Yongrui Yu et.al. 2403.17770 translate read null
2024-03-26 FaultGuard: A Generative Approach to Resilient Fault Prediction in Smart Electrical Grids Emad Efatinasab et.al. 2403.17494 translate read null
2024-03-26 LaRE^2: Latent Reconstruction Error Based Method for Diffusion-Generated Image Detection Yunpeng Luo et.al. 2403.17465 translate read null
2024-03-26 An inexact proximal MM method for a class of nonconvex composite image reconstruction models Bujin Li et.al. 2403.17450 translate read null
2024-03-25 DiffusionAct: Controllable Diffusion Autoencoder for One-shot Face Reenactment Stella Bounareli et.al. 2403.17217 translate read null
2024-03-25 FlashFace: Human Image Personalization with High-fidelity Identity Preservation Shilong Zhang et.al. 2403.17008 translate read null
2024-03-25 SD-DiT: Unleashing the Power of Self-supervised Discrimination in Diffusion Transformer Rui Zhu et.al. 2403.17004 translate read null
2024-03-25 Be Yourself: Bounded Attention for Multi-Subject Text-to-Image Generation Omer Dahary et.al. 2403.16990 translate read null
2024-03-25 Isolated Diffusion: Optimizing Multi-Concept Text-to-Image Generation Training-Freely with Isolated Diffusion Guidance Jingyuan Zhu et.al. 2403.16954 translate read null
2024-03-25 Iso-Diffusion: Improving Diffusion Probabilistic Models Using the Isotropy of the Additive Gaussian Noise Dilum Fernando et.al. 2403.16790 translate read null
2024-03-25 Diff-Def: Diffusion-Generated Deformation Fields for Conditional Atlases Sophie Starck et.al. 2403.16776 translate read null
2024-03-25 Multi-Scale Texture Loss for CT denoising with GANs Francesco Di Feola et.al. 2403.16640 translate read link
2024-03-25 SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions Yuda Song et.al. 2403.16627 translate read null
2024-03-25 Enhancing Cross-Dataset EEG Emotion Recognition: A Novel Approach with Emotional EEG Style Transfer Network Yijin Zhou et.al. 2403.16540 translate read null
2024-03-25 An Intermediate Fusion ViT Enables Efficient Text-Image Alignment in Diffusion Models Zizhao Hu et.al. 2403.16530 translate read null
2024-03-25 Training Generative Adversarial Network-Based Vocoder with Limited Data Using Augmentation-Conditional Discriminator Takuhiro Kaneko et.al. 2403.16464 translate read null
2024-03-25 Refining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation Sanyam Lakhanpal et.al. 2403.16422 translate read null
2024-03-25 Skews in the Phenomenon Space Hinder Generalization in Text-to-Image Generation Yingshan Chang et.al. 2403.16394 translate read null
2024-03-25 Illuminating Systematic Trends in Nuclear Data with Generative Machine Learning Models Jordan M. R. Fox et.al. 2403.16389 translate read null
2024-03-25 FlashEval: Towards Fast and Accurate Evaluation of Text-to-image Diffusion Generative Models Lin Zhao et.al. 2403.16379 translate read null
2024-03-24 Fill in the __ (a Diffusion-based Image Inpainting Pipeline) Eyoel Gebre et.al. 2403.16016 translate read null
2024-03-22 DragAPart: Learning a Part-Level Motion Prior for Articulated Objects Ruining Li et.al. 2403.15382 translate read null
2024-03-22 Long-CLIP: Unlocking the Long-Text Capability of CLIP Beichen Zhang et.al. 2403.15378 translate read null
2024-03-22 A Wasserstein perspective of Vanilla GANs Lea Kunkel et.al. 2403.15312 translate read null
2024-03-22 Controlled Training Data Generation with Diffusion Models Teresa Yeo et.al. 2403.15309 translate read null
2024-03-22 Robust Utility Optimization via a GAN Approach Florian Krach et.al. 2403.15243 translate read null
2024-03-22 A Multimodal Approach for Cross-Domain Image Retrieval Lucas Iijima et.al. 2403.15152 translate read null
2024-03-22 MM-Diff: High-Fidelity Image Personalization via Multi-Modal Condition Integration Zhichao Wei et.al. 2403.15059 translate read null
2024-03-22 Cartoon Hallucinations Detection: Pose-aware In Context Visual Learning Bumsoo Kim et.al. 2403.15048 translate read null
2024-03-22 Generative Active Learning for Image Synthesis Personalization Xulu Zhang et.al. 2403.14987 translate read null
2024-03-22 CLIP-VQDiffusion : Langauge Free Training of Text To Image generation using CLIP and vector quantized diffusion model Seungdae Han et.al. 2403.14944 translate read null
2024-03-21 Implicit Style-Content Separation using B-LoRA Yarden Frenkel et.al. 2403.14572 translate read null
2024-03-21 DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image Editing Yueru Jia et.al. 2403.14487 translate read null
2024-03-21 AnyV2V: A Plug-and-Play Framework For Any Video-to-Video Editing Tasks Max Ku et.al. 2403.14468 translate read null
2024-03-21 Analysing Diffusion Segmentation for Medical Images Mathias Öttl et.al. 2403.14440 translate read null
2024-03-21 Style-Extracting Diffusion Models for Semi-Supervised Histopathology Segmentation Mathias Öttl et.al. 2403.14429 translate read null
2024-03-21 HySim: An Efficient Hybrid Similarity Measure for Patch Matching in Image Inpainting Saad Noufel et.al. 2403.14292 translate read null
2024-03-21 Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models Pablo Marcos-Manchón et.al. 2403.14291 translate read link
2024-03-21 Safeguarding Medical Image Segmentation Datasets against Unauthorized Training via Contour- and Texture-Aware Perturbations Xun Lin et.al. 2403.14250 translate read null
2024-03-21 StyleCineGAN: Landscape Cinemagraph Generation using a Pre-trained StyleGAN Jongwoo Choi et.al. 2403.14186 translate read null
2024-03-21 QSMDiff: Unsupervised 3D Diffusion Models for Quantitative Susceptibility Mapping Zhuang Xiong et.al. 2403.14070 translate read null
2024-03-20 Learning from Models and Data for Visual Grounding Ruozhen He et.al. 2403.13804 translate read null
2024-03-20 Step-Calibrated Diffusion for Biomedical Optical Image Restoration Yiwei Lyu et.al. 2403.13680 translate read null
2024-03-20 ReGround: Improving Textual and Spatial Grounding at No Cost Yuseung Lee et.al. 2403.13589 translate read null
2024-03-20 Diversity-aware Channel Pruning for StyleGAN Compression Jiwoo Chung et.al. 2403.13548 translate read link
2024-03-20 IDAdapter: Learning Mixed Features for Tuning-Free Personalization of Text-to-Image Models Siying Cui et.al. 2403.13535 translate read null
2024-03-20 Deepfake Detection without Deepfakes: Generalization via Synthetic Frequency Patterns Injection Davide Alessandro Coccomini et.al. 2403.13479 translate read null
2024-03-20 S2DM: Sector-Shaped Diffusion Models for Video Generation Haoran Lang et.al. 2403.13408 translate read null
2024-03-20 IIDM: Image-to-Image Diffusion Model for Semantic Image Synthesis Feng Liu et.al. 2403.13378 translate read null
2024-03-20 AGFSync: Leveraging AI-Generated Feedback for Preference Optimization in Text-to-Image Generation Jingkun An et.al. 2403.13352 translate read null
2024-03-20 TiBiX: Leveraging Temporal Information for Bidirectional X-ray and Report Generation Santosh Sanjeev et.al. 2403.13343 translate read null
2024-03-19 FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis Linjiang Huang et.al. 2403.12963 translate read link
2024-03-19 Segment Anything for comprehensive analysis of grapevine cluster architecture and berry properties Efrain Torres-Lomas et.al. 2403.12935 translate read null
2024-03-19 You Only Sample Once: Taming One-Step Text-To-Image Synthesis by Self-Cooperative Diffusion GANs Yihong Luo et.al. 2403.12931 translate read link
2024-03-19 Ultra-High-Resolution Image Synthesis with Pyramid Diffusion Model Jiajie Yang et.al. 2403.12915 translate read link
2024-03-19 Generative Enhancement for 3D Medical Images Lingting Zhu et.al. 2403.12852 translate read link
2024-03-19 How Spammers and Scammers Leverage AI-Generated Images on Facebook for Audience Growth Renee DiResta et.al. 2403.12838 translate read null
2024-03-19 Total Disentanglement of Font Images into Style and Character Class Features Daichi Haraguchi et.al. 2403.12784 translate read null
2024-03-19 Towards Controllable Face Generation with Semantic Latent Diffusion Models Alex Ergasti et.al. 2403.12743 translate read link
2024-03-19 Tuning-Free Image Customization with Image and Text Guidance Pengzhi Li et.al. 2403.12658 translate read null
2024-03-19 NSGAN: A Non-Dominant Sorting Optimisation-Based Generative Adversarial Design Framework for Alloy Discovery Zhipeng Li et.al. 2403.12495 translate read null
2024-03-18 Urban Scene Diffusion through Semantic Occupancy Map Junge Zhang et.al. 2403.11697 translate read null
2024-03-18 Binary Noise for Binary Tasks: Masked Bernoulli Diffusion for Unsupervised Anomaly Detection Julia Wolleb et.al. 2403.11667 translate read null
2024-03-18 LocalStyleFool: Regional Video Style Transfer Attack Using Segment Anything Model Yuxin Cao et.al. 2403.11656 translate read null
2024-03-18 QEAN: Quaternion-Enhanced Attention Network for Visual Dance Generation Zhizhen Zhou et.al. 2403.11626 translate read null
2024-03-18 CRS-Diff: Controllable Generative Remote Sensing Foundation Model Datao Tang et.al. 2403.11614 translate read null
2024-03-18 VmambaIR: Visual State Space Model for Image Restoration Yuan Shi et.al. 2403.11423 translate read link
2024-03-17 StainDiffuser: MultiTask Dual Diffusion Model for Virtual Staining Tushar Kataria et.al. 2403.11340 translate read null
2024-03-17 Fast Personalized Text-to-Image Syntheses With Attention Injection Yuxuan Zhang et.al. 2403.11284 translate read null
2024-03-17 Forging the Forger: An Attempt to Improve Authorship Verification via Data Augmentation Silvia Corbara et.al. 2403.11265 translate read null
2024-03-17 Understanding Diffusion Models by Feynman’s Path Integral Yuji Hirono et.al. 2403.11262 translate read null
2024-03-14 SCP-Diff: Photo-Realistic Semantic Image Synthesis with Spatial-Categorical Joint Prior Huan-ang Gao et.al. 2403.09638 translate read null
2024-03-14 Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering Zeyu Liu et.al. 2403.09622 translate read null
2024-03-14 PrompTHis: Visualizing the Process and Influence of Prompt Editing during Text-to-Image Creation Yuhan Guo et.al. 2403.09615 translate read null
2024-03-14 Counterfactual contrastive learning: robust representations via causal image synthesis Melanie Roschewitz et.al. 2403.09605 translate read link
2024-03-14 Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing Wonjun Kang et.al. 2403.09468 translate read link
2024-03-14 Mitigating attribute amplification in counterfactual image generation Tian Xia et.al. 2403.09422 translate read null
2024-03-14 Machine Learning Processes as Sources of Ambiguity: Insights from AI Art Christian Sivertsen et.al. 2403.09374 translate read null
2024-03-14 Mitigating Data Consistency Induced Discrepancy in Cascaded Diffusion Models for Sparse-view CT Reconstruction Hanyu Chen et.al. 2403.09355 translate read null
2024-03-14 StainFuser: Controlling Diffusion for Faster Neural Style Transfer in Multi-Gigapixel Histology Images Robert Jewsbury et.al. 2403.09302 translate read link
2024-03-14 Noise Dimension of GAN: An Image Compression Perspective Ziran Zhu et.al. 2403.09196 translate read null
2024-03-13 Ambient Diffusion Posterior Sampling: Solving Inverse Problems with Diffusion Models trained on Corrupted Data Asad Aali et.al. 2403.08728 translate read link
2024-03-13 HAIFIT: Human-Centered AI for Fashion Image Translation Jianan Jiang et.al. 2403.08651 translate read link
2024-03-13 Gaussian Splatting in Style Abhishek Saroha et.al. 2403.08498 translate read null
2024-03-13 An Analysis of Human Alignment of Latent Diffusion Models Lorenz Linhardt et.al. 2403.08469 translate read null
2024-03-13 Generating Synthetic Computed Tomography for Radiotherapy: SynthRAD2023 Challenge Report Evi M. C. Huijben et.al. 2403.08447 translate read null
2024-03-13 Iterative Online Image Synthesis via Diffusion Model for Imbalanced Classification Shuhan Li et.al. 2403.08407 translate read null
2024-03-13 StyleDyRF: Zero-shot 4D Style Transfer for Dynamic Neural Radiance Fields Hongbin Xu et.al. 2403.08310 translate read null
2024-03-13 Attack Deterministic Conditional Image Generative Models for Diverse and Controllable Generation Tianyi Chu et.al. 2403.08294 translate read null
2024-03-13 VIGFace: Virtual Identity Generation Model for Face Image Synthesis Minsoo Kim et.al. 2403.08277 translate read null
2024-03-13 CoroNetGAN: Controlled Pruning of GANs via Hypernetworks Aman Kumar et.al. 2403.08261 translate read null
2024-03-12 Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation Shihao Zhao et.al. 2403.07860 translate read link
2024-03-12 Quantifying and Mitigating Privacy Risks for Tabular Generative Models Chaoyi Zhu et.al. 2403.07842 translate read null
2024-03-12 StyleGaussian: Instant 3D Style Transfer with Gaussian Splatting Kunhao Liu et.al. 2403.07807 translate read null
2024-03-12 BraSyn 2023 challenge: Missing MRI synthesis and the effect of different learning objectives Ivo M. Baltruschat et.al. 2403.07800 translate read null
2024-03-12 Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model Yuxuan Zhang et.al. 2403.07764 translate read null
2024-03-12 Synth $^2$ : Boosting Visual-Language Models with Synthetic Captions and Image Embeddings Sahand Sharifzadeh et.al. 2403.07750 translate read null
2024-03-12 Visual Decoding and Reconstruction via EEG Embeddings with Guided Diffusion Dongyang Li et.al. 2403.07721 translate read link
2024-03-12 SSM Meets Video Diffusion Models: Efficient Video Generation with Structured State Spaces Yuta Oshima et.al. 2403.07711 translate read link
2024-03-12 Towards Model Extraction Attacks in GAN-Based Image Translation via Domain Shift Mitigation Di Mi et.al. 2403.07673 translate read null
2024-03-12 Gender-ambiguous voice generation through feminine speaking style transfer in male voices Maria Koutsogiannaki et.al. 2403.07661 translate read null
2024-03-11 BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion Xuan Ju et.al. 2403.06976 translate read null
2024-03-11 Surface-aware Mesh Texture Synthesis with Pre-trained 2D CNNs Áron Samuel Kovács et.al. 2403.06855 translate read null
2024-03-11 Medical Image Synthesis via Fine-Grained Image-Text Alignment and Anatomy-Pathology Prompting Wenting Chen et.al. 2403.06835 translate read null
2024-03-11 Data-Independent Operator: A Training-Free Artifact Representation Extractor for Generalizable Deepfake Detection Chuangchuang Tan et.al. 2403.06803 translate read link
2024-03-11 FaceChain-SuDe: Building Derived Class to Inherit Category Attributes for One-shot Subject-Driven Generation Pengchong Qiao et.al. 2403.06775 translate read link
2024-03-11 Distribution-Aware Data Expansion with Diffusion Models Haowei Zhu et.al. 2403.06741 translate read link
2024-03-11 Enhancing Image Caption Generation Using Reinforcement Learning with Human Feedback Adarsh N L et.al. 2403.06735 translate read null
2024-03-11 Galaxy Morphologies Revealed with Subaru HSC and Super-Resolution Techniques II: Environmental Dependence of Galaxy Mergers at z~2-5 Takatoshi Shibuya et.al. 2403.06729 translate read null
2024-03-11 FFAD: A Novel Metric for Assessing Generated Time Series Data Utilizing Fourier Transform and Auto-encoder Yang Chen et.al. 2403.06576 translate read null
2024-03-11 Active Generation for Image Classification Tao Huang et.al. 2403.06517 translate read null
2024-03-08 Beyond Finite Data: Towards Data-free Out-of-distribution Generalization via Extrapola Yijiang Li et.al. 2403.05523 translate read null
2024-03-08 A Data Augmentation Pipeline to Generate Synthetic Labeled Datasets of 3D Echocardiography Images using a GAN Cristiana Tiago et.al. 2403.05384 translate read null
2024-03-08 Federated Learning Method for Preserving Privacy in Face Recognition System Enoch Solomon et.al. 2403.05344 translate read null
2024-03-08 Fine-tuning a Multiple Instance Learning Feature Extractor with Masked Context Modelling and Knowledge Distillation Juan I. Pisula et.al. 2403.05325 translate read null
2024-03-08 GAN-based Massive MIMO Channel Model Trained on Measured Data Florian Euchner et.al. 2403.05321 translate read null
2024-03-08 An Efficient Quasi-Random Sampling for Copulas Sumin Wang et.al. 2403.05281 translate read null
2024-03-08 Towards Effective Usage of Human-Centric Priors in Diffusion Models for Text-based Human Image Generation Junyan Wang et.al. 2403.05239 translate read null
2024-03-08 Synthetic Privileged Information Enhances Medical Image Representation Learning Lucas Farndale et.al. 2403.05220 translate read null
2024-03-08 Denoising Autoregressive Representation Learning Yazhe Li et.al. 2403.05196 translate read null
2024-03-08 Robust Semantic Communications for Speech-to-Text Translation Zhenzi Weng et.al. 2403.05187 translate read null
2024-03-07 Photonic probabilistic machine learning using quantum vacuum noise Seou Choi et.al. 2403.04731 translate read null
2024-03-07 PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation Junsong Chen et.al. 2403.04692 translate read null
2024-03-07 A Domain Translation Framework with an Adversarial Denoising Diffusion Model to Generate Synthetic Datasets of Echocardiography Images Cristiana Tiago et.al. 2403.04612 translate read null
2024-03-07 Discriminative Probing and Tuning for Text-to-Image Generation Leigang Qu et.al. 2403.04321 translate read null
2024-03-06 PromptCharm: Text-to-Image Generation through Multi-modal Prompting and Refinement Zhijie Wang et.al. 2403.04014 translate read link
2024-03-06 Unifying Generation and Compression: Ultra-low bitrate Image Coding Via Multi-stage Transformer Naifu Xue et.al. 2403.03736 translate read null
2024-03-06 Seamless Virtual Reality with Integrated Synchronizer and Synthesizer for Autonomous Driving He Li et.al. 2403.03541 translate read null
2024-03-06 NoiseCollage: A Layout-Aware Text-to-Image Diffusion Model Based on Noise Cropping and Merging Takahiro Shirakawa et.al. 2403.03485 translate read link
2024-03-06 FLAME Diffuser: Grounded Wildfire Image Synthesis using Mask Guided Diffusion Hao Wang et.al. 2403.03463 translate read null
2024-03-07 DLP-GAN: learning to draw modern Chinese landscape photos with generative adversarial network Xiangquan Gui et.al. 2403.03456 translate read null
2024-03-06 Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing Bingyan Liu et.al. 2403.03431 translate read null
2024-03-05 Scaling Rectified Flow Transformers for High-Resolution Image Synthesis Patrick Esser et.al. 2403.03206 translate read null
2024-03-05 Behavior Generation with Latent Actions Seungjae Lee et.al. 2403.03181 translate read link
2024-03-05 Doubly Abductive Counterfactual Inference for Text-based Image Editing Xue Song et.al. 2403.02981 translate read null
2024-03-05 Bias in Generative AI Mi Zhou et.al. 2403.02726 translate read null
2024-03-05 Time Weaver: A Conditional Time Series Generation Model Sai Shankar Narasimhan et.al. 2403.02682 translate read null
2024-03-04 Transformer for Times Series: an Application to the S&P500 Pierre Brugiere et.al. 2403.02523 translate read null
2024-03-04 NiNformer: A Network in Network Transformer with Token Mixing Generated Gating Function Abdullah Nazhat Abdullah et.al. 2403.02411 translate read link
2024-03-04 ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models Jiaxiang Cheng et.al. 2403.02084 translate read link
2024-03-05 Matrix Completion with Convex Optimization and Column Subset Selection Antonina Krajewska et.al. 2403.01919 translate read link
2024-03-04 PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis Zhengyao Lv et.al. 2403.01852 translate read link
2024-03-02 Bespoke Non-Stationary Solvers for Fast Sampling of Diffusion and Flow Models Neta Shaul et.al. 2403.01329 translate read null
2024-03-02 TCIG: Two-Stage Controlled Image Generation with Quality Enhancement through Diffusion Salaheldin Mohamed et.al. 2403.01212 translate read null
2024-03-02 A Hybrid Model for Traffic Incident Detection based on Generative Adversarial Networks and Transformer Model Xinying Lu et.al. 2403.01147 translate read null
2024-03-02 Distilling Text Style Transfer With Self-Explanation From LLMs Chiyu Zhang et.al. 2403.01106 translate read null
2024-03-01 BasedAI: A decentralized P2P network for Zero Knowledge Large Language Models (ZK-LLMs) Sean Wellington et.al. 2403.01008 translate read null
2024-03-01 Improving Android Malware Detection Through Data Augmentation Using Wasserstein Generative Adversarial Networks Kawana Stalin et.al. 2403.00890 translate read null
2024-03-01 Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks Yuhao Liu et.al. 2403.00644 translate read null
2024-03-01 Improving Explicit Spatial Relationships in Text-to-Image Generation through an Automatically Derived Dataset Ander Salaberria et.al. 2403.00587 translate read link
2024-03-01 Rethinking cluster-conditioned diffusion models Nikolas Adaloglou et.al. 2403.00570 translate read null
2024-03-01 VisionLLaMA: A Unified LLaMA Interface for Vision Tasks Xiangxiang Chu et.al. 2403.00522 translate read link

(<a href=../Image_Generation.md>back to Image Generation</a>)