Image Generation - 2024-03 | Paper Arxiv Daily

Image Generation - 2024-03

Publish Date	Title	Authors	PDF	Translate	Read	Code
2024-03-29	Benchmarking Counterfactual Image Generation	Thomas Melistas et.al.	2403.20287	translate	read	link
2024-03-29	FreeSeg-Diff: Training-Free Open-Vocabulary Segmentation with Diffusion Models	Barbara Toniella Corradini et.al.	2403.20105	translate	read	null
2024-03-29	SCINeRF: Neural Radiance Fields from a Snapshot Compressive Image	Yunhao Li et.al.	2403.20018	translate	read	link
2024-03-29	FairRAG: Fair Human Generation via Fair Retrieval Augmentation	Robik Shrestha et.al.	2403.19964	translate	read	null
2024-03-28	Vision-Language Synthetic Data Enhances Echocardiography Downstream Tasks	Pooria Ashrafian et.al.	2403.19880	translate	read	link
2024-03-28	Is Synthetic Image Useful for Transfer Learning? An Investigation into Data Generation, Volume, and Utilization	Yuhang Li et.al.	2403.19866	translate	read	null
2024-03-28	CLoRA: A Contrastive Approach to Compose Multiple LoRA Models	Tuna Han Salih Meral et.al.	2403.19776	translate	read	null
2024-03-28	Detecting Image Attribution for Text-to-Image Diffusion Models in RGB and Beyond	Katherine Xu et.al.	2403.19653	translate	read	link
2024-03-28	GANTASTIC: GAN-based Transfer of Interpretable Directions for Disentangled Image Editing in Text-to-Image Diffusion Models	Yusuf Dalva et.al.	2403.19645	translate	read	null
2024-03-28	Lane-Change in Dense Traffic with Model Predictive Control and Neural Networks	Sangjae Bae et.al.	2403.19633	translate	read	link
2024-03-28	Collaborative Interactive Evolution of Art in the Latent Space of Deep Generative Models	Ole Hall et.al.	2403.19620	translate	read	null
2024-03-28	Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model	Zhicai Wang et.al.	2403.19600	translate	read	link
2024-03-28	Frame by Familiar Frame: Understanding Replication in Video Diffusion Models	Aimon Rahman et.al.	2403.19593	translate	read	null
2024-03-28	Locate, Assign, Refine: Taming Customized Image Inpainting with Text-Subject Guidance	Yulin Pan et.al.	2403.19534	translate	read	null
2024-03-28	Imperceptible Protection against Style Imitation from Diffusion Models	Namhyuk Ahn et.al.	2403.19254	translate	read	null
2024-03-28	QNCD: Quantization Noise Correction for Diffusion Models	Huanpeng Chu et.al.	2403.19140	translate	read	link
2024-03-28	Synthetic Medical Imaging Generation with Generative Adversarial Networks For Plain Radiographs	John R. McNulty et.al.	2403.19107	translate	read	null
2024-03-27	Conditional Wasserstein Distances with Applications in Bayesian OT Flow Matching	Jannis Chemseddine et.al.	2403.18705	translate	read	null
2024-03-27	Attention Calibration for Disentangled Text-to-Image Personalization	Yanbing Zhang et.al.	2403.18551	translate	read	link
2024-03-27	DiffusionFace: Towards a Comprehensive Dataset for Diffusion-Based Face Forgery Analysis	Zhongxi Chen et.al.	2403.18471	translate	read	link
2024-03-27	DiffStyler: Diffusion-based Localized Image Style Transfer	Shaoxu Li et.al.	2403.18461	translate	read	null
2024-03-27	U-Sketch: An Efficient Approach for Sketch to Image Diffusion Models	Ilias Mitsouras et.al.	2403.18425	translate	read	null
2024-03-27	ECNet: Effective Controllable Text-to-Image Diffusion Models	Sicheng Li et.al.	2403.18417	translate	read	null
2024-03-27	Colour and Brush Stroke Pattern Recognition in Abstract Art using Modified Deep Convolutional Generative Adversarial Networks	Srinitish Srinivasan et.al.	2403.18397	translate	read	link
2024-03-27	Ship in Sight: Diffusion Models for Ship-Image Super Resolution	Luigi Sigillo et.al.	2403.18370	translate	read	link
2024-03-27	DSF-GAN: DownStream Feedback Generative Adversarial Network	Oriel Perets et.al.	2403.18267	translate	read	link
2024-03-27	Don’t Look into the Dark: Latent Codes for Pluralistic Image Inpainting	Haiwei Chen et.al.	2403.18186	translate	read	null
2024-03-26	Boosting Diffusion Models with Moving Average Sampling in Frequency Domain	Yurui Qian et.al.	2403.17870	translate	read	null
2024-03-26	CT Synthesis with Conditional Diffusion Models for Abdominal Lymph Node Segmentation	Yongrui Yu et.al.	2403.17770	translate	read	null
2024-03-26	FaultGuard: A Generative Approach to Resilient Fault Prediction in Smart Electrical Grids	Emad Efatinasab et.al.	2403.17494	translate	read	null
2024-03-26	LaRE^2: Latent Reconstruction Error Based Method for Diffusion-Generated Image Detection	Yunpeng Luo et.al.	2403.17465	translate	read	null
2024-03-26	An inexact proximal MM method for a class of nonconvex composite image reconstruction models	Bujin Li et.al.	2403.17450	translate	read	null
2024-03-25	DiffusionAct: Controllable Diffusion Autoencoder for One-shot Face Reenactment	Stella Bounareli et.al.	2403.17217	translate	read	null
2024-03-25	FlashFace: Human Image Personalization with High-fidelity Identity Preservation	Shilong Zhang et.al.	2403.17008	translate	read	null
2024-03-25	SD-DiT: Unleashing the Power of Self-supervised Discrimination in Diffusion Transformer	Rui Zhu et.al.	2403.17004	translate	read	null
2024-03-25	Be Yourself: Bounded Attention for Multi-Subject Text-to-Image Generation	Omer Dahary et.al.	2403.16990	translate	read	null
2024-03-25	Isolated Diffusion: Optimizing Multi-Concept Text-to-Image Generation Training-Freely with Isolated Diffusion Guidance	Jingyuan Zhu et.al.	2403.16954	translate	read	null
2024-03-25	Iso-Diffusion: Improving Diffusion Probabilistic Models Using the Isotropy of the Additive Gaussian Noise	Dilum Fernando et.al.	2403.16790	translate	read	null
2024-03-25	Diff-Def: Diffusion-Generated Deformation Fields for Conditional Atlases	Sophie Starck et.al.	2403.16776	translate	read	null
2024-03-25	Multi-Scale Texture Loss for CT denoising with GANs	Francesco Di Feola et.al.	2403.16640	translate	read	link
2024-03-25	SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions	Yuda Song et.al.	2403.16627	translate	read	null
2024-03-25	Enhancing Cross-Dataset EEG Emotion Recognition: A Novel Approach with Emotional EEG Style Transfer Network	Yijin Zhou et.al.	2403.16540	translate	read	null
2024-03-25	An Intermediate Fusion ViT Enables Efficient Text-Image Alignment in Diffusion Models	Zizhao Hu et.al.	2403.16530	translate	read	null
2024-03-25	Training Generative Adversarial Network-Based Vocoder with Limited Data Using Augmentation-Conditional Discriminator	Takuhiro Kaneko et.al.	2403.16464	translate	read	null
2024-03-25	Refining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation	Sanyam Lakhanpal et.al.	2403.16422	translate	read	null
2024-03-25	Skews in the Phenomenon Space Hinder Generalization in Text-to-Image Generation	Yingshan Chang et.al.	2403.16394	translate	read	null
2024-03-25	Illuminating Systematic Trends in Nuclear Data with Generative Machine Learning Models	Jordan M. R. Fox et.al.	2403.16389	translate	read	null
2024-03-25	FlashEval: Towards Fast and Accurate Evaluation of Text-to-image Diffusion Generative Models	Lin Zhao et.al.	2403.16379	translate	read	null
2024-03-24	Fill in the __ (a Diffusion-based Image Inpainting Pipeline)	Eyoel Gebre et.al.	2403.16016	translate	read	null
2024-03-22	DragAPart: Learning a Part-Level Motion Prior for Articulated Objects	Ruining Li et.al.	2403.15382	translate	read	null
2024-03-22	Long-CLIP: Unlocking the Long-Text Capability of CLIP	Beichen Zhang et.al.	2403.15378	translate	read	null
2024-03-22	A Wasserstein perspective of Vanilla GANs	Lea Kunkel et.al.	2403.15312	translate	read	null
2024-03-22	Controlled Training Data Generation with Diffusion Models	Teresa Yeo et.al.	2403.15309	translate	read	null
2024-03-22	Robust Utility Optimization via a GAN Approach	Florian Krach et.al.	2403.15243	translate	read	null
2024-03-22	A Multimodal Approach for Cross-Domain Image Retrieval	Lucas Iijima et.al.	2403.15152	translate	read	null
2024-03-22	MM-Diff: High-Fidelity Image Personalization via Multi-Modal Condition Integration	Zhichao Wei et.al.	2403.15059	translate	read	null
2024-03-22	Cartoon Hallucinations Detection: Pose-aware In Context Visual Learning	Bumsoo Kim et.al.	2403.15048	translate	read	null
2024-03-22	Generative Active Learning for Image Synthesis Personalization	Xulu Zhang et.al.	2403.14987	translate	read	null
2024-03-22	CLIP-VQDiffusion : Langauge Free Training of Text To Image generation using CLIP and vector quantized diffusion model	Seungdae Han et.al.	2403.14944	translate	read	null
2024-03-21	Implicit Style-Content Separation using B-LoRA	Yarden Frenkel et.al.	2403.14572	translate	read	null
2024-03-21	DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image Editing	Yueru Jia et.al.	2403.14487	translate	read	null
2024-03-21	AnyV2V: A Plug-and-Play Framework For Any Video-to-Video Editing Tasks	Max Ku et.al.	2403.14468	translate	read	null
2024-03-21	Analysing Diffusion Segmentation for Medical Images	Mathias Öttl et.al.	2403.14440	translate	read	null
2024-03-21	Style-Extracting Diffusion Models for Semi-Supervised Histopathology Segmentation	Mathias Öttl et.al.	2403.14429	translate	read	null
2024-03-21	HySim: An Efficient Hybrid Similarity Measure for Patch Matching in Image Inpainting	Saad Noufel et.al.	2403.14292	translate	read	null
2024-03-21	Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models	Pablo Marcos-Manchón et.al.	2403.14291	translate	read	link
2024-03-21	Safeguarding Medical Image Segmentation Datasets against Unauthorized Training via Contour- and Texture-Aware Perturbations	Xun Lin et.al.	2403.14250	translate	read	null
2024-03-21	StyleCineGAN: Landscape Cinemagraph Generation using a Pre-trained StyleGAN	Jongwoo Choi et.al.	2403.14186	translate	read	null
2024-03-21	QSMDiff: Unsupervised 3D Diffusion Models for Quantitative Susceptibility Mapping	Zhuang Xiong et.al.	2403.14070	translate	read	null
2024-03-20	Learning from Models and Data for Visual Grounding	Ruozhen He et.al.	2403.13804	translate	read	null
2024-03-20	Step-Calibrated Diffusion for Biomedical Optical Image Restoration	Yiwei Lyu et.al.	2403.13680	translate	read	null
2024-03-20	ReGround: Improving Textual and Spatial Grounding at No Cost	Yuseung Lee et.al.	2403.13589	translate	read	null
2024-03-20	Diversity-aware Channel Pruning for StyleGAN Compression	Jiwoo Chung et.al.	2403.13548	translate	read	link
2024-03-20	IDAdapter: Learning Mixed Features for Tuning-Free Personalization of Text-to-Image Models	Siying Cui et.al.	2403.13535	translate	read	null
2024-03-20	Deepfake Detection without Deepfakes: Generalization via Synthetic Frequency Patterns Injection	Davide Alessandro Coccomini et.al.	2403.13479	translate	read	null
2024-03-20	S2DM: Sector-Shaped Diffusion Models for Video Generation	Haoran Lang et.al.	2403.13408	translate	read	null
2024-03-20	IIDM: Image-to-Image Diffusion Model for Semantic Image Synthesis	Feng Liu et.al.	2403.13378	translate	read	null
2024-03-20	AGFSync: Leveraging AI-Generated Feedback for Preference Optimization in Text-to-Image Generation	Jingkun An et.al.	2403.13352	translate	read	null
2024-03-20	TiBiX: Leveraging Temporal Information for Bidirectional X-ray and Report Generation	Santosh Sanjeev et.al.	2403.13343	translate	read	null
2024-03-19	FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis	Linjiang Huang et.al.	2403.12963	translate	read	link
2024-03-19	Segment Anything for comprehensive analysis of grapevine cluster architecture and berry properties	Efrain Torres-Lomas et.al.	2403.12935	translate	read	null
2024-03-19	You Only Sample Once: Taming One-Step Text-To-Image Synthesis by Self-Cooperative Diffusion GANs	Yihong Luo et.al.	2403.12931	translate	read	link
2024-03-19	Ultra-High-Resolution Image Synthesis with Pyramid Diffusion Model	Jiajie Yang et.al.	2403.12915	translate	read	link
2024-03-19	Generative Enhancement for 3D Medical Images	Lingting Zhu et.al.	2403.12852	translate	read	link
2024-03-19	How Spammers and Scammers Leverage AI-Generated Images on Facebook for Audience Growth	Renee DiResta et.al.	2403.12838	translate	read	null
2024-03-19	Total Disentanglement of Font Images into Style and Character Class Features	Daichi Haraguchi et.al.	2403.12784	translate	read	null
2024-03-19	Towards Controllable Face Generation with Semantic Latent Diffusion Models	Alex Ergasti et.al.	2403.12743	translate	read	link
2024-03-19	Tuning-Free Image Customization with Image and Text Guidance	Pengzhi Li et.al.	2403.12658	translate	read	null
2024-03-19	NSGAN: A Non-Dominant Sorting Optimisation-Based Generative Adversarial Design Framework for Alloy Discovery	Zhipeng Li et.al.	2403.12495	translate	read	null
2024-03-18	Urban Scene Diffusion through Semantic Occupancy Map	Junge Zhang et.al.	2403.11697	translate	read	null
2024-03-18	Binary Noise for Binary Tasks: Masked Bernoulli Diffusion for Unsupervised Anomaly Detection	Julia Wolleb et.al.	2403.11667	translate	read	null
2024-03-18	LocalStyleFool: Regional Video Style Transfer Attack Using Segment Anything Model	Yuxin Cao et.al.	2403.11656	translate	read	null
2024-03-18	QEAN: Quaternion-Enhanced Attention Network for Visual Dance Generation	Zhizhen Zhou et.al.	2403.11626	translate	read	null
2024-03-18	CRS-Diff: Controllable Generative Remote Sensing Foundation Model	Datao Tang et.al.	2403.11614	translate	read	null
2024-03-18	VmambaIR: Visual State Space Model for Image Restoration	Yuan Shi et.al.	2403.11423	translate	read	link
2024-03-17	StainDiffuser: MultiTask Dual Diffusion Model for Virtual Staining	Tushar Kataria et.al.	2403.11340	translate	read	null
2024-03-17	Fast Personalized Text-to-Image Syntheses With Attention Injection	Yuxuan Zhang et.al.	2403.11284	translate	read	null
2024-03-17	Forging the Forger: An Attempt to Improve Authorship Verification via Data Augmentation	Silvia Corbara et.al.	2403.11265	translate	read	null
2024-03-17	Understanding Diffusion Models by Feynman’s Path Integral	Yuji Hirono et.al.	2403.11262	translate	read	null
2024-03-14	SCP-Diff: Photo-Realistic Semantic Image Synthesis with Spatial-Categorical Joint Prior	Huan-ang Gao et.al.	2403.09638	translate	read	null
2024-03-14	Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering	Zeyu Liu et.al.	2403.09622	translate	read	null
2024-03-14	PrompTHis: Visualizing the Process and Influence of Prompt Editing during Text-to-Image Creation	Yuhan Guo et.al.	2403.09615	translate	read	null
2024-03-14	Counterfactual contrastive learning: robust representations via causal image synthesis	Melanie Roschewitz et.al.	2403.09605	translate	read	link
2024-03-14	Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing	Wonjun Kang et.al.	2403.09468	translate	read	link
2024-03-14	Mitigating attribute amplification in counterfactual image generation	Tian Xia et.al.	2403.09422	translate	read	null
2024-03-14	Machine Learning Processes as Sources of Ambiguity: Insights from AI Art	Christian Sivertsen et.al.	2403.09374	translate	read	null
2024-03-14	Mitigating Data Consistency Induced Discrepancy in Cascaded Diffusion Models for Sparse-view CT Reconstruction	Hanyu Chen et.al.	2403.09355	translate	read	null
2024-03-14	StainFuser: Controlling Diffusion for Faster Neural Style Transfer in Multi-Gigapixel Histology Images	Robert Jewsbury et.al.	2403.09302	translate	read	link
2024-03-14	Noise Dimension of GAN: An Image Compression Perspective	Ziran Zhu et.al.	2403.09196	translate	read	null
2024-03-13	Ambient Diffusion Posterior Sampling: Solving Inverse Problems with Diffusion Models trained on Corrupted Data	Asad Aali et.al.	2403.08728	translate	read	link
2024-03-13	HAIFIT: Human-Centered AI for Fashion Image Translation	Jianan Jiang et.al.	2403.08651	translate	read	link
2024-03-13	Gaussian Splatting in Style	Abhishek Saroha et.al.	2403.08498	translate	read	null
2024-03-13	An Analysis of Human Alignment of Latent Diffusion Models	Lorenz Linhardt et.al.	2403.08469	translate	read	null
2024-03-13	Generating Synthetic Computed Tomography for Radiotherapy: SynthRAD2023 Challenge Report	Evi M. C. Huijben et.al.	2403.08447	translate	read	null
2024-03-13	Iterative Online Image Synthesis via Diffusion Model for Imbalanced Classification	Shuhan Li et.al.	2403.08407	translate	read	null
2024-03-13	StyleDyRF: Zero-shot 4D Style Transfer for Dynamic Neural Radiance Fields	Hongbin Xu et.al.	2403.08310	translate	read	null
2024-03-13	Attack Deterministic Conditional Image Generative Models for Diverse and Controllable Generation	Tianyi Chu et.al.	2403.08294	translate	read	null
2024-03-13	VIGFace: Virtual Identity Generation Model for Face Image Synthesis	Minsoo Kim et.al.	2403.08277	translate	read	null
2024-03-13	CoroNetGAN: Controlled Pruning of GANs via Hypernetworks	Aman Kumar et.al.	2403.08261	translate	read	null
2024-03-12	Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation	Shihao Zhao et.al.	2403.07860	translate	read	link
2024-03-12	Quantifying and Mitigating Privacy Risks for Tabular Generative Models	Chaoyi Zhu et.al.	2403.07842	translate	read	null
2024-03-12	StyleGaussian: Instant 3D Style Transfer with Gaussian Splatting	Kunhao Liu et.al.	2403.07807	translate	read	null
2024-03-12	BraSyn 2023 challenge: Missing MRI synthesis and the effect of different learning objectives	Ivo M. Baltruschat et.al.	2403.07800	translate	read	null
2024-03-12	Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model	Yuxuan Zhang et.al.	2403.07764	translate	read	null
2024-03-12	Synth $^2$ : Boosting Visual-Language Models with Synthetic Captions and Image Embeddings	Sahand Sharifzadeh et.al.	2403.07750	translate	read	null
2024-03-12	Visual Decoding and Reconstruction via EEG Embeddings with Guided Diffusion	Dongyang Li et.al.	2403.07721	translate	read	link
2024-03-12	SSM Meets Video Diffusion Models: Efficient Video Generation with Structured State Spaces	Yuta Oshima et.al.	2403.07711	translate	read	link
2024-03-12	Towards Model Extraction Attacks in GAN-Based Image Translation via Domain Shift Mitigation	Di Mi et.al.	2403.07673	translate	read	null
2024-03-12	Gender-ambiguous voice generation through feminine speaking style transfer in male voices	Maria Koutsogiannaki et.al.	2403.07661	translate	read	null
2024-03-11	BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion	Xuan Ju et.al.	2403.06976	translate	read	null
2024-03-11	Surface-aware Mesh Texture Synthesis with Pre-trained 2D CNNs	Áron Samuel Kovács et.al.	2403.06855	translate	read	null
2024-03-11	Medical Image Synthesis via Fine-Grained Image-Text Alignment and Anatomy-Pathology Prompting	Wenting Chen et.al.	2403.06835	translate	read	null
2024-03-11	Data-Independent Operator: A Training-Free Artifact Representation Extractor for Generalizable Deepfake Detection	Chuangchuang Tan et.al.	2403.06803	translate	read	link
2024-03-11	FaceChain-SuDe: Building Derived Class to Inherit Category Attributes for One-shot Subject-Driven Generation	Pengchong Qiao et.al.	2403.06775	translate	read	link
2024-03-11	Distribution-Aware Data Expansion with Diffusion Models	Haowei Zhu et.al.	2403.06741	translate	read	link
2024-03-11	Enhancing Image Caption Generation Using Reinforcement Learning with Human Feedback	Adarsh N L et.al.	2403.06735	translate	read	null
2024-03-11	Galaxy Morphologies Revealed with Subaru HSC and Super-Resolution Techniques II: Environmental Dependence of Galaxy Mergers at z~2-5	Takatoshi Shibuya et.al.	2403.06729	translate	read	null
2024-03-11	FFAD: A Novel Metric for Assessing Generated Time Series Data Utilizing Fourier Transform and Auto-encoder	Yang Chen et.al.	2403.06576	translate	read	null
2024-03-11	Active Generation for Image Classification	Tao Huang et.al.	2403.06517	translate	read	null
2024-03-08	Beyond Finite Data: Towards Data-free Out-of-distribution Generalization via Extrapola	Yijiang Li et.al.	2403.05523	translate	read	null
2024-03-08	A Data Augmentation Pipeline to Generate Synthetic Labeled Datasets of 3D Echocardiography Images using a GAN	Cristiana Tiago et.al.	2403.05384	translate	read	null
2024-03-08	Federated Learning Method for Preserving Privacy in Face Recognition System	Enoch Solomon et.al.	2403.05344	translate	read	null
2024-03-08	Fine-tuning a Multiple Instance Learning Feature Extractor with Masked Context Modelling and Knowledge Distillation	Juan I. Pisula et.al.	2403.05325	translate	read	null
2024-03-08	GAN-based Massive MIMO Channel Model Trained on Measured Data	Florian Euchner et.al.	2403.05321	translate	read	null
2024-03-08	An Efficient Quasi-Random Sampling for Copulas	Sumin Wang et.al.	2403.05281	translate	read	null
2024-03-08	Towards Effective Usage of Human-Centric Priors in Diffusion Models for Text-based Human Image Generation	Junyan Wang et.al.	2403.05239	translate	read	null
2024-03-08	Synthetic Privileged Information Enhances Medical Image Representation Learning	Lucas Farndale et.al.	2403.05220	translate	read	null
2024-03-08	Denoising Autoregressive Representation Learning	Yazhe Li et.al.	2403.05196	translate	read	null
2024-03-08	Robust Semantic Communications for Speech-to-Text Translation	Zhenzi Weng et.al.	2403.05187	translate	read	null
2024-03-07	Photonic probabilistic machine learning using quantum vacuum noise	Seou Choi et.al.	2403.04731	translate	read	null
2024-03-07	PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation	Junsong Chen et.al.	2403.04692	translate	read	null
2024-03-07	A Domain Translation Framework with an Adversarial Denoising Diffusion Model to Generate Synthetic Datasets of Echocardiography Images	Cristiana Tiago et.al.	2403.04612	translate	read	null
2024-03-07	Discriminative Probing and Tuning for Text-to-Image Generation	Leigang Qu et.al.	2403.04321	translate	read	null
2024-03-06	PromptCharm: Text-to-Image Generation through Multi-modal Prompting and Refinement	Zhijie Wang et.al.	2403.04014	translate	read	link
2024-03-06	Unifying Generation and Compression: Ultra-low bitrate Image Coding Via Multi-stage Transformer	Naifu Xue et.al.	2403.03736	translate	read	null
2024-03-06	Seamless Virtual Reality with Integrated Synchronizer and Synthesizer for Autonomous Driving	He Li et.al.	2403.03541	translate	read	null
2024-03-06	NoiseCollage: A Layout-Aware Text-to-Image Diffusion Model Based on Noise Cropping and Merging	Takahiro Shirakawa et.al.	2403.03485	translate	read	link
2024-03-06	FLAME Diffuser: Grounded Wildfire Image Synthesis using Mask Guided Diffusion	Hao Wang et.al.	2403.03463	translate	read	null
2024-03-07	DLP-GAN: learning to draw modern Chinese landscape photos with generative adversarial network	Xiangquan Gui et.al.	2403.03456	translate	read	null
2024-03-06	Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing	Bingyan Liu et.al.	2403.03431	translate	read	null
2024-03-05	Scaling Rectified Flow Transformers for High-Resolution Image Synthesis	Patrick Esser et.al.	2403.03206	translate	read	null
2024-03-05	Behavior Generation with Latent Actions	Seungjae Lee et.al.	2403.03181	translate	read	link
2024-03-05	Doubly Abductive Counterfactual Inference for Text-based Image Editing	Xue Song et.al.	2403.02981	translate	read	null
2024-03-05	Bias in Generative AI	Mi Zhou et.al.	2403.02726	translate	read	null
2024-03-05	Time Weaver: A Conditional Time Series Generation Model	Sai Shankar Narasimhan et.al.	2403.02682	translate	read	null
2024-03-04	Transformer for Times Series: an Application to the S&P500	Pierre Brugiere et.al.	2403.02523	translate	read	null
2024-03-04	NiNformer: A Network in Network Transformer with Token Mixing Generated Gating Function	Abdullah Nazhat Abdullah et.al.	2403.02411	translate	read	link
2024-03-04	ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models	Jiaxiang Cheng et.al.	2403.02084	translate	read	link
2024-03-05	Matrix Completion with Convex Optimization and Column Subset Selection	Antonina Krajewska et.al.	2403.01919	translate	read	link
2024-03-04	PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis	Zhengyao Lv et.al.	2403.01852	translate	read	link
2024-03-02	Bespoke Non-Stationary Solvers for Fast Sampling of Diffusion and Flow Models	Neta Shaul et.al.	2403.01329	translate	read	null
2024-03-02	TCIG: Two-Stage Controlled Image Generation with Quality Enhancement through Diffusion	Salaheldin Mohamed et.al.	2403.01212	translate	read	null
2024-03-02	A Hybrid Model for Traffic Incident Detection based on Generative Adversarial Networks and Transformer Model	Xinying Lu et.al.	2403.01147	translate	read	null
2024-03-02	Distilling Text Style Transfer With Self-Explanation From LLMs	Chiyu Zhang et.al.	2403.01106	translate	read	null
2024-03-01	BasedAI: A decentralized P2P network for Zero Knowledge Large Language Models (ZK-LLMs)	Sean Wellington et.al.	2403.01008	translate	read	null
2024-03-01	Improving Android Malware Detection Through Data Augmentation Using Wasserstein Generative Adversarial Networks	Kawana Stalin et.al.	2403.00890	translate	read	null
2024-03-01	Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks	Yuhao Liu et.al.	2403.00644	translate	read	null
2024-03-01	Improving Explicit Spatial Relationships in Text-to-Image Generation through an Automatically Derived Dataset	Ander Salaberria et.al.	2403.00587	translate	read	link
2024-03-01	Rethinking cluster-conditioned diffusion models	Nikolas Adaloglou et.al.	2403.00570	translate	read	null
2024-03-01	VisionLLaMA: A Unified LLaMA Interface for Vision Tasks	Xiangxiang Chu et.al.	2403.00522	translate	read	link

(<a href=../Image_Generation.md>back to Image Generation</a>)