Image Generation - 2024-04 | Paper Arxiv Daily

Image Generation - 2024-04

Publish Date	Title	Authors	PDF	Translate	Read	Code
2024-04-30	IgCONDA-PET: Implicitly-Guided Counterfactual Diffusion for Detecting Anomalies in PET Images	Shadab Ahamed et.al.	2405.00239	translate	read	link
2024-04-30	DOCCI: Descriptions of Connected and Contrasting Images	Yasumasa Onoe et.al.	2404.19753	translate	read	null
2024-04-30	Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation	Yunhao Ge et.al.	2404.19752	translate	read	null
2024-04-30	SwipeGANSpace: Swipe-to-Compare Image Generation via Efficient Latent Space Exploration	Yuto Nakashima et.al.	2404.19693	translate	read	null
2024-04-30	Seeing Through the Clouds: Cloud Gap Imputation with Prithvi Foundation Model	Denys Godwin et.al.	2404.19609	translate	read	null
2024-04-30	TwinDiffusion: Enhancing Coherence and Efficiency in Panoramic Image Generation with Diffusion Models	Teng Zhou et.al.	2404.19475	translate	read	null
2024-04-30	InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation	Chanran Kim et.al.	2404.19427	translate	read	null
2024-04-30	NeRF-Insert: 3D Local Editing with Multimodal Control Signals	Benet Oriol Sabat et.al.	2404.19204	translate	read	null
2024-04-29	DGE: Direct Gaussian 3D Editing by Consistent Multi-view Editing	Minghao Chen et.al.	2404.18929	translate	read	null
2024-04-29	TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation	Junhao Cheng et.al.	2404.18919	translate	read	null
2024-04-29	Hide and Seek: How Does Watermarking Impact Face Recognition?	Yuguang Yao et.al.	2404.18890	translate	read	null
2024-04-29	Learning Mixtures of Gaussians Using Diffusion Models	Khashayar Gatmiry et.al.	2404.18869	translate	read	null
2024-04-29	Socially Adaptive Path Planning Based on Generative Adversarial Network	Yao Wang et.al.	2404.18687	translate	read	null
2024-04-29	FlexiFilm: Long Video Generation with Flexible Conditions	Yichen Ouyang et.al.	2404.18620	translate	read	link
2024-04-29	Anywhere: A Multi-Agent Framework for Reliable and Diverse Foreground-Conditioned Image Inpainting	Tianyidan Xie et.al.	2404.18598	translate	read	null
2024-04-29	SIDBench: A Python Framework for Reliably Assessing Synthetic Image Detection Methods	Manos Schinas et.al.	2404.18552	translate	read	link
2024-04-29	Towards Image Synthesis with Photon Counting Stellar Intensity Interferometry	Alessia Spolon et.al.	2404.18507	translate	read	null
2024-04-29	Autonomous Quality and Hallucination Assessment for Virtual Tissue Staining and Digital Pathology	Luzhe Huang et.al.	2404.18458	translate	read	null
2024-04-26	Federated Transfer Component Analysis Towards Effective VNF Profiling	Xunzheng ZhangB et.al.	2404.17553	translate	read	null
2024-04-26	Spatial-frequency Dual-Domain Feature Fusion Network for Low-Light Remote Sensing Image Enhancement	Zishu Yao et.al.	2404.17400	translate	read	null
2024-04-26	Trinity Detector:text-assisted and attention mechanisms based spectral fusion for diffusion generation image detection	Jiawei Song et.al.	2404.17254	translate	read	null
2024-04-26	ObjectAdd: Adding Objects into Image via a Training-Free Diffusion Modification Fashion	Ziyue Zhang et.al.	2404.17230	translate	read	link
2024-04-26	DPGAN: A Dual-Path Generative Adversarial Network for Missing Data Imputation in Graphs	Xindi Zheng et.al.	2404.17164	translate	read	null
2024-04-26	An Investigation of Time-Frequency Representation Discriminators for High-Fidelity Vocoder	Yicheng Gu et.al.	2404.17161	translate	read	null
2024-04-26	Synthesizing Iris Images using Generative Adversarial Networks: Survey and Comparative Analysis	Shivangi Yadav et.al.	2404.17105	translate	read	null
2024-04-25	Channel Modeling for FR3 Upper Mid-band via Generative Adversarial Networks	Yaqi Hu et.al.	2404.17069	translate	read	null
2024-04-25	DE-CGAN: Boosting rTMS Treatment Prediction with Diversity Enhancing Conditional Generative Adversarial Networks	Matthew Squires et.al.	2404.16913	translate	read	null
2024-04-25	REBEL: Reinforcement Learning via Regressing Relative Rewards	Zhaolin Gao et.al.	2404.16767	translate	read	null
2024-04-25	Denoising: from classical methods to deep CNNs	Jean-Eric Campagne et.al.	2404.16617	translate	read	link
2024-04-25	MuseumMaker: Continual Style Customization without Catastrophic Forgetting	Chenxi Liu et.al.	2404.16612	translate	read	null
2024-04-25	Conditional Distribution Modelling for Few-Shot Image Synthesis with Diffusion Models	Parul Gupta et.al.	2404.16556	translate	read	null
2024-04-25	OpenDlign: Enhancing Open-World 3D Learning with Depth-Aligned Images	Ye Mao et.al.	2404.16538	translate	read	null
2024-04-25	Cross-sensor super-resolution of irregularly sampled Sentinel-2 time series	Aimi Okabayashi et.al.	2404.16409	translate	read	link
2024-04-24	Guardians of the Quantum GAN	Archisman Ghosh et.al.	2404.16156	translate	read	null
2024-04-24	Quantitative Characterization of Retinal Features in Translated OCTA	Rashadul Hasan Badhon et.al.	2404.16133	translate	read	null
2024-04-24	Spinning solar jets explained through the interplay between plasma sheets and vortex columns	Sahel Dey et.al.	2404.16096	translate	read	null
2024-04-24	PuLID: Pure and Lightning ID Customization via Contrastive Alignment	Zinan Guo et.al.	2404.16022	translate	read	null
2024-04-24	Security Analysis of WiFi-based Sensing Systems: Threats from Perturbation Attacks	Hangcheng Cao et.al.	2404.15587	translate	read	null
2024-04-23	Multi-scale Intervention Planning based on Generative Design	Ioannis Kavouras et.al.	2404.15492	translate	read	null
2024-04-23	ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning	Weifeng Chen et.al.	2404.15449	translate	read	null
2024-04-23	GLoD: Composing Global Contexts and Local Details in Image Generation	Moyuru Yamada et.al.	2404.15447	translate	read	null
2024-04-23	From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generation	Zehuan Huang et.al.	2404.15267	translate	read	null
2024-04-23	Adaptive Mixed-Scale Feature Fusion Network for Blind AI-Generated Image Quality Assessment	Tianwei Zhou et.al.	2404.15163	translate	read	null
2024-04-23	Multimodal Large Language Model is a Human-Aligned Annotator for Text-to-Image Generation	Xun Wu et.al.	2404.15100	translate	read	null
2024-04-23	CoARF: Controllable 3D Artistic Style Transfer for Radiance Fields	Deheng Zhang et.al.	2404.14967	translate	read	null
2024-04-23	Music Style Transfer With Diffusion Model	Hong Huang et.al.	2404.14771	translate	read	null
2024-04-23	SkinGEN: an Explainable Dermatology Diagnosis-to-Generation Framework with Interactive Vision-Language Models	Bo Lin et.al.	2404.14755	translate	read	null
2024-04-23	Skip the Benchmark: Generating System-Level High-Level Synthesis Data using Generative Machine Learning	Yuchao Liao et.al.	2404.14754	translate	read	null
2024-04-23	FINEMATCH: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction	Hang Hua et.al.	2404.14715	translate	read	null
2024-04-22	The Adversarial AI-Art: Understanding, Generation, Detection, and Benchmarking	Yuying Li et.al.	2404.14581	translate	read	null
2024-04-22	GeoDiffuser: Geometry-Based Image Editing with Diffusion Models	Rahul Sajnani et.al.	2404.14403	translate	read	null
2024-04-22	SEED-X: Multimodal Models with Unified Multi-granularity Comprehension and Generation	Yuying Ge et.al.	2404.14396	translate	read	link
2024-04-22	MultiBooth: Towards Generating All Your Concepts in an Image from Text	Chenyang Zhu et.al.	2404.14239	translate	read	link
2024-04-22	RHanDS: Refining Malformed Hands for Generated Images with Decoupled Structure and Style Guidance	Chengrui Wang et.al.	2404.13984	translate	read	null
2024-04-23	Accelerating Image Generation with Sub-path Linear Approximation Model	Chen Xu et.al.	2404.13903	translate	read	null
2024-04-22	Towards Better Text-to-Image Generation Alignment via Attention Modulation	Yihang Wu et.al.	2404.13899	translate	read	null
2024-04-22	Regional Style and Color Transfer	Zhicheng Ding et.al.	2404.13880	translate	read	null
2024-04-22	Distributional Black-Box Model Inversion Attack with Multi-Agent Reinforcement Learning	Huan Bao et.al.	2404.13860	translate	read	null
2024-04-22	A Comparative Study on Enhancing Prediction in Social Network Advertisement through Data Augmentation	Qikai Yang et.al.	2404.13812	translate	read	null
2024-04-21	Enforcing Conditional Independence for Fair Representation Learning and Causal Image Generation	Jensen Hwa et.al.	2404.13798	translate	read	null
2024-04-19	RadRotator: 3D Rotation of Radiographs with Diffusion Models	Pouria Rouzrokh et.al.	2404.13000	translate	read	null
2024-04-19	Robust CLIP-Based Detector for Exposing Diffusion Model-Generated Images	Santosh et.al.	2404.12908	translate	read	link
2024-04-19	Explainable Deepfake Video Detection using Convolutional Neural Network and CapsuleNet	Gazi Hasin Ishrak et.al.	2404.12841	translate	read	null
2024-04-19	Generative Modelling with High-Order Langevin Dynamics	Ziqiang Shi et.al.	2404.12814	translate	read	null
2024-04-19	PATE-TripleGAN: Privacy-Preserving Image Synthesis with Gaussian Differential Privacy	Zepeng Jiang et.al.	2404.12730	translate	read	null
2024-04-19	MLSD-GAN – Generating Strong High Quality Face Morphing Attacks using Latent Semantic Disentanglement	Aravinda Reddy PN et.al.	2404.12679	translate	read	null
2024-04-19	How Real Is Real? A Human Evaluation Framework for Unrestricted Adversarial Examples	Dren Fazlija et.al.	2404.12653	translate	read	null
2024-04-19	F2FLDM: Latent Diffusion Models with Histopathology Pre-Trained Embeddings for Unpaired Frozen Section to FFPE Translation	Man M. Ho et.al.	2404.12650	translate	read	null
2024-04-18	Alleviating Catastrophic Forgetting in Facial Expression Recognition with Emotion-Centered Models	Israel A. Laurensi et.al.	2404.12260	translate	read	null
2024-04-18	First 2D electron density measurements using Coherence Imaging Spectroscopy in the MAST-U Super-X divertor	N. Lonigro et.al.	2404.12021	translate	read	null
2024-04-18	©Plug-in Authorization for Human Content Copyright Protection in Text-to-Image Model	Chao Zhou et.al.	2404.11962	translate	read	null
2024-04-18	Sketch-guided Image Inpainting with Partial Discrete Diffusion Process	Nakul Sharma et.al.	2404.11949	translate	read	link
2024-04-18	LD-Pruner: Efficient Pruning of Latent Diffusion Models using Task-Agnostic Insights	Thibault Castells et.al.	2404.11936	translate	read	null
2024-04-18	EdgeFusion: On-Device Text-to-Image Generation	Thibault Castells et.al.	2404.11925	translate	read	null
2024-04-18	Multi-view X-ray Image Synthesis with Multiple Domain Disentanglement from CT Scans	Lixing Tan et.al.	2404.11889	translate	read	null
2024-04-18	Generating synthetic electroretinogram waveforms using Artificial Intelligence to improve classification of retinal conditions in under-represented populations	Mikhail Kulyabin et.al.	2404.11842	translate	read	null
2024-04-18	TextCenGen: Attention-Guided Text-Centric Background Adaptation for Text-to-Image Generation	Tianyi Liang et.al.	2404.11824	translate	read	null
2024-04-18	Tailoring Generative Adversarial Networks for Smooth Airfoil Design	Joyjit Chattoraj et.al.	2404.11816	translate	read	null
2024-04-17	On the Scalability of GNNs for Molecular Graphs	Maciej Sypetkowski et.al.	2404.11568	translate	read	null
2024-04-17	MoA: Mixture-of-Attention for Subject-Context Disentanglement in Personalized Image Generation	Kuan-Chieh et.al.	2404.11565	translate	read	null
2024-04-17	SSDiff: Spatial-spectral Integrated Diffusion Model for Remote Sensing Pansharpening	Yu Zhong et.al.	2404.11537	translate	read	null
2024-04-17	Towards Highly Realistic Artistic Style Transfer via Stable Diffusion with Step-aware and Layer-aware Prompt	Zhanjie Zhang et.al.	2404.11474	translate	read	link
2024-04-17	What-if Analysis Framework for Digital Twins in 6G Wireless Network Management	Elif Ak et.al.	2404.11394	translate	read	null
2024-04-17	Image Generative Semantic Communication with Multi-Modal Similarity Estimation for Resource-Limited Networks	Eri Hosonuma et.al.	2404.11280	translate	read	null
2024-04-17	Optical Image-to-Image Translation Using Denoising Diffusion Models: Heterogeneous Change Detection as a Use Case	João Gabriel Vinholi et.al.	2404.11243	translate	read	null
2024-04-17	KI-GAN: Knowledge-Informed Generative Adversarial Networks for Enhanced Multi-Vehicle Trajectory Forecasting at Signalized Intersections	Chuheng Wei et.al.	2404.11181	translate	read	link
2024-04-17	TiNO-Edit: Timestep and Noise Optimization for Robust Diffusion-Based Image Editing	Sherry X. Chen et.al.	2404.11120	translate	read	link
2024-04-17	Object Remover Performance Evaluation Methods using Class-wise Object Removal Images	Changsuk Oh et.al.	2404.11104	translate	read	null
2024-04-16	RefFusion: Reference Adapted Diffusion Models for 3D Scene Inpainting	Ashkan Mirzaei et.al.	2404.10765	translate	read	null
2024-04-16	LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation?	Yuchi Wang et.al.	2404.10763	translate	read	link
2024-04-16	AV-GAN: Attention-Based Varifocal Generative Adversarial Network for Uneven Medical Image Translation	Zexin Li et.al.	2404.10714	translate	read	null
2024-04-16	Gaussian Splatting Decoder for 3D-aware Generative Adversarial Networks	Florian Barthel et.al.	2404.10625	translate	read	null
2024-04-16	Adversarial Identity Injection for Semantic Face Image Synthesis	Giuseppe Tarollo et.al.	2404.10408	translate	read	null
2024-04-16	Generating Counterfactual Trajectories with Latent Diffusion Models for Concept Discovery	Payal Varshney et.al.	2404.10356	translate	read	null
2024-04-16	CanvasPic: An Interactive Tool for Freely Generating Facial Images Based on Spatial Layout	Jiafu Wei et.al.	2404.10352	translate	read	null
2024-04-16	OmniSSR: Zero-shot Omnidirectional Image Super-Resolution using Stable Diffusion Model	Runyi Li et.al.	2404.10312	translate	read	null
2024-04-16	Learnable Prompt for Few-Shot Semantic Segmentation in Remote Sensing Domain	Steve Andreas Immanuel et.al.	2404.10307	translate	read	link
2024-04-16	OneActor: Consistent Character Generation via Cluster-Conditioned Guidance	Jiahao Wang et.al.	2404.10267	translate	read	null
2024-04-15	Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language Models	Ziwei Luo et.al.	2404.09732	translate	read	link
2024-04-15	VFLGAN: Vertical Federated Learning-based Generative Adversarial Network for Vertically Partitioned Data Publication	Xun Yuan et.al.	2404.09722	translate	read	null
2024-04-15	In-Context Translation: Towards Unifying Image Recognition, Processing, and Generation	Han Xue et.al.	2404.09633	translate	read	null
2024-04-15	Text-Driven Diverse Facial Texture Generation via Progressive Latent-Space Refinement	Chi Wang et.al.	2404.09540	translate	read	null
2024-04-15	Magic Clothing: Controllable Garment-Driven Image Synthesis	Weifeng Chen et.al.	2404.09512	translate	read	link
2024-04-15	Improved Object-Based Style Transfer with Single Deep Network	Harshmohan Kulkarni et.al.	2404.09461	translate	read	null
2024-04-15	Watermark-embedded Adversarial Examples for Copyright Protection against Diffusion Models	Peifei Zhu et.al.	2404.09401	translate	read	null
2024-04-14	Counteracting Concept Drift by Learning with Future Malware Predictions	Branislav Bosansky et.al.	2404.09352	translate	read	null
2024-04-14	DreamScape: 3D Scene Creation via Gaussian Splatting joint Correlation Modeling	Xuening Yuan et.al.	2404.09227	translate	read	null
2024-04-13	InverseVis: Revealing the Hidden with Curved Sphere Tracing	Kai Lawonn et.al.	2404.09092	translate	read	null
2024-04-12	An improved tabular data generator with VAE-GMM integration	Patricia A. Apellániz et.al.	2404.08434	translate	read	null
2024-04-12	Counterfactual Explanations for Face Forgery Detection via Adversarial Removal of Artifacts	Yang Li et.al.	2404.08341	translate	read	link
2024-04-11	Latent Guard: a Safety Framework for Text-to-image Generation	Runtao Liu et.al.	2404.08031	translate	read	link
2024-04-11	Rethinking Artistic Copyright Infringements in the Era of Text-to-Image Generative Models	Mazda Moayeri et.al.	2404.08030	translate	read	null
2024-04-11	OpenBias: Open-set Bias Detection in Text-to-Image Generative Models	Moreno D’Incà et.al.	2404.07990	translate	read	null
2024-04-11	Taming Stable Diffusion for Text to 360° Panorama Image Generation	Cheng Zhang et.al.	2404.07949	translate	read	link
2024-04-11	Generating Synthetic Satellite Imagery With Deep-Learning Text-to-Image Models – Technical Challenges and Implications for Monitoring and Verification	Tuong Vy Nguyen et.al.	2404.07754	translate	read	null
2024-04-11	Applying Guidance in a Limited Interval Improves Sample and Distribution Quality in Diffusion Models	Tuomas Kynkäänniemi et.al.	2404.07724	translate	read	null
2024-04-11	Model-based Cleaning of the QUILT-1M Pathology Dataset for Text-Conditional Image Synthesis	Marc Aubreville et.al.	2404.07676	translate	read	null
2024-04-11	Implicit and Explicit Language Guidance for Diffusion-based Visual Perception	Hefeng Wang et.al.	2404.07600	translate	read	null
2024-04-11	GAN-based iterative motion estimation in HASTE MRI	Mathias S. Feinler et.al.	2404.07576	translate	read	null
2024-04-11	ObjBlur: A Curriculum Learning Approach With Progressive Object-Level Blurring for Improved Layout-to-Image Generation	Stanislav Frolov et.al.	2404.07564	translate	read	null
2024-04-11	CAT: Contrastive Adapter Training for Personalized Image Generation	Jae Wan Park et.al.	2404.07554	translate	read	link
2024-04-11	Enhancing Network Intrusion Detection Performance using Generative Adversarial Networks	Xinxing Zhao et.al.	2404.07464	translate	read	null
2024-04-10	RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion	Jaidev Shriram et.al.	2404.07199	translate	read	null
2024-04-10	A Gauss-Newton Approach for Min-Max Optimization in Generative Adversarial Networks	Neel Mishra et.al.	2404.07172	translate	read	link
2024-04-10	Implicit Multi-Spectral Transformer: An Lightweight and Effective Visible to Infrared Image Translation Model	Yijia Chen et.al.	2404.07072	translate	read	link
2024-04-10	Fine color guidance in diffusion models and its application to image compression at extremely low bitrates	Tom Bordin et.al.	2404.06865	translate	read	null
2024-04-10	UDiFF: Generating Conditional Unsigned Distance Fields with Optimal Wavelet Diffusion	Junsheng Zhou et.al.	2404.06851	translate	read	null
2024-04-10	Tuning-Free Adaptive Style Incorporation for Structure-Consistent Text-Driven Style Transfer	Yanqi Ge et.al.	2404.06835	translate	read	null
2024-04-10	MedRG: Medical Report Grounding with Multi-modal Large Language Model	Ke Zou et.al.	2404.06798	translate	read	null
2024-04-10	CryinGAN: Design and evaluation of point-cloud-based generative adversarial networks using disordered materials $-$ application to Li$_3$ScCl$_6$-LiCoO$_2$ battery interfaces	Adrian Xiao Bin Yong et.al.	2404.06734	translate	read	null
2024-04-10	Deep Generative Data Assimilation in Multimodal Setting	Yongquan Qu et.al.	2404.06665	translate	read	link
2024-04-09	GeoSynth: Contextually-Aware High-Resolution Satellite Image Synthesis	Srikumar Sastry et.al.	2404.06637	translate	read	link
2024-04-09	High Noise Scheduling is a Must	Mahmut S. Gokmen et.al.	2404.06353	translate	read	null
2024-04-09	Fortifying Fully Convolutional Generative Adversarial Networks for Image Super-Resolution Using Divergence Measures	Arkaprabha Basu et.al.	2404.06294	translate	read	null
2024-04-09	Hyperparameter-Free Medical Image Synthesis for Sharing Data and Improving Site-Specific Segmentation	Alexander Chebykin et.al.	2404.06240	translate	read	link
2024-04-09	DiffHarmony: Latent Diffusion Model Meets Image Harmonization	Pengfei Zhou et.al.	2404.06139	translate	read	null
2024-04-09	Greedy-DiM: Greedy Algorithms for Unreasonably Effective Face Morphs	Zander W. Blasingame et.al.	2404.06025	translate	read	null
2024-04-09	Boosting Digital Safeguards: Blending Cryptography and Steganography	Anamitra Maiti et.al.	2404.05985	translate	read	null
2024-04-09	Tackling Structural Hallucination in Image Translation with Local Diffusion	Seunghoi Kim et.al.	2404.05980	translate	read	null
2024-04-09	StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion	Ming Tao et.al.	2404.05979	translate	read	link
2024-04-09	Quantum Generative Adversarial Networks in a Silicon Photonic Chip with Maximum Expressibility	Haoran Ma et.al.	2404.05921	translate	read	null
2024-04-08	SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing	Jing Gu et.al.	2404.05717	translate	read	null
2024-04-08	Learning 3D-Aware GANs from Unposed Images with Template Feature Field	Xinya Chen et.al.	2404.05705	translate	read	null
2024-04-08	SphereHead: Stable 3D Full-head Synthesis with Spherical Tri-plane Representation	Heyuan Li et.al.	2404.05680	translate	read	null
2024-04-08	MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation	Kunpeng Song et.al.	2404.05674	translate	read	null
2024-04-08	Automatic Controllable Colorization via Imagination	Xiaoyan Cong et.al.	2404.05661	translate	read	null
2024-04-08	UniFL: Improve Stable Diffusion via Unified Feedback Learning	Jiacheng Zhang et.al.	2404.05595	translate	read	null
2024-04-08	Mind-to-Image: Projecting Visual Mental Imagination of the Brain from fMRI	Hugo Caselles-Dupré et.al.	2404.05468	translate	read	null
2024-04-08	CDAD-Net: Bridging Domain Gaps in Generalized Category Discovery	Sai Bhargav Rongali et.al.	2404.05366	translate	read	null
2024-04-08	Mask-ControlNet: Higher-Quality Image Generation with An Additional Mask Prompt	Zhiqi Huang et.al.	2404.05331	translate	read	null
2024-04-08	MC $^2$ : Multi-concept Guidance for Customized Multi-concept Generation	Jiaxiu Jiang et.al.	2404.05268	translate	read	null
2024-04-04	No “Zero-Shot” Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance	Vishaal Udandarao et.al.	2404.04125	translate	read	link
2024-04-05	3D Facial Expressions through Analysis-by-Neural-Synthesis	George Retsinas et.al.	2404.04104	translate	read	null
2024-04-05	Dynamic Prompt Optimizing for Text-to-Image Generation	Wenyi Mo et.al.	2404.04095	translate	read	link
2024-04-05	Physics-Inspired Synthesized Underwater Image Dataset	Reina Kaneko et.al.	2404.03998	translate	read	null
2024-04-05	Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models	Gihyun Kwon et.al.	2404.03913	translate	read	null
2024-04-04	RaFE: Generative Radiance Fields Restoration	Zhongkai Wu et.al.	2404.03654	translate	read	null
2024-04-04	CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching	Dongzhi Jiang et.al.	2404.03653	translate	read	link
2024-04-04	Reference-Based 3D-Aware Image Editing with Triplane	Bahri Batuhan Bilecen et.al.	2404.03632	translate	read	null
2024-04-04	Robust Concept Erasure Using Task Vectors	Minh Pham et.al.	2404.03631	translate	read	null
2024-04-04	Terrain Point Cloud Inpainting via Signal Decomposition	Yizhou Xie et.al.	2404.03572	translate	read	null
2024-04-04	Integrating Generative AI into Financial Market Prediction for Improved Decision Making	Chang Che et.al.	2404.03523	translate	read	null
2024-04-04	Knowledge Distillation-Based Model Extraction Attack using Private Counterfactual Explanations	Fatima Ezzeddine et.al.	2404.03348	translate	read	null
2024-04-04	Multi Positive Contrastive Learning with Pose-Consistent Generated Images	Sho Inayoshi et.al.	2404.03256	translate	read	null
2024-04-04	Would Deep Generative Models Amplify Bias in Future Models?	Tianwei Chen et.al.	2404.03242	translate	read	null
2024-04-04	Diverse and Tailored Image Generation for Zero-shot Multi-label Classification	Kaixin Zhang et.al.	2404.03144	translate	read	null
2024-04-03	Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction	Keyu Tian et.al.	2404.02905	translate	read	link
2024-04-03	MatAtlas: Text-driven Consistent Geometry Texturing and Material Assignment	Duygu Ceylan et.al.	2404.02899	translate	read	null
2024-04-03	On the Scalability of Diffusion-based Text-to-Image Generation	Hao Li et.al.	2404.02883	translate	read	null
2024-04-03	MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image Generation	Petru-Daniel Tudosiu et.al.	2404.02790	translate	read	null
2024-04-03	InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation	Haofan Wang et.al.	2404.02733	translate	read	link
2024-04-03	Model-agnostic Origin Attribution of Generated Images with Few-shot Examples	Fengyuan Liu et.al.	2404.02697	translate	read	null
2024-04-03	Deep Privacy Funnel Model: From a Discriminative to a Generative Approach with an Application to Face Recognition	Behrooz Razeghi et.al.	2404.02696	translate	read	null
2024-04-03	Severity Controlled Text-to-Image Generative Model Bias Manipulation	Jordan Vice et.al.	2404.02530	translate	read	null
2024-04-03	Designing a Photonic Physically Unclonable Function Having Resilience to Machine Learning Attacks	Elena R. Henderson et.al.	2404.02440	translate	read	null
2024-04-02	Diffusion $^2$ : Dynamic 3D Content Generation via Score Composition of Orthogonal Diffusion Models	Zeyu Yang et.al.	2404.02148	translate	read	link
2024-04-02	3D Congealing: 3D-Aware Image Alignment in the Wild	Yunzhi Zhang et.al.	2404.02125	translate	read	null
2024-04-02	Red-Teaming Segment Anything Model	Krzysztof Jankowski et.al.	2404.02067	translate	read	link
2024-04-02	MultiParaDetox: Extending Text Detoxification with Parallel Data to New Languages	Daryna Dementieva et.al.	2404.02037	translate	read	null
2024-04-02	Enhancing Portfolio Optimization with Transformer-GAN Integration: A Novel Approach in the Black-Litterman Framework	Enmin Zhu et.al.	2404.02029	translate	read	null
2024-04-02	Bi-LORA: A Vision-Language Approach for Synthetic Image Detection	Mamadou Keita et.al.	2404.01959	translate	read	null
2024-04-02	Real, fake and synthetic faces – does the coin have three sides?	Shahzeb Naeem et.al.	2404.01878	translate	read	null
2024-04-02	Disentangled Pre-training for Human-Object Interaction Detection	Zhuolong Li et.al.	2404.01725	translate	read	null
2024-04-01	PlayFutures: Imagining Civic Futures with AI and Puppets	Supratim Pait et.al.	2404.01527	translate	read	null
2024-04-01	Is Model Collapse Inevitable? Breaking the Curse of Recursion by Accumulating Real and Synthetic Data	Matthias Gerstgrasser et.al.	2404.01413	translate	read	null
2024-04-01	Structure Matters: Tackling the Semantic Discrepancy in Diffusion Models for Image Inpainting	Haipeng Liu et.al.	2403.19898	translate	read	link

(<a href=../Image_Generation.md>back to Image Generation</a>)