Image Generation - 2024-08 | Paper Arxiv Daily

Image Generation - 2024-08

Publish Date	Title	Authors	PDF	Translate	Read	Code
2024-08-30	Image-Perfect Imperfections: Safety, Bias, and Authenticity in the Shadow of Text-To-Image Model Evolution	Yixin Wu et.al.	2408.17285	translate	read	null
2024-08-30	VQ4DiT: Efficient Post-Training Vector Quantization for Diffusion Transformers	Juncan Deng et.al.	2408.17131	translate	read	null
2024-08-30	FissionVAE: Federated Non-IID Image Generation with Latent Space and Decoder Decomposition	Chen Hu et.al.	2408.17090	translate	read	link
2024-08-30	Text-to-Image Generation Via Energy-Based CLIP	Roy Ganz et.al.	2408.17046	translate	read	null
2024-08-30	AdaptVision: Dynamic Input Scaling in MLLMs for Versatile Scene Understanding	Yonghui Wang et.al.	2408.16986	translate	read	link
2024-08-30	Contrastive Learning with Synthetic Positives	Dewen Zeng et.al.	2408.16965	translate	read	link
2024-08-29	GameIR: A Large-Scale Synthesized Ground-Truth Dataset for Image Restoration over Gaming Content	Lebin Zhou et.al.	2408.16866	translate	read	null
2024-08-29	STEREO: Towards Adversarially Robust Concept Erasing from Text-to-Image Generation Models	Koushik Srivatsan et.al.	2408.16807	translate	read	link
2024-08-29	CSGO: Content-Style Composition in Text-to-Image Generation	Peng Xing et.al.	2408.16766	translate	read	link
2024-08-29	GradBias: Unveiling Word Influence on Bias in Text-to-Image Generative Models	Moreno D’Incà et.al.	2408.16700	translate	read	link
2024-08-29	RLCP: A Reinforcement Learning-based Copyright Protection Method for Text-to-Image Diffusion Model	Zhuan Shi et.al.	2408.16634	translate	read	null
2024-08-29	GRPose: Learning Graph Relations for Human Image Generation with Pose Priors	Xiangchen Yin et.al.	2408.16540	translate	read	null
2024-08-29	Spiking Diffusion Models	Jiahang Cao et.al.	2408.16467	translate	read	link
2024-08-29	ResVG: Enhancing Relation and Semantic Understanding in Multiple Instances for Visual Grounding	Minghang Zheng et.al.	2408.16314	translate	read	link
2024-08-29	Improving Diffusion-based Data Augmentation with Inversion Spherical Interpolation	Yanghao Wang et.al.	2408.16266	translate	read	null
2024-08-29	Enhancing Conditional Image Generation with Explainable Latent Space Manipulation	Kshitij Pathania et.al.	2408.16232	translate	read	link
2024-08-29	Anchor-Controlled Generative Adversarial Network for High-Fidelity Electromagnetic and Structurally Diverse Metasurface Design	Yunhui Zeng et.al.	2408.16231	translate	read	null
2024-08-28	Simulating realistic short tandem repeat capillary electrophoretic signal using a generative adversarial network	Duncan Taylor et.al.	2408.16169	translate	read	null
2024-08-28	CoRe: Context-Regularized Text Embedding Learning for Text-to-Image Personalization	Feize Wu et.al.	2408.15914	translate	read	null
2024-08-28	Disentangled Diffusion Autoencoder for Harmonization of Multi-site Neuroimaging Data	Ayodeji Ijishakin et.al.	2408.15890	translate	read	null
2024-08-28	Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas	Fabio Quattrini et.al.	2408.15660	translate	read	link
2024-08-28	GANs Conditioning Methods: A Survey	Anis Bourou et.al.	2408.15640	translate	read	null
2024-08-28	Dissipation-driven quantum generative adversarial networks	He Wang et.al.	2408.15597	translate	read	null
2024-08-28	Hand1000: Generating Realistic Hands from Text with Only 1,000 Images	Haozhuo Zhang et.al.	2408.15461	translate	read	null
2024-08-28	Avoiding Generative Model Writer’s Block With Embedding Nudging	Ali Zand et.al.	2408.15450	translate	read	null
2024-08-27	Histo-Diffusion: A Diffusion Super-Resolution Method for Digital Pathology with Comprehensive Quality Assessment	Xuan Xu et.al.	2408.15218	translate	read	null
2024-08-27	Automatic 8-tissue Segmentation for 6-month Infant Brains	Yilan Dong et.al.	2408.15198	translate	read	null
2024-08-27	T-FAKE: Synthesizing Thermal Images for Facial Landmarking	Philipp Flotho et.al.	2408.15127	translate	read	link
2024-08-28	User-level Social Multimedia Traffic Anomaly Detection with Meta-Learning	Tongtong Feng et.al.	2408.14884	translate	read	null
2024-08-27	Alfie: Democratising RGBA Image Generation With No $$$	Fabio Quattrini et.al.	2408.14826	translate	read	link
2024-08-27	Build-A-Scene: Interactive 3D Layout Control for Diffusion-Based Image Generation	Abdelrahman Eldesokey et.al.	2408.14819	translate	read	null
2024-08-27	MaskCycleGAN-based Whisper to Normal Speech Conversion	K. Rohith Gupta et.al.	2408.14797	translate	read	null
2024-08-27	CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Street View Synthesis	Weijia Li et.al.	2408.14765	translate	read	null
2024-08-27	Sequential-Scanning Dual-Energy CT Imaging Using High Temporal Resolution Image Reconstruction and Error-Compensated Material Basis Image Generation	Qiaoxin Li et.al.	2408.14754	translate	read	null
2024-08-27	Learning Differentially Private Diffusion Models via Stochastic Adversarial Distillation	Bochao Liu et.al.	2408.14738	translate	read	null
2024-08-26	GR-MG: Leveraging Partially Annotated Data via Multi-Modal Goal Conditioned Policy	Peiyan Li et.al.	2408.14368	translate	read	null
2024-08-26	ConceptMix: A Compositional Image Generation Benchmark with Controllable Difficulty	Xindi Wu et.al.	2408.14339	translate	read	null
2024-08-26	Efficient Active Flow Control Strategy for Confined Square Cylinder Wake Using Deep Learning-Based Surrogate Model and Reinforcement Learning	Meng Zhang et.al.	2408.14232	translate	read	null
2024-08-26	Foodfusion: A Novel Approach for Food Image Composition via Diffusion Models	Chaohua Shi et.al.	2408.14135	translate	read	null
2024-08-26	Rate-Distortion-Perception Controllable Joint Source-Channel Coding for High-Fidelity Generative Communications	Kailin Tan et.al.	2408.14127	translate	read	null
2024-08-25	Bridging the Gap between Real-world and Synthetic Images for Testing Autonomous Driving Systems	Mohammad Hossein Amini et.al.	2408.13950	translate	read	null
2024-08-25	RT-Attack: Jailbreaking Text-to-Image Models via Random Token	Sensen Gao et.al.	2408.13896	translate	read	null
2024-08-25	Prior Learning in Introspective VAEs	Ioannis Athanasiadis et.al.	2408.13805	translate	read	null
2024-08-25	SceneDreamer360: Text-Driven 3D-Consistent Scene Generation with Panoramic Gaussian Splatting	Wenrui Li et.al.	2408.13711	translate	read	link
2024-08-27	Prompt-Softbox-Prompt: A free-text Embedding Control for Image Editing	Yitong Yang et.al.	2408.13623	translate	read	null
2024-08-23	Focus on Neighbors and Know the Whole: Towards Consistent Dense Multiview Text-to-Image Generator for 3D Creation	Bonan Li et.al.	2408.13149	translate	read	null
2024-08-23	G3FA: Geometry-guided GAN for Face Animation	Alireza Javanmardi et.al.	2408.13049	translate	read	null
2024-08-23	EasyControl: Transfer ControlNet to Video Diffusion for Controllable Generation and Interpolation	Cong Wang et.al.	2408.13005	translate	read	null
2024-08-23	What Do You Want? User-centric Prompt Generation for Text-to-image Synthesis via Multi-turn Guidance	Yilun Liu et.al.	2408.12910	translate	read	link
2024-08-22	Unlocking Intrinsic Fairness in Stable Diffusion	Eunji Kim et.al.	2408.12692	translate	read	null
2024-08-22	Enhancing Transferability of Adversarial Attacks with GE-AdvGAN+: A Comprehensive Framework for Gradient Editing	Zhibo Jin et.al.	2408.12673	translate	read	null
2024-08-22	Show-o: One Single Transformer to Unify Multimodal Understanding and Generation	Jinheng Xie et.al.	2408.12528	translate	read	link
2024-08-22	CODE: Confident Ordinary Differential Editing	Bastien van Delft et.al.	2408.12418	translate	read	link
2024-08-22	Dynamic Product Image Generation and Recommendation at Scale for Personalized E-commerce	Ádám Tibor Czapp et.al.	2408.12392	translate	read	null
2024-08-22	Scalable Autoregressive Image Generation with Mamba	Haopeng Li et.al.	2408.12245	translate	read	link
2024-08-22	MedDiT: A Knowledge-Controlled Diffusion Transformer Framework for Dynamic Medical Image Generation in Virtual Simulated Patient	Yanzeng Li et.al.	2408.12236	translate	read	null
2024-08-22	BihoT: A Large-Scale Dataset and Benchmark for Hyperspectral Camouflaged Object Tracking	Hanzheng Wang et.al.	2408.12232	translate	read	null
2024-08-22	DimeRec: A Unified Framework for Enhanced Sequential Recommendation via Generative Diffusion Models	Wuchao Li et.al.	2408.12153	translate	read	null
2024-08-22	Query-Efficient Video Adversarial Attack with Stylized Logo	Duoxun Tang et.al.	2408.12099	translate	read	null
2024-08-22	High-Quality Data Augmentation for Low-Resource NMT: Combining a Translation Memory, a GAN Generator, and Filtering	Hengjie Liu et.al.	2408.12079	translate	read	null
2024-08-21	Two-Timescale Gradient Descent Ascent Algorithms for Nonconvex Minimax Optimization	Tianyi Lin et.al.	2408.11974	translate	read	null
2024-08-21	Pixel Is Not A Barrier: An Effective Evasion Attack for Pixel-Domain Diffusion Models	Chun-Yen Shih et.al.	2408.11810	translate	read	link
2024-08-21	Approaching Deep Learning through the Spectral Dynamics of Weights	David Yunis et.al.	2408.11804	translate	read	link
2024-08-21	JieHua Paintings Style Feature Extracting Model using Stable Diffusion with ControlNet	Yujia Gu et.al.	2408.11744	translate	read	null
2024-08-21	Iterative Object Count Optimization for Text-to-image Diffusion Models	Oz Zafar et.al.	2408.11721	translate	read	null
2024-08-21	FRAP: Faithful and Realistic Text-to-Image Generation with Adaptive Prompt Weighting	Liyao Jiang et.al.	2408.11706	translate	read	link
2024-08-21	Latent Feature and Attention Dual Erasure Attack against Multi-View Diffusion Models for 3D Assets Protection	Jingwei Sun et.al.	2408.11408	translate	read	null
2024-08-21	Gender Bias Evaluation in Text-to-image Generation: A Survey	Yankun Wu et.al.	2408.11358	translate	read	null
2024-08-21	UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation	Xiangyu Zhao et.al.	2408.11305	translate	read	link
2024-08-20	Compress Guidance in Conditional Diffusion Sampling	Anh-Dung Dinh et.al.	2408.11194	translate	read	null
2024-08-20	MS $^3$ D: A RG Flow-Based Regularization for GAN Training with Limited Data	Jian Wang et.al.	2408.11135	translate	read	null
2024-08-20	MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning	Haoning Wu et.al.	2408.11001	translate	read	link
2024-08-20	A Grey-box Attack against Latent Diffusion Model-based Image Editing by Posterior Collapse	Zhongliang Guo et.al.	2408.10901	translate	read	link
2024-08-20	Generating Multi-frame Ultrawide-field Fluorescein Angiography from Ultrawide-field Color Imaging Improves Diabetic Retinopathy Stratification	Ruoyu Chen et.al.	2408.10636	translate	read	null
2024-08-20	TextMastero: Mastering High-Quality Scene Text Editing in Diverse Languages and Styles	Tong Wang et.al.	2408.10623	translate	read	null
2024-08-20	MUSES: 3D-Controllable Image Generation via Multi-Modal Agent Collaboration	Yanbo Ding et.al.	2408.10605	translate	read	link
2024-08-20	Prompt-Agnostic Adversarial Perturbation for Customized Diffusion Models	Cong Wan et.al.	2408.10571	translate	read	null
2024-08-21	FAGStyle: Feature Augmentation on Geodesic Surface for Zero-shot Text-guided Diffusion Image Style Transfer	Yuexing Han et.al.	2408.10533	translate	read	null
2024-08-19	The Brittleness of AI-Generated Image Watermarking Techniques: Examining Their Robustness Against Visual Paraphrasing Attacks	Niyar R Barman et.al.	2408.10446	translate	read	null
2024-08-19	Fashion Image-to-Image Translation for Complementary Item Retrieval	Matteo Attimonelli et.al.	2408.09847	translate	read	null
2024-08-19	Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation	Yunxin Li et.al.	2408.09787	translate	read	link
2024-08-19	TraDiffusion: Trajectory-Based Training-Free Image Generation	Mingrui Wu et.al.	2408.09739	translate	read	link
2024-08-19	Diff2CT: Diffusion Learning to Reconstruct Spine CT from Biplanar X-Rays	Zhi Qiao et.al.	2408.09731	translate	read	null
2024-08-19	GANPrompt: Enhancing Robustness in LLM-Based Recommendations with GAN-Enhanced Diversity Prompts	Xinyu Li et.al.	2408.09671	translate	read	null
2024-08-18	AnomalyFactory: Regard Anomaly Generation as Unsupervised Anomaly Localization	Ying Zhao et.al.	2408.09533	translate	read	null
2024-08-18	Deformation-aware GAN for Medical Image Synthesis with Substantially Misaligned Pairs	Bowen Xin et.al.	2408.09432	translate	read	null
2024-08-18	FD2Talk: Towards Generalized Talking Head Generation with Facial Decoupled Diffusion Model	Ziyu Yao et.al.	2408.09384	translate	read	null
2024-08-17	Re-boosting Self-Collaboration Parallel Prompt GAN for Unsupervised Image Restoration	Xin Lin et.al.	2408.09241	translate	read	link
2024-08-16	Fire Dynamic Vision: Image Segmentation and Tracking for Multi-Scale Fire and Plume Behavior	Daryn Sagel et.al.	2408.08984	translate	read	null
2024-08-16	PFDiff: Training-free Acceleration of Diffusion Models through the Gradient Guidance of Past and Future	Guangyi Wang et.al.	2408.08822	translate	read	null
2024-08-16	Comparative Analysis of Generative Models: Enhancing Image Synthesis with VAEs, GANs, and Stable Diffusion	Sanchayan Vivekananthan et.al.	2408.08751	translate	read	null
2024-08-16	An End-to-End Model for Photo-Sharing Multi-modal Dialogue Generation	Peiming Guo et.al.	2408.08650	translate	read	null
2024-08-16	SketchRef: A Benchmark Dataset and Evaluation Metrics for Automated Sketch Synthesis	Xingyue Lin et.al.	2408.08623	translate	read	null
2024-08-16	Efficient Image-to-Image Diffusion Classifier for Adversarial Robustness	Hefei Mei et.al.	2408.08502	translate	read	link
2024-08-16	TEXTOC: Text-driven Object-Centric Style Transfer	Jihun Park et.al.	2408.08461	translate	read	null
2024-08-15	JPEG-LM: LLMs as Image Generators with Canonical Codec Representations	Xiaochuang Han et.al.	2408.08459	translate	read	null
2024-08-15	Can Large Language Models Understand Symbolic Graphics Programs?	Zeju Qiu et.al.	2408.08313	translate	read	null
2024-08-15	Accelerated Image-Aware Generative Diffusion Modeling	Tanmay Asthana et.al.	2408.08306	translate	read	null
2024-08-15	Derivative-Free Guidance in Continuous and Discrete Diffusion Models with Soft Value-Based Decoding	Xiner Li et.al.	2408.08252	translate	read	link
2024-08-15	The Dawn of KAN in Image-to-Image (I2I) Translation: Integrating Kolmogorov-Arnold Networks with GANs for Unpaired I2I Translation	Arpan Mahara et.al.	2408.08216	translate	read	null
2024-08-15	Multimodal Causal Reasoning Benchmark: Challenging Vision Large Language Models to Infer Causal Links Between Siamese Images	Zhiyuan Li et.al.	2408.08105	translate	read	link
2024-08-15	Single-image coherent reconstruction of objects and humans	Sarthak Batra et.al.	2408.08086	translate	read	null
2024-08-15	Conditional Brownian Bridge Diffusion Model for VHR SAR to Optical Image Translation	Seon-Hoon Kim et.al.	2408.07947	translate	read	null
2024-08-15	A Novel Generative Artificial Intelligence Method for Interference Study on Multiplex Brightfield Immunohistochemistry Images	Satarupa Mukherjee et.al.	2408.07860	translate	read	null
2024-08-14	Boosting Unconstrained Face Recognition with Targeted Style Adversary	Mohammad Saeed Ebrahimi Saadabadi et.al.	2408.07642	translate	read	null
2024-08-15	MagicFace: Training-free Universal-Style Human Image Customized Synthesis	Yibin Wang et.al.	2408.07433	translate	read	null
2024-08-14	KIND: Knowledge Integration and Diversion in Diffusion Models	Yucheng Xie et.al.	2408.07337	translate	read	link
2024-08-14	GRIF-DM: Generation of Rich Impression Fonts using Diffusion Models	Lei Kang et.al.	2408.07259	translate	read	link
2024-08-13	SeLoRA: Self-Expanding Low-Rank Adaptation of Latent Diffusion Model for Medical Image Synthesis	Yuchen Mao et.al.	2408.07196	translate	read	null
2024-08-13	Generative Photomontage	Sean J. Liu et.al.	2408.07116	translate	read	null
2024-08-14	Content and Style Aware Audio-Driven Facial Animation	Qingju Liu et.al.	2408.07005	translate	read	null
2024-08-13	SpectralGaussians: Semantic, spectral 3D Gaussian splatting for multi-spectral scene representation, visualization and analysis	Saptarshi Neil Sinha et.al.	2408.06975	translate	read	null
2024-08-13	VNet: A GAN-based Multi-Tier Discriminator Network for Speech Synthesis Vocoders	Yubing Cao et.al.	2408.06906	translate	read	null
2024-08-13	Definition of multispectral camera system parameters to model the asteroid 2001 SN263	Gabriela de Carvalho Assis Goulart et.al.	2408.06886	translate	read	null
2024-08-13	A Comprehensive Survey on Synthetic Infrared Image synthesis	Avinash Upadhyay et.al.	2408.06868	translate	read	null
2024-08-13	Improving Synthetic Image Detection Towards Generalization: An Image Transformation Perspective	Ouxiang Li et.al.	2408.06741	translate	read	link
2024-08-13	DiffLoRA: Generating Personalized Low-Rank Adaptation Weights with Diffusion	Yujia Wu et.al.	2408.06740	translate	read	null
2024-08-13	DiffSG: A Generative Solver for Network Optimization with Diffusion Model	Ruihuai Liang et.al.	2408.06701	translate	read	null
2024-08-13	Hybrid SD: Edge-Cloud Collaborative Inference for Stable Diffusion Models	Chenqian Yan et.al.	2408.06646	translate	read	null
2024-08-12	Prompt Recovery for Image Generation Models: A Comparative Study of Discrete Optimizers	Joshua Nathaniel Williams et.al.	2408.06502	translate	read	null
2024-08-12	Open-Source Molecular Processing Pipeline for Generating Molecules	Shreyas V et.al.	2408.06261	translate	read	null
2024-08-12	Deep Learning System Boundary Testing through Latent Space Style Mixing	Amr Abdellatif et.al.	2408.06258	translate	read	null
2024-08-12	An Analysis for Image-to-Image Translation and Style Transfer	Xiaoming Yu et.al.	2408.06000	translate	read	null
2024-08-12	A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models	Taehong Moon et.al.	2408.05927	translate	read	link
2024-08-11	Egocentric Vision Language Planning	Zhirui Fang et.al.	2408.05802	translate	read	null
2024-08-11	SSL: A Self-similarity Loss for Improving Generative Image Super-resolution	Du Chen et.al.	2408.05713	translate	read	null
2024-08-10	Generative Adversarial Networks for Solving Hand-Eye Calibration without Data Correspondence	Ilkwon Hong et.al.	2408.05613	translate	read	null
2024-08-10	ZePo: Zero-Shot Portrait Stylization with Faster Sampling	Jin Liu et.al.	2408.05492	translate	read	link
2024-08-10	Scene123: One Prompt to 3D Scene Generation via Video-Assisted and Consistency-Enhanced MAE	Yiying Yang et.al.	2408.05477	translate	read	null
2024-08-10	Artworks Reimagined: Exploring Human-AI Co-Creation through Body Prompting	Jonas Oppenlaender et.al.	2408.05476	translate	read	null
2024-08-09	Instruction Tuning-free Visual Token Complement for Multimodal LLMs	Dongsheng Wang et.al.	2408.05019	translate	read	null
2024-08-09	DAFT-GAN: Dual Affine Transformation Generative Adversarial Network for Text-Guided Image Inpainting	Jihoon Lee et.al.	2408.04962	translate	read	null
2024-08-08	Deep Learning-based Unsupervised Domain Adaptation via a Unified Model for Prostate Lesion Detection Using Multisite Bi-parametric MRI Datasets	Hao Li et.al.	2408.04777	translate	read	null
2024-08-08	Zero-Shot Uncertainty Quantification using Diffusion Probabilistic Models	Dule Shu et.al.	2408.04718	translate	read	null
2024-08-08	Deep Generative Models in Robotics: A Survey on Learning from Multimodal Demonstrations	Julen Urain et.al.	2408.04380	translate	read	null
2024-08-08	InstantStyleGaussian: Efficient Art Style Transfer with 3D Gaussian Splatting	Xin-Yi Yu et.al.	2408.04249	translate	read	null
2024-08-08	Cross-View Meets Diffusion: Aerial Image Synthesis with Geometry and Text Guidance	Ahmad Arrabi et.al.	2408.04224	translate	read	link
2024-08-08	Artificial Intelligence based Approach for Identification and Mitigation of Cyber-Attacks in Wide-Area Control of Power Systems	Jishnudeep Kar et.al.	2408.04189	translate	read	null
2024-08-07	ArtVLM: Attribute Recognition Through Vision-Based Prefix Language Modeling	William Y. Zhu et.al.	2408.04102	translate	read	null
2024-08-07	Counterfactuals and Uncertainty-Based Explainable Paradigm for the Automated Detection and Segmentation of Renal Cysts in Computed Tomography Images: A Multi-Center Study	Zohaib Salahuddin et.al.	2408.03789	translate	read	null
2024-08-07	Data Generation Scheme for Thermal Modality with Edge-Guided Adversarial Conditional Diffusion Model	Guoqing Zhu et.al.	2408.03748	translate	read	link
2024-08-07	Openstory++: A Large-scale Dataset and Benchmark for Instance-aware Open-domain Visual Storytelling	Zilyu Ye et.al.	2408.03695	translate	read	link
2024-08-07	Consumer Transactions Simulation through Generative Adversarial Networks	Sergiy Tkachuk et.al.	2408.03655	translate	read	null
2024-08-07	Concept Conductor: Orchestrating Multiple Personalized Concepts in Text-to-Image Synthesis	Zebin Yao et.al.	2408.03632	translate	read	link
2024-08-07	A comparative study of generative adversarial networks for image recognition algorithms based on deep learning and traditional methods	Yihao Zhong et.al.	2408.03568	translate	read	null
2024-08-07	Unlocking Exocentric Video-Language Data for Egocentric Video Representation Learning	Zi-Yi Dou et.al.	2408.03567	translate	read	null
2024-08-07	SLRQA: A Sparse Low-Rank Quaternion Model for Color Image Processing with Convergence Analysis	Zhanwang Deng et.al.	2408.03563	translate	read	null
2024-08-07	D2Styler: Advancing Arbitrary Style Transfer with Discrete Diffusion Methods	Onkar Susladkar et.al.	2408.03558	translate	read	link
2024-08-06	Attacks and Defenses for Generative Diffusion Models: A Comprehensive Survey	Vu Tuan Truong et.al.	2408.03400	translate	read	null
2024-08-06	IPAdapter-Instruct: Resolving Ambiguity in Image-based Conditioning using Instruct Prompts	Ciara Rowles et.al.	2408.03209	translate	read	null
2024-08-06	An Object is Worth 64x64 Pixels: Generating 3D Object via Image Diffusion	Xingguang Yan et.al.	2408.03178	translate	read	null
2024-08-06	Iterative CT Reconstruction via Latent Variable Optimization of Shallow Diffusion Models	Sho Ozaki et.al.	2408.03156	translate	read	null
2024-08-06	Multitask and Multimodal Neural Tuning for Large Models	Hao Sun et.al.	2408.03001	translate	read	null
2024-08-06	DreamLCM: Towards High-Quality Text-to-3D Generation via Latent Consistency Model	Yiming Zhong et.al.	2408.02993	translate	read	null
2024-08-06	A generative adversarial network for stellar core-collapse gravitational-waves	Tarin Eccleston et.al.	2408.02895	translate	read	null
2024-08-05	Pre-trained Encoder Inference: Revealing Upstream Encoders In Downstream Machine Learning Services	Shaopeng Fu et.al.	2408.02814	translate	read	null
2024-08-05	Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining	Dongyang Liu et.al.	2408.02657	translate	read	null
2024-08-06	ProCreate, Don’t Reproduce! Propulsive Energy Diffusion for Creative Generation	Jack Lu et.al.	2408.02226	translate	read	null
2024-08-05	Dense Feature Interaction Network for Image Inpainting Localization	Ye Yao et.al.	2408.02191	translate	read	null
2024-08-04	PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance	Aoming Liu et.al.	2408.02157	translate	read	null
2024-08-04	View-consistent Object Removal in Radiance Fields	Yiren Lu et.al.	2408.02100	translate	read	null
2024-08-04	LDFaceNet: Latent Diffusion-based Network for High-Fidelity Deepfake Generation	Dwij Mehta et.al.	2408.02078	translate	read	null
2024-08-04	Step Saver: Predicting Minimum Denoising Steps for Diffusion Model Image Generation	Jean Yu et.al.	2408.02054	translate	read	null
2024-08-04	Robustness of Watermarking on Text-to-Image Diffusion Models	Xiaodong Wu et.al.	2408.02035	translate	read	null
2024-08-03	Supervised Image Translation from Visible to Infrared Domain for Object Detection	Prahlad Anand et.al.	2408.01843	translate	read	null
2024-08-03	ST-SACLF: Style Transfer Informed Self-Attention Classifier for Bias-Aware Painting Classification	Mridula Vijendran et.al.	2408.01827	translate	read	null
2024-08-02	Out-Of-Distribution Detection for Audio-visual Generalized Zero-Shot Learning: A General Framework	Liuyuan Wen et.al.	2408.01284	translate	read	null
2024-08-02	VAR-CLIP: Text-to-Image Generator with Visual Auto-Regressive Modeling	Qian Zhang et.al.	2408.01181	translate	read	null
2024-08-02	PINNs for Medical Image Analysis: A Survey	Chayan Banerjee et.al.	2408.01026	translate	read	null
2024-08-02	EIUP: A Training-Free Approach to Erase Non-Compliant Concepts Conditioned on Implicit Unsafe Prompts	Die Chen et.al.	2408.01014	translate	read	null
2024-08-02	FBSDiff: Plug-and-Play Frequency Band Substitution of Diffusion Features for Highly Controllable Text-Driven Image Translation	Xiang Gao et.al.	2408.00998	translate	read	null
2024-08-01	Temporal Evolution of Knee Osteoarthritis: A Diffusion-based Morphing Model for X-ray Medical Image Synthesis	Zhe Wang et.al.	2408.00891	translate	read	null
2024-08-01	Smoothed Energy Guidance: Guiding Diffusion Models with Reduced Energy Curvature of Attention	Susung Hong et.al.	2408.00760	translate	read	null
2024-08-01	Synthetic dual image generation for reduction of labeling efforts in semantic segmentation of micrographs with a customized metric function	Matias Oscar Volman Stern et.al.	2408.00707	translate	read	null
2024-08-01	Modeling stochastic eye tracking data: A comparison of quantum generative adversarial networks and Markov models	Shailendra Bhandari et.al.	2408.00673	translate	read	null
2024-08-01	Evaluation Metrics and Methods for Generative Models in the Wireless PHY Layer	Michael Baur et.al.	2408.00634	translate	read	null
2024-08-01	A new approach for encoding code and assisting code understanding	Mengdan Fan et.al.	2408.00521	translate	read	null
2024-08-01	Reenact Anything: Semantic Video Motion Transfer Using Motion-Textual Inversion	Manuel Kansy et.al.	2408.00458	translate	read	null
2024-08-01	Towards Reliable Advertising Image Generation Using Human Feedback	Zhenbang Du et.al.	2408.00418	translate	read	null
2024-08-01	DriveArena: A Closed-loop Generative Simulation Platform for Autonomous Driving	Xuemeng Yang et.al.	2408.00415	translate	read	null
2024-08-01	Deepfake Media Forensics: State of the Art and Challenges Ahead	Irene Amerini et.al.	2408.00388	translate	read	null
2024-08-01	On the Limitations and Prospects of Machine Unlearning for Generative AI	Shiji Zhou et.al.	2408.00376	translate	read	null

(<a href=../Image_Generation.md>back to Image Generation</a>)