Image Generation - 2024-05 | Paper Arxiv Daily

Image Generation - 2024-05

Publish Date	Title	Authors	PDF	Translate	Read	Code
2024-05-31	Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling	Jiatao Gu et.al.	2405.21048	translate	read	null
2024-05-31	You Only Scan Once: Efficient Multi-dimension Sequential Modeling with LightNet	Zhen Qin et.al.	2405.21022	translate	read	null
2024-05-31	Early Stopping Criteria for Training Generative Adversarial Networks in Biomedical Imaging	Muhammad Muneeb Saad et.al.	2405.20987	translate	read	null
2024-05-31	Generative Adversarial Networks in Ultrasound Imaging: Extending Field of View Beyond Conventional Limits	Matej Gazda et.al.	2405.20981	translate	read	null
2024-05-31	Amortizing intractable inference in diffusion models for vision, language, and control	Siddarth Venkatraman et.al.	2405.20971	translate	read	link
2024-05-31	MegActor: Harness the Power of Raw Video for Vivid Portrait Animation	Shurong Yang et.al.	2405.20851	translate	read	link
2024-05-31	Multilingual Text Style Transfer: Datasets & Models for Indian Languages	Sourabrata Mukherjee et.al.	2405.20805	translate	read	null
2024-05-31	Information Theoretic Text-to-Image Alignment	Chao Wang et.al.	2405.20759	translate	read	null
2024-05-31	Diffusion Models Are Innate One-Step Generators	Bowen Zheng et.al.	2405.20750	translate	read	link
2024-05-31	GANcrop: A Contrastive Defense Against Backdoor Attacks in Federated Learning	Xiaoyun Gan et.al.	2405.20727	translate	read	null
2024-05-30	SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow	Chaoyang Wang et.al.	2405.20282	translate	read	link
2024-05-30	ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections	Massimo Bini et.al.	2405.20271	translate	read	link
2024-05-30	Boost Your Own Human Image Generation Model via Direct Preference Optimization with AI Feedback	Sanghyeon Na et.al.	2405.20216	translate	read	null
2024-05-30	RIGID: A Training-free and Model-Agnostic Framework for Robust AI-Generated Image Detection	Zhiyuan He et.al.	2405.20112	translate	read	null
2024-05-30	RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection	Fangyi Chen et.al.	2405.19854	translate	read	null
2024-05-30	Puff-Net: Efficient Style Transfer with Pure Content and Style Feature Fusion Network	Sizhe Zheng et.al.	2405.19775	translate	read	null
2024-05-30	MAE-GAN: A Novel Strategy for Simultaneous Super-resolution Reconstruction and Denoising of Post-stack Seismic Profile	Wenshuo Yu et.al.	2405.19767	translate	read	null
2024-05-30	Mitigating annotation shift in cancer classification using single image generative models	Marta Buetas Arcas et.al.	2405.19754	translate	read	link
2024-05-30	Uncertainty-guided Optimal Transport in Depth Supervised Sparse-View 3D Gaussian	Wei Sun et.al.	2405.19657	translate	read	null
2024-05-29	Quo Vadis ChatGPT? From Large Language Models to Large Knowledge Models	Venkat Venkatasubramanian et.al.	2405.19561	translate	read	null
2024-05-29	ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning	Ruchika Chavhan et.al.	2405.19237	translate	read	link
2024-05-29	Going beyond compositional generalization, DDPMs can produce zero-shot interpolation	Justin Deschenaux et.al.	2405.19201	translate	read	link
2024-05-29	The ethical situation of DALL-E 2	Eduard Hogea et.al.	2405.19176	translate	read	null
2024-05-29	Patch-enhanced Mask Encoder Prompt Image Generation	Shusong Xu et.al.	2405.19085	translate	read	null
2024-05-29	EasyAnimate: A High-Performance Long Video Generation Method based on Transformer Architecture	Jiaqi Xu et.al.	2405.18991	translate	read	link
2024-05-29	Topological Perspectives on Optimal Multimodal Embedding Spaces	Abdul Aziz A. B et.al.	2405.18867	translate	read	null
2024-05-29	Flow Priors for Linear Inverse Problems via Iterative Corrupted Trajectory Matching	Yasi Zhang et.al.	2405.18816	translate	read	null
2024-05-29	SketchTriplet: Self-Supervised Scenarized Sketch-Text-Image Triplet Generation	Zhenbei Wu et.al.	2405.18801	translate	read	null
2024-05-29	Inpaint Biases: A Pathway to Accurate and Unbiased Image Generation	Jiyoon Myung et.al.	2405.18762	translate	read	null
2024-05-29	SketchDeco: Decorating B&W Sketches with Colour	Chaitat Utintu et.al.	2405.18716	translate	read	null
2024-05-28	Phased Consistency Model	Fu-Yun Wang et.al.	2405.18407	translate	read	link
2024-05-28	Multi-modal Generation via Cross-Modal In-Context Learning	Amandeep Kumar et.al.	2405.18304	translate	read	link
2024-05-28	Are Image Distributions Indistinguishable to Humans Indistinguishable to Classifiers?	Zebin You et.al.	2405.18029	translate	read	null
2024-05-28	Cycle-YOLO: A Efficient and Robust Framework for Pavement Damage Detection	Zhengji Li et.al.	2405.17905	translate	read	null
2024-05-27	RefDrop: Controllable Consistency in Image or Video Generation via Reference Feature Guidance	Jiaojiao Fan et.al.	2405.17661	translate	read	null
2024-05-27	Enhancing Global Sensitivity and Uncertainty Quantification in Medical Image Reconstruction with Monte Carlo Arbitrary-Masked Mamba	Jiahao Huang et.al.	2405.17659	translate	read	null
2024-05-27	EM-GANSim: Real-time and Accurate EM Simulation Using Conditional GANs for 3D Indoor Scenes	Ruichen Wang et.al.	2405.17366	translate	read	null
2024-05-27	Prompt Optimization with Human Feedback	Xiaoqiang Lin et.al.	2405.17346	translate	read	link
2024-05-27	From Text to Blueprint: Leveraging Text-to-Image Tools for Floor Plan Creation	Xiaoyu Li et.al.	2405.17236	translate	read	null
2024-05-27	MCGAN: Enhancing GAN Training with Regression-Based Generator Loss	Baoren Xiao et.al.	2405.17191	translate	read	null
2024-05-27	Training-free Editioning of Text-to-Image Models	Jinqi Wang et.al.	2405.17069	translate	read	null
2024-05-27	The Poisson Midpoint Method for Langevin Dynamics: Provably Efficient Discretization for Diffusion Models	Saravanan Kandasamy et.al.	2405.17068	translate	read	null
2024-05-27	Glauber Generative Model: Discrete Diffusion Models via Binary Classification	Harshit Varma et.al.	2405.17035	translate	read	null
2024-05-27	A Correlation- and Mean-Aware Loss Function and Benchmarking Framework to Improve GAN-based Tabular Data Synthesis	Minh H. Vu et.al.	2405.16971	translate	read	null
2024-05-27	Anonymization Prompt Learning for Facial Privacy-Preserving Text-to-Image Generation	Liang Shi et.al.	2405.16895	translate	read	null
2024-05-27	Think Before You Act: A Two-Stage Framework for Mitigating Gender Bias Towards Vision-Language Tasks	Yunqi Zhang et.al.	2405.16860	translate	read	link
2024-05-24	Learning to Discretize Denoising Diffusion ODEs	Vinh Tong et.al.	2405.15506	translate	read	link
2024-05-24	A Misleading Gallery of Fluid Motion by Generative Artificial Intelligence	Ali Kashefi et.al.	2405.15406	translate	read	null
2024-05-24	Stochastic SR for Gaussian microtextures	Emile Pierret et.al.	2405.15399	translate	read	null
2024-05-24	Challenges and Opportunities in 3D Content Generation	Ke Zhao et.al.	2405.15335	translate	read	null
2024-05-24	Towards Understanding the Working Mechanism of Text-to-Image Diffusion Model	Mingyang Yi et.al.	2405.15330	translate	read	null
2024-05-24	SG-Adapter: Enhancing Text-to-Image Generation with Scene Graph Guidance	Guibao Shen et.al.	2405.15321	translate	read	null
2024-05-24	Decaf: Data Distribution Decompose Attack against Federated Learning	Zhiyang Dai et.al.	2405.15316	translate	read	null
2024-05-24	Unlearning Concepts in Diffusion Model via Concept Domain Correction and Concept Preserving Gradient	Yongliang Wu et.al.	2405.15304	translate	read	null
2024-05-24	StyleMaster: Towards Flexible Stylized Image Generation with Diffusion Models	Chengming Xu et.al.	2405.15287	translate	read	null
2024-05-24	Defensive Unlearning with Adversarial Training for Robust Concept Erasure in Diffusion Models	Yimeng Zhang et.al.	2405.15234	translate	read	link
2024-05-23	Improved Distribution Matching Distillation for Fast Image Synthesis	Tianwei Yin et.al.	2405.14867	translate	read	link
2024-05-23	Semantica: An Adaptable Image-Conditioned Diffusion Model	Manoj Kumar et.al.	2405.14857	translate	read	null
2024-05-23	TerDiT: Ternary Diffusion Models with Transformers	Xudong Lu et.al.	2405.14854	translate	read	link
2024-05-23	Good Seed Makes a Good Crop: Discovering Secret Seeds in Text-to-Image Diffusion Models	Katherine Xu et.al.	2405.14828	translate	read	null
2024-05-24	Fast-DDPM: Fast Denoising Diffusion Probabilistic Models for Medical Image-to-Image Generation	Hongxu Jiang et.al.	2405.14802	translate	read	null
2024-05-23	Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy	Shengfang Zhai et.al.	2405.14800	translate	read	link
2024-05-23	RetAssist: Facilitating Vocabulary Learners with Generative Images in Story Retelling Practices	Qiaoyi Chen et.al.	2405.14794	translate	read	null
2024-05-23	OpFlowTalker: Realistic and Natural Talking Face Generation via Optical Flow Guidance	Shuheng Ge et.al.	2405.14709	translate	read	null
2024-05-23	Learning Multi-dimensional Human Preference for Text-to-Image Generation	Sixian Zhang et.al.	2405.14705	translate	read	null
2024-05-23	RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance	Zhicheng Sun et.al.	2405.14677	translate	read	link
2024-05-21	Personalized Residuals for Concept-Driven Text-to-Image Generation	Cusuh Ham et.al.	2405.12978	translate	read	null
2024-05-21	An Empirical Study and Analysis of Text-to-Image Generation Using Large Language Model-Powered Textual Representation	Zhiyu Tan et.al.	2405.12914	translate	read	link
2024-05-21	Spatial-aware Attention Generative Adversarial Network for Semi-supervised Anomaly Detection in Medical Image	Zerui Zhang et.al.	2405.12872	translate	read	null
2024-05-21	A Dataset and Baselines for Measuring and Predicting the Music Piece Memorability	Li-Yang Tseng et.al.	2405.12847	translate	read	link
2024-05-21	Leveraging Neural Radiance Fields for Pose Estimation of an Unknown Space Object during Proximity Operations	Antoine Legrand et.al.	2405.12728	translate	read	null
2024-05-21	CustomText: Customized Textual Image Generation using Diffusion Models	Shubham Paliwal et.al.	2405.12531	translate	read	null
2024-05-20	Diffusion for World Modeling: Visual Details Matter in Atari	Eloi Alonso et.al.	2405.12399	translate	read	link
2024-05-20	Paired Conditional Generative Adversarial Network for Highly Accelerated Liver 4D MRI	Di Xu et.al.	2405.12357	translate	read	null
2024-05-20	EGAN: Evolutional GAN for Ransomware Evasion	Daniel Commey et.al.	2405.12266	translate	read	null
2024-05-20	Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal Slices	Nathaniel Cohen et.al.	2405.12211	translate	read	link
2024-05-20	Diffusion Models for Generating Ballistic Spacecraft Trajectories	Tyler Presser et.al.	2405.11738	translate	read	null
2024-05-19	URDFormer: A Pipeline for Constructing Articulated Simulation Environments from Real-World Images	Zoey Chen et.al.	2405.11656	translate	read	null
2024-05-19	Nickel and Diming Your GAN: A Dual-Method Approach to Enhancing GAN Efficiency via Knowledge Distillation	Sangyeop Yeo et.al.	2405.11614	translate	read	null
2024-05-19	A GAN-Based Data Poisoning Attack Against Federated Learning Systems and Its Countermeasure	Wei Sun et.al.	2405.11440	translate	read	null
2024-05-18	UPAM: Unified Prompt Attack in Text-to-Image Generation Models Against Both Textual Filters and Visual Checkers	Duo Peng et.al.	2405.11336	translate	read	null
2024-05-18	On the Trajectory Regularity of ODE-based Diffusion Sampling	Defang Chen et.al.	2405.11326	translate	read	null
2024-05-18	Few-Shot API Attack Detection: Overcoming Data Scarcity with GAN-Inspired Learning	Udi Aharon et.al.	2405.11258	translate	read	null
2024-05-18	TriLoRA: Integrating SVD for Advanced Style Personalization in Text-to-Image Generation	Chengcheng Feng et.al.	2405.11236	translate	read	null
2024-05-17	Improving face generation quality and prompt following with synthetic captions	Michail Tarasiou et.al.	2405.10864	translate	read	null
2024-05-17	Multi-scale Semantic Prior Features Guided Deep Neural Network for Urban Street-view Image	Jianshun Zeng et.al.	2405.10504	translate	read	null
2024-05-17	Lean Attention: Hardware-Aware Scalable Attention Mechanism for the Decode-Phase of Transformers	Rya Sanovar et.al.	2405.10480	translate	read	null
2024-05-16	Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model	Zheng Gu et.al.	2405.10316	translate	read	null
2024-05-16	UniRAG: Universal Retrieval Augmentation for Multi-Modal Large Language Models	Sahel Sharifymoghaddam et.al.	2405.10311	translate	read	link
2024-05-16	VirtualModel: Generating Object-ID-retentive Human-object Interaction Image by Diffusion Model for E-commerce Marketing	Binghui Chen et.al.	2405.09985	translate	read	null
2024-05-16	KPNDepth: Depth Estimation of Lane Images under Complex Rainy Environment	Zhengxu Shi et.al.	2405.09964	translate	read	null
2024-05-16	Chameleon: Mixed-Modal Early-Fusion Foundation Models	Chameleon Team et.al.	2405.09818	translate	read	link
2024-05-16	MediSyn: Text-Guided Diffusion Models for Broad Medical 2D and 3D Image Synthesis	Joseph Cho et.al.	2405.09806	translate	read	null
2024-05-16	An Autoencoder and Generative Adversarial Networks Approach for Multi-Omics Data Imbalanced Class Handling and Classification	Ibrahim Al-Hurani et.al.	2405.09756	translate	read	null
2024-05-15	Towards Evaluating the Robustness of Automatic Speech Recognition Systems via Audio Style Transfer	Weifei Jin et.al.	2405.09470	translate	read	null
2024-05-16	Global-Local Image Perceptual Score (GLIPS): Evaluating Photorealistic Quality of AI-Generated Images	Memoona Aziz et.al.	2405.09426	translate	read	null
2024-05-15	DeCoDEx: Confounder Detector Guidance for Improved Diffusion-based Counterfactual Explanations	Nima Fathi et.al.	2405.09288	translate	read	link
2024-05-15	SOEDiff: Efficient Distillation for Small Object Editing	Qihe Pan et.al.	2405.09114	translate	read	null
2024-05-15	Deep Learning in Earthquake Engineering: A Comprehensive Review	Yazhou Xie et.al.	2405.09021	translate	read	null
2024-05-14	Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding	Zhimin Li et.al.	2405.08748	translate	read	link
2024-05-15	Similarity Metrics for MR Image-To-Image Translation	Melanie Dohmen et.al.	2405.08431	translate	read	null
2024-05-14	Compositional Text-to-Image Generation with Dense Blob Representations	Weili Nie et.al.	2405.08246	translate	read	null
2024-05-13	RATLIP: Generative Adversarial CLIP Text-to-Image Synthesis Based on Recurrent Affine Transformations	Chengde Lin et.al.	2405.08114	translate	read	link
2024-05-13	CTRLorALTer: Conditional LoRAdapter for Efficient 0-Shot Control & Altering of T2I Models	Nick Stracke et.al.	2405.07913	translate	read	null
2024-05-13	SAR Image Synthesis with Diffusion Models	Denisa Qosja et.al.	2405.07776	translate	read	null
2024-05-12	Semantic Loss Functions for Neuro-Symbolic Structured Prediction	Kareem Ahmed et.al.	2405.07387	translate	read	null
2024-05-12	Understanding and Evaluating Human Preferences for AI Generated Images with Instruction Tuning	Jiarui Wang et.al.	2405.07346	translate	read	link
2024-05-12	PotatoGANs: Utilizing Generative Adversarial Networks, Instance Segmentation, and Explainable AI for Enhanced Potato Disease Identification and Classification	Mohammad Shafiul Alam et.al.	2405.07332	translate	read	link
2024-05-12	Stable Signature is Unstable: Removing Image Watermark from Diffusion Models	Yuepeng Hu et.al.	2405.07145	translate	read	null
2024-05-12	MAxPrototyper: A Multi-Agent Generation System for Interactive User Interface Prototyping	Mingyue Yuan et.al.	2405.07131	translate	read	null
2024-05-11	Unsupervised Density Neural Representation for CT Metal Artifact Reduction	Qing Wu et.al.	2405.07047	translate	read	null
2024-05-11	Semantic Guided Large Scale Factor Remote Sensing Image Super-resolution with Generative Diffusion Prior	Ce Wang et.al.	2405.07044	translate	read	link
2024-05-11	Training-free Subject-Enhanced Attention Guidance for Compositional Text-to-image Generation	Shengyuan Liu et.al.	2405.06948	translate	read	null
2024-05-10	Controllable Image Generation With Composed Parallel Token Prediction	Jamie Stirling et.al.	2405.06535	translate	read	null
2024-05-10	SketchDream: Sketch-based Text-to-3D Generation and Editing	Feng-Lin Liu et.al.	2405.06461	translate	read	null
2024-05-09	Photonic quantum generative adversarial networks for classical data	Tigran Sedrakyan et.al.	2405.06023	translate	read	null
2024-05-09	Frame Interpolation with Consecutive Brownian Bridge Diffusion	Zonglin Lyu et.al.	2405.05953	translate	read	link
2024-05-09	Could It Be Generated? Towards Practical Analysis of Memorization in Text-To-Image Diffusion Models	Zhe Ma et.al.	2405.05846	translate	read	null
2024-05-10	MasterWeaver: Taming Editability and Identity for Personalized Text-to-Image Generation	Yuxiang Wei et.al.	2405.05806	translate	read	link
2024-05-09	Exploring Text-Guided Single Image Editing for Remote Sensing Images	Fangzhou Han et.al.	2405.05769	translate	read	null
2024-05-09	End-to-End Generative Semantic Communication Powered by Shared Semantic Knowledge Base	Shuling Li et.al.	2405.05738	translate	read	null
2024-05-09	VM-DDPM: Vision Mamba Diffusion for Medical Image Synthesis	Zhihan Ju et.al.	2405.05667	translate	read	null
2024-05-09	A Survey on Personalized Content Synthesis with Diffusion Models	Xulu Zhang et.al.	2405.05538	translate	read	null
2024-05-09	Characteristic Learning for Provable One Step Generation	Zhao Ding et.al.	2405.05512	translate	read	link
2024-05-08	Cross-Modality Translation with Generative Adversarial Networks to Unveil Alzheimer’s Disease Biomarkers	Reihaneh Hassanzadeh et.al.	2405.05462	translate	read	null
2024-05-08	DrawL: Understanding the Effects of Non-Mainstream Dialects in Prompted Image Generation	Joshua N. Williams et.al.	2405.05382	translate	read	null
2024-05-08	Diffusion-HMC: Parameter Inference with Diffusion Model driven Hamiltonian Monte Carlo	Nayantara Mudur et.al.	2405.05255	translate	read	link
2024-05-08	StyleMamba : State Space Model for Efficient Text-driven Image Style Transfer	Zijia Wang et.al.	2405.05027	translate	read	null
2024-05-08	Discrepancy-based Diffusion Models for Lesion Detection in Brain MRI	Keqiang Fan et.al.	2405.04974	translate	read	null
2024-05-08	Improving Long Text Understanding with Knowledge Distilled from Summarization Model	Yan Liu et.al.	2405.04955	translate	read	null
2024-05-08	HAGAN: Hybrid Augmented Generative Adversarial Network for Medical Image Synthesis	Zhihan Ju et.al.	2405.04902	translate	read	null
2024-05-08	FlexEControl: Flexible and Efficient Multimodal Control for Text-to-Image Generation	Xuehai He et.al.	2405.04834	translate	read	null
2024-05-07	TexControl: Sketch-Based Two-Stage Fashion Image Generation Using Diffusion Model	Yongming Zhang et.al.	2405.04675	translate	read	null
2024-05-07	ResNCT: A Deep Learning Model for the Synthesis of Nephrographic Phase Images in CT Urography	Syed Jamal Safdar Gardezi et.al.	2405.04629	translate	read	null
2024-05-07	SingIt! Singer Voice Transformation	Amit Eliav et.al.	2405.04627	translate	read	null
2024-05-07	Towards Geographic Inclusion in the Evaluation of Text-to-Image Models	Melissa Hall et.al.	2405.04457	translate	read	null
2024-05-07	Data augmentation experiments with style-based quantum generative adversarial networks on trapped-ion and superconducting-qubit technologies	Julien Baglio et.al.	2405.04401	translate	read	null
2024-05-07	Diffusion-driven GAN Inversion for Multi-Modal Face Image Generation	Jihyun Kim et.al.	2405.04356	translate	read	null
2024-05-07	Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer	Zhuoyi Yang et.al.	2405.04312	translate	read	link
2024-05-07	Improving Offline Reinforcement Learning with Inaccurate Simulators	Yiwen Hou et.al.	2405.04307	translate	read	null
2024-05-07	Bayesian Simultaneous Localization and Multi-Lane Tracking Using Onboard Sensors and a SD Map	Yuxuan Xia et.al.	2405.04290	translate	read	null
2024-05-07	Bidirectional Adversarial Autoencoders for the design of Plasmonic Metasurfaces	Yuansan Liu et.al.	2405.04056	translate	read	link
2024-05-07	Simple Drop-in LoRA Conditioning on Attention Layers Will Improve Your Diffusion Model	Joo Young Choi et.al.	2405.03958	translate	read	null
2024-05-06	Generated Contents Enrichment	Mahdi Naseri et.al.	2405.03650	translate	read	null
2024-05-06	CCDM: Continuous Conditional Diffusion Models for Image Generation	Xin Ding et.al.	2405.03546	translate	read	link
2024-05-06	GLIP: Electromagnetic Field Exposure Map Completion by Deep Generative Networks	Mohammed Mallik et.al.	2405.03384	translate	read	null
2024-05-05	AnoGAN for Tabular Data: A Novel Approach to Anomaly Detection	Aditya Singh et.al.	2405.03075	translate	read	null
2024-05-05	Boundary-aware Decoupled Flow Networks for Realistic Extreme Rescaling	Jinmin Li et.al.	2405.02941	translate	read	null
2024-05-05	Data-Efficient Molecular Generation with Hierarchical Textual Inversion	Seojin Kim et.al.	2405.02845	translate	read	null
2024-05-05	SMCD: High Realism Motion Style Transfer via Mamba-based Diffusion	Ziyun Qian et.al.	2405.02844	translate	read	null
2024-05-05	ImageInWords: Unlocking Hyper-Detailed Image Descriptions	Roopal Garg et.al.	2405.02793	translate	read	link
2024-05-04	U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers	Yuchuan Tian et.al.	2405.02730	translate	read	null
2024-05-03	Functional Imaging Constrained Diffusion for Brain PET Synthesis from Structural MRI	Minhui Yu et.al.	2405.02504	translate	read	null
2024-05-03	Multi-method Integration with Confidence-based Weighting for Zero-shot Image Classification	Siqi Yin et.al.	2405.02155	translate	read	null
2024-05-03	Reconstructing the mid-infrared spectra of galaxies using ultraviolet to submillimeter photometry and Deep Generative Networks	Agapi Rissaki et.al.	2405.02153	translate	read	null
2024-05-03	Three-Dimensional Amyloid-Beta PET Synthesis from Structural MRI with Conditional Generative Adversarial Networks	Fernando Vega et.al.	2405.02109	translate	read	null
2024-05-03	AI-generated art perceptions with GenFrame – an image-generating picture frame	Peter Kun et.al.	2405.01901	translate	read	null
2024-05-03	Defect Image Sample Generation With Diffusion Prior for Steel Surface Defect Recognition	Yichun Tai et.al.	2405.01872	translate	read	null
2024-05-03	Report on the AAPM Grand Challenge on deep generative modeling for learning medical image statistics	Rucha Deshpande et.al.	2405.01822	translate	read	null
2024-05-02	Long Tail Image Generation Through Feature Space Augmentation and Iterated Learning	Rafael Elberg et.al.	2405.01705	translate	read	link
2024-05-02	Investigation on optimal microstructure of dual-phase steel with high strength and ductility by machine learning	Misato Suzuki et.al.	2405.01689	translate	read	null
2024-05-02	Improving Subject-Driven Image Synthesis with Subject-Agnostic Guidance	Kelvin C. K. Chan et.al.	2405.01356	translate	read	null
2024-05-02	Towards Inclusive Face Recognition Through Synthetic Ethnicity Alteration	Praveen Kumar Chandaliya et.al.	2405.01273	translate	read	null
2024-05-02	DiffusionPipe: Training Large Diffusion Models with Efficient Pipelines	Ye Tian et.al.	2405.01248	translate	read	null
2024-05-02	On Mechanistic Knowledge Localization in Text-to-Image Generative Models	Samyadeep Basu et.al.	2405.01008	translate	read	null
2024-05-01	SonicDiffusion: Audio-Driven Image Generation and Editing with Pretrained Diffusion Models	Burak Can Biner et.al.	2405.00878	translate	read	null
2024-05-01	Guided Conditional Diffusion Classifier (ConDiff) for Enhanced Prediction of Infection in Diabetic Foot Ulcers	Palawat Busaranuvong et.al.	2405.00858	translate	read	null
2024-05-01	RGB $\leftrightarrow$ X: Image decomposition and synthesis using material- and lighting-aware diffusion models	Zheng Zeng et.al.	2405.00666	translate	read	null
2024-05-01	UWAFA-GAN: Ultra-Wide-Angle Fluorescein Angiography Transformation via Multi-scale Generation and Registration Enhancement	Ruiquan Ge et.al.	2405.00542	translate	read	link
2024-05-01	Compressive Sensing Imaging Using Caustic Lens Mask Generated by Periodic Perturbation in a Ripple Tank	Doğan Tunca Arık et.al.	2405.00407	translate	read	null
2024-05-01	Beamforming Inferring by Conditional WGAN-GP for Holographic Antenna Arrays	Fenghao Zhu et.al.	2405.00391	translate	read	null
2024-05-01	Streamlining Image Editing with Layered Diffusion Brushes	Peyman Gholami et.al.	2405.00313	translate	read	null
2024-05-01	Mapping New Realities: Ground Truth Image Creation with Pix2Pix Image-to-Image Translation	Zhenglin Li et.al.	2404.19265	translate	read	null
2024-05-01	FOTS: A Fast Optical Tactile Simulator for Sim2Real Learning of Tactile-motor Robot Manipulation Skills	Yongqiang Zhao et.al.	2404.19217	translate	read	null

(<a href=../Image_Generation.md>back to Image Generation</a>)