Image Generation - 2025-01 | Paper Arxiv Daily

Image Generation - 2025-01

Publish Date	Title	Authors	PDF	Translate	Read	Code
2025-01-31	Application of Generative Adversarial Network (GAN) for Synthetic Training Data Creation to improve performance of ANN Classifier for extracting Built-Up pixels from Landsat Satellite Imagery	Amritendu Mukherjee et.al.	2501.19283	translate	read	null
2025-01-31	Ambient Denoising Diffusion Generative Adversarial Networks for Establishing Stochastic Object Models from Noisy Image Data	Xichen Xu et.al.	2501.19094	translate	read	null
2025-01-31	Concept Steerers: Leveraging K-Sparse Autoencoders for Controllable Generations	Dahye Kim et.al.	2501.19066	translate	read	link
2025-01-31	BCAT: A Block Causal Transformer for PDE Foundation Models for Fluid Dynamics	Yuxuan Liu et.al.	2501.18972	translate	read	null
2025-01-31	Distorting Embedding Space for Safety: A Defense Mechanism for Adversarially Robust Diffusion Models	Jaesin Ahn et.al.	2501.18877	translate	read	link
2025-01-31	REG: Rectified Gradient Guidance for Conditional Diffusion Models	Zhengqi Gao et.al.	2501.18865	translate	read	null
2025-01-30	High-Accuracy ECG Image Interpretation using Parameter-Efficient LoRA Fine-Tuning with Multimodal LLaMA 3.2	Nandakishor M et.al.	2501.18670	translate	read	null
2025-01-30	Diffusion Autoencoders are Scalable Image Tokenizers	Yinbo Chen et.al.	2501.18593	translate	read	null
2025-01-30	CGAN-Based Framework for Meson Mass and Width Prediction	S. Rostami et.al.	2501.18562	translate	read	null
2025-01-30	SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer	Enze Xie et.al.	2501.18427	translate	read	null
2025-01-30	MatIR: A Hybrid Mamba-Transformer Image Restoration Model	Juan Wen et.al.	2501.18401	translate	read	null
2025-01-30	Simulation of microstructures and machine learning	Katja Schladitz et.al.	2501.18313	translate	read	null
2025-01-30	LLMs can see and hear without any training	Kumar Ashutosh et.al.	2501.18096	translate	read	link
2025-01-29	Generative AI for Vision: A Comprehensive Study of Frameworks and Applications	Fouad Bousetouane et.al.	2501.18033	translate	read	null
2025-01-29	Janus-Pro: Unified Multimodal Understanding and Generation with Data and Model Scaling	Xiaokang Chen et.al.	2501.17811	translate	read	link
2025-01-29	A Framework for Generating Realistic Synthetic Tabular Data in a Randomized Controlled Trial Setting	Niki Z. Petrakos et.al.	2501.17719	translate	read	null
2025-01-29	Segmentation-Aware Generative Reinforcement Network (GRN) for Tissue Layer Segmentation in 3-D Ultrasound Images for Chronic Low-back Pain (cLBP) Assessment	Zixue Zeng et.al.	2501.17690	translate	read	null
2025-01-29	Trustworthy image-to-image translation: evaluating uncertainty calibration in unpaired training scenarios	Ciaran Bench et.al.	2501.17570	translate	read	null
2025-01-28	Text-to-Image Generation for Vocabulary Learning Using the Keyword Method	Nuwan T. Attygalle et.al.	2501.17099	translate	read	null
2025-01-28	MAUCell: An Adaptive Multi-Attention Framework for Video Frame Prediction	Shreyam Gupta et.al.	2501.16997	translate	read	null
2025-01-28	DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation	Chenguo Lin et.al.	2501.16764	translate	read	link
2025-01-29	Polyp-Gen: Realistic and Diverse Polyp Image Generation for Endoscopic Dataset Expansion	Shengyuan Liu et.al.	2501.16679	translate	read	link
2025-01-28	Variational Schrödinger Momentum Diffusion	Kevin Rojas et.al.	2501.16675	translate	read	null
2025-01-27	LoRA-X: Bridging Foundation Models with Training-Free Cross-Model Adaptation	Farzad Farhadzadeh et.al.	2501.16559	translate	read	null
2025-01-27	Towards Robust Stability Prediction in Smart Grids: GAN-based Approach under Data Constraints and Adversarial Challenges	Emad Efatinasab et.al.	2501.16490	translate	read	null
2025-01-27	RelightVid: Temporal-Consistent Diffusion Model for Video Relighting	Ye Fang et.al.	2501.16330	translate	read	link
2025-01-27	MetaDecorator: Generating Immersive Virtual Tours through Multimodality	Shuang Xie et.al.	2501.16164	translate	read	null
2025-01-27	Generating Spatial Synthetic Populations Using Wasserstein Generative Adversarial Network: A Case Study with EU-SILC Data for Helsinki and Thessaloniki	Vanja Falck et.al.	2501.16080	translate	read	null
2025-01-27	Slot-Guided Adaptation of Pre-trained Diffusion Models for Object-Centric Learning and Compositional Generation	Adil Kaan Akan et.al.	2501.15878	translate	read	null
2025-01-27	Can Location Embeddings Enhance Super-Resolution of Satellite Imagery?	Daniel Panangian et.al.	2501.15847	translate	read	null
2025-01-27	Autonomous Horizon-based Asteroid Navigation With Observability-constrained Maneuvers	Aditya Arjun Anibha et.al.	2501.15806	translate	read	null
2025-01-27	Do Existing Testing Tools Really Uncover Gender Bias in Text-to-Image Models?	Yunbo Lyu et.al.	2501.15775	translate	read	null
2025-01-26	Bringing Characters to New Stories: Training-Free Theme-Specific Image Generation via Dynamic Visual Prompting	Yuxin Zhang et.al.	2501.15641	translate	read	link
2025-01-26	Comparative clinical evaluation of “memory-efficient” synthetic 3d generative adversarial networks (gan) head-to-head to state of art: results on computed tomography of the chest	Mahshid shiri et.al.	2501.15572	translate	read	null
2025-01-26	SQ-DM: Accelerating Diffusion Models with Aggressive Quantization and Temporal Sparsity	Zichen Fan et.al.	2501.15448	translate	read	null
2025-01-24	Towards Scalable Topological Regularizers	Hiu-Tung Wong et.al.	2501.14641	translate	read	null
2025-01-24	Training-Free Style and Content Transfer by Leveraging U-Net Skip Connections in Stable Diffusion 2.*	Ludovica Schaerf et.al.	2501.14524	translate	read	null
2025-01-24	PAID: A Framework of Product-Centric Advertising Image Design	Hongyu Chen et.al.	2501.14316	translate	read	null
2025-01-24	CDI: Blind Image Restoration Fidelity Evaluation based on Consistency with Degraded Image	Xiaojun Tang et.al.	2501.14264	translate	read	null
2025-01-24	VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking	Runyi Hu et.al.	2501.14195	translate	read	link
2025-01-24	Fully Guided Neural Schrödinger bridge for Brain MR image synthesis	Hanyeol Yang et.al.	2501.14171	translate	read	null
2025-01-23	LLM-guided Instance-level Image Manipulation with Diffusion U-Net Cross-Attention Maps	Andrey Palaev et.al.	2501.14046	translate	read	link
2025-01-23	Can We Generate Images with CoT? Let’s Verify and Reinforce Image Generation Step by Step	Ziyu Guo et.al.	2501.13926	translate	read	link
2025-01-23	IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models	Jiayi Lei et.al.	2501.13920	translate	read	link
2025-01-23	Binary Diffusion Probabilistic Model	Vitaliy Kinakh et.al.	2501.13915	translate	read	null
2025-01-23	Generating Realistic Forehead-Creases for User Verification via Conditioned Piecewise Polynomial Curves	Abhishek Tandon et.al.	2501.13889	translate	read	link
2025-01-23	Everyone-Can-Sing: Zero-Shot Singing Voice Synthesis and Conversion with Speech Reference	Shuqi Dai et.al.	2501.13870	translate	read	null
2025-01-23	The Lock Generative Adversarial Network for Medical Waveform Anomaly Detection	Wenjie Xu et.al.	2501.13858	translate	read	null
2025-01-23	Predicting Compact Phrasal Rewrites with Large Language Models for ASR Post Editing	Hao Zhang et.al.	2501.13831	translate	read	null
2025-01-23	PhotoGAN: Generative Adversarial Neural Network Acceleration with Silicon Photonics	Tharini Suresh et.al.	2501.13828	translate	read	null
2025-01-23	A Mutual Information Perspective on Multiple Latent Variable Generative Models for Positive View Generation	Dario Serez et.al.	2501.13718	translate	read	link
2025-01-23	One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt	Tao Liu et.al.	2501.13554	translate	read	link
2025-01-22	Accelerate High-Quality Diffusion Models with Inner Loop Feedback	Matthew Gwilliam et.al.	2501.13107	translate	read	null
2025-01-22	Orchid: Image Latent Diffusion for Joint Appearance and Geometry Generation	Akshay Krishnan et.al.	2501.13087	translate	read	null
2025-01-22	On the Use of WGANs for Super Resolution in Dark-Matter Simulations	John Brennan et.al.	2501.13056	translate	read	null
2025-01-22	LiT: Delving into a Simplified Linear Diffusion Transformer for Image Generation	Jiahao Wang et.al.	2501.12976	translate	read	null
2025-01-22	PreciseCam: Precise Camera Control for Text-to-Image Generation	Edurne Bernal-Berdun et.al.	2501.12910	translate	read	null
2025-01-22	Certified Guidance for Planning with Deep Generative Models	Francesco Giacomarra et.al.	2501.12815	translate	read	null
2025-01-22	T2ISafety: Benchmark for Assessing Fairness, Toxicity, and Privacy in Image Generation	Lijun Li et.al.	2501.12612	translate	read	link
2025-01-21	Bidirectional Brain Image Translation using Transfer Learning from Generic Pre-trained Models	Fatima Haimour et.al.	2501.12488	translate	read	null
2025-01-22	GPS as a Control Signal for Image Generation	Chao Feng et.al.	2501.12390	translate	read	null
2025-01-21	Parallel Sequence Modeling via Generalized Spatial Propagation Network	Hongjun Wang et.al.	2501.12381	translate	read	null
2025-01-21	Expertise elevates AI usage: experimental evidence comparing laypeople and professional artists	Thomas F. Eisenmann et.al.	2501.12374	translate	read	link
2025-01-21	VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model	Xianwei Zhuang et.al.	2501.12327	translate	read	link
2025-01-21	Regressor-Guided Image Editing Regulates Emotional Response to Reduce Online Engagement	Christoph Gebhardt et.al.	2501.12289	translate	read	null
2025-01-21	VipDiff: Towards Coherent and Diverse Video Inpainting via Training-free Denoising Diffusion Models	Chaohao Xie et.al.	2501.12267	translate	read	null
2025-01-21	ComposeAnyone: Controllable Layout-to-Human Generation with Decoupled Multimodal Conditions	Shiyue Zhang et.al.	2501.12173	translate	read	link
2025-01-20	Are generative models fair? A study of racial bias in dermatological image generation	Miguel López-Pérez et.al.	2501.11752	translate	read	null
2025-01-20	StAyaL	Multilingual Style Transfer	Karishma Thakrar et.al.	2501.11639	translate	read	null
2025-01-20	StyleSSP: Sampling StartPoint Enhancement for Training-free Diffusion-based Method for Style Transfer	Ruojun Xu et.al.	2501.11319	translate	read	null
2025-01-17	Credit Risk Identification in Supply Chains Using Generative Adversarial Networks	Zizhou Zhang et.al.	2501.10348	translate	read	null
2025-01-17	DiffVSR: Enhancing Real-World Video Super-Resolution with Diffusion Models for Advanced Visual Quality and Temporal Consistency	Xiaohui Li et.al.	2501.10110	translate	read	null
2025-01-17	HiFi-SR: A Unified Generative Transformer-Convolutional Adversarial Network for High-Fidelity Speech Super-Resolution	Shengkui Zhao et.al.	2501.10045	translate	read	link
2025-01-17	Spatiotemporal Prediction of Secondary Crashes by Rebalancing Dynamic and Static Data with Generative Adversarial Networks	Junlan Chen et.al.	2501.10041	translate	read	null
2025-01-17	Physics-informed DeepCT: Sinogram Wavelet Decomposition Meets Masked Diffusion	Zekun Zhou et.al.	2501.09935	translate	read	null
2025-01-17	IE-Bench: Advancing the Measurement of Text-Driven Image Editing for Human Perception Alignment	Shangkun Sun et.al.	2501.09927	translate	read	null
2025-01-16	PIXELS: Progressive Image Xemplar-based Editing with Latent Surgery	Shristi Das Biswas et.al.	2501.09826	translate	read	link
2025-01-16	Learnings from Scaling Visual Tokenizers for Reconstruction and Generation	Philippe Hansen-Estruch et.al.	2501.09755	translate	read	null
2025-01-16	Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps	Nanye Ma et.al.	2501.09732	translate	read	null
2025-01-16	AnyStory: Towards Unified Single and Multiple Subject Personalization in Text-to-Image Generation	Junjie He et.al.	2501.09503	translate	read	link
2025-01-16	Mining the time axis with TRON. I. Millisecond pulsars in Omega Centauri, Terzan 5 and 47 Tucanae detected through MeerKAT interferometric imaging	Oleg M. Smirnov et.al.	2501.09488	translate	read	null
2025-01-16	Dynamic Neural Style Transfer for Artistic Image Generation using VGG19	Kapil Kashyap et.al.	2501.09420	translate	read	null
2025-01-16	SVIA: A Street View Image Anonymization Framework for Self-Driving Applications	Dongyu Liu et.al.	2501.09393	translate	read	null
2025-01-16	Contract-Inspired Contest Theory for Controllable Image Generation in Mobile Edge Metaverse	Guangyuan Liu et.al.	2501.09391	translate	read	null
2025-01-16	SEAL: Entangled White-box Watermarks on Low-Rank Adaptation	Giyeong Oh et.al.	2501.09284	translate	read	link
2025-01-15	Grounding Text-To-Image Diffusion Models For Controlled High-Quality Image Generation	Ahmad Süleyman et.al.	2501.09194	translate	read	null
2025-01-15	Generative diffusion model with inverse renormalization group flows	Kanta Masuki et.al.	2501.09064	translate	read	link
2025-01-15	How Do Generative Models Draw a Software Engineer? A Case Study on Stable Diffusion Bias	Tosin Fadahunsi et.al.	2501.09014	translate	read	link
2025-01-15	Multimodal LLMs Can Reason about Aesthetics in Zero-Shot	Ruixiang Jiang et.al.	2501.09012	translate	read	link
2025-01-15	VECT-GAN: A variationally encoded generative model for overcoming data scarcity in pharmaceutical science	Youssef Abdalla et.al.	2501.08995	translate	read	link
2025-01-15	Enhanced Multi-Scale Cross-Attention for Person Image Generation	Hao Tang et.al.	2501.08900	translate	read	null
2025-01-15	XMusic: Towards a Generalized and Controllable Symbolic Music Generation Framework	Sida Tian et.al.	2501.08809	translate	read	null
2025-01-15	Investigating Parameter-Efficiency of Hybrid QuGANs Based on Geometric Properties of Generated Sea Route Graphs	Tobias Rohe et.al.	2501.08678	translate	read	null
2025-01-15	StereoGen: High-quality Stereo Image Generation from a Single Image	Xianqi Wang et.al.	2501.08654	translate	read	link
2025-01-15	Joint Learning of Depth and Appearance for Portrait Image Animation	Xinya Ji et.al.	2501.08649	translate	read	null
2025-01-15	Watermarking in Diffusion Model: Gaussian Shading with Exact Diffusion Inversion via Coupled Transformations (EDICT)	Krishna Panthi et.al.	2501.08604	translate	read	null
2025-01-15	Stability and convergence of relaxed scalar auxiliary variable schemes for Cahn-Hilliard systems with bounded mass source	Kei Fong Lam et.al.	2501.08543	translate	read	null
2025-01-14	D $^2$ -DPM: Dual Denoising for Quantized Diffusion Probabilistic Models	Qian Zeng et.al.	2501.08180	translate	read	link
2025-01-14	Benchmarking Multimodal Models for Fine-Grained Image Analysis: A Comparative Study Across Diverse Visual Features	Evgenii Evstafev et.al.	2501.08170	translate	read	null
2025-01-14	Prediction Interval Construction Method for Electricity Prices	Xin Lu et.al.	2501.07827	translate	read	null
2025-01-14	On the Statistical Capacity of Deep Generative Models	Edric Tam et.al.	2501.07763	translate	read	link
2025-01-13	Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens	Dongwon Kim et.al.	2501.07730	translate	read	null
2025-01-13	Pedestrian Trajectory Prediction Based on Social Interactions Learning With Random Weights	Jiajia Xie et.al.	2501.07711	translate	read	null
2025-01-13	OCORD: Open-Campus Object Removal Dataset	Shuo Zhang et.al.	2501.07397	translate	read	null
2025-01-13	Boosting Text-To-Image Generation via Multilingual Prompting in Large Multimodal Models	Yongyu Mu et.al.	2501.07086	translate	read	link
2025-01-13	Enhancing Image Generation Fidelity via Progressive Prompts	Zhen Xiong et.al.	2501.07070	translate	read	link
2025-01-13	SFC-GAN: A Generative Adversarial Network for Brain Functional and Structural Connectome Translation	Yee-Fan Tan et.al.	2501.07055	translate	read	null
2025-01-13	Detection of AI Deepfake and Fraud in Online Payments Using GAN-Based Models	Zong Ke et.al.	2501.07033	translate	read	null
2025-01-12	Super-Resolution of 3D Micro-CT Images Using Generative Adversarial Networks: Enhancing Resolution and Segmentation Accuracy	Evgeny Ugolkov et.al.	2501.06939	translate	read	link
2025-01-12	Defect Detection Network In PCB Circuit Devices Based on GAN Enhanced YOLOv11	Jiayi Huang et.al.	2501.06879	translate	read	null
2025-01-12	SuperNeRF-GAN: A Universal 3D-Consistent Super-Resolution Framework for Efficient and Enhanced 3D-Aware Image Synthesis	Peng Zheng et.al.	2501.06770	translate	read	null
2025-01-12	ODPG: Outfitting Diffusion with Pose Guided Condition	Seohyun Lee et.al.	2501.06769	translate	read	null
2025-01-12	Padding Tone: A Mechanistic Analysis of Padding Tokens in T2I Models	Michael Toker et.al.	2501.06751	translate	read	null
2025-01-10	Poetry in Pixels: Prompt Tuning for Poem Image Generation via Diffusion Models	Sofia Jamil et.al.	2501.05839	translate	read	link
2025-01-10	EmotiCrafter: Text-to-Emotional-Image Generation based on Valence-Arousal Model	Yi He et.al.	2501.05710	translate	read	null
2025-01-10	HFMF: Hierarchical Fusion Meets Multi-Stream Models for Deepfake Detection	Anant Mehta et.al.	2501.05631	translate	read	link
2025-01-09	Towards Probabilistic Inference of Human Motor Intentions by Assistive Mobile Robots Controlled via a Brain-Computer Interface	Xiaoshan Zhou et.al.	2501.05610	translate	read	null
2025-01-09	Consistent Flow Distillation for Text-to-3D Generation	Runjie Yan et.al.	2501.05445	translate	read	null
2025-01-09	Zero-1-to-G: Taming Pretrained 2D Diffusion Model for Direct 3D Generation	Xuyi Meng et.al.	2501.05427	translate	read	null
2025-01-09	Seeing Sound: Assembling Sounds from Visuals for Audio-to-Image Generation	Darius Petermann et.al.	2501.05413	translate	read	null
2025-01-09	CROPS: Model-Agnostic Training-Free Framework for Safe Image Synthesis with Latent Diffusion Models	Junha Park et.al.	2501.05359	translate	read	null
2025-01-09	Patch-GAN Transfer Learning with Reconstructive Models for Cloud Removal	Wanli Ma et.al.	2501.05265	translate	read	null
2025-01-09	3DIS-FLUX: simple and efficient multi-instance generation with DiT rendering	Dewei Zhou et.al.	2501.05131	translate	read	link
2025-01-08	LayerMix: Enhanced Data Augmentation through Fractal Integration for Robust Deep Learning	Hafiz Mughees Ahmad et.al.	2501.04861	translate	read	null
2025-01-08	EditAR: Unified Conditional Generation with Autoregressive Models	Jiteng Mu et.al.	2501.04699	translate	read	null
2025-01-08	Optimal Trading of a Charging-Station Company in Auction Markets for Electricity	Farnaz Sohrabi et.al.	2501.04647	translate	read	null
2025-01-08	Accelerated Discovery of Vanadium Oxide Compositions: A WGAN-VAE Framework for Materials Design	Danial Ebrahimzadeh et.al.	2501.04604	translate	read	null
2025-01-08	Improving Image Captioning by Mimicking Human Reformulation Feedback at Inference-time	Uri Berger et.al.	2501.04513	translate	read	null
2025-01-08	On Computational Limits and Provably Efficient Criteria of Visual Autoregressive Models: A Fine-Grained Complexity Analysis	Yekun Ke et.al.	2501.04377	translate	read	null
2025-01-08	Circuit Complexity Bounds for Visual Autoregressive Model	Yekun Ke et.al.	2501.04299	translate	read	null
2025-01-07	HistoryPalette: Supporting Exploration and Reuse of Past Alternatives in Image Generation and Editing	Karim Benharrak et.al.	2501.04163	translate	read	null
2025-01-07	BiasGuard: Guardrailing Fairness in Machine Learning Production Systems	Nurit Cohen-Inger et.al.	2501.04142	translate	read	link
2025-01-07	ZDySS – Zero-Shot Dynamic Scene Stylization using Gaussian Splatting	Abhishek Saroha et.al.	2501.03875	translate	read	null
2025-01-07	A Value Mapping Virtual Staining Framework for Large-scale Histological Imaging	Junjia Wang et.al.	2501.03592	translate	read	null
2025-01-07	Evaluating Image Caption via Cycle-consistent Text-to-Image Generation	Tianyu Cui et.al.	2501.03567	translate	read	null
2025-01-07	PromptGuard: Soft Prompt-Guided Unsafe Content Moderation for Text-to-Image Models	Lingzhi Yuan et.al.	2501.03544	translate	read	null
2025-01-07	Textualize Visual Prompt for Image Editing via Diffusion Bridge	Pengcheng Xu et.al.	2501.03495	translate	read	null
2025-01-07	SceneBooth: Diffusion-based Framework for Subject-preserved Text-to-Image Generation	Shang Chai et.al.	2501.03490	translate	read	null
2025-01-07	Physics-Constrained Generative Artificial Intelligence for Rapid Takeoff Trajectory Design	Samuel Sisk et.al.	2501.03445	translate	read	null
2025-01-06	License Plate Images Generation with Diffusion Models	Mariia Shpir et.al.	2501.03374	translate	read	null
2025-01-06	InpDiffusion: Image Inpainting Localization via Conditional Diffusion Models	Kai Wang et.al.	2501.02816	translate	read	null
2025-01-06	DarkFarseer: Inductive Spatio-temporal Kriging via Hidden Style Enhancement and Sparsity-Noise Mitigation	Zhuoxuan Liang et.al.	2501.02808	translate	read	null
2025-01-06	Enhancing Robot Route Optimization in Smart Logistics with Transformer and GNN Integration	Hao Luo et.al.	2501.02749	translate	read	null
2025-01-06	Artificial Intelligence in Creative Industries: Advances Prior to 2025	Nantheera Anantrasirichai et.al.	2501.02725	translate	read	null
2025-01-05	Vision-Driven Prompt Optimization for Large Language Models in Multimodal Generative Tasks	Leo Franklin et.al.	2501.02527	translate	read	null
2025-01-05	Face-MakeUp: Multimodal Facial Prompts for Text-to-Image Generation	Dawei Dai et.al.	2501.02523	translate	read	link
2025-01-05	ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling	Chaojie Mao et.al.	2501.02487	translate	read	null
2025-01-05	MedSegDiffNCA: Diffusion Models With Neural Cellular Automata for Skin Lesion Segmentation	Avni Mittal et.al.	2501.02447	translate	read	null
2025-01-04	CorrFill: Enhancing Faithfulness in Reference-based Inpainting with Correspondence Guidance in Diffusion Models	Kuan-Hung Liu et.al.	2501.02355	translate	read	link
2025-01-04	Design and Benchmarking of A Multi-Modality Sensor for Robotic Manipulation with GAN-Based Cross-Modality Interpretation	Dandan Zhang et.al.	2501.02303	translate	read	null
2025-01-03	Creating Artificial Students that Never Existed: Leveraging Large Language Models and CTGANs for Synthetic Data Generation	Mohammad Khalil et.al.	2501.01793	translate	read	null
2025-01-03	Controlling your Attributes in Voice	Xuyuan Li et.al.	2501.01674	translate	read	null
2025-01-03	Multivariate Time Series Anomaly Detection using DiffGAN Model	Guangqiang Wu et.al.	2501.01591	translate	read	null
2025-01-02	Object-level Visual Prompts for Compositional Image Generation	Gaurav Parmar et.al.	2501.01424	translate	read	null
2025-01-02	On Unifying Video Generation and Camera Pose Estimation	Chun-Hao Paul Huang et.al.	2501.01409	translate	read	null
2025-01-02	ProjectedEx: Enhancing Generation in Explainable AI for Prostate Cancer	Xuyin Qi et.al.	2501.01392	translate	read	link
2025-01-02	Test-time Controllable Image Generation by Explicit Spatial Constraint Enforcement	Z. Zhang et.al.	2501.01368	translate	read	null
2025-01-02	LayeringDiff: Layered Image Synthesis via Generation, then Disassembly with Generative Knowledge	Kyoungkook Kang et.al.	2501.01197	translate	read	null
2025-01-02	HarmonyIQA: Pioneering Benchmark and Model for Image Harmonization Quality Assessment	Zitong Xu et.al.	2501.01116	translate	read	null
2025-01-02	MalCL: Leveraging GAN-Based Generative Replay to Combat Catastrophic Forgetting in Malware Classification	Jimin Park et.al.	2501.01110	translate	read	link
2025-01-02	EliGen: Entity-Level Controlled Image Generation with Regional Attention	Hong Zhang et.al.	2501.01097	translate	read	null
2025-01-02	State-of-the-art AI-based Learning Approaches for Deepfake Generation and Detection, Analyzing Opportunities, Threading through Pros, Cons, and Future Prospects	Harshika Goyal et.al.	2501.01029	translate	read	null
2025-01-01	OASIS Uncovers: High-Quality T2I Models, Same Old Stereotypes	Sepehr Dehdashtian et.al.	2501.00962	translate	read	null
2025-01-02	Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation	Yuanbo Yang et.al.	2412.21117	translate	read	null

(<a href=../Image_Generation.md>back to Image Generation</a>)