Image Generation - 2024-12 | Paper Arxiv Daily

Image Generation - 2024-12

Publish Date	Title	Authors	PDF	Translate	Read	Code
2024-12-30	Quantum Diffusion Model for Quark and Gluon Jet Generation	Mariia Baidachna et.al.	2412.21082	translate	read	link
2024-12-30	Varformer: Adapting VAR’s Generative Prior for Image Restoration	Siyang Wang et.al.	2412.21063	translate	read	link
2024-12-30	Redesign Quantum Circuits on Quantum Hardware Device	Runhong He et.al.	2412.20893	translate	read	null
2024-12-30	Generative Deep Synthesis of MIMO Sensing Waveforms with Desired Transmit Beampattern	Vesa Saarinen et.al.	2412.20883	translate	read	null
2024-12-30	VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control	Shaojin Wu et.al.	2412.20800	translate	read	link
2024-12-30	HFI: A unified framework for training-free detection and implicit watermarking of latent diffusion model generated images	Sungik Choi et.al.	2412.20704	translate	read	null
2024-12-30	Latent Drifting in Diffusion Models for Counterfactual Medical Image Synthesis	Yousef Yeganeh et.al.	2412.20651	translate	read	null
2024-12-29	Zero-Shot Image Restoration Using Few-Step Guidance of Consistency Models (and Beyond)	Tomer Garber et.al.	2412.20596	translate	read	null
2024-12-29	Diff4MMLiTS: Advanced Multimodal Liver Tumor Segmentation via Diffusion-Based Image Synthesis and Alignment	Shiyun Chen et.al.	2412.20418	translate	read	null
2024-12-27	StyleRWKV: High-Quality and High-Efficiency Style Transfer with RWKV-like Architecture	Miaomiao Dai et.al.	2412.19535	translate	read	null
2024-12-27	P3S-Diffusion:A Selective Subject-driven Generation Framework via Point Supervision	Junjie Hu et.al.	2412.19533	translate	read	null
2024-12-27	Estimation of System Parameters Including Repeated Cross-Sectional Data through Emulator-Informed Deep Generative Model	Hyunwoo Cho et.al.	2412.19517	translate	read	null
2024-12-27	Generative Adversarial Network on Motion-Blur Image Restoration	Zhengdong Li et.al.	2412.19479	translate	read	null
2024-12-27	Focusing Image Generation to Mitigate Spurious Correlations	Xuewei Li et.al.	2412.19457	translate	read	null
2024-12-26	Multi-Attribute Constraint Satisfaction via Language Model Rewriting	Ashutosh Baheti et.al.	2412.19198	translate	read	null
2024-12-26	Generating Editable Head Avatars with 3D Gaussian GANs	Guohao Li et.al.	2412.19149	translate	read	link
2024-12-25	MGAN-CRCM: A Novel Multiple Generative Adversarial Network and Coarse-Refinement Based Cognizant Method for Image Inpainting	Nafiz Al Asad et.al.	2412.19000	translate	read	null
2024-12-25	Single Trajectory Distillation for Accelerating Image and Video Style Transfer	Sijie Xu et.al.	2412.18945	translate	read	null
2024-12-25	UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation	Lunhao Duan et.al.	2412.18928	translate	read	null
2024-12-24	Efficient Aircraft Design Optimization Using Multi-Fidelity Models and Multi-fidelity Physics Informed Neural Networks	Apurba Sarker et.al.	2412.18564	translate	read	null
2024-12-24	Fashionability-Enhancing Outfit Image Editing with Conditional Diffusion Models	Qice Qin et.al.	2412.18421	translate	read	null
2024-12-24	Extract Free Dense Misalignment from CLIP	JeongYeon Nam et.al.	2412.18404	translate	read	link
2024-12-24	RDPM: Solve Diffusion Probabilistic Models via Recurrent Token Prediction	Wu Xiaoping et.al.	2412.18390	translate	read	null
2024-12-24	Improved Feature Generating Framework for Transductive Zero-shot Learning	Zihan Ye et.al.	2412.18282	translate	read	null
2024-12-24	TextMatch: Enhancing Image-Text Consistency Through Multimodal Optimization	Yucong Luo et.al.	2412.18185	translate	read	null
2024-12-24	EvalMuse-40K: A Reliable and Fine-Grained Benchmark with Comprehensive Human Annotations for Text-to-Image Generation Model Evaluation	Shuhao Han et.al.	2412.18150	translate	read	null
2024-12-24	Dense-Face: Personalized Face Generation Model via Dense Annotation Prediction	Xiao Guo et.al.	2412.18149	translate	read	null
2024-12-24	Ensuring Consistency for In-Image Translation	Chengpeng Fu et.al.	2412.18139	translate	read	null
2024-12-24	Beyond the Known: Enhancing Open Set Domain Adaptation with Unknown Exploration	Lucas Fernando Alvarenga e Silva et.al.	2412.18105	translate	read	link
2024-12-23	Personalized Large Vision-Language Models	Chau Pham et.al.	2412.17610	translate	read	null
2024-12-23	Discriminative Image Generation with Diffusion Models for Zero-Shot Learning	Dingjie Fu et.al.	2412.17219	translate	read	null
2024-12-22	Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching	Enshu Liu et.al.	2412.17153	translate	read	link
2024-12-22	Style Transfer Dataset: What Makes A Good Stylization?	Victor Kitov et.al.	2412.17139	translate	read	null
2024-12-22	Similarity Trajectories: Linking Sampling Process to Artifacts in Diffusion-Generated Images	Dennis Menn et.al.	2412.17109	translate	read	null
2024-12-22	DreamOmni: Unified Image Generation and Editing	Bin Xia et.al.	2412.17098	translate	read	null
2024-12-22	Modular Conversational Agents for Surveys and Interviews	Jiangbo Yu et.al.	2412.17049	translate	read	null
2024-12-22	HyperNet Fields: Efficiently Training Hypernetworks without Ground Truth by Learning Weight Trajectories	Eric Hedlin et.al.	2412.17040	translate	read	null
2024-12-22	DTSGAN: Learning Dynamic Textures via Spatiotemporal Generative Adversarial Network	Xiangtian Li et.al.	2412.16948	translate	read	null
2024-12-22	Self-Corrected Flow Distillation for Consistent One-Step and Few-Step Text-to-Image Generation	Quan Dao et.al.	2412.16906	translate	read	link
2024-12-20	Personalized Representation from Personalized Generation	Shobhita Sundaram et.al.	2412.16156	translate	read	link
2024-12-20	NeRF-To-Real Tester: Neural Radiance Fields as Test Image Generators for Vision of Autonomous Systems	Laura Weihl et.al.	2412.16141	translate	read	null
2024-12-20	CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up	Songhua Liu et.al.	2412.16112	translate	read	link
2024-12-20	SafeCFG: Redirecting Harmful Classifier-Free Guidance for Safe Generation	Jiadong Pan et.al.	2412.16039	translate	read	null
2024-12-20	A Thorough Investigation into the Application of Deep CNN for Enhancing Natural Language Processing Capabilities	Chang Weng et.al.	2412.15900	translate	read	null
2024-12-20	Semi-Supervised Adaptation of Diffusion Models for Handwritten Text Generation	Kai Brandenbusch et.al.	2412.15853	translate	read	null
2024-12-20	Robustness-enhanced Myoelectric Control with GAN-based Open-set Recognition	Cheng Wang et.al.	2412.15819	translate	read	null
2024-12-20	PersonaMagic: Stage-Regulated High-Fidelity Face Customization with Tandem Equilibrium	Xinzhe Li et.al.	2412.15674	translate	read	link
2024-12-20	BS-LDM: Effective Bone Suppression in High-Resolution Chest X-Ray Images with Conditional Latent Diffusion Models	Yifei Sun et.al.	2412.15670	translate	read	link
2024-12-20	SemDP: Semantic-level Differential Privacy Protection for Face Datasets	Xiaoting Zhang et.al.	2412.15590	translate	read	null
2024-12-19	Flowing from Words to Pixels: A Framework for Cross-Modality Evolution	Qihao Liu et.al.	2412.15213	translate	read	null
2024-12-19	FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching	Sucheng Ren et.al.	2412.15205	translate	read	link
2024-12-19	LlamaFusion: Adapting Pretrained Language Models for Multimodal Generation	Weijia Shi et.al.	2412.15188	translate	read	null
2024-12-19	Tiled Diffusion	Or Madar et.al.	2412.15185	translate	read	null
2024-12-19	DCTdiff: Intriguing Properties of Image Generative Modeling in the DCT Space	Mang Ning et.al.	2412.15032	translate	read	link
2024-12-19	Generative AI for Banks: Benchmarks and Algorithms for Synthetic Financial Transaction Data	Fabian Sven Karst et.al.	2412.14730	translate	read	link
2024-12-19	Qua $^2$ SeDiMo: Quantifiable Quantization Sensitivity of Diffusion Models	Keith G. Mills et.al.	2412.14628	translate	read	null
2024-12-19	Dynamic User Interface Generation for Enhanced Human-Computer Interaction Using Variational Autoencoders	Runsheng Zhang et.al.	2412.14521	translate	read	null
2024-12-19	DiffusionTrend: A Minimalist Approach to Virtual Fashion Try-On	Wengyi Zhan et.al.	2412.14465	translate	read	null
2024-12-19	LEDiff: Latent Exposure Diffusion for HDR Generation	Chao Wang et.al.	2412.14456	translate	read	null
2024-12-18	E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling	Zhihang Yuan et.al.	2412.14170	translate	read	null
2024-12-18	Autoregressive Video Generation without Vector Quantization	Haoge Deng et.al.	2412.14169	translate	read	link
2024-12-18	FashionComposer: Compositional Fashion Image Generation	Sihui Ji et.al.	2412.14168	translate	read	null
2024-12-18	VideoDPO: Omni-Preference Alignment for Video Diffusion Generation	Runtao Liu et.al.	2412.14167	translate	read	null
2024-12-18	Super-Resolution Generative Adversarial Network for Data Compression of Direct Numerical Simulations	Ludovico Nista et.al.	2412.14150	translate	read	null
2024-12-18	Text2Relight: Creative Portrait Relighting with Text Guidance	Junuk Cha et.al.	2412.13734	translate	read	null
2024-12-18	Diffusion models and stochastic quantisation in lattice field theory	Gert Aarts et.al.	2412.13704	translate	read	null
2024-12-18	MMO-IG: Multi-Class and Multi-Scale Object Image Generation for Remote Sensing	Chuang Yang et.al.	2412.13684	translate	read	null
2024-12-18	Self-control: A Better Conditional Mechanism for Masked Autoregressive Model	Qiaoying Qu et.al.	2412.13635	translate	read	null
2024-12-18	Hybrid Data-Free Knowledge Distillation	Jialiang Tang et.al.	2412.13525	translate	read	link
2024-12-17	F-Bench: Rethinking Human Preference Evaluation Metrics for Benchmarking Face Generation, Customization, and Restoration	Lu Liu et.al.	2412.13155	translate	read	null
2024-12-17	Prompt Augmentation for Self-supervised Text-guided Image Manipulation	Rumeysa Bodur et.al.	2412.13081	translate	read	null
2024-12-17	3D MedDiffusion: A 3D Medical Diffusion Model for Controllable and High-quality Medical Image Generation	Haoshen Wang et.al.	2412.13059	translate	read	null
2024-12-17	A New Adversarial Perspective for LiDAR-based 3D Object Detection	Shijun Zheng et.al.	2412.13017	translate	read	null
2024-12-17	Stable Diffusion is a Natural Cross-Modal Decoder for Layered AI-generated Image Compression	Ruijie Chen et.al.	2412.12982	translate	read	null
2024-12-17	Attentive Eraser: Unleashing Diffusion Model’s Object Removal Potential via Self-Attention Redirection Guidance	Wenhao Sun et.al.	2412.12974	translate	read	link
2024-12-17	Unsupervised Region-Based Image Editing of Denoising Diffusion Models	Zixiang Li et.al.	2412.12912	translate	read	null
2024-12-17	ArtAug: Enhancing Text-to-Image Generation through Synthesis-Understanding Interaction	Zhongjie Duan et.al.	2412.12888	translate	read	link
2024-12-17	Rethinking Diffusion-Based Image Generators for Fundus Fluorescein Angiography Synthesis on Limited Data	Chengzhou Yu et.al.	2412.12778	translate	read	null
2024-12-17	Guided and Variance-Corrected Fusion with One-shot Style Alignment for Large-Content Image Generation	Shoukun Sun et.al.	2412.12771	translate	read	null
2024-12-16	Causal Diffusion Transformers for Generative Modeling	Chaorui Deng et.al.	2412.12095	translate	read	link
2024-12-16	A LoRA is Worth a Thousand Pictures	Chenxi Liu et.al.	2412.12048	translate	read	null
2024-12-16	Ensemble Learning and 3D Pix2Pix for Comprehensive Brain Tumor Analysis in Multimodal MRI	Ramy A. Zeineldin et.al.	2412.11849	translate	read	null
2024-12-16	Multilingual and Explainable Text Detoxification with Parallel Corpora	Daryna Dementieva et.al.	2412.11691	translate	read	link
2024-12-16	IDProtector: An Adversarial Noise Encoder to Protect Against ID-Preserving Image Generation	Yiren Song et.al.	2412.11638	translate	read	null
2024-12-16	3D $^2$ -Actor: Learning Pose-Conditioned 3D-Aware Denoiser for Realistic Gaussian Avatar Modeling	Zichen Tang et.al.	2412.11599	translate	read	link
2024-12-16	VersaGen: Unleashing Versatile Visual Control for Text-to-Image Synthesis	Zhipeng Chen et.al.	2412.11594	translate	read	link
2024-12-16	LineArt: A Knowledge-guided Training-free High-quality Appearance Transfer for Design Drawing with Diffusion Model	Xi Wang et.al.	2412.11519	translate	read	null
2024-12-16	FedCAR: Cross-client Adaptive Re-weighting for Generative Models in Federated Learning	Minjun Kim et.al.	2412.11463	translate	read	link
2024-12-16	Nearly Zero-Cost Protection Against Mimicry by Personalized Diffusion Models	Namhyuk Ahn et.al.	2412.11423	translate	read	null
2024-12-13	OP-LoRA: The Blessing of Dimensionality	Piotr Teterwak et.al.	2412.10362	translate	read	null
2024-12-13	BrushEdit: All-In-One Image Inpainting and Editing	Yaowei Li et.al.	2412.10316	translate	read	link
2024-12-13	Simple Guidance Mechanisms for Discrete Diffusion Models	Yair Schiff et.al.	2412.10193	translate	read	link
2024-12-13	FaceShield: Defending Facial Image against Deepfake Threats	Jaehwan Jeong et.al.	2412.09921	translate	read	null
2024-12-13	ProxyLLM : LLM-Driven Framework for Customer Support Through Text-Style Transfer	Sehyeong Jo et.al.	2412.09916	translate	read	null
2024-12-13	T-GMSI: A transformer-based generative model for spatial interpolation under sparse measurements	Xiangxi Tian et.al.	2412.09886	translate	read	null
2024-12-13	Financial Fine-tuning a Large Time Series Model	Xinghong Fu et.al.	2412.09880	translate	read	link
2024-12-12	Human vs. AI: A Novel Benchmark and a Comparative Study on the Detection of Generated Images and the Impact of Prompts	Philipp Moeßner et.al.	2412.09715	translate	read	link
2024-12-12	Diffusion-Enhanced Test-time Adaptation with Text and Image Augmentation	Chun-Mei Feng et.al.	2412.09706	translate	read	link
2024-12-12	LoRACLR: Contrastive Adaptation for Customization of Diffusion Models	Enis Simsar et.al.	2412.09622	translate	read	null
2024-12-12	EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM	Zhuofan Zong et.al.	2412.09618	translate	read	null
2024-12-12	FluxSpace: Disentangled Semantic Editing in Rectified Flow Transformers	Yusuf Dalva et.al.	2412.09611	translate	read	null
2024-12-12	Spectral Image Tokenizer	Carlos Esteves et.al.	2412.09607	translate	read	null
2024-12-12	Are Conditional Latent Diffusion Models Effective for Image Restoration?	Yunchen Yuan et.al.	2412.09324	translate	read	null
2024-12-12	Transfer Learning of RSSI to Improve Indoor Localisation Performance	Thanaphon Suwannaphong et.al.	2412.09292	translate	read	link
2024-12-12	DASK: Distribution Rehearsing via Adaptive Style Kernel Learning for Exemplar-Free Lifelong Person Re-Identification	Kunlun Xu et.al.	2412.09224	translate	read	null
2024-12-12	RAD: Region-Aware Diffusion Models for Image Inpainting	Sora Kim et.al.	2412.09191	translate	read	null
2024-12-12	LVMark: Robust Watermark for latent video diffusion models	MinHyuk Jang et.al.	2412.09122	translate	read	null
2024-12-12	ViUniT: Visual Unit Tests for More Robust Visual Programming	Artemis Panagopoulou et.al.	2412.08859	translate	read	null
2024-12-11	Generative Semantic Communication: Architectures, Technologies, and Applications	Jinke Ren et.al.	2412.08642	translate	read	null
2024-12-11	Fast Prompt Alignment for Text-to-Image Generation	Khalil Mrini et.al.	2412.08639	translate	read	link
2024-12-11	Multimodal Latent Language Modeling with Next-Token Diffusion	Yutao Sun et.al.	2412.08635	translate	read	link
2024-12-11	LAION-SG: An Enhanced Large-Scale Dataset for Training Complex Image-Text Models with Structural Annotations	Zejian Li et.al.	2412.08580	translate	read	link
2024-12-11	StyleStudio: Text-Driven Style Transfer with Selective Control of Style Elements	Mingkun Lei et.al.	2412.08503	translate	read	link
2024-12-11	Learning Flow Fields in Attention for Controllable Person Image Generation	Zijian Zhou et.al.	2412.08486	translate	read	link
2024-12-11	InvDiff: Invariant Guidance for Bias Mitigation in Diffusion Models	Min Hou et.al.	2412.08480	translate	read	link
2024-12-11	CC-Diff: Enhancing Contextual Coherence in Remote Sensing Image Synthesis	Mu Zhang et.al.	2412.08464	translate	read	null
2024-12-11	Analyzing and Improving Model Collapse in Rectified Flow Models	Huminhao Zhu et.al.	2412.08175	translate	read	null
2024-12-11	AsyncDSB: Schedule-Asynchronous Diffusion Schrödinger Bridge for Image Inpainting	Zihao Han et.al.	2412.08149	translate	read	null
2024-12-10	UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics	Xi Chen et.al.	2412.07774	translate	read	null
2024-12-10	StyleMaster: Stylize Your Video with Artistic Generation and Translation	Zixuan Ye et.al.	2412.07744	translate	read	link
2024-12-10	FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models	Tong Wu et.al.	2412.07674	translate	read	link
2024-12-10	DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation	Jianzong Wu et.al.	2412.07589	translate	read	link
2024-12-10	StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization	Jinlu Zhang et.al.	2412.07375	translate	read	link
2024-12-10	Fusion Embedding for Pose-Guided Person Image Synthesis with Diffusion Model	Donghwna Lee et.al.	2412.07333	translate	read	link
2024-12-10	A Generative Victim Model for Segmentation	Aixuan Li et.al.	2412.07274	translate	read	null
2024-12-10	Buster: Incorporating Backdoor Attacks into Text Encoder to Mitigate NSFW Content Generation	Xin Zhao et.al.	2412.07249	translate	read	null
2024-12-10	Moderating the Generalization of Score-based Generative Model	Wan Jiang et.al.	2412.07229	translate	read	null
2024-12-10	Fine-grained Text to Image Synthesis	Xu Ouyang et.al.	2412.07196	translate	read	null
2024-12-09	Visual Lexicon: Rich Image Features in Language Space	XuDong Wang et.al.	2412.06774	translate	read	null
2024-12-09	Proactive Agents for Multi-Turn Text-to-Image Generation Under Uncertainty	Meera Hahn et.al.	2412.06771	translate	read	link
2024-12-09	ContRail: A Framework for Realistic Railway Image Synthesis using ControlNet	Andrei-Robert Alexandrescu et.al.	2412.06742	translate	read	null
2024-12-09	EMOv2: Pushing 5M Vision Model Frontier	Jiangning Zhang et.al.	2412.06674	translate	read	link
2024-12-09	ILLUME: Illuminating Your LLMs to See, Draw, and Self-Enhance	Chunwei Wang et.al.	2412.06673	translate	read	null
2024-12-09	Efficiency Meets Fidelity: A Novel Quantization Framework for Stable Diffusion	Shuaiting Li et.al.	2412.06661	translate	read	null
2024-12-09	Echocardiography to Cardiac MRI View Transformation for Real-Time Blind Restoration	Ilke Adalioglu et.al.	2412.06445	translate	read	null
2024-12-09	Exploring the Impact of Synthetic Data on Human Gesture Recognition Tasks Using GANs	George Kontogiannis et.al.	2412.06389	translate	read	null
2024-12-09	Sound2Vision: Generating Diverse Visuals from Audio through Cross-Modal Latent Alignment	Kim Sung-Bin et.al.	2412.06209	translate	read	link
2024-12-09	ASGDiffusion: Parallel High-Resolution Generation with Asynchronous Structure Guidance	Yuming Li et.al.	2412.06163	translate	read	null
2024-12-06	LoRA.rar: Learning to Merge LoRAs via Hypernetworks for Subject-Style Conditioned Image Generation	Donald Shenaj et.al.	2412.05148	translate	read	link
2024-12-06	The Silent Prompt: Initial Noise as Implicit Guidance for Goal-Driven Image Generation	Ruoyu Wang et.al.	2412.05101	translate	read	null
2024-12-06	Noise Matters: Diffusion Model-based Urban Mobility Generation with Collaborative Noise Priors	Yuheng Zhang et.al.	2412.05000	translate	read	null
2024-12-06	Continuous Video Process: Modeling Videos as Continuous Multi-Dimensional Processes for Video Prediction	Gaurav Shrivastava et.al.	2412.04929	translate	read	null
2024-12-05	Hidden in the Noise: Two-Stage Robust Watermarking for Images	Kasra Arabi et.al.	2412.04653	translate	read	link
2024-12-05	One Communication Round is All It Needs for Federated Fine-Tuning Foundation Models	Ziyao Wang et.al.	2412.04650	translate	read	null
2024-12-05	LayerFusion: Harmonized Multi-Layer Text-to-Image Generation with Generative Priors	Yusuf Dalva et.al.	2412.04460	translate	read	null
2024-12-05	Learning Artistic Signatures: Symmetry Discovery and Style Transfer	Emma Finn et.al.	2412.04441	translate	read	null
2024-12-05	Infinity: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis	Jian Han et.al.	2412.04431	translate	read	link
2024-12-05	Multi-Subject Image Synthesis as a Generative Prior for Single-Subject PET Image Reconstruction	George Webber et.al.	2412.04324	translate	read	null
2024-12-05	The Hyperfitting Phenomenon: Sharpening and Stabilizing LLMs for Open-Ended Text Generation	Fredrik Carlsson et.al.	2412.04318	translate	read	null
2024-12-05	T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts	Ziwei Huang et.al.	2412.04300	translate	read	null
2024-12-05	Structure-Aware Stylized Image Synthesis for Robust Medical Image Segmentation	Jie Bao et.al.	2412.04296	translate	read	null
2024-12-05	AnyDressing: Customizable Multi-Garment Virtual Dressing via Latent Diffusion Models	Xinghui Li et.al.	2412.04146	translate	read	link
2024-12-05	D-LORD for Motion Stylization	Meenakshi Gupta et.al.	2412.04097	translate	read	null
2024-12-05	BodyMetric: Evaluating the Realism of HumanBodies in Text-to-Image Generation	Nefeli Andreou et.al.	2412.04086	translate	read	null
2024-12-04	Style3D: Attention-guided Multi-view Style Transfer for 3D Object Generation	Bingjie Song et.al.	2412.03571	translate	read	null
2024-12-04	MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation	Zehuan Huang et.al.	2412.03558	translate	read	link
2024-12-04	Flow Matching with General Discrete Paths: A Kinetic-Optimal Perspective	Neta Shaul et.al.	2412.03487	translate	read	null
2024-12-04	Skel3D: Skeleton Guided Novel View Synthesis	Aron Fóthi et.al.	2412.03407	translate	read	null
2024-12-04	Implicit Priors Editing in Stable Diffusion via Targeted Token Adjustment	Feng He et.al.	2412.03400	translate	read	null
2024-12-04	SGSST: Scaling Gaussian Splatting StyleTransfer	Bruno Galerne et.al.	2412.03371	translate	read	null
2024-12-04	DIVE: Taming DINO for Subject-Driven Video Editing	Yi Huang et.al.	2412.03347	translate	read	null
2024-12-04	Geometry-guided Cross-view Diffusion for One-to-many Cross-view Image Synthesis	Tao Jun Lin et.al.	2412.03315	translate	read	null
2024-12-04	Is JPEG AI going to change image forensics?	Edoardo Daniele Cannas et.al.	2412.03261	translate	read	null
2024-12-04	DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation	Qingdong He et.al.	2412.03255	translate	read	null
2024-12-03	Taming Scalable Visual Tokenizer for Autoregressive Image Generation	Fengyuan Shi et.al.	2412.02692	translate	read	link
2024-12-03	FoundHand: Large-Scale Domain-Specific Learning for Controllable Hand Image Generation	Kefan Chen et.al.	2412.02690	translate	read	null
2024-12-03	SNOOPI: Supercharged One-step Diffusion Distillation with Proper Guidance	Viet Nguyen et.al.	2412.02687	translate	read	null
2024-12-03	WEM-GAN: Wavelet transform based facial expression manipulation	Dongya Sun et.al.	2412.02530	translate	read	null
2024-12-03	ScImage: How Good Are Multimodal Large Language Models at Scientific Text-to-Image Generation?	Leixin Zhang et.al.	2412.02368	translate	read	link
2024-12-03	Switchable deep beamformer for high-quality and real-time passive acoustic mapping	Yi Zeng et.al.	2412.02327	translate	read	null
2024-12-03	Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generative Models	Jungwon Park et.al.	2412.02237	translate	read	link
2024-12-03	GIST: Towards Photorealistic Style Transfer via Multiscale Geometric Representations	Renan A. Rojas-Gomez et.al.	2412.02214	translate	read	null
2024-12-03	An Automated Data Mining Framework Using Autoencoders for Feature Extraction and Dimensionality Reduction	Yaxin Liang et.al.	2412.02211	translate	read	null
2024-12-03	3D representation in 512-Byte:Variational tokenizer is the key for autoregressive 3D generation	Jinzhi Zhang et.al.	2412.02202	translate	read	null

(<a href=../Image_Generation.md>back to Image Generation</a>)