Image Generation - 2024-04
Image Generation - 2024-04
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2024-04-30 | IgCONDA-PET: Implicitly-Guided Counterfactual Diffusion for Detecting Anomalies in PET Images | Shadab Ahamed et.al. | 2405.00239 | translate | read | link |
| 2024-04-30 | DOCCI: Descriptions of Connected and Contrasting Images | Yasumasa Onoe et.al. | 2404.19753 | translate | read | null |
| 2024-04-30 | Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation | Yunhao Ge et.al. | 2404.19752 | translate | read | null |
| 2024-04-30 | SwipeGANSpace: Swipe-to-Compare Image Generation via Efficient Latent Space Exploration | Yuto Nakashima et.al. | 2404.19693 | translate | read | null |
| 2024-04-30 | Seeing Through the Clouds: Cloud Gap Imputation with Prithvi Foundation Model | Denys Godwin et.al. | 2404.19609 | translate | read | null |
| 2024-04-30 | TwinDiffusion: Enhancing Coherence and Efficiency in Panoramic Image Generation with Diffusion Models | Teng Zhou et.al. | 2404.19475 | translate | read | null |
| 2024-04-30 | InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation | Chanran Kim et.al. | 2404.19427 | translate | read | null |
| 2024-04-30 | NeRF-Insert: 3D Local Editing with Multimodal Control Signals | Benet Oriol Sabat et.al. | 2404.19204 | translate | read | null |
| 2024-04-29 | DGE: Direct Gaussian 3D Editing by Consistent Multi-view Editing | Minghao Chen et.al. | 2404.18929 | translate | read | null |
| 2024-04-29 | TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation | Junhao Cheng et.al. | 2404.18919 | translate | read | null |
| 2024-04-29 | Hide and Seek: How Does Watermarking Impact Face Recognition? | Yuguang Yao et.al. | 2404.18890 | translate | read | null |
| 2024-04-29 | Learning Mixtures of Gaussians Using Diffusion Models | Khashayar Gatmiry et.al. | 2404.18869 | translate | read | null |
| 2024-04-29 | Socially Adaptive Path Planning Based on Generative Adversarial Network | Yao Wang et.al. | 2404.18687 | translate | read | null |
| 2024-04-29 | FlexiFilm: Long Video Generation with Flexible Conditions | Yichen Ouyang et.al. | 2404.18620 | translate | read | link |
| 2024-04-29 | Anywhere: A Multi-Agent Framework for Reliable and Diverse Foreground-Conditioned Image Inpainting | Tianyidan Xie et.al. | 2404.18598 | translate | read | null |
| 2024-04-29 | SIDBench: A Python Framework for Reliably Assessing Synthetic Image Detection Methods | Manos Schinas et.al. | 2404.18552 | translate | read | link |
| 2024-04-29 | Towards Image Synthesis with Photon Counting Stellar Intensity Interferometry | Alessia Spolon et.al. | 2404.18507 | translate | read | null |
| 2024-04-29 | Autonomous Quality and Hallucination Assessment for Virtual Tissue Staining and Digital Pathology | Luzhe Huang et.al. | 2404.18458 | translate | read | null |
| 2024-04-26 | Federated Transfer Component Analysis Towards Effective VNF Profiling | Xunzheng ZhangB et.al. | 2404.17553 | translate | read | null |
| 2024-04-26 | Spatial-frequency Dual-Domain Feature Fusion Network for Low-Light Remote Sensing Image Enhancement | Zishu Yao et.al. | 2404.17400 | translate | read | null |
| 2024-04-26 | Trinity Detector:text-assisted and attention mechanisms based spectral fusion for diffusion generation image detection | Jiawei Song et.al. | 2404.17254 | translate | read | null |
| 2024-04-26 | ObjectAdd: Adding Objects into Image via a Training-Free Diffusion Modification Fashion | Ziyue Zhang et.al. | 2404.17230 | translate | read | link |
| 2024-04-26 | DPGAN: A Dual-Path Generative Adversarial Network for Missing Data Imputation in Graphs | Xindi Zheng et.al. | 2404.17164 | translate | read | null |
| 2024-04-26 | An Investigation of Time-Frequency Representation Discriminators for High-Fidelity Vocoder | Yicheng Gu et.al. | 2404.17161 | translate | read | null |
| 2024-04-26 | Synthesizing Iris Images using Generative Adversarial Networks: Survey and Comparative Analysis | Shivangi Yadav et.al. | 2404.17105 | translate | read | null |
| 2024-04-25 | Channel Modeling for FR3 Upper Mid-band via Generative Adversarial Networks | Yaqi Hu et.al. | 2404.17069 | translate | read | null |
| 2024-04-25 | DE-CGAN: Boosting rTMS Treatment Prediction with Diversity Enhancing Conditional Generative Adversarial Networks | Matthew Squires et.al. | 2404.16913 | translate | read | null |
| 2024-04-25 | REBEL: Reinforcement Learning via Regressing Relative Rewards | Zhaolin Gao et.al. | 2404.16767 | translate | read | null |
| 2024-04-25 | Denoising: from classical methods to deep CNNs | Jean-Eric Campagne et.al. | 2404.16617 | translate | read | link |
| 2024-04-25 | MuseumMaker: Continual Style Customization without Catastrophic Forgetting | Chenxi Liu et.al. | 2404.16612 | translate | read | null |
| 2024-04-25 | Conditional Distribution Modelling for Few-Shot Image Synthesis with Diffusion Models | Parul Gupta et.al. | 2404.16556 | translate | read | null |
| 2024-04-25 | OpenDlign: Enhancing Open-World 3D Learning with Depth-Aligned Images | Ye Mao et.al. | 2404.16538 | translate | read | null |
| 2024-04-25 | Cross-sensor super-resolution of irregularly sampled Sentinel-2 time series | Aimi Okabayashi et.al. | 2404.16409 | translate | read | link |
| 2024-04-24 | Guardians of the Quantum GAN | Archisman Ghosh et.al. | 2404.16156 | translate | read | null |
| 2024-04-24 | Quantitative Characterization of Retinal Features in Translated OCTA | Rashadul Hasan Badhon et.al. | 2404.16133 | translate | read | null |
| 2024-04-24 | Spinning solar jets explained through the interplay between plasma sheets and vortex columns | Sahel Dey et.al. | 2404.16096 | translate | read | null |
| 2024-04-24 | PuLID: Pure and Lightning ID Customization via Contrastive Alignment | Zinan Guo et.al. | 2404.16022 | translate | read | null |
| 2024-04-24 | Security Analysis of WiFi-based Sensing Systems: Threats from Perturbation Attacks | Hangcheng Cao et.al. | 2404.15587 | translate | read | null |
| 2024-04-23 | Multi-scale Intervention Planning based on Generative Design | Ioannis Kavouras et.al. | 2404.15492 | translate | read | null |
| 2024-04-23 | ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning | Weifeng Chen et.al. | 2404.15449 | translate | read | null |
| 2024-04-23 | GLoD: Composing Global Contexts and Local Details in Image Generation | Moyuru Yamada et.al. | 2404.15447 | translate | read | null |
| 2024-04-23 | From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generation | Zehuan Huang et.al. | 2404.15267 | translate | read | null |
| 2024-04-23 | Adaptive Mixed-Scale Feature Fusion Network for Blind AI-Generated Image Quality Assessment | Tianwei Zhou et.al. | 2404.15163 | translate | read | null |
| 2024-04-23 | Multimodal Large Language Model is a Human-Aligned Annotator for Text-to-Image Generation | Xun Wu et.al. | 2404.15100 | translate | read | null |
| 2024-04-23 | CoARF: Controllable 3D Artistic Style Transfer for Radiance Fields | Deheng Zhang et.al. | 2404.14967 | translate | read | null |
| 2024-04-23 | Music Style Transfer With Diffusion Model | Hong Huang et.al. | 2404.14771 | translate | read | null |
| 2024-04-23 | SkinGEN: an Explainable Dermatology Diagnosis-to-Generation Framework with Interactive Vision-Language Models | Bo Lin et.al. | 2404.14755 | translate | read | null |
| 2024-04-23 | Skip the Benchmark: Generating System-Level High-Level Synthesis Data using Generative Machine Learning | Yuchao Liao et.al. | 2404.14754 | translate | read | null |
| 2024-04-23 | FINEMATCH: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction | Hang Hua et.al. | 2404.14715 | translate | read | null |
| 2024-04-22 | The Adversarial AI-Art: Understanding, Generation, Detection, and Benchmarking | Yuying Li et.al. | 2404.14581 | translate | read | null |
| 2024-04-22 | GeoDiffuser: Geometry-Based Image Editing with Diffusion Models | Rahul Sajnani et.al. | 2404.14403 | translate | read | null |
| 2024-04-22 | SEED-X: Multimodal Models with Unified Multi-granularity Comprehension and Generation | Yuying Ge et.al. | 2404.14396 | translate | read | link |
| 2024-04-22 | MultiBooth: Towards Generating All Your Concepts in an Image from Text | Chenyang Zhu et.al. | 2404.14239 | translate | read | link |
| 2024-04-22 | RHanDS: Refining Malformed Hands for Generated Images with Decoupled Structure and Style Guidance | Chengrui Wang et.al. | 2404.13984 | translate | read | null |
| 2024-04-23 | Accelerating Image Generation with Sub-path Linear Approximation Model | Chen Xu et.al. | 2404.13903 | translate | read | null |
| 2024-04-22 | Towards Better Text-to-Image Generation Alignment via Attention Modulation | Yihang Wu et.al. | 2404.13899 | translate | read | null |
| 2024-04-22 | Regional Style and Color Transfer | Zhicheng Ding et.al. | 2404.13880 | translate | read | null |
| 2024-04-22 | Distributional Black-Box Model Inversion Attack with Multi-Agent Reinforcement Learning | Huan Bao et.al. | 2404.13860 | translate | read | null |
| 2024-04-22 | A Comparative Study on Enhancing Prediction in Social Network Advertisement through Data Augmentation | Qikai Yang et.al. | 2404.13812 | translate | read | null |
| 2024-04-21 | Enforcing Conditional Independence for Fair Representation Learning and Causal Image Generation | Jensen Hwa et.al. | 2404.13798 | translate | read | null |
| 2024-04-19 | RadRotator: 3D Rotation of Radiographs with Diffusion Models | Pouria Rouzrokh et.al. | 2404.13000 | translate | read | null |
| 2024-04-19 | Robust CLIP-Based Detector for Exposing Diffusion Model-Generated Images | Santosh et.al. | 2404.12908 | translate | read | link |
| 2024-04-19 | Explainable Deepfake Video Detection using Convolutional Neural Network and CapsuleNet | Gazi Hasin Ishrak et.al. | 2404.12841 | translate | read | null |
| 2024-04-19 | Generative Modelling with High-Order Langevin Dynamics | Ziqiang Shi et.al. | 2404.12814 | translate | read | null |
| 2024-04-19 | PATE-TripleGAN: Privacy-Preserving Image Synthesis with Gaussian Differential Privacy | Zepeng Jiang et.al. | 2404.12730 | translate | read | null |
| 2024-04-19 | MLSD-GAN – Generating Strong High Quality Face Morphing Attacks using Latent Semantic Disentanglement | Aravinda Reddy PN et.al. | 2404.12679 | translate | read | null |
| 2024-04-19 | How Real Is Real? A Human Evaluation Framework for Unrestricted Adversarial Examples | Dren Fazlija et.al. | 2404.12653 | translate | read | null |
| 2024-04-19 | F2FLDM: Latent Diffusion Models with Histopathology Pre-Trained Embeddings for Unpaired Frozen Section to FFPE Translation | Man M. Ho et.al. | 2404.12650 | translate | read | null |
| 2024-04-18 | Alleviating Catastrophic Forgetting in Facial Expression Recognition with Emotion-Centered Models | Israel A. Laurensi et.al. | 2404.12260 | translate | read | null |
| 2024-04-18 | First 2D electron density measurements using Coherence Imaging Spectroscopy in the MAST-U Super-X divertor | N. Lonigro et.al. | 2404.12021 | translate | read | null |
| 2024-04-18 | ©Plug-in Authorization for Human Content Copyright Protection in Text-to-Image Model | Chao Zhou et.al. | 2404.11962 | translate | read | null |
| 2024-04-18 | Sketch-guided Image Inpainting with Partial Discrete Diffusion Process | Nakul Sharma et.al. | 2404.11949 | translate | read | link |
| 2024-04-18 | LD-Pruner: Efficient Pruning of Latent Diffusion Models using Task-Agnostic Insights | Thibault Castells et.al. | 2404.11936 | translate | read | null |
| 2024-04-18 | EdgeFusion: On-Device Text-to-Image Generation | Thibault Castells et.al. | 2404.11925 | translate | read | null |
| 2024-04-18 | Multi-view X-ray Image Synthesis with Multiple Domain Disentanglement from CT Scans | Lixing Tan et.al. | 2404.11889 | translate | read | null |
| 2024-04-18 | Generating synthetic electroretinogram waveforms using Artificial Intelligence to improve classification of retinal conditions in under-represented populations | Mikhail Kulyabin et.al. | 2404.11842 | translate | read | null |
| 2024-04-18 | TextCenGen: Attention-Guided Text-Centric Background Adaptation for Text-to-Image Generation | Tianyi Liang et.al. | 2404.11824 | translate | read | null |
| 2024-04-18 | Tailoring Generative Adversarial Networks for Smooth Airfoil Design | Joyjit Chattoraj et.al. | 2404.11816 | translate | read | null |
| 2024-04-17 | On the Scalability of GNNs for Molecular Graphs | Maciej Sypetkowski et.al. | 2404.11568 | translate | read | null |
| 2024-04-17 | MoA: Mixture-of-Attention for Subject-Context Disentanglement in Personalized Image Generation | Kuan-Chieh et.al. | 2404.11565 | translate | read | null |
| 2024-04-17 | SSDiff: Spatial-spectral Integrated Diffusion Model for Remote Sensing Pansharpening | Yu Zhong et.al. | 2404.11537 | translate | read | null |
| 2024-04-17 | Towards Highly Realistic Artistic Style Transfer via Stable Diffusion with Step-aware and Layer-aware Prompt | Zhanjie Zhang et.al. | 2404.11474 | translate | read | link |
| 2024-04-17 | What-if Analysis Framework for Digital Twins in 6G Wireless Network Management | Elif Ak et.al. | 2404.11394 | translate | read | null |
| 2024-04-17 | Image Generative Semantic Communication with Multi-Modal Similarity Estimation for Resource-Limited Networks | Eri Hosonuma et.al. | 2404.11280 | translate | read | null |
| 2024-04-17 | Optical Image-to-Image Translation Using Denoising Diffusion Models: Heterogeneous Change Detection as a Use Case | João Gabriel Vinholi et.al. | 2404.11243 | translate | read | null |
| 2024-04-17 | KI-GAN: Knowledge-Informed Generative Adversarial Networks for Enhanced Multi-Vehicle Trajectory Forecasting at Signalized Intersections | Chuheng Wei et.al. | 2404.11181 | translate | read | link |
| 2024-04-17 | TiNO-Edit: Timestep and Noise Optimization for Robust Diffusion-Based Image Editing | Sherry X. Chen et.al. | 2404.11120 | translate | read | link |
| 2024-04-17 | Object Remover Performance Evaluation Methods using Class-wise Object Removal Images | Changsuk Oh et.al. | 2404.11104 | translate | read | null |
| 2024-04-16 | RefFusion: Reference Adapted Diffusion Models for 3D Scene Inpainting | Ashkan Mirzaei et.al. | 2404.10765 | translate | read | null |
| 2024-04-16 | LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation? | Yuchi Wang et.al. | 2404.10763 | translate | read | link |
| 2024-04-16 | AV-GAN: Attention-Based Varifocal Generative Adversarial Network for Uneven Medical Image Translation | Zexin Li et.al. | 2404.10714 | translate | read | null |
| 2024-04-16 | Gaussian Splatting Decoder for 3D-aware Generative Adversarial Networks | Florian Barthel et.al. | 2404.10625 | translate | read | null |
| 2024-04-16 | Adversarial Identity Injection for Semantic Face Image Synthesis | Giuseppe Tarollo et.al. | 2404.10408 | translate | read | null |
| 2024-04-16 | Generating Counterfactual Trajectories with Latent Diffusion Models for Concept Discovery | Payal Varshney et.al. | 2404.10356 | translate | read | null |
| 2024-04-16 | CanvasPic: An Interactive Tool for Freely Generating Facial Images Based on Spatial Layout | Jiafu Wei et.al. | 2404.10352 | translate | read | null |
| 2024-04-16 | OmniSSR: Zero-shot Omnidirectional Image Super-Resolution using Stable Diffusion Model | Runyi Li et.al. | 2404.10312 | translate | read | null |
| 2024-04-16 | Learnable Prompt for Few-Shot Semantic Segmentation in Remote Sensing Domain | Steve Andreas Immanuel et.al. | 2404.10307 | translate | read | link |
| 2024-04-16 | OneActor: Consistent Character Generation via Cluster-Conditioned Guidance | Jiahao Wang et.al. | 2404.10267 | translate | read | null |
| 2024-04-15 | Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language Models | Ziwei Luo et.al. | 2404.09732 | translate | read | link |
| 2024-04-15 | VFLGAN: Vertical Federated Learning-based Generative Adversarial Network for Vertically Partitioned Data Publication | Xun Yuan et.al. | 2404.09722 | translate | read | null |
| 2024-04-15 | In-Context Translation: Towards Unifying Image Recognition, Processing, and Generation | Han Xue et.al. | 2404.09633 | translate | read | null |
| 2024-04-15 | Text-Driven Diverse Facial Texture Generation via Progressive Latent-Space Refinement | Chi Wang et.al. | 2404.09540 | translate | read | null |
| 2024-04-15 | Magic Clothing: Controllable Garment-Driven Image Synthesis | Weifeng Chen et.al. | 2404.09512 | translate | read | link |
| 2024-04-15 | Improved Object-Based Style Transfer with Single Deep Network | Harshmohan Kulkarni et.al. | 2404.09461 | translate | read | null |
| 2024-04-15 | Watermark-embedded Adversarial Examples for Copyright Protection against Diffusion Models | Peifei Zhu et.al. | 2404.09401 | translate | read | null |
| 2024-04-14 | Counteracting Concept Drift by Learning with Future Malware Predictions | Branislav Bosansky et.al. | 2404.09352 | translate | read | null |
| 2024-04-14 | DreamScape: 3D Scene Creation via Gaussian Splatting joint Correlation Modeling | Xuening Yuan et.al. | 2404.09227 | translate | read | null |
| 2024-04-13 | InverseVis: Revealing the Hidden with Curved Sphere Tracing | Kai Lawonn et.al. | 2404.09092 | translate | read | null |
| 2024-04-12 | An improved tabular data generator with VAE-GMM integration | Patricia A. Apellániz et.al. | 2404.08434 | translate | read | null |
| 2024-04-12 | Counterfactual Explanations for Face Forgery Detection via Adversarial Removal of Artifacts | Yang Li et.al. | 2404.08341 | translate | read | link |
| 2024-04-11 | Latent Guard: a Safety Framework for Text-to-image Generation | Runtao Liu et.al. | 2404.08031 | translate | read | link |
| 2024-04-11 | Rethinking Artistic Copyright Infringements in the Era of Text-to-Image Generative Models | Mazda Moayeri et.al. | 2404.08030 | translate | read | null |
| 2024-04-11 | OpenBias: Open-set Bias Detection in Text-to-Image Generative Models | Moreno D’Incà et.al. | 2404.07990 | translate | read | null |
| 2024-04-11 | Taming Stable Diffusion for Text to 360° Panorama Image Generation | Cheng Zhang et.al. | 2404.07949 | translate | read | link |
| 2024-04-11 | Generating Synthetic Satellite Imagery With Deep-Learning Text-to-Image Models – Technical Challenges and Implications for Monitoring and Verification | Tuong Vy Nguyen et.al. | 2404.07754 | translate | read | null |
| 2024-04-11 | Applying Guidance in a Limited Interval Improves Sample and Distribution Quality in Diffusion Models | Tuomas Kynkäänniemi et.al. | 2404.07724 | translate | read | null |
| 2024-04-11 | Model-based Cleaning of the QUILT-1M Pathology Dataset for Text-Conditional Image Synthesis | Marc Aubreville et.al. | 2404.07676 | translate | read | null |
| 2024-04-11 | Implicit and Explicit Language Guidance for Diffusion-based Visual Perception | Hefeng Wang et.al. | 2404.07600 | translate | read | null |
| 2024-04-11 | GAN-based iterative motion estimation in HASTE MRI | Mathias S. Feinler et.al. | 2404.07576 | translate | read | null |
| 2024-04-11 | ObjBlur: A Curriculum Learning Approach With Progressive Object-Level Blurring for Improved Layout-to-Image Generation | Stanislav Frolov et.al. | 2404.07564 | translate | read | null |
| 2024-04-11 | CAT: Contrastive Adapter Training for Personalized Image Generation | Jae Wan Park et.al. | 2404.07554 | translate | read | link |
| 2024-04-11 | Enhancing Network Intrusion Detection Performance using Generative Adversarial Networks | Xinxing Zhao et.al. | 2404.07464 | translate | read | null |
| 2024-04-10 | RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion | Jaidev Shriram et.al. | 2404.07199 | translate | read | null |
| 2024-04-10 | A Gauss-Newton Approach for Min-Max Optimization in Generative Adversarial Networks | Neel Mishra et.al. | 2404.07172 | translate | read | link |
| 2024-04-10 | Implicit Multi-Spectral Transformer: An Lightweight and Effective Visible to Infrared Image Translation Model | Yijia Chen et.al. | 2404.07072 | translate | read | link |
| 2024-04-10 | Fine color guidance in diffusion models and its application to image compression at extremely low bitrates | Tom Bordin et.al. | 2404.06865 | translate | read | null |
| 2024-04-10 | UDiFF: Generating Conditional Unsigned Distance Fields with Optimal Wavelet Diffusion | Junsheng Zhou et.al. | 2404.06851 | translate | read | null |
| 2024-04-10 | Tuning-Free Adaptive Style Incorporation for Structure-Consistent Text-Driven Style Transfer | Yanqi Ge et.al. | 2404.06835 | translate | read | null |
| 2024-04-10 | MedRG: Medical Report Grounding with Multi-modal Large Language Model | Ke Zou et.al. | 2404.06798 | translate | read | null |
| 2024-04-10 | CryinGAN: Design and evaluation of point-cloud-based generative adversarial networks using disordered materials $-$ application to Li$_3$ScCl$_6$-LiCoO$_2$ battery interfaces | Adrian Xiao Bin Yong et.al. | 2404.06734 | translate | read | null |
| 2024-04-10 | Deep Generative Data Assimilation in Multimodal Setting | Yongquan Qu et.al. | 2404.06665 | translate | read | link |
| 2024-04-09 | GeoSynth: Contextually-Aware High-Resolution Satellite Image Synthesis | Srikumar Sastry et.al. | 2404.06637 | translate | read | link |
| 2024-04-09 | High Noise Scheduling is a Must | Mahmut S. Gokmen et.al. | 2404.06353 | translate | read | null |
| 2024-04-09 | Fortifying Fully Convolutional Generative Adversarial Networks for Image Super-Resolution Using Divergence Measures | Arkaprabha Basu et.al. | 2404.06294 | translate | read | null |
| 2024-04-09 | Hyperparameter-Free Medical Image Synthesis for Sharing Data and Improving Site-Specific Segmentation | Alexander Chebykin et.al. | 2404.06240 | translate | read | link |
| 2024-04-09 | DiffHarmony: Latent Diffusion Model Meets Image Harmonization | Pengfei Zhou et.al. | 2404.06139 | translate | read | null |
| 2024-04-09 | Greedy-DiM: Greedy Algorithms for Unreasonably Effective Face Morphs | Zander W. Blasingame et.al. | 2404.06025 | translate | read | null |
| 2024-04-09 | Boosting Digital Safeguards: Blending Cryptography and Steganography | Anamitra Maiti et.al. | 2404.05985 | translate | read | null |
| 2024-04-09 | Tackling Structural Hallucination in Image Translation with Local Diffusion | Seunghoi Kim et.al. | 2404.05980 | translate | read | null |
| 2024-04-09 | StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion | Ming Tao et.al. | 2404.05979 | translate | read | link |
| 2024-04-09 | Quantum Generative Adversarial Networks in a Silicon Photonic Chip with Maximum Expressibility | Haoran Ma et.al. | 2404.05921 | translate | read | null |
| 2024-04-08 | SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing | Jing Gu et.al. | 2404.05717 | translate | read | null |
| 2024-04-08 | Learning 3D-Aware GANs from Unposed Images with Template Feature Field | Xinya Chen et.al. | 2404.05705 | translate | read | null |
| 2024-04-08 | SphereHead: Stable 3D Full-head Synthesis with Spherical Tri-plane Representation | Heyuan Li et.al. | 2404.05680 | translate | read | null |
| 2024-04-08 | MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation | Kunpeng Song et.al. | 2404.05674 | translate | read | null |
| 2024-04-08 | Automatic Controllable Colorization via Imagination | Xiaoyan Cong et.al. | 2404.05661 | translate | read | null |
| 2024-04-08 | UniFL: Improve Stable Diffusion via Unified Feedback Learning | Jiacheng Zhang et.al. | 2404.05595 | translate | read | null |
| 2024-04-08 | Mind-to-Image: Projecting Visual Mental Imagination of the Brain from fMRI | Hugo Caselles-Dupré et.al. | 2404.05468 | translate | read | null |
| 2024-04-08 | CDAD-Net: Bridging Domain Gaps in Generalized Category Discovery | Sai Bhargav Rongali et.al. | 2404.05366 | translate | read | null |
| 2024-04-08 | Mask-ControlNet: Higher-Quality Image Generation with An Additional Mask Prompt | Zhiqi Huang et.al. | 2404.05331 | translate | read | null |
| 2024-04-08 | MC $^2$ : Multi-concept Guidance for Customized Multi-concept Generation | Jiaxiu Jiang et.al. | 2404.05268 | translate | read | null |
| 2024-04-04 | No “Zero-Shot” Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance | Vishaal Udandarao et.al. | 2404.04125 | translate | read | link |
| 2024-04-05 | 3D Facial Expressions through Analysis-by-Neural-Synthesis | George Retsinas et.al. | 2404.04104 | translate | read | null |
| 2024-04-05 | Dynamic Prompt Optimizing for Text-to-Image Generation | Wenyi Mo et.al. | 2404.04095 | translate | read | link |
| 2024-04-05 | Physics-Inspired Synthesized Underwater Image Dataset | Reina Kaneko et.al. | 2404.03998 | translate | read | null |
| 2024-04-05 | Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models | Gihyun Kwon et.al. | 2404.03913 | translate | read | null |
| 2024-04-04 | RaFE: Generative Radiance Fields Restoration | Zhongkai Wu et.al. | 2404.03654 | translate | read | null |
| 2024-04-04 | CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching | Dongzhi Jiang et.al. | 2404.03653 | translate | read | link |
| 2024-04-04 | Reference-Based 3D-Aware Image Editing with Triplane | Bahri Batuhan Bilecen et.al. | 2404.03632 | translate | read | null |
| 2024-04-04 | Robust Concept Erasure Using Task Vectors | Minh Pham et.al. | 2404.03631 | translate | read | null |
| 2024-04-04 | Terrain Point Cloud Inpainting via Signal Decomposition | Yizhou Xie et.al. | 2404.03572 | translate | read | null |
| 2024-04-04 | Integrating Generative AI into Financial Market Prediction for Improved Decision Making | Chang Che et.al. | 2404.03523 | translate | read | null |
| 2024-04-04 | Knowledge Distillation-Based Model Extraction Attack using Private Counterfactual Explanations | Fatima Ezzeddine et.al. | 2404.03348 | translate | read | null |
| 2024-04-04 | Multi Positive Contrastive Learning with Pose-Consistent Generated Images | Sho Inayoshi et.al. | 2404.03256 | translate | read | null |
| 2024-04-04 | Would Deep Generative Models Amplify Bias in Future Models? | Tianwei Chen et.al. | 2404.03242 | translate | read | null |
| 2024-04-04 | Diverse and Tailored Image Generation for Zero-shot Multi-label Classification | Kaixin Zhang et.al. | 2404.03144 | translate | read | null |
| 2024-04-03 | Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction | Keyu Tian et.al. | 2404.02905 | translate | read | link |
| 2024-04-03 | MatAtlas: Text-driven Consistent Geometry Texturing and Material Assignment | Duygu Ceylan et.al. | 2404.02899 | translate | read | null |
| 2024-04-03 | On the Scalability of Diffusion-based Text-to-Image Generation | Hao Li et.al. | 2404.02883 | translate | read | null |
| 2024-04-03 | MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image Generation | Petru-Daniel Tudosiu et.al. | 2404.02790 | translate | read | null |
| 2024-04-03 | InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation | Haofan Wang et.al. | 2404.02733 | translate | read | link |
| 2024-04-03 | Model-agnostic Origin Attribution of Generated Images with Few-shot Examples | Fengyuan Liu et.al. | 2404.02697 | translate | read | null |
| 2024-04-03 | Deep Privacy Funnel Model: From a Discriminative to a Generative Approach with an Application to Face Recognition | Behrooz Razeghi et.al. | 2404.02696 | translate | read | null |
| 2024-04-03 | Severity Controlled Text-to-Image Generative Model Bias Manipulation | Jordan Vice et.al. | 2404.02530 | translate | read | null |
| 2024-04-03 | Designing a Photonic Physically Unclonable Function Having Resilience to Machine Learning Attacks | Elena R. Henderson et.al. | 2404.02440 | translate | read | null |
| 2024-04-02 | Diffusion $^2$ : Dynamic 3D Content Generation via Score Composition of Orthogonal Diffusion Models | Zeyu Yang et.al. | 2404.02148 | translate | read | link |
| 2024-04-02 | 3D Congealing: 3D-Aware Image Alignment in the Wild | Yunzhi Zhang et.al. | 2404.02125 | translate | read | null |
| 2024-04-02 | Red-Teaming Segment Anything Model | Krzysztof Jankowski et.al. | 2404.02067 | translate | read | link |
| 2024-04-02 | MultiParaDetox: Extending Text Detoxification with Parallel Data to New Languages | Daryna Dementieva et.al. | 2404.02037 | translate | read | null |
| 2024-04-02 | Enhancing Portfolio Optimization with Transformer-GAN Integration: A Novel Approach in the Black-Litterman Framework | Enmin Zhu et.al. | 2404.02029 | translate | read | null |
| 2024-04-02 | Bi-LORA: A Vision-Language Approach for Synthetic Image Detection | Mamadou Keita et.al. | 2404.01959 | translate | read | null |
| 2024-04-02 | Real, fake and synthetic faces – does the coin have three sides? | Shahzeb Naeem et.al. | 2404.01878 | translate | read | null |
| 2024-04-02 | Disentangled Pre-training for Human-Object Interaction Detection | Zhuolong Li et.al. | 2404.01725 | translate | read | null |
| 2024-04-01 | PlayFutures: Imagining Civic Futures with AI and Puppets | Supratim Pait et.al. | 2404.01527 | translate | read | null |
| 2024-04-01 | Is Model Collapse Inevitable? Breaking the Curse of Recursion by Accumulating Real and Synthetic Data | Matthias Gerstgrasser et.al. | 2404.01413 | translate | read | null |
| 2024-04-01 | Structure Matters: Tackling the Semantic Discrepancy in Diffusion Models for Image Inpainting | Haipeng Liu et.al. | 2403.19898 | translate | read | link |
(<a href=../Image_Generation.md>back to Image Generation</a>)