Image Generation - 2024-09
Image Generation - 2024-09
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2024-09-30 | Inverse Painting: Reconstructing The Painting Process | Bowei Chen et.al. | 2409.20556 | translate | read | null |
| 2024-09-30 | Dual Encoder GAN Inversion for High-Fidelity 3D Head Reconstruction from Single Images | Bahri Batuhan Bilecen et.al. | 2409.20530 | translate | read | null |
| 2024-09-30 | All-optical autoencoder machine learning framework using diffractive processors | Peijie Feng et.al. | 2409.20346 | translate | read | null |
| 2024-09-30 | Illustrious: an Open Advanced Illustration Model | Sang Hyun Park et.al. | 2409.19946 | translate | read | null |
| 2024-09-30 | MaskMamba: A Hybrid Mamba-Transformer Model for Masked Image Generation | Wenchao Chen et.al. | 2409.19937 | translate | read | null |
| 2024-09-29 | OrganiQ: Mitigating Classical Resource Bottlenecks of Quantum Generative Adversarial Networks on NISQ-Era Machines | Daniel Silver et.al. | 2409.19823 | translate | read | null |
| 2024-09-29 | When Molecular GAN Meets Byte-Pair Encoding | Huidong Tang et.al. | 2409.19740 | translate | read | null |
| 2024-09-29 | Simple and Fast Distillation of Diffusion Models | Zhenyu Zhou et.al. | 2409.19681 | translate | read | link |
| 2024-09-29 | Storynizor: Consistent Story Generation via Inter-Frame Synchronized and Shuffled ID Injection | Yuhang Ma et.al. | 2409.19624 | translate | read | null |
| 2024-09-27 | Detecting Dataset Abuse in Fine-Tuning Stable Diffusion Models for Text-to-Image Synthesis | Songrui Wang et.al. | 2409.18897 | translate | read | null |
| 2024-09-27 | Explainable Artifacts for Synthetic Western Blot Source Attribution | João Phillipe Cardenuto et.al. | 2409.18881 | translate | read | null |
| 2024-09-27 | Simulating Dynamic Tumor Contrast Enhancement in Breast MRI using Conditional Generative Adversarial Networks | Richard Osuala et.al. | 2409.18872 | translate | read | null |
| 2024-09-27 | Underwater Image Enhancement with Physical-based Denoising Diffusion Implicit Models | Nguyen Gia Bach et.al. | 2409.18476 | translate | read | link |
| 2024-09-27 | Gradient-free Decoder Inversion in Latent Diffusion Models | Seongmin Hong et.al. | 2409.18442 | translate | read | null |
| 2024-09-27 | Adaptive Learning of the Latent Space of Wasserstein Generative Adversarial Networks | Yixuan Qiu et.al. | 2409.18374 | translate | read | null |
| 2024-09-26 | DRL-STNet: Unsupervised Domain Adaptation for Cross-modality Medical Image Segmentation via Disentangled Representation Learning | Hui Lin et.al. | 2409.18340 | translate | read | null |
| 2024-09-26 | Realistic Evaluation of Model Merging for Compositional Generalization | Derek Tam et.al. | 2409.18314 | translate | read | link |
| 2024-09-26 | Harnessing Wavelet Transformations for Generalizable Deepfake Forgery Detection | Lalith Bharadwaj Baru et.al. | 2409.18301 | translate | read | link |
| 2024-09-26 | Synthesizing beta-amyloid PET images from T1-weighted Structural MRI: A Preliminary Study | Qing Lyu et.al. | 2409.18282 | translate | read | null |
| 2024-09-26 | FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner | Wenliang Zhao et.al. | 2409.18128 | translate | read | link |
| 2024-09-26 | Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction | Jing He et.al. | 2409.18124 | translate | read | link |
| 2024-09-26 | DiffSSC: Semantic LiDAR Scan Completion using Denoising Diffusion Probabilistic Models | Helin Cao et.al. | 2409.18092 | translate | read | null |
| 2024-09-26 | Pioneering Reliable Assessment in Text-to-Image Knowledge Editing: Leveraging a Fine-Grained Dataset and an Innovative Criterion | Hengrui Gu et.al. | 2409.17928 | translate | read | null |
| 2024-09-26 | Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation | Qihan Huang et.al. | 2409.17920 | translate | read | link |
| 2024-09-26 | WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians | Dmytro Kotovenko et.al. | 2409.17917 | translate | read | null |
| 2024-09-26 | Text Image Generation for Low-Resource Languages with Dual Translation Learning | Chihiro Noguchi et.al. | 2409.17747 | translate | read | null |
| 2024-09-26 | AnyLogo: Symbiotic Subject-Driven Diffusion System with Gemini Status | Jinghao Zhang et.al. | 2409.17740 | translate | read | null |
| 2024-09-26 | ID $^3$ : Identity-Preserving-yet-Diversified Diffusion Models for Synthetic Face Recognition | Shen Li et.al. | 2409.17576 | translate | read | null |
| 2024-09-26 | Pixel-Space Post-Training of Latent Diffusion Models | Christina Zhang et.al. | 2409.17565 | translate | read | null |
| 2024-09-25 | GeoBiked: A Dataset with Geometric Features and Automated Labeling Techniques to Enable Deep Generative Models in Engineering Design | Phillip Mueller et.al. | 2409.17045 | translate | read | null |
| 2024-09-25 | Enhanced Wavelet Scattering Network for image inpainting detection | Barglazan Adrian-Alin et.al. | 2409.17023 | translate | read | null |
| 2024-09-25 | WasteGAN: Data Augmentation for Robotic Waste Sorting through Generative Adversarial Networks | Alberto Bacchin et.al. | 2409.16999 | translate | read | link |
| 2024-09-25 | Towards General Text-guided Image Synthesis for Customized Multimodal Brain MRI Generation | Yulin Wang et.al. | 2409.16818 | translate | read | link |
| 2024-09-25 | Pose-Guided Fine-Grained Sign Language Video Generation | Tongkai Shi et.al. | 2409.16709 | translate | read | null |
| 2024-09-25 | Pix2Next: Leveraging Vision Foundation Models for RGB to NIR Image Translation | Youngwan Jin et.al. | 2409.16706 | translate | read | link |
| 2024-09-25 | Morphological-consistent Diffusion Network for Ultrasound Coronal Image Enhancement | Yihao Zhou et.al. | 2409.16661 | translate | read | null |
| 2024-09-25 | ECG-Image-Database: A Dataset of ECG Images with Real-World Imaging and Scanning Artifacts; A Foundation for Computerized ECG Image Digitization and Analysis | Matthew A. Reyna et.al. | 2409.16612 | translate | read | null |
| 2024-09-25 | Prompt Sliders for Fine-Grained Control, Editing and Erasing of Concepts in Diffusion Models | Deepak Sridhar et.al. | 2409.16535 | translate | read | link |
| 2024-09-24 | MonoFormer: One Transformer for Both Diffusion and Autoregression | Chuyang Zhao et.al. | 2409.16280 | translate | read | link |
| 2024-09-24 | Label-Augmented Dataset Distillation | Seoungyoon Kang et.al. | 2409.16239 | translate | read | null |
| 2024-09-24 | MaskBit: Embedding-free Image Generation via Bit Tokens | Mark Weber et.al. | 2409.16211 | translate | read | link |
| 2024-09-24 | Machine learning approaches for automatic defect detection in photovoltaic systems | Swayam Rajat Mohanty et.al. | 2409.16069 | translate | read | null |
| 2024-09-24 | Enhanced Unsupervised Image-to-Image Translation Using Contrastive Learning and Histogram of Oriented Gradients | Wanchen Zhao et.al. | 2409.16042 | translate | read | null |
| 2024-09-24 | Deep chroma compression of tone-mapped images | Xenios Milidonis et.al. | 2409.16032 | translate | read | link |
| 2024-09-24 | Improvements to SDXL in NovelAI Diffusion V3 | Juan Ossa et.al. | 2409.15997 | translate | read | null |
| 2024-09-24 | StyleSinger 2: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control | Yu Zhang et.al. | 2409.15977 | translate | read | link |
| 2024-09-24 | Data Augmentation for Sparse Multidimensional Learning Performance Data Using Generative AI | Liang Zhang et.al. | 2409.15631 | translate | read | null |
| 2024-09-23 | Critic Loss for Image Classification | Brendan Hogan Rappazzo et.al. | 2409.15565 | translate | read | null |
| 2024-09-18 | Brain-Streams: fMRI-to-Image Reconstruction with Multi-modal Guidance | Jaehoon Joo et.al. | 2409.12099 | translate | read | null |
| 2024-09-18 | ChefFusion: Multimodal Foundation Model Integrating Recipe and Food Image Generation | Peiyu Li et.al. | 2409.12010 | translate | read | link |
| 2024-09-18 | Tracking Any Point with Frame-Event Fusion Network at High Frame Rate | Jiaxiong Liu et.al. | 2409.11953 | translate | read | null |
| 2024-09-18 | Agglomerative Token Clustering | Joakim Bruslund Haurum et.al. | 2409.11923 | translate | read | link |
| 2024-09-18 | Finding the Subjective Truth: Collecting 2 Million Votes for Comprehensive Gen-AI Model Evaluation | Dimitrios Christodoulou et.al. | 2409.11904 | translate | read | null |
| 2024-09-18 | RaggeDi: Diffusion-based State Estimation of Disordered Rags, Sheets, Towels and Blankets | Jikai Ye et.al. | 2409.11831 | translate | read | null |
| 2024-09-18 | Latent fingerprint enhancement for accurate minutiae detection | Abdul Wahab et.al. | 2409.11802 | translate | read | null |
| 2024-09-18 | METEOR: Melody-aware Texture-controllable Symbolic Orchestral Music Generation | Dinh-Viet-Toan Le et.al. | 2409.11753 | translate | read | link |
| 2024-09-18 | GUNet: A Graph Convolutional Network United Diffusion Model for Stable and Diversity Pose Generation | Shuowen Liang et.al. | 2409.11689 | translate | read | link |
| 2024-09-17 | Using Physics Informed Generative Adversarial Networks to Model 3D porous media | Zihan Ren et.al. | 2409.11541 | translate | read | null |
| 2024-09-17 | Training Datasets Generation for Machine Learning: Application to Vision Based Navigation | Jérémy Lebreton et.al. | 2409.11383 | translate | read | null |
| 2024-09-17 | Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think | Gonzalo Martin Garcia et.al. | 2409.11355 | translate | read | link |
| 2024-09-17 | OmniGen: Unified Image Generation | Shitao Xiao et.al. | 2409.11340 | translate | read | link |
| 2024-09-17 | Improving the Efficiency of Visually Augmented Language Models | Paula Ontalvilla et.al. | 2409.11148 | translate | read | null |
| 2024-09-17 | MM2Latent: Text-to-facial image generation and editing in GANs with multimodal assistance | Debin Meng et.al. | 2409.11010 | translate | read | link |
| 2024-09-16 | A Missing Data Imputation GAN for Character Sprite Generation | Flávio Coutinho et.al. | 2409.10721 | translate | read | link |
| 2024-09-16 | Playground v3: Improving Text-to-Image Alignment with Deep-Fusion Large Language Models | Bingchen Liu et.al. | 2409.10695 | translate | read | null |
| 2024-09-16 | Incorporating Classifier-Free Guidance in Diffusion Model-Based Recommendation | Noah Buchanan et.al. | 2409.10494 | translate | read | null |
| 2024-09-16 | SimInversion: A Simple Framework for Inversion-Based Text-to-Image Editing | Qi Qian et.al. | 2409.10476 | translate | read | null |
| 2024-09-16 | Mamba-ST: State Space Model for Efficient Style Transfer | Filippo Botti et.al. | 2409.10385 | translate | read | null |
| 2024-09-16 | Robust image representations with counterfactual contrastive learning | Mélanie Roschewitz et.al. | 2409.10365 | translate | read | link |
| 2024-09-16 | VAE-QWGAN: Improving Quantum GANs for High Resolution Image Generation | Aaron Mark Thomas et.al. | 2409.10339 | translate | read | null |
| 2024-09-16 | On Synthetic Texture Datasets: Challenges, Creation, and Curation | Blaine Hoak et.al. | 2409.10297 | translate | read | null |
| 2024-09-16 | MotionCom: Automatic and Motion-Aware Image Composition with LLM and Video Diffusion Prior | Weijing Tao et.al. | 2409.10090 | translate | read | null |
| 2024-09-16 | Cross-modality image synthesis from TOF-MRA to CTA using diffusion-based models | Alexander Koch et.al. | 2409.10089 | translate | read | null |
| 2024-09-16 | 2S-ODIS: Two-Stage Omni-Directional Image Synthesis by Geometric Distortion Correction | Atsuya Nakata et.al. | 2409.09969 | translate | read | link |
| 2024-09-15 | GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion | Vitor Guizilini et.al. | 2409.09896 | translate | read | null |
| 2024-09-13 | InstantDrag: Improving Interactivity in Drag-based Image Editing | Joonghyuk Shin et.al. | 2409.08857 | translate | read | null |
| 2024-09-13 | GroundingBooth: Grounding Text-to-Image Customization | Zhexiao Xiong et.al. | 2409.08520 | translate | read | null |
| 2024-09-13 | Enhancing Privacy in ControlNet and Stable Diffusion via Split Learning | Dixi Yao et.al. | 2409.08503 | translate | read | null |
| 2024-09-13 | Cross-conditioned Diffusion Model for Medical Image to Image Translation | Zhaohu Xing et.al. | 2409.08500 | translate | read | null |
| 2024-09-12 | Learned Compression for Images and Point Clouds | Mateen Ulhaq et.al. | 2409.08376 | translate | read | link |
| 2024-09-12 | Impact of Stain Variation and Color Normalization for Prognostic Predictions in Pathology | Siyu et.al. | 2409.08338 | translate | read | null |
| 2024-09-12 | Click2Mask: Local Editing with Dynamic Mask Generation | Omer Regev et.al. | 2409.08272 | translate | read | link |
| 2024-09-12 | Improving Virtual Try-On with Garment-focused Diffusion Models | Siqi Wan et.al. | 2409.08258 | translate | read | link |
| 2024-09-12 | TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder | NaHyeon Park et.al. | 2409.08248 | translate | read | link |
| 2024-09-12 | IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation | Yinwei Wu et.al. | 2409.08240 | translate | read | null |
| 2024-09-12 | High-Frequency Anti-DreamBooth: Robust Defense Against Image Synthesis | Takuto Onikubo et.al. | 2409.08167 | translate | read | null |
| 2024-09-12 | EZIGen: Enhancing zero-shot subject-driven image generation with precise subject encoding and decoupled guidance | Zicheng Duan et.al. | 2409.08091 | translate | read | null |
| 2024-09-12 | Scribble-Guided Diffusion for Training-free Text-to-Image Generation | Seonho Lee et.al. | 2409.08026 | translate | read | link |
| 2024-09-12 | FPMT: Enhanced Semi-Supervised Model for Traffic Incident Detection | Xinying Lu et.al. | 2409.07839 | translate | read | null |
| 2024-09-11 | Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models | Haibo Yang et.al. | 2409.07452 | translate | read | link |
| 2024-09-11 | FreeEnhance: Tuning-Free Image Enhancement via Content-Consistent Noising-and-Denoising Process | Yang Luo et.al. | 2409.07451 | translate | read | null |
| 2024-09-11 | Controllable retinal image synthesis using conditional StyleGAN and latent space manipulation for improved diagnosis and grading of diabetic retinopathy | Somayeh Pakdelmoez et.al. | 2409.07422 | translate | read | null |
| 2024-09-11 | Some effects of limited wall-sensor availability on flow estimation with 3D-GANs | Antonio Cuéllar et.al. | 2409.07348 | translate | read | null |
| 2024-09-11 | CCFExp: Facial Image Synthesis with Cycle Cross-Fusion Diffusion Model for Facial Paralysis Individuals | Weixiang Gao et.al. | 2409.07271 | translate | read | link |
| 2024-09-11 | Bio-Eng-LMM AI Assist chatbot: A Comprehensive Tool for Research and Education | Ali Forootani et.al. | 2409.07110 | translate | read | null |
| 2024-09-11 | Fidelity-optimized quantum surface code via GAN decoder and application to quantum teleportation | Jiaxin Li et.al. | 2409.06984 | translate | read | null |
| 2024-09-10 | DANCE: Deep Learning-Assisted Analysis of Protein Sequences Using Chaos Enhanced Kaleidoscopic Images | Taslim Murad et.al. | 2409.06694 | translate | read | null |
| 2024-09-10 | Three-dimensional generative adversarial networks for turbulent flow estimation from wall measurements | Antonio Cuéllar et.al. | 2409.06548 | translate | read | null |
| 2024-09-10 | PoseEmbroider: Towards a 3D, Visual, Semantic-aware Human Pose Representation | Ginger Delmas et.al. | 2409.06535 | translate | read | null |
| 2024-09-10 | DiffQRCoder: Diffusion-based Aesthetic QR Code Generation with Scanning Robustness Guided Iterative Refinement | Jia-Wei Liao et.al. | 2409.06355 | translate | read | link |
| 2024-09-10 | Spectral oversubtraction? An approach for speech enhancement after robot ego speech filtering in semi-real-time | Yue Li et.al. | 2409.06274 | translate | read | null |
| 2024-09-10 | EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation | Nischal Khanal et.al. | 2409.06183 | translate | read | link |
| 2024-09-09 | SVS-GAN: Leveraging GANs for Semantic Video Synthesis | Khaled M. Seyam et.al. | 2409.06074 | translate | read | null |
| 2024-09-09 | Statistical Mechanics of Min-Max Problems | Yuma Ichikawa et.al. | 2409.06053 | translate | read | null |
| 2024-09-09 | SVFit: Parameter-Efficient Fine-Tuning of Large Pre-Trained Models Using Singular Values | Chengwei Sun et.al. | 2409.05926 | translate | read | null |
| 2024-09-09 | Quantum Wasserstein Compilation: Unitary Compilation using the Quantum Earth Mover’s Distance | Marvin Richter et.al. | 2409.05849 | translate | read | null |
| 2024-09-09 | CipherDM: Secure Three-Party Inference for Diffusion Model Sampling | Xin Zhao et.al. | 2409.05414 | translate | read | null |
| 2024-09-09 | Sequential Posterior Sampling with Diffusion Models | Tristan S. W. Stevens et.al. | 2409.05399 | translate | read | null |
| 2024-09-09 | Decoupling Contact for Fine-Grained Motion Style Transfer | Xiangjun Tang et.al. | 2409.05387 | translate | read | null |
| 2024-09-09 | TERD: A Unified Framework for Safeguarding Diffusion Models Against Backdoors | Yichuan Mo et.al. | 2409.05294 | translate | read | null |
| 2024-09-09 | Disentangled Representations for Short-Term and Long-Term Person Re-Identification | Chanho Eom et.al. | 2409.05277 | translate | read | null |
| 2024-09-09 | MRStyle: A Unified Framework for Color Style Transfer with Multi-Modality Reference | Jiancheng Huang et.al. | 2409.05250 | translate | read | null |
| 2024-09-08 | Can OOD Object Detectors Learn from Foundation Models? | Jiahui Liu et.al. | 2409.05162 | translate | read | link |
| 2024-09-08 | Physics-augmented Deep Learning with Adversarial Domain Adaptation: Applications to STM Image Denoising | Jianxin Xie et.al. | 2409.05118 | translate | read | null |
| 2024-09-07 | Rethinking The Training And Evaluation of Rich-Context Layout-to-Image Generation | Jiaxin Cheng et.al. | 2409.04847 | translate | read | link |
| 2024-09-06 | VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation | Yecheng Wu et.al. | 2409.04429 | translate | read | link |
| 2024-09-06 | Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual Generation | Zhuoyan Luo et.al. | 2409.04410 | translate | read | link |
| 2024-09-06 | How Fair is Your Diffusion Recommender Model? | Daniele Malitesta et.al. | 2409.04339 | translate | read | null |
| 2024-09-06 | Secure Traffic Sign Recognition: An Attention-Enabled Universal Image Inpainting Mechanism against Light Patch Attacks | Hangcheng Cao et.al. | 2409.04133 | translate | read | null |
| 2024-09-06 | Bi-modality Images Transfer with a Discrete Process Matching Method | Zhe Xiong et.al. | 2409.03977 | translate | read | null |
| 2024-09-05 | Generating High Dimensional User-Specific Wireless Channels using Diffusion Models | Taekyun Lee et.al. | 2409.03924 | translate | read | null |
| 2024-09-05 | ArtiFade: Learning to Generate High-quality Subject from Blemished Images | Shuya Yang et.al. | 2409.03745 | translate | read | null |
| 2024-09-05 | Unsupervised Anomaly Detection and Localization with Generative Adversarial Networks | Khouloud Abdelli et.al. | 2409.03657 | translate | read | null |
| 2024-09-05 | RealisHuman: A Two-Stage Approach for Refining Malformed Human Parts in Generated Images | Benzhi Wang et.al. | 2409.03644 | translate | read | null |
| 2024-09-05 | VFLGAN-TS: Vertical Federated Learning-based Generative Adversarial Networks for Publication of Vertically Partitioned Time-Series Data | Xun Yuan et.al. | 2409.03612 | translate | read | null |
| 2024-09-05 | TCDiff: Triple Condition Diffusion Model with 3D Constraints for Stylizing Synthetic Faces | Bernardo Biesseck et.al. | 2409.03600 | translate | read | link |
| 2024-09-05 | Blended Latent Diffusion under Attention Control for Real-World Video Editing | Deyin Liu et.al. | 2409.03514 | translate | read | null |
| 2024-09-05 | Non-Uniform Illumination Attack for Fooling Convolutional Neural Networks | Akshay Jain et.al. | 2409.03458 | translate | read | link |
| 2024-09-05 | Fine-tuning large language models for domain adaptation: Exploration of training strategies, scaling, model merging and synergistic capabilities | Wei Lu et.al. | 2409.03444 | translate | read | link |
| 2024-09-05 | RoVi-Aug: Robot and Viewpoint Augmentation for Cross-Embodiment Robot Learning | Lawrence Yunliang Chen et.al. | 2409.03403 | translate | read | null |
| 2024-09-05 | Enhancing digital core image resolution using optimal upscaling algorithm: with application to paired SEM images | Shaohua You et.al. | 2409.03265 | translate | read | null |
| 2024-09-04 | HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts | Xinyu Liu et.al. | 2409.02919 | translate | read | link |
| 2024-09-04 | Independence Constrained Disentangled Representation Learning from Epistemological Perspective | Ruoyu Wang et.al. | 2409.02672 | translate | read | null |
| 2024-09-04 | Skip-and-Play: Depth-Driven Pose-Preserved Image Generation for Any Objects | Kyungmin Jo et.al. | 2409.02653 | translate | read | null |
| 2024-09-04 | StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion Models | Wen Li et.al. | 2409.02543 | translate | read | link |
| 2024-09-04 | A Learnable Color Correction Matrix for RAW Reconstruction | Anqi Liu et.al. | 2409.02497 | translate | read | null |
| 2024-09-04 | Training-free Color-Style Disentanglement for Constrained Text-to-Image Synthesis | Aishwarya Agarwal et.al. | 2409.02429 | translate | read | null |
| 2024-09-04 | Exploring Low-Dimensional Subspaces in Diffusion Models for Controllable Image Editing | Siyi Chen et.al. | 2409.02374 | translate | read | link |
| 2024-09-03 | QID $^2$ : An Image-Conditioned Diffusion Model for Q-space Up-sampling of DWI Data | Zijian Chen et.al. | 2409.02309 | translate | read | null |
| 2024-09-03 | FastVoiceGrad: One-step Diffusion-Based Voice Conversion with Adversarial Conditional Diffusion Distillation | Takuhiro Kaneko et.al. | 2409.02245 | translate | read | null |
| 2024-09-03 | LSTM-QGAN: Scalable NISQ Generative Adversarial Network | Cheng Chu et.al. | 2409.02212 | translate | read | null |
| 2024-09-02 | Enabling Local Editing in Diffusion Models by Joint and Individual Component Analysis | Theodoros Kouzelis et.al. | 2408.16845 | translate | read | null |
(<a href=../Image_Generation.md>back to Image Generation</a>)