Image Generation - 2024-07
Image Generation - 2024-07
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2024-07-31 | Detecting, Explaining, and Mitigating Memorization in Diffusion Models | Yuxin Wen et.al. | 2407.21720 | translate | read | null |
| 2024-07-31 | Fine-gained Zero-shot Video Sampling | Dengsheng Chen et.al. | 2407.21475 | translate | read | null |
| 2024-07-31 | Deformable 3D Shape Diffusion Model | Dengsheng Chen et.al. | 2407.21428 | translate | read | null |
| 2024-07-31 | Identity-Consistent Diffusion Network for Grading Knee Osteoarthritis Progression in Radiographic Imaging | Wenhua Wu et.al. | 2407.21381 | translate | read | null |
| 2024-07-31 | ESIQA: Perceptual Quality Assessment of Vision-Pro-based Egocentric Spatial Images | Xilei Zhu et.al. | 2407.21363 | translate | read | null |
| 2024-07-30 | Embedding Space Selection for Detecting Memorization and Fingerprinting in Generative Models | Jack He et.al. | 2407.21159 | translate | read | null |
| 2024-07-30 | Vulnerabilities in AI-generated Image Detection: The Challenge of Adversarial Attacks | Yunfeng Diao et.al. | 2407.20836 | translate | read | null |
| 2024-07-30 | Understanding the Impact of Synchronous, Asynchronous, and Hybrid In-Situ Techniques in Computational Fluid Dynamics Applications | Yi Ju et.al. | 2407.20717 | translate | read | null |
| 2024-07-30 | DocXPand-25k: a large and diverse benchmark dataset for identity documents analysis | Julien Lerouge et.al. | 2407.20662 | translate | read | link |
| 2024-07-30 | Autonomous Improvement of Instruction Following Skills via Foundation Models | Zhiyuan Zhou et.al. | 2407.20635 | translate | read | null |
| 2024-07-30 | Enhancing Quantitative Image Synthesis through Pretraining and Resolution Scaling for Bone Mineral Density Estimation from a Plain X-ray Image | Yi Gu et.al. | 2407.20495 | translate | read | null |
| 2024-07-29 | Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities | Lorenzo Baraldi et.al. | 2407.20337 | translate | read | link |
| 2024-07-29 | LatentArtiFusion: An Effective and Efficient Histological Artifacts Restoration Framework | Zhenqi He et.al. | 2407.20172 | translate | read | link |
| 2024-07-29 | MaskInversion: Localized Embeddings via Optimization of Explainability Maps | Walid Bousselham et.al. | 2407.20034 | translate | read | null |
| 2024-07-29 | ImagiNet: A Multi-Content Dataset for Generalizable Synthetic Image Detection via Contrastive Learning | Delyan Boychev et.al. | 2407.20020 | translate | read | link |
| 2024-07-29 | Reproducibility Study of “ITI-GEN: Inclusive Text-to-Image Generation” | Daniel Gallo Fernández et.al. | 2407.19996 | translate | read | null |
| 2024-07-29 | From Flat to Spatial: Comparison of 4 methods constructing 3D, 2 and 1/2D Models from 2D Plans with neural networks | Jacob Sam et.al. | 2407.19970 | translate | read | null |
| 2024-07-29 | Synthetic Thermal and RGB Videos for Automatic Pain Assessment utilizing a Vision-MLP Architecture | Stefanos Gkikas et.al. | 2407.19811 | translate | read | null |
| 2024-07-28 | Temporal Feature Matters: A Framework for Diffusion Model Quantization | Yushi Huang et.al. | 2407.19547 | translate | read | null |
| 2024-07-28 | Deep Generative Models-Assisted Automated Labeling for Electron Microscopy Images Segmentation | Wenhao Yuan et.al. | 2407.19544 | translate | read | null |
| 2024-07-28 | VersusDebias: Universal Zero-Shot Debiasing for Text-to-Image Models via SLM-Based Prompt Engineering and Generative Adversary | Hanjun Luo et.al. | 2407.19524 | translate | read | null |
| 2024-07-28 | MVPbev: Multi-view Perspective Image Generation from BEV with Test-time Controllability and Generalizability | Buyu Liu et.al. | 2407.19468 | translate | read | link |
| 2024-07-26 | SHIC: Shape-Image Correspondences with no Keypoint Supervision | Aleksandar Shtedritski et.al. | 2407.18907 | translate | read | null |
| 2024-07-26 | Generative Adversarial Networks for Imputing Sparse Learning Performance | Liang Zhang et.al. | 2407.18875 | translate | read | null |
| 2024-07-26 | Adversarial Robustification via Text-to-Image Diffusion Models | Daewon Choi et.al. | 2407.18658 | translate | read | link |
| 2024-07-26 | Topology Optimization of Random Memristors for Input-Aware Dynamic SNN | Bo Wang et.al. | 2407.18625 | translate | read | null |
| 2024-07-26 | Speech Bandwidth Expansion Via High Fidelity Generative Adversarial Networks | Mahmoud Salhab et.al. | 2407.18571 | translate | read | null |
| 2024-07-26 | Machine Unlearning using a Multi-GAN based Model | Amartya Hatua et.al. | 2407.18467 | translate | read | null |
| 2024-07-25 | Generative AI like ChatGPT in Blockchain Federated Learning: use cases, opportunities and future | Sai Puppala et.al. | 2407.18358 | translate | read | null |
| 2024-07-25 | AttentionHand: Text-driven Controllable Hand Image Generation for 3D Hand Reconstruction in the Wild | Junho Park et.al. | 2407.18034 | translate | read | null |
| 2024-07-25 | Guided Latent Slot Diffusion for Object-Centric Learning | Krishnakant Singh et.al. | 2407.17929 | translate | read | null |
| 2024-07-25 | ReCorD: Reasoning and Correcting Diffusion for HOI Generation | Jian-Yu Jiang-Lin et.al. | 2407.17911 | translate | read | link |
| 2024-07-25 | Artificial Immunofluorescence in a Flash: Rapid Synthetic Imaging from Brightfield Through Residual Diffusion | Xiaodan Xing et.al. | 2407.17882 | translate | read | null |
| 2024-07-25 | Enhancing Eye Disease Diagnosis with Deep Learning and Synthetic Data Augmentation | Saideep Kilaru et.al. | 2407.17755 | translate | read | null |
| 2024-07-24 | Synthetic High-resolution Cryo-EM Density Maps with Generative Adversarial Networks | Chenwei Zhang et.al. | 2407.17674 | translate | read | link |
| 2024-07-24 | CDDIP: Constrained Diffusion-Driven Deep Image Prior for Seismic Image Reconstruction | Paul Goyes-Peñafiel et.al. | 2407.17402 | translate | read | null |
| 2024-07-24 | ViPer: Visual Personalization of Generative Models via Individual Preference Learning | Sogand Salehi et.al. | 2407.17365 | translate | read | null |
| 2024-07-24 | DexGANGrasp: Dexterous Generative Adversarial Grasping Synthesis for Task-Oriented Manipulation | Qian Feng et.al. | 2407.17348 | translate | read | null |
| 2024-07-25 | LPGen: Enhancing High-Fidelity Landscape Painting Generation through Diffusion Model | Wanggong Yang et.al. | 2407.17229 | translate | read | null |
| 2024-07-24 | MemBench: Memorized Image Trigger Prompt Dataset for Diffusion Models | Chunsan Hong et.al. | 2407.17095 | translate | read | null |
| 2024-07-24 | Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model | Lirui Zhao et.al. | 2407.16982 | translate | read | link |
| 2024-07-24 | 3DAttGAN: A 3D Attention-based Generative Adversarial Network for Joint Space-Time Video Super-Resolution | Congrui Fu et.al. | 2407.16965 | translate | read | null |
| 2024-07-24 | An Adaptive Gradient Regularization Method | Huixiu Jiang et.al. | 2407.16944 | translate | read | null |
| 2024-07-24 | McGAN: Generating Manufacturable Designs by Embedding Manufacturing Rules into Conditional Generative Adversarial Network | Zhichao Wang et.al. | 2407.16943 | translate | read | null |
| 2024-07-24 | Synthetic Trajectory Generation Through Convolutional Neural Networks | Jesse Merhi et.al. | 2407.16938 | translate | read | link |
| 2024-07-23 | Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions | Fabio Tosi et.al. | 2407.16698 | translate | read | link |
| 2024-07-23 | On Differentially Private 3D Medical Image Synthesis with Controllable Latent Diffusion Models | Deniz Daum et.al. | 2407.16405 | translate | read | link |
| 2024-07-23 | CLII: Visual-Text Inpainting via Cross-Modal Predictive Interaction | Liang Zhao et.al. | 2407.16204 | translate | read | null |
| 2024-07-23 | MxT: Mamba x Transformer for Image Inpainting | Shuang Chen et.al. | 2407.16126 | translate | read | null |
| 2024-07-23 | Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos | Jiahe Liu et.al. | 2407.16124 | translate | read | link |
| 2024-07-22 | FDWST: Fingerphoto Deblurring using Wavelet Style Transfer | David Keaton et.al. | 2407.15964 | translate | read | null |
| 2024-07-22 | Semantics Guided Disentangled GAN for Chest X-ray Image Rib Segmentation | Lili Huang et.al. | 2407.15903 | translate | read | null |
| 2024-07-22 | DStruct2Design: Data and Benchmarks for Data Structure Driven Generative Floor Plan Design | Zhi Hao Luo et.al. | 2407.15723 | translate | read | link |
| 2024-07-22 | SETTP: Style Extraction and Tunable Inference via Dual-level Transferable Prompt Learning | Chunzhen Jin et.al. | 2407.15556 | translate | read | null |
| 2024-07-22 | SpotDiffusion: A Fast Approach For Seamless Panorama Generation Over Time | Stanislav Frolov et.al. | 2407.15507 | translate | read | null |
| 2024-07-22 | TextureCrop: Enhancing Synthetic Image Detection through Texture-based Cropping | Despina Konstantinidou et.al. | 2407.15500 | translate | read | null |
| 2024-07-22 | DiffX: Guide Your Layout to Cross-Modal Generative Modeling | Zeyu Wang et.al. | 2407.15488 | translate | read | link |
| 2024-07-22 | Text2Place: Affordance-aware Text Guided Human Placement | Rishubh Parihar et.al. | 2407.15446 | translate | read | null |
| 2024-07-22 | X-Recon: Learning-based Patient-specific High-Resolution CT Reconstruction from Orthogonal X-Ray Images | Yunpeng Wang et.al. | 2407.15356 | translate | read | link |
| 2024-07-21 | MedEdit: Counterfactual Diffusion-based Image Editing on Brain MRI | Malek Ben Alaya et.al. | 2407.15270 | translate | read | null |
| 2024-07-21 | BIGbench: A Unified Benchmark for Social Bias in Text-to-Image Generative Models Based on Multi-modal LLM | Hanjun Luo et.al. | 2407.15240 | translate | read | null |
| 2024-07-21 | Variational Potential Flow: A Novel Probabilistic Framework for Energy-Based Generative Modelling | Junn Yong Loo et.al. | 2407.15238 | translate | read | null |
| 2024-07-19 | Controllable and Efficient Multi-Class Pathology Nuclei Data Augmentation using Text-Conditioned Diffusion Models | Hyun-Jic Oh et.al. | 2407.14426 | translate | read | null |
| 2024-07-19 | Thinking Racial Bias in Fair Forgery Detection: Models, Datasets and Evaluations | Decheng Liu et.al. | 2407.14367 | translate | read | null |
| 2024-07-19 | Panoptic Segmentation of Mammograms with Text-To-Image Diffusion Model | Kun Zhao et.al. | 2407.14326 | translate | read | null |
| 2024-07-19 | Zero-Shot Underwater Gesture Recognition | Sandipan Sarma et.al. | 2407.14103 | translate | read | link |
| 2024-07-19 | Time Series Generative Learning with Application to Brain Imaging Analysis | Zhenghao Li et.al. | 2407.14003 | translate | read | null |
| 2024-07-18 | BRSR-OpGAN: Blind Radar Signal Restoration using Operational Generative Adversarial Network | Muhammad Uzair Zahid et.al. | 2407.13949 | translate | read | null |
| 2024-07-18 | A Closer Look at GAN Priors: Exploiting Intermediate Features for Enhanced Model Inversion Attacks | Yixiang Qiu et.al. | 2407.13863 | translate | read | link |
| 2024-07-18 | HPix: Generating Vector Maps from Satellite Images | Aditya Taparia et.al. | 2407.13680 | translate | read | link |
| 2024-07-18 | Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models | Xiaoyu Zhu et.al. | 2407.13642 | translate | read | null |
| 2024-07-18 | Training-free Composite Scene Generation for Layout-to-Image Synthesis | Jiaqi Liu et.al. | 2407.13609 | translate | read | link |
| 2024-07-18 | Reducing Barriers to the Use of Marginalised Music Genres in AI | Nick Bryan-Kinns et.al. | 2407.13439 | translate | read | null |
| 2024-07-18 | URCDM: Ultra-Resolution Image Synthesis in Histopathology | Sarah Cechnicka et.al. | 2407.13277 | translate | read | null |
| 2024-07-18 | Motif-Consistent Counterfactuals with Adversarial Refinement for Graph-Level Anomaly Detection | Chunjing Xiao et.al. | 2407.13251 | translate | read | null |
| 2024-07-18 | Safe-SD: Safe and Traceable Stable Diffusion with Text Prompt Trigger for Invisible Generative Watermarking | Zhiyuan Ma et.al. | 2407.13188 | translate | read | null |
| 2024-07-18 | Image Inpainting Models are Effective Tools for Instruction-guided Image Editing | Xuan Ju et.al. | 2407.13139 | translate | read | null |
| 2024-07-17 | From Principles to Practices: Lessons Learned from Applying Partnership on AI’s (PAI) Synthetic Media Framework to 11 Use Cases | Claire R. Leibowicz et.al. | 2407.13025 | translate | read | null |
| 2024-07-17 | Denoising Diffusions in Latent Space for Medical Image Segmentation | Fahim Ahmed Zaman et.al. | 2407.12952 | translate | read | null |
| 2024-07-17 | IMAGDressing-v1: Customizable Virtual Dressing | Fei Shen et.al. | 2407.12705 | translate | read | link |
| 2024-07-17 | Promptable Counterfactual Diffusion Model for Unified Brain Tumor Segmentation and Generation with MRIs | Yiqing Shen et.al. | 2407.12678 | translate | read | null |
| 2024-07-17 | Enhancing the Utility of Privacy-Preserving Cancer Classification using Synthetic Data | Richard Osuala et.al. | 2407.12669 | translate | read | null |
| 2024-07-17 | Zero-shot Text-guided Infinite Image Synthesis with LLM guidance | Soyeong Kwon et.al. | 2407.12642 | translate | read | null |
| 2024-07-17 | Towards Understanding Unsafe Video Generation | Yan Pang et.al. | 2407.12581 | translate | read | link |
| 2024-07-17 | The Fabrication of Reality and Fantasy: Scene Generation with LLM-Assisted Prompt Interpretation | Yi Yao et.al. | 2407.12579 | translate | read | null |
| 2024-07-17 | I2AM: Interpreting Image-to-Image Latent Diffusion Models via Attribution Maps | Junseo Park et.al. | 2407.12331 | translate | read | null |
| 2024-07-17 | Voltage-Controlled Magnetoelectric Devices for Neuromorphic Diffusion Process | Yang Cheng et.al. | 2407.12261 | translate | read | null |
| 2024-07-16 | Towards Dataset-scale and Feature-oriented Evaluation of Text Summarization in Large Language Model Prompts | Sam Yu-Te Lee et.al. | 2407.12192 | translate | read | null |
| 2024-07-16 | Beta Sampling is All You Need: Efficient Image Generation Strategy for Diffusion Models using Stepwise Spectral Analysis | Haeil Lee et.al. | 2407.12173 | translate | read | null |
| 2024-07-16 | Efficient Training with Denoised Neural Weights | Yifan Gong et.al. | 2407.11966 | translate | read | null |
| 2024-07-16 | DepGAN: Leveraging Depth Maps for Handling Occlusions and Transparency in Image Composition | Amr Ghoneim et.al. | 2407.11890 | translate | read | null |
| 2024-07-16 | Novel Hybrid Integrated Pix2Pix and WGAN Model with Gradient Penalty for Binary Images Denoising | Luca Tirel et.al. | 2407.11865 | translate | read | null |
| 2024-07-16 | Cycle Contrastive Adversarial Learning for Unsupervised image Deraining | Chen Zhao et.al. | 2407.11750 | translate | read | null |
| 2024-07-16 | Mask-guided cross-image attention for zero-shot in-silico histopathologic image generation with a diffusion model | Dominik Winter et.al. | 2407.11664 | translate | read | null |
| 2024-07-16 | Scaling Diffusion Transformers to 16 Billion Parameters | Zhengcong Fei et.al. | 2407.11633 | translate | read | link |
| 2024-07-16 | DiNO-Diffusion. Scaling Medical Diffusion via Self-Supervised Pre-Training | Guillermo Jimenez-Perez et.al. | 2407.11594 | translate | read | null |
| 2024-07-16 | How Control Information Influences Multilingual Text Image Generation and Editing? | Boqiang Zhang et.al. | 2407.11502 | translate | read | null |
| 2024-07-16 | Diff-MTS: Temporal-Augmented Conditional Diffusion-based AIGC for Industrial Time Series Towards the Large Model Era | Lei Ren et.al. | 2407.11501 | translate | read | null |
| 2024-07-16 | AIGC for Industrial Time Series: From Deep Generative Models to Large Generative Models | Lei Ren et.al. | 2407.11480 | translate | read | null |
| 2024-07-15 | OPa-Ma: Text Guided Mamba for 360-degree Image Out-painting | Penglei Gao et.al. | 2407.10923 | translate | read | null |
| 2024-07-15 | DataDream: Few-shot Guided Dataset Generation | Jae Myung Kim et.al. | 2407.10910 | translate | read | link |
| 2024-07-15 | Optical Diffusion Models for Image Generation | Ilker Oguz et.al. | 2407.10897 | translate | read | null |
| 2024-07-15 | Leveraging Multimodal CycleGAN for the Generation of Anatomically Accurate Synthetic CT Scans from MRIs | Leonardo Crespi et.al. | 2407.10888 | translate | read | null |
| 2024-07-15 | Physics-Inspired Generative Models in Medical Imaging: A Review | Dennis Hein et.al. | 2407.10856 | translate | read | null |
| 2024-07-15 | Domain Generalization for 6D Pose Estimation Through NeRF-based Image Synthesis | Antoine Legrand et.al. | 2407.10762 | translate | read | null |
| 2024-07-15 | An Autonomous Drone Swarm for Detecting and Tracking Anomalies among Dense Vegetation | Rakesh John Amala Arokia Nathan et.al. | 2407.10754 | translate | read | null |
| 2024-07-15 | AccDiffusion: An Accurate Method for Higher-Resolution Image Generation | Zhihang Lin et.al. | 2407.10738 | translate | read | link |
| 2024-07-15 | IE-NeRF: Inpainting Enhanced Neural Radiance Fields in the Wild | Shuaixian Wang et.al. | 2407.10695 | translate | read | null |
| 2024-07-15 | Addressing Image Hallucination in Text-to-Image Generation through Factual Image Retrieval | Youngsun Lim et.al. | 2407.10683 | translate | read | null |
| 2024-07-12 | StyleSplat: 3D Object Style Transfer with Gaussian Splatting | Sahil Jain et.al. | 2407.09473 | translate | read | null |
| 2024-07-12 | FairyLandAI: Personalized Fairy Tales utilizing ChatGPT and DALLE-3 | Georgios Makridis et.al. | 2407.09467 | translate | read | null |
| 2024-07-12 | PID: Physics-Informed Diffusion Model for Infrared Image Generation | Fangyuan Mao et.al. | 2407.09299 | translate | read | link |
| 2024-07-12 | Region Attention Transformer for Medical Image Restoration | Zhiwen Yang et.al. | 2407.09268 | translate | read | link |
| 2024-07-12 | Surgical Text-to-Image Generation | Chinedu Innocent Nwoye et.al. | 2407.09230 | translate | read | null |
| 2024-07-12 | DART: An Automated End-to-End Object Detection Pipeline with Data Diversification, Open-Vocabulary Bounding Box Annotation, Pseudo-Label Review, and Model Training | Chen Xin et.al. | 2407.09174 | translate | read | link |
| 2024-07-12 | Machine Apophenia: The Kaleidoscopic Generation of Architectural Images | Alexey Tikhonov et.al. | 2407.09172 | translate | read | null |
| 2024-07-12 | LAPT: Label-driven Automated Prompt Tuning for OOD Detection with Vision-Language Models | Yabin Zhang et.al. | 2407.08966 | translate | read | link |
| 2024-07-11 | Diff-MST: Differentiable Mixing Style Transfer | Soumya Sai Vanka et.al. | 2407.08889 | translate | read | null |
| 2024-07-11 | A Hybrid Spiking-Convolutional Neural Network Approach for Advancing Machine Learning Models | Sanaullah et.al. | 2407.08861 | translate | read | null |
| 2024-07-11 | SEED-Story: Multimodal Long Story Generation with Large Language Model | Shuai Yang et.al. | 2407.08683 | translate | read | link |
| 2024-07-11 | CAD-Prompted Generative Models: A Pathway to Feasible and Novel Engineering Designs | Leah Chong et.al. | 2407.08675 | translate | read | null |
| 2024-07-11 | Latent Spaces Enable Transformer-Based Dose Prediction in Complex Radiotherapy Plans | Edward Wang et.al. | 2407.08650 | translate | read | link |
| 2024-07-11 | Haar Nuclear Norms with Applications to Remote Sensing Imagery Restoration | Shuang Xu et.al. | 2407.08509 | translate | read | null |
| 2024-07-11 | E2VIDiff: Perceptual Events-to-Video Reconstruction using Diffusion Priors | Jinxiu Liang et.al. | 2407.08231 | translate | read | null |
| 2024-07-11 | GAURA: Generalizable Approach for Unified Restoration and Rendering of Arbitrary Views | Vinayak Gupta et.al. | 2407.08221 | translate | read | null |
| 2024-07-11 | Enriching Information and Preserving Semantic Consistency in Expanding Curvilinear Object Segmentation Datasets | Qin Lei et.al. | 2407.08209 | translate | read | link |
| 2024-07-11 | fairBERTs: Erasing Sensitive Information Through Semantic and Fairness-aware Perturbations | Jinfeng Li et.al. | 2407.08189 | translate | read | null |
| 2024-07-11 | Synthetic Electroretinogram Signal Generation Using Conditional Generative Adversarial Network for Enhancing Classification of Autism Spectrum Disorder | Mikhail Kulyabin et.al. | 2407.08166 | translate | read | null |
| 2024-07-10 | NDST: Neural Driving Style Transfer for Human-Like Vision-Based Autonomous Driving | Donghyun Kim et.al. | 2407.08073 | translate | read | null |
| 2024-07-10 | Generative Image as Action Models | Mohit Shridhar et.al. | 2407.07875 | translate | read | link |
| 2024-07-10 | StoryDiffusion: How to Support UX Storyboarding With Generative-AI | Zhaohui Liang et.al. | 2407.07672 | translate | read | null |
| 2024-07-10 | Boosting Medical Image Synthesis via Registration-guided Consistency and Disentanglement Learning | Chuanpu Li et.al. | 2407.07660 | translate | read | null |
| 2024-07-11 | MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis | Wanggui He et.al. | 2407.07614 | translate | read | link |
| 2024-07-11 | Trainable Highly-expressive Activation Functions | Irit Chelly et.al. | 2407.07564 | translate | read | null |
| 2024-07-10 | Federated PCA on Grassmann Manifold for IoT Anomaly Detection | Tung-Anh Nguyen et.al. | 2407.07421 | translate | read | link |
| 2024-07-10 | Deformation-Recovery Diffusion Model (DRDM): Instance Deformation for Image Manipulation and Synthesis | Jian-Qing Zheng et.al. | 2407.07295 | translate | read | link |
| 2024-07-10 | HoneyGAN Pots: A Deep Learning Approach for Generating Honeypots | Ryan Gabrys et.al. | 2407.07292 | translate | read | null |
| 2024-07-09 | Few-Shot Image Generation by Conditional Relaxing Diffusion Inversion | Yu Cao et.al. | 2407.07249 | translate | read | null |
| 2024-07-09 | Accelerating Mobile Edge Generation (MEG) by Constrained Learning | Xiaoxia Xu et.al. | 2407.07245 | translate | read | null |
| 2024-07-09 | ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction | Shaozhe Hao et.al. | 2407.07077 | translate | read | link |
| 2024-07-09 | Spanish TrOCR: Leveraging Transfer Learning for Language Adaptation | Filipe Lauar et.al. | 2407.06950 | translate | read | link |
| 2024-07-09 | HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance | Guian Fang et.al. | 2407.06937 | translate | read | link |
| 2024-07-09 | Towards Physics-informed Cyclic Adversarial Multi-PSF Lensless Imaging | Abeer Banerjee et.al. | 2407.06727 | translate | read | null |
| 2024-07-09 | Deep-Motion-Net: GNN-based volumetric organ shape reconstruction from single-view 2D projections | Isuru Wijesinghe et.al. | 2407.06692 | translate | read | null |
| 2024-07-09 | Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning | Fanyue Wei et.al. | 2407.06642 | translate | read | link |
| 2024-07-09 | Attack GAN (AGAN ): A new Security Evaluation Tool for Perceptual Encryption | Umesh Kashyap et.al. | 2407.06570 | translate | read | null |
| 2024-07-09 | DriftGAN: Using historical data for Unsupervised Recurring Drift Detection | Christofer Fellicious et.al. | 2407.06543 | translate | read | null |
| 2024-07-09 | Sketch-Guided Scene Image Generation | Tianyu Zhang et.al. | 2407.06469 | translate | read | null |
| 2024-07-08 | FairDiff: Fair Segmentation with Point-Image Diffusion | Wenyi Li et.al. | 2407.06250 | translate | read | null |
| 2024-07-08 | Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images | Zhangyang Qi et.al. | 2407.06191 | translate | read | null |
| 2024-07-08 | JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation | Yu Zeng et.al. | 2407.06187 | translate | read | null |
| 2024-07-08 | The Tug-of-War Between Deepfake Generation and Detection | Hannah Lee et.al. | 2407.06174 | translate | read | null |
| 2024-07-08 | PerlDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Models | Jinhua Zhang et.al. | 2407.06109 | translate | read | link |
| 2024-07-08 | Accelerating Diffusion for SAR-to-Optical Image Translation via Adversarial Consistency Distillation | Xinyu Bai et.al. | 2407.06095 | translate | read | null |
| 2024-07-08 | Layered Diffusion Model for One-Shot High Resolution Text-to-Image Synthesis | Emaad Khwaja et.al. | 2407.06079 | translate | read | null |
| 2024-07-08 | MMIS: Multimodal Dataset for Interior Scene Visual Generation and Recognition | Hozaifa Kassab et.al. | 2407.05980 | translate | read | null |
| 2024-07-08 | Minutes to Seconds: Speeded-up DDPM-based Image Inpainting with Coarse-to-Fine Sampling | Lintao Zhang et.al. | 2407.05875 | translate | read | link |
| 2024-07-08 | 3D Vessel Graph Generation Using Denoising Diffusion | Chinmay Prabhakar et.al. | 2407.05842 | translate | read | link |
| 2024-07-08 | MobilePortrait: Real-Time One-Shot Neural Head Avatars on Mobile Devices | Jianwen Jiang et.al. | 2407.05712 | translate | read | null |
| 2024-07-05 | Smell and Emotion: Recognising emotions in smell-related artworks | Vishal Patoliya et.al. | 2407.04592 | translate | read | null |
| 2024-07-05 | FA-GAN: Artifacts-free and Phase-aware High-fidelity GAN-based Vocoder | Rubing Shen et.al. | 2407.04575 | translate | read | null |
| 2024-07-05 | PROUD: PaRetO-gUided Diffusion Model for Multi-objective Generation | Yinghua Yao et.al. | 2407.04493 | translate | read | null |
| 2024-07-05 | Efficient GANs for Document Image Binarization Based on DWT and Normalization | Rui-Yang Ju et.al. | 2407.04231 | translate | read | link |
| 2024-07-04 | Performance of Medical Image Fusion in High-level Analysis Tasks: A Mutual Enhancement Framework for Unaligned PAT and MRI Image Fusion | Yutian Zhong et.al. | 2407.03992 | translate | read | link |
| 2024-07-04 | Leveraging Latent Diffusion Models for Training-Free In-Distribution Data Augmentation for Surface Defect Detection | Federico Girella et.al. | 2407.03961 | translate | read | link |
| 2024-07-04 | DiCTI: Diffusion-based Clothing Designer via Text-guided Input | Ajda Lampe et.al. | 2407.03901 | translate | read | null |
| 2024-07-04 | Deep learning architectures for data-driven damage detection in nonlinear dynamic systems | Harrish Joseph et.al. | 2407.03700 | translate | read | null |
| 2024-07-04 | Generative Technology for Human Emotion Recognition: A Scope Review | Fei Ma et.al. | 2407.03640 | translate | read | null |
| 2024-07-04 | Lateralization LoRA: Interleaved Instruction Tuning with Modality-Specialized Adaptations | Zhiyang Xu et.al. | 2407.03604 | translate | read | null |
| 2024-07-03 | BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations | Zhantao Yang et.al. | 2407.03314 | translate | read | null |
| 2024-07-03 | DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents | Yilun Xu et.al. | 2407.03300 | translate | read | link |
| 2024-07-03 | Artificial Inductive Bias for Synthetic Tabular Data Generation in Data-Scarce Scenarios | Patricia A. Apellániz et.al. | 2407.03080 | translate | read | null |
| 2024-07-03 | Towards High Resolution Real-Time Optical Flow Particle Image Velocimetry | Juan Pimienta et.al. | 2407.03057 | translate | read | null |
| 2024-07-03 | An Organism Starts with a Single Pix-Cell: A Neural Cellular Diffusion for High-Resolution Image Synthesis | Marawan Elbatel et.al. | 2407.03018 | translate | read | null |
| 2024-07-03 | Representation learning with CGAN for casual inference | Zhaotian Weng et.al. | 2407.02825 | translate | read | null |
| 2024-07-03 | Mobile Edge Generation-Enabled Digital Twin: Architecture Design and Research Opportunities | Xiaoxia Xu et.al. | 2407.02804 | translate | read | null |
| 2024-07-02 | Change My Frame: Reframing in the Wild in r/ChangeMyView | Arturo Martínez Peguero et.al. | 2407.02637 | translate | read | null |
| 2024-07-02 | Diffusion Models for Tabular Data Imputation and Synthetic Data Generation | Mario Villaizán-Vallelado et.al. | 2407.02549 | translate | read | null |
| 2024-07-02 | A Pattern Language for Machine Learning Tasks | Benjamin Rodatz et.al. | 2407.02424 | translate | read | null |
| 2024-07-02 | MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis | Dewei Zhou et.al. | 2407.02329 | translate | read | link |
| 2024-07-02 | UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks | Jingjing Ren et.al. | 2407.02158 | translate | read | null |
| 2024-07-02 | SwiftDiffusion: Efficient Diffusion Model Serving with Add-on Modules | Suyi Li et.al. | 2407.02031 | translate | read | null |
| 2024-07-02 | Unsupervised Face-Mask Speech Enhancement Using Generative Adversarial Networks with Human-in-the-Loop Assessment Metrics | Syu-Siang Wang et.al. | 2407.01939 | translate | read | null |
| 2024-07-02 | Enhancing Multi-Class Anomaly Detection via Diffusion Refinement with Dual Conditioning | Jiawei Zhan et.al. | 2407.01905 | translate | read | null |
| 2024-07-01 | Purple-teaming LLMs with Adversarial Defender Training | Jingyan Zhou et.al. | 2407.01850 | translate | read | null |
| 2024-07-01 | Label-free Neural Semantic Image Synthesis | Jiayi Wang et.al. | 2407.01790 | translate | read | null |
| 2024-07-01 | Universal Quantum Tomography With Deep Neural Networks | Nhan T. Luu et.al. | 2407.01734 | translate | read | null |
| 2024-07-01 | Scalable Nested Optimization for Deep Learning | Jonathan Lorraine et.al. | 2407.01526 | translate | read | null |
(<a href=../Image_Generation.md>back to Image Generation</a>)