Classification - 2025-03
Classification - 2025-03
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2025-03-31 | NoProp: Training Neural Networks without Back-propagation or Forward-propagation | Qinyu Li et.al. | 2503.24322 | translate | read | null |
| 2025-03-31 | CIBR: Cross-modal Information Bottleneck Regularization for Robust CLIP Generalization | Yingrui Ji et.al. | 2503.24182 | translate | read | null |
| 2025-03-31 | PixelCAM: Pixel Class Activation Mapping for Histology Image Classification and ROI Localization | Alexis Guichemerre et.al. | 2503.24135 | translate | read | link |
| 2025-03-31 | Crossmodal Knowledge Distillation with WordNet-Relaxed Text Embeddings for Robust Image Classification | Chenqi Guo et.al. | 2503.24017 | translate | read | null |
| 2025-03-31 | FlexiMo: A Flexible Remote Sensing Foundation Model | Xuyang Li et.al. | 2503.23844 | translate | read | null |
| 2025-03-31 | Expanding-and-Shrinking Binary Neural Networks | Xulong Shi et.al. | 2503.23709 | translate | read | link |
| 2025-03-31 | WHERE and WHICH: Iterative Debate for Biomedical Synthetic Data Augmentation | Zhengyi Zhao et.al. | 2503.23673 | translate | read | null |
| 2025-03-30 | Efficient Dynamic Attention 3D Convolution for Hyperspectral Image Classification | Guandong Li et.al. | 2503.23472 | translate | read | null |
| 2025-03-30 | KernelDNA: Dynamic Kernel Sharing via Decoupled Naive Adapters | Haiduo Huang et.al. | 2503.23379 | translate | read | link |
| 2025-03-29 | Optimizing Distributed Training Approaches for Scaling Neural Networks | Vishnu Vardhan Baligodugula et.al. | 2503.23186 | translate | read | null |
| 2025-03-28 | Data-Free Universal Attack by Exploiting the Intrinsic Vulnerability of Deep Models | YangTian Yan et.al. | 2503.22205 | translate | read | link |
| 2025-03-28 | Route-and-Aggregate Decentralized Federated Learning Under Communication Errors | Weicai Li et.al. | 2503.22186 | translate | read | null |
| 2025-03-27 | On Large Multimodal Models as Open-World Image Classifiers | Alessandro Conti et.al. | 2503.21851 | translate | read | link |
| 2025-03-27 | Bayesian Pseudo Posterior Mechanism for Differentially Private Machine Learning | Robert Chew et.al. | 2503.21528 | translate | read | null |
| 2025-03-27 | Retinal Fundus Multi-Disease Image Classification using Hybrid CNN-Transformer-Ensemble Architectures | Deependra Singh et.al. | 2503.21465 | translate | read | link |
| 2025-03-27 | Fine-Tuning LLMs on Small Medical Datasets: Text Classification and Normalization Effectiveness on Cardiology reports and Discharge records | Noah Losch et.al. | 2503.21349 | translate | read | null |
| 2025-03-27 | Improving $(α, f)$ -Byzantine Resilience in Federated Learning via layerwise aggregation and cosine distance | Mario García-Márquez et.al. | 2503.21244 | translate | read | link |
| 2025-03-27 | Neural Architecture Search by Learning a Hierarchical Search Space | Mehraveh Javan Roshtkhari et.al. | 2503.21061 | translate | read | null |
| 2025-03-26 | TS-Inverse: A Gradient Inversion Attack Tailored for Federated Time Series Forecasting Models | Caspar Meijer et.al. | 2503.20952 | translate | read | link |
| 2025-03-26 | VESTA: A Versatile SNN-Based Transformer Accelerator with Unified PEs for Multiple Computational Layers | Ching-Yao Chen et.al. | 2503.20246 | translate | read | null |
| 2025-03-26 | BeLightRec: A lightweight recommender system enhanced with BERT | Manh Mai Van et.al. | 2503.20206 | translate | read | null |
| 2025-03-25 | Vanishing Depth: A Depth Adapter with Positional Depth Encoding for Generalized Image Encoders | Paul Koch et.al. | 2503.19947 | translate | read | null |
| 2025-03-25 | Optimizing Breast Cancer Detection in Mammograms: A Comprehensive Study of Transfer Learning, Resolution Reduction, and Multi-View Classification | Daniel G. P. Petrini et.al. | 2503.19945 | translate | read | link |
| 2025-03-25 | Extensions of regret-minimization algorithm for optimal design | Youguang Chen et.al. | 2503.19874 | translate | read | null |
| 2025-03-25 | VectorFit : Adaptive Singular & Bias Vector Fine-Tuning of Pre-trained Foundation Models | Suhas G Hegde et.al. | 2503.19530 | translate | read | null |
| 2025-03-25 | LRSCLIP: A Vision-Language Foundation Model for Aligning Remote Sensing Image with Longer Text | Weizhi Chen et.al. | 2503.19311 | translate | read | link |
| 2025-03-25 | Face Spoofing Detection using Deep Learning | Najeebullah et.al. | 2503.19223 | translate | read | link |
| 2025-03-24 | Exploring the Integration of Key-Value Attention Into Pure and Hybrid Transformers for Semantic Segmentation | DeShin Hwa et.al. | 2503.18862 | translate | read | null |
| 2025-03-24 | Latent Space Class Dispersion: Effective Test Data Quality Assessment for DNNs | Vivek Vekariya et.al. | 2503.18799 | translate | read | null |
| 2025-03-24 | Unbiasing through Textual Descriptions: Mitigating Representation Bias in Video Benchmarks | Nina Shvetsova et.al. | 2503.18637 | translate | read | null |
| 2025-03-24 | Explaining Domain Shifts in Language: Concept erasing for Interpretable Image Classification | Zequn Zeng et.al. | 2503.18483 | translate | read | null |
| 2025-03-24 | Teaching LLMs for Step-Level Automatic Math Correction via Reinforcement Learning | Junsong Li et.al. | 2503.18432 | translate | read | null |
| 2025-03-24 | Sun-Shine: A Large Language Model for Tibetan Culture | Cheng Huang et.al. | 2503.18288 | translate | read | null |
| 2025-03-23 | Feature Learning beyond the Lazy-Rich Dichotomy: Insights from Representational Geometry | Chi-Ning Chou et.al. | 2503.18114 | translate | read | null |
| 2025-03-23 | What Time Tells Us? An Explorative Study of Time Awareness Learned from Static Images | Dongheng Lin et.al. | 2503.17899 | translate | read | null |
| 2025-03-21 | Spatiotemporal Learning with Context-aware Video Tubelets for Ultrasound Video Analysis | Gary Y. Li et.al. | 2503.17475 | translate | read | null |
| 2025-03-21 | Leveraging Text-to-Image Generation for Handling Spurious Correlation | Aryan Yazdan Parast et.al. | 2503.17226 | translate | read | null |
| 2025-03-21 | CoRLD: Contrastive Representation Learning Of Deformable Shapes In Images | Tonmoy Hossain ana Miaomiao Zhang et.al. | 2503.17162 | translate | read | null |
| 2025-03-21 | Beyond Accuracy: What Matters in Designing Well-Behaved Models? | Robin Hesse et.al. | 2503.17110 | translate | read | null |
| 2025-03-21 | Symbolic Audio Classification via Modal Decision Tree Learning | Enrico Marzano et.al. | 2503.17018 | translate | read | null |
| 2025-03-21 | EasyRobust: A Comprehensive and Easy-to-use Toolkit for Robust and Generalized Vision | Xiaofeng Mao et.al. | 2503.16975 | translate | read | link |
| 2025-03-21 | City2Scene: Improving Acoustic Scene Classification with City Features | Yiqiang Cai et.al. | 2503.16862 | translate | read | null |
| 2025-03-20 | MobilePlantViT: A Mobile-friendly Hybrid ViT for Generalized Plant Disease Image Classification | Moshiur Rahman Tonmoy et.al. | 2503.16628 | translate | read | null |
| 2025-03-20 | PSA-MIL: A Probabilistic Spatial Attention-Based Multiple Instance Learning for Whole Slide Image Classification | Sharon Peled et.al. | 2503.16284 | translate | read | link |
| 2025-03-20 | CLS-RL: Image Classification with Rule-Based Reinforcement Learning | Ming Li et.al. | 2503.16188 | translate | read | link |
| 2025-03-20 | Corrective In-Context Learning: Evaluating Self-Correction in Large Language Models | Mario Sanz-Guerrero et.al. | 2503.16022 | translate | read | link |
| 2025-03-20 | Beyond the Visible: Multispectral Vision-Language Learning for Earth Observation | Clive Tinashe Marimo et.al. | 2503.15969 | translate | read | link |
| 2025-03-19 | Graph-Weighted Contrastive Learning for Semi-Supervised Hyperspectral Image Classification | Yuqing Zhang et.al. | 2503.15731 | translate | read | null |
| 2025-03-20 | Dynamic Bi-Elman Attention Networks (DBEAN): Dual-Directional Context-Aware Representation Learning for Enhanced Text Classification | ZhengLin Lai et.al. | 2503.15469 | translate | read | link |
| 2025-03-19 | Test-Time Backdoor Detection for Object Detection Models | Hangtao Zhang et.al. | 2503.15293 | translate | read | null |
| 2025-03-19 | Efficient allocation of image recognition and LLM tasks on multi-GPU system | Marcin Lawenda et.al. | 2503.15252 | translate | read | null |
| 2025-03-19 | Comparing Llama3 and DeepSeekR1 on Biomedical Text Classification Tasks | Yuting Guo et.al. | 2503.15169 | translate | read | null |
| 2025-03-19 | ARC: Anchored Representation Clouds for High-Resolution INR Classification | Joost Luijmes et.al. | 2503.15156 | translate | read | null |
| 2025-03-19 | Ultrasound Image-to-Video Synthesis via Latent Dynamic Diffusion Models | Tingxiu Chen et.al. | 2503.14966 | translate | read | null |
| 2025-03-19 | Optimal Transport Adapter Tuning for Bridging Modality Gaps in Few-Shot Remote Sensing Scene Classification | Zhong Ji et.al. | 2503.14938 | translate | read | null |
| 2025-03-18 | RAT: Boosting Misclassification Detection Ability without Extra Data | Ge Yan et.al. | 2503.14783 | translate | read | null |
| 2025-03-18 | LipShiFT: A Certifiably Robust Shift-based Vision Transformer | Rohan Menon et.al. | 2503.14751 | translate | read | null |
| 2025-03-18 | Utilization of Neighbor Information for Image Classification with Different Levels of Supervision | Gihan Jayatilaka et.al. | 2503.14500 | translate | read | null |
| 2025-03-17 | Neural Edge Histogram Descriptors for Underwater Acoustic Target Recognition | Atharva Agashe et.al. | 2503.13763 | translate | read | null |
| 2025-03-17 | Micro Text Classification Based on Balanced Positive-Unlabeled Learning | Lin-Han Jia et.al. | 2503.13562 | translate | read | null |
| 2025-03-17 | Escaping Plato’s Cave: Robust Conceptual Reasoning through Interpretable 3D Neural Object Volumes | Nhi Pham et.al. | 2503.13429 | translate | read | link |
| 2025-03-17 | Do Vision Models Develop Human-Like Progressive Difficulty Understanding? | Zeyi Huang et.al. | 2503.13058 | translate | read | null |
| 2025-03-16 | Domain Generalization for Improved Human Activity Recognition in Office Space Videos Using Adaptive Pre-processing | Partho Ghosh et.al. | 2503.12678 | translate | read | null |
| 2025-03-16 | Scaling Semantic Categories: Investigating the Impact on Vision Transformer Labeling Performance | Anthony Lamelas et.al. | 2503.12617 | translate | read | null |
| 2025-03-16 | Defense Against Model Stealing Based on Account-Aware Distribution Discrepancy | Jian-Ping Mei et.al. | 2503.12497 | translate | read | null |
| 2025-03-16 | GeoRSMLLM: A Multimodal Large Language Model for Vision-Language Tasks in Geoscience and Remote Sensing | Zilun Zhang et.al. | 2503.12490 | translate | read | null |
| 2025-03-16 | Shape Bias and Robustness Evaluation via Cue Decomposition for Image Classification and Segmentation | Edgar Heinert et.al. | 2503.12453 | translate | read | null |
| 2025-03-16 | MExD: An Expert-Infused Diffusion Model for Whole-Slide Image Classification | Jianwei Zhao et.al. | 2503.12401 | translate | read | null |
| 2025-03-15 | TLAC: Two-stage LMM Augmented CLIP for Zero-Shot Classification | Ans Munir et.al. | 2503.12206 | translate | read | null |
| 2025-03-15 | Goal-Oriented Source Coding using LDPC Codes for Compressed-Domain Image Classification | Ahcen Aliouat et.al. | 2503.11954 | translate | read | null |
| 2025-03-14 | Creating a Good Teacher for Knowledge Distillation in Acoustic Scene Classification | Tobias Morocutti et.al. | 2503.11363 | translate | read | null |
| 2025-03-14 | PARIC: Probabilistic Attention Regularization for Language Guided Image Classification from Pre-trained Vison Language Models | Mayank Nautiyal et.al. | 2503.11360 | translate | read | null |
| 2025-03-14 | APLA: A Simple Adaptation Method for Vision Transformers | Moein Sorkhei et.al. | 2503.11335 | translate | read | link |
| 2025-03-14 | Open-Set Plankton Recognition | Joona Kareinen et.al. | 2503.11318 | translate | read | null |
| 2025-03-14 | MEET: A Million-Scale Dataset for Fine-Grained Geospatial Scene Classification with Zoom-Free Remote Sensing Imagery | Yansheng Li et.al. | 2503.11219 | translate | read | null |
| 2025-03-14 | Falcon: A Remote Sensing Vision-Language Foundation Model | Kelu Yao et.al. | 2503.11070 | translate | read | link |
| 2025-03-13 | $(\varepsilon, δ)$ Considered Harmful: Best Practices for Reporting Differential Privacy Guarantees | Juan Felipe Gomez et.al. | 2503.10945 | translate | read | null |
| 2025-03-13 | Learning Interpretable Logic Rules from Deep Vision Models | Chuqin Geng et.al. | 2503.10547 | translate | read | null |
| 2025-03-13 | Extreme Learning Machines for Attention-based Multiple Instance Learning in Whole-Slide Image Classification | Rajiv Krishnakumar et.al. | 2503.10510 | translate | read | null |
| 2025-03-13 | RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing | Fengxiang Wang et.al. | 2503.10392 | translate | read | link |
| 2025-03-13 | PS3C: An Ensemble-Based Two-Step Framework for Classification of Pep Smear Cell Images | Theo Di Piazza et.al. | 2503.10312 | translate | read | link |
| 2025-03-13 | Wikipedia is Not a Dictionary, Delete! Text Classification as a Proxy for Analysing Wiki Deletion Discussions | Hsuvas Borkakoty et.al. | 2503.10294 | translate | read | null |
| 2025-03-13 | A Multi-Modal Federated Learning Framework for Remote Sensing Image Classification | Barış Büyüktaş et.al. | 2503.10262 | translate | read | null |
| 2025-03-13 | Interpretable Image Classification via Non-parametric Part Prototype Learning | Zhijie Zhu et.al. | 2503.10247 | translate | read | null |
| 2025-03-13 | Multiplicative Learning | Han Kim et.al. | 2503.10144 | translate | read | null |
| 2025-03-13 | Cognitive-Mental-LLM: Leveraging Reasoning in Large Language Models for Mental Health Prediction via Online Text | Avinash Patil et.al. | 2503.10095 | translate | read | null |
| 2025-03-13 | Do We Always Need the Simplicity Bias? Looking for Optimal Inductive Biases in the Wild | Damien Teney et.al. | 2503.10065 | translate | read | null |
| 2025-03-12 | Fair Federated Medical Image Classification Against Quality Shift via Inter-Client Progressive State Matching | Nannan Wu et.al. | 2503.09587 | translate | read | null |
| 2025-03-12 | Double-Stage Feature-Level Clustering-Based Mixture of Experts Framework | Bakary Badjie et.al. | 2503.09504 | translate | read | null |
| 2025-03-12 | ForAug: Recombining Foregrounds and Backgrounds to Improve Vision Transformer Training with Bias Mitigation | Tobias Christian Nauen et.al. | 2503.09399 | translate | read | link |
| 2025-03-12 | Membership Inference Attacks fueled by Few-Short Learning to detect privacy leakage tackling data integrity | Daniel Jiménez-López et.al. | 2503.09365 | translate | read | null |
| 2025-03-12 | Deep Learning for Climate Action: Computer Vision Analysis of Visual Narratives on X | Katharina Prasse et.al. | 2503.09361 | translate | read | null |
| 2025-03-12 | Bayesian Test-Time Adaptation for Vision-Language Models | Lihua Zhou et.al. | 2503.09248 | translate | read | null |
| 2025-03-12 | Probing Network Decisions: Capturing Uncertainties and Unveiling Vulnerabilities Without Label Information | Youngju Joung et.al. | 2503.09068 | translate | read | null |
| 2025-03-12 | Discovering Influential Neuron Path in Vision Transformers | Yifan Wang et.al. | 2503.09046 | translate | read | link |
| 2025-03-11 | KAN-Mixers: a new deep learning architecture for image classification | Jorge Luiz dos Santos Canuto et.al. | 2503.08939 | translate | read | null |
| 2025-03-12 | MsaMIL-Net: An End-to-End Multi-Scale Aware Multiple Instance Learning Network for Efficient Whole Slide Image Classification | Jiangping Wen et.al. | 2503.08581 | translate | read | null |
| 2025-03-11 | Generalizable and Explainable Deep Learning for Medical Image Computing: An Overview | Ahmad Chaddad et.al. | 2503.08420 | translate | read | null |
| 2025-03-11 | Prototype-Based Multiple Instance Learning for Gigapixel Whole Slide Image Classification | Susu Sun et.al. | 2503.08384 | translate | read | null |
| 2025-03-11 | Tangentially Aligned Integrated Gradients for User-Friendly Explanations | Lachlan Simpson et.al. | 2503.08240 | translate | read | null |
| 2025-03-11 | EnergyFormer: Energy Attention with Fourier Embedding for Hyperspectral Image Classification | Saad Sohail et.al. | 2503.08239 | translate | read | null |
| 2025-03-11 | Identification of Star Clusters in M31 from PAndAS Images Based on Deep Learning | Baisong Zhang et.al. | 2503.08130 | translate | read | null |
| 2025-03-11 | LabelCoRank: Revolutionizing Long Tail Multi-Label Classification with Co-Occurrence Reranking | Yan Yan et.al. | 2503.07968 | translate | read | null |
| 2025-03-12 | Measuring directional bias amplification in image captions using predictability | Rahul Nair et.al. | 2503.07878 | translate | read | null |
| 2025-03-10 | Fair Text Classification via Transferable Representations | Thibaud Leteno et.al. | 2503.07691 | translate | read | null |
| 2025-03-10 | Keeping Representation Similarity in Finetuning for Medical Image Analysis | Wenqiang Zu et.al. | 2503.07399 | translate | read | null |
| 2025-03-10 | Brain Inspired Adaptive Memory Dual-Net for Few-Shot Image Classification | Kexin Di et.al. | 2503.07396 | translate | read | null |
| 2025-03-10 | Is My Text in Your AI Model? Gradient-based Membership Inference Test applied to LLMs | Gonzalo Mancera et.al. | 2503.07384 | translate | read | null |
| 2025-03-10 | Distilling Knowledge into Quantum Vision Transformers for Biomedical Image Classification | Thomas Boucher et.al. | 2503.07294 | translate | read | null |
| 2025-03-10 | A Zero-shot Learning Method Based on Large Language Models for Multi-modal Knowledge Graph Embedding | Bingchen Liu et.al. | 2503.07202 | translate | read | null |
| 2025-03-10 | Understanding the Learning Dynamics of LoRA: A Gradient Flow Perspective on Low-Rank Adaptation in Matrix Factorization | Ziqing Xu et.al. | 2503.06982 | translate | read | null |
| 2025-03-10 | Task Vector Quantization for Memory-Efficient Model Merging | Youngeun Kim et.al. | 2503.06921 | translate | read | link |
| 2025-03-10 | MADS: Multi-Attribute Document Supervision for Zero-Shot Image Classification | Xiangyan Qu et.al. | 2503.06847 | translate | read | null |
| 2025-03-09 | Enhancing Layer Attention Efficiency through Pruning Redundant Retrievals | Hanze Li et.al. | 2503.06473 | translate | read | null |
| 2025-03-09 | M $^3$ amba: CLIP-driven Mamba Model for Multi-modal Remote Sensing Classification | Mingxiang Cao et.al. | 2503.06446 | translate | read | null |
| 2025-03-07 | Similarity-Based Domain Adaptation with LLMs | Jie He et.al. | 2503.05281 | translate | read | null |
| 2025-03-07 | Spatial Context-Driven Positive Pair Sampling for Enhanced Histopathology Image Classification | Willmer Rafell Quinones Robles et.al. | 2503.05170 | translate | read | null |
| 2025-03-07 | Ensemble Debiasing Across Class and Sample Levels for Fairer Prompting Accuracy | Ruixi Lin et.al. | 2503.05157 | translate | read | link |
| 2025-03-07 | Grouped Sequential Optimization Strategy – the Application of Hyperparameter Importance Assessment in Deep Learning | Ruinan Wang et.al. | 2503.05106 | translate | read | null |
| 2025-03-06 | HieroLM: Egyptian Hieroglyph Recovery with Next Word Prediction Language Model | Xuheng Cai et.al. | 2503.04996 | translate | read | null |
| 2025-03-06 | Label Distribution Learning-Enhanced Dual-KNN for Text Classification | Bo Yuan et.al. | 2503.04869 | translate | read | null |
| 2025-03-06 | Guiding LLMs to Generate High-Fidelity and High-Quality Counterfactual Explanations for Text Classification | Van Bach Nguyen et.al. | 2503.04463 | translate | read | null |
| 2025-03-06 | WeakSupCon: Weakly Supervised Contrastive Learning for Encoder Pre-training | Bodong Zhang et.al. | 2503.04165 | translate | read | null |
| 2025-03-04 | Measurement noise scaling laws for cellular representation learning | Gokul Gowri et.al. | 2503.02726 | translate | read | null |
| 2025-03-04 | XFMamba: Cross-Fusion Mamba for Multi-View Medical Image Classification | Xiaoyu Zheng et.al. | 2503.02619 | translate | read | link |
| 2025-03-04 | Remote Sensing Image Classification Using Convolutional Neural Network (CNN) and Transfer Learning Techniques | Mustafa Majeed Abd Zaid et.al. | 2503.02510 | translate | read | null |
| 2025-03-06 | Union of Experts: Adapting Hierarchical Routing to Equivalently Decomposed Transformer | Yujiao Yang et.al. | 2503.02495 | translate | read | link |
| 2025-03-04 | Making Better Mistakes in CLIP-Based Zero-Shot Classification with Hierarchy-Aware Language Prompts | Tong Liang et.al. | 2503.02248 | translate | read | null |
| 2025-03-04 | Sharpness-Aware Minimization: General Analysis and Improved Rates | Dimitris Oikonomou et.al. | 2503.02225 | translate | read | null |
| 2025-03-03 | Mathematical Foundation of Interpretable Equivariant Surrogate Models | Jacopo Joy Colombini et.al. | 2503.01942 | translate | read | null |
| 2025-03-03 | Visual-RFT: Visual Reinforcement Fine-Tuning | Ziyu Liu et.al. | 2503.01785 | translate | read | link |
| 2025-03-03 | Mamba base PKD for efficient knowledge compression | José Medina et.al. | 2503.01727 | translate | read | null |
| 2025-03-04 | SAR-W-MixMAE: SAR Foundation Model Training Using Backscatter Power Weighting | Ali Caglayan et.al. | 2503.01181 | translate | read | null |
| 2025-03-03 | Large Language Models for Healthcare Text Classification: A Systematic Review | Hajar Sakai et.al. | 2503.01159 | translate | read | null |
| 2025-03-03 | Fast and Accurate Gigapixel Pathological Image Classification with Hierarchical Distillation Multi-Instance Learning | Jiuyang Dong et.al. | 2502.21130 | translate | read | null |
| 2025-03-03 | Gradient-Guided Annealing for Domain Generalization | Aristotelis Ballas et.al. | 2502.20162 | translate | read | link |
(<a href=../Classification.md>back to Classification</a>)