Classification
Classification
| Publish Date | Title | Authors | Code | |
|---|---|---|---|---|
| 2025-12-18 | Blog Data Showdown: Machine Learning vs Neuro-Symbolic Models for Gender Classification | Natnael Tilahun Sinshaw et.al. | 2512.16687 | null |
| 2025-12-18 | Protecting Deep Neural Network Intellectual Property with Chaos-Based White-Box Watermarking | Sangeeth B et.al. | 2512.16658 | null |
| 2025-12-10 | D3G: Diverse Demographic Data Generation Increases Zero-Shot Image Classification Accuracy within Multimodal Models | Javon Hickmon et.al. | 2512.15747 | null |
| 2025-12-17 | Stylized Synthetic Augmentation further improves Corruption Robustness | Georg Siedel et.al. | 2512.15675 | null |
| 2025-12-17 | Vision-based module for accurately reading linear scales in a laboratory | Parvesh Saini et.al. | 2512.15327 | null |
| 2025-12-17 | TrajSyn: Privacy-Preserving Dataset Distillation from Federated Model Trajectories for Server-Side Adversarial Training | Mukur Gupta et.al. | 2512.15123 | null |
| 2025-12-09 | A Critical Perspective on Finite Sample Conformal Prediction Theory in Medical Applications | Klaus-Rudolf Kladny et.al. | 2512.14727 | null |
| 2025-12-16 | An Energy-Efficient Adiabatic Capacitive Neural Network Chip | Himadri Singh Raghav et.al. | 2512.14642 | null |
| 2025-12-16 | Low-Resource, High-Impact: Building Corpora for Inclusive Language Technologies | Ekaterina Artemova et.al. | 2512.14576 | null |
| 2025-12-16 | FoodLogAthl-218: Constructing a Real-World Food Image Dataset Using Dietary Management Applications | Mitsuki Watanabe et.al. | 2512.14574 | null |
| 2025-12-16 | Improving Semantic Uncertainty Quantification in LVLMs with Semantic Gaussian Processes | Joseph Hoche et.al. | 2512.14177 | null |
| 2025-12-15 | Ensemble-Guided Distillation for Compact and Robust Acoustic Scene Classification on Edge Devices | Hossein Sharify et.al. | 2512.13905 | null |
| 2025-12-14 | DL $^3$ M: A Vision-to-Language Framework for Expert-Level Medical Reasoning through Deep Learning and Large Language Models | Md. Najib Hasan et.al. | 2512.13742 | null |
| 2025-12-15 | REVERB-FL: Server-Side Adversarial and Reserve-Enhanced Federated Learning for Robust Audio Classification | Sathwika Peechara et.al. | 2512.13647 | null |
| 2025-12-15 | On the Ability of Deep Learning to Detect Signals with Unknown Parameters | Tom Anders et.al. | 2512.13542 | null |
| 2025-12-15 | Dual-Qubit Hierarchical Fuzzy Neural Network for Image Classification: Enabling Relational Learning via Quantum Entanglement | Wenwei Zhang et.al. | 2512.13274 | null |
| 2025-12-15 | Reveal Hidden Pitfalls and Navigate Next Generation of Vector Similarity Search from Task-Centric Views | Tingyang Chen et.al. | 2512.12980 | link |
| 2025-12-15 | Revisiting 2D Foundation Models for Scalable 3D Medical Image Classification | Han Liu et.al. | 2512.12887 | null |
| 2025-12-14 | Adapting Multimodal Foundation Models for Few-Shot Learning: A Comprehensive Study on Contrastive Captioners | N. K. B. M. P. K. B. Narasinghe et.al. | 2512.12824 | null |
| 2025-12-14 | Fine-Tuning Causal LLMs for Text Classification: Embedding-Based vs. Instruction-Based Approaches | Amirhossein Yousefiramandi et.al. | 2512.12677 | null |
| 2025-12-13 | Advancing Cache-Based Few-Shot Classification via Patch-Driven Relational Gated Graph Attention | Tasweer Ahmad et.al. | 2512.12498 | null |
| 2025-12-13 | Semantic Distance Measurement based on Multi-Kernel Gaussian Processes | Yinzhu Cheng et.al. | 2512.12238 | null |
| 2025-12-12 | A Comparative Analysis of Semiconductor Wafer Map Defect Detection with Image Transformer | Sushmita Nath et.al. | 2512.11977 | null |
| 2025-12-12 | ACCOR: Attention-Enhanced Complex-Valued Contrastive Learning for Occluded Object Classification Using mmWave Radar IQ Signals | Stefan Hägele et.al. | 2512.11556 | null |
| 2025-12-12 | FRQI Pairs method for image classification using Quantum Recurrent Neural Network | Rafał Potempa et.al. | 2512.11499 | null |
| 2025-12-12 | VLM2GeoVec: Toward Universal Multimodal Embeddings for Remote Sensing | Emanuel Sánchez Aimar et.al. | 2512.11490 | null |
| 2025-12-12 | Multi-task Learning with Extended Temporal Shift Module for Temporal Action Localization | Anh-Kiet Duong et.al. | 2512.11189 | null |
| 2025-12-11 | VL-JEPA: Joint Embedding Predictive Architecture for Vision-language | Delong Chen et.al. | 2512.10942 | null |
| 2025-12-11 | LabelFusion: Learning to Fuse LLMs and Transformer Classifiers for Robust Text Classification | Michael Schlee et.al. | 2512.10793 | null |
| 2025-12-11 | Uncertainty-Preserving QBNNs: Multi-Level Quantization of SVI-Based Bayesian Neural Networks for Image Classification | Hendrik Borras et.al. | 2512.10602 | null |
| 2025-12-10 | MedXAI: A Retrieval-Augmented and Self-Verifying Framework for Knowledge-Guided Medical Image Analysis | Midhat Urooj et.al. | 2512.10098 | null |
| 2025-12-10 | Text2Graph: Combining Lightweight LLMs and GNNs for Efficient Text Classification in Label-Scarce Scenarios | João Lucas Luz Lima Sarcinelli et.al. | 2512.10061 | null |
| 2025-12-10 | Stylized Meta-Album: Group-bias injection with style transfer to study robustness against distribution shifts | Romain Mussard et.al. | 2512.09773 | null |
| 2025-12-10 | OxEnsemble: Fair Ensembles for Low-Data Classification | Jonathan Rystrøm et.al. | 2512.09665 | null |
| 2025-12-10 | Hands-on Evaluation of Visual Transformers for Object Recognition and Detection | Dimitrios N. Vlachogiannis et.al. | 2512.09579 | null |
| 2025-12-10 | NeuroSketch: An Effective Framework for Neural Decoding via Systematic Architectural Optimization | Gaorui Zhang et.al. | 2512.09524 | link |
| 2025-12-10 | Advancing Text Classification with Large Language Models and Neural Attention Mechanisms | Ning Lyu et.al. | 2512.09444 | null |
| 2025-12-10 | Representation Calibration and Uncertainty Guidance for Class-Incremental Learning based on Vision Language Model | Jiantao Tan et.al. | 2512.09441 | null |
| 2025-12-10 | Benchmarking Real-World Medical Image Classification with Noisy Labels: Challenges, Practice, and Outlook | Yuan Ma et.al. | 2512.09315 | null |
| 2025-12-09 | GS-KAN: Parameter-Efficient Kolmogorov-Arnold Networks via Sprecher-Type Shared Basis Functions | Oscar Eliasson et.al. | 2512.09084 | null |
| 2025-12-09 | Improving Multi-Class Calibration through Normalization-Aware Isotonic Techniques | Alon Arad et.al. | 2512.09054 | null |
| 2025-12-09 | Luxical: High-Speed Lexical-Dense Text Embeddings | DatologyAI et.al. | 2512.09015 | null |
| 2025-12-08 | Enhancing Knowledge Transfer in Hyperspectral Image Classification via Cross-scene Knowledge Integration | Lu Huo et.al. | 2512.08989 | null |
| 2025-12-09 | Automated Pollen Recognition in Optical and Holographic Microscopy Images | Swarn Singh Warshaneyan et.al. | 2512.08589 | null |
| 2025-12-09 | Low Rank Support Quaternion Matrix Machine | Wang Chen et.al. | 2512.08327 | null |
| 2025-12-08 | Applicability of Metalenses for Generalizable Computer Vision | Yubo Zhang et.al. | 2512.08109 | null |
| 2025-12-08 | Exploiting the Randomness of Large Language Models (LLM) in Text Classification Tasks: Locating Privileged Documents in Legal Matters | Keith Huffman et.al. | 2512.08083 | null |
| 2025-11-28 | GSPN-2: Efficient Parallel Sequence Modeling | Hongjun Wang et.al. | 2512.07884 | null |
| 2025-11-27 | Semi-Supervised Contrastive Learning with Orthonormal Prototypes | Huanran Li et.al. | 2512.07880 | null |
| 2025-12-08 | Complementary Learning Approach for Text Classification using Large Language Models | Navid Asgari et.al. | 2512.07583 | null |
| 2025-12-08 | Integrating Multi-scale and Multi-filtration Topological Features for Medical Image Classification | Pengfei Gu et.al. | 2512.07190 | null |
| 2025-12-08 | Winning the Lottery by Preserving Network Training Dynamics with Concrete Ticket Search | Tanay Arora et.al. | 2512.07142 | null |
| 2025-12-08 | Dual Refinement Cycle Learning: Unsupervised Text Classification of Mamba and Community Detection on Text Attributed Graph | Hong Wang et.al. | 2512.07100 | null |
| 2025-12-07 | Toward Reliable Machine Unlearning: Theory, Algorithms, and Evaluation | Ali Ebrahimpour-Boroojeny et.al. | 2512.06993 | null |
| 2025-12-07 | SceneMixer: Exploring Convolutional Mixing Networks for Remote Sensing Scene Classification | Mohammed Q. Alkhatib et.al. | 2512.06877 | null |
| 2025-12-07 | Arc Gradient Descent: A Mathematically Derived Reformulation of Gradient Descent with Phase-Aware, User-Controlled Step Dynamics | Nikhil Verma et.al. | 2512.06737 | null |
| 2025-12-07 | Hierarchical Deep Learning for Diatom Image Classification: A Multi-Level Taxonomic Approach | Yueying Ke et.al. | 2512.06613 | null |
| 2025-12-06 | Proof of Concept for Mammography Classification with Enhanced Compactness and Separability Modules | Fariza Dahes et.al. | 2512.06575 | null |
| 2025-12-06 | LOCUS: A System and Method for Low-Cost Customization for Universal Specialization | Dhanasekar Sundararaman et.al. | 2512.06239 | null |
| 2025-12-05 | Efficient Text Classification with Conformal In-Context Learning | Ippokratis Pantelidis et.al. | 2512.05732 | null |
| 2025-12-05 | NormalView: sensor-agnostic tree species classification from backpack and aerial lidar data using geometric projections | Juho Korkeala et.al. | 2512.05610 | null |
| 2025-12-04 | GeoPE:A Unified Geometric Positional Embedding for Structured Tensors | Yupu Yao et.al. | 2512.04963 | null |
| 2025-12-04 | An all-optical convolutional neural network for image identification | Wei-Wei Fu et.al. | 2512.04569 | null |
| 2025-12-04 | Performance Evaluation of Transfer Learning Based Medical Image Classification Techniques for Disease Detection | Zeeshan Ahmad et.al. | 2512.04397 | null |
| 2025-11-20 | Memory-DD: A Low-Complexity Dendrite-Inspired Neuron for Temporal Prediction Tasks | Dongjian Yang et.al. | 2512.04094 | null |
| 2025-12-03 | Improving Alignment Between Human and Machine Codes: An Empirical Assessment of Prompt Engineering for Construct Identification in Psychology | Kylie L. Anglin et.al. | 2512.03818 | null |
| 2025-12-03 | Research on Brain Tumor Classification Method Based on Improved ResNet34 Network | Yufeng Li et.al. | 2512.03751 | null |
| 2025-12-03 | Multi-Scale Visual Prompting for Lightweight Small-Image Classification | Salim Khazem et.al. | 2512.03663 | null |
| 2025-12-03 | FeatureLens: A Highly Generalizable and Interpretable Framework for Detecting Adversarial Examples Based on Image Features | Zhigang Yang et.al. | 2512.03625 | null |
| 2025-12-03 | Label-Efficient Hyperspectral Image Classification via Spectral FiLM Modulation of Low-Level Pretrained Diffusion Features | Yuzhen Hu et.al. | 2512.03430 | null |
| 2025-12-01 | ALARM: Automated MLLM-Based Anomaly Detection in Complex-EnviRonment Monitoring with Uncertainty Quantification | Congjing Zhang et.al. | 2512.03101 | null |
| 2025-12-01 | Context-Enriched Contrastive Loss: Enhancing Presentation of Inherent Sample Connections in Contrastive Learning Framework | Haojin Deng et.al. | 2512.02152 | null |
| 2025-11-29 | Parallel Multi-Circuit Quantum Feature Fusion in Hybrid Quantum-Classical Convolutional Neural Networks for Breast Tumor Classification | Ece Yurtseven et.al. | 2512.02066 | null |
| 2025-12-01 | ViT $^3$ : Unlocking Test-Time Training in Vision | Dongchen Han et.al. | 2512.01643 | null |
| 2025-12-01 | Supervised Contrastive Machine Unlearning of Background Bias in Sonar Image Classification with Fine-Grained Explainable AI | Kamal Basha S et.al. | 2512.01291 | null |
| 2025-12-01 | nnMobileNet++: Towards Efficient Hybrid Networks for Retinal Image Analysis | Xin Li et.al. | 2512.01273 | null |
| 2025-12-01 | Teaching by Failure: Counter-Example-Driven Curricula for Transformer Self-Improvement | Harshil Vejendla et.al. | 2512.01187 | null |
| 2025-11-30 | Projection-Free CNN Pruning via Frank-Wolfe with Momentum: Sparser Models with Less Pretraining | Hamza ElMokhtar Shili et.al. | 2512.01147 | null |
| 2025-11-30 | OmniFD: A Unified Model for Versatile Face Forgery Detection | Haotian Liu et.al. | 2512.01128 | link |
| 2025-11-29 | Financial Text Classification Based On rLoRA Finetuning On Qwen3-8B model | Zhiming Lian et.al. | 2512.00630 | null |
| 2025-11-29 | Learning What Helps: Task-Aligned Context Selection for Vision Tasks | Jingyu Guo et.al. | 2512.00489 | null |
| 2025-11-29 | Vision Transformer for Classification of UAV and Helicopters Using Micro-Doppler Spectrograms in Surveillance Radar | Arkadiusz Czuba et.al. | 2512.00374 | null |
| 2025-11-26 | SemImage: Semantic Image Representation for Text, a Novel Framework for Embedding Disentangled Linguistic Features | Mohammad Zare et.al. | 2512.00088 | null |
| 2025-11-28 | Decoding the Past: Explainable Machine Learning Models for Dating Historical Texts | Paulo J. N. Pinto et.al. | 2511.23056 | null |
| 2025-11-27 | The Collapse of Patches | Wei Guo et.al. | 2511.22281 | link |
| 2025-11-27 | Support Vector Machine Classifier with Rescaled Huberized Pinball Loss | Shibo Diao et.al. | 2511.22065 | null |
| 2025-11-27 | When Do Domain-Specific Foundation Models Justify Their Cost? A Systematic Evaluation Across Retinal Imaging Tasks | David Isztl et.al. | 2511.22001 | null |
| 2025-11-26 | DeepGI: Explainable Deep Learning for Gastrointestinal Image Classification | Walid Houmaidi et.al. | 2511.21959 | null |
| 2025-11-23 | Semantics as a Shield: Label Disguise Defense (LDD) against Prompt Injection in LLM Sentiment Classification | Yanxi Li et.al. | 2511.21752 | null |
| 2025-11-18 | DNNs, Dataset Statistics, and Correlation Functions | Robert W. Batterman et.al. | 2511.21715 | null |
| 2025-11-26 | Continual Error Correction on Low-Resource Devices | Kirill Paramonov et.al. | 2511.21652 | null |
| 2025-11-25 | CHiQPM: Calibrated Hierarchical Interpretable Image Classification | Thomas Norrenbrock et.al. | 2511.20779 | null |
| 2025-11-25 | Adaptive Hopfield Network: Rethinking Similarities in Associative Memory | Shurong Wang et.al. | 2511.20609 | null |
| 2025-11-25 | HVAdam: A Full-Dimension Adaptive Optimizer | Yiheng Zhang et.al. | 2511.20277 | null |
| 2025-11-25 | ScenarioCLIP: Pretrained Transferable Visual Language Models and Action-Genome Dataset for Natural Scene Analysis | Advik Sinha et.al. | 2511.20274 | null |
| 2025-11-25 | Advancing Image Classification with Discrete Diffusion Classification Modeling | Omer Belhasin et.al. | 2511.20263 | null |
| 2025-11-25 | Exploring State-of-the-art models for Early Detection of Forest Fires | Sharjeel Ahmed et.al. | 2511.20096 | null |
| 2025-11-24 | Multiscale Vector-Quantized Variational Autoencoder for Endoscopic Image Synthesis | Dimitrios E. Diamantis et.al. | 2511.19578 | null |
| 2025-11-24 | An Anatomy Aware Hybrid Deep Learning Framework for Lung Cancer Tumor Stage Classification | Saniah Kayenat Chowdhury et.al. | 2511.19367 | null |
| 2025-11-24 | Neural Architecture Search for Quantum Autoencoders | Hibah Agha et.al. | 2511.19246 | null |
| 2025-11-24 | Uncertainty-Aware Dual-Student Knowledge Distillation for Efficient Image Classification | Aakash Gore et.al. | 2511.18826 | null |
| 2025-11-24 | Dendritic Convolution for Noise Image Recognition | Jiarui Xue et.al. | 2511.18699 | null |
| 2025-11-24 | EVCC: Enhanced Vision Transformer-ConvNeXt-CoAtNet Fusion for Classification | Kazi Reyazul Hasan et.al. | 2511.18691 | null |
| 2025-11-23 | A Unified BERT-CNN-BiLSTM Framework for Simultaneous Headline Classification and Sentiment Analysis of Bangla News | Mirza Raquib et.al. | 2511.18618 | null |
| 2025-11-22 | AdaPerceiver: Transformers with Adaptive Width, Depth, and Tokens | Purvish Jajal et.al. | 2511.18105 | null |
| 2025-11-22 | Less Is More: An Explainable AI Framework for Lightweight Malaria Classification | Md Abdullah Al Kafi et.al. | 2511.18083 | null |
| 2025-11-22 | Hierarchical Semi-Supervised Active Learning for Remote Sensing | Wei Huang et.al. | 2511.18058 | null |
| 2025-11-21 | A Hybrid Classical-Quantum Fine Tuned BERT for Text Classification | Abu Kaisar Mohammad Masum et.al. | 2511.17677 | null |
| 2025-11-21 | REMSA: An LLM Agent for Foundation Model Selection in Remote Sensing | Binger Chen et.al. | 2511.17442 | null |
| 2025-11-21 | DSeq-JEPA: Discriminative Sequential Joint-Embedding Predictive Architecture | Xiangteng He et.al. | 2511.17354 | link |
| 2025-11-21 | Attention-Guided Feature Fusion (AGFF) Model for Integrating Statistical and Semantic Features in News Text Classification | Mohammad Zare et.al. | 2511.17184 | null |
| 2025-11-15 | Concept-Based Interpretability for Toxicity Detection | Samarth Garg et.al. | 2511.16689 | null |
| 2025-11-20 | Teacher-Guided One-Shot Pruning via Context-Aware Knowledge Distillation | Md. Samiul Alim et.al. | 2511.16653 | null |
| 2025-11-20 | Formal Abductive Latent Explanations for Prototype-Based Networks | Jules Soria et.al. | 2511.16588 | link |
| 2025-11-20 | Unsupervised Image Classification with Adaptive Nearest Neighbor Selection and Cluster Ensembles | Melih Baydar et.al. | 2511.16213 | null |
| 2025-11-20 | SpectralTrain: A Universal Framework for Hyperspectral Image Classification | Meihua Zhou et.al. | 2511.16084 | null |
| 2025-11-19 | RB-FT: Rationale-Bootstrapped Fine-Tuning for Video Classification | Meilong Xu et.al. | 2511.15923 | null |
| 2025-11-19 | Hyperspectral Image Classification using Spectral-Spatial Mixer Network | Mohammed Q. Alkhatib et.al. | 2511.15692 | null |
| 2025-11-19 | IPTQ-ViT: Post-Training Quantization of Non-linear Functions for Integer-only Vision Transformers | Gihwan Kim et.al. | 2511.15369 | null |
| 2025-11-19 | Computer Vision Modeling of the Development of Geometric and Numerical Concepts in Humans | Zekun Wang et.al. | 2511.15029 | null |
| 2025-11-18 | Logit-Based Losses Limit the Effectiveness of Feature Knowledge Distillation | Nicholas Cooper et.al. | 2511.14981 | null |
| 2025-11-18 | Vision Large Language Models Are Good Noise Handlers in Engagement Analysis | Alexander Vedernikov et.al. | 2511.14749 | null |
| 2025-11-18 | Task Addition and Weight Disentanglement in Closed-Vocabulary Models | Adam Hazimeh et.al. | 2511.14569 | null |
| 2025-11-18 | Step by Step Network | Dongchen Han et.al. | 2511.14329 | null |
| 2025-11-18 | Zero-Training Task-Specific Model Synthesis for Few-Shot Medical Image Classification | Yao Qin et.al. | 2511.14082 | null |
| 2025-11-16 | Semantic Multiplexing | Mohammad Abdi et.al. | 2511.13779 | null |
| 2025-11-17 | Cross-Learning from Scarce Data via Multi-Task Constrained Optimization | Leopoldo Agorio et.al. | 2511.13680 | null |
| 2025-11-17 | Data Value in the Age of Scaling: Understanding LLM Scaling Dynamics Under Real-Synthetic Data Mixtures | Haohui Wang et.al. | 2511.13640 | null |
| 2025-11-17 | Minimax Multi-Target Conformal Prediction with Applications to Imaging Inverse Problems | Jeffrey Wen et.al. | 2511.13533 | null |
| 2025-11-17 | Tight and Practical Privacy Auditing for Differentially Private In-Context Learning | Yuyang Xia et.al. | 2511.13502 | null |
| 2025-11-17 | Aspect-Level Obfuscated Sentiment in Thai Financial Disclosures and Its Impact on Abnormal Returns | Attapol T. Rutherford et.al. | 2511.13481 | null |
| 2025-11-17 | Hardware optimization on Android for inference of AI models | Iulius Gherasim et.al. | 2511.13453 | null |
| 2025-11-17 | MedDCR: Learning to Design Agentic Workflows for Medical Coding | Jiyang Zheng et.al. | 2511.13361 | null |
| 2025-11-17 | Synthetic Forgetting without Access: A Few-shot Zero-glance Framework for Machine Unlearning | Qipeng Song et.al. | 2511.13116 | null |
| 2025-11-17 | Angular Gradient Sign Method: Uncovering Vulnerabilities in Hyperbolic Networks | Minsoo Jo et.al. | 2511.12985 | null |
| 2025-11-17 | CalibrateMix: Guided-Mixup Calibration of Image Semi-Supervised Models | Mehrab Mustafy Rahman et.al. | 2511.12964 | null |
| 2025-11-16 | Catastrophic Forgetting in Kolmogorov-Arnold Networks | Mohammad Marufur Rahman et.al. | 2511.12828 | null |
| 2025-11-16 | Medical Knowledge Intervention Prompt Tuning for Medical Image Classification | Ye Du et.al. | 2511.12639 | null |
| 2025-11-15 | AGGRNet: Selective Feature Extraction and Aggregation for Enhanced Medical Image Classification | Ansh Makwe et.al. | 2511.12382 | null |
| 2025-11-15 | CLAReSNet: When Convolution Meets Latent Attention for Hyperspectral Image Classification | Asmit Bandyopadhyay et.al. | 2511.12346 | null |
| 2025-11-15 | Learning Time in Static Classifiers | Xi Ding et.al. | 2511.12321 | null |
| 2025-11-15 | Rethinking Bias in Generative Data Augmentation for Medical AI: a Frequency Recalibration Method | Chi Liu et.al. | 2511.12301 | null |
| 2025-11-15 | FaNe: Towards Fine-Grained Cross-Modal Contrast with False-Negative Reduction and Text-Conditioned Sparse Attention | Peng Zhang et.al. | 2511.12215 | null |
| 2025-11-15 | MPD-SGR: Robust Spiking Neural Networks with Membrane Potential Distribution-Driven Surrogate Gradient Regularization | Runhao Jiang et.al. | 2511.12199 | null |
| 2025-11-15 | Breaking the Modality Wall: Time-step Mixup for Efficient Spiking Knowledge Transfer from Static to Event Domain | Yuqi Xie et.al. | 2511.12150 | null |
| 2025-11-15 | Supervised Multilabel Image Classification Using Residual Networks with Probabilistic Reasoning | Lokender Singh et.al. | 2511.12082 | null |
| 2025-11-15 | FedSDA: Federated Stain Distribution Alignment for Non-IID Histopathological Image Classification | Cheng-Chang Tsai et.al. | 2511.12044 | null |
| 2025-11-14 | Additive Large Language Models for Semi-Structured Text | Karthikeyan K et.al. | 2511.11922 | null |
| 2025-11-14 | Quantifying and Improving Adaptivity in Conformal Prediction through Input Transformations | Sooyong Jang et.al. | 2511.11472 | null |
| 2025-11-06 | Google-MedGemma Based Abnormality Detection in Musculoskeletal radiographs | Soumyajit Maity et.al. | 2511.05600 | null |
| 2025-11-06 | EETnet: a CNN for Gaze Detection and Tracking for Smart-Eyewear | Andrea Aspesi et.al. | 2511.04779 | null |
| 2025-11-06 | Courant algebroid lifts and curved Courant algebroids | Filip Moučka et.al. | 2511.04743 | null |
| 2025-11-06 | Trustworthiness Calibration Framework for Phishing Email Detection Using Large Language Models | Daniyal Ganiuly et.al. | 2511.04728 | null |
| 2025-11-06 | When retrieval outperforms generation: Dense evidence retrieval for scalable fake news detection | Alamgir Munir Qazi et.al. | 2511.04643 | link |
| 2025-11-06 | CardioPHON: Quality assessment and self-supervised pretraining for screening of cardiac function based on phonocardiogram recordings | Vladimir Despotovic et.al. | 2511.04533 | null |
| 2025-11-06 | IntelliProof: An Argumentation Network-based Conversational Helper for Organized Reflection | Kaveh Eskandari Miandoab et.al. | 2511.04528 | null |
| 2025-11-06 | Modeling Clinical Uncertainty in Radiology Reports: from Explicit Uncertainty Markers to Implicit Reasoning Pathways | Paloma Rabaey et.al. | 2511.04506 | null |
| 2025-11-06 | Differentially Private In-Context Learning with Nearest Neighbor Search | Antti Koskela et.al. | 2511.04332 | null |
| 2025-11-06 | Classification of four-quark operators with $ΔF\le 2$ under flavor symmetry and their renormalization in a gauge-invariant scheme | Gregoris Spanoudes et.al. | 2511.04305 | null |
| 2025-11-06 | Covariance Descriptors Meet General Vision Encoders: Riemannian Deep Learning for Medical Image Classification | Josef Mayr et.al. | 2511.04190 | null |
| 2025-11-06 | SynQuE: Estimating Synthetic Dataset Quality Without Annotations | Arthur Chen et.al. | 2511.03928 | null |
| 2025-11-05 | Divide, Cache, Conquer: Dichotomic Prompting for Efficient Multi-Label LLM-Based Classification | Mikołaj Langner et.al. | 2511.03830 | null |
| 2025-11-04 | Hybrid Convolution and Vision Transformer NAS Search Space for TinyML Image Classification | Mikhael Djajapermana et.al. | 2511.02992 | null |
| 2025-11-04 | Diffusion Models are Robust Pretrainers | Mika Yagoda et.al. | 2511.02793 | null |
| 2025-11-03 | Towards Continuous-variable Quantum Neural Networks for Biomedical Imaging | Daniel Alejandro Lopez et.al. | 2511.02051 | null |
| 2025-11-03 | Reliability Assessment Framework Based on Feature Separability for Pathological Cell Image Classification under Prior Bias | Takaaki Tachibana et.al. | 2511.01953 | null |
| 2025-11-03 | Game-theoretic distributed learning of generative models for heterogeneous data collections | Dmitrij Schlesinger et.al. | 2511.01740 | null |
| 2025-11-03 | Bayesian Natural Gradient Fine-Tuning of CLIP Models via Kalman Filtering | Hossein Abdi et.al. | 2511.01694 | null |
| 2025-11-03 | Protecting the Neural Networks against FGSM Attack Using Machine Unlearning | Amir Hossein Khorasani et.al. | 2511.01377 | null |
| 2025-11-02 | Parameter Interpolation Adversarial Training for Robust Image Classification | Xin Liu et.al. | 2511.00836 | null |
| 2025-11-01 | FeNN-DMA: A RISC-V SoC for SNN acceleration | Zainab Aizaz et.al. | 2511.00732 | null |
| 2025-11-01 | Leveraging Hierarchical Image-Text Misalignment for Universal Fake Image Detection | Daichi Zhang et.al. | 2511.00427 | null |
| 2025-11-01 | LGCA: Enhancing Semantic Representation via Progressive Expansion | Thanh Hieu Cao et.al. | 2511.00419 | null |
| 2025-10-31 | Cross-fluctuation phase transitions reveal sampling dynamics in diffusion models | Sai Niranjan Ramachandran et.al. | 2511.00124 | null |
| 2025-10-28 | ReLaX-Net: Reusing Layers for Parameter-Efficient Physical Neural Networks | Kohei Tsuchiyama et.al. | 2511.00044 | null |
| 2025-10-31 | C-LEAD: Contrastive Learning for Enhanced Adversarial Defense | Suklav Ghosh et.al. | 2510.27249 | null |
| 2025-10-31 | SpecAware: A Spectral-Content Aware Foundation Model for Unifying Multi-Sensor Learning in Hyperspectral Remote Sensing Mapping | Renjie Ji et.al. | 2510.27219 | null |
| 2025-10-31 | AFM-Net: Advanced Fusing Hierarchical CNN Visual Priors with Global Sequence Modeling for Remote Sensing Image Scene Classification | Yuanhao Tang et.al. | 2510.27155 | link |
| 2025-10-30 | Overspecified Mixture Discriminant Analysis: Exponential Convergence, Statistical Guarantees, and Remote Sensing Applications | Arman Bolatov et.al. | 2510.27056 | null |
| 2025-10-30 | Non-Convex Over-the-Air Heterogeneous Federated Learning: A Bias-Variance Trade-off | Muhammad Faraz Ul Abrar et.al. | 2510.26722 | null |
| 2025-10-30 | FlowQ-Net: A Generative Framework for Automated Quantum Circuit Design | Jun Dai et.al. | 2510.26688 | null |
| 2025-10-30 | Do Students Debias Like Teachers? On the Distillability of Bias Mitigation Methods | Jiali Cheng et.al. | 2510.26038 | null |
| 2025-10-29 | Binaspect – A Python Library for Binaural Audio Analysis, Visualization & Feature Generation | Dan Barry et.al. | 2510.25714 | null |
| 2025-10-29 | Monitoring the calibration of probability forecasts with an application to concept drift detection involving image classification | Christopher T. Franck et.al. | 2510.25573 | null |
| 2025-10-29 | Neighborhood Feature Pooling for Remote Sensing Image Classification | Fahimeh Orvati Nia et.al. | 2510.25077 | null |
| 2025-10-29 | Joint Analysis of Acoustic Scenes and Sound Events Based on Semi-Supervised Training of Sound Events With Partial Labels | Keisuke Imoto et.al. | 2510.25075 | null |
| 2025-10-28 | Fair Indivisible Payoffs through Shapley Value | Mikołaj Czarnecki et.al. | 2510.24906 | null |
| 2025-10-25 | CFL-SparseMed: Communication-Efficient Federated Learning for Medical Imaging with Top-k Sparse Updates | Gousia Habib et.al. | 2510.24776 | null |
| 2025-10-28 | All in one timestep: Enhancing Sparsity and Energy efficiency in Multi-level Spiking Neural Networks | Andrea Castagnetti et.al. | 2510.24637 | null |
| 2025-10-28 | Few-Shot Remote Sensing Image Scene Classification with CLIP and Prompt Learning | Ivica Dimitrovski et.al. | 2510.24321 | null |
| 2025-10-26 | Quantum Machine Learning for Image Classification: A Hybrid Model of Residual Network with Quantum Support Vector Machine | Md. Farhan Shahriyar et.al. | 2510.23659 | null |
| 2025-10-27 | iPac: Incorporating Intra-image Patch Context into Graph Neural Networks for Medical Image Classification | Usama Zidan et.al. | 2510.23504 | null |
| 2025-10-27 | Mixed Precision Training of Neural ODEs | Elena Celledoni et.al. | 2510.23498 | null |
| 2025-10-27 | Human-AI Collaborative Uncertainty Quantification | Sima Noorani et.al. | 2510.23476 | null |
| 2025-10-27 | CURVETE: Curriculum Learning and Progressive Self-supervised Training for Medical Image Classification | Asmaa Abbas et.al. | 2510.23442 | null |
| 2025-10-27 | One-Timestep is Enough: Achieving High-performance ANN-to-SNN Conversion via Scale-and-Fire Neurons | Qiuyang Chen et.al. | 2510.23383 | null |
| 2025-10-27 | The Benchmarking Epistemology: Construct Validity for Evaluating Machine Learning Models | Timo Freiesleben et.al. | 2510.23191 | null |
| 2025-10-27 | Generating Auxiliary Tasks with Reinforcement Learning | Judah Goldfeder et.al. | 2510.22940 | null |
| 2025-10-26 | SALSA: Single-pass Autoregressive LLM Structured Classification | Ruslan Berdichevsky et.al. | 2510.22691 | null |
| 2025-10-26 | Alias-Free ViT: Fractional Shift Invariance via Linear Attention | Hagay Michaeli et.al. | 2510.22673 | null |
| 2025-10-25 | Stable neural networks and connections to continuous dynamical systems | Matthias J. Ehrhardt et.al. | 2510.22299 | null |
| 2025-10-25 | WAON: Large-Scale and High-Quality Japanese Image-Text Pair Dataset for Vision-Language Models | Issa Sugiura et.al. | 2510.22276 | null |
| 2025-10-25 | Simplifying Knowledge Transfer in Pretrained Models | Siddharth Jain et.al. | 2510.22208 | null |
| 2025-10-23 | Framework for Machine Evaluation of Reasoning Completeness in Large Language Models For Classification Tasks | Avinash Patil et.al. | 2510.21884 | null |
| 2025-10-23 | TernaryCLIP: Efficiently Compressing Vision-Language Models with Ternary Weights and Distilled Knowledge | Shu-Hao Zhang et.al. | 2510.21879 | null |
| 2025-10-22 | Towards Accurate and Efficient Waste Image Classification: A Hybrid Deep Learning and Machine Learning Approach | Ngoc-Bao-Quang Nguyen et.al. | 2510.21833 | null |
| 2025-10-13 | A Multi-lingual Dataset of Classified Paragraphs from Open Access Scientific Publications | Eric Jeangirard et.al. | 2510.21762 | null |
| 2025-10-24 | Head Pursuit: Probing Attention Specialization in Multimodal Transformers | Lorenzo Basile et.al. | 2510.21518 | null |
| 2025-10-24 | Compressing Quaternion Convolutional Neural Networks for Audio Classification | Arshdeep Singh et.al. | 2510.21388 | null |
| 2025-10-24 | Weak-to-Strong Generalization under Distribution Shifts | Myeongho Jeon et.al. | 2510.21332 | null |
| 2025-10-24 | VL-SAE: Interpreting and Enhancing Vision-Language Alignment with a Unified Concept Set | Shufan Shen et.al. | 2510.21323 | link |
| 2025-10-23 | H-SPLID: HSIC-based Saliency Preserving Latent Information Decomposition | Lukas Miklautz et.al. | 2510.20627 | null |
| 2025-10-23 | Breakdance Video classification in the age of Generative AI | Sauptik Dhar et.al. | 2510.20287 | null |
| 2025-10-22 | Improving Predictive Confidence in Medical Imaging via Online Label Smoothing | Kushan Choudhury et.al. | 2510.20011 | null |
| 2025-10-22 | Uncertainty evaluation of segmentation models for Earth observation | Melanie Rey et.al. | 2510.19586 | null |
| 2025-10-22 | AegisRF: Adversarial Perturbations Guided with Sensitivity for Protecting Intellectual Property of Neural Radiance Fields | Woo Jae Kim et.al. | 2510.19371 | null |
| 2025-10-22 | AMAuT: A Flexible and Efficient Multiview Audio Transformer Framework Trained from Scratch | Weichuang Shao et.al. | 2510.19368 | null |
| 2025-10-22 | Feature Space Adaptation for Robust Model Fine-Tuning | Peng Wang et.al. | 2510.19155 | null |
| 2025-10-21 | Robustness Verification of Graph Neural Networks Via Lightweight Satisfiability Testing | Chia-Hsuan Lu et.al. | 2510.18591 | null |
| 2025-10-21 | DWaste: Greener AI for Waste Sorting using Mobile and Edge Devices | Suman Kunwar et.al. | 2510.18513 | null |
| 2025-10-21 | ScaleNet: Scaling up Pretrained Neural Networks with Incremental Parameters | Zhiwei Hao et.al. | 2510.18431 | null |
| 2025-10-21 | Learning from N-Tuple Data with M Positive Instances: Unbiased Risk Estimation and Theoretical Guarantees | Miao Zhang et.al. | 2510.18406 | null |
| 2025-10-21 | Ensembling Pruned Attention Heads For Uncertainty-Aware Efficient Transformers | Firas Gabetni et.al. | 2510.18358 | null |
| 2025-10-21 | Adaptive Per-Channel Energy Normalization Front-end for Robust Audio Signal Processing | Hanyu Meng et.al. | 2510.18206 | null |
| 2025-10-18 | Advances in Pre-trained Language Models for Domain-Specific Text Classification: A Systematic Review | Zhyar Rzgar K. Rostam et.al. | 2510.17892 | null |
| 2025-10-10 | MAT-Agent: Adaptive Multi-Agent Training Optimization | Jusheng Zhang et.al. | 2510.17845 | null |
| 2025-10-20 | Reliable Inference in Edge-Cloud Model Cascades via Conformal Alignment | Jiayi Huang et.al. | 2510.17543 | null |
| 2025-10-20 | BenCao: An Instruction-Tuned Large Language Model for Traditional Chinese Medicine | Jiacheng Xie et.al. | 2510.17415 | null |
| 2025-10-20 | DDSC: Dynamic Dual-Signal Curriculum for Data-Efficient Acoustic Scene Classification under Domain Shift | Peihong Zhang et.al. | 2510.17345 | null |
| 2025-10-20 | EndoCIL: A Class-Incremental Learning Framework for Endoscopic Image Classification | Bingrong Liu et.al. | 2510.17200 | null |
| 2025-10-19 | ReclAIm: A multi-agent framework for degradation-aware performance tuning of medical imaging AI | Eleftherios Tzanis et.al. | 2510.17004 | null |
| 2025-10-18 | Adversarially Robust Quantum Transfer Learning | Amena Khatun et.al. | 2510.16301 | null |
| 2025-10-17 | Expert Merging in Sparse Mixture of Experts with Nash Bargaining | Dung V. Nguyen et.al. | 2510.16138 | null |
| 2025-10-18 | Differentiable, Bit-shifting, and Scalable Quantization without training neural network from scratch | Zia Badar et.al. | 2510.16088 | null |
| 2025-10-17 | Data-Driven Analysis of Intersectional Bias in Image Classification: A Framework with Bias-Weighted Augmentation | Farjana Yesmin et.al. | 2510.16072 | null |
| 2025-10-17 | FedPURIN: Programmed Update and Reduced INformation for Sparse Personalized Federated Learning | Lunchen Xie et.al. | 2510.16065 | null |
| 2025-10-14 | Layer-Aware Influence for Online Data Valuation Estimation | Ziao Yang et.al. | 2510.16007 | null |
| 2025-10-17 | Balanced Multi-Task Attention for Satellite Image Classification: A Systematic Approach to Achieving 97.23% Accuracy on EuroSAT Without Pre-Training | Aditya Vir et.al. | 2510.15527 | null |
| 2025-10-17 | A Tsetlin Machine Image Classification Accelerator on a Flexible Substrate | Yushu Qin et.al. | 2510.15519 | null |
| 2025-10-17 | Adaptive transfer learning for surgical tool presence detection in laparoscopic videos through gradual freezing fine-tuning | Ana Davila et.al. | 2510.15372 | null |
| 2025-10-16 | Fourier Transform Multiple Instance Learning for Whole Slide Image Classification | Anthony Bilic et.al. | 2510.15138 | null |
| 2025-10-16 | Programmatic Representation Learning with Language Models | Gabriel Poesia et.al. | 2510.14825 | null |
| 2025-10-16 | Rethinking Hebbian Principle: Low-Dimensional Structural Projection for Unsupervised Learning | Shikuang Deng et.al. | 2510.14810 | null |
| 2025-10-16 | Free-Grained Hierarchical Recognition | Seulki Park et.al. | 2510.14737 | null |
| 2025-10-16 | Camera Movement Classification in Historical Footage: A Comparative Study of Deep Video Models | Tingyu Lin et.al. | 2510.14713 | null |
| 2025-10-16 | FedPPA: Progressive Parameter Alignment for Personalized Federated Learning | Maulidi Adi Prasetia et.al. | 2510.14698 | null |
| 2025-10-16 | Geometric Moment Alignment for Domain Adaptation via Siegel Embeddings | Shayan Gharib et.al. | 2510.14666 | null |
| 2025-10-16 | Vision Mamba for Permeability Prediction of Porous Media | Ali Kashefi et.al. | 2510.14516 | null |
| 2025-10-15 | NAPPure: Adversarial Purification for Robust Image Classification under Non-Additive Perturbations | Junjie Nan et.al. | 2510.14025 | null |
| 2025-10-14 | MultiFoodhat: A potential new paradigm for intelligent food quality inspection | Yue Hu et.al. | 2510.13889 | null |
| 2025-10-14 | Large Language Model Agents Enable Autonomous Design and Image Analysis of Microwell Microfluidics | Dinh-Nguyen Nguyen et.al. | 2510.13883 | null |
| 2025-10-15 | Multi-Scale High-Resolution Logarithmic Grapher Module for Efficient Vision GNNs | Mustafa Munir et.al. | 2510.13740 | null |
| 2025-10-15 | Automated document processing system for government agencies using DBNET++ and BART models | Aya Kaysan Bahjat et.al. | 2510.13303 | null |
| 2025-10-15 | Approximate Bilevel Graph Structure Learning for Histopathology Image Classification | Sudipta Paul et.al. | 2510.13188 | null |
| 2025-10-14 | ProtoSiTex: Learning Semi-Interpretable Prototypes for Multi-label Text Classification | Utsav Kumar Nareti et.al. | 2510.12534 | null |
| 2025-10-14 | A Function Centric Perspective On Flat and Sharp Minima | Israel Mason-Williams et.al. | 2510.12451 | null |
| 2025-10-14 | Deep Attention-guided Adaptive Subsampling | Sharath M Shankaranarayana et.al. | 2510.12376 | null |
| 2025-10-14 | Hybrid Vision Transformer and Quantum Convolutional Neural Network for Image Classification | Mingzhu Wang et.al. | 2510.12291 | null |
| 2025-10-14 | State Space Prompting via Gathering and Spreading Spatio-Temporal Information for Video Understanding | Jiahuan Zhou et.al. | 2510.12160 | null |
| 2025-10-14 | A Review on Domain Adaption and Generative Adversarial Networks(GANs) | Aashish Dhawan et.al. | 2510.12075 | null |
| 2025-10-13 | Evaluating the Explainability of Vision Transformers in Medical Imaging | Leili Barekatain et.al. | 2510.12021 | null |
| 2025-10-13 | Bayesian Topological Convolutional Neural Nets | Sarah Harkins Dayton et.al. | 2510.11704 | null |
| 2025-10-13 | Benchmarking foundation models for hyperspectral image classification: Application to cereal crop type mapping | Walid Elbarz et.al. | 2510.11576 | null |
| 2025-10-13 | Investigating Large Language Models’ Linguistic Abilities for Text Preprocessing | Marco Braga et.al. | 2510.11482 | null |
| 2025-10-13 | GADA: Graph Attention-based Detection Aggregation for Ultrasound Video Classification | Li Chen et.al. | 2510.11437 | null |
| 2025-10-13 | Exploring and Leveraging Class Vectors for Classifier Editing | Jaeik Kim et.al. | 2510.11268 | null |
| 2025-10-13 | Multiview Manifold Evidential Fusion for PolSAR Image Classification | Junfei Shi et.al. | 2510.11171 | null |
| 2025-10-13 | One Size Does Not Fit All: Exploring Variable Thresholds for Distance-Based Multi-Label Text Classification | Jens Van Nooten et.al. | 2510.11160 | null |
| 2025-10-13 | Efficient Edge Test-Time Adaptation via Latent Feature Coordinate Correction | Xinyu Luo et.al. | 2510.11068 | null |
| 2025-10-12 | Identifying bias in CNN image classification using image scrambling and transforms | Sai Teja Erukude et.al. | 2510.10383 | null |
| 2025-10-11 | Weed Out, Then Harvest: Dual Low-Rank Adaptation is an Effective Noisy Label Detector for Noise-Robust Learning | Bo Yuan et.al. | 2510.10208 | null |
| 2025-10-11 | Lightweight Baselines for Medical Abstract Classification: DistilBERT with Cross-Entropy as a Strong Default | Jiaqi Liu et.al. | 2510.10025 | null |
| 2025-10-10 | Phase-Aware Deep Learning with Complex-Valued CNNs for Audio Signal Applications | Naman Agrawal et.al. | 2510.09926 | null |
| 2025-10-10 | One Sentence, Two Embeddings: Contrastive Learning of Explicit and Implicit Semantic Representations | Kohei Oda et.al. | 2510.09293 | null |
| 2025-10-10 | Instance-Level Generation for Representation Learning | Yankun Wu et.al. | 2510.09171 | null |
| 2025-10-10 | Exploring Single Domain Generalization of LiDAR-based Semantic Segmentation under Imperfect Labels | Weitong Kong et.al. | 2510.09035 | null |
| 2025-10-10 | Defense against Unauthorized Distillation in Image Restoration via Feature Space Perturbation | Han Hu et.al. | 2510.08925 | null |
| 2025-10-09 | The Boundaries of Fair AI in Medical Image Prognosis: A Causal Perspective | Thai-Hoang Pham et.al. | 2510.08840 | null |
| 2025-10-09 | Structured Output Regularization: a framework for few-shot transfer learning | Nicolas Ewen et.al. | 2510.08728 | null |
| 2025-10-09 | Entropy Regularizing Activation: Boosting Continuous Control, Large Language Models, and Image Classification with Activation as Entropy Constraints | Zilin Kang et.al. | 2510.08549 | null |
| 2025-10-09 | Efficient Prompt Optimisation for Legal Text Classification with Proxy Prompt Evaluator | Hyunji Lee et.al. | 2510.08524 | null |
| 2025-10-09 | Adaptive Gradient Calibration for Single-Positive Multi-Label Learning in Remote Sensing Image Scene Classification | Chenying Liu et.al. | 2510.08269 | null |
| 2025-10-09 | Enhancing Visual Prompting through Expanded Transformation Space and Overfitting Mitigation | Shohei Enomoto et.al. | 2510.07823 | null |
| 2025-10-08 | Multi-Task Pre-Finetuning of Lightweight Transformer Encoders for Text Classification and NER | Junyi Zhu et.al. | 2510.07566 | null |
| 2025-10-08 | Label Semantics for Robust Hyperspectral Image Classification | Rafin Hassan et.al. | 2510.07556 | null |
| 2025-10-08 | Reasoning for Hierarchical Text Classification: The Case of Patents | Lekang Jiang et.al. | 2510.07167 | null |
| 2025-10-08 | Few-Shot Adaptation Benchmark for Remote Sensing Vision-Language Models | Karim El Khoury et.al. | 2510.07135 | link |
| 2025-10-08 | Textual interpretation of transient image classifications from large language models | Fiorenzo Stoppa et.al. | 2510.06931 | null |
| 2025-10-08 | Lamb wave-based MVDR imaging and CNN classification of defects in pipelines | Shuangshuang Li et.al. | 2510.06899 | null |
| 2025-10-08 | Continual Action Quality Assessment via Adaptive Manifold-Aligned Graph Regularization | Kanglei Zhou et.al. | 2510.06842 | null |
| 2025-10-08 | CLAQS: Compact Learnable All-Quantum Token Mixer with Shared-ansatz for Text Classification | Junhao Chen et.al. | 2510.06532 | null |
| 2025-10-02 | User to Video: A Model for Spammer Detection Inspired by Video Classification Technology | Haoyang Zhang et.al. | 2510.06233 | null |
| 2025-10-07 | Shaken or Stirred? An Analysis of MetaFormer’s Token Mixing for Medical Imaging | Ron Keuth et.al. | 2510.05971 | null |
| 2025-10-07 | Leveraging Vision Transformers for Enhanced Classification of Emotions using ECG Signals | Pubudu L. Indrasiri et.al. | 2510.05826 | null |
| 2025-10-07 | A Novel Technique for Robust Training of Deep Networks With Multisource Weak Labeled Remote Sensing Data | Gianmarco Perantoni et.al. | 2510.05760 | null |
| 2025-10-07 | Large Language Model-Based Uncertainty-Adjusted Label Extraction for Artificial Intelligence Model Development in Upper Extremity Radiography | Hanna Kreutzer et.al. | 2510.05664 | null |
| 2025-10-06 | NASP-T: A Fuzzy Neuro-Symbolic Transformer for Logic-Constrained Aviation Safety Report Classification | Fadi Al Machot et.al. | 2510.05451 | null |
| 2025-10-06 | Neuroplastic Modular Framework: Cross-Domain Image Classification of Garbage and Industrial Surfaces | Debojyoti Ghosh et.al. | 2510.05071 | null |
| 2025-10-06 | AWARE, Beyond Sentence Boundaries: A Contextual Transformer Framework for Identifying Cultural Capital in STEM Narratives | Khalid Mehtab Khan et.al. | 2510.04983 | null |
| 2025-10-06 | REN: Anatomically-Informed Mixture-of-Experts for Interstitial Lung Disease Diagnosis | Alec K. Peltekian et.al. | 2510.04923 | null |
| 2025-10-06 | A Semantics-Aware Hierarchical Self-Supervised Approach to Classification of Remote Sensing Images | Giulio Weikmann et.al. | 2510.04916 | null |
| 2025-10-06 | ERDE: Entropy-Regularized Distillation for Early-exit | Martial Guidez et.al. | 2510.04856 | null |
| 2025-10-06 | Federated Learning for Surgical Vision in Appendicitis Classification: Results of the FedSurg EndoVis 2024 Challenge | Max Kirchner et.al. | 2510.04772 | null |
| 2025-10-06 | Do Superpixel Segmentation Methods Influence Deforestation Image Classification? | Hugo Resende et.al. | 2510.04645 | null |
| 2025-10-06 | A Spatial-Spectral-Frequency Interactive Network for Multimodal Remote Sensing Classification | Hao Liu et.al. | 2510.04628 | null |
| 2025-10-05 | LLM Based Bayesian Optimization for Prompt Search | Adam Ballew et.al. | 2510.04384 | null |
| 2025-10-05 | Unmasking Backdoors: An Explainable Defense via Gradient-Attention Anomaly Scoring for Pre-trained Language Models | Anindya Sundar Das et.al. | 2510.04347 | null |
| 2025-10-05 | SliceMoE: Routing Embedding Slices Instead of Tokens for Fine-Grained and Balanced Transformer Scaling | Harshil Vejendla et.al. | 2510.04286 | null |
| 2025-10-05 | From Segments to Concepts: Interpretable Image Classification via Concept-Guided Segmentation | Ran Eisenberg et.al. | 2510.04180 | null |
| 2025-10-05 | Quantization Range Estimation for Convolutional Neural Networks | Bingtao Yang et.al. | 2510.04044 | link |
| 2025-10-05 | Replacing Softmax Similarity with a Sharpened Angular Similarity: Theory and Practice of Scaling To Billion-Context Attention | Sahil Joshi et.al. | 2510.04008 | null |
| 2025-10-04 | Zero-Shot Fine-Grained Image Classification Using Large Vision-Language Models | Md. Atabuzzaman et.al. | 2510.03903 | null |
| 2025-10-04 | Cross-View Open-Vocabulary Object Detection in Aerial Imagery | Jyoti Kini et.al. | 2510.03858 | null |
| 2025-10-04 | Lightweight and Generalizable Acoustic Scene Representations via Contrastive Fine-Tuning and Distillation | Kuang Yuan et.al. | 2510.03728 | null |
| 2025-10-04 | Exploring the Hierarchical Reasoning Model for Small Natural-Image Classification Without Augmentation | Alexander V. Mantzaris et.al. | 2510.03598 | null |
| 2025-10-03 | What is a protest anyway? Codebook conceptualization is still a first-order concern in LLM-era classification | Andrew Halterman et.al. | 2510.03541 | null |
| 2025-10-03 | A Robust Clustered Federated Learning Approach for Non-IID Data with Quantity Skew | Michael Ben Ali et.al. | 2510.03380 | null |
| 2025-10-02 | Error correction in multiclass image classification of facial emotion on unbalanced samples | Andrey A. Lebedev et.al. | 2510.03337 | null |
| 2025-10-01 | SVDefense: Effective Defense against Gradient Inversion Attacks via Singular Value Decomposition | Chenxiang Luo et.al. | 2510.03319 | null |
| 2025-09-28 | QuadEnhancer: Leveraging Quadratic Transformations to Enhance Deep Neural Networks | Qian Chen et.al. | 2510.03276 | null |
| 2025-10-03 | Test-Time Defense Against Adversarial Attacks via Stochastic Resonance of Latent Ensembles | Dong Lao et.al. | 2510.03224 | null |
| 2025-10-02 | In-memory Training on Analog Devices with Limited Conductance States via Multi-tile Residual Learning | Jindan Li et.al. | 2510.02516 | null |
| 2025-09-27 | Language, Culture, and Ideology: Personalizing Offensiveness Detection in Political Tweets with Reasoning LLMs | Dzmitry Pihulski et.al. | 2510.02351 | null |
| 2025-10-02 | Knowledge Distillation Detection for Open-weights Models | Qin Shi et.al. | 2510.02302 | null |
| 2025-10-02 | microCLIP: Unsupervised CLIP Adaptation via Coarse-Fine Token Fusion for Fine-Grained Image Classification | Sathira Silva et.al. | 2510.02270 | null |
| 2025-10-02 | StelLA: Subspace Learning in Low-rank Adaptation using Stiefel Manifold | Zhizhong Li et.al. | 2510.01938 | null |
| 2025-10-02 | A Methodology for Transparent Logic-Based Classification Using a Multi-Task Convolutional Tsetlin Machine | Mayur Kishor Shende et.al. | 2510.01906 | null |
| 2025-10-01 | Intuitions of Machine Learning Researchers about Transfer Learning for Medical Image Classification | Yucheng Lu et.al. | 2510.00902 | null |
| 2025-10-01 | Uncertainty-Aware Concept Bottleneck Models with Enhanced Interpretability | Haifei Zhang et.al. | 2510.00773 | null |
| 2025-10-01 | Quantum Probabilistic Label Refining: Enhancing Label Quality for Robust Image Classification | Fang Qi et.al. | 2510.00528 | null |
| 2025-09-30 | Efficient Layer-wise LLM Fine-tuning for Revision Intention Prediction | Zhexiong Liu et.al. | 2510.00268 | null |
| 2025-09-30 | Object-Centric Case-Based Reasoning via Argumentation | Gabriel de Olim Gaul et.al. | 2510.00185 | null |
| 2025-09-30 | Zero-Shot Decentralized Federated Learning | Alessio Masano et.al. | 2509.26462 | null |
| 2025-09-30 | Attention over Scene Graphs: Indoor Scene Representations Toward CSAI Classification | Artur Barros et.al. | 2509.26457 | null |
| 2025-09-30 | MAPLE: Multi-scale Attribute-enhanced Prompt Learning for Few-shot Whole Slide Image Classification | Junjie Zhou et.al. | 2509.25863 | null |
| 2025-09-29 | Accelerating Dynamic Image Graph Construction on FPGA for Vision GNNs | Anvitha Ramachandran et.al. | 2509.25121 | null |
| 2025-09-29 | Unmute the Patch Tokens: Rethinking Probing in Multi-Label Audio Classification | Lukas Rauch et.al. | 2509.24901 | null |
| 2025-09-29 | A TRIANGLE Enables Multimodal Alignment Beyond Cosine Similarity | Giordano Cicchetti et.al. | 2509.24734 | null |
| 2025-09-29 | VNODE: A Piecewise Continuous Volterra Neural Network | Siddharth Roheda et.al. | 2509.24659 | null |
| 2025-09-29 | Combining Discrepancy-Confusion Uncertainty and Calibration Diversity for Active Fine-Grained Image Classification | Yinghao Jin et.al. | 2509.24181 | null |
| 2025-09-29 | High-Order Progressive Trajectory Matching for Medical Image Dataset Distillation | Le Dong et.al. | 2509.24177 | null |
| 2025-09-28 | Singleton-Optimized Conformal Prediction | Tao Wang et.al. | 2509.24095 | null |
| 2025-09-28 | Bridging the Task Gap: Multi-Task Adversarial Transferability in CLIP and Its Derivatives | Kuanrong Liu et.al. | 2509.23917 | null |
| 2025-09-28 | CE-FAM: Concept-Based Explanation via Fusion of Activation Maps | Michihiro Kuroki et.al. | 2509.23849 | null |
| 2025-09-28 | Spatially Parallel All-optical Neural Networks | Jianwei Qin et.al. | 2509.23611 | null |
| 2025-09-28 | Deep Taxonomic Networks for Unsupervised Hierarchical Prototype Discovery | Zekun Wang et.al. | 2509.23602 | null |
| 2025-09-27 | The Impact of Role Design in In-Context Learning for Large Language Models | Hamidreza Rouzegar et.al. | 2509.23501 | null |
| 2025-09-27 | S $^3$ F-Net: A Multi-Modal Approach to Medical Image Classification via Spatial-Spectral Summarizer Fusion Network | Md. Saiful Bari Siddiqui et.al. | 2509.23442 | null |
| 2025-09-27 | Dynamics of Learning: Generative Schedules from Latent ODEs | Matt L. Sampson et.al. | 2509.23052 | null |
| 2025-09-26 | MonoCon: A general framework for learning ultra-compact high-fidelity representations using monotonicity constraints | Shreyas Gokhale et.al. | 2509.22931 | null |
| 2025-09-26 | FishAI 2.0: Marine Fish Image Classification with Multi-modal Few-shot Learning | Chenghan Yang et.al. | 2509.22930 | null |
| 2025-09-24 | Achieving Fair Skin Lesion Detection through Skin Tone Normalization and Channel Pruning | Zihan Wei et.al. | 2509.22712 | null |
| 2025-09-26 | Training-Free Synthetic Data Generation with Dual IP-Adapter Guidance | Luc Boudier et.al. | 2509.22635 | null |
| 2025-09-26 | Rule-Based Reinforcement Learning for Document Image Classification with Vision Language Models | Michael Jungo et.al. | 2509.22283 | null |
| 2025-09-26 | Universal Legal Article Prediction via Tight Collaboration between Supervised Classification Model and LLM | Xiao Chi et.al. | 2509.22119 | null |
| 2025-09-25 | Filtering with Confidence: When Data Augmentation Meets Conformal Prediction | Zixuan Wu et.al. | 2509.21479 | null |
| 2025-09-23 | Coreset selection based on Intra-class diversity | Imran Ashraf et.al. | 2509.21380 | null |
| 2025-09-21 | MDF-MLLM: Deep Fusion Through Cross-Modal Feature Alignment for Contextually Aware Fundoscopic Image Classification | Jason Jordan et.al. | 2509.21358 | null |
| 2025-09-25 | AutoIntent: AutoML for Text Classification | Ilya Alekseev et.al. | 2509.21138 | link |
| 2025-09-25 | Sparse Representations Improve Adversarial Robustness of Neural Network Classifiers | Killian Steunou et.al. | 2509.21130 | link |
| 2025-09-25 | Concepts in Motion: Temporal Bottlenecks for Interpretable Video Classification | Patrick Knab et.al. | 2509.20899 | link |
| 2025-09-25 | Plant identification based on noisy web data: the amazing performance of deep learning (LifeCLEF 2017) | Herve Goeau et.al. | 2509.20856 | null |
| 2025-09-25 | Punching Above Precision: Small Quantized Model Distillation with Learnable Regularizer | Abdur Rehman et.al. | 2509.20854 | null |
| 2025-09-24 | Understanding and Improving Adversarial Robustness of Neural Probabilistic Circuits | Weixin Chen et.al. | 2509.20549 | null |
| 2025-09-24 | Efficiently Attacking Memorization Scores | Tue Do et.al. | 2509.20463 | null |
| 2025-09-24 | Enabling Multi-Species Bird Classification on Low-Power Bioacoustic Loggers | Stefano Ciapponi et.al. | 2509.20103 | null |
| 2025-09-24 | Anatomically Constrained Transformers for Cardiac Amyloidosis Classification | Alexander Thorley et.al. | 2509.19691 | null |
| 2025-09-24 | Thinking While Listening: Simple Test Time Scaling For Audio Classification | Prateek Verma et.al. | 2509.19676 | null |
| 2025-09-14 | Holographic Transformers for Complex-Valued Signal Processing: Integrating Phase Interference into Self-Attention | Enhao Huang et.al. | 2509.19331 | null |
| 2025-09-23 | Algorithms for Adversarially Robust Deep Learning | Alexander Robey et.al. | 2509.19100 | null |
| 2025-09-23 | No Labels Needed: Zero-Shot Image Classification with Collaborative Self-Learning | Matheus Vinícius Todescato et.al. | 2509.18938 | null |
| 2025-09-23 | Benchmarking Vision-Language and Multimodal Large Language Models in Zero-shot and Few-shot Scenarios: A study on Christian Iconography | Gianmarco Spinaci et.al. | 2509.18839 | null |
| 2025-09-23 | Lightweight Vision Transformer with Window and Spatial Attention for Food Image Classification | Xinle Gao et.al. | 2509.18692 | null |
| 2025-09-23 | An overview of neural architectures for self-supervised audio representation learning from masked spectrograms | Sarthak Yadav et.al. | 2509.18691 | null |
| 2025-09-21 | Automatic Classification of Magnetic Chirality of Solar Filaments from H-Alpha Observations | Alexis Chalmers et.al. | 2509.18214 | null |
| 2025-09-17 | Self Identity Mapping | Xiuding Cai et.al. | 2509.18165 | null |
| 2025-09-22 | Elucidating the Design Space of FP4 training | Robert Hu et.al. | 2509.17791 | null |
| 2025-09-22 | Dual-View Alignment Learning with Hierarchical-Prompt for Class-Imbalance Multi-Label Classification | Sheng Huang et.al. | 2509.17747 | null |
| 2025-09-22 | WISE: Weak-Supervision-Guided Step-by-Step Explanations for Multimodal LLMs in Image Classification | Yiwen Jiang et.al. | 2509.17740 | null |
| 2025-09-22 | Enhancing Cross-Lingual Transfer through Reversible Transliteration: A Huffman-Based Approach for Low-Resource Languages | Wenhao Zhuang et.al. | 2509.17493 | null |
| 2025-09-22 | Multimodal Medical Image Classification via Synergistic Learning Pre-training | Qinghua Lin et.al. | 2509.17492 | null |
| 2025-09-21 | DeepASA: An Object-Oriented One-for-All Network for Auditory Scene Analysis | Dongheon Lee et.al. | 2509.17247 | null |
| 2025-09-21 | Flow-Induced Diagonal Gaussian Processes | Moule Lin et.al. | 2509.17153 | null |
| 2025-09-20 | Looking in the mirror: A faithful counterfactual explanation method for interpreting deep image classification models | Townim Faisal Chowdhury et.al. | 2509.16822 | null |
| 2025-09-20 | Towards a Transparent and Interpretable AI Model for Medical Image Classifications | Binbin Wen et.al. | 2509.16685 | null |
| 2025-09-20 | LLM-Guided Co-Training for Text Classification | Md Mezbaur Rahman et.al. | 2509.16516 | null |
| 2025-09-19 | Training Variational Quantum Circuits Using Particle Swarm Optimization | Marco Mordacci et.al. | 2509.15726 | null |
| 2025-09-19 | Impact of Single Rotations and Entanglement Topologies in Quantum Neural Networks | Marco Mordacci et.al. | 2509.15722 | null |
| 2025-09-18 | Training thermodynamic computers by gradient descent | Stephen Whitelam et.al. | 2509.15324 | null |
| 2025-09-18 | Which Direction to Choose? An Analysis on the Representation Power of Self-Supervised ViTs in Downstream Tasks | Yannis Kaltampanidis et.al. | 2509.15272 | null |
| 2025-09-17 | M-PACE: Mother Child Framework for Multimodal Compliance | Shreyash Verma et.al. | 2509.15241 | null |
| 2025-09-18 | Leveraging Geometric Visual Illusions as Perceptual Inductive Biases for Vision Models | Haobo Yang et.al. | 2509.15156 | null |
| 2025-09-18 | Low-rank surrogate modeling and stochastic zero-order optimization for training of neural networks with black-box layers | Andrei Chertkov et.al. | 2509.15113 | null |
| 2025-09-18 | MARIC: Multi-Agent Reasoning for Image Classification | Wonduk Seo et.al. | 2509.14860 | null |
| 2025-09-18 | Threat Modeling for Enhancing Security of IoT Audio Classification Devices under a Secure Protocols Framework | Sergio Benlloch-Lopez et.al. | 2509.14657 | null |
| 2025-09-18 | Enhancing Situational Awareness in Wearable Audio Devices Using a Lightweight Sound Event Localization and Detection System | Jun-Wei Yeow et.al. | 2509.14650 | null |
| 2025-09-16 | HQCNN: A Hybrid Quantum-Classical Neural Network for Medical Image Classification | Shahjalal et.al. | 2509.14277 | null |
| 2025-09-17 | CSMoE: An Efficient Remote Sensing Foundation Model with Soft Mixture-of-Experts | Leonard Hackel et.al. | 2509.14104 | null |
| 2025-09-17 | Comprehensive Evaluation of CNN-Based Audio Tagging Models on Resource-Constrained Devices | Jordi Grau-Haro et.al. | 2509.14049 | null |
| 2025-09-17 | Quantum Variational Activation Functions Empower Kolmogorov-Arnold Networks | Jiun-Cheng Jiang et.al. | 2509.14026 | link |
| 2025-09-17 | Taylor-Series Expanded Kolmogorov-Arnold Network for Medical Imaging Classification | Kaniz Fatema et.al. | 2509.13687 | null |
| 2025-09-17 | Deep Lookup Network | Yulan Guo et.al. | 2509.13662 | null |
| 2025-09-16 | Multimodal Hate Detection Using Dual-Stream Graph Neural Networks | Jiangbei Yue et.al. | 2509.13515 | null |
| 2025-09-14 | Hybrid Quantum-Classical Model for Image Classification | Muhammad Adnan Shahzad et.al. | 2509.13353 | link |
| 2025-09-16 | Hierarchical Deep Fusion Framework for Multi-dimensional Facial Forgery Detection – The 2024 Global Deepfake Image Detection Challenge | Kohou Wang et.al. | 2509.13107 | null |
| 2025-09-16 | Time-step Mixup for Efficient Spiking Knowledge Transfer from Appearance to Event Domain | Yuqi Xie et.al. | 2509.12959 | null |
| 2025-09-16 | Reversible Deep Equilibrium Models | Sam McCallum et.al. | 2509.12917 | null |
| 2025-09-15 | GhostNetV3-Small: A Tailored Architecture and Comparative Study of Distillation Strategies for Tiny Images | Florian Zager et.al. | 2509.12380 | null |
| 2025-09-13 | A Modern Look at Simplicity Bias in Image Classification Tasks | Xiaoguang Chang et.al. | 2509.12265 | null |
| 2025-09-10 | RU-Net for Automatic Characterization of TRISO Fuel Cross Sections | Lu Cai et.al. | 2509.12244 | null |
| 2025-09-15 | GTA: Supervised-Guided Reinforcement Learning for Text Classification with Large Language Models | Min Zeng et.al. | 2509.12108 | null |
| 2025-09-15 | Neuromorphic Photonic Circuits with Nonlinear Dynamics and Memory for Time Sequence Classification | Alessandro Foradori et.al. | 2509.11721 | null |
| 2025-09-15 | Optimizing Class Distributions for Bias-Aware Multi-Class Learning | Mirco Felske et.al. | 2509.11588 | null |
| 2025-09-14 | Decoding Musical Origins: Distinguishing Human and AI Composers | Cheng-Yang Tsai et.al. | 2509.11369 | null |
| 2025-09-14 | Promoting Shape Bias in CNNs: Frequency-Based and Contrastive Regularization for Corruption Robustness | Robin Narsingh Ranabhat et.al. | 2509.11355 | null |
| 2025-09-14 | The Impact of Skin Tone Label Granularity on the Performance and Fairness of AI Based Dermatology Image Classification Models | Partha Shah et.al. | 2509.11184 | null |
| 2025-09-14 | An Entropy-Guided Curriculum Learning Strategy for Data-Efficient Acoustic Scene Classification under Domain Shift | Peihong Zhang et.al. | 2509.11168 | null |
| 2025-09-14 | A Collaborative Framework for Quantum Optimisation and Quantum Neural Networks: Credit Feature Selection and Image Classification | JiaNing Long et.al. | 2509.11110 | null |
| 2025-09-14 | UltraUPConvNet: A UPerNet- and ConvNeXt-Based Multi-Task Network for Ultrasound Tissue Segmentation and Disease Prediction | Zhi Chen et.al. | 2509.11108 | null |
| 2025-09-12 | A Comparison and Evaluation of Fine-tuned Convolutional Neural Networks to Large Language Models for Image Classification and Segmentation of Brain Tumors on MRI | Felicia Liu et.al. | 2509.10683 | null |
| 2025-09-10 | Combining Audio and Non-Audio Inputs in Evolved Neural Networks for Ovenbird | Sergio Poo Hernandez et.al. | 2509.10566 | null |
| 2025-09-02 | FireGNN: Neuro-Symbolic Graph Neural Networks with Trainable Fuzzy Rules for Interpretable Medical Image Classification | Prajit Sengupta et.al. | 2509.10510 | link |
| 2025-09-12 | Beyond Token Limits: Assessing Language Model Performance on Long Text Classification | Miklós Sebők et.al. | 2509.10199 | null |
| 2025-09-12 | Prototypical Contrastive Learning For Improved Few-Shot Audio Classification | Christos Sgouropoulos et.al. | 2509.10074 | null |
| 2025-09-12 | Acoustic Scene Classification Using CNN-GRU Model Without Knowledge Distillation | Ee-Leng Tan et.al. | 2509.09931 | null |
| 2025-09-11 | Images in Motion?: A First Look into Video Leakage in Collaborative Deep Learning | Md Fazle Rasul et.al. | 2509.09742 | null |
| 2025-09-11 | Image Recognition with Vision and Language Embeddings of VLMs | Illia Volkov et.al. | 2509.09311 | null |
| 2025-09-11 | Adaptive Knowledge Distillation using a Device-Aware Teacher for Low-Complexity Acoustic Scene Classification | Seung Gyu Jeong et.al. | 2509.09262 | null |
| 2025-09-11 | CWSSNet: Hyperspectral Image Classification Enhanced by Wavelet Domain Convolution | Yulin Tong et.al. | 2509.09163 | null |
| 2025-09-10 | CoSwin: Convolution Enhanced Hierarchical Shifted Window Attention For Small-Scale Vision | Puskal Khadka et.al. | 2509.08959 | link |
| 2025-09-10 | UOPSL: Unpaired OCT Predilection Sites Learning for Fundus Image Diagnosis Augmentation | Zhihao Zhao et.al. | 2509.08624 | null |
| 2025-09-10 | HyperTTA: Test-Time Adaptation for Hyperspectral Image Classification under Distribution Shifts | Xia Yue et.al. | 2509.08436 | null |
| 2025-09-10 | Boosted Training of Lightweight Early Exits for Optimizing CNN Image Classification Inference | Yehudit Aperstein et.al. | 2509.08318 | null |
| 2025-09-10 | SimCroP: Radiograph Representation Learning with Similarity-driven Cross-granularity Pre-training | Rongsheng Wang et.al. | 2509.08311 | link |
| 2025-09-09 | Are Humans as Brittle as Large Language Models? | Jiahui Li et.al. | 2509.07869 | null |
| 2025-09-09 | Spectral and Rhythm Feature Performance Evaluation for Category and Class Level Audio Classification with Deep Convolutional Neural Networks | Friedrich Wolf-Monheim et.al. | 2509.07756 | null |
| 2025-09-09 | Nearest Neighbor Projection Removal Adversarial Training | Himanshu Singh et.al. | 2509.07673 | null |
| 2025-09-09 | MedicalPatchNet: A Patch-Based Self-Explainable AI Architecture for Chest X-ray Classification | Patrick Wienholt et.al. | 2509.07477 | link |
| 2025-09-08 | Dimensionally Reduced Open-World Clustering: DROWCULA | Erencem Ozbey et.al. | 2509.07184 | null |
| 2025-09-07 | 1 bit is all we need: binary normalized neural networks | Eduardo Lobo Lustoda Cabral et.al. | 2509.07025 | null |
| 2025-09-03 | FedAPT: Federated Adversarial Prompt Tuning for Vision-Language Models | Kun Zhai et.al. | 2509.06992 | null |
| 2025-09-08 | Entanglement and Classical Simulability in Quantum Extreme Learning Machines | A. De Lorenzis et.al. | 2509.06873 | null |
| 2025-09-08 | Video-Based MPAA Rating Prediction: An Attention-Driven Hybrid Architecture Using Contrastive Learning | Dipta Neogi et.al. | 2509.06826 | null |
| 2025-09-08 | Classical Neural Networks on Quantum Devices via Tensor Network Disentanglers: A Case Study in Image Classification | Borja Aizpurua et.al. | 2509.06653 | null |
| 2025-09-08 | IGAff: Benchmarking Adversarial Iterative and Genetic Affine Algorithms on Deep Neural Networks | Sebastian-Vasile Echim et.al. | 2509.06459 | null |
| 2025-09-07 | Khana: A Comprehensive Indian Cuisine Dataset | Omkar Prabhu et.al. | 2509.06006 | null |
| 2025-09-07 | A brain-inspired paradigm for scalable quantum vision | Chenghua Duan et.al. | 2509.05919 | null |
| 2025-09-06 | Brain Tumor Detection Through Diverse CNN Architectures in IoT Healthcare Industries: Fast R-CNN, U-Net, Transfer Learning-Based CNN, and Fully Connected CNN | Mohsen Asghari Ilani et.al. | 2509.05821 | null |
| 2025-09-06 | High Utilization Energy-Aware Real-Time Inference Deep Convolutional Neural Network Accelerator | Kuan-Ting Lin et.al. | 2509.05688 | null |
| 2025-09-06 | LM-Searcher: Cross-domain Neural Architecture Search with LLMs via Unified Numerical Encoding | Yuxuan Hu et.al. | 2509.05657 | null |
| 2025-09-05 | Quaternion Approximation Networks for Enhanced Image Classification and Oriented Object Detection | Bryce Grant et.al. | 2509.05512 | null |
| 2025-09-05 | Prior Distribution and Model Confidence | Maksim Kazanskii et.al. | 2509.05485 | null |
| 2025-09-06 | Universality of physical neural networks with multivariate nonlinearity | Benjamin Savinson et.al. | 2509.05420 | null |
| 2025-08-30 | Application of discrete Ricci curvature in pruning randomly wired neural networks: A case study with chest x-ray classification of COVID-19 | Pavithra Elumalai et.al. | 2509.05322 | null |
| 2025-08-30 | Context-Aware Knowledge Distillation with Adaptive Weighting for Image Classification | Zhengda Li et.al. | 2509.05319 | null |
| 2025-09-04 | Noisy Label Refinement with Semantically Reliable Synthetic Images | Yingxuan Li et.al. | 2509.04298 | null |
| 2025-09-04 | An Automated, Scalable Machine Learning Model Inversion Assessment Pipeline | Tyler Shumaker et.al. | 2509.04214 | null |
| 2025-09-04 | Real Time FPGA Based Transformers & VLMs for Vision Tasks: SOTA Designs and Optimizations | Safa Mohammed Sali et.al. | 2509.04162 | null |
| 2025-09-04 | SAC-MIL: Spatial-Aware Correlated Multiple Instance Learning for Histopathology Whole Slide Image Classification | Yu Bai et.al. | 2509.03973 | null |
| 2025-09-03 | Learning Mechanism Underlying NLP Pre-Training and Fine-Tuning | Yarden Tzach et.al. | 2509.03407 | null |
| 2025-09-03 | TinyDrop: Tiny Model Guided Token Dropping for Vision Transformers | Guoxin Wang et.al. | 2509.03379 | null |
| 2025-08-24 | The Lifecycle Principle: Stabilizing Dynamic Neural Networks with State Memory | Zichuan Yang et.al. | 2509.02575 | null |
| 2025-09-02 | Ordinal Adaptive Correction: A Data-Centric Approach to Ordinal Image Classification with Noisy Labels | Alireza Sedighi Moghaddam et.al. | 2509.02351 | null |
| 2025-09-02 | Extrapolated Markov Chain Oversampling Method for Imbalanced Text Classification | Aleksi Avela et.al. | 2509.02332 | null |
| 2025-09-02 | HydroVision: Predicting Optically Active Parameters in Surface Water Using Computer Vision | Shubham Laxmikant Deshmukh et.al. | 2509.01882 | null |
| 2025-09-01 | Modeling and benchmarking quantum optical neurons for efficient neural computation | Andrea Andrisani et.al. | 2509.01784 | null |
| 2025-09-01 | Examination of PCA Utilisation for Multilabel Classifier of Multispectral Images | Filip Karpowicz et.al. | 2509.01691 | null |
| 2025-09-01 | AgroSense: An Integrated Deep Learning System for Crop Recommendation via Soil Image Analysis and Nutrient Profiling | Vishal Pandey et.al. | 2509.01344 | null |
| 2025-08-31 | Hybrid Topic-Semantic Labeling and Graph Embeddings for Unsupervised Legal Document Clustering | Deepak Bastola et.al. | 2509.00990 | null |
| 2025-08-31 | Performance Analysis of Supervised Machine Learning Algorithms for Text Classification | Sadia Zaman Mishu et.al. | 2509.00983 | null |
| 2025-08-31 | Quantization Meets OOD: Generalizable Quantization-aware Training from a Flatness Perspective | Jiacheng Jiang et.al. | 2509.00859 | null |
| 2025-08-31 | A computer vision-based approach to enhance seismic catalogues | Michele De Solda et.al. | 2509.00791 | null |
| 2025-08-31 | Multi-Level CLS Token Fusion for Contrastive Learning in Endoscopy Image Classification | Y Hop Nguyen et.al. | 2509.00752 | null |
| 2025-08-31 | CSFMamba: Cross State Fusion Mamba Operator for Multimodal Remote Sensing Image Classification | Qingyu Wang et.al. | 2509.00677 | null |
| 2025-08-30 | All-optical classification of real biomedical cell images using a diffractive neural network: a simulation study | Norihide Sagami et.al. | 2509.00370 | null |
| 2025-08-30 | Target-Oriented Single Domain Generalization | Marzi Heidari et.al. | 2509.00351 | null |
| 2025-08-29 | Principled Approximation Methods for Efficient and Scalable Deep Learning | Pedro Savarese et.al. | 2509.00174 | null |
| 2025-08-27 | Yet Unnoticed in LSTM: Binary Tree Based Input Reordering, Weight Regularization, and Gate Nonlinearization | Mojtaba Moattari et.al. | 2509.00087 | null |
| 2025-08-24 | Performance is not All You Need: Sustainability Considerations for Algorithms | Xiang Li et.al. | 2509.00045 | null |
| 2025-08-29 | I Stolenly Swear That I Am Up to (No) Good: Design and Evaluation of Model Stealing Attacks | Daryna Oliynyk et.al. | 2508.21654 | null |
| 2025-08-28 | Full-Frequency Temporal Patching and Structured Masking for Enhanced Audio Classification | Aditya Makineni et.al. | 2508.21243 | null |
| 2025-08-28 | Online incremental learning for audio classification using a pretrained audio model | Manjunath Mulimani et.al. | 2508.20732 | null |
| 2025-08-28 | Domain Adaptation Techniques for Natural and Medical Image Classification | Ahmad Chaddad et.al. | 2508.20537 | null |
| 2025-08-28 | Dual-Model Weight Selection and Self-Knowledge Distillation for Medical Image Classification | Ayaka Tsutsumi et.al. | 2508.20461 | null |
| 2025-08-27 | Exploring Selective Retrieval-Augmentation for Long-Tail Legal Text Classification | Boheng Mao et.al. | 2508.19997 | null |
| 2025-08-27 | Dhati+: Fine-tuned Large Language Models for Arabic Subjectivity Evaluation | Slimane Bellaouar et.al. | 2508.19966 | null |
| 2025-08-27 | Microscale optoelectronic reservoir networks of halide perovskite for in-sensor computing | Jeroen J. de Boer et.al. | 2508.19916 | null |
| 2025-08-27 | Image Quality Assessment for Machines: Paradigm, Large-scale Database, and Models | Xiaoqi Wang et.al. | 2508.19850 | link |
| 2025-08-26 | Time Series Analysis of Spiking Neural Systems via Transfer Entropy and Directed Persistent Homology | Dylan Peek et.al. | 2508.19048 | null |
| 2025-08-26 | Automatic Prompt Optimization with Prompt Distillation | Ernest A. Dyagin et.al. | 2508.18992 | link |
| 2025-08-26 | Flatness-aware Curriculum Learning via Adversarial Difficulty | Hiroaki Aizawa et.al. | 2508.18726 | null |
| 2025-08-26 | Class-wise Flooding Regularization for Imbalanced Image Classification | Hiroaki Aizawa et.al. | 2508.18723 | null |
| 2025-08-26 | Natural Image Classification via Quasi-Cyclic Graph Ensembles and Random-Bond Ising Models at the Nishimori Temperature | V. S. Usatyuk et.al. | 2508.18717 | null |
| 2025-08-25 | Analise de Desaprendizado de Maquina em Modelos de Classificacao de Imagens Medicas | Andreza M. C. Falcao et.al. | 2508.18509 | null |
| 2025-08-25 | Scene-Aware Vectorized Memory Multi-Agent Framework with Cross-Modal Differentiated Quantization VLMs for Visually Impaired Assistance | Xiangxiang Wang et.al. | 2508.18177 | null |
| 2025-08-25 | Hybrid Quantum-Classical Learning for Multiclass Image Classification | Shuchismita Anwar et.al. | 2508.18161 | null |
| 2025-08-25 | Designing Practical Models for Isolated Word Visual Speech Recognition | Iason Ioannis Panagos et.al. | 2508.17894 | null |
| 2025-08-25 | Towards Optimal Convolutional Transfer Learning Architectures for Breast Lesion Classification and ACL Tear Detection | Daniel Frees et.al. | 2508.17567 | null |
| 2025-08-24 | Efficient Zero-Shot Long Document Classification by Reducing Context Through Sentence Ranking | Prathamesh Kokate et.al. | 2508.17490 | null |
| 2025-08-24 | Morphological Cognition: Classifying MNIST Digits Through Morphological Computation Alone | Alican Mertan et.al. | 2508.17469 | null |
| 2025-08-24 | ResLink: A Novel Deep Learning Architecture for Brain Tumor Classification with Area Attention and Residual Connections | Sumedha Arya et.al. | 2508.17259 | null |
| 2025-08-23 | GRAID: Synthetic Data Generation with Geometric Constraints and Multi-Agentic Reflection for Harmful Content Detection | Melissa Kazemi Rad et.al. | 2508.17057 | null |
| 2025-08-22 | Enhanced NIRMAL Optimizer With Damped Nesterov Acceleration: A Comparative Analysis | Nirmal Gaud et.al. | 2508.16550 | null |
| 2025-08-22 | LLM-as-classifier: Semi-Supervised, Iterative Framework for Hierarchical Text Classification using Large Language Models | Doohee You et.al. | 2508.16478 | null |
| 2025-08-22 | Vision encoders should be image size agnostic and task driven | Nedyalko Prisadnikov et.al. | 2508.16317 | null |
| 2025-08-22 | An Investigation of Visual Foundation Models Robustness | Sandeep Gupta et.al. | 2508.16225 | null |
| 2025-08-21 | Contributions to Label-Efficient Learning in Computer Vision and Remote Sensing | Minh-Tan Pham et.al. | 2508.15973 | null |
| 2025-08-21 | Glo-VLMs: Leveraging Vision-Language Models for Fine-Grained Diseased Glomerulus Classification | Zhenhao Guo et.al. | 2508.15960 | null |
| 2025-08-21 | Investigating Different Geo Priors for Image Classification | Angela Zhu et.al. | 2508.15946 | null |
| 2025-08-21 | Strategic Sample Selection for Improved Clean-Label Backdoor Attacks in Text Classification | Onur Alp Kirci et.al. | 2508.15934 | null |
| 2025-08-21 | Structure-Preserving Medical Image Generation from a Latent Graph Representation | Kevin Arias et.al. | 2508.15920 | null |
| 2025-08-13 | A BERT-based Hierarchical Classification Model with Applications in Chinese Commodity Classification | Kun Liu et.al. | 2508.15800 | null |
| 2025-08-21 | Tutorial on the Probabilistic Unification of Estimation Theory, Machine Learning, and Generative AI | Mohammed Elmusrati et.al. | 2508.15719 | null |
| 2025-08-21 | ASCMamba: Multimodal Time-Frequency Mamba for Acoustic Scene Classification | Bochao Sun et.al. | 2508.15632 | null |
| 2025-08-21 | AudioSet-R: A Refined AudioSet with Multi-Stage LLM Label Reannotation | Yulin Sun et.al. | 2508.15429 | null |
| 2025-08-21 | Transfer learning optimization based on evolutionary selective fine tuning | Jacinto Colan et.al. | 2508.15367 | null |
| 2025-08-21 | Explainable Knowledge Distillation for Efficient Medical Image Classification | Aqib Nazir Mir et.al. | 2508.15251 | null |
| 2025-08-21 | Robust and Efficient Quantum Reservoir Computing with Discrete Time Crystal | Da Zhang et.al. | 2508.15230 | null |
| 2025-08-20 | Fast Graph Neural Network for Image Classification | Mustafa Mohammadi Gharasuie et.al. | 2508.14958 | null |
| 2025-08-20 | HHNAS-AM: Hierarchical Hybrid Neural Architecture Search using Adaptive Mutation Policies | Anurag Tripathi et.al. | 2508.14946 | null |
| 2025-08-19 | TOM: An Open-Source Tongue Segmentation Method with Multi-Teacher Distillation and Task-Specific Data Augmentation | Jiacheng Xie et.al. | 2508.14932 | null |
| 2025-08-20 | Seeing Further on the Shoulders of Giants: Knowledge Inheritance for Vision Foundation Models | Jiabo Huang et.al. | 2508.14707 | null |
| 2025-08-20 | SMTrack: End-to-End Trained Spiking Neural Networks for Multi-Object Tracking in RGB Videos | Pengzhi Zhong et.al. | 2508.14607 | null |
| 2025-08-20 | Incremental Object Detection with Prompt-based Methods | Matthias Neuwirth-Trapp et.al. | 2508.14599 | null |
| 2025-08-20 | Multi-view Graph Condensation via Tensor Decomposition | Nícolas Roque dos Santos et.al. | 2508.14330 | null |
| 2025-08-19 | Graph Concept Bottleneck Models | Haotian Xu et.al. | 2508.14255 | null |
| 2025-08-19 | Accelerating Image Classification with Graph Convolutional Neural Networks using Voronoi Diagrams | Mustafa Mohammadi Gharasuie et.al. | 2508.14218 | null |
| 2025-08-19 | Comparing energy consumption and accuracy in text classification inference | Johannes Zschache et.al. | 2508.14170 | null |
| 2025-08-12 | Toward Lifelong Learning in Equilibrium Propagation: Sleep-like and Awake Rehearsal for Enhanced Stability | Yoshimasa Kubo et.al. | 2508.14081 | null |
| 2025-08-19 | Towards Efficient Vision State Space Models via Token Merging | Jinyoung Park et.al. | 2508.13599 | null |
| 2025-08-19 | A fully-programmable integrated photonic processor for both domain-specific and general-purpose computing | Feng-Kai Han et.al. | 2508.13551 | null |
| 2025-08-19 | Compressed Models are NOT Trust-equivalent to Their Large Counterparts | Rohit Raj Rai et.al. | 2508.13533 | null |
| 2025-08-19 | Vision Transformers for Kidney Stone Image Classification: A Comparative Study with CNNs | Ivan Reyes-Amezcua et.al. | 2508.13461 | null |
| 2025-08-18 | Applications of Small Language Models in Medical Imaging Classification with a Focus on Prompt Strategies | Yiting Wang et.al. | 2508.13378 | null |
| 2025-08-18 | Preserve and Sculpt: Manifold-Aligned Fine-tuning of Vision-Language Models for Few-Shot Learning | Dexia Chen et.al. | 2508.12877 | null |
| 2025-08-18 | CLAIRE-DSA: Fluoroscopic Image Classification for Quality Assurance of Computer Vision Pipelines in Acute Ischemic Stroke | Cristo J. van den Berg et.al. | 2508.12755 | null |
| 2025-08-17 | Skin Cancer Classification: Hybrid CNN-Transformer Models with KAN-Based Fusion | Shubhi Agarwal et.al. | 2508.12484 | null |
| 2025-08-17 | Federated Cross-Modal Style-Aware Prompt Generation | Suraj Prasad et.al. | 2508.12399 | null |
| 2025-08-17 | Attention Pooling Enhances NCA-based Classification of Microscopy Images | Chen Yang et.al. | 2508.12324 | null |
| 2025-08-17 | CryptPEFT: Efficient and Private Neural Network Inference via Parameter-Efficient Fine-Tuning | Saisai Xia et.al. | 2508.12264 | null |
| 2025-08-16 | Extending Straight-Through Estimation for Robust Neural Networks on Analog CIM Hardware | Yuannuo Feng et.al. | 2508.11940 | null |
| 2025-08-15 | An Efficient Medical Image Classification Method Based on a Lightweight Improved ConvNeXt-Tiny Architecture | Jingsong Xia et.al. | 2508.11532 | null |
| 2025-08-15 | Robust Convolution Neural ODEs via Contractivity-promoting regularization | Muhammad Zakwan et.al. | 2508.11432 | null |
| 2025-08-15 | Model Interpretability and Rationale Extraction by Input Mask Optimization | Marc Brinner et.al. | 2508.11388 | null |
| 2025-08-15 | Noise Matters: Optimizing Matching Noise for Diffusion Classifiers | Yanghao Wang et.al. | 2508.11330 | null |
| 2025-08-13 | NIRMAL Pooling: An Adaptive Max Pooling Approach with Non-linear Activation for Enhanced Image Classification | Nirmal Gaud et.al. | 2508.10940 | null |
| 2025-08-14 | X-Node: Self-Explanation is All We Need | Prajit Sengupta et.al. | 2508.10461 | link |
| 2025-08-13 | Invisible Watermarks, Visible Gains: Steering Machine Unlearning with Bi-Level Watermarking Design | Yuhao Sun et.al. | 2508.10065 | null |
| 2025-08-10 | Multi-task Adversarial Attacks against Black-box Model with Few-shot Queries | Wenqiang Wang et.al. | 2508.10039 | null |
| 2025-08-08 | LLMCARE: Alzheimer’s Detection via Transformer Models Enhanced by LLM-Generated Synthetic Data | Ali Zolnour et.al. | 2508.10027 | null |
| 2025-08-04 | AutoGeTS: Knowledge-based Automated Generation of Text Synthetics for Improving Text Classification | Chenhao Xue et.al. | 2508.10000 | null |
| 2025-08-13 | MOC: Meta-Optimized Classifier for Few-Shot Whole Slide Image Classification | Tianqi Xiang et.al. | 2508.09967 | null |
| 2025-08-13 | HKT: A Biologically Inspired Framework for Modular Hereditary Knowledge Transfer in Neural Networks | Yanick Chistian Tchenko et.al. | 2508.09743 | null |
| 2025-08-13 | Exploring the Equivalence of Closed-Set Generative and Real Data Augmentation in Image Classification | Haowen Wang et.al. | 2508.09550 | null |
| 2025-08-13 | CLIP-Flow: A Universal Discriminator for AI-Generated Images Inspired by Anomaly Detection | Zhipeng Yuan et.al. | 2508.09477 | null |
| 2025-08-12 | SinLlama – A Large Language Model for Sinhala | H. W. K. Aravinda et.al. | 2508.09115 | null |
| 2025-08-12 | 3DFroMLLM: 3D Prototype Generation only from Pretrained Multimodal LLMs | Noor Ahmed et.al. | 2508.08821 | null |
| 2025-08-12 | Classifier Language Models: Unifying Sparse Finetuning and Adaptive Tokenization for Specialized Classification Tasks | Adit Krishnan et.al. | 2508.08635 | null |
| 2025-08-11 | Incoherent Light-Driven Nonlinear Optical Extreme Learner via Data Reverberation | Bofeng Liu et.al. | 2508.08428 | null |
| 2025-08-11 | Neural Tangent Knowledge Distillation for Optical Convolutional Networks | Jinlin Xiang et.al. | 2508.08421 | null |
| 2025-08-11 | SHeRL-FL: When Representation Learning Meets Split Learning in Hierarchical Federated Learning | Dung T. Tran et.al. | 2508.08339 | null |
| 2025-08-11 | FairFLRep: Fairness aware fault localization and repair of Deep Neural Networks | Moses Openja et.al. | 2508.08151 | null |
| 2025-08-11 | Pindrop it! Audio and Visual Deepfake Countermeasures for Robust Detection and Fine Grained-Localization | Nicholas Klein et.al. | 2508.08141 | null |
| 2025-08-11 | Data-Efficient Biomedical In-Context Learning: A Diversity-Enhanced Submodular Perspective | Jun Wang et.al. | 2508.08140 | null |
| 2025-08-11 | Auditory Intelligence: Understanding the World Through Sound | Hyeonuk Nam et.al. | 2508.07829 | null |
| 2025-08-11 | Importance-Aware Semantic Communication in MIMO-OFDM Systems Using Vision Transformer | Joohyuk Park et.al. | 2508.07696 | null |
| 2025-08-11 | GLiClass: Generalist Lightweight Model for Sequence Classification Tasks | Ihor Stepanov et.al. | 2508.07662 | link |
| 2025-08-09 | Sensory robustness through top-down feedback and neural stochasticity in recurrent vision models | Antonino Greco et.al. | 2508.07115 | null |
| 2025-08-09 | Nonlinear Photonic Neuromorphic Chips for Spiking Reinforcement Learning | Shuiying Xiang et.al. | 2508.06962 | null |
| 2025-08-09 | Beyond Frequency: Seeing Subtle Cues Through the Lens of Spatial Decomposition for Fine-Grained Visual Classification | Qin Xu et.al. | 2508.06959 | null |
| 2025-08-08 | Large Language Models for Oral History Understanding with Text Classification and Sentiment Analysis | Komala Subramanyam Cherukuri et.al. | 2508.06729 | link |
| 2025-08-06 | Slice or the Whole Pie? Utility Control for AI Models | Ye Tao et.al. | 2508.06551 | null |
| 2025-08-02 | Large Language Models Facilitate Vision Reflection in Image Classification | Guoyuan An et.al. | 2508.06525 | null |
| 2025-08-08 | Blockchain-Enabled Federated Learning | Murtaza Rangwala et.al. | 2508.06406 | null |
| 2025-08-08 | Text as Any-Modality for Zero-Shot Classification by Consistent Prompt Tuning | Xiangyu Wu et.al. | 2508.06382 | null |
| 2025-08-08 | FedX: Explanation-Guided Pruning for Communication-Efficient Federated Learning in Remote Sensing | Barış Büyüktaş et.al. | 2508.06256 | null |
| 2025-08-07 | AHDMIL: Asymmetric Hierarchical Distillation Multi-Instance Learning for Fast and Accurate Whole-Slide Image Classification | Jiuyang Dong et.al. | 2508.05114 | null |
| 2025-08-07 | ULU: A Unified Activation Function | Simin Huo et.al. | 2508.05073 | null |
| 2025-08-07 | MedMambaLite: Hardware-Aware Mamba for Medical Image Classification | Romina Aalishah et.al. | 2508.05049 | null |
| 2025-08-06 | Revealing Temporal Label Noise in Multimodal Hateful Video Classification | Shuonan Yang et.al. | 2508.04900 | null |
| 2025-08-06 | Share Your Attention: Transformer Weight Sharing via Matrix-based Dictionary Learning | Magauiya Zhussip et.al. | 2508.04581 | null |
| 2025-08-06 | Benchmarking Uncertainty and its Disentanglement in multi-label Chest X-Ray Classification | Simon Baur et.al. | 2508.04457 | null |
| 2025-08-06 | Matrix-Free Two-to-Infinity and One-to-Two Norms Estimation | Askar Tsyganov et.al. | 2508.04444 | null |
| 2025-08-06 | WSS-CL: Weight Saliency Soft-Guided Contrastive Learning for Efficient Machine Unlearning Image Classification | Thang Duc Tran et.al. | 2508.04308 | null |
| 2025-08-06 | Comparative Analysis of Novel NIRMAL Optimizer Against Adam and SGD with Momentum | Nirmal Gaud et.al. | 2508.04293 | null |
| 2025-08-06 | A machine learning approach for image classification in synthetic aperture RADAR | Romina Gaburro et.al. | 2508.04234 | null |
| 2025-08-06 | DocVCE: Diffusion-based Visual Counterfactual Explanations for Document Image Classification | Saifullah Saifullah et.al. | 2508.04233 | null |
| 2025-08-06 | Hierarchical Text Classification Using Black Box Large Language Models | Kosuke Yoshimura et.al. | 2508.04219 | null |
| 2025-08-06 | DP-DocLDM: Differentially Private Document Image Generation using Latent Diffusion Models | Saifullah Saifullah et.al. | 2508.04208 | null |
| 2025-08-06 | Dual Prompt Learning for Adapting Vision-Language Models to Downstream Image-Text Retrieval | Yifan Wang et.al. | 2508.04028 | null |
| 2025-08-05 | FedPromo: Federated Lightweight Proxy Models at the Edge Bring New Domains to Foundation Models | Matteo Caligiuri et.al. | 2508.03356 | null |
| 2025-08-05 | Adaptive Sparse Softmax: An Effective and Efficient Softmax Variant | Qi Lv et.al. | 2508.03175 | null |
| 2025-08-05 | Augmenting Continual Learning of Diseases with LLM-Generated Visual Concepts | Jiantao Tan et.al. | 2508.03094 | null |
| 2025-08-05 | Contrastive Cross-Bag Augmentation for Multiple Instance Learning-based Whole Slide Image Classification | Bo Zhang et.al. | 2508.03081 | null |
| 2025-08-05 | The Geometry of Cortical Computation: Manifold Disentanglement and Predictive Dynamics in VCNet | Brennen A. Hill et.al. | 2508.02995 | null |
| 2025-08-04 | Tricks and Plug-ins for Gradient Boosting with Transformers | Biyi Fang et.al. | 2508.02924 | null |
| 2025-08-04 | ASMR: Angular Support for Malfunctioning Client Resilience in Federated Learning | Mirko Konstantin et.al. | 2508.02414 | null |
| 2025-08-04 | Semi-Supervised Dual-Threshold Contrastive Learning for Ultrasound Image Classification and Segmentation | Peng Zhang et.al. | 2508.02265 | null |
| 2025-08-04 | Reservoir Computing with Evolved Critical Neural Cellular Automata | Sidney Pontes-Filho et.al. | 2508.02218 | null |
| 2025-08-04 | Large-Scale Model Enabled Semantic Communication Based on Robust Knowledge Distillation | Kuiyuan Ding et.al. | 2508.02148 | null |
| 2025-08-04 | FedLAD: A Linear Algebra Based Data Poisoning Defence for Federated Learning | Qi Xiong et.al. | 2508.02136 | null |
| 2025-08-04 | REACT-KD: Region-Aware Cross-modal Topological Knowledge Distillation for Interpretable Medical Image Classification | Hongzhao Chen et.al. | 2508.02104 | null |
| 2025-08-04 | Deeply Dual Supervised learning for melanoma recognition | Rujosh Polma et.al. | 2508.01994 | null |
| 2025-08-03 | Granular Concept Circuits: Toward a Fine-Grained Circuit Discovery for Concept Representations | Dahee Kwon et.al. | 2508.01728 | null |
| 2025-08-03 | HateClipSeg: A Segment-Level Annotated Dataset for Fine-Grained Hate Video Detection | Han Wang et.al. | 2508.01712 | null |
| 2025-08-03 | TopoImages: Incorporating Local Topology Encoding into Deep Learning Models for Medical Image Classification | Pengfei Gu et.al. | 2508.01574 | null |
| 2025-08-03 | EvoVLMA: Evolutionary Vision-Language Model Adaptation | Kun Ding et.al. | 2508.01558 | null |
| 2025-08-02 | TeSent: A Benchmark Dataset for Fairness-aware Explainable Sentiment Classification in Telugu | Vallabhaneni Raj Kumar et.al. | 2508.01486 | null |
| 2025-08-02 | GMAT: Grounded Multi-Agent Clinical Description Generation for Text Encoder in Vision-Language MIL for Whole Slide Image Classification | Ngoc Bui Lam Quang et.al. | 2508.01293 | null |
| 2025-08-02 | Eigen Neural Network: Unlocking Generalizable Vision with Eigenbasis | Anzhe Cheng et.al. | 2508.01219 | null |
| 2025-08-01 | Small sample-based adaptive text classification through iterative and contrastive description refinement | Amrit Rajeev et.al. | 2508.00957 | null |
| 2025-07-30 | XAutoLM: Efficient Fine-Tuning of Language Models via Meta-Learning and AutoML | Ernesto L. Estevanell-Valladares et.al. | 2508.00924 | null |
| 2025-07-31 | Object-Centric Cropping for Visual Few-Shot Classification | Aymane Abdali et.al. | 2508.00218 | null |
| 2025-07-31 | Explainable Image Classification with Reduced Overconfidence for Tissue Characterisation | Alfie Roddan et.al. | 2507.23709 | null |
| 2025-07-31 | I Am Big, You Are Little; I Am Right, You Are Wrong | David A. Kelly et.al. | 2507.23509 | null |
| 2025-07-31 | Causal Identification of Sufficient, Contrastive and Complete Feature Sets in Image Classification | David A Kelly et.al. | 2507.23497 | null |
| 2025-07-31 | Smart Video Capsule Endoscopy: Raw Image-Based Localization for Enhanced GI Tract Investigation | Oliver Bause et.al. | 2507.23398 | null |
| 2025-07-31 | Popov Mirror-Prox Method for Variational Inequalities | Abhishek Chakraborty et.al. | 2507.23395 | null |
| 2025-07-31 | Analysis of Hyperparameter Optimization Effects on Lightweight Deep Models for Real-Time Image Classification | Vineet Kumar Rakesh et.al. | 2507.23315 | null |
| 2025-07-30 | Vocabulary-free Fine-grained Visual Recognition via Enriched Contextually Grounded Vision-Language Model | Dmitry Demidov et.al. | 2507.23070 | null |
| 2025-07-30 | Tricks and Plug-ins for Gradient Boosting in Image Classification | Biyi Fang et.al. | 2507.22842 | null |
| 2025-07-30 | Label-free estimation of clinically relevant performance metrics under distribution shifts | Tim Flühmann et.al. | 2507.22776 | null |
| 2025-07-30 | RainbowPrompt: Diversity-Enhanced Prompt-Evolving for Continual Learning | Kiseong Hong et.al. | 2507.22553 | null |
| 2025-07-30 | LoReUn: Data Itself Implicitly Provides Cues to Improve Machine Unlearning | Xiang Li et.al. | 2507.22499 | null |
| 2025-07-30 | Visual Language Models as Zero-Shot Deepfake Detectors | Viacheslav Pirogov et.al. | 2507.22469 | null |
| 2025-07-29 | HOG-CNN: Integrating Histogram of Oriented Gradients with Convolutional Neural Networks for Retinal Image Classification | Faisal Ahmed et.al. | 2507.22274 | null |
| 2025-07-29 | LLM-based Content Classification Approach for GitHub Repositories by the README Files | Malik Uzair Mehmood et.al. | 2507.21899 | null |
| 2025-07-29 | Improving Neural Network Training using Dynamic Learning Rate Schedule for PINNs and Image Classification | D. Veerababu et.al. | 2507.21749 | null |
| 2025-07-29 | Ethical Classification of Non-Coding Contributions in Open-Source Projects via Large Language Models | Sergio Cobos et.al. | 2507.21583 | null |
| 2025-07-28 | Evaluating Deep Learning Models for African Wildlife Image Classification: From DenseNet to Vision Transformers | Lukman Jibril Aliyu et.al. | 2507.21364 | null |
| 2025-07-28 | Can human clinical rationales improve the performance and explainability of clinical text classification models? | Christoph Metzner et.al. | 2507.21302 | null |
| 2025-07-28 | Dual Guidance Semi-Supervised Action Detection | Ankit Singh et.al. | 2507.21247 | null |
| 2025-07-27 | Contrast-CAT: Contrasting Activations for Enhanced Interpretability in Transformer-based Text Classifiers | Sungmin Han et.al. | 2507.21186 | null |
| 2025-07-24 | Comparative Analysis of Vision Transformers and Convolutional Neural Networks for Medical Image Classification | Kunal Kawadkar et.al. | 2507.21156 | null |
| 2025-07-28 | Compositional Function Networks: A High-Performance Alternative to Deep Neural Networks with Built-in Interpretability | Fang Li et.al. | 2507.21004 | null |
| 2025-07-28 | Lightweight Remote Sensing Scene Classification on Edge Devices via Knowledge Distillation and Early-exit | Yang Zhao et.al. | 2507.20623 | null |
| 2025-07-28 | PhaseNAS: Language-Model Driven Architecture Search with Dynamic Phase Adaptation | Fei Kong et.al. | 2507.20592 | null |
| 2025-07-27 | L-MCAT: Unpaired Multimodal Transformer with Contrastive Attention for Label-Efficient Satellite Image Classification | Mitul Goswami et.al. | 2507.20259 | null |
| 2025-07-27 | Dual-Stream Global-Local Feature Collaborative Representation Network for Scene Classification of Mining Area | Shuqi Fan et.al. | 2507.20216 | null |
| 2025-07-26 | Improving Audio Classification by Transitioning from Zero- to Few-Shot | James Taylor et.al. | 2507.20036 | null |
| 2025-07-26 | AF-CLIP: Zero-Shot Anomaly Detection via Anomaly-Focused CLIP Adaptation | Qingqing Fang et.al. | 2507.19949 | null |
| 2025-07-26 | Causality-aligned Prompt Learning via Diffusion-based Counterfactual Generation | Xinshu Li et.al. | 2507.19882 | null |
| 2025-07-26 | FedS2R: One-Shot Federated Domain Generalization for Synthetic-to-Real Semantic Segmentation in Autonomous Driving | Tao Lian et.al. | 2507.19881 | null |
| 2025-07-26 | Debunking Optimization Myths in Federated Learning for Medical Image Classification | Youngjoon Lee et.al. | 2507.19822 | null |
| 2025-07-25 | Joint Feature and Output Distillation for Low-complexity Acoustic Scene Classification | Haowen Li et.al. | 2507.19557 | null |
| 2025-07-25 | MedSymmFlow: Bridging Generative Modeling and Classification in Medical Imaging through Symmetrical Flow Matching | Francisco Caetano et.al. | 2507.19098 | link |
| 2025-07-25 | A New One-Shot Federated Learning Framework for Medical Imaging Classification with Feature-Guided Rectified Flow and Knowledge Distillation | Yufei Ma et.al. | 2507.19045 | null |
| 2025-07-24 | The Role of Orthographic Consistency in Multilingual Embedding Models for Text Classification in Arabic-Script Languages | Abdulhady Abas Abdullah et.al. | 2507.18762 | null |
| 2025-07-24 | CatchPhrase: EXPrompt-Guided Encoder Adaptation for Audio-to-Image Generation | Hyunwoo Oh et.al. | 2507.18750 | null |
| 2025-07-23 | VGS-ATD: Robust Distributed Learning for Multi-Label Medical Image Classification Under Heterogeneous and Imbalanced Conditions | Zehui Zhao et.al. | 2507.18657 | null |
| 2025-07-24 | On the Performance of Concept Probing: The Influence of the Data (Extended Version) | Manuel de Sousa Ribeiro et.al. | 2507.18550 | null |
| 2025-07-24 | GLiNER2: An Efficient Multi-Task Information Extraction System with Schema-Driven Interface | Urchade Zaratiana et.al. | 2507.18546 | link |
| 2025-07-24 | Iwin Transformer: Hierarchical Vision Transformer using Interleaved Windows | Simin Huo et.al. | 2507.18405 | null |
| 2025-07-24 | FedSA-GCL: A Semi-Asynchronous Federated Graph Learning Framework with Personalized Aggregation and Cluster-Aware Broadcasting | Zhongzheng Yuan et.al. | 2507.18219 | null |
| 2025-07-23 | LTLZinc: a Benchmarking Framework for Continual Learning and Neuro-Symbolic Temporal Reasoning | Luca Salvatore Lorello et.al. | 2507.17482 | null |
| 2025-07-23 | Filter-And-Refine: A MLLM Based Cascade System for Industrial-Scale Video Content Moderation | Zixuan Wang et.al. | 2507.17204 | null |
| 2025-07-22 | Combining Language and Topic Models for Hierarchical Text Classification | Jaco du Toit et.al. | 2507.16490 | null |
| 2025-07-22 | The Cost of Compression: Tight Quadratic Black-Box Attacks on Sketches for $\ell_2$ Norm Estimation | Sara Ahmadian et.al. | 2507.16345 | null |
| 2025-07-22 | Cross-Modal Distillation For Widely Differing Modalities | Cairong Zhao et.al. | 2507.16296 | null |
| 2025-07-22 | MAN++: Scaling Momentum Auxiliary Network for Supervised Local Learning in Vision Tasks | Junhao Su et.al. | 2507.16279 | null |
| 2025-07-22 | Quality Text, Robust Vision: The Role of Language in Enhancing Visual Robustness of Vision-Language Models | Futa Waseda et.al. | 2507.16257 | null |
| 2025-07-21 | Stop-band Energy Constraint for Orthogonal Tunable Wavelet Units in Convolutional Neural Networks for Computer Vision problems | An D. Le et.al. | 2507.16114 | null |
| 2025-07-21 | Optimizing Canaries for Privacy Auditing with Metagradient Descent | Matteo Boglioni et.al. | 2507.15836 | null |
| 2025-07-21 | GeMix: Conditional GAN-Based Mixup for Improved Medical Image Augmentation | Hugo Carlesso et.al. | 2507.15577 | null |
| 2025-07-21 | Smart Eyes for Silent Threats: VLMs and In-Context Learning for THz Imaging | Nicolas Poggi et.al. | 2507.15576 | null |
| 2025-07-21 | An Investigation of Test-time Adaptation for Audio Classification under Background Noise | Weichuang Shao et.al. | 2507.15523 | null |
| 2025-07-20 | Polymorph: Energy-Efficient Multi-Label Classification for Video Streams on Embedded Devices | Saeid Ghafouri et.al. | 2507.14959 | null |
| 2025-07-20 | Probabilistic smooth attention for deep multiple instance learning in medical imaging | Francisco M. Castro-Macías et.al. | 2507.14932 | null |
| 2025-07-20 | Semantic-Aware Representation Learning for Multi-label Image Classification | Ren-Dong Xie et.al. | 2507.14918 | null |
| 2025-07-20 | The Tsetlin Machine Goes Deep: Logical Learning and Reasoning With Graphs | Ole-Christoffer Granmo et.al. | 2507.14874 | null |
| 2025-07-19 | Performance comparison of medical image classification systems using TensorFlow Keras, PyTorch, and JAX | Merjem Bećirović et.al. | 2507.14587 | null |
| 2025-07-18 | Classification of Histopathology Slides with Persistence Homology Convolutions | Shrunal Pothagoni et.al. | 2507.14378 | null |
| 2025-07-18 | Quantum Boltzmann Machines using Parallel Annealing for Medical Image Classification | Daniëlle Schuman et.al. | 2507.14116 | null |
| 2025-07-18 | Foundation Models as Class-Incremental Learners for Dermatological Image Classification | Mohamed Elkhayat et.al. | 2507.14050 | null |
| 2025-07-18 | Evaluating the Effectiveness of Cost-Efficient Large Language Models in Benchmark Biomedical Tasks | Israt Jahan et.al. | 2507.14045 | null |
| 2025-07-18 | Automatic Classification and Segmentation of Tunnel Cracks Based on Deep Learning and Visual Explanations | Yong Feng et.al. | 2507.14010 | null |
| 2025-07-18 | Feature Engineering is Not Dead: Reviving Classical Machine Learning with Entropy, HOG, and LBP Feature Fusion for Image Classification | Abhijit Sen et.al. | 2507.13772 | null |
| 2025-07-18 | Adversarial Training Improves Generalization Under Distribution Shifts in Bioacoustics | René Heinrich et.al. | 2507.13727 | null |
| 2025-07-18 | Enhanced image classification via hybridizing quantum dynamics with classical neural networks | Ruiyang Zhou et.al. | 2507.13587 | null |
| 2025-07-17 | Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy | Yiting Yang et.al. | 2507.13260 | null |
| 2025-07-17 | Adversarial attacks to image classification systems using evolutionary algorithms | Sergio Nesmachnow et.al. | 2507.13136 | null |
| 2025-07-17 | MUPAX: Multidimensional Problem Agnostic eXplainable AI | Vincenzo Dentamaro et.al. | 2507.13090 | null |
| 2025-07-17 | Making Language Model a Hierarchical Classifier and Generator | Yihong Wang et.al. | 2507.12930 | null |
| 2025-07-17 | Federated Learning for Commercial Image Sources | Shreyansh Jain et.al. | 2507.12903 | null |
| 2025-07-17 | LanePerf: a Performance Estimation Framework for Lane Detection | Yin Wu et.al. | 2507.12894 | null |
| 2025-07-17 | Feature-Enhanced TResNet for Fine-Grained Food Image Classification | Lulu Liu et.al. | 2507.12828 | null |
| 2025-07-17 | Task-Specific Audio Coding for Machines: Machine-Learned Latent Features Are Codes for That Machine | Anastasia Kuznetsova et.al. | 2507.12701 | null |
| 2025-07-16 | Comparative Analysis of CNN Performance in Keras, PyTorch and JAX on PathMNIST | Anida Nezović et.al. | 2507.12248 | null |
| 2025-07-16 | PRISM: Distributed Inference for Foundation Models at Edge | Muhammad Azlan Qazi et.al. | 2507.12145 | null |
| 2025-07-16 | Effective Fine-Tuning of Vision Transformers with Low-Rank Adaptation for Privacy-Preserving Image Classification | Haiwei Lin et.al. | 2507.11943 | null |
| 2025-07-16 | Spatial Frequency Modulation for Semantic Segmentation | Linwei Chen et.al. | 2507.11893 | link |
| 2025-07-16 | ProtoConNet: Prototypical Augmentation and Alignment for Open-Set Few-Shot Image Classification | Kexuan Shi et.al. | 2507.11845 | null |
| 2025-07-15 | Quantum Adaptive Excitation Network with Variational Quantum Circuits for Channel Attention | Yu-Chao Hsu et.al. | 2507.11217 | null |
| 2025-07-15 | Hashed Watermark as a Filter: Defeating Forging and Overwriting Attacks in Weight-based Neural Network Watermarking | Yuan Yao et.al. | 2507.11137 | link |
| 2025-07-15 | Focus on Texture: Rethinking Pre-training in Masked Autoencoders for Medical Image Classification | Chetan Madan et.al. | 2507.10869 | null |
| 2025-07-14 | AudioMAE++: learning better masked audio representations with SwiGLU FFNs | Sarthak Yadav et.al. | 2507.10464 | null |
| 2025-07-14 | Improving Remote Sensing Classification using Topological Data Analysis and Convolutional Neural Networks | Aaryam Sharma et.al. | 2507.10381 | null |
| 2025-07-14 | FTCFormer: Fuzzy Token Clustering Transformer for Image Classification | Muyi Bao et.al. | 2507.10283 | null |
| 2025-07-14 | Transferring Styles for Reduced Texture Bias and Improved Robustness in Semantic Segmentation Networks | Ben Hamscher et.al. | 2507.10239 | null |
| 2025-07-14 | MEDebiaser: A Human-AI Feedback System for Mitigating Bias in Multi-label Medical Image Classification | Shaohan Shi et.al. | 2507.10044 | null |
| 2025-07-14 | Effects of structural properties of neural networks on machine learning performance | Yash Arya et.al. | 2507.10005 | null |
| 2025-07-14 | Hierarchical Job Classification with Similarity Graph Integration | Md Ahsanul Kabir et.al. | 2507.09949 | null |
| 2025-07-13 | Post-Training Quantization of Generative and Discriminative LSTM Text Classifiers: A Study of Calibration, Class Balance, and Robustness | Md Mushfiqur Rahaman et.al. | 2507.09687 | null |
| 2025-07-13 | MLoRQ: Bridging Low-Rank and Quantization for Transformer Compression | Ofir Gordon et.al. | 2507.09616 | null |
| 2025-07-13 | SDTN and TRN: Adaptive Spectral-Spatial Feature Extraction for Hyperspectral Image Classification | Fuyin Ye et.al. | 2507.09492 | null |
| 2025-07-11 | A Hybrid Multi-Well Hopfield-CNN with Feature Extraction and K-Means for MNIST Classification | Ahmed Farooq et.al. | 2507.08766 | null |
| 2025-07-11 | DatasetAgent: A Novel Multi-Agent System for Auto-Constructing Datasets from Real-World Images | Haoran Sun et.al. | 2507.08648 | null |
| 2025-07-11 | Onboard Neuromorphic Split Computing via Optical Links for LEO Remote Sensing | Zihang Song et.al. | 2507.08490 | null |
| 2025-07-11 | Interpretability-Aware Pruning for Efficient Medical Image Analysis | Nikita Malik et.al. | 2507.08330 | null |
| 2025-07-11 | Admissibility of Stein Shrinkage for Batch Normalization in the Presence of Adversarial Attacks | Sofia Ivolgina et.al. | 2507.08261 | null |
| 2025-07-10 | A Hybrid Multilayer Extreme Learning Machine for Image Classification with an Application to Quadcopters | Rolando A. Hernandez-Hernandez et.al. | 2507.08047 | null |
| 2025-07-10 | Where are we with calibration under dataset shift in image classification? | Mélanie Roschewitz et.al. | 2507.07780 | null |
| 2025-07-10 | TRIX- Trading Adversarial Fairness via Mixed Adversarial Training | Tejaswini Medi et.al. | 2507.07768 | null |
| 2025-07-10 | OPC: One-Point-Contraction Unlearning Toward Deep Feature Forgetting | Jaeheun Jung et.al. | 2507.07754 | null |
| 2025-07-10 | Temporal Unlearnable Examples: Preventing Personal Video Data from Unauthorized Exploitation by Object Tracking | Qiangqiang Wu et.al. | 2507.07483 | null |
| 2025-07-10 | EPIC: Efficient Prompt Interaction for Text-Image Classification | Xinyao Yu et.al. | 2507.07415 | null |
| 2025-07-10 | GNN-CNN: An Efficient Hybrid Model of Convolutional and Graph Neural Networks for Text Representation | Fardin Rastakhiz et.al. | 2507.07414 | null |
| 2025-07-09 | GNN-ViTCap: GNN-Enhanced Multiple Instance Learning with Vision Transformers for Whole Slide Image Classification and Captioning | S M Taslim Uddin Raju et.al. | 2507.07006 | null |
| 2025-07-09 | Unifying Re-Identification, Attribute Inference, and Data Reconstruction Risks in Differential Privacy | Bogdan Kulynych et.al. | 2507.06969 | null |
| 2025-07-09 | Steps Adaptive Decay DPSGD: Enhancing Performance on Imbalanced Datasets with Differential Privacy with HAM10000 | Xiaobo Huang et.al. | 2507.06619 | null |
| 2025-07-08 | Capsule-ConvKAN: A Hybrid Neural Approach to Medical Image Classification | Laura Pituková et.al. | 2507.06417 | null |
| 2025-07-08 | SoftReMish: A Novel Activation Function for Enhanced Convolutional Neural Networks for Visual Recognition Performance | Mustafa Bayram Gücen et.al. | 2507.06148 | null |
| 2025-07-08 | On the Effectiveness of Methods and Metrics for Explainable AI in Remote Sensing Image Scene Classification | Jonas Klotz et.al. | 2507.05916 | null |
| 2025-07-08 | Knowledge-guided Complex Diffusion Model for PolSAR Image Classification in Contourlet Domain | Junfei Shi et.al. | 2507.05666 | null |
| 2025-07-08 | Model-free Optical Processors using In Situ Reinforcement Learning with Proximal Policy Optimization | Yuhang Li et.al. | 2507.05583 | null |
| 2025-07-07 | Experimental data re-uploading with provable enhanced learning capabilities | Martin F. X. Mauser et.al. | 2507.05120 | null |
| 2025-07-07 | Verified Language Processing with Hybrid Explainability: A Technical Report | Oliver Robert Fox et.al. | 2507.05017 | null |
| 2025-07-07 | Co-DETECT: Collaborative Discovery of Edge Cases in Text Classification | Chenfei Xiong et.al. | 2507.05010 | null |
| 2025-07-07 | Bridging KAN and MLP: MJKAN, a Hybrid Architecture with Both Efficiency and Expressiveness | Hanseon Joo et.al. | 2507.04690 | null |
| 2025-07-07 | Recovering Plasticity of Neural Networks via Soft Weight Rescaling | Seungwon Oh et.al. | 2507.04683 | null |
| 2025-07-07 | VLM2Vec-V2: Advancing Multimodal Embedding for Videos, Images, and Visual Documents | Rui Meng et.al. | 2507.04590 | link |
| 2025-07-06 | MVNet: Hyperspectral Remote Sensing Image Classification Based on Hybrid Mamba-Transformer Vision Backbone Architecture | Guandong Li et.al. | 2507.04409 | null |
| 2025-07-06 | Transferring Visual Explainability of Self-Explaining Models through Task Arithmetic | Yuya Yoshikawa et.al. | 2507.04380 | null |
| 2025-07-06 | Efficient Training of Deep Networks using Guided Spectral Data Selection: A Step Toward Learning What You Need | Mohammadreza Sharifi et.al. | 2507.04269 | null |
| 2025-07-06 | Siberian radioheliograph image classification using ensemble of CLIP, EfficientNet and CatBoost models | Yaroslav Egorov et.al. | 2507.04211 | null |
| 2025-07-03 | Linear Attention with Global Context: A Multipole Attention Mechanism for Vision and Physics | Alex Colagrande et.al. | 2507.02748 | link |
| 2025-07-03 | ASDA: Audio Spectrogram Differential Attention Mechanism for Self-Supervised Representation Learning | Junyu Wang et.al. | 2507.02666 | null |
| 2025-07-03 | MedFormer: Hierarchical Medical Vision Transformer with Content-Aware Dual Sparse Selection Attention | Zunhui Xia et.al. | 2507.02488 | null |
| 2025-07-03 | F^2TTA: Free-Form Test-Time Adaptation on Cross-Domain Medical Image Classification via Image-Level Disentangled Prompt Tuning | Wei Li et.al. | 2507.02437 | null |
| 2025-07-03 | Cross-domain Hyperspectral Image Classification based on Bi-directional Domain Adaptation | Yuxiang Zhang et.al. | 2507.02268 | null |
| 2025-07-03 | High-Fidelity Differential-information Driven Binary Vision Transformer | Tian Gao et.al. | 2507.02222 | null |
| 2025-07-02 | Selective Feature Re-Encoded Quantum Convolutional Neural Network with Joint Optimization for Image Classification | Shaswata Mahernob Sarkar et.al. | 2507.02086 | null |
| 2025-07-02 | How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks | Rahul Ramachandran et.al. | 2507.01955 | link |
| 2025-07-02 | evMLP: An Efficient Event-Driven MLP Architecture for Vision | Zhentan Zheng et.al. | 2507.01927 | link |
| 2025-07-02 | mGRADE: Minimal Recurrent Gating Meets Delay Convolutions for Lightweight Sequence Modeling | Tristan Torchet et.al. | 2507.01829 | null |
| 2025-07-02 | Are Vision Transformer Representations Semantically Meaningful? A Case Study in Medical Imaging | Montasir Shams et.al. | 2507.01788 | null |
| 2025-07-02 | Learning from Random Subspace Exploration: Generalized Test-Time Augmentation with Self-supervised Distillation | Andrei Jelea et.al. | 2507.01347 | null |
| 2025-07-01 | Biorthogonal Tunable Wavelet Unit with Lifting Scheme in Convolutional Neural Network | An Le et.al. | 2507.00739 | null |
| 2025-07-01 | Rectifying Magnitude Neglect in Linear Attention | Qihang Fan et.al. | 2507.00698 | link |
| 2025-07-01 | Few-shot Classification as Multi-instance Verification: Effective Backbone-agnostic Transfer across Domains | Xin Xu et.al. | 2507.00401 | null |
| 2025-06-30 | Two-Stage Reasoning-Infused Learning: Improving Classification with LLM-Generated Reasoning | Mads Henrichsen et.al. | 2507.00214 | null |
| 2025-06-30 | Toward Simple and Robust Contrastive Explanations for Image Classification by Leveraging Instance Similarity and Concept Relevance | Yuliia Kaidashova et.al. | 2506.23975 | null |
| 2025-06-30 | Unveiling Decision-Making in LLMs for Text Classification : Extraction of influential and interpretable concepts with Sparse Autoencoders | Mathis Le Bail et.al. | 2506.23951 | null |
| 2025-06-30 | Controllable Reference-Based Real-World Remote Sensing Image Super-Resolution with Generative Diffusion Priors | Ce Wang et.al. | 2506.23801 | null |
| 2025-07-01 | Towards the Training of Deeper Predictive Coding Neural Networks | Chang Qi et.al. | 2506.23800 | null |
| 2025-06-30 | A Unified Framework for Stealthy Adversarial Generation via Latent Optimization and Transferability Enhancement | Gaozheng Pei et.al. | 2506.23676 | null |
| 2025-06-30 | Robustness of Misinformation Classification Systems to Adversarial Examples Through BeamAttack | Arnisa Fazla et.al. | 2506.23661 | null |
| 2025-06-30 | AdFair-CLIP: Adversarial Fair Contrastive Language-Image Pre-training for Chest X-rays | Chenlang Yi et.al. | 2506.23467 | null |
| 2025-06-29 | Federated Breast Cancer Detection Enhanced by Synthetic Ultrasound Image Augmentation | Hongyi Pan et.al. | 2506.23334 | null |
| 2025-07-01 | Exposing and Mitigating Calibration Biases and Demographic Unfairness in MLLM Few-Shot In-Context Learning for Medical Image Classification | Xing Shen et.al. | 2506.23298 | null |
| 2025-06-29 | Aggregating Local Saliency Maps for Semi-Global Explainable Image Classification | James Hinns et.al. | 2506.23247 | null |
| 2025-06-27 | Boosting Classification with Quantum-Inspired Augmentations | Matthias Tschöpe et.al. | 2506.22241 | null |
| 2025-06-27 | Remote Sensing Large Vision-Language Model: Semantic-augmented Multi-level Alignment and Semantic-aware Expert Modeling | Sungjune Park et.al. | 2506.21863 | null |
| 2025-06-27 | LinguaSynth: Heterogeneous Linguistic Signals for News Classification | Duo Zhang et.al. | 2506.21848 | null |
| 2025-06-25 | Disentangled representations of microscopy images | Jacopo Dapueto et.al. | 2506.20649 | null |
| 2025-06-25 | Counterfactual Influence as a Distributional Quantity | Matthieu Meeus et.al. | 2506.20481 | null |
| 2025-06-25 | Practical insights on the effect of different encodings, ansätze and measurements in quantum and hybrid convolutional neural networks | Jesús Lozano-Cruz et.al. | 2506.20355 | link |
| 2025-06-25 | Learning Moderately Input-Sensitive Functions: A Case Study in QR Code Decoding | Kazuki Yoda et.al. | 2506.20305 | null |
| 2025-06-25 | Hierarchical Mask-Enhanced Dual Reconstruction Network for Few-Shot Fine-Grained Image Classification | Ning Luo et.al. | 2506.20263 | null |
| 2025-06-25 | Perspectives in Play: A Multi-Perspective Approach for More Inclusive NLP Systems | Benedetta Muscato et.al. | 2506.20209 | null |
| 2025-06-26 | Towards Scalable and Generalizable Earth Observation Data Mining via Foundation Model Composition | Man Duc Chuc et.al. | 2506.20174 | null |
| 2025-06-24 | Neuromorphic Wireless Split Computing with Resonate-and-Fire Neurons | Dengyu Wu et.al. | 2506.20015 | null |
| 2025-06-24 | Ensemble nonlinear optical learner by electrically tunable linear scattering | Tunan Xia et.al. | 2506.19976 | null |
| 2025-06-25 | One Prototype Is Enough: Single-Prototype Activation for Interpretable Image Classification | Yitao Peng et.al. | 2506.19808 | null |
| 2025-06-24 | MambaOutRS: A Hybrid CNN-Fourier Architecture for Remote Sensing Image Classification | Minjong Cheon et.al. | 2506.19561 | null |
| 2025-06-24 | Iterative Quantum Feature Maps | Nasa Matsumoto et.al. | 2506.19461 | null |
| 2025-06-24 | Comparative Performance of Finetuned ImageNet Pre-trained Models for Electronic Component Classification | Yidi Shao et.al. | 2506.19330 | null |
| 2025-06-23 | LKA: Large Kernel Adapter for Enhanced Medical Image Classification | Ziquan Zhu et.al. | 2506.19118 | null |
| 2025-06-23 | Sensitivity Analysis of Image Classification Models using Generalized Polynomial Chaos | Lukas Bahr et.al. | 2506.18751 | null |
| 2025-06-23 | SIM-Net: A Multimodal Fusion Network Using Inferred 3D Object Shape Point Clouds from RGB Images for 2D Classification | Youcef Sklab et.al. | 2506.18683 | null |
| 2025-06-23 | SpaNN: Detecting Multiple Adversarial Patches on CNNs by Spanning Saliency Thresholds | Mauricio Byrd Victorica et.al. | 2506.18591 | null |
| 2025-06-23 | Geometry-aware Distance Measure for Diverse Hierarchical Structures in Hyperbolic Spaces | Pengxiang Li et.al. | 2506.18533 | null |
| 2025-06-23 | A Set-to-Set Distance Measure in Hyperbolic Space | Pengxiang Li et.al. | 2506.18529 | null |
| 2025-06-23 | Fully Few-shot Class-incremental Audio Classification Using Multi-level Embedding Extractor and Ridge Regression Classifier | Yongjie Si et.al. | 2506.18406 | null |
| 2025-06-23 | Open Set Recognition for Endoscopic Image Classification: A Deep Learning Approach on the Kvasir Dataset | Kasra Moazzami et.al. | 2506.18284 | null |
| 2025-06-22 | Pitfalls of Conformal Predictions for Medical Image Classification | Hendrik Mehrtens et.al. | 2506.18162 | null |
| 2025-06-22 | HE-LRM: Encrypted Deep Learning Recommendation Models using Fully Homomorphic Encryption | Karthik Garimella et.al. | 2506.18150 | null |
| 2025-06-22 | Training-free Test-time Improvement for Explainable Medical Image Classification | Hangzhou He et.al. | 2506.18070 | link |
| 2025-06-20 | Robust Training with Data Augmentation for Medical Imaging Classification | Josué Martínez-Martínez et.al. | 2506.17133 | null |
| 2025-06-20 | Acquiring and Accumulating Knowledge from Diverse Datasets for Multi-label Driving Scene Classification | Ke Li et.al. | 2506.17101 | null |
| 2025-06-20 | From Concepts to Components: Concept-Agnostic Attention Module Discovery in Transformers | Jingtong Su et.al. | 2506.17052 | null |
| 2025-06-20 | With Limited Data for Multimodal Alignment, Let the STRUCTURE Guide You | Fabian Gröger et.al. | 2506.16895 | null |
| 2025-06-20 | Transition of AI Models in dependence of noise | Thomas Seidler et.al. | 2506.16715 | null |
| 2025-06-19 | Efficient Transformations in Deep Learning Convolutional Neural Networks | Berk Yilmaz et.al. | 2506.16418 | null |
| 2025-06-19 | SHREC and PHEONA: Using Large Language Models to Advance Next-Generation Computational Phenotyping | Sarah Pungitore et.al. | 2506.16359 | null |
| 2025-06-19 | Polyline Path Masked Attention for Vision Transformer | Zhongchen Zhao et.al. | 2506.15940 | link |
| 2025-06-18 | FedWSIDD: Federated Whole Slide Image Classification via Dataset Distillation | Haolong Jin et.al. | 2506.15365 | link |
| 2025-06-18 | Enhancing One-run Privacy Auditing with Quantile Regression-Based Membership Inference | Terrance Liu et.al. | 2506.15349 | null |
| 2025-06-19 | OpenPath: Open-Set Active Learning for Pathology Image Classification via Pre-trained Vision-Language Models | Lanfeng Zhong et.al. | 2506.15318 | null |
| 2025-06-18 | J3DAI: A tiny DNN-Based Edge AI Accelerator for 3D-Stacked CMOS Image Sensor | Benoit Tain et.al. | 2506.15316 | null |
| 2025-06-18 | Domain Adaptation for Image Classification of Defects in Semiconductor Manufacturing | Adrian Poniatowski et.al. | 2506.15260 | null |
| 2025-06-18 | A Comparative Study of Task Adaptation Techniques of Large Language Models for Identifying Sustainable Development Goals | Andrea Cadeddu et.al. | 2506.15208 | null |
| 2025-06-18 | Identifying social isolation themes in NVDRS text narratives using topic modeling and text-classification methods | Drew Walker et.al. | 2506.15030 | null |
| 2025-06-17 | DDS-NAS: Dynamic Data Selection within Neural Architecture Search via On-line Hard Example Mining applied to Image Classification | Matt Poyser et.al. | 2506.14667 | null |
| 2025-06-17 | Train Once, Forget Precisely: Anchored Optimization for Efficient Post-Hoc Unlearning | Prabhav Sanga et.al. | 2506.14515 | null |
| 2025-06-17 | Compositional Attribute Imbalance in Vision Datasets | Jiayi Chen et.al. | 2506.14418 | null |
| 2025-06-17 | One-Shot Neural Architecture Search with Network Similarity Directed Initialization for Pathological Image Classification | Renao Yan et.al. | 2506.14176 | null |
| 2025-06-17 | SeqPE: Transformer with Sequential Position Encoding | Huayang Li et.al. | 2506.13277 | link |
| 2025-06-15 | Intriguing Frequency Interpretation of Adversarial Robustness for CNNs and ViTs | Lu Chen et.al. | 2506.12875 | null |
| 2025-06-15 | Medical Argument Mining: Exploitation of Scarce Data Using NLI Systems | Maitane Urruela et.al. | 2506.12823 | null |
| 2025-06-15 | Cross-architecture universal feature coding via distribution alignment | Changsheng Gao et.al. | 2506.12737 | null |
| 2025-06-15 | Unsupervised Contrastive Learning Using Out-Of-Distribution Data for Long-Tailed Dataset | Cuong Manh Hoang et.al. | 2506.12698 | null |
| 2025-06-15 | Evaluating Cell Type Inference in Vision Language Models Under Varying Visual Context | Samarth Singhal et.al. | 2506.12683 | null |
| 2025-06-14 | OscNet v1.5: Energy Efficient Hopfield Network on CMOS Oscillators for Image Classification | Wenxiao Cai et.al. | 2506.12610 | null |
| 2025-06-14 | DejaVid: Encoder-Agnostic Learned Temporal Matching for Video Classification | Darryl Ho et.al. | 2506.12585 | null |
| 2025-06-14 | MVP-CBM:Multi-layer Visual Preference-enhanced Concept Bottleneck Model for Explainable Medical Image Classification | Chunjiang Wang et.al. | 2506.12568 | null |
| 2025-06-14 | PLD: A Choice-Theoretic List-Wise Knowledge Distillation | Ejafa Bassam et.al. | 2506.12542 | null |
| 2025-06-13 | GeistBERT: Breathing Life into German NLP | Raphael Scheible-Schmitt et.al. | 2506.11903 | null |
| 2025-06-13 | Evaluating Fairness and Mitigating Bias in Machine Learning: A Novel Technique using Tensor Data and Bayesian Regression | Kuniko Paxton et.al. | 2506.11627 | null |
| 2025-06-13 | Machine Unlearning for Robust DNNs: Attribution-Guided Partitioning and Neuron Pruning in Noisy Environments | Deliang Jin et.al. | 2506.11615 | null |
| 2025-06-13 | Black-Box Edge AI Model Selection with Conformal Latency and Accuracy Guarantees | Anders E. Kalør et.al. | 2506.11391 | null |
| 2025-06-12 | SNR and Resource Adaptive Deep JSCC for Distributed IoT Image Classification | Ali Waqas et.al. | 2506.10699 | null |
| 2025-06-13 | PiPViT: Patch-based Visual Interpretable Prototypes for Retinal Image Analysis | Marzieh Oghbaie et.al. | 2506.10669 | link |
| 2025-06-12 | Boosting Adversarial Transferability for Hyperspectral Image Classification Using 3D Structure-invariant Transformation and Intermediate Feature Distance | Chun Liu et.al. | 2506.10459 | null |
| 2025-06-12 | Can We Infer Confidential Properties of Training Data from LLMs? | Penguin Huang et.al. | 2506.10364 | null |
| 2025-06-12 | Flick: Few Labels Text Classification using K-Aware Intermediate Learning in Multi-Task Low-Resource Languages | Ali Almutairi et.al. | 2506.10292 | null |
| 2025-06-11 | FedMLAC: Mutual Learning Driven Heterogeneous Federated Audio Classification | Jun Bai et.al. | 2506.10207 | null |
| 2025-06-11 | Detecção da Psoríase Utilizando Visão Computacional: Uma Abordagem Comparativa Entre CNNs e Vision Transformers | Natanael Lucena et.al. | 2506.10119 | null |
| 2025-06-11 | DeepTraverse: A Depth-First Search Inspired Network for Algorithmic Visual Understanding | Bin Guo et.al. | 2506.10084 | null |
| 2025-06-11 | Evidential Deep Learning with Spectral-Spatial Uncertainty Disentanglement for Open-Set Hyperspectral Domain Generalization | Amirreza Khoshbakht et.al. | 2506.09460 | null |
| 2025-06-11 | MSSDF: Modality-Shared Self-supervised Distillation for High-Resolution Multi-modal Remote Sensing Image Learning | Tong Wang et.al. | 2506.09327 | null |
| 2025-06-10 | ScalableHD: Scalable and High-Throughput Hyperdimensional Computing Inference on Multi-Core CPUs | Dhruv Parikh et.al. | 2506.09282 | null |
| 2025-06-10 | Hyperbolic Dual Feature Augmentation for Open-Environment | Peilin Yu et.al. | 2506.08906 | null |
| 2025-06-10 | Normalized Radon Cumulative Distribution Transforms for Invariance and Robustness in Optimal Transport Based Image Classification | Matthias Beckmann et.al. | 2506.08761 | null |
| 2025-06-12 | InceptionMamba: An Efficient Hybrid Network with Large Band Convolution and Bottleneck Mamba | Yuhang Wang et.al. | 2506.08735 | null |
| 2025-06-10 | Biologically Inspired Deep Learning Approaches for Fetal Ultrasound Image Classification | Rinat Prochii et.al. | 2506.08623 | null |
| 2025-06-10 | mSTEB: Massively Multilingual Evaluation of LLMs on Speech and Text Tasks | Luel Hagos Beyene et.al. | 2506.08400 | null |
| 2025-06-10 | An Adaptive Method Stabilizing Activations for Enhanced Generalization | Hyunseok Seung et.al. | 2506.08353 | null |
| 2025-06-11 | Hyperspectral Image Classification via Transformer-based Spectral-Spatial Attention Decoupling and Adaptive Gating | Guandong Li et.al. | 2506.08324 | null |
| 2025-06-09 | TokenBreak: Bypassing Text Classification Models Through Token Manipulation | Kasimir Schulz et.al. | 2506.07948 | null |
| 2025-06-09 | MultiMatch: Multihead Consistency Regularization Matching for Semi-Supervised Text Classification | Iustin Sirbu et.al. | 2506.07801 | null |
| 2025-06-09 | Improving Memory Efficiency for Training KANs via Meta Learning | Zhangchi Zhao et.al. | 2506.07549 | null |
| 2025-06-09 | Mind the Gap: Removing the Discretization Gap in Differentiable Logic Gate Networks | Shakir Yousefi et.al. | 2506.07500 | null |
| 2025-06-08 | Mobility-Aware Asynchronous Federated Learning with Dynamic Sparsification | Jintao Yan et.al. | 2506.07328 | null |
| 2025-06-08 | A Stable Whitening Optimizer for Efficient Neural Network Training | Kevin Frans et.al. | 2506.07254 | null |
| 2025-06-08 | Hierarchical Feature-level Reverse Propagation for Post-Training Neural Networks | Ni Ding et.al. | 2506.07188 | null |
| 2025-06-08 | CTDGSI: A comprehensive exploitation of instance selection methods for automatic text classification. VII Concurso de Teses, Dissertações e Trabalhos de Graduação em SI – XXI Simpósio Brasileiro de Sistemas de Informação | Washington Cunha et.al. | 2506.07169 | null |
| 2025-06-08 | pFedSOP : Accelerating Training Of Personalized Federated Learning Using Second-Order Optimization | Mrinmay Sen et.al. | 2506.07159 | null |
| 2025-06-07 | Rewriting the Budget: A General Framework for Black-Box Attacks Under Cost Asymmetry | Mahdi Salmani et.al. | 2506.06933 | null |
| 2025-06-06 | Eigenspectrum Analysis of Neural Networks without Aspect Ratio Bias | Yuanzhe Hu et.al. | 2506.06280 | null |
| 2025-06-06 | FPDANet: A Multi-Section Classification Model for Intelligent Screening of Fetal Ultrasound | Minglang Chen et.al. | 2506.06054 | null |
| 2025-06-06 | Enhancing Orthopox Image Classification Using Hybrid Machine Learning and Deep Learning Models | Alejandro Puente-Castro et.al. | 2506.06007 | null |
| 2025-06-06 | LTG at SemEval-2025 Task 10: Optimizing Context for Classification of Narrative Roles | Egil Rønningstad et.al. | 2506.05976 | null |
| 2025-06-06 | Integer Binary-Range Alignment Neuron for Spiking Neural Networks | Binghao Ye et.al. | 2506.05679 | null |
| 2025-06-05 | FRAME: Pre-Training Video Feature Representations via Anticipation and Memory | Sethuraman TV et.al. | 2506.05543 | null |
| 2025-06-05 | Spectral Graph Neural Networks are Incomplete on Graphs with a Simple Spectrum | Snir Hordan et.al. | 2506.05530 | null |
| 2025-06-05 | Robustness Evaluation for Video Models with Reinforcement Learning | Ashwin Ramesh Babu et.al. | 2506.05431 | null |
| 2025-06-05 | Interpretable Few-Shot Image Classification via Prototypical Concept-Guided Mixture of LoRA Experts | Zhong Ji et.al. | 2506.04673 | null |
| 2025-06-04 | Deep Learning for Absorption-Image Analysis | Jacob Morrey et.al. | 2506.04517 | null |
| 2025-06-04 | KOALA++: Efficient Kalman-Based Optimization of Neural Networks with Gradient-Covariance Products | Zixuan Xia et.al. | 2506.04432 | null |
| 2025-06-04 | Benchmarking Time-localized Explanations for Audio Classification Models | Cecilia Bolaños et.al. | 2506.04391 | null |
| 2025-06-04 | Hierarchical Text Classification Using Contrastive Learning Informed Path Guided Hierarchy | Neeraj Agrawal et.al. | 2506.04381 | null |
| 2025-06-04 | Recent Advances in Medical Image Classification | Loan Dao et.al. | 2506.04129 | null |
| 2025-06-04 | Prompt Candidates, then Distill: A Teacher-Student Framework for LLM-driven Data Annotation | Mingxuan Xia et.al. | 2506.03857 | null |
| 2025-06-04 | RhoDARTS: Differentiable Quantum Architecture Search with Density Matrix Simulations | Swagat Kumar et.al. | 2506.03697 | null |
| 2025-06-04 | Directional Non-Commutative Monoidal Embeddings for MNIST | Mahesh Godavarti et.al. | 2506.03472 | null |
| 2025-06-03 | RoNFA: Robust Neural Field-based Approach for Few-Shot Image Classification with Noisy Labels | Nan Xiang et.al. | 2506.03461 | null |
| 2025-06-02 | Quantifying task-relevant representational similarity using decision variable correlation | Yu et.al. | 2506.02164 | null |
| 2025-06-02 | Towards Better Generalization and Interpretability in Unsupervised Concept-Based Models | Francesco De Santis et.al. | 2506.02092 | null |
| 2025-06-02 | OD3: Optimization-free Dataset Distillation for Object Detection | Salwa K. Al Khatib et.al. | 2506.01942 | null |
| 2025-06-02 | Generalized Gradient Norm Clipping & Non-Euclidean $(L_0,L_1)$ -Smoothness | Thomas Pethick et.al. | 2506.01913 | null |
| 2025-06-02 | Beyond Static Responses: Multi-Agent LLM Systems as a New Paradigm for Social Science Research | Jennifer Haase et.al. | 2506.01839 | null |
| 2025-06-02 | mdok of KInIT: Robustly Fine-tuned LLM for Binary and Multiclass AI-Generated Text Detection | Dominik Macko et.al. | 2506.01702 | null |
| 2025-06-02 | Data Pruning by Information Maximization | Haoru Tan et.al. | 2506.01701 | null |
| 2025-06-02 | Domain Lexical Knowledge-based Word Embedding Learning for Text Classification under Small Data | Zixiao Zhu et.al. | 2506.01621 | null |
| 2025-06-02 | Speed-up of Vision Transformer Models by Attention-aware Token Filtering | Takahiro Naruko et.al. | 2506.01519 | null |
| 2025-06-02 | A Novel Context-Adaptive Fusion of Shadow and Highlight Regions for Efficient Sonar Image Classification | Kamal Basha S et.al. | 2506.01445 | null |
| 2025-05-30 | Optimal Weighted Convolution for Classification and Denosing | Simone Cammarasana et.al. | 2505.24558 | null |
| 2025-05-30 | SASP: Strip-Aware Spatial Perception for Fine-Grained Bird Image Classification | Zheng Wang et.al. | 2505.24380 | null |
| 2025-05-30 | Spatiotemporal Analysis of Forest Machine Operations Using 3D Video Classification | Maciej Wielgosz et.al. | 2505.24375 | null |
| 2025-05-30 | GeoVision Labeler: Zero-Shot Geospatial Classification with Vision and Language Models | Gilles Quentin Hacheme et.al. | 2505.24340 | null |
| 2025-05-30 | Provably Improving Generalization of Few-Shot Models with Synthetic Data | Lan-Cuong Nguyen et.al. | 2505.24190 | null |
| 2025-05-30 | FeatureSense: Protecting Speaker Attributes in Always-On Audio Sensing System | Bhawana Chhaglani et.al. | 2505.24115 | null |
| 2025-05-30 | Proxy-FDA: Proxy-based Feature Distribution Alignment for Fine-tuning Vision Foundation Models without Forgetting | Chen Huang et.al. | 2505.24088 | null |
| 2025-05-29 | BIRD: Behavior Induction via Representation-structure Distillation | Galen Pogoncheff et.al. | 2505.23933 | null |
| 2025-05-29 | Boosting Domain Incremental Learning: Selecting the Optimal Parameters is All You Need | Qiang Wang et.al. | 2505.23744 | null |
| 2025-05-29 | Spectrotemporal Modulation: Efficient and Interpretable Feature Representation for Classifying Speech, Music, and Environmental Sounds | Andrew Chang et.al. | 2505.23509 | link |
| 2025-05-29 | MCFNet: A Multimodal Collaborative Fusion Network for Fine-Grained Semantic Classification | Yang Qiao et.al. | 2505.23365 | null |
| 2025-05-29 | DSAGL: Dual-Stream Attention-Guided Learning for Weakly Supervised Whole Slide Image Classification | Daoxi Cao et.al. | 2505.23341 | null |
| 2025-05-29 | Deep Modeling and Optimization of Medical Image Classification | Yihang Wu et.al. | 2505.23040 | link |
| 2025-05-28 | Leveraging Diffusion Models for Synthetic Data Augmentation in Protein Subcellular Localization Classification | Sylvey Lin et.al. | 2505.22926 | null |
| 2025-05-28 | Frequency-Adaptive Discrete Cosine-ViT-ResNet Architecture for Sparse-Data Vision | Ziyue Kang et.al. | 2505.22701 | null |
| 2025-05-28 | S2AFormer: Strip Self-Attention for Efficient Vision Transformer | Guoan Xu et.al. | 2505.22195 | null |
| 2025-05-28 | Efficient Ensemble for Fine-tuning Language Models on Multiple Datasets | Dongyue Li et.al. | 2505.21930 | null |
| 2025-05-28 | Test-Time Adaptation of Vision-Language Models for Open-Vocabulary Semantic Segmentation | Mehrdad Noori et.al. | 2505.21844 | null |
| 2025-05-27 | MedBridge: Bridging Foundation Vision-Language Models to Medical Image Diagnosis | Yitong Li et.al. | 2505.21698 | null |
| 2025-05-27 | Leveraging large language models and traditional machine learning ensembles for ADHD detection from narrative transcripts | Yuxin Zhu et.al. | 2505.21324 | null |
| 2025-05-27 | Making Every Event Count: Balancing Data Efficiency and Accuracy in Event Camera Subsampling | Hesam Araghi et.al. | 2505.21187 | null |
| 2025-05-27 | Information-Theoretic Complementary Prompts for Improved Continual Text Classification | Duzhen Zhang et.al. | 2505.20933 | null |
| 2025-05-27 | Evidential Deep Active Learning for Semi-Supervised Classification | Shenkai Zhao et.al. | 2505.20691 | null |
| 2025-05-26 | UORA: Uniform Orthogonal Reinitialization Adaptation in Parameter-Efficient Fine-Tuning of Large Models | Xueyan Zhang et.al. | 2505.20154 | null |
| 2025-05-26 | Improvement Strategies for Few-Shot Learning in OCT Image Classification of Rare Retinal Diseases | Cheng-Yu Tai et.al. | 2505.20149 | null |
| 2025-05-26 | Differential Privacy Analysis of Decentralized Gossip Averaging under Varying Threat Models | Antti Koskela et.al. | 2505.19969 | null |
| 2025-05-26 | Task-Oriented Low-Label Semantic Communication With Self-Supervised Learning | Run Gu et.al. | 2505.19940 | null |
| 2025-05-26 | Advancements in Medical Image Classification through Fine-Tuning Natural Domain Foundation Models | Mobina Mansoori et.al. | 2505.19779 | link |
| 2025-05-26 | Mosaic: Data-Free Knowledge Distillation via Mixture-of-Experts for Heterogeneous Distributed Environments | Junming Liu et.al. | 2505.19699 | null |
| 2025-05-26 | Diagnosing and Mitigating Modality Interference in Multimodal Large Language Models | Rui Cai et.al. | 2505.19616 | null |
| 2025-05-26 | Applications and Effect Evaluation of Generative Adversarial Networks in Semi-Supervised Learning | Jiyu Hu et.al. | 2505.19522 | null |
| 2025-05-26 | DiSa: Directional Saliency-Aware Prompt Learning for Generalizable Vision-Language Models | Niloufar Alipour Talemi et.al. | 2505.19373 | null |
| 2025-05-25 | Remote Sensing Image Classification with Decoupled Knowledge Distillation | Yaping He et.al. | 2505.19111 | null |
| 2025-05-24 | MoMBS: Mixed-order minibatch sampling enhances model training from diverse-quality images | Han Li et.al. | 2505.18741 | null |
| 2025-05-23 | SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond Classification | Shashank Agnihotri et.al. | 2505.18015 | null |
| 2025-05-23 | KITINet: Kinetics Theory Inspired Network Architectures with PDE Simulation Approaches | Mingquan Feng et.al. | 2505.17919 | null |
| 2025-05-23 | Ownership Verification of DNN Models Using White-Box Adversarial Attacks with Specified Probability Manipulation | Teruki Sano et.al. | 2505.17579 | null |
| 2025-05-23 | Scaling Up Biomedical Vision-Language Models: Fine-Tuning, Instruction Tuning, and Multi-Modal Learning | Cheng Peng et.al. | 2505.17436 | null |
| 2025-05-23 | EVM-Fusion: An Explainable Vision Mamba Architecture with Neural Algorithmic Fusion | Zichuan Yang et.al. | 2505.17367 | null |
| 2025-05-22 | Extending Dataset Pruning to Object Detection: A Variance-based Approach | Ryota Yagi et.al. | 2505.17245 | null |
| 2025-05-23 | TULiP: Test-time Uncertainty Estimation via Linearization and Weight Perturbation | Yuhui Zhang et.al. | 2505.16923 | null |
| 2025-05-22 | Incremental Sequence Classification with Temporal Consistency | Lucas Maystre et.al. | 2505.16548 | null |
| 2025-05-22 | Fusion of Foundation and Vision Transformer Model Features for Dermatoscopic Image Classification | Amirreza Mahbod et.al. | 2505.16338 | null |
| 2025-05-22 | Accelerating Targeted Hard-Label Adversarial Attacks in Low-Query Black-Box Settings | Arjhun Swaminathan et.al. | 2505.16313 | link |
| 2025-05-22 | Swin Transformer for Robust CGI Images Detection: Intra- and Inter-Dataset Analysis across Multiple Color Spaces | Preeti Mehta et.al. | 2505.16253 | null |
| 2025-05-22 | When VLMs Meet Image Classification: Test Sets Renovation via Missing Label Identification | Zirui Pang et.al. | 2505.16149 | null |
| 2025-05-21 | Small Language Models in the Real World: Insights from Industrial Text Classification | Lujun Li et.al. | 2505.16078 | null |
| 2025-05-21 | GradPCA: Leveraging NTK Alignment for Reliable Out-of-Distribution Detection | Mariia Seleznova et.al. | 2505.16017 | null |
| 2025-05-21 | Domain Adaptive Skin Lesion Classification via Conformal Ensemble of Vision Transformers | Mehran Zoravar et.al. | 2505.15997 | null |
| 2025-05-21 | Large Language Models as Computable Approximations to Solomonoff Induction | Jun Wan et.al. | 2505.15784 | null |
| 2025-05-21 | FragFake: A Dataset for Fine-Grained Detection of Edited Images with Vision Language Models | Zhen Sun et.al. | 2505.15644 | null |
| 2025-05-21 | SNAP: A Benchmark for Testing the Effects of Capture Conditions on Fundamental Vision Tasks | Iuliia Kotseruba et.al. | 2505.15628 | link |
| 2025-05-21 | Aligning Explanations with Human Communication | Jacopo Teneggi et.al. | 2505.15626 | null |
| 2025-05-21 | Beyond Linearity: Squeeze-and-Recalibrate Blocks for Few-Shot Whole Slide Image Classification | Conghao Xiong et.al. | 2505.15504 | null |
| 2025-05-21 | Adaptive Temperature Scaling with Conformal Prediction | Nikita Kotelevskii et.al. | 2505.15437 | null |
| 2025-05-21 | Parameter-Efficient Fine-Tuning of Multispectral Foundation Models for Hyperspectral Image Classification | Bernardin Ligan et.al. | 2505.15334 | null |
| 2025-05-21 | Multicrossmodal Automated Agent for Integrating Diverse Materials Science Data | Adib Bazgir et.al. | 2505.15132 | null |
| 2025-05-20 | Reliable Decision Support with LLMs: A Framework for Evaluating Consistency in Binary Text Classification Applications | Fadel M. Megahed et.al. | 2505.14918 | null |
| 2025-05-20 | Solving MNIST with a globally trained Mixture of Quantum Experts | Paolo Alessandro Xavier Tognini et.al. | 2505.14789 | null |
| 2025-05-20 | Guarded Query Routing for Large Language Models | Richard Šléher et.al. | 2505.14524 | null |
| 2025-05-20 | PRL: Prompts from Reinforcement Learning | Paweł Batorski et.al. | 2505.14412 | null |
| 2025-05-20 | Domain Adaptation for Multi-label Image Classification: a Discriminator-free Approach | Inder Pal Singh et.al. | 2505.14333 | link |
| 2025-05-20 | HausaNLP: Current Status, Challenges and Future Directions for Hausa Natural Language Processing | Shamsuddeen Hassan Muhammad et.al. | 2505.14311 | null |
| 2025-05-20 | Intra-class Patch Swap for Self-Distillation | Hongjun Choi et.al. | 2505.14124 | link |
| 2025-05-20 | Scaling Vision Mamba Across Resolutions via Fractal Traversal | Bo Li et.al. | 2505.14062 | null |
| 2025-05-20 | Learning Concept-Driven Logical Rules for Interpretable and Generalizable Medical Image Classification | Yibo Gao et.al. | 2505.14049 | null |
| 2025-05-20 | A Challenge to Build Neuro-Symbolic Video Agents | Sahil Shah et.al. | 2505.13851 | null |
| 2025-05-19 | Synthetic-Powered Predictive Inference | Meshi Bashari et.al. | 2505.13432 | null |
| 2025-05-20 | Unlabeled Data or Pre-trained Model: Rethinking Semi-Supervised Learning and Pretrain-Finetuning | Song-Lin Li et.al. | 2505.13317 | null |
| 2025-05-19 | A Physics-Inspired Optimizer: Velocity Regularized Adam | Pranav Vaidhyanathan et.al. | 2505.13196 | null |
| 2025-05-19 | Emergence of Fixational and Saccadic Movements in a Multi-Level Recurrent Attention Model for Vision | Pengcheng Pan et.al. | 2505.13191 | null |
| 2025-05-19 | Learning to Adapt to Position Bias in Vision Transformer Classifiers | Robert-Jan Bruintjes et.al. | 2505.13137 | link |
| 2025-05-19 | When majority rules, minority loses: bias amplification of gradient descent | François Bachoc et.al. | 2505.13122 | null |
| 2025-05-19 | Expert-Like Reparameterization of Heterogeneous Pyramid Receptive Fields in Efficient CNNs for Fair Medical Image Classification | Xiao Wu et.al. | 2505.13039 | null |
| 2025-05-19 | EPIC: Explanation of Pretrained Image Classification Networks via Prototype | Piotr Borycki et.al. | 2505.12897 | link |
| 2025-05-19 | Enhancing Transformers Through Conditioned Embedded Tokens | Hemanth Saratchandran et.al. | 2505.12789 | null |
| 2025-05-19 | An approach based on class activation maps for investigating the effects of data augmentation on neural networks for image classification | Lucas M. Dorneles et.al. | 2505.12581 | null |
| 2025-05-16 | Energy efficiency analysis of Spiking Neural Networks for space applications | Paolo Lunghi et.al. | 2505.11418 | null |
| 2025-05-16 | Harnessing Photon Indistinguishability in Quantum Extreme Learning Machines | Malo Joly et.al. | 2505.11238 | null |
| 2025-05-16 | CheX-DS: Improving Chest X-ray Image Classification with Ensemble Learning Based on DenseNet and Swin Transformer | Xinran Li et.al. | 2505.11168 | null |
| 2025-05-16 | Privacy-Aware Lifelong Learning | Ozan Özdenizci et.al. | 2505.10941 | null |
| 2025-05-16 | MCU: Improving Machine Unlearning through Mode Connectivity | Yingdan Shi et.al. | 2505.10859 | null |
| 2025-05-15 | CLIP Embeddings for AI-Generated Image Detection: A Few-Shot Study with Lightweight Classifier | Ziyang Ou et.al. | 2505.10664 | null |
| 2025-05-15 | Research of the Variational Shadow Quantum Circuit Based on the Whale Optimization Algorithm in Image Classification | Shuang Wu et.al. | 2505.09994 | null |
| 2025-05-14 | Quantum-Enhanced Parameter-Efficient Learning for Typhoon Trajectory Forecasting | Chen-Yu Liu et.al. | 2505.09395 | null |
| 2025-05-14 | Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis | Bingxin Ke et.al. | 2505.09358 | link |
| 2025-05-17 | PrePrompt: Predictive prompting for class incremental learning | Libo Huang et.al. | 2505.08586 | link |
| 2025-05-13 | Convolutional Spiking Neural Network for Image Classification | Mikhail Kiselev et.al. | 2505.08514 | null |
| 2025-05-13 | CNN and ViT Efficiency Study on Tiny ImageNet and DermaMNIST Datasets | Aidar Amangeldi et.al. | 2505.08259 | null |
| 2025-05-13 | Empowering Vision Transformers with Multi-Scale Causal Intervention for Long-Tailed Image Classification | Xiaoshuo Yan et.al. | 2505.08173 | null |
| 2025-05-13 | MoKD: Multi-Task Optimization for Knowledge Distillation | Zeeshan Hayder et.al. | 2505.08170 | null |
| 2025-05-12 | Hierarchical Sparse Attention Framework for Computationally Efficient Classification of Biological Cells | Elad Yoshai et.al. | 2505.07661 | null |
| 2025-05-12 | Synthetic Similarity Search in Automotive Production | Christoph Huber et.al. | 2505.07256 | null |
| 2025-05-12 | Discovering Fine-Grained Visual-Concept Relations by Disentangled Optimal Transport Concept Bottleneck Models | Yan Xie et.al. | 2505.07209 | null |
| 2025-05-12 | KDH-MLTC: Knowledge Distillation for Healthcare Multi-Label Text Classification | Hajar Sakai et.al. | 2505.07162 | null |
| 2025-05-11 | A Vision-Language Foundation Model for Leaf Disease Identification | Khang Nguyen Quoc et.al. | 2505.07019 | null |
| 2025-05-11 | Image Classification Using a Diffusion Model as a Pre-Training Model | Kosuke Ukita et.al. | 2505.06890 | null |
| 2025-05-11 | NeuRN: Neuro-inspired Domain Generalization for Image Classification | Hamd Jalil et.al. | 2505.06881 | null |
| 2025-05-11 | Active Learning for Multi-class Image Classification | Thien Nhan Vo et.al. | 2505.06825 | null |
| 2025-05-10 | FNBench: Benchmarking Robust Federated Learning against Noisy Labels | Xuefeng Jiang et.al. | 2505.06684 | link |
| 2025-05-10 | The Efficiency of Pre-training with Objective Masking in Pseudo Labeling for Semi-Supervised Text Classification | Arezoo Hatefi et.al. | 2505.06624 | null |
| 2025-05-09 | Adapting a Segmentation Foundation Model for Medical Image Classification | Pengfei Gu et.al. | 2505.06217 | null |
| 2025-05-09 | Towards Robust Few-Shot Text Classification Using Transformer Architectures and Dual Loss Strategies | Xu Han et.al. | 2505.06145 | null |
| 2025-05-09 | Short-circuiting Shortcuts: Mechanistic Investigation of Shortcuts in Text Classification | Leon Eshuijs et.al. | 2505.06032 | link |
| 2025-05-09 | Efficient Quantum Convolutional Neural Networks for Image Classification: Overcoming Hardware Constraints | Peter Röseler et.al. | 2505.05957 | null |
| 2025-05-09 | Achieving 3D Attention via Triplet Squeeze and Excitation Block | Maan Alhazmi et.al. | 2505.05943 | null |
| 2025-05-09 | Improving Generalizability of Kolmogorov-Arnold Networks via Error-Correcting Output Codes | Youngjoon Lee et.al. | 2505.05798 | null |
| 2025-05-09 | Variational Bayesian Logistic Tensor Regression with Application to Image Recognition | Yunzhi Jin et.al. | 2505.05730 | null |
| 2025-05-08 | V-EfficientNets: Vector-Valued Efficiently Scaled Convolutional Neural Network Models | Guilherme Vieira Neto et.al. | 2505.05659 | link |
| 2025-05-08 | KG-HTC: Integrating Knowledge Graphs into LLMs for Effective Zero-shot Hierarchical Text Classification | Qianbo Zang et.al. | 2505.05583 | link |
| 2025-05-08 | Hide & Seek: Transformer Symmetries Obscure Sharpness & Riemannian Geometry Finds It | Marvin F. da Silva et.al. | 2505.05409 | null |
| 2025-05-08 | Quantum Surrogate-Driven Image Classifier: A Gradient-Free Approach to Avoid Barren Plateaus | Yichen Xie et.al. | 2505.05249 | null |
| 2025-05-08 | Biomed-DPT: Dual Modality Prompt Tuning for Biomedical Vision-Language Models | Wei Peng et.al. | 2505.05189 | null |
| 2025-05-08 | CacheFL: Efficient Federated Cache Model Fine-Tuning for Vision-Language Models | Mengjun Yi et.al. | 2505.05130 | null |
| 2025-05-08 | Direct Image Classification from Fourier Ptychographic Microscopy Measurements without Reconstruction | Navya Sonal Agarwal et.al. | 2505.05054 | null |
| 2025-05-07 | ORXE: Orchestrating Experts for Dynamically Configurable Efficiency | Qingyuan Wang et.al. | 2505.04850 | null |
| 2025-05-07 | Label-efficient Single Photon Images Classification via Active Learning | Zili Zhang et.al. | 2505.04376 | null |
| 2025-05-07 | FRAIN to Train: A Fast-and-Reliable Solution for Decentralized Federated Learning | Sanghyeon Park et.al. | 2505.04223 | null |
| 2025-05-06 | Read My Ears! Horse Ear Movement Detection for Equine Affective State Assessment | João Alves et.al. | 2505.03554 | null |
| 2025-05-06 | Noisy HQNNs: A Comprehensive Analysis of Noise Robustness in Hybrid Quantum Neural Networks | Tasnim Ahmed et.al. | 2505.03378 | null |
| 2025-05-06 | A Vision-Language Model for Focal Liver Lesion Classification | Song Jian et.al. | 2505.03350 | null |
| 2025-05-06 | Comparative Analysis of Lightweight Deep Learning Models for Memory-Constrained Devices | Tasnim Shahriar et.al. | 2505.03303 | null |
| 2025-05-06 | Survey of Abstract Meaning Representation: Then, Now, Future | Behrooz Mansouri et.al. | 2505.03229 | null |
| 2025-05-06 | seq-JEPA: Autoregressive Predictive Learning of Invariant-Equivariant World Models | Hafez Ghaemi et.al. | 2505.03176 | null |
| 2025-05-06 | Enhancing Glass Defect Detection with Diffusion Models: Addressing Imbalanced Datasets in Manufacturing Quality Control | Sajjad Rezvani Boroujeni et.al. | 2505.03134 | null |
| 2025-05-05 | Bayesian Robust Aggregation for Federated Learning | Aleksandr Karakulev et.al. | 2505.02490 | null |
| 2025-05-06 | Adversarial Cooperative Rationalization: The Risk of Spurious Correlations in Even Clean Datasets | Wei Liu et.al. | 2505.02118 | null |
| 2025-05-03 | Backdoor Attacks Against Patch-based Mixture of Experts | Cedric Chan et.al. | 2505.01811 | null |
| 2025-05-03 | Low-Complexity Acoustic Scene Classification with Device Information in the DCASE 2025 Challenge | Florian Schmid et.al. | 2505.01747 | null |
| 2025-05-03 | CLOG-CD: Curriculum Learning based on Oscillating Granularity of Class Decomposed Medical Image Classification | Asmaa Abbas et.al. | 2505.01741 | null |
| 2025-05-02 | TActiLE: Tiny Active LEarning for wearable devices | Massimo Pavan et.al. | 2505.01160 | null |
| 2025-04-30 | Towards Improved Cervical Cancer Screening: Vision Transformer-Based Classification and Interpretability | Khoa Tuan Nguyen et.al. | 2504.21340 | null |
| 2025-04-28 | AGATE: Stealthy Black-box Watermarking for Multimodal Model Copyright Protection | Jianbo Gao et.al. | 2504.21044 | null |
| 2025-04-29 | Photonic Quantum Convolutional Neural Networks with Adaptive State Injection | Léo Monbroussou et.al. | 2504.20989 | null |
| 2025-04-30 | DS_FusionNet: Dynamic Dual-Stream Fusion with Bidirectional Knowledge Distillation for Plant Disease Recognition | Yanghui Song et.al. | 2504.20948 | link |
| 2025-04-29 | MambaMoE: Mixture-of-Spectral-Spatial-Experts State Space Model for Hyperspectral Image Classification | Yichu Xu et.al. | 2504.20509 | null |
| 2025-04-28 | DeepAndes: A Self-Supervised Vision Foundation Model for Multi-Spectral Remote Sensing Imagery of the Andes | Junlin Guo et.al. | 2504.20303 | null |
| 2025-04-28 | GenCLS++: Pushing the Boundaries of Generative Classification in LLMs Through Comprehensive SFT and RL Studies Across Diverse Datasets | Mingqian He et.al. | 2504.19898 | null |
| 2025-04-28 | Reinforcement Learning-Based Heterogeneous Multi-Task Optimization in Semantic Broadcast Communications | Zhilin Lu et.al. | 2504.19806 | null |
| 2025-04-28 | Explaining Vision GNNs: A Semantic and Visual Analysis of Graph-based Image Classification | Nikolaos Chaidos et.al. | 2504.19682 | null |
| 2025-04-28 | Hardware/Software Co-Design of RISC-V Extensions for Accelerating Sparse DNNs on FPGAs | Muhammad Sabih et.al. | 2504.19659 | null |
| 2025-04-28 | Neural network task specialization via domain constraining | Roman Malashin et.al. | 2504.19592 | null |
| 2025-04-28 | GMAR: Gradient-Driven Multi-Head Attention Rollout for Vision Transformer Interpretability | Sehyeong Jo et.al. | 2504.19414 | null |
| 2025-04-27 | Dual-Branch Residual Network for Cross-Domain Few-Shot Hyperspectral Image Classification with Refined Prototype | Anyong Qin et.al. | 2504.19074 | null |
| 2025-04-26 | Advancing Scientific Text Classification: Fine-Tuned Models with Dataset Expansion and Hard-Voting | Zhyar Rzgar K Rostam et.al. | 2504.19021 | null |
| 2025-04-26 | A Simple Ensemble Strategy for LLM Inference: Towards More Stable Text Classification | Junichiro Niimi et.al. | 2504.18884 | link |
| 2025-04-26 | IoT Botnet Detection: Application of Vision Transformer to Classification of Network Flow Traffic | Hassan Wasswa et.al. | 2504.18781 | null |
| 2025-04-25 | Examining the Impact of Optical Aberrations to Image Classification and Object Detection Models | Patrick Müller et.al. | 2504.18510 | null |
| 2025-04-25 | Pseudo-Asynchronous Local SGD: Robust and Efficient Data-Parallel Training | Hiroki Naganuma et.al. | 2504.18454 | null |
| 2025-04-25 | Passive All-Optical Nonlinear Neuron Activation via PPLN Nanophotonic Waveguides | Wujie Fu et.al. | 2504.18145 | null |
| 2025-04-25 | DMS-Net:Dual-Modal Multi-Scale Siamese Network for Binocular Fundus Image Classification | Guohao Huo et.al. | 2504.18046 | null |
| 2025-04-24 | Disaggregated Deep Learning via In-Physics Computing at Radio Frequency | Zhihui Gao et.al. | 2504.17752 | null |
| 2025-04-24 | Aerial Image Classification in Scarce and Unconstrained Environments via Conformal Prediction | Farhad Pourkamali-Anaraki et.al. | 2504.17655 | null |
| 2025-04-24 | Enhanced Sample Selection with Confidence Tracking: Identifying Correctly Labeled yet Hard-to-Learn Samples in Noisy Data | Weiran Pan et.al. | 2504.17474 | null |
| 2025-04-24 | Dual-Individual Genetic Algorithm: A Dual-Individual Approach for Efficient Training of Multi-Layer Neural Networks | Tran Thuy Nga Truong et.al. | 2504.17346 | null |
| 2025-04-24 | Evaluating and Mitigating Bias in AI-Based Medical Text Generation | Xiuying Chen et.al. | 2504.17279 | null |
| 2025-04-24 | Group Downsampling with Equivariant Anti-aliasing | Md Ashiqur Rahman et.al. | 2504.17258 | link |
| 2025-04-24 | Multi-Modal Traffic Analysis: Integrating Time-Series Forecasting, Accident Prediction, and Image Classification | Nivedita M et.al. | 2504.17232 | null |
| 2025-04-23 | A Diff-Attention Aware State Space Fusion Model for Remote Sensing Classification | Wenping Ma et.al. | 2504.16665 | null |
| 2025-04-23 | Streetscape Analysis with Generative AI (SAGAI): Vision-Language Assessment and Mapping of Urban Scenes | Joan Perez et.al. | 2504.16538 | null |
| 2025-04-24 | An Effective Gram Matrix Characterizes Generalization in Deep Networks | Rubing Yang et.al. | 2504.16450 | null |
| 2025-04-23 | FrogDogNet: Fourier frequency Retained visual prompt Output Guidance for Domain Generalization of CLIP in Remote Sensing | Hariseetharam Gunduboina et.al. | 2504.16433 | null |
| 2025-04-22 | CLIP-IT: CLIP-based Pairing for Histology Images Classification | Banafsheh Karimian et.al. | 2504.16181 | null |
| 2025-04-22 | Automated Bug Report Prioritization in Large Open-Source Projects | Riley Pierson et.al. | 2504.15912 | null |
| 2025-04-22 | Generative AI for Research Data Processing: Lessons Learnt From Three Use Cases | Modhurita Mitra et.al. | 2504.15829 | null |
| 2025-04-22 | DualOptim: Enhancing Efficacy and Stability in Machine Unlearning with Dual Optimizers | Xuyang Zhong et.al. | 2504.15827 | null |
| 2025-04-22 | HS-Mamba: Full-Field Interaction Multi-Groups Mamba for Hyperspectral Image Classification | Hongxing Peng et.al. | 2504.15612 | null |
| 2025-04-22 | LLM-based Semantic Augmentation for Harmful Content Detection | Elyas Meguellati et.al. | 2504.15548 | null |
| 2025-04-21 | Feeding LLM Annotations to BERT Classifiers at Your Own Risk | Yucheng Lu et.al. | 2504.15432 | null |
| 2025-04-21 | Dynamic 3D KAN Convolution with Adaptive Grid Optimization for Hyperspectral Image Classification | Guandong Li et.al. | 2504.15155 | null |
| 2025-04-21 | Application of Sensitivity Analysis Methods for Studying Neural Network Models | Jiaxuan Miao et.al. | 2504.15100 | null |
| 2025-04-21 | Trainable Quantum Neural Network for Multiclass Image Classification with the Power of Pre-trained Tree Tensor Networks | Keisuke Murota et.al. | 2504.14995 | null |
| 2025-04-21 | ECViT: Efficient Convolutional Vision Transformer with Local-Attention and Multi-scale Stages | Zhoujie Qian et.al. | 2504.14825 | link |
| 2025-04-21 | What Lurks Within? Concept Auditing for Shared Diffusion Models at Scale | Xiaoyong Yuan et.al. | 2504.14815 | null |
| 2025-04-21 | A Basic Evaluation of Neural Networks Trained with the Error Diffusion Learning Algorithm | Kazuhisa Fujita et.al. | 2504.14814 | null |
| 2025-04-19 | Learning from Stochastic Teacher Representations Using Student-Guided Knowledge Distillation | Muhammad Haseeb Aslam et.al. | 2504.14307 | null |
| 2025-04-19 | Exploring Modality Guidance to Enhance VFM-based Feature Fusion for UDA in 3D Semantic Segmentation | Johannes Spoecklberger et.al. | 2504.14231 | null |
| 2025-04-19 | Enhancing Multimodal In-Context Learning for Image Classification through Coreset Optimization | Huiyi Chen et.al. | 2504.14200 | null |
| 2025-04-19 | ThyroidEffi 1.0: A Cost-Effective System for High-Performance Multi-Class Thyroid Carcinoma Classification | Hai Pham-Ngoc et.al. | 2504.14139 | null |
| 2025-04-18 | Feature Alignment and Representation Transfer in Knowledge Distillation for Large Language Models | Junjie Yang et.al. | 2504.13825 | null |
| 2025-04-18 | CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning | Yang Yue et.al. | 2504.13820 | link |
| 2025-04-18 | Towards Accurate and Interpretable Neuroblastoma Diagnosis via Contrastive Multi-scale Pathological Image Analysis | Zhu Zhu et.al. | 2504.13754 | null |
| 2025-04-18 | Human-aligned Deep Learning: Explainability, Causality, and Biological Inspiration | Gianluca Carloni et.al. | 2504.13717 | null |
| 2025-04-18 | Word Embedding Techniques for Classification of Star Ratings | Hesham Abdelmotaleb et.al. | 2504.13653 | null |
| 2025-04-18 | Cross-Hierarchical Bidirectional Consistency Learning for Fine-Grained Visual Classification | Pengxiang Gao et.al. | 2504.13608 | null |
| 2025-04-18 | MAAM: A Lightweight Multi-Agent Aggregation Module for Efficient Image Classification Based on the MindSpore Framework | Zhenkai Qin et.al. | 2504.13574 | null |
| 2025-04-18 | Bayesian continual learning and forgetting in neural networks | Djohan Bonnet et.al. | 2504.13569 | null |
| 2025-04-17 | Dynamic Memory-enhanced Transformer for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2504.13242 | null |
| 2025-04-17 | Perception Encoder: The best visual embeddings are not at the output of the network | Daniel Bolya et.al. | 2504.13181 | link |
| 2025-04-17 | Expert Kernel Generation Network Driven by Contextual Mapping for Hyperspectral Image Classification | Guandong Li et.al. | 2504.13045 | null |
| 2025-04-17 | Quantum Computing Supported Adversarial Attack-Resilient Autonomous Vehicle Perception Module for Traffic Sign Classification | Reek Majumder et.al. | 2504.12644 | null |
| 2025-04-16 | GLUSE: Enhanced Channel-Wise Adaptive Gated Linear Units SE for Onboard Satellite Earth Observation Image Classification | Thanh-Dung Le et.al. | 2504.12484 | null |
| 2025-04-16 | FLIP Reasoning Challenge | Andreas Plesner et.al. | 2504.12256 | link |
| 2025-04-16 | Weakly Semi-supervised Whole Slide Image Classification by Two-level Cross Consistency Supervision | Linhao Qu et.al. | 2504.12132 | null |
| 2025-04-16 | Exploring Video-Based Driver Activity Recognition under Noisy Labels | Linjuan Fan et.al. | 2504.11966 | link |
| 2025-04-17 | Selective Attention Federated Learning: Improving Privacy and Efficiency for Clinical Text Classification | Yue Li et.al. | 2504.11793 | null |
| 2025-04-15 | The Pontryagin Maximum Principle for Training Convolutional Neural Networks | Sebastian Hofmann et.al. | 2504.11647 | null |
| 2025-04-15 | Deep Learning Approaches for Medical Imaging Under Varying Degrees of Label Availability: A Comprehensive Survey | Siteng Ma et.al. | 2504.11588 | null |
| 2025-04-15 | Diversity-Driven Learning: Tackling Spurious Correlations and Data Heterogeneity in Federated Models | Gergely D. Németh et.al. | 2504.11216 | null |
| 2025-04-15 | Embedding Radiomics into Vision Transformers for Multimodal Medical Image Classification | Zhenyu Yang et.al. | 2504.10916 | null |
| 2025-04-15 | Progressive Rock Music Classification | Arpan Nagar et.al. | 2504.10821 | null |
| 2025-04-15 | 3D Wavelet Convolutions with Extended Receptive Fields for Hyperspectral Image Classification | Guandong Li et.al. | 2504.10795 | null |
| 2025-04-14 | Quantum Image Classification: Experiments on Utility-Scale Quantum Computers | Hrant Gharibyan et.al. | 2504.10595 | null |
| 2025-04-14 | LEMUR Neural Network Dataset: Towards Seamless AutoML | Arash Torabi Goodarzi et.al. | 2504.10552 | link |
| 2025-04-13 | An Efficient Quantum Classifier Based on Hamiltonian Representations | Federico Tiblias et.al. | 2504.10542 | null |
| 2025-04-14 | Correlative and Discriminative Label Grouping for Multi-Label Visual Prompt Tuning | LeiLei Ma et.al. | 2504.09990 | null |
| 2025-04-14 | GFT: Gradient Focal Transformer | Boris Kriuk et.al. | 2504.09852 | null |
| 2025-04-13 | PCM-SAR: Physics-Driven Contrastive Mutual Learning for SAR Classification | Pengfei Wang et.al. | 2504.09502 | null |
| 2025-04-13 | InfoBound: A Provable Information-Bounds Inspired Framework for Both OoD Generalization and OoD Detection | Lin Zhu et.al. | 2504.09448 | null |
| 2025-04-13 | Sparse Deformable Mamba for Hyperspectral Image Classification | Lincoln Linlin Xu et.al. | 2504.09446 | null |
| 2025-04-12 | Cycle Training with Semi-Supervised Domain Adaptation: Bridging Accuracy and Efficiency for Real-Time Mobile Scene Detection | Huu-Phong Phan-Nguyen et.al. | 2504.09297 | null |
| 2025-04-12 | Sparse Hybrid Linear-Morphological Networks | Konstantinos Fotopoulos et.al. | 2504.09289 | null |
| 2025-04-12 | Mixture of Group Experts for Learning Invariant Representations | Lei Kang et.al. | 2504.09265 | null |
| 2025-04-12 | Langformers: Unified NLP Pipelines for Language Models | Rabindra Lamsal et.al. | 2504.09170 | null |
| 2025-04-12 | Evolved Hierarchical Masking for Self-Supervised Learning | Zhanzhou Feng et.al. | 2504.09155 | null |
| 2025-04-11 | Hypergraph Vision Transformers: Images are More than Nodes, More than Edges | Joshua Fixelle et.al. | 2504.08710 | null |
| 2025-04-11 | Integrated ensemble of BERT- and features-based models for authorship attribution in Japanese literary works | Taisei Kanda et.al. | 2504.08527 | null |
| 2025-04-11 | An Early Experience with Confidential Computing Architecture for On-Device Model Protection | Sina Abdollahi et.al. | 2504.08508 | null |
| 2025-04-11 | The inherent convolution property of quantum neural networks | Guangkai Qu et.al. | 2504.08487 | null |
| 2025-04-11 | A Hybrid Fully Convolutional CNN-Transformer Model for Inherently Interpretable Medical Image Classification | Kerol Djoumessi et.al. | 2504.08481 | null |
| 2025-04-11 | FocalLens: Instruction Tuning Enables Zero-Shot Conditional Image Representations | Cheng-Yu Hsieh et.al. | 2504.08368 | null |
| 2025-04-11 | Comparative Analysis of Different Methods for Classifying Polychromatic Sketches | Fahd Baba et.al. | 2504.08186 | null |
| 2025-04-11 | Pychop: Emulating Low-Precision Arithmetic in Numerical Methods and Neural Networks | Erin Carson et.al. | 2504.07835 | link |
| 2025-04-10 | Traversal Learning Coordination For Lossless And Efficient Distributed Learning | Erdenebileg Batbaatar et.al. | 2504.07471 | null |
| 2025-04-09 | Identifying regions of interest in whole slide images of renal cell carcinoma | Mohammed Lamine Benomar et.al. | 2504.07313 | null |
| 2025-04-09 | A new training approach for text classification in Mental Health: LatentGLoss | Korhan Sevinç et.al. | 2504.07245 | null |
| 2025-04-09 | Deep Learning for Cardiovascular Risk Assessment: Proxy Features from Carotid Sonography as Predictors of Arterial Damage | Christoph Balada et.al. | 2504.06680 | null |
| 2025-04-08 | Memory-Modular Classification: Learning to Generalize with Memory Replacement | Dahyun Kang et.al. | 2504.06021 | null |
| 2025-04-08 | Federated Unlearning Made Practical: Seamless Integration via Negated Pseudo-Gradients | Alessio Mora et.al. | 2504.05822 | null |
| 2025-04-08 | DefMamba: Deformable Visual State Space Model | Leiye Liu et.al. | 2504.05794 | null |
| 2025-04-08 | Layer-Aware Embedding Fusion for LLMs in Text Classifications | Jiho Gwak et.al. | 2504.05764 | null |
| 2025-04-07 | REEF: Relevance-Aware and Efficient LLM Adapter for Video Understanding | Sakib Reza et.al. | 2504.05491 | null |
| 2025-04-07 | Secure Diagnostics: Adversarial Robustness Meets Clinical Interpretability | Mohammad Hossein Najafi et.al. | 2504.05483 | null |
| 2025-04-07 | Explaining Low Perception Model Competency with High-Competency Counterfactuals | Sara Pohland et.al. | 2504.05254 | null |
| 2025-04-07 | Federated Learning for Medical Image Classification: A Comprehensive Benchmark | Zhekai Zhou et.al. | 2504.05238 | null |
| 2025-04-07 | Batch Aggregation: An Approach to Enhance Text Classification with Correlated Augmented Data | Charco Hui et.al. | 2504.05020 | null |
| 2025-04-07 | RS-RAG: Bridging Remote Sensing Imagery and Comprehensive Knowledge with a Multi-Modal Dataset and Retrieval-Augmented Generation Model | Congcong Wen et.al. | 2504.04988 | null |
| 2025-04-06 | Your Image Generator Is Your New Private Dataset | Nicolo Resmini et.al. | 2504.04582 | null |
| 2025-04-06 | Attributed Synthetic Data Generation for Zero-shot Domain-specific Image Classification | Shijian Wang et.al. | 2504.04510 | null |
| 2025-04-06 | Spatial-Geometry Enhanced 3D Dynamic Snake Convolutional Neural Network for Hyperspectral Image Classification | Guandong Li et.al. | 2504.04463 | null |
| 2025-04-05 | A Comparative Study of Explainable AI Methods: Model-Agnostic vs. Model-Specific Approaches | Keerthi Devireddy et.al. | 2504.04276 | null |
| 2025-04-05 | GlotEval: A Test Suite for Massively Multilingual Evaluation of Large Language Models | Hengyu Luo et.al. | 2504.04155 | link |
| 2025-04-05 | Scaling Federated Learning Solutions with Kubernetes for Synthesizing Histopathology Images | Andrei-Alexandru Preda et.al. | 2504.04130 | null |
| 2025-04-04 | Adaptive Classification of Interval-Valued Time Series | Wan Tian et.al. | 2504.03318 | null |
| 2025-04-04 | Beyond the Next Token: Towards Prompt-Robust Zero-Shot Classification via Efficient Multi-Token Prediction | Junlang Qian et.al. | 2504.03159 | null |
| 2025-04-03 | HQViT: Hybrid Quantum Vision Transformer for Image Classification | Hui Zhang et.al. | 2504.02730 | null |
| 2025-04-03 | LLM-Guided Evolution: An Autonomous Model Optimization for Object Detection | YiMing Yu et.al. | 2504.02280 | null |
| 2025-04-02 | Neural Style Transfer for Synthesising a Dataset of Ancient Egyptian Hieroglyphs | Lewis Matheson Creed et.al. | 2504.02163 | null |
| 2025-04-02 | A thorough benchmark of automatic text classification: From traditional approaches to large language models | Washington Cunha et.al. | 2504.01930 | link |
| 2025-04-02 | A Randomized Zeroth-Order Hierarchical Framework for Heterogeneous Federated Learning | Yuyang Qiu et.al. | 2504.01839 | null |
| 2025-04-02 | A Novel Approach To Implementing Knowledge Distillation In Tsetlin Machines | Calvin Kinateder et.al. | 2504.01798 | null |
| 2025-04-02 | Token Pruning in Audio Transformers: Optimizing Performance and Decoding Patch Importance | Taehan Lee et.al. | 2504.01690 | link |
| 2025-04-02 | All Patches Matter, More Patches Better: Enhance AI-Generated Image Detection via Panoptic Patch Learning | Zheng Yang et.al. | 2504.01396 | null |
| 2025-04-01 | TenAd: A Tensor-based Low-rank Black Box Adversarial Attack for Video Classification | Kimia haghjooei et.al. | 2504.01228 | null |
| 2025-04-01 | PolygoNet: Leveraging Simplified Polygonal Representation for Effective Image Classification | Salim Khazem et.al. | 2504.01214 | link |
| 2025-04-01 | Enabling Efficient Processing of Spiking Neural Networks with On-Chip Learning on Commodity Neuromorphic Processors for Edge AI Systems | Rachmad Vidya Wicaksana Putra et.al. | 2504.00957 | null |
| 2025-04-01 | Impact of Data Duplication on Deep Neural Network-Based Image Classifiers: Robust vs. Standard Models | Alireza Aghabagherloo et.al. | 2504.00638 | null |
| 2025-04-01 | Geometric Median Matching for Robust k-Subset Selection from Noisy Data | Anish Acharya et.al. | 2504.00564 | null |
| 2025-03-31 | NoProp: Training Neural Networks without Back-propagation or Forward-propagation | Qinyu Li et.al. | 2503.24322 | null |
| 2025-03-31 | CIBR: Cross-modal Information Bottleneck Regularization for Robust CLIP Generalization | Yingrui Ji et.al. | 2503.24182 | null |
| 2025-03-31 | PixelCAM: Pixel Class Activation Mapping for Histology Image Classification and ROI Localization | Alexis Guichemerre et.al. | 2503.24135 | link |
| 2025-03-31 | Crossmodal Knowledge Distillation with WordNet-Relaxed Text Embeddings for Robust Image Classification | Chenqi Guo et.al. | 2503.24017 | null |
| 2025-03-31 | FlexiMo: A Flexible Remote Sensing Foundation Model | Xuyang Li et.al. | 2503.23844 | null |
| 2025-03-31 | Expanding-and-Shrinking Binary Neural Networks | Xulong Shi et.al. | 2503.23709 | link |
| 2025-03-31 | WHERE and WHICH: Iterative Debate for Biomedical Synthetic Data Augmentation | Zhengyi Zhao et.al. | 2503.23673 | null |
| 2025-03-30 | Efficient Dynamic Attention 3D Convolution for Hyperspectral Image Classification | Guandong Li et.al. | 2503.23472 | null |
| 2025-03-30 | KernelDNA: Dynamic Kernel Sharing via Decoupled Naive Adapters | Haiduo Huang et.al. | 2503.23379 | link |
| 2025-03-29 | Optimizing Distributed Training Approaches for Scaling Neural Networks | Vishnu Vardhan Baligodugula et.al. | 2503.23186 | null |
| 2025-03-28 | Data-Free Universal Attack by Exploiting the Intrinsic Vulnerability of Deep Models | YangTian Yan et.al. | 2503.22205 | link |
| 2025-03-28 | Route-and-Aggregate Decentralized Federated Learning Under Communication Errors | Weicai Li et.al. | 2503.22186 | null |
| 2025-03-27 | On Large Multimodal Models as Open-World Image Classifiers | Alessandro Conti et.al. | 2503.21851 | link |
| 2025-03-27 | Bayesian Pseudo Posterior Mechanism for Differentially Private Machine Learning | Robert Chew et.al. | 2503.21528 | null |
| 2025-03-27 | Retinal Fundus Multi-Disease Image Classification using Hybrid CNN-Transformer-Ensemble Architectures | Deependra Singh et.al. | 2503.21465 | link |
| 2025-03-27 | Fine-Tuning LLMs on Small Medical Datasets: Text Classification and Normalization Effectiveness on Cardiology reports and Discharge records | Noah Losch et.al. | 2503.21349 | null |
| 2025-03-27 | Improving $(α, f)$ -Byzantine Resilience in Federated Learning via layerwise aggregation and cosine distance | Mario García-Márquez et.al. | 2503.21244 | link |
| 2025-03-27 | Neural Architecture Search by Learning a Hierarchical Search Space | Mehraveh Javan Roshtkhari et.al. | 2503.21061 | null |
| 2025-03-26 | TS-Inverse: A Gradient Inversion Attack Tailored for Federated Time Series Forecasting Models | Caspar Meijer et.al. | 2503.20952 | link |
| 2025-03-26 | VESTA: A Versatile SNN-Based Transformer Accelerator with Unified PEs for Multiple Computational Layers | Ching-Yao Chen et.al. | 2503.20246 | null |
| 2025-03-26 | BeLightRec: A lightweight recommender system enhanced with BERT | Manh Mai Van et.al. | 2503.20206 | null |
| 2025-03-25 | Vanishing Depth: A Depth Adapter with Positional Depth Encoding for Generalized Image Encoders | Paul Koch et.al. | 2503.19947 | null |
| 2025-03-25 | Optimizing Breast Cancer Detection in Mammograms: A Comprehensive Study of Transfer Learning, Resolution Reduction, and Multi-View Classification | Daniel G. P. Petrini et.al. | 2503.19945 | link |
| 2025-03-25 | Extensions of regret-minimization algorithm for optimal design | Youguang Chen et.al. | 2503.19874 | null |
| 2025-03-25 | VectorFit : Adaptive Singular & Bias Vector Fine-Tuning of Pre-trained Foundation Models | Suhas G Hegde et.al. | 2503.19530 | null |
| 2025-03-25 | LRSCLIP: A Vision-Language Foundation Model for Aligning Remote Sensing Image with Longer Text | Weizhi Chen et.al. | 2503.19311 | link |
| 2025-03-25 | Face Spoofing Detection using Deep Learning | Najeebullah et.al. | 2503.19223 | link |
| 2025-03-24 | Exploring the Integration of Key-Value Attention Into Pure and Hybrid Transformers for Semantic Segmentation | DeShin Hwa et.al. | 2503.18862 | null |
| 2025-03-24 | Latent Space Class Dispersion: Effective Test Data Quality Assessment for DNNs | Vivek Vekariya et.al. | 2503.18799 | null |
| 2025-03-24 | Unbiasing through Textual Descriptions: Mitigating Representation Bias in Video Benchmarks | Nina Shvetsova et.al. | 2503.18637 | null |
| 2025-03-24 | Explaining Domain Shifts in Language: Concept erasing for Interpretable Image Classification | Zequn Zeng et.al. | 2503.18483 | null |
| 2025-03-24 | Teaching LLMs for Step-Level Automatic Math Correction via Reinforcement Learning | Junsong Li et.al. | 2503.18432 | null |
| 2025-03-24 | Sun-Shine: A Large Language Model for Tibetan Culture | Cheng Huang et.al. | 2503.18288 | null |
| 2025-03-23 | Feature Learning beyond the Lazy-Rich Dichotomy: Insights from Representational Geometry | Chi-Ning Chou et.al. | 2503.18114 | null |
| 2025-03-23 | What Time Tells Us? An Explorative Study of Time Awareness Learned from Static Images | Dongheng Lin et.al. | 2503.17899 | null |
| 2025-03-21 | Spatiotemporal Learning with Context-aware Video Tubelets for Ultrasound Video Analysis | Gary Y. Li et.al. | 2503.17475 | null |
| 2025-03-21 | Leveraging Text-to-Image Generation for Handling Spurious Correlation | Aryan Yazdan Parast et.al. | 2503.17226 | null |
| 2025-03-21 | CoRLD: Contrastive Representation Learning Of Deformable Shapes In Images | Tonmoy Hossain ana Miaomiao Zhang et.al. | 2503.17162 | null |
| 2025-03-21 | Beyond Accuracy: What Matters in Designing Well-Behaved Models? | Robin Hesse et.al. | 2503.17110 | null |
| 2025-03-21 | Symbolic Audio Classification via Modal Decision Tree Learning | Enrico Marzano et.al. | 2503.17018 | null |
| 2025-03-21 | EasyRobust: A Comprehensive and Easy-to-use Toolkit for Robust and Generalized Vision | Xiaofeng Mao et.al. | 2503.16975 | link |
| 2025-03-21 | City2Scene: Improving Acoustic Scene Classification with City Features | Yiqiang Cai et.al. | 2503.16862 | null |
| 2025-03-20 | MobilePlantViT: A Mobile-friendly Hybrid ViT for Generalized Plant Disease Image Classification | Moshiur Rahman Tonmoy et.al. | 2503.16628 | null |
| 2025-03-20 | PSA-MIL: A Probabilistic Spatial Attention-Based Multiple Instance Learning for Whole Slide Image Classification | Sharon Peled et.al. | 2503.16284 | link |
| 2025-03-20 | CLS-RL: Image Classification with Rule-Based Reinforcement Learning | Ming Li et.al. | 2503.16188 | link |
| 2025-03-20 | Corrective In-Context Learning: Evaluating Self-Correction in Large Language Models | Mario Sanz-Guerrero et.al. | 2503.16022 | link |
| 2025-03-20 | Beyond the Visible: Multispectral Vision-Language Learning for Earth Observation | Clive Tinashe Marimo et.al. | 2503.15969 | link |
| 2025-03-19 | Graph-Weighted Contrastive Learning for Semi-Supervised Hyperspectral Image Classification | Yuqing Zhang et.al. | 2503.15731 | null |
| 2025-03-20 | Dynamic Bi-Elman Attention Networks (DBEAN): Dual-Directional Context-Aware Representation Learning for Enhanced Text Classification | ZhengLin Lai et.al. | 2503.15469 | link |
| 2025-03-19 | Test-Time Backdoor Detection for Object Detection Models | Hangtao Zhang et.al. | 2503.15293 | null |
| 2025-03-19 | Efficient allocation of image recognition and LLM tasks on multi-GPU system | Marcin Lawenda et.al. | 2503.15252 | null |
| 2025-03-19 | Comparing Llama3 and DeepSeekR1 on Biomedical Text Classification Tasks | Yuting Guo et.al. | 2503.15169 | null |
| 2025-03-19 | ARC: Anchored Representation Clouds for High-Resolution INR Classification | Joost Luijmes et.al. | 2503.15156 | null |
| 2025-03-19 | Ultrasound Image-to-Video Synthesis via Latent Dynamic Diffusion Models | Tingxiu Chen et.al. | 2503.14966 | null |
| 2025-03-19 | Optimal Transport Adapter Tuning for Bridging Modality Gaps in Few-Shot Remote Sensing Scene Classification | Zhong Ji et.al. | 2503.14938 | null |
| 2025-03-18 | RAT: Boosting Misclassification Detection Ability without Extra Data | Ge Yan et.al. | 2503.14783 | null |
| 2025-03-18 | LipShiFT: A Certifiably Robust Shift-based Vision Transformer | Rohan Menon et.al. | 2503.14751 | null |
| 2025-03-18 | Utilization of Neighbor Information for Image Classification with Different Levels of Supervision | Gihan Jayatilaka et.al. | 2503.14500 | null |
| 2025-03-17 | Neural Edge Histogram Descriptors for Underwater Acoustic Target Recognition | Atharva Agashe et.al. | 2503.13763 | null |
| 2025-03-17 | Micro Text Classification Based on Balanced Positive-Unlabeled Learning | Lin-Han Jia et.al. | 2503.13562 | null |
| 2025-03-17 | Escaping Plato’s Cave: Robust Conceptual Reasoning through Interpretable 3D Neural Object Volumes | Nhi Pham et.al. | 2503.13429 | link |
| 2025-03-17 | Do Vision Models Develop Human-Like Progressive Difficulty Understanding? | Zeyi Huang et.al. | 2503.13058 | null |
| 2025-03-16 | Domain Generalization for Improved Human Activity Recognition in Office Space Videos Using Adaptive Pre-processing | Partho Ghosh et.al. | 2503.12678 | null |
| 2025-03-16 | Scaling Semantic Categories: Investigating the Impact on Vision Transformer Labeling Performance | Anthony Lamelas et.al. | 2503.12617 | null |
| 2025-03-16 | Defense Against Model Stealing Based on Account-Aware Distribution Discrepancy | Jian-Ping Mei et.al. | 2503.12497 | null |
| 2025-03-16 | GeoRSMLLM: A Multimodal Large Language Model for Vision-Language Tasks in Geoscience and Remote Sensing | Zilun Zhang et.al. | 2503.12490 | null |
| 2025-03-16 | Shape Bias and Robustness Evaluation via Cue Decomposition for Image Classification and Segmentation | Edgar Heinert et.al. | 2503.12453 | null |
| 2025-03-16 | MExD: An Expert-Infused Diffusion Model for Whole-Slide Image Classification | Jianwei Zhao et.al. | 2503.12401 | null |
| 2025-03-15 | TLAC: Two-stage LMM Augmented CLIP for Zero-Shot Classification | Ans Munir et.al. | 2503.12206 | null |
| 2025-03-15 | Goal-Oriented Source Coding using LDPC Codes for Compressed-Domain Image Classification | Ahcen Aliouat et.al. | 2503.11954 | null |
| 2025-03-14 | Creating a Good Teacher for Knowledge Distillation in Acoustic Scene Classification | Tobias Morocutti et.al. | 2503.11363 | null |
| 2025-03-14 | PARIC: Probabilistic Attention Regularization for Language Guided Image Classification from Pre-trained Vison Language Models | Mayank Nautiyal et.al. | 2503.11360 | null |
| 2025-03-14 | APLA: A Simple Adaptation Method for Vision Transformers | Moein Sorkhei et.al. | 2503.11335 | link |
| 2025-03-14 | Open-Set Plankton Recognition | Joona Kareinen et.al. | 2503.11318 | null |
| 2025-03-14 | MEET: A Million-Scale Dataset for Fine-Grained Geospatial Scene Classification with Zoom-Free Remote Sensing Imagery | Yansheng Li et.al. | 2503.11219 | null |
| 2025-03-14 | Falcon: A Remote Sensing Vision-Language Foundation Model | Kelu Yao et.al. | 2503.11070 | link |
| 2025-03-13 | $(\varepsilon, δ)$ Considered Harmful: Best Practices for Reporting Differential Privacy Guarantees | Juan Felipe Gomez et.al. | 2503.10945 | null |
| 2025-03-13 | Learning Interpretable Logic Rules from Deep Vision Models | Chuqin Geng et.al. | 2503.10547 | null |
| 2025-03-13 | Extreme Learning Machines for Attention-based Multiple Instance Learning in Whole-Slide Image Classification | Rajiv Krishnakumar et.al. | 2503.10510 | null |
| 2025-03-13 | RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing | Fengxiang Wang et.al. | 2503.10392 | link |
| 2025-03-13 | PS3C: An Ensemble-Based Two-Step Framework for Classification of Pep Smear Cell Images | Theo Di Piazza et.al. | 2503.10312 | link |
| 2025-03-13 | Wikipedia is Not a Dictionary, Delete! Text Classification as a Proxy for Analysing Wiki Deletion Discussions | Hsuvas Borkakoty et.al. | 2503.10294 | null |
| 2025-03-13 | A Multi-Modal Federated Learning Framework for Remote Sensing Image Classification | Barış Büyüktaş et.al. | 2503.10262 | null |
| 2025-03-13 | Interpretable Image Classification via Non-parametric Part Prototype Learning | Zhijie Zhu et.al. | 2503.10247 | null |
| 2025-03-13 | Multiplicative Learning | Han Kim et.al. | 2503.10144 | null |
| 2025-03-13 | Cognitive-Mental-LLM: Leveraging Reasoning in Large Language Models for Mental Health Prediction via Online Text | Avinash Patil et.al. | 2503.10095 | null |
| 2025-03-13 | Do We Always Need the Simplicity Bias? Looking for Optimal Inductive Biases in the Wild | Damien Teney et.al. | 2503.10065 | null |
| 2025-03-12 | Fair Federated Medical Image Classification Against Quality Shift via Inter-Client Progressive State Matching | Nannan Wu et.al. | 2503.09587 | null |
| 2025-03-12 | Double-Stage Feature-Level Clustering-Based Mixture of Experts Framework | Bakary Badjie et.al. | 2503.09504 | null |
| 2025-03-12 | ForAug: Recombining Foregrounds and Backgrounds to Improve Vision Transformer Training with Bias Mitigation | Tobias Christian Nauen et.al. | 2503.09399 | link |
| 2025-03-12 | Membership Inference Attacks fueled by Few-Short Learning to detect privacy leakage tackling data integrity | Daniel Jiménez-López et.al. | 2503.09365 | null |
| 2025-03-12 | Deep Learning for Climate Action: Computer Vision Analysis of Visual Narratives on X | Katharina Prasse et.al. | 2503.09361 | null |
| 2025-03-12 | Bayesian Test-Time Adaptation for Vision-Language Models | Lihua Zhou et.al. | 2503.09248 | null |
| 2025-03-12 | Probing Network Decisions: Capturing Uncertainties and Unveiling Vulnerabilities Without Label Information | Youngju Joung et.al. | 2503.09068 | null |
| 2025-03-12 | Discovering Influential Neuron Path in Vision Transformers | Yifan Wang et.al. | 2503.09046 | link |
| 2025-03-11 | KAN-Mixers: a new deep learning architecture for image classification | Jorge Luiz dos Santos Canuto et.al. | 2503.08939 | null |
| 2025-03-12 | MsaMIL-Net: An End-to-End Multi-Scale Aware Multiple Instance Learning Network for Efficient Whole Slide Image Classification | Jiangping Wen et.al. | 2503.08581 | null |
| 2025-03-11 | Generalizable and Explainable Deep Learning for Medical Image Computing: An Overview | Ahmad Chaddad et.al. | 2503.08420 | null |
| 2025-03-11 | Prototype-Based Multiple Instance Learning for Gigapixel Whole Slide Image Classification | Susu Sun et.al. | 2503.08384 | null |
| 2025-03-11 | Tangentially Aligned Integrated Gradients for User-Friendly Explanations | Lachlan Simpson et.al. | 2503.08240 | null |
| 2025-03-11 | EnergyFormer: Energy Attention with Fourier Embedding for Hyperspectral Image Classification | Saad Sohail et.al. | 2503.08239 | null |
| 2025-03-11 | Identification of Star Clusters in M31 from PAndAS Images Based on Deep Learning | Baisong Zhang et.al. | 2503.08130 | null |
| 2025-03-11 | LabelCoRank: Revolutionizing Long Tail Multi-Label Classification with Co-Occurrence Reranking | Yan Yan et.al. | 2503.07968 | null |
| 2025-03-12 | Measuring directional bias amplification in image captions using predictability | Rahul Nair et.al. | 2503.07878 | null |
| 2025-03-10 | Fair Text Classification via Transferable Representations | Thibaud Leteno et.al. | 2503.07691 | null |
| 2025-03-10 | Keeping Representation Similarity in Finetuning for Medical Image Analysis | Wenqiang Zu et.al. | 2503.07399 | null |
| 2025-03-10 | Brain Inspired Adaptive Memory Dual-Net for Few-Shot Image Classification | Kexin Di et.al. | 2503.07396 | null |
| 2025-03-10 | Is My Text in Your AI Model? Gradient-based Membership Inference Test applied to LLMs | Gonzalo Mancera et.al. | 2503.07384 | null |
| 2025-03-10 | Distilling Knowledge into Quantum Vision Transformers for Biomedical Image Classification | Thomas Boucher et.al. | 2503.07294 | null |
| 2025-03-10 | A Zero-shot Learning Method Based on Large Language Models for Multi-modal Knowledge Graph Embedding | Bingchen Liu et.al. | 2503.07202 | null |
| 2025-03-10 | Understanding the Learning Dynamics of LoRA: A Gradient Flow Perspective on Low-Rank Adaptation in Matrix Factorization | Ziqing Xu et.al. | 2503.06982 | null |
| 2025-03-10 | Task Vector Quantization for Memory-Efficient Model Merging | Youngeun Kim et.al. | 2503.06921 | link |
| 2025-03-10 | MADS: Multi-Attribute Document Supervision for Zero-Shot Image Classification | Xiangyan Qu et.al. | 2503.06847 | null |
| 2025-03-09 | Enhancing Layer Attention Efficiency through Pruning Redundant Retrievals | Hanze Li et.al. | 2503.06473 | null |
| 2025-03-09 | M $^3$ amba: CLIP-driven Mamba Model for Multi-modal Remote Sensing Classification | Mingxiang Cao et.al. | 2503.06446 | null |
| 2025-03-07 | Similarity-Based Domain Adaptation with LLMs | Jie He et.al. | 2503.05281 | null |
| 2025-03-07 | Spatial Context-Driven Positive Pair Sampling for Enhanced Histopathology Image Classification | Willmer Rafell Quinones Robles et.al. | 2503.05170 | null |
| 2025-03-07 | Ensemble Debiasing Across Class and Sample Levels for Fairer Prompting Accuracy | Ruixi Lin et.al. | 2503.05157 | link |
| 2025-03-07 | Grouped Sequential Optimization Strategy – the Application of Hyperparameter Importance Assessment in Deep Learning | Ruinan Wang et.al. | 2503.05106 | null |
| 2025-03-06 | HieroLM: Egyptian Hieroglyph Recovery with Next Word Prediction Language Model | Xuheng Cai et.al. | 2503.04996 | null |
| 2025-03-06 | Label Distribution Learning-Enhanced Dual-KNN for Text Classification | Bo Yuan et.al. | 2503.04869 | null |
| 2025-03-06 | Guiding LLMs to Generate High-Fidelity and High-Quality Counterfactual Explanations for Text Classification | Van Bach Nguyen et.al. | 2503.04463 | null |
| 2025-03-06 | WeakSupCon: Weakly Supervised Contrastive Learning for Encoder Pre-training | Bodong Zhang et.al. | 2503.04165 | null |
| 2025-03-04 | Measurement noise scaling laws for cellular representation learning | Gokul Gowri et.al. | 2503.02726 | null |
| 2025-03-04 | XFMamba: Cross-Fusion Mamba for Multi-View Medical Image Classification | Xiaoyu Zheng et.al. | 2503.02619 | link |
| 2025-03-04 | Remote Sensing Image Classification Using Convolutional Neural Network (CNN) and Transfer Learning Techniques | Mustafa Majeed Abd Zaid et.al. | 2503.02510 | null |
| 2025-03-06 | Union of Experts: Adapting Hierarchical Routing to Equivalently Decomposed Transformer | Yujiao Yang et.al. | 2503.02495 | link |
| 2025-03-04 | Making Better Mistakes in CLIP-Based Zero-Shot Classification with Hierarchy-Aware Language Prompts | Tong Liang et.al. | 2503.02248 | null |
| 2025-03-04 | Sharpness-Aware Minimization: General Analysis and Improved Rates | Dimitris Oikonomou et.al. | 2503.02225 | null |
| 2025-03-03 | Mathematical Foundation of Interpretable Equivariant Surrogate Models | Jacopo Joy Colombini et.al. | 2503.01942 | null |
| 2025-03-03 | Visual-RFT: Visual Reinforcement Fine-Tuning | Ziyu Liu et.al. | 2503.01785 | link |
| 2025-03-03 | Mamba base PKD for efficient knowledge compression | José Medina et.al. | 2503.01727 | null |
| 2025-03-04 | SAR-W-MixMAE: SAR Foundation Model Training Using Backscatter Power Weighting | Ali Caglayan et.al. | 2503.01181 | null |
| 2025-03-03 | Large Language Models for Healthcare Text Classification: A Systematic Review | Hajar Sakai et.al. | 2503.01159 | null |
| 2025-03-03 | Fast and Accurate Gigapixel Pathological Image Classification with Hierarchical Distillation Multi-Instance Learning | Jiuyang Dong et.al. | 2502.21130 | null |
| 2025-02-28 | Comparative study of the ansätze in quantum language models | Jordi Del Castillo et.al. | 2502.20744 | null |
| 2025-02-28 | Exploring the Impact of Temperature Scaling in Softmax for Classification and Adversarial Robustness | Hao Xuan et.al. | 2502.20604 | null |
| 2025-02-27 | In-Model Merging for Enhancing the Robustness of Medical Imaging Classification Models | Hu Wang et.al. | 2502.20516 | null |
| 2025-02-27 | Online Meta-learning for AutoML in Real-time (OnMAR) | Mia Gerber et.al. | 2502.20279 | null |
| 2025-03-03 | Gradient-Guided Annealing for Domain Generalization | Aristotelis Ballas et.al. | 2502.20162 | link |
| 2025-02-27 | QPM: Discrete Optimization for Globally Interpretable Image Classification | Thomas Norrenbrock et.al. | 2502.20130 | link |
| 2025-02-27 | ProAPO: Progressively Automatic Prompt Optimization for Visual Classification | Xiangyan Qu et.al. | 2502.19844 | link |
| 2025-02-27 | Text classification using machine learning methods | Bogdan Oancea et.al. | 2502.19801 | null |
| 2025-02-27 | InPK: Infusing Prior Knowledge into Prompt for Vision-Language Models | Shuchang Zhou et.al. | 2502.19777 | null |
| 2025-02-27 | Learning Mask Invariant Mutual Information for Masked Image Modeling | Tao Huang et.al. | 2502.19718 | null |
| 2025-02-27 | Language-Informed Hyperspectral Image Synthesis for Imbalanced-Small Sample Classification via Semi-Supervised Conditional Diffusion Model | Yimin Zhu et.al. | 2502.19700 | null |
| 2025-02-27 | Spatial-Spectral Diffusion Contrastive Representation Network for Hyperspectral Image Classification | Yimin Zhu et.al. | 2502.19699 | null |
| 2025-02-27 | A Residual Multi-task Network for Joint Classification and Regression in Medical Imaging | Junji Lin et.al. | 2502.19692 | null |
| 2025-02-26 | I Know What I Don’t Know: Improving Model Cascades Through Confidence Tuning | Stephan Rabanser et.al. | 2502.19335 | null |
| 2025-02-26 | Active Few-Shot Learning for Text Classification | Saeed Ahmadnia et.al. | 2502.18782 | null |
| 2025-02-25 | Enhancing Image Classification with Augmentation: Data Augmentation Techniques for Improved Image Classification | Saorj Kumar et.al. | 2502.18691 | null |
| 2025-02-25 | Enhancing Text Classification with a Novel Multi-Agent Collaboration Framework Leveraging BERT | Hediyeh Baban et.al. | 2502.18653 | null |
| 2025-02-25 | MedKAN: An Advanced Kolmogorov-Arnold Network for Medical Image Classification | Zhuoqin Yang et.al. | 2502.18416 | null |
| 2025-02-26 | A Fusion Model for Art Author Identification Based on Convolutional Neural Networks and Transformers | Zhenyu Wang et.al. | 2502.18083 | null |
| 2025-02-25 | MAGE: Multi-Head Attention Guided Embeddings for Low Resource Sentiment Classification | Varun Vashisht et.al. | 2502.17987 | null |
| 2025-02-25 | Dual Classification Head Self-training Network for Cross-scene Hyperspectral Image Classification | Rong Liu et.al. | 2502.17879 | null |
| 2025-02-24 | Can Score-Based Generative Modeling Effectively Handle Medical Image Classification? | Sushmita Sarker et.al. | 2502.17727 | link |
| 2025-02-24 | A Priori Generalizability Estimate for a CNN | Cito Balsells et.al. | 2502.17622 | null |
| 2025-02-24 | Neural Attention: A Novel Mechanism for Enhanced Expressive Power in Transformer Models | Andrew DiGiugno et.al. | 2502.17206 | link |
| 2025-02-24 | Disentangling Visual Transformers: Patch-level Interpretability for Image Classification | Guillaume Jeanneret et.al. | 2502.17196 | null |
| 2025-02-24 | Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment | Chenghao Fan et.al. | 2502.16894 | link |
| 2025-02-24 | Applying LLMs to Active Learning: Towards Cost-Efficient Cross-Task Text Classification without Manually Labeled Data | Yejian Zhang et.al. | 2502.16892 | null |
| 2025-02-24 | A Transformer-in-Transformer Network Utilizing Knowledge Distillation for Image Recognition | Dewan Tauhid Rahman et.al. | 2502.16762 | null |
| 2025-02-23 | AUKT: Adaptive Uncertainty-Guided Knowledge Transfer with Conformal Prediction | Rui Liu et.al. | 2502.16736 | null |
| 2025-02-22 | MOB-GCN: A Novel Multiscale Object-Based Graph Neural Network for Hyperspectral Image Classification | Tuan-Anh Yang et.al. | 2502.16289 | link |
| 2025-02-22 | A Multi-Scale Isolation Forest Approach for Real-Time Detection and Filtering of FGSM Adversarial Attacks in Video Streams of Autonomous Vehicles | Richard Abhulimhen et.al. | 2502.16044 | null |
| 2025-02-21 | MMRAG: Multi-Mode Retrieval-Augmented Generation with Large Language Models for Biomedical In-Context Learning | Zaifu Zhan et.al. | 2502.15954 | null |
| 2025-02-21 | Directional Gradient Projection for Robust Fine-Tuning of Foundation Models | Chengyue Huang et.al. | 2502.15895 | null |
| 2025-02-21 | MHQA: A Diverse, Knowledge Intensive Mental Health Question Answering Challenge for Language Models | Suraj Racha et.al. | 2502.15418 | null |
| 2025-02-21 | A Novel Riemannian Sparse Representation Learning Network for Polarimetric SAR Image Classification | Junfei Shi et.al. | 2502.15302 | null |
| 2025-02-21 | Quantum autoencoders for image classification | Hinako Asaoka et.al. | 2502.15254 | null |
| 2025-02-21 | Steganographic Embeddings as an Effective Data Augmentation | Nicholas DiSalvo et.al. | 2502.15245 | null |
| 2025-02-21 | Learning to Collaborate: A Capability Vectors-based Architecture for Adaptive Human-AI Decision Making | Renlong Jie et.al. | 2502.15196 | null |
| 2025-02-21 | TransMamba: Fast Universal Architecture Adaption from Transformers to Mamba | Xiuwei Chen et.al. | 2502.15130 | null |
| 2025-02-20 | Fundamental Survey on Neuromorphic Based Audio Classification | Amlan Basu et.al. | 2502.15056 | null |
| 2025-02-20 | Reinforcement Learning for Ultrasound Image Analysis A Comprehensive Review of Advances and Applications | Maha Ezzelarab et.al. | 2502.14995 | null |
| 2025-02-20 | Sparse Activations as Conformal Predictors | Margarida M. Campos et.al. | 2502.14773 | link |
| 2025-02-20 | An Enhancement of Jiang, Z., et al.s Compression-Based Classification Algorithm Applied to News Article Categorization | Sean Lester C. Benavides et.al. | 2502.14444 | null |
| 2025-02-20 | Stochastic Resonance Improves the Detection of Low Contrast Images in Deep Learning Models | Siegfried Ludwig et.al. | 2502.14442 | null |
| 2025-02-20 | Token-Level Density-Based Uncertainty Quantification Methods for Eliciting Truthfulness of Large Language Models | Artem Vazhentsev et.al. | 2502.14427 | null |
| 2025-02-20 | Reliable Explainability of Deep Learning Spatial-Spectral Classifiers for Improved Semantic Segmentation in Autonomous Driving | Jon Gutiérrez-Zaballa et.al. | 2502.14416 | null |
| 2025-02-20 | QUAD-LLM-MLTC: Large Language Models Ensemble Learning for Healthcare Text Multi-Label Classification | Hajar Sakai et.al. | 2502.14189 | null |
| 2025-02-19 | Self-Regularization with Latent Space Explanations for Controllable LLM-based Classification | Xuansheng Wu et.al. | 2502.14133 | null |
| 2025-02-19 | Medical Image Classification with KAN-Integrated Transformers and Dilated Neighborhood Attention | Omid Nejati Manzari et.al. | 2502.13693 | link |
| 2025-02-18 | Language Models Can Predict Their Own Behavior | Dhananjay Ashok et.al. | 2502.13329 | null |
| 2025-02-18 | Performance Evaluation of Sentiment Analysis on Text and Emoji Data Using End-to-End, Transfer Learning, Distributed and Explainable AI Models | Sirisha Velampalli et.al. | 2502.13278 | null |
| 2025-02-18 | Private Text Generation by Seeding Large Language Model Prompts | Supriya Nagesh et.al. | 2502.13193 | null |
| 2025-02-18 | RingFormer: Rethinking Recurrent Transformer with Adaptive Level Signals | Jaemu Heo et.al. | 2502.13181 | null |
| 2025-02-18 | Benchmarking MedMNIST dataset on real quantum hardware | Gurinder Singh et.al. | 2502.13056 | null |
| 2025-02-18 | Likelihood-Ratio Regularized Quantile Regression: Adapting Conformal Prediction to High-Dimensional Covariate Shifts | Sunay Joshi et.al. | 2502.13030 | null |
| 2025-02-18 | A Survey of Text Classification Under Class Distribution Shift | Adriana Valentina Costache et.al. | 2502.12965 | null |
| 2025-02-18 | Task-Informed Anti-Curriculum by Masking Improves Downstream Performance on Text | Andrei Jarca et.al. | 2502.12953 | null |
| 2025-02-18 | DAMamba: Vision State Space Model with Dynamic Adaptive Scan | Tanzhe Li et.al. | 2502.12627 | null |
| 2025-02-18 | When Segmentation Meets Hyperspectral Image: New Paradigm for Hyperspectral Image Classification | Weilian Zhou et.al. | 2502.12541 | null |
| 2025-02-17 | Achieving Upper Bound Accuracy of Joint Training in Continual Learning | Saleh Momeni et.al. | 2502.12388 | null |
| 2025-02-17 | OCT Data is All You Need: How Vision Transformers with and without Pre-training Benefit Imaging | Zihao Han et.al. | 2502.12379 | null |
| 2025-02-17 | AdaSplash: Adaptive Sparse Flash Attention | Nuno Gonçalves et.al. | 2502.12082 | null |
| 2025-02-17 | Masked Latent Prediction and Classification for Self-Supervised Audio Representation Learning | Aurian Quelennec et.al. | 2502.12031 | null |
| 2025-02-17 | Text Classification in the LLM Era - Where do we stand? | Sowmya Vajjala et.al. | 2502.11830 | null |
| 2025-02-17 | Variable-frame CNNLSTM for Breast Nodule Classification using Ultrasound Videos | Xiangxiang Cui et.al. | 2502.11481 | null |
| 2025-02-16 | Leveraging Conditional Mutual Information to Improve Large Language Model Fine-Tuning For Classification | Thanushon Sivakaran et.al. | 2502.11258 | null |
| 2025-02-16 | UNITE-FND: Reframing Multimodal Fake News Detection through Unimodal Scene Translation | Arka Mukherjee et.al. | 2502.11132 | null |
| 2025-02-16 | Towards Achieving Concept Completeness for Unsupervised Textual Concept Bottleneck Models | Milan Bhan et.al. | 2502.11100 | null |
| 2025-02-16 | Leveraging Large Language Models for Cybersecurity: Enhancing SMS Spam Detection with Robust and Context-Aware Text Classification | Mohsen Ahmadi et.al. | 2502.11014 | null |
| 2025-02-15 | Simulations of Common Unsupervised Domain Adaptation Algorithms for Image Classification | Ahmad Chaddad et.al. | 2502.10694 | null |
| 2025-02-15 | REAL: Realism Evaluation of Text-to-Image Generation Models for Effective Data Augmentation | Ran Li et.al. | 2502.10663 | null |
| 2025-02-14 | Simplifying DINO via Coding Rate Regularization | Ziyang Wu et.al. | 2502.10385 | null |
| 2025-02-14 | Ocular Disease Classification Using CNN with Deep Convolutional Generative Adversarial Network | Arun Kunwar et.al. | 2502.10334 | null |
| 2025-02-14 | SeWA: Selective Weight Average via Probabilistic Masking | Peng Wang et.al. | 2502.10119 | null |
| 2025-02-14 | On Space Folds of ReLU Neural Networks | Michal Lewandowski et.al. | 2502.09954 | null |
| 2025-02-13 | A CNN Approach to Automated Detection and Classification of Brain Tumors | Md. Zahid Hasan et.al. | 2502.09731 | null |
| 2025-02-13 | GAIA: A Global, Multi-modal, Multi-scale Vision-Language Dataset for Remote Sensing Image Analysis | Angelos Zavras et.al. | 2502.09598 | link |
| 2025-02-14 | Optimizing GPT for Video Understanding: Zero-Shot Performance and Prompt Engineering | Mark Beliaev et.al. | 2502.09573 | null |
| 2025-02-13 | Feature-based Graph Attention Networks Improve Online Continual Learning | Adjovi Sim et.al. | 2502.09143 | null |
| 2025-02-13 | A Hybrid Model for Few-Shot Text Classification Using Transfer and Meta-Learning | Jia Gao et.al. | 2502.09086 | null |
| 2025-02-13 | Hierarchical Vision Transformer with Prototypes for Interpretable Medical Image Classification | Luisa Gallée et.al. | 2502.08997 | null |
| 2025-02-13 | Quantum Approaches for Dysphonia Assessment in Small Speech Datasets | Ha Tran et.al. | 2502.08968 | null |
| 2025-02-12 | Measuring Diversity in Synthetic Datasets | Yuchang Zhu et.al. | 2502.08512 | null |
| 2025-02-12 | ViLa-MIL: Dual-scale Vision-Language Multiple Instance Learning for Whole Slide Image Classification | Jiangbo Shi et.al. | 2502.08391 | null |
| 2025-02-12 | Keep your distance: learning dispersed embeddings on $\mathbb{S}_d$ | Evgeniia Tokarchuk et.al. | 2502.08231 | null |
| 2025-02-12 | Riemannian Complex Hermit Positive Definite Convolution Network for Polarimetric SAR Image Classification | Junfei Shi et.al. | 2502.08137 | null |
| 2025-02-12 | Knowledge Swapping via Learning and Unlearning | Mingyu Xing et.al. | 2502.08075 | null |
| 2025-02-12 | Can Machine Learning Support the Selection of Studies for Systematic Literature Review Updates? | Marcelo Costalonga et.al. | 2502.08050 | null |
| 2025-02-11 | ESPFormer: Doubly-Stochastic Attention with Expected Sliced Transport Plans | Ashkan Shahbazi et.al. | 2502.07962 | null |
| 2025-02-11 | Optimizing Knowledge Distillation in Transformers: Enabling Multi-Head Attention without Alignment Barriers | Zhaodong Bing et.al. | 2502.07436 | null |
| 2025-02-11 | MoENAS: Mixture-of-Expert based Neural Architecture Search for jointly Accurate, Fair, and Robust Edge Deep Neural Networks | Lotfi Abdelkrim Mecharbat et.al. | 2502.07422 | null |
| 2025-02-11 | MGPATH: Vision-Language Model with Multi-Granular Prompt Learning for Few-Shot WSI Classification | Anh-Tien Nguyen et.al. | 2502.07409 | null |
| 2025-02-11 | Don’t Just Demo, Teach Me the Principles: A Principle-Based Multi-Agent Prompting Strategy for Text Classification | Peipei Wei et.al. | 2502.07165 | null |
| 2025-02-10 | From Image to Video: An Empirical Study of Diffusion Representations | Pedro Vélez et.al. | 2502.07001 | null |
| 2025-02-10 | Krum Federated Chain (KFC): Using blockchain to defend against adversarial attacks in Federated Learning | Mario García-Márquez et.al. | 2502.06917 | null |
| 2025-02-10 | Enhancing Performance of Explainable AI Models with Constrained Concept Refinement | Geyu Liang et.al. | 2502.06775 | null |
| 2025-02-10 | Efficient Scientific Full Text Classification: The Case of EICAT Impact Assessments | Marc Felix Brinner et.al. | 2502.06551 | null |
| 2025-02-10 | Hybrid State-Space and GRU-based Graph Tokenization Mamba for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2502.06427 | null |
| 2025-02-10 | Provably Near-Optimal Federated Ensemble Distillation with Negligible Overhead | Won-Jun Jang et.al. | 2502.06349 | null |
| 2025-02-10 | From Pixels to Components: Eigenvector Masking for Visual Representation Learning | Alice Bizeul et.al. | 2502.06314 | null |
| 2025-02-10 | Beyond Batch Learning: Global Awareness Enhanced Domain Adaptation | Lingkun Luo et.al. | 2502.06272 | null |
| 2025-02-10 | Multi-Scale Transformer Architecture for Accurate Medical Image Classification | Jiacheng Hu et.al. | 2502.06243 | null |
| 2025-02-10 | Low Tensor-Rank Adaptation of Kolmogorov–Arnold Networks | Yihang Gao et.al. | 2502.06153 | null |
| 2025-02-09 | Benchmarking Prompt Sensitivity in Large Language Models | Amirhossein Razavi et.al. | 2502.06065 | null |
| 2025-02-09 | ARISE: Iterative Rule Induction and Synthetic Data Generation for Text Classification | Yashwanth M. et.al. | 2502.05923 | null |
| 2025-02-07 | Training-free Neural Architecture Search through Variance of Knowledge of Deep Network Weights | Ondřej Týbl et.al. | 2502.04975 | null |
| 2025-02-07 | Enhancing Disinformation Detection with Explainable AI and Named Entity Replacement | Santiago González-Silot et.al. | 2502.04863 | null |
| 2025-02-07 | AIQViT: Architecture-Informed Post-Training Quantization for Vision Transformers | Runqing Jiang et.al. | 2502.04628 | null |
| 2025-02-06 | Augmented Conditioning Is Enough For Effective Training Image Generation | Jiahui Chen et.al. | 2502.04475 | null |
| 2025-02-06 | How does a Multilingual LM Handle Multiple Languages? | Santhosh Kakarla et.al. | 2502.04269 | null |
| 2025-02-06 | Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion | Marco Mistretta et.al. | 2502.04263 | link |
| 2025-02-06 | Expanding Training Data for Endoscopic Phenotyping of Eosinophilic Esophagitis | Juming Xiong et.al. | 2502.04199 | null |
| 2025-02-06 | Improving Natural Language Understanding for LLMs via Large-Scale Instruction Synthesis | Lin Yuan et.al. | 2502.03843 | null |
| 2025-02-06 | Self-Supervised Learning for Solar Radio Spectrum Classification | Siqi Li et.al. | 2502.03778 | null |
| 2025-02-06 | Conditional Diffusion Models are Medical Image Classifiers that Provide Explainability and Uncertainty for Free | Gian Mario Favero et.al. | 2502.03687 | null |
| 2025-02-05 | A Study in Dataset Distillation for Image Super-Resolution | Tobias Dietz et.al. | 2502.03656 | null |
| 2025-02-05 | Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning Dynamics | Indrashis Das et.al. | 2502.03654 | link |
| 2025-02-05 | Clinically-Inspired Hierarchical Multi-Label Classification of Chest X-rays with a Penalty-Based Loss Function | Mehrdad Asadi et.al. | 2502.03591 | link |
| 2025-02-05 | Optimal Task Order for Continual Learning of Multiple Tasks | Ziyan Li et.al. | 2502.03350 | null |
| 2025-02-05 | Out-of-Distribution Detection using Synthetic Data Generation | Momin Abbas et.al. | 2502.03323 | null |
| 2025-02-05 | Long-tailed Medical Diagnosis with Relation-aware Representation Learning and Iterative Classifier Calibration | Li Pan et.al. | 2502.03238 | null |
| 2025-02-05 | Adversarial Dependence Minimization | Pierre-François De Plaen et.al. | 2502.03227 | null |
| 2025-02-05 | Disentangling CLIP Features for Enhanced Localized Understanding | Samyak Rawelekar et.al. | 2502.02977 | null |
| 2025-02-05 | Slowing Learning by Erasing Simple Features | Lucia Quirke et.al. | 2502.02820 | null |
| 2025-02-04 | The Skin Game: Revolutionizing Standards for AI Dermatology Model Comparison | Łukasz Miętkiewicz et.al. | 2502.02500 | null |
| 2025-02-04 | BRIDLE: Generalized Self-supervised Learning with Quantization | Hoang M. Nguyen et.al. | 2502.02118 | null |
| 2025-02-04 | DCT-Mamba3D: Spectral Decorrelation and Spatial-Spectral Feature Extraction for Hyperspectral Image Classification | Weijia Cao et.al. | 2502.01986 | null |
| 2025-02-04 | Generative Data Mining with Longtail-Guided Diffusion | David S. Hayden et.al. | 2502.01980 | null |
| 2025-02-03 | A Multi-Scale Feature Fusion Framework Integrating Frequency Domain and Cross-View Attention for Dual-View X-ray Security Inspections | Shilong Hong et.al. | 2502.01710 | null |
| 2025-02-03 | Activation by Interval-wise Dropout: A Simple Way to Prevent Neural Networks from Plasticity Loss | Sangyeon Park et.al. | 2502.01342 | null |
| 2025-02-03 | A Framework for Double-Blind Federated Adaptation of Foundation Models | Nurbek Tastan et.al. | 2502.01289 | null |
| 2025-02-02 | Synthetic Artifact Auditing: Tracing LLM-Generated Synthetic Data Usage in Downstream Applications | Yixin Wu et.al. | 2502.00808 | null |
| 2025-02-02 | Enhanced Convolutional Neural Networks for Improved Image Classification | Xiaoran Yang et.al. | 2502.00663 | null |
| 2025-02-01 | Fast Vision Mamba: Pooling Spatial Dimensions for Accelerated Processing | Saarthak Kapse et.al. | 2502.00594 | null |
| 2025-01-31 | Redefining Machine Unlearning: A Conformal Prediction-Motivated Approach | Yingdan Shi et.al. | 2501.19403 | null |
| 2025-01-31 | An All-digital 65-nm Tsetlin Machine Image Classification Accelerator with 8.6 nJ per MNIST Frame at 60.3k Frames per Second | Svein Anders Tunheim et.al. | 2501.19347 | null |
| 2025-01-31 | Through the Looking Glass: LLM-Based Analysis of AR/VR Android Applications Privacy Policies | Abdulaziz Alghamdi et.al. | 2501.19223 | null |
| 2025-01-31 | Fairness Analysis of CLIP-Based Foundation Models for X-Ray Image Classification | Xiangyu Sun et.al. | 2501.19086 | null |
| 2025-01-31 | Memory-Efficient Fine-Tuning of Transformers via Token Selection | Antoine Simoulin et.al. | 2501.18824 | null |
| 2025-01-30 | OT-Transformer: A Continuous-time Transformer Architecture with Optimal Transport Regularization | Kelvin Kan et.al. | 2501.18793 | null |
| 2025-01-29 | Semantic Consistency Regularization with Large Language Models for Semi-supervised Sentiment Analysis | Kunrong Li et.al. | 2501.17598 | null |
| 2025-01-28 | Extending Information Bottleneck Attribution to Video Sequences | Veronika Solopova et.al. | 2501.16889 | link |
| 2025-01-28 | Misspellings in Natural Language Processing: A survey | Gianluca Sperduti et.al. | 2501.16836 | null |
| 2025-01-28 | DebugAgent: Efficient and Interpretable Error Slice Discovery for Comprehensive Model Debugging | Muxi Chen et.al. | 2501.16751 | null |
| 2025-01-28 | Toward Relative Positional Encoding in Spiking Transformers | Changze Lv et.al. | 2501.16745 | null |
| 2025-01-28 | Improving Interpretability and Accuracy in Neuro-Symbolic Rule Extraction Using Class-Specific Sparse Filters | Parth Padalkar et.al. | 2501.16677 | null |
| 2025-01-27 | Generating customized prompts for Zero-Shot Rare Event Medical Image Classification using LLM | Payal Kamboj et.al. | 2501.16481 | link |
| 2025-01-28 | SPECIAL: Zero-shot Hyperspectral Image Classification With CLIP | Li Pang et.al. | 2501.16222 | link |
| 2025-01-27 | The Linear Attention Resurrection in Vision Transformer | Chuanyang Zheng et.al. | 2501.16182 | null |
| 2025-01-27 | Enhancing the Convergence of Federated Learning Aggregation Strategies with Limited Data | Judith Sáinz-Pardo Díaz et.al. | 2501.15949 | null |
| 2025-01-26 | Quantum-Enhanced Attention Mechanism in NLP: A Hybrid Classical-Quantum Approach | S. M. Yousuf Iqbal Tomal et.al. | 2501.15630 | null |
| 2025-01-26 | Building Efficient Lightweight CNN Models | Nathan Isong et.al. | 2501.15547 | null |
| 2025-01-26 | Fuzzy-aware Loss for Source-free Domain Adaptation in Visual Emotion Recognition | Ying Zheng et.al. | 2501.15519 | null |
| 2025-01-26 | Variational Bayesian Adaptive Learning of Deep Latent Variables for Acoustic Knowledge Transfer | Hu Hu et.al. | 2501.15496 | null |
| 2025-01-25 | Pre-trained Model Guided Mixture Knowledge Distillation for Adversarial Federated Learning | Yu Qiao et.al. | 2501.15257 | null |
| 2025-01-24 | Feasible Learning | Juan Ramirez et.al. | 2501.14912 | link |
| 2025-01-24 | Rethinking Foundation Models for Medical Image Classification through a Benchmark Study on MedMNIST | Fuping Wu et.al. | 2501.14685 | null |
| 2025-01-24 | Geometric Mean Improves Loss For Few-Shot Learning | Tong Wu et.al. | 2501.14593 | null |
| 2025-01-24 | Idiom Detection in Sorani Kurdish Texts | Skala Kamaran Omer et.al. | 2501.14528 | null |
| 2025-01-24 | $SpikePack$ : Enhanced Information Flow in Spiking Neural Networks with High Hardware Compatibility | Guobin Shen et.al. | 2501.14484 | null |
| 2025-01-24 | Impact of Batch Normalization on Convolutional Network Representations | Hermanus L. Potgieter et.al. | 2501.14441 | null |
| 2025-01-24 | Quantum Neural Networks: A Comparative Analysis and Noise Robustness Evaluation | Tasnim Ahmed et.al. | 2501.14412 | null |
| 2025-01-24 | Correlation-Based Band Selection for Hyperspectral Image Classification | Dibyabha Deb et.al. | 2501.14338 | link |
| 2025-01-24 | Relative Layer-Wise Relevance Propagation: a more Robust Neural Networks eXplaination | Eric Nyiri et.al. | 2501.14322 | null |
| 2025-01-24 | A Comprehensive Framework for Semantic Similarity Detection Using Transformer Architectures and Enhanced Ensemble Techniques | Lifu Gao et.al. | 2501.14288 | null |
| 2025-01-24 | TLXML: Task-Level Explanation of Meta-Learning via Influence Functions | Yoshihiro Mitsuka et.al. | 2501.14271 | null |
| 2025-01-23 | A Study of the Plausibility of Attention between RNN Encoders in Natural Language Inference | Duc Hau Nguyen et.al. | 2501.13735 | null |
| 2025-01-23 | A Transformer-based Autoregressive Decoder Architecture for Hierarchical Text Classification | Younes Yousef et.al. | 2501.13598 | link |
| 2025-01-23 | Multi-Level Attention and Contrastive Learning for Enhanced Text Classification with an Optimized Transformer | Jia Gao et.al. | 2501.13467 | null |
| 2025-01-23 | Atmospheric Noise-Resilient Image Classification in a Real-World Scenario: Using Hybrid CNN and Pin-GTSVM | Shlok Mehendale et.al. | 2501.13422 | null |
| 2025-01-23 | AEON: Adaptive Estimation of Instance-Dependent In-Distribution and Out-of-Distribution Label Noise for Robust Learning | Arpit Garg et.al. | 2501.13389 | null |
| 2025-01-23 | Multi-aspect Knowledge Distillation with Large Language Model | Taegyeong Lee et.al. | 2501.13341 | link |
| 2025-01-22 | Revisiting Data Augmentation for Ultrasound Images | Adam Tupper et.al. | 2501.13193 | link |
| 2025-01-22 | Regularization, Semi-supervision, and Supervision for a Plausible Attention-Based Explanation | Duc Hau Nguyen et.al. | 2501.12775 | link |
| 2025-01-22 | Estimating the Conformal Prediction Threshold from Noisy Labels | Coby Penso et.al. | 2501.12749 | link |
| 2025-01-22 | Adapting OpenAI’s CLIP Model for Few-Shot Image Inspection in Manufacturing Quality Control: An Expository Case Study with Multiple Application Examples | Fadel M. Megahed et.al. | 2501.12596 | null |
| 2025-01-21 | Efficient Lung Ultrasound Severity Scoring Using Dedicated Feature Extractor | Jiaqi Guo et.al. | 2501.12524 | null |
| 2025-01-21 | CCESAR: Coastline Classification-Extraction From SAR Images Using CNN-U-Net Combination | Vidhu Arora et.al. | 2501.12384 | null |
| 2025-01-21 | CBVLM: Training-free Explainable Concept-based Large Vision Language Models for Medical Image Classification | Cristiano Patrício et.al. | 2501.12266 | null |
| 2025-01-21 | Early Detection and Classification of Breast Cancer Using Deep Learning Techniques | Mst. Mumtahina Labonno et.al. | 2501.12217 | null |
| 2025-01-21 | UAV-Assisted Real-Time Disaster Detection Using Optimized Transformer Model | Branislava Jankovic et.al. | 2501.12087 | null |
| 2025-01-20 | Communication-Efficient Federated Learning Based on Explanation-Guided Pruning for Remote Sensing Image Classification | Jonas Klotz et.al. | 2501.11493 | null |
| 2025-01-22 | QGAIC: Quantum Inspired Genetic Algorithm for Image Classification | Akhilesh Kumar Singh et.al. | 2501.11477 | null |
| 2025-01-20 | GenVidBench: A Challenging Benchmark for Detecting AI-Generated Video | Zhenliang Ni et.al. | 2501.11340 | null |
| 2025-01-20 | KPL: Training-Free Medical Knowledge Mining of Vision-Language Models | Jiaxiang Liu et.al. | 2501.11231 | link |
| 2025-01-19 | CLOFAI: A Dataset of Real And Fake Image Classification Tasks for Continual Learning | William Doherty et.al. | 2501.11140 | link |
| 2025-01-19 | Leveraging counterfactual concepts for debugging and improving CNN model performance | Syed Ali Tariq et.al. | 2501.11087 | null |
| 2025-01-17 | A Vision-Language Framework for Multispectral Scene Representation Using Language-Grounded Features | Enes Karanfil et.al. | 2501.10144 | null |
| 2025-01-17 | Classifier Ensemble for Efficient Uncertainty Calibration of Deep Neural Networks for Image Classification | Michael Schulze et.al. | 2501.10089 | null |
| 2025-01-17 | One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression | Keita Miwa et.al. | 2501.10064 | null |
| 2025-01-17 | LWGANet: A Lightweight Group Attention Backbone for Remote Sensing Visual Tasks | Wei Lu et.al. | 2501.10040 | link |
| 2025-01-16 | Empirical Evaluation of Embedding Models in the Context of Text Classification in Document Review in Construction Delay Disputes | Fusheng Wei et.al. | 2501.09859 | null |
| 2025-01-16 | SRE-Conv: Symmetric Rotation Equivariant Convolution for Biomedical Image Classification | Yuexi Du et.al. | 2501.09753 | link |
| 2025-01-16 | Practical Continual Forgetting for Pre-trained Vision Models | Hongbo Zhao et.al. | 2501.09705 | link |
| 2025-01-16 | Multimodal Marvels of Deep Learning in Medical Diagnosis: A Comprehensive Review of COVID-19 Detection | Md Shofiqul Islama et.al. | 2501.09506 | link |
| 2025-01-16 | HydraMix: Multi-Image Feature Mixing for Small Data Image Classification | Christoph Reinders et.al. | 2501.09504 | null |
| 2025-01-16 | Quantum-Enhanced Transformers for Robust Acoustic Scene Classification in IoT Environments | Minh K. Quan et.al. | 2501.09394 | null |
| 2025-01-16 | Shape-Based Single Object Classification Using Ensemble Method Classifiers | Nur Shazwani Kamarudin et.al. | 2501.09311 | null |
| 2025-01-16 | Efficient Few-Shot Medical Image Analysis via Hierarchical Contrastive Vision-Language Learning | Harrison Fuller et.al. | 2501.09294 | null |
| 2025-01-16 | A Simple Graph Contrastive Learning Framework for Short Text Classification | Yonghao Liu et.al. | 2501.09219 | link |
| 2025-01-16 | Boosting Short Text Classification with Multi-Source Information Exploration and Dual-Level Contrastive Learning | Yonghao Liu et.al. | 2501.09214 | link |
| 2025-01-15 | Augmenting Human-Annotated Training Data with Large Language Model Generation and Distillation in Open-Response Assessment | Conrad Borchers et.al. | 2501.09126 | null |
| 2025-01-15 | IDEA: Image Description Enhanced CLIP-Adapter | Zhipeng Ye et.al. | 2501.08816 | null |
| 2025-01-15 | MIAFEx: An Attention-based Feature Extraction Method for Medical Image Classification | Oscar Ramos-Soto et.al. | 2501.08562 | null |
| 2025-01-14 | Towards Zero-Shot & Explainable Video Description by Reasoning over Graphs of Events in Space and Time | Mihai Masala et.al. | 2501.08460 | null |
| 2025-01-14 | Large Language Models For Text Classification: Case Study And Comprehensive Review | Arina Kostina et.al. | 2501.08457 | null |
| 2025-01-14 | READ: Reinforcement-based Adversarial Learning for Text Classification with Limited Labeled Data | Rohit Sharma et.al. | 2501.08035 | null |
| 2025-01-14 | Training Hybrid Neural Networks with Multimode Optical Nonlinearities Using Digital Twins | Ilker Oguz et.al. | 2501.07991 | null |
| 2025-01-14 | deepTerra – AI Land Classification Made Easy | Andrew Keith Wilkinson et.al. | 2501.07859 | null |
| 2025-01-14 | A Low-cost and Ultra-lightweight Binary Neural Network for Traffic Signal Recognition | Mingke Xiao et.al. | 2501.07808 | null |
| 2025-01-14 | Balance Divergence for Knowledge Distillation | Yafei Qi et.al. | 2501.07804 | null |
| 2025-01-14 | Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding | Zhaokai Wang et.al. | 2501.07783 | link |
| 2025-01-13 | Universal Training of Neural Networks to Achieve Bayes Optimal Classification Accuracy | Mohammadreza Tavasoli Naeini et.al. | 2501.07754 | null |
| 2025-01-13 | Uncertainty Guarantees on Automated Precision Weeding using Conformal Prediction | Paul Melki et.al. | 2501.07185 | null |
| 2025-01-13 | Adaptive Noise-Tolerant Network for Image Segmentation | Weizhi Li et.al. | 2501.07163 | null |
| 2025-01-12 | LarvSeg: Exploring Image Classification Data For Large Vocabulary Semantic Segmentation via Category-wise Attentive Classifier | Haojun Yu et.al. | 2501.06862 | link |
| 2025-01-12 | Rice Leaf Disease Detection: A Comparative Study Between CNN, Transformer and Non-neural Network Architectures | Samia Mehnaz et.al. | 2501.06740 | null |
| 2025-01-12 | Multi-Label Scene Classification in Remote Sensing Benefits from Image Super-Resolution | Ashitha Mudraje et.al. | 2501.06720 | null |
| 2025-01-11 | Synthetic Feature Augmentation Improves Generalization Performance of Language Models | Ashok Choudhary et.al. | 2501.06434 | null |
| 2025-01-10 | Kolmogorov-Arnold networks for metal surface defect classification | Maciej Krzywda et.al. | 2501.06389 | null |
| 2025-01-10 | Merging Feed-Forward Sublayers for Compressed Transformers | Neha Verma et.al. | 2501.06126 | link |
| 2025-01-10 | Averaged Adam accelerates stochastic optimization in the training of deep neural network approximations for partial differential equation and optimal control problems | Steffen Dereich et.al. | 2501.06081 | link |
| 2025-01-10 | Constrained Over-the-Air Model Updating for Wireless Online Federated Learning with Delayed Information | Juncheng Wang et.al. | 2501.05637 | null |
| 2025-01-10 | The Impact of Model Scaling on Seen and Unseen Language Performance | Rhitabrat Pokharel et.al. | 2501.05629 | null |
| 2025-01-09 | Vision-Language Models for Autonomous Driving: CLIP-Based Dynamic Scene Understanding | Mohammed Elhenawy et.al. | 2501.05566 | null |
| 2025-01-09 | Spatial Information Integration in Small Language Models for Document Layout Generation and Classification | Pablo Melendez et.al. | 2501.05497 | null |
| 2025-01-09 | An Empirical Study of Autoregressive Pre-training from Videos | Jathushan Rajasegaran et.al. | 2501.05453 | null |
| 2025-01-09 | A 1Mb mixed-precision quantized encoder for image classification and patch-based compression | Van Thien Nguyen et.al. | 2501.05097 | null |
| 2025-01-09 | A CT Image Classification Network Framework for Lung Tumors Based on Pre-trained MobileNetV2 Model and Transfer learning, And Its Application and Market Analysis in the Medical field | Ziyang Gao et.al. | 2501.04996 | null |
| 2025-01-09 | MambaHSI: Spatial-Spectral Mamba for Hyperspectral Image Classification | Yapeng Li et.al. | 2501.04944 | null |
| 2025-01-09 | A New Perspective on Privacy Protection in Federated Learning with Granular-Ball Computing | Guannan Lai et.al. | 2501.04940 | link |
| 2025-01-09 | ThriftLLM: On Cost-Effective Selection of Large Language Models for Classification Queries | Keke Huang et.al. | 2501.04901 | null |
| 2025-01-09 | Online Continual Learning: A Systematic Literature Review of Approaches, Challenges, and Benchmarks | Seyed Amir Bidaki et.al. | 2501.04897 | link |
| 2025-01-08 | Planarian Neural Networks: Evolutionary Patterns from Basic Bilateria Shaping Modern Artificial Neural Network Architectures | Ziyuan Huang et.al. | 2501.04700 | null |
| 2025-01-08 | Discrete Wavelet Transform-Based Capsule Network for Hyperspectral Image Classification | Zhiqiang Gao et.al. | 2501.04643 | null |
| 2025-01-08 | Enhancing Scene Classification in Cloudy Image Scenarios: A Collaborative Transfer Method with Information Regulation Mechanism using Optical Cloud-Covered and SAR Remote Sensing Images | Yuze Wang et.al. | 2501.04283 | null |
| 2025-01-08 | Comparison of Neural Models for X-ray Image Classification in COVID-19 Detection | Jimi Togni et.al. | 2501.04196 | null |
| 2025-01-07 | Temporal Feature Weaving for Neonatal Echocardiographic Viewpoint Video Classification | Satchel French et.al. | 2501.03967 | link |
| 2025-01-07 | Dolphin: Closed-loop Open-ended Auto-research through Thinking, Practice, and Feedback | Jiakang Yuan et.al. | 2501.03916 | null |
| 2025-01-07 | MedFocusCLIP : Improving few shot classification in medical datasets using pixel wise attention | Aadya Arora et.al. | 2501.03839 | null |
| 2025-01-07 | LHGNN: Local-Higher Order Graph Neural Networks For Audio Classification and Tagging | Shubhr Singh et.al. | 2501.03464 | null |
| 2025-01-06 | FTA-FTL: A Fine-Tuned Aggregation Federated Transfer Learning Scheme for Lithology Microscopic Image Classification | Keyvan RahimiZadeh et.al. | 2501.03349 | link |
| 2025-01-06 | CM3T: Framework for Efficient Multimodal Learning for Inhomogeneous Interaction Datasets | Tanay Agrawal et.al. | 2501.03332 | null |
| 2025-01-06 | Plant Leaf Disease Detection and Classification Using Deep Learning: A Review and A Proposed System on Bangladesh’s Perspective | Md. Jalal Uddin Chowdhury et.al. | 2501.03305 | null |
| 2025-01-06 | Deep-Relative-Trust-Based Diffusion for Decentralized Deep Learning | Muyun Li et.al. | 2501.03162 | null |
| 2025-01-06 | Graph-based Retrieval Augmented Generation for Dynamic Few-shot Text Classification | Yubo Wang et.al. | 2501.02844 | null |
| 2025-01-06 | TARDiS : Text Augmentation for Refining Diversity and Separability | Kyungmin Kim et.al. | 2501.02739 | null |
| 2025-01-05 | FedRSClip: Federated Learning for Remote Sensing Scene Classification Using Vision-Language Models | Hui Lin et.al. | 2501.02461 | null |
| 2025-01-04 | Exploring Secure Machine Learning Through Payload Injection and FGSM Attacks on ResNet-50 | Umesh Yadav et.al. | 2501.02147 | null |
| 2025-01-03 | A Separable Self-attention Inspired by the State Space Model for Computer Vision | Juntao Zhang et.al. | 2501.02040 | link |
| 2025-01-03 | Google is all you need: Semi-Supervised Transfer Learning Strategy For Light Multimodal Multi-Task Classification Model | Haixu Liu et.al. | 2501.01611 | null |
| 2025-01-02 | Multi-Modal Video Feature Extraction for Popularity Prediction | Haixu Liu et.al. | 2501.01422 | null |
| 2025-01-02 | A Multi-task Supervised Compression Model for Split Computing | Yoshitomo Matsubara et.al. | 2501.01420 | link |
| 2025-01-02 | Multi-Head Explainer: A General Framework to Improve Explainability in CNNs and Transformers | Bohang Sun et.al. | 2501.01311 | null |
| 2025-01-02 | FAST: Fast Audio Spectrogram Transformer | Anugunj Naman et.al. | 2501.01104 | null |
| 2025-01-01 | A Novel Approach using CapsNet and Deep Belief Network for Detection and Identification of Oral Leukopenia | Hirthik Mathesh GV et.al. | 2501.00876 | null |
| 2025-01-01 | Ensuring superior learning outcomes and data security for authorized learner | Jeongho Bang et.al. | 2501.00754 | null |
| 2024-12-31 | TSPE: Task-Specific Prompt Ensemble for Improved Zero-Shot Audio Classification | Nishit Anand et.al. | 2501.00398 | null |
| 2024-12-31 | Exploring Variability in Fine-Tuned Models for Text Classification with DistilBERT | Giuliano Lorenzoni et.al. | 2501.00241 | null |
| 2024-12-30 | The Text Classification Pipeline: Starting Shallow going Deeper | Marco Siino et.al. | 2501.00174 | null |
| 2024-12-30 | Text Classification: Neural Networks VS Machine Learning Models VS Pre-trained Models | Christos Petridis et.al. | 2412.21022 | null |
| 2024-12-30 | FPGA-based Acceleration of Neural Network for Image Classification using Vitis AI | Zhengdong Li et.al. | 2412.20974 | null |
| 2024-12-30 | Uncertainty-Aware Out-of-Distribution Detection with Gaussian Processes | Yang Chen et.al. | 2412.20918 | null |
| 2024-12-30 | UniRS: Unifying Multi-temporal Remote Sensing Tasks through Vision Language Models | Yujie Li et.al. | 2412.20742 | null |
| 2024-12-30 | Improving Acoustic Scene Classification in Low-Resource Conditions | Zhi Chen et.al. | 2412.20722 | null |
| 2024-12-29 | Hilbert Curve Based Molecular Sequence Analysis | Sarwan Ali et.al. | 2412.20616 | null |
| 2024-12-29 | A Novel FPGA-based CNN Hardware Accelerator: Optimization for Convolutional Layers using Karatsuba Ofman Multiplier | Amit Sarkar et.al. | 2412.20393 | null |
| 2024-12-29 | HindiLLM: Large Language Model for Hindi | Sanjay Chouhan et.al. | 2412.20357 | null |
| 2024-12-29 | Deep Learning in Image Classification: Evaluating VGG19’s Performance on Complex Visual Data | Weijie He et.al. | 2412.20345 | null |
| 2024-12-28 | Few-shot Algorithm Assurance | Dang Nguyen et.al. | 2412.20275 | null |
| 2024-12-27 | Asymmetrical Reciprocity-based Federated Learning for Resolving Disparities in Medical Diagnosis | Jiaqi Wang et.al. | 2412.19654 | null |
| 2024-12-27 | Enhancing Fine-grained Image Classification through Attentive Batch Training | Duy M. Le et.al. | 2412.19606 | null |
| 2024-12-27 | A Comparative Study of Machine Unlearning Techniques for Image and Text Classification Models | Omar M. Safa et.al. | 2412.19583 | null |
| 2024-12-27 | Multi-label Classification using Deep Multi-order Context-aware Kernel Networks | Mingyuan Jiu et.al. | 2412.19491 | null |
| 2024-12-27 | Residual Feature-Reutilization Inception Network for Image Classification | Yuanpeng He et.al. | 2412.19433 | null |
| 2024-12-27 | An In-Depth Analysis of Adversarial Discriminative Domain Adaptation for Digit Classification | Eugene Choi et.al. | 2412.19391 | link |
| 2024-12-26 | Assessing Pre-trained Models for Transfer Learning through Distribution of Spectral Components | Tengxue Zhang et.al. | 2412.19085 | null |
| 2024-12-26 | Let the Rule Speak: Enhancing In-context Learning Debiasing with Interpretability | Ruixi Lin et.al. | 2412.19018 | null |
| 2024-12-25 | Injecting Bias into Text Classification Models using Backdoor Attacks | A. Dilara Yavuz et.al. | 2412.18975 | null |
| 2024-12-25 | Research Experiment on Multi-Model Comparison for Chinese Text Classification Tasks | JiaCheng Li et.al. | 2412.18908 | null |
| 2024-12-24 | VisionGRU: A Linear-Complexity RNN Model for Efficient Image Analysis | Shicheng Yin et.al. | 2412.18178 | link |
| 2024-12-24 | Beyond Gradient Averaging in Parallel Optimization: Improved Robustness through Gradient Agreement Filtering | Francois Chaubard et.al. | 2412.18052 | null |
| 2024-12-23 | Explainability in Neural Networks for Natural Language Processing Tasks | Melkamu Mersha et.al. | 2412.18036 | null |
| 2024-12-23 | COBRA: COmBinatorial Retrieval Augmentation for Few-Shot Learning | Arnav M. Das et.al. | 2412.17684 | null |
| 2024-12-23 | Resource-Aware Arabic LLM Creation: Model Adaptation, Integration, and Multi-Domain Testing | Prakash Aryan et.al. | 2412.17548 | link |
| 2024-12-23 | Domain-Incremental Learning for Audio Classification | Manjunath Mulimani et.al. | 2412.17424 | null |
| 2024-12-23 | An Experimental Evaluation of Japanese Tokenizers for Sentiment-Based Text Classification | Andre Rusli et.al. | 2412.17361 | link |
| 2024-12-23 | DiffFormer: a Differential Spatial-Spectral Transformer for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2412.17350 | link |
| 2024-12-22 | Survey on Abstractive Text Summarization: Dataset, Models, and Metrics | Gospel Ozioma Nnadi et.al. | 2412.17165 | link |
| 2024-12-22 | LH-Mix: Local Hierarchy Correlation Guided Mixup over Hierarchical Prompt Tuning | Fanshuang Kong et.al. | 2412.16963 | link |
| 2024-12-22 | Predicting the Reliability of an Image Classifier under Image Distortion | Dang Nguyen et.al. | 2412.16881 | null |
| 2024-12-21 | Forget Vectors at Play: Universal Input Perturbations Driving Machine Unlearning in Image Classification | Changchang Sun et.al. | 2412.16780 | null |
| 2024-12-21 | UNEM: UNrolled Generalized EM for Transductive Few-Shot Learning | Long Zhou et.al. | 2412.16739 | link |
| 2024-12-20 | Mamba2D: A Natively Multi-Dimensional State-Space Model for Vision Tasks | Enis Baty et.al. | 2412.16146 | link |
| 2024-12-20 | Towards Interpretable Radiology Report Generation via Concept Bottlenecks using a Multi-Agentic RAG | Hasan Md Tusfiqur Alam et.al. | 2412.16086 | link |
| 2024-12-20 | A Thorough Investigation into the Application of Deep CNN for Enhancing Natural Language Processing Capabilities | Chang Weng et.al. | 2412.15900 | null |
| 2024-12-20 | Continual Learning Using a Kernel-Based Method Over Foundation Models | Saleh Momeni et.al. | 2412.15571 | link |
| 2024-12-19 | Time Will Tell: Timing Side Channels via Output Token Count in Large Language Models | Tianchen Zhang et.al. | 2412.15431 | null |
| 2024-12-19 | Till the Layers Collapse: Compressing a Deep Neural Network through the Lenses of Batch Normalization Layers | Zhu Liao et.al. | 2412.15077 | null |
| 2024-12-18 | Zero-Shot Prompting and Few-Shot Fine-Tuning: Revisiting Document Image Classification Using Large Language Models | Anna Scius-Bertrand et.al. | 2412.13859 | null |
| 2024-12-18 | Modelling Multi-modal Cross-interaction for ML-FSIC Based on Local Feature Selection | Kun Yan et.al. | 2412.13732 | null |
| 2024-12-18 | MBInception: A new Multi-Block Inception Model for Enhancing Image Processing Efficiency | Fatemeh Froughirad et.al. | 2412.13703 | null |
| 2024-12-17 | Identifying Bias in Deep Neural Networks Using Image Transforms | Sai Teja Erukude et.al. | 2412.13079 | link |
| 2024-12-17 | Token-Level Graphs for Short Text Classification | Gregor Donabauer et.al. | 2412.12754 | link |
| 2024-12-17 | Your Next State-of-the-Art Could Come from Another Domain: A Cross-Domain Analysis of Hierarchical Text Classification | Nan Li et.al. | 2412.12744 | link |
| 2024-12-17 | ShotVL: Human-Centric Highlight Frame Retrieval via Language Queries | Wangyu Xue et.al. | 2412.12675 | null |
| 2024-12-17 | Structural Pruning via Spatial-aware Information Redundancy for Semantic Segmentation | Dongyue Wu et.al. | 2412.12672 | link |
| 2024-12-19 | RemoteTrimmer: Adaptive Structural Pruning for Remote Sensing Image Classification | Guangwenjie Zou et.al. | 2412.12603 | link |
| 2024-12-17 | Addressing Small and Imbalanced Medical Image Datasets Using Generative Models: A Comparative Study of DDPM and PGGANs with Random and Greedy K Sampling | Iman Khazrak et.al. | 2412.12532 | link |
| 2024-12-16 | Gramian Multimodal Representation Learning and Alignment | Giordano Cicchetti et.al. | 2412.11959 | link |
| 2024-12-16 | The Impact of Generalization Techniques on the Interplay Among Privacy, Utility, and Fairness in Image Classification | Ahmad Hassanpour et.al. | 2412.11951 | null |
| 2024-12-16 | Does VLM Classification Benefit from LLM Description Semantics? | Pingchuan Ma et.al. | 2412.11917 | link |
| 2024-12-16 | Discrepancy-Aware Attention Network for Enhanced Audio-Visual Zero-Shot Learning | RunLin Yu et.al. | 2412.11715 | null |
| 2024-12-16 | LMM-Regularized CLIP Embeddings for Image Classification | Maria Tzelepi et.al. | 2412.11663 | null |
| 2024-12-16 | Non-Convex Optimization in Federated Learning via Variance Reduction and Adaptive Learning | Dipanwita Thakur et.al. | 2412.11660 | null |
| 2024-12-16 | CNNtention: Can CNNs do better with Attention? | Julian Glattki et.al. | 2412.11657 | link |
| 2024-12-16 | Explicit and Implicit Graduated Optimization in Deep Neural Networks | Naoki Sato et.al. | 2412.11501 | link |
| 2024-12-16 | Towards Better Multi-task Learning: A Framework for Optimizing Dataset Combinations in Large Language Models | Zaifu Zhan et.al. | 2412.11455 | null |
| 2024-12-16 | Scaled Conjugate Gradient Method for Nonconvex Optimization in Deep Neural Networks | Naoki Sato et.al. | 2412.11400 | null |
| 2024-12-13 | Robust image classification with multi-modal large language models | Francesco Villani et.al. | 2412.10353 | null |
| 2024-12-13 | MVQ:Towards Efficient DNN Compression and Acceleration with Masked Vector Quantization | Shuaiting Li et.al. | 2412.10261 | null |
| 2024-12-13 | Label-template based Few-Shot Text Classification with Contrastive Learning | Guanghua Hou et.al. | 2412.10110 | null |
| 2024-12-13 | Data Pruning Can Do More: A Comprehensive Data Pruning Approach for Object Re-identification | Zi Yang et.al. | 2412.10091 | link |
| 2024-12-13 | Low-Resource Fast Text Classification Based on Intra-Class and Inter-Class Distance Calculation | Yanxu Mao et.al. | 2412.09922 | null |
| 2024-12-12 | DQA: An Efficient Method for Deep Quantization of Deep Neural Network Activations | Wenhao Hu et.al. | 2412.09687 | null |
| 2024-12-12 | Embeddings are all you need! Achieving High Performance Medical Image Classification through Training-Free Embedding Analysis | Raj Hansini Khoiwal et.al. | 2412.09445 | null |
| 2024-12-12 | Learned Compression for Compressed Learning | Dan Jacobellis et.al. | 2412.09405 | link |
| 2024-12-12 | Advancing Attribution-Based Neural Network Explainability through Relative Absolute Magnitude Layer-Wise Relevance Propagation and Multi-Component Evaluation | Davor Vukadin et.al. | 2412.09311 | link |
| 2024-12-13 | An Efficient Framework for Enhancing Discriminative Models via Diffusion Techniques | Chunxiao Li et.al. | 2412.09063 | null |
| 2024-12-12 | STEAM: Squeeze and Transform Enhanced Attention Module | Rishabh Sabharwal et.al. | 2412.09023 | null |
| 2024-12-12 | Stochastic Learning of Non-Conjugate Variational Posterior for Image Classification | Kart-Leong Lim et.al. | 2412.08951 | null |
| 2024-12-11 | BDA: Bangla Text Data Augmentation Framework | Md. Tariquzzaman et.al. | 2412.08753 | null |
| 2024-12-11 | Advancing Single- and Multi-task Text Classification through Large Language Model Fine-tuning | Hang Zhao et.al. | 2412.08587 | null |
| 2024-12-11 | ALoRE: Efficient Visual Adaptation via Aggregating Low Rank Experts | Sinan Du et.al. | 2412.08341 | null |
| 2024-12-11 | Online training and pruning of photonic neural networks | Jiawei Zhang et.al. | 2412.08184 | null |
| 2024-12-11 | Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge Distillation | Jiaming Lv et.al. | 2412.08139 | null |
| 2024-12-11 | Concept Bottleneck Large Language Models | Chung-En Sun et.al. | 2412.07992 | link |
| 2024-12-10 | FastDDS-Based Middleware System for Remote X-Ray Image Classification Using Raspberry Pi | Omar H. Khater et.al. | 2412.07818 | null |
| 2024-12-10 | Leveraging Content and Context Cues for Low-Light Image Enhancement | Igor Morawski et.al. | 2412.07693 | link |
| 2024-12-10 | Post-Training Non-Uniform Quantization for Convolutional Neural Networks | Ahmed Luqman et.al. | 2412.07391 | null |
| 2024-12-10 | Image Classification Using Singular Value Decomposition and Optimization | Isabela M. Yepes et.al. | 2412.07288 | link |
| 2024-12-10 | An Enhancement of CNN Algorithm for Rice Leaf Disease Image Classification in Mobile Applications | Kayne Uriel K. Rodrigo et.al. | 2412.07182 | null |
| 2024-12-09 | Convolution goes higher-order: a biologically inspired mechanism empowers image classification | Simone Azeglio et.al. | 2412.06740 | null |
| 2024-12-09 | Impact of Privacy Parameters on Deep Learning Models for Image Classification | Basanta Chaulagain et.al. | 2412.06689 | null |
| 2024-12-10 | Data Quality Enhancement on the Basis of Diversity with Large Language Models for Text Classification: Uncovered, Difficult, and Noisy | Min Zeng et.al. | 2412.06575 | null |
| 2024-12-09 | How Certain are Uncertainty Estimates? Three Novel Earth Observation Datasets for Benchmarking Uncertainty Quantification in Machine Learning | Yuanyuan Wang et.al. | 2412.06451 | null |
| 2024-12-09 | Optimizing Multi-Task Learning for Enhanced Performance in Large Language Models | Zhen Qi et.al. | 2412.06249 | null |
| 2024-12-08 | Hyperspectral Image Spectral-Spatial Feature Extraction via Tensor Principal Component Analysis | Yuemei Ren et.al. | 2412.06075 | null |
| 2024-12-08 | Vision Transformer-based Semantic Communications With Importance-Aware Quantization | Joohyuk Park et.al. | 2412.06038 | null |
| 2024-12-06 | Sparse autoencoders reveal selective remapping of visual concepts during adaptation | Hyesu Lim et.al. | 2412.05276 | link |
| 2024-12-06 | MTSpark: Enabling Multi-Task Learning with Spiking Neural Networks for Generalist Agents | Avaneesh Devkota et.al. | 2412.04847 | null |
| 2024-12-05 | Grounding Descriptions in Images informs Zero-Shot Visual Recognition | Shaunak Halbe et.al. | 2412.04429 | link |
| 2024-12-05 | FedDUAL: A Dual-Strategy with Adaptive Loss and Dynamic Aggregation for Mitigating Data Heterogeneity in Federated Learning | Pranab Sahoo et.al. | 2412.04416 | link |
| 2024-12-05 | Enhancing Whole Slide Image Classification through Supervised Contrastive Domain Adaptation | Ilán Carretero et.al. | 2412.04260 | null |
| 2024-12-05 | Demonstration Selection for In-Context Learning via Reinforcement Learning | Xubin Wang et.al. | 2412.03966 | null |
| 2024-12-05 | Quantized and Interpretable Learning Scheme for Deep Neural Networks in Classification Task | Alireza Maleki et.al. | 2412.03915 | null |
| 2024-12-05 | Multisource Collaborative Domain Generalization for Cross-Scene Remote Sensing Image Classification | Zhu Han et.al. | 2412.03897 | null |
| 2024-12-05 | Dual-Branch Subpixel-Guided Network for Hyperspectral Image Classification | Zhu Han et.al. | 2412.03893 | link |
| 2024-12-04 | Language Model Meets Prototypes: Towards Interpretable Text Classification Models through Prototypical Networks | Ximing Wen et.al. | 2412.03761 | null |
| 2024-12-05 | Continual Low-Rank Scaled Dot-product Attention | Ginés Carreto Picón et.al. | 2412.03214 | null |
| 2024-12-04 | Multi-Level Correlation Network For Few-Shot Image Classification | Yunkai Dang et.al. | 2412.03159 | link |
| 2024-12-04 | Assessing the performance of CT image denoisers using Laguerre-Gauss Channelized Hotelling Observer for lesion detection | Prabhat Kc et.al. | 2412.02920 | null |
| 2024-12-04 | Higher Order Transformers: Efficient Attention Mechanism for Tensor Structured Data | Soroush Omranpour et.al. | 2412.02919 | null |
| 2024-12-03 | Synergistic Development of Perovskite Memristors and Algorithms for Robust Analog Computing | Nanyang Ye et.al. | 2412.02779 | null |
| 2024-12-03 | Mixture of Physical Priors Adapter for Parameter-Efficient Fine-Tuning | Zhaozhi Wang et.al. | 2412.02759 | null |
| 2024-12-03 | Multimodal Remote Sensing Scene Classification Using VLMs and Dual-Cross Attention Networks | Jinjin Cai et.al. | 2412.02531 | null |
| 2024-12-04 | GenMix: Effective Data Augmentation with Generative Diffusion Model Image Editing | Khawar Islam et.al. | 2412.02366 | null |
| 2024-12-03 | Multi-Granularity Tibetan Textual Adversarial Attack Method Based on Masked Language Model | Xi Cao et.al. | 2412.02343 | null |
| 2024-12-03 | Active Learning via Classifier Impact and Greedy Selection for Interactive Image Retrieval | Leah Bar et.al. | 2412.02310 | link |
| 2024-12-03 | A Classic-Quantum Hybrid Network Framework: CQH-Net | Ao Liu et.al. | 2412.02059 | null |
| 2024-12-02 | PROFIT: A PROximal FIne Tuning Optimizer for Multi-Task Learning | Anirudh S Chakravarthy et.al. | 2412.01930 | null |
| 2024-12-02 | Concept Based Continuous Prompts for Interpretable Text Classification | Qian Chen et.al. | 2412.01644 | link |
| 2024-12-02 | NYT-Connections: A Deceptively Simple Text Classification Task that Stumps System-1 Thinkers | Angel Yahir Loredo Lopez et.al. | 2412.01621 | null |
| 2024-12-02 | Explaining the Unexplained: Revealing Hidden Correlations for Better Interpretability | Wen-Dong Jiang et.al. | 2412.01365 | null |
| 2024-12-02 | Class Distance Weighted Cross Entropy Loss for Classification of Disease Severity | Gorkem Polat et.al. | 2412.01246 | null |
| 2024-11-29 | LLM Teacher-Student Framework for Text Classification With No Manually Annotated Data: A Case Study in IPTC News Topic Classification | Taja Kuzman et.al. | 2411.19638 | link |
| 2024-11-29 | FairDD: Fair Dataset Distillation via Synchronized Matching | Qihang Zhou et.al. | 2411.19623 | null |
| 2024-11-29 | Memristive Nanowire Network for Energy Efficient Audio Classification: Pre-Processing-Free Reservoir Computing with Reduced Latency | Akshaya Rajesh et.al. | 2411.19611 | null |
| 2024-11-29 | Contextual Checkerboard Denoise – A Novel Neural Network-Based Approach for Classification-Aware OCT Image Denoising | Md. Touhidul Islam et.al. | 2411.19549 | link |
| 2024-11-28 | CLIP meets DINO for Tuning Zero-Shot Classifier using Unlabeled Image Collections | Mohamed Fazli Imam et.al. | 2411.19346 | link |
| 2024-11-28 | Quantum Neural Networks in Practice: A Comparative Study with Classical Models from Standard Data Sets to Industrial Images | Daniel Basilewitsch et.al. | 2411.19276 | null |
| 2024-11-28 | Controlling Participation in Federated Learning with Feedback | Michael Cummins et.al. | 2411.19242 | null |
| 2024-11-28 | Introducing Three New Benchmark Datasets for Hierarchical Text Classification | Jaco du Toit et.al. | 2411.19119 | null |
| 2024-11-28 | MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers | Jongseong Bae et.al. | 2411.18995 | null |
| 2024-11-27 | Fall Leaf Adversarial Attack on Traffic Sign Classification | Anthony Etim et.al. | 2411.18776 | null |
| 2024-11-27 | Leveraging Semi-Supervised Learning to Enhance Data Mining for Image Classification under Limited Labeled Data | Aoran Shen et.al. | 2411.18622 | null |
| 2024-11-27 | Pruning Deep Convolutional Neural Network Using Conditional Mutual Information | Tien Vu-Van et.al. | 2411.18578 | null |
| 2024-11-27 | Mixture of Experts in Image Classification: What’s the Sweet Spot? | Mathurin Videau et.al. | 2411.18322 | null |
| 2024-11-27 | KANs for Computer Vision: An Experimental Study | Karthik Mohan et.al. | 2411.18224 | null |
| 2024-11-27 | Spectral-Spatial Transformer with Active Transfer Learning for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2411.18115 | link |
| 2024-11-27 | Vision Mamba Distillation for Low-resolution Fine-grained Image Classification | Yao Chen et.al. | 2411.17980 | link |
| 2024-11-27 | Optimized Tradeoffs for Private Prediction with Majority Ensembling | Shuli Jiang et.al. | 2411.17965 | null |
| 2024-11-26 | What Differentiates Educational Literature? A Multimodal Fusion Approach of Transformers and Computational Linguistics | Jordan J. Bird et.al. | 2411.17593 | null |
| 2024-11-26 | TinyViM: Frequency Decoupling for Tiny Hybrid Vision Mamba | Xiaowen Ma et.al. | 2411.17473 | link |
| 2024-11-26 | SpikeAtConv: An Integrated Spiking-Convolutional Attention Architecture for Energy-Efficient Neuromorphic Vision Processing | Wangdan Liao et.al. | 2411.17439 | null |
| 2024-11-26 | CoA: Chain-of-Action for Generative Semantic Labels | Meng Wei et.al. | 2411.17406 | link |
| 2024-11-26 | BadScan: An Architectural Backdoor Attack on Visual State Space Models | Om Suhas Deshmukh et.al. | 2411.17283 | null |
| 2024-11-26 | An In-depth Investigation of Sparse Rate Reduction in Transformer-like Models | Yunzhe Hu et.al. | 2411.17182 | null |
| 2024-11-25 | Contrastive Multi-graph Learning with Neighbor Hierarchical Sifting for Semi-supervised Text Classification | Wei Ai et.al. | 2411.16787 | null |
| 2024-11-25 | A Supervised Machine Learning Approach for Assessing Grant Peer Review Reports | Gabriel Okasa et.al. | 2411.16662 | link |
| 2024-11-25 | Debiasing Classifiers by Amplifying Bias with Latent Diffusion and Large Language Models | Donggeun Ko et.al. | 2411.16079 | null |
| 2024-11-24 | Context-Aware Detection of Mixed Critical Events using Video Classification | Filza Akhlaq et.al. | 2411.15773 | null |
| 2024-11-23 | MUNBa: Machine Unlearning via Nash Bargaining | Jing Wu et.al. | 2411.15537 | null |
| 2024-11-23 | Twin Trigger Generative Networks for Backdoor Attacks against Object Detection | Zhiying Li et.al. | 2411.15439 | null |
| 2024-11-22 | MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs | Chaoyou Fu et.al. | 2411.15296 | null |
| 2024-11-21 | CODE-CL: COnceptor-Based Gradient Projection for DEep Continual Learning | Marco Paul E. Apolinario et.al. | 2411.15235 | null |
| 2024-11-21 | BiomedCoOp: Learning to Prompt for Biomedical Vision-Language Models | Taha Koleilat et.al. | 2411.15232 | link |
| 2024-11-22 | FOCUS: Knowledge-enhanced Adaptive Visual Compression for Few-shot Whole Slide Image Classification | Zhengrui Guo et.al. | 2411.14743 | link |
| 2024-11-21 | Adaptable Embeddings Network (AEN) | Stan Loosmore et.al. | 2411.13786 | null |
| 2024-11-20 | Hierarchical Text Classification (HTC) vs. eXtreme Multilabel Classification (XML): Two Sides of the Same Medal | Nerijus Bertalis et.al. | 2411.13687 | link |
| 2024-11-20 | Combining Autoregressive and Autoencoder Language Models for Text Classification | João Gonçalves et.al. | 2411.13282 | link |
| 2024-11-20 | MEGL: Multimodal Explanation-Guided Learning | Yifei Zhang et.al. | 2411.13053 | null |
| 2024-11-19 | Problem-dependent convergence bounds for randomized linear gradient compression | Thomas Flynn et.al. | 2411.12898 | null |
| 2024-11-19 | Enhancing Multi-Class Disease Classification: Neoplasms, Cardiovascular, Nervous System, and Digestive Disorders Using Advanced LLMs | Ahmed Akib Jawad Karim et.al. | 2411.12712 | null |
| 2024-11-22 | STREAM: A Universal State-Space Model for Sparse Geometric Data | Mark Schöne et.al. | 2411.12603 | null |
| 2024-11-19 | AdaCM $^2$ : On Understanding Extremely Long-Term Video with Adaptive Cross-Modality Memory Reduction | Yuanbin Man et.al. | 2411.12593 | null |
| 2024-11-19 | Zero-Shot Crate Digging: DJ Tool Retrieval Using Speech Activity, Music Structure And CLAP Embeddings | Iroro Orife et.al. | 2411.12209 | link |
| 2024-11-19 | Invariant Shape Representation Learning For Image Classification | Tonmoy Hossain et.al. | 2411.12201 | link |
| 2024-11-19 | Self-Supervised Learning in Deep Networks: A Pathway to Robust Few-Shot Classification | Yuyang Xiao et.al. | 2411.12151 | null |
| 2024-11-18 | Just Leaf It: Accelerating Diffusion Classifiers with Hierarchical Class Pruning | Arundhati S. Shanbhag et.al. | 2411.12073 | link |
| 2024-11-18 | Vision Language Models Are Few-Shot Audio Spectrogram Classifiers | Satvik Dixit et.al. | 2411.12058 | null |
| 2024-11-18 | Fair Distillation: Teaching Fairness from Biased Teachers in Medical Imaging | Milad Masroor et.al. | 2411.11939 | null |
| 2024-11-18 | Exploring Emerging Trends and Research Opportunities in Visual Place Recognition | Antonios Gasteratos et.al. | 2411.11481 | null |
| 2024-11-16 | MetaLA: Unified Optimal Linear Approximation to Softmax Attention Map | Yuhong Chou et.al. | 2411.10741 | null |
| 2024-11-16 | Diagnostic Text-guided Representation Learning in Hierarchical Classification for Pathological Whole Slide Image | Jiawen Li et.al. | 2411.10709 | null |
| 2024-11-16 | Multi-perspective Contrastive Logit Distillation | Qi Wang et.al. | 2411.10693 | null |
| 2024-11-15 | Vision Eagle Attention: A New Lens for Advancing Image Classification | Mahmudul Hasan et.al. | 2411.10564 | link |
| 2024-11-15 | On the Cost of Model-Serving Frameworks: An Experimental Evaluation | Pasquale De Rosa et.al. | 2411.10337 | null |
| 2024-11-15 | Embedding Byzantine Fault Tolerance into Federated Learning via Virtual Data-Driven Consistency Scoring Plugin | Youngjoon Lee et.al. | 2411.10212 | link |
| 2024-11-15 | Outliers resistant image classification by anomaly detection | Anton Sergeev et.al. | 2411.10150 | null |
| 2024-11-15 | Adapting the Biological SSVEP Response to Artificial Neural Networks | Emirhan Böge et.al. | 2411.10084 | null |
| 2024-11-15 | Evidential Federated Learning for Skin Lesion Image Classification | Rutger Hendrix et.al. | 2411.10071 | null |
| 2024-11-14 | Adversarial Attacks Using Differentiable Rendering: A Survey | Matthew Hull et.al. | 2411.09749 | null |
| 2024-11-14 | ResidualDroppath: Enhancing Feature Reuse over Residual Connections | Sejik Park et.al. | 2411.09475 | null |
| 2024-11-14 | SAG-ViT: A Scale-Aware, High-Fidelity Patching Approach with Graph Attention for Vision Transformers | Shravan Venkatraman et.al. | 2411.09420 | null |
| 2024-11-14 | Heuristical Comparison of Vision Transformers Against Convolutional Neural Networks for Semantic Segmentation on Remote Sensing Imagery | Ashim Dahal et.al. | 2411.09101 | link |
| 2024-11-13 | Computed tomography using meta-optics | Maksym Zhelyeznuyakov et.al. | 2411.08995 | null |
| 2024-11-13 | CoCoP: Enhancing Text Classification with LLM through Code Completion Prompt | Mohammad Mahdi Mohajeri et.al. | 2411.08979 | null |
| 2024-11-13 | ScaleNet: Scale Invariance Learning in Directed Graphs | Qin Jiang et.al. | 2411.08758 | link |
| 2024-11-13 | Efficient Whole Slide Image Classification through Fisher Vector Representation | Ravi Kant Gupta et.al. | 2411.08530 | null |
| 2024-11-12 | HMIL: Hierarchical Multi-Instance Learning for Fine-Grained Whole Slide Image Classification | Cheng Jin et.al. | 2411.07660 | null |
| 2024-11-12 | Semantic segmentation on multi-resolution optical and microwave data using deep learning | Jai G Singla et.al. | 2411.07581 | null |
| 2024-11-11 | The Inherent Adversarial Robustness of Analog In-Memory Computing | Corey Lammie et.al. | 2411.07023 | null |
| 2024-11-11 | ScaleKD: Strong Vision Transformers Could Be Excellent Teachers | Jiawei Fan et.al. | 2411.06786 | link |
| 2024-11-11 | A Text Classification Model Combining Adversarial Training with Pre-trained Language Model and neural networks: A Case Study on Telecom Fraud Incident Texts | Liu Zhuoxian et.al. | 2411.06772 | null |
| 2024-11-11 | Can KAN Work? Exploring the Potential of Kolmogorov-Arnold Networks in Computer Vision | Yueyang Cang et.al. | 2411.06727 | null |
| 2024-11-10 | Deep Active Learning in the Open World | Tian Xie et.al. | 2411.06353 | null |
| 2024-11-09 | Clustering Algorithms and RAG Enhancing Semi-Supervised Text Classification with Large LLMs | Shan Zhong et.al. | 2411.06175 | null |
| 2024-11-09 | AI-Compass: A Comprehensive and Effective Multi-module Testing Tool for AI Systems | Zhiyu Zhu et.al. | 2411.06146 | null |
| 2024-11-09 | Exploring Structural Nonlinearity in Binary Polariton-Based Neuromorphic Architectures | Evgeny Sedov et.al. | 2411.06124 | null |
| 2024-11-09 | Mutual-energy inner product optimization method for constructing feature coordinates and image classification in Machine Learning | Yuanxiu Wang et.al. | 2411.06100 | null |
| 2024-11-08 | GUIDEQ: Framework for Guided Questioning for progressive informational collection and classification | Priya Mishra et.al. | 2411.05991 | link |
| 2024-11-08 | FisherMask: Enhancing Neural Network Labeling Efficiency in Image Classification Using Fisher Information | Shreen Gul et.al. | 2411.05752 | link |
| 2024-11-08 | Visual-TCAV: Concept-based Attribution and Saliency Maps for Post-hoc Explainability in Image Classification | Antonio De Santis et.al. | 2411.05698 | null |
| 2024-11-08 | Efficient Audio-Visual Fusion for Video Classification | Mahrukh Awan et.al. | 2411.05603 | null |
| 2024-11-08 | Training objective drives the consistency of representational similarity across datasets | Laure Ciernik et.al. | 2411.05561 | link |
| 2024-11-08 | Estimating the Influence of Sequentially Correlated Literary Properties in Textual Classification: A Data-Centric Hypothesis-Testing Approach | Gideon Yoffe et.al. | 2411.04950 | null |
| 2024-11-07 | Attention Masks Help Adversarial Attacks to Bypass Safety Detectors | Yunfan Shi et.al. | 2411.04772 | link |
| 2024-11-07 | Zero-Shot Temporal Resolution Domain Adaptation for Spiking Neural Networks | Sanja Karilanova et.al. | 2411.04760 | null |
| 2024-11-07 | Is network fragmentation a useful complexity measure? | Coenraad Mouton et.al. | 2411.04695 | null |
| 2024-11-07 | DISCO: DISCovering Overfittings as Causal Rules for Text Classification Models | Zijian Zhang et.al. | 2411.04649 | null |
| 2024-11-07 | Neural Fingerprints for Adversarial Attack Detection | Haim Fisher et.al. | 2411.04533 | link |
| 2024-11-06 | Multimodal Structure-Aware Quantum Data Processing | Hala Hawashin et.al. | 2411.04242 | null |
| 2024-11-06 | RaVL: Discovering and Mitigating Spurious Correlations in Fine-Tuned Vision-Language Models | Maya Varma et.al. | 2411.04097 | link |
| 2024-11-06 | Overcoming label shift in targeted federated learning | Edvin Listo Zec et.al. | 2411.03799 | null |
| 2024-11-06 | Deferred Poisoning: Making the Model More Vulnerable via Hessian Singularization | Yuhao He et.al. | 2411.03752 | null |
| 2024-11-05 | Judge Like a Real Doctor: Dual Teacher Sample Consistency Framework for Semi-supervised Medical Image Classification | Zhang Qixiang et.al. | 2411.03041 | null |
| 2024-11-06 | Confidence Calibration of Classifiers with Many Classes | Adrien LeCoz et.al. | 2411.02988 | link |
| 2024-11-05 | Domain Expansion and Boundary Growth for Open-Set Single-Source Domain Generalization | Pengkun Jiao et.al. | 2411.02920 | null |
| 2024-11-05 | ADOPT: Modified Adam Can Converge with Any $β_2$ with the Optimal Rate | Shohei Taniguchi et.al. | 2411.02853 | link |
| 2024-11-05 | Integrated lithium niobate photonic computing circuit based on efficient and high-speed electro-optic conversion | Yaowen Hu et.al. | 2411.02734 | null |
| 2024-11-06 | Wave Network: An Ultra-Small Language Model | Xin Zhang et.al. | 2411.02674 | null |
| 2024-11-04 | FUSECAPS: Investigating Feature Fusion Based Framework for Capsule Endoscopy Image Classification | Bidisha Chakraborty et.al. | 2411.02637 | null |
| 2024-11-04 | TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives | Maitreya Patel et.al. | 2411.02545 | null |
| 2024-11-04 | A Comparative Analysis of Instruction Fine-Tuning LLMs for Financial Text Classification | Sorouralsadat Fatemi et.al. | 2411.02476 | null |
| 2024-11-04 | Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models | Sharat Agarwal et.al. | 2411.01925 | null |
| 2024-11-03 | Optimizing Gastrointestinal Diagnostics: A CNN-Based Model for VCE Image Classification | Vaneeta Ahlawat et.al. | 2411.01652 | null |
| 2024-11-03 | ParseCaps: An Interpretable Parsing Capsule Network for Medical Image Diagnosis | Xinyu Geng et.al. | 2411.01564 | null |
| 2024-11-03 | Efficient Deep Learning Infrastructures for Embedded Computing Systems: A Comprehensive Survey and Future Envision | Xiangzhong Luo et.al. | 2411.01431 | null |
| 2024-11-02 | Combining Financial Data and News Articles for Stock Price Movement Prediction Using Large Language Models | Ali Elahi et.al. | 2411.01368 | null |
| 2024-11-02 | Optimizing Violence Detection in Video Classification Accuracy through 3D Convolutional Neural Networks | Aarjav Kavathia et.al. | 2411.01348 | null |
| 2024-11-02 | MIC: Medical Image Classification Using Chest X-ray (COVID-19 and Pneumonia) Dataset with the Help of CNN and Customized CNN | Nafiz Fahad et.al. | 2411.01163 | null |
| 2024-11-02 | Few-Class Arena: A Benchmark for Efficient Selection of Vision Models and Dataset Difficulty Measurement | Bryan Bo Cao et.al. | 2411.01099 | link |
| 2024-11-01 | Towards Robust Text Classification: Mitigating Spurious Correlations with Causal Learning | Yuqing Zhou et.al. | 2411.01045 | null |
| 2024-11-01 | FISHing in Uncertainty: Synthetic Contrastive Learning for Genetic Aberration Detection | Simon Gutwein et.al. | 2411.01025 | link |
| 2024-10-31 | Video Token Merging for Long-form Video Understanding | Seon-Ho Lee et.al. | 2410.23782 | null |
| 2024-10-31 | Neurobench: DCASE 2020 Acoustic Scene Classification benchmark on XyloAudio 2 | Weijie Ke et.al. | 2410.23776 | null |
| 2024-10-31 | QUEST-A: Untrained Filtering with Trained Focusing led to Enhanced Quantum Architectures | Lian-Hui Yu et.al. | 2410.23560 | link |
| 2024-11-01 | Large Language Models for Patient Comments Multi-Label Classification | Hajar Sakai et.al. | 2410.23528 | null |
| 2024-10-30 | Multilingual Vision-Language Pre-training for the Remote Sensing Domain | João Daniel Silva et.al. | 2410.23370 | null |
| 2024-10-30 | Domain-decomposed image classification algorithms using linear discriminant analysis and convolutional neural networks | Axel Klawonn et.al. | 2410.23359 | null |
| 2024-10-30 | CLIPErase: Efficient Unlearning of Visual-Textual Associations in CLIP | Tianyu Yang et.al. | 2410.23330 | null |
| 2024-10-30 | Don’t Just Pay Attention, PLANT It: Transfer L2R Models to Fine-tune Attention in Extreme Multi-Label Text Classification | Debjyoti Saharoy et.al. | 2410.23066 | null |
| 2024-10-30 | Automated Trustworthiness Oracle Generation for Machine Learning Text Classifiers | Lam Nguyen Tung et.al. | 2410.22663 | null |
| 2024-10-29 | Developing Convolutional Neural Networks using a Novel Lamarckian Co-Evolutionary Algorithm | Zaniar Sharifi et.al. | 2410.22487 | null |
| 2024-10-29 | EfficientNet with Hybrid Attention Mechanisms for Enhanced Breast Histopathology Classification: A Comprehensive Approach | Naren Sengodan et.al. | 2410.22392 | null |
| 2024-10-29 | DISCERN: Decoding Systematic Errors in Natural Language for Text Classifiers | Rakesh R. Menon et.al. | 2410.22239 | null |
| 2024-10-29 | Class-Aware Contrastive Optimization for Imbalanced Text Classification | Grigorii Khvatskii et.al. | 2410.22197 | null |
| 2024-10-29 | Active Learning for Vision-Language Models | Bardia Safaei et.al. | 2410.22187 | null |
| 2024-10-29 | Multi-Level Feature Distillation of Joint Teachers Trained on Distinct Image Datasets | Adrian Iordache et.al. | 2410.22184 | link |
| 2024-10-29 | Natural Language Processing for Analyzing Electronic Health Records and Clinical Notes in Cancer Research: A Review | Muhammad Bilal et.al. | 2410.22180 | null |
| 2024-10-29 | FakeFormer: Efficient Vulnerability-Driven Transformers for Generalisable Deepfake Detection | Dat Nguyen et.al. | 2410.21964 | null |
| 2024-10-29 | Bayesian Optimization for Hyperparameters Tuning in Neural Networks | Gabriele Onorato et.al. | 2410.21886 | null |
| 2024-10-29 | Advancing Efficient Brain Tumor Multi-Class Classification – New Insights from the Vision Mamba Model in Transfer Learning | Yinyi Lai et.al. | 2410.21872 | null |
| 2024-10-28 | Audio Classification of Low Feature Spectrograms Utilizing Convolutional Neural Networks | Noel Elias et.al. | 2410.21561 | null |
| 2024-10-30 | A Novel Score-CAM based Denoiser for Spectrographic Signature Extraction without Ground Truth | Noel Elias et.al. | 2410.21557 | null |
| 2024-10-28 | Attacking Misinformation Detection Using Adversarial Examples Generated by Language Models | Piotr Przybyła et.al. | 2410.20940 | null |
| 2024-10-28 | Data-Efficient Low-Complexity Acoustic Scene Classification via Distilling and Progressive Pruning | Bing Han et.al. | 2410.20775 | null |
| 2024-10-28 | Interpretable Image Classification with Adaptive Prototype-based Vision Transformers | Chiyu Ma et.al. | 2410.20722 | null |
| 2024-10-27 | Graph Neural Networks on Discriminative Graphs of Words | Yassine Abbahaddou et.al. | 2410.20469 | null |
| 2024-10-27 | Historical Test-time Prompt Tuning for Vision Foundation Models | Jingyi Zhang et.al. | 2410.20346 | null |
| 2024-10-27 | Sequential Large Language Model-Based Hyper-Parameter Optimization | Kanan Mahammadli et.al. | 2410.20302 | link |
| 2024-10-26 | Enhancing CNN Classification with Lamarckian Memetic Algorithms and Local Search | Akhilbaran Ghosh et.al. | 2410.20234 | null |
| 2024-10-26 | Annotation Efficiency: Identifying Hard Samples via Blocked Sparse Linear Bandits | Adit Jain et.al. | 2410.20041 | null |
| 2024-10-26 | Attacks against Abstractive Text Summarization Models through Lead Bias and Influence Functions | Poojitha Thota et.al. | 2410.20019 | null |
| 2024-10-26 | Vulnerability of LLMs to Vertically Aligned Text Manipulations | Zhecheng Li et.al. | 2410.20016 | null |
| 2024-10-25 | Learning the Regularization Strength for Deep Fine-Tuning via a Data-Emphasized Variational Objective | Ethan Harvey et.al. | 2410.19675 | null |
| 2024-10-24 | Noise Adaption Network for Morse Code Image Classification | Xiaxia Wang et.al. | 2410.19180 | link |
| 2024-10-24 | Hybrid Quantum-Classical Feature Extraction approach for Image Classification using Autoencoders and Quantum SVMs | Donovan Slabbert et.al. | 2410.18814 | null |
| 2024-10-24 | Spatial-Temporal Search for Spiking Neural Networks | Kaiwei Che et.al. | 2410.18580 | null |
| 2024-10-25 | Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks | Lehan Wang et.al. | 2410.18387 | null |
| 2024-10-23 | Using Cartesian slice plots of a cosmological simulation as input of a convolutional neural network | Guillermo Arreaga-Garcia et.al. | 2410.18320 | null |
| 2024-10-25 | Backdoor in Seconds: Unlocking Vulnerabilities in Large Pre-trained Models via Model Editing | Dongliang Guo et.al. | 2410.18267 | null |
| 2024-10-23 | Future Token Prediction – Causal Language Modelling with Per-Token Semantic State Vector for Multi-Token Prediction | Nicholas Walker et.al. | 2410.18160 | null |
| 2024-10-23 | Deep Learning for Active Region Classification: A Systematic Study from Convolutional Neural Networks to Vision Transformers | Edoardo Legnaro et.al. | 2410.17816 | null |
| 2024-10-23 | New Insight in Cervical Cancer Diagnosis Using Convolution Neural Network Architecture | Ach. Khozaimi et.al. | 2410.17735 | null |
| 2024-10-24 | Advancing Interpretability in Text Classification through Prototype Learning | Bowen Wei et.al. | 2410.17546 | null |
| 2024-10-23 | Enhancing Multimodal Medical Image Classification using Cross-Graph Modal Contrastive Learning | Jun-En Ding et.al. | 2410.17494 | null |
| 2024-10-22 | Data Obfuscation through Latent Space Projection (LSP) for Privacy-Preserving AI Governance: Case Studies in Medical Diagnosis and Finance Fraud Detection | Mahesh Vaijainthymala Krishnamoorthy et.al. | 2410.17459 | null |
| 2024-10-22 | Altogether: Image Captioning via Re-aligning Alt-text | Hu Xu et.al. | 2410.17251 | null |
| 2024-10-22 | KANICE: Kolmogorov-Arnold Networks with Interactive Convolutional Elements | Md Meftahul Ferdaus et.al. | 2410.17172 | link |
| 2024-10-22 | Development of CNN Architectures using Transfer Learning Methods for Medical Image Classification | Ganga Prasad Basyal et.al. | 2410.16711 | null |
| 2024-10-21 | Efficient Neural Network Training via Subset Pretraining | Jan Spörer et.al. | 2410.16523 | null |
| 2024-10-21 | 1024m at SMM4H 2024: Tasks 3, 5 & 6 – Ensembles of Transformers and Large Language Models for Medical Text Classification | Ram Mohan Rao Kadiyala et.al. | 2410.15998 | null |
| 2024-10-21 | Visual Representation Learning Guided By Multi-modal Prior Knowledge | Hongkuan Zhou et.al. | 2410.15981 | null |
| 2024-10-21 | AutoTrain: No-code training for state-of-the-art models | Abhishek Thakur et.al. | 2410.15735 | link |
| 2024-10-21 | ViMoE: An Empirical Study of Designing Vision Mixture-of-Experts | Xumeng Han et.al. | 2410.15732 | null |
| 2024-10-21 | P-YOLOv8: Efficient and Accurate Real-Time Detection of Distracted Driving | Mohamed R. Elshamy et.al. | 2410.15602 | null |
| 2024-10-20 | Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text Describability | Yusuke Hosoya et.al. | 2410.15315 | link |
| 2024-10-19 | Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion | Chaodong Xiao et.al. | 2410.15091 | link |
| 2024-10-19 | PAT: Parameter-Free Audio-Text Aligner to Boost Zero-Shot Audio Classification | Ashish Seth et.al. | 2410.15062 | null |
| 2024-10-19 | Weakly-supervised diagnosis identification from Italian discharge letters | Vittorio Torri et.al. | 2410.15051 | null |
| 2024-10-19 | Reflexive Guidance: Improving OoDD in Vision-Language Models via Self-Guided Image-Adaptive Concept Generation | Seulbi Lee et.al. | 2410.14975 | null |
| 2024-10-18 | A Hybrid Feature Fusion Deep Learning Framework for Leukemia Cancer Detection in Microscopic Blood Sample Using Gated Recurrent Unit and Uncertainty Quantification | Maksuda Akter et.al. | 2410.14536 | null |
| 2024-10-18 | Unlearning Backdoor Attacks for LLMs with Weak-to-Strong Knowledge Distillation | Shuai Zhao et.al. | 2410.14425 | link |
| 2024-10-18 | A Novel Method to Metigate Demographic and Expert Bias in ICD Coding with Causal Inference | Bin Zhang et.al. | 2410.14236 | null |
| 2024-10-18 | Comparative Evaluation of Clustered Federated Learning Method | Michael Ben Ali et.al. | 2410.14212 | link |
| 2024-10-17 | Reproducibility study of “LICO: Explainable Models with Language-Image Consistency” | Luan Fletcher et.al. | 2410.13989 | link |
| 2024-10-17 | LoLDU: Low-Rank Adaptation via Lower-Diag-Upper Decomposition for Parameter-Efficient Fine-Tuning | Yiming Shi et.al. | 2410.13618 | link |
| 2024-10-17 | Augmentation Policy Generation for Image Classification Using Large Language Models | Ant Duru et.al. | 2410.13453 | null |
| 2024-10-17 | Similarity-Dissimilarity Loss with Supervised Contrastive Learning for Multi-label Classification | Guangming Huang et.al. | 2410.13439 | null |
| 2024-10-16 | Interpreting and Analyzing CLIP’s Zero-Shot Image Classification via Mutual Knowledge | Fawaz Sammani et.al. | 2410.13016 | link |
| 2024-10-16 | PND-Net: Plant Nutrition Deficiency and Disease Classification using Graph Convolutional Network | Asish Bera et.al. | 2410.12742 | null |
| 2024-10-16 | Beyond Speech and More: Investigating the Emergent Ability of Speech Foundation Models for Classifying Physiological Time-Series Signals | Orchid Chetia Phukan et.al. | 2410.12645 | null |
| 2024-10-17 | From Measurement Instruments to Data: Leveraging Theory-Driven Synthetic Training Data for Classifying Social Constructs | Lukas Birkenmaier et.al. | 2410.12622 | null |
| 2024-10-16 | Feature Augmentation for Self-supervised Contrastive Learning: A Closer Look | Yong Zhang et.al. | 2410.12396 | null |
| 2024-10-15 | Clustering doc2vec output for topic-dimensionality reduction: A MITRE ATT&CK calibration | Nathan Monnet et.al. | 2410.11573 | null |
| 2024-10-15 | LoKO: Low-Rank Kalman Optimizer for Online Fine-Tuning of Large Models | Hossein Abdi et.al. | 2410.11551 | null |
| 2024-10-15 | Reducing Labeling Costs in Sentiment Analysis via Semi-Supervised Learning | Minoo Jafarlou et.al. | 2410.11355 | null |
| 2024-10-14 | Towards a More Complete Theory of Function Preserving Transforms | Michael Painter et.al. | 2410.11038 | null |
| 2024-10-14 | Enhancing JEPAs with Spatial Conditioning: Robust and Efficient Representation Learning | Etai Littwin et.al. | 2410.10773 | null |
| 2024-10-15 | Ensemble of ConvNeXt V2 and MaxViT for Long-Tailed CXR Classification with View-Based Aggregation | Yosuke Yamagishi et.al. | 2410.10710 | link |
| 2024-10-14 | Queryable Prototype Multiple Instance Learning with Vision-Language Models for Incremental Whole Slide Image Classification | Jiaxiang Gou et.al. | 2410.10573 | null |
| 2024-10-14 | Dynamic Power Control in a Hardware Neural Network with Error-Configurable MAC Units | Maedeh Ghaderi et.al. | 2410.10545 | null |
| 2024-10-14 | Improve Meta-learning for Few-Shot Text Classification with All You Can Acquire from the Tasks | Xinyue Liu et.al. | 2410.10454 | link |
| 2024-10-14 | GlobalMamba: Global Image Serialization for Vision Mamba | Chengkun Wang et.al. | 2410.10316 | link |
| 2024-10-14 | A Multi-Task Text Classification Pipeline with Natural Language Explanations: A User-Centric Evaluation in Sentiment Analysis and Offensive Language Identification in Greek Tweets | Nikolaos Mylonas et.al. | 2410.10290 | null |
| 2024-10-14 | big.LITTLE Vision Transformer for Efficient Visual Recognition | He Guo et.al. | 2410.10267 | null |
| 2024-10-14 | SkillAggregation: Reference-free LLM-Dependent Aggregation | Guangzhi Sun et.al. | 2410.10215 | null |
| 2024-10-14 | Will the Inclusion of Generated Data Amplify Bias Across Generations in Future Image Classification Models? | Zeliang Zhang et.al. | 2410.10160 | null |
| 2024-10-11 | Efficient Hyperparameter Importance Assessment for CNNs | Ruinan Wang et.al. | 2410.08920 | null |
| 2024-10-11 | Parameter-Efficient Fine-Tuning of Large Language Models using Semantic Knowledge Tuning | Nusrat Jahan Prottasha et.al. | 2410.08598 | null |
| 2024-10-11 | DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention | Nguyen Huu Bao Long et.al. | 2410.08582 | link |
| 2024-10-11 | Accelerated Distributed Stochastic Non-Convex Optimization over Time-Varying Directed Networks | Yiyue Chen et.al. | 2410.08508 | null |
| 2024-10-11 | Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP | Eunji Kim et.al. | 2410.08469 | null |
| 2024-10-10 | Bilinear MLPs enable weight-based mechanistic interpretability | Michael T. Pearce et.al. | 2410.08417 | null |
| 2024-10-10 | What is Left After Distillation? How Knowledge Transfer Impacts Fairness and Bias | Aida Mohammadshahi et.al. | 2410.08407 | null |
| 2024-10-10 | Time Traveling to Defend Against Adversarial Example Attacks in Image Classification | Anthony Etim et.al. | 2410.08338 | null |
| 2024-10-10 | More Experts Than Galaxies: Conditionally-overlapping Experts With Biologically-Inspired Fixed Routing | Sagi Shaier et.al. | 2410.08003 | null |
| 2024-10-10 | When the Small-Loss Trick is Not Enough: Multi-Label Image Classification with Noisy Labels Applied to CCTV Sewer Inspections | Keryan Chelouche et.al. | 2410.07689 | null |
| 2024-10-10 | Invisibility Cloak: Disappearance under Human Pose Estimation via Backdoor Attacks | Minxing Zhang et.al. | 2410.07670 | null |
| 2024-10-10 | StablePrompt: Automatic Prompt Tuning using Reinforcement Learning for Large Language Models | Minchan Kwon et.al. | 2410.07652 | null |
| 2024-10-10 | Explainability of Deep Neural Networks for Brain Tumor Detection | S. Park et.al. | 2410.07613 | link |
| 2024-10-10 | CSA: Data-efficient Mapping of Unimodal Features to Multimodal Features | Po-han Li et.al. | 2410.07610 | null |
| 2024-10-09 | One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation | Fabian Paischer et.al. | 2410.07170 | link |
| 2024-10-09 | JPEG Inspired Deep Learning | Ahmed H. Salamah et.al. | 2410.07081 | link |
| 2024-10-09 | Optimizing Estimators of Squared Calibration Errors in Classification | Sebastian G. Gruber et.al. | 2410.07014 | null |
| 2024-10-09 | Spectral and Rhythm Features for Audio Classification with Deep Convolutional Neural Networks | Friedrich Wolf-Monheim et.al. | 2410.06927 | null |
| 2024-10-09 | QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model | Fei Xie et.al. | 2410.06806 | null |
| 2024-10-09 | Convex Distillation: Efficient Compression of Deep Networks via Convex Optimization | Prateek Varshney et.al. | 2410.06567 | null |
| 2024-10-08 | A Comparative Study of Hybrid Models in Health Misinformation Text Classification | Mkululi Sikosana et.al. | 2410.06311 | null |
| 2024-10-08 | Conformal Structured Prediction | Botong Zhang et.al. | 2410.06296 | link |
| 2024-10-08 | TEOChat: A Large Vision-Language Assistant for Temporal Earth Observation Data | Jeremy Andrew Irvin et.al. | 2410.06234 | null |
| 2024-10-08 | Manual Verbalizer Enrichment for Few-Shot Text Classification | Quang Anh Nguyen et.al. | 2410.06173 | null |
| 2024-10-07 | LoTLIP: Improving Language-Image Pre-training for Long Text Understanding | Wei Wu et.al. | 2410.05249 | null |
| 2024-10-07 | Variable Resolution Pixel Quantization for Low Power Machine Vision Application on Edge | Senorita Deb et.al. | 2410.05189 | null |
| 2024-10-07 | IGroupSS-Mamba: Interval Group Spatial-Spectral Mamba for Hyperspectral Image Classification | Yan He et.al. | 2410.05100 | null |
| 2024-10-07 | Explanation sensitivity to the randomness of large language models: the case of journalistic text classification | Jeremie Bogaert et.al. | 2410.05085 | null |
| 2024-10-07 | Control-oriented Clustering of Visual Latent Representation | Han Qi et.al. | 2410.05063 | null |
| 2024-10-07 | SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Classification | Benjamin Feuer et.al. | 2410.05057 | link |
| 2024-10-07 | Art Forgery Detection using Kolmogorov Arnold and Convolutional Neural Networks | Sandro Boccuzzo et.al. | 2410.04866 | null |
| 2024-10-06 | MECFormer: Multi-task Whole Slide Image Classification with Expert Consultation Network | Doanh C. Bui et.al. | 2410.04507 | null |
| 2024-10-06 | Interpret Your Decision: Logical Reasoning Regularization for Generalization in Visual Classification | Zhaorui Tan et.al. | 2410.04492 | link |
| 2024-10-05 | IT $^3$ : Idempotent Test-Time Training | Nikita Durasov et.al. | 2410.04201 | null |
| 2024-10-04 | Classification-Denoising Networks | Louis Thiry et.al. | 2410.03505 | null |
| 2024-10-04 | A Multimodal Framework for Deepfake Detection | Kashish Gandhi et.al. | 2410.03487 | null |
| 2024-10-04 | On Uncertainty In Natural Language Processing | Dennis Ulmer et.al. | 2410.03446 | link |
| 2024-10-04 | Comparing zero-shot self-explanations with human rationales in multilingual text classification | Stephanie Brandl et.al. | 2410.03296 | null |
| 2024-10-04 | Sm: enhanced localization in Multiple Instance Learning for medical imaging classification | Francisco M. Castro-Macías et.al. | 2410.03276 | null |
| 2024-10-04 | Selective Transformer for Hyperspectral Image Classification | Yichu Xu et.al. | 2410.03171 | null |
| 2024-10-03 | CPFD: Confidence-aware Privileged Feature Distillation for Short Video Classification | Jinghao Shi et.al. | 2410.03038 | null |
| 2024-10-03 | On Expert Estimation in Hierarchical Mixture of Experts: Beyond Softmax Gating Functions | Huy Nguyen et.al. | 2410.02935 | null |
| 2024-10-03 | Lie Algebra Canonicalization: Equivariant Neural Operators under arbitrary Lie Groups | Zakhar Shumaylov et.al. | 2410.02698 | null |
| 2024-10-03 | LoGra-Med: Long Context Multi-Graph Alignment for Medical Vision-Language Model | Duy M. H. Nguyen et.al. | 2410.02615 | null |
| 2024-10-03 | Personalized Quantum Federated Learning for Privacy Image Classification | Jinjing Shi et.al. | 2410.02547 | null |
| 2024-10-03 | BiSSL: Bilevel Optimization for Self-Supervised Pre-Training and Fine-Tuning | Gustav Wagner Zakarias et.al. | 2410.02387 | null |
| 2024-10-03 | CTARR: A fast and robust method for identifying anatomical regions on CT images via atlas registration | Thomas Buddenkotte et.al. | 2410.02316 | link |
| 2024-10-03 | Hard Negative Sample Mining for Whole Slide Image Classification | Wentao Huang et.al. | 2410.02212 | link |
| 2024-10-02 | Kolmogorov-Arnold Network Autoencoders | Mohammadamin Moradi et.al. | 2410.02077 | link |
| 2024-10-02 | Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data | Sreyan Ghosh et.al. | 2410.02056 | null |
| 2024-10-02 | FLAG: Financial Long Document Classification via AMR-based GNN | Bolun et.al. | 2410.02024 | link |
| 2024-10-02 | MONICA: Benchmarking on Long-tailed Medical Image Classification | Lie Ju et.al. | 2410.02010 | null |
| 2024-10-02 | Revisiting Hierarchical Text Classification: Inference and Metrics | Roman Plaud et.al. | 2410.01305 | link |
| 2024-10-02 | Automatic deductive coding in discourse analysis: an application of large language models in learning analytics | Lishan Zhang et.al. | 2410.01240 | null |
| 2024-10-01 | Deep Nets with Subsampling Layers Unwittingly Discard Useful Activations at Test-Time | Chiao-An Yang et.al. | 2410.01083 | link |
| 2024-10-01 | Local-to-Global Self-Supervised Representation Learning for Diabetic Retinopathy Grading | Mostafa Hajighasemloua et.al. | 2410.00779 | null |
| 2024-10-01 | NECOMIMI: Neural-Cognitive Multimodal EEG-informed Image Generation with Diffusion Models | Chi-Sheng Chen et.al. | 2410.00712 | null |
| 2024-10-01 | TikGuard: A Deep Learning Transformer-Based Solution for Detecting Unsuitable TikTok Content for Kids | Mazen Balat et.al. | 2410.00403 | null |
| 2024-09-30 | KPCA-CAM: Visual Explainability of Deep Computer Vision Models using Kernel PCA | Sachin Karmani et.al. | 2410.00267 | null |
| 2024-09-30 | A Methodology for Explainable Large Language Models with Integrated Gradients and Linguistic Analysis in Text Classification | Marina Ribeiro et.al. | 2410.00250 | null |
| 2024-09-30 | Evaluating the performance of state-of-the-art esg domain-specific pre-trained large language models in text classification against existing models and traditional machine learning techniques | Tin Yuet Chung et.al. | 2410.00207 | null |
| 2024-10-02 | Evaluating the fairness of task-adaptive pretraining on unlabeled test data before few-shot text classification | Kush Dubey et.al. | 2410.00179 | link |
| 2024-09-30 | POMONAG: Pareto-Optimal Many-Objective Neural Architecture Generator | Eugenio Lomurno et.al. | 2409.20447 | null |
| 2024-09-30 | Satellite image classification with neural quantum kernels | Pablo Rodriguez-Grasa et.al. | 2409.20356 | null |
| 2024-09-30 | All-optical autoencoder machine learning framework using diffractive processors | Peijie Feng et.al. | 2409.20346 | null |
| 2024-09-30 | Fine-Tuning Personalization in Federated Learning to Mitigate Adversarial Clients | Youssef Allouah et.al. | 2409.20329 | null |
| 2024-09-30 | Classroom-Inspired Multi-Mentor Distillation with Adaptive Learning Strategies | Shalini Sarode et.al. | 2409.20237 | null |
| 2024-09-30 | Classification of Radiological Text in Small and Imbalanced Datasets in a Non-English Language | Vincent Beliveau et.al. | 2409.20147 | null |
| 2024-09-30 | SATA: Spatial Autocorrelation Token Analysis for Enhancing the Robustness of Vision Transformers | Nick Nikzad et.al. | 2409.19850 | null |
| 2024-09-29 | Adversarial Examples for DNA Classification | Hyunwoo Yoo et.al. | 2409.19788 | null |
| 2024-09-29 | FAST: A Dual-tier Few-Shot Learning Paradigm for Whole Slide Image Classification | Kexue Fu et.al. | 2409.19720 | null |
| 2024-09-29 | Vision-Language Models are Strong Noisy Label Detectors | Tong Wei et.al. | 2409.19696 | link |
| 2024-09-27 | Unconditional stability of a recurrent neural circuit implementing divisive normalization | Shivang Rawat et.al. | 2409.18946 | null |
| 2024-09-27 | Subspace Preserving Quantum Convolutional Neural Network Architectures | Léo Monbroussou et.al. | 2409.18918 | null |
| 2024-09-27 | Med-IC: Fusing a Single Layer Involution with Convolutions for Enhanced Medical Image Classification and Segmentation | Md. Farhadul Islam et.al. | 2409.18506 | null |
| 2024-09-26 | Towards the Mitigation of Confirmation Bias in Semi-supervised Learning: a Debiased Training Perspective | Yu Wang et.al. | 2409.18316 | null |
| 2024-09-26 | Realistic Evaluation of Model Merging for Compositional Generalization | Derek Tam et.al. | 2409.18314 | null |
| 2024-09-26 | DARE: Diverse Visual Question Answering with Robustness Evaluation | Hannah Sterz et.al. | 2409.18023 | null |
| 2024-09-26 | The Lou Dataset – Exploring the Impact of Gender-Fair Language in German Text Classification | Andreas Waldis et.al. | 2409.17929 | null |
| 2024-09-26 | Cascade Prompt Learning for Vision-Language Model Adaptation | Ge Wu et.al. | 2409.17805 | null |
| 2024-09-26 | Byzantine-Robust Aggregation for Securing Decentralized Federated Learning | Diego Cajaraville-Aboy et.al. | 2409.17754 | null |
| 2024-09-26 | Let the Quantum Creep In: Designing Quantum Neural Network Models by Gradually Swapping Out Classical Components | Peiyong Wang et.al. | 2409.17583 | link |
| 2024-09-26 | Leveraging Annotator Disagreement for Text Classification | Jin Xu et.al. | 2409.17577 | null |
| 2024-09-26 | Uni-Med: A Unified Medical Generalist Foundation Model For Multi-Task Learning Via Connector-MoE | Xun Zhu et.al. | 2409.17508 | null |
| 2024-09-26 | Reducing and Exploiting Data Augmentation Noise through Meta Reweighting Contrastive Learning for Text Classification | Guanyi Mou et.al. | 2409.17474 | null |
| 2024-09-26 | Navigating the Shortcut Maze: A Comprehensive Analysis of Shortcut Learning in Text Classification by Language Models | Yuqing Zhou et.al. | 2409.17455 | null |
| 2024-09-25 | Block Expanded DINORET: Adapting Natural Domain Foundation Models for Retinal Imaging Without Catastrophic Forgetting | Jay Zoellin et.al. | 2409.17332 | null |
| 2024-09-25 | BitQ: Tailoring Block Floating Point Precision for Improved DNN Efficiency on Resource-Constrained Devices | Yongqi Xu et.al. | 2409.17093 | link |
| 2024-09-25 | Accumulator-Aware Post-Training Quantization | Ian Colbert et.al. | 2409.17092 | null |
| 2024-09-26 | HVT: A Comprehensive Vision Framework for Learning in Non-Euclidean Space | Jacob Fein-Ashley et.al. | 2409.16897 | link |
| 2024-09-25 | Shifting from endangerment to rebirth in the Artificial Intelligence Age: An Ensemble Machine Learning Approach for Hawrami Text Classification | Aram Khaksar et.al. | 2409.16884 | null |
| 2024-09-25 | Explicitly Modeling Pre-Cortical Vision with a Neuro-Inspired Front-End Improves CNN Robustness | Lucas Piper et.al. | 2409.16838 | link |
| 2024-09-24 | Unleashing the Potential of Synthetic Images: A Study on Histopathology Image Classification | Leire Benito-Del-Valle et.al. | 2409.16002 | link |
| 2024-09-24 | An ensemble framework approach of hybrid Quantum convolutional neural networks for classification of breast cancer images | Dibyasree Guha et.al. | 2409.15958 | null |
| 2024-09-24 | iGAiVA: Integrated Generative AI and Visual Analytics in a Machine Learning Workflow for Text Classification | Yuanzhe Jin et.al. | 2409.15848 | link |
| 2024-09-23 | Optimizing News Text Classification with Bi-LSTM and Attention Mechanism for Efficient Data Processing | Bingyao Liu et.al. | 2409.15576 | null |
| 2024-09-23 | Critic Loss for Image Classification | Brendan Hogan Rappazzo et.al. | 2409.15565 | null |
| 2024-09-23 | VLMine: Long-Tail Data Mining with Vision Language Models | Mao Ye et.al. | 2409.15486 | null |
| 2024-09-23 | HydroVision: LiDAR-Guided Hydrometric Prediction with Vision Transformers and Hybrid Graph Learning | Naghmeh Shafiee Roudbari et.al. | 2409.15213 | null |
| 2024-09-23 | Benchmarking Edge AI Platforms for High-Performance ML Inference | Rakshith Jayanth et.al. | 2409.14803 | null |
| 2024-09-23 | Less yet robust: crucial region selection for scene recognition | Jianqi Zhang et.al. | 2409.14741 | null |
| 2024-09-22 | Low-Light Enhancement Effect on Classification and Detection: An Empirical Study | Xu Wu et.al. | 2409.14461 | null |
| 2024-09-18 | Unraveling the Hessian: A Key to Smooth Convergence in Loss Function Landscapes | Nikita Kiselev et.al. | 2409.11995 | link |
| 2024-09-18 | Data Efficient Acoustic Scene Classification using Teacher-Informed Confusing Class Instruction | Jin Jie Sean Yeo et.al. | 2409.11964 | null |
| 2024-09-18 | Agglomerative Token Clustering | Joakim Bruslund Haurum et.al. | 2409.11923 | null |
| 2024-09-18 | Distillation-free Scaling of Large SSMs for Images and Videos | Hamid Suleman et.al. | 2409.11867 | null |
| 2024-09-18 | Community Shaping in the Digital Age: A Temporal Fusion Framework for Analyzing Discourse Fragmentation in Online Social Networks | Amirhossein Dezhboro et.al. | 2409.11665 | null |
| 2024-09-18 | Few-Shot Learning Approach on Tuberculosis Classification Based on Chest X-Ray Images | A. A. G. Yogi Pramana et.al. | 2409.11644 | null |
| 2024-09-18 | Hyperspectral Image Classification Based on Faster Residual Multi-branch Spiking Neural Network | Yang Liu et.al. | 2409.11619 | null |
| 2024-09-17 | Multi-Cohort Framework with Cohort-Aware Attention and Adversarial Mutual-Information Minimization for Whole Slide Image Classification | Sharon Peled et.al. | 2409.11119 | null |
| 2024-09-17 | Anti-ESIA: Analyzing and Mitigating Impacts of Electromagnetic Signal Injection Attacks | Denglin Kang et.al. | 2409.10922 | null |
| 2024-09-16 | Are Deep Learning Models Robust to Partial Object Occlusion in Visual Recognition Tasks? | Kaleb Kassaw et.al. | 2409.10775 | null |
| 2024-09-16 | Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning | Amin Karimi Monsefi et.al. | 2409.10362 | null |
| 2024-09-16 | InfoDisent: Explainability of Image Classification Models by Information Disentanglement | Łukasz Struski et.al. | 2409.10329 | null |
| 2024-09-16 | Enhancing Image Classification in Small and Unbalanced Datasets through Synthetic Data Augmentation | Neil De La Fuente et.al. | 2409.10286 | null |
| 2024-09-15 | Finetuning CLIP to Reason about Pairwise Differences | Dylan Sam et.al. | 2409.09721 | null |
| 2024-09-15 | Compositional Audio Representation Learning | Sripathi Sridhar et.al. | 2409.09619 | null |
| 2024-09-14 | One missing piece in Vision and Language: A Survey on Comics Understanding | Emanuele Vivoli et.al. | 2409.09502 | link |
| 2024-09-14 | Real-world Adversarial Defense against Patch Attacks based on Diffusion Model | Xingxing Wei et.al. | 2409.09406 | null |
| 2024-09-14 | Turbo your multi-modal classification with contrastive learning | Zhiyu Zhang et.al. | 2409.09282 | null |
| 2024-09-14 | Leveraging Foundation Models for Efficient Federated Learning in Resource-restricted Edge Networks | S. Kawa Atapour et.al. | 2409.09273 | null |
| 2024-09-13 | ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds | Sreyan Ghosh et.al. | 2409.09213 | link |
| 2024-09-13 | Pushing the boundaries of event subsampling in event-based video classification using CNNs | Hesam Araghi et.al. | 2409.08953 | link |
| 2024-09-13 | Pushing Joint Image Denoising and Classification to the Edge | Thomas C Markhorst et.al. | 2409.08943 | null |
| 2024-09-13 | Byzantine-Robust and Communication-Efficient Distributed Learning via Compressed Momentum Filtering | Changxin Liu et.al. | 2409.08640 | null |
| 2024-09-13 | Anytime Continual Learning for Open Vocabulary Classification | Zhen Zhu et.al. | 2409.08518 | link |
| 2024-09-12 | Enhancing Few-Shot Image Classification through Learnable Multi-Scale Embedding and Attention Mechanisms | Fatemeh Askari et.al. | 2409.07989 | link |
| 2024-09-12 | Microscopic-Mamba: Revealing the Secrets of Microscopic Images with Just 4M Parameters | Shun Zou et.al. | 2409.07896 | link |
| 2024-09-12 | Classifying Images with CoLaNET Spiking Neural Network – the MNIST Example | Mikhail Kiselev et.al. | 2409.07833 | null |
| 2024-09-12 | Efficient Privacy-Preserving KAN Inference Using Homomorphic Encryption | Zhizheng Lai et.al. | 2409.07751 | null |
| 2024-09-12 | DFDG: Data-Free Dual-Generator Adversarial Distillation for One-Shot Federated Learning | Kangyang Luo et.al. | 2409.07734 | null |
| 2024-09-12 | Cooperative Inference with Interleaved Operator Partitioning for CNNs | Zhibang Liu et.al. | 2409.07693 | null |
| 2024-09-11 | Token Turing Machines are Efficient Vision Models | Purvish Jajal et.al. | 2409.07613 | null |
| 2024-09-11 | Minimizing Embedding Distortion for Robust Out-of-Distribution Performance | Tom Shaked et.al. | 2409.07582 | null |
| 2024-09-11 | A Contrastive Symmetric Forward-Forward Algorithm (SFFA) for Continual Learning Tasks | Erik B. Terres-Escudero et.al. | 2409.07387 | null |
| 2024-09-11 | Optimizing Neural Network Performance and Interpretability with Diophantine Equation Encoding | Ronald Katende et.al. | 2409.07310 | null |
| 2024-09-11 | LLM-based feature generation from text for interpretable machine learning | Vojtěch Balek et.al. | 2409.07132 | null |
| 2024-09-11 | Privacy-Preserving Federated Learning with Consistency via Knowledge Distillation Using Conditional Generator | Kangyang Luo et.al. | 2409.06955 | null |
| 2024-09-10 | Dynamic Decoupling of Placid Terminal Attractor-based Gradient Descent Algorithm | Jinwei Zhao et.al. | 2409.06542 | null |
| 2024-09-10 | Seam Carving as Feature Pooling in CNN | Mohammad Imrul Jubair et.al. | 2409.06311 | null |
| 2024-09-10 | EntAugment: Entropy-Driven Adaptive Data Augmentation Framework for Image Classification | Suorong Yang et.al. | 2409.06290 | link |
| 2024-09-09 | A Small Claims Court for the NLP: Judging Legal Text Classification Strategies With Small Datasets | Mariana Yukari Noguti et.al. | 2409.05972 | null |
| 2024-09-09 | SVFit: Parameter-Efficient Fine-Tuning of Large Pre-Trained Models Using Singular Values | Chengwei Sun et.al. | 2409.05926 | null |
| 2024-09-09 | Adversarial Attacks on Data Attribution | Xinhe Wang et.al. | 2409.05657 | null |
| 2024-09-09 | Look One and More: Distilling Hybrid Order Relational Knowledge for Cross-Resolution Image Recognition | Shiming Ge et.al. | 2409.05384 | null |
| 2024-09-09 | RexUniNLU: Recursive Method with Explicit Schema Instructor for Universal NLU | Chengyuan Liu et.al. | 2409.05275 | null |
| 2024-09-09 | Scalable Frame Sampling for Video Classification: A Semi-Optimal Policy Approach with Reduced Search Space | Junho Lee et.al. | 2409.05260 | null |
| 2024-09-08 | PatchAlign:Fair and Accurate Skin Disease Image Classification by Alignment with Clinical Labels | Aayushman et.al. | 2409.04975 | link |
| 2024-09-07 | Activation Function Optimization Scheme for Image Classification | Abdur Rahman et.al. | 2409.04915 | null |
| 2024-09-07 | LoCa: Logit Calibration for Knowledge Distillation | Runming Yang et.al. | 2409.04778 | null |
| 2024-09-07 | Swin Transformer for Robust Differentiation of Real and Synthetic Images: Intra- and Inter-Dataset Analysis | Preetu Mehta et.al. | 2409.04734 | null |
| 2024-09-06 | Connectivity-Inspired Network for Context-Aware Recognition | Gianluca Carloni et.al. | 2409.04360 | null |
| 2024-09-06 | An optically accelerated extreme learning machine using hot atomic vapors | Pierre Azam et.al. | 2409.04312 | null |
| 2024-09-06 | PlantSeg: A Large-Scale In-the-wild Dataset for Plant Disease Segmentation | Tianqi Wei et.al. | 2409.04038 | null |
| 2024-09-05 | Deep Clustering of Remote Sensing Scenes through Heterogeneous Transfer Learning | Isaac Ray et.al. | 2409.03938 | null |
| 2024-09-05 | WaterMAS: Sharpness-Aware Maximization for Neural Network Watermarking | Carl De Sousa Trias et.al. | 2409.03902 | null |
| 2024-09-05 | On-board Satellite Image Classification for Earth Observation: A Comparative Study of Pre-Trained Vision Transformer Models | Thanh-Dung Le et.al. | 2409.03901 | null |
| 2024-09-05 | Have Large Vision-Language Models Mastered Art History? | Ombretta Strafforello et.al. | 2409.03521 | null |
| 2024-09-05 | Non-Uniform Illumination Attack for Fooling Convolutional Neural Networks | Akshay Jain et.al. | 2409.03458 | link |
| 2024-09-05 | Training-free Conversion of Pretrained ANNs to SNNs for Low-Power and High-Performance Applications | Tong Bu et.al. | 2409.03368 | null |
| 2024-09-05 | PEPL: Precision-Enhanced Pseudo-Labeling for Fine-Grained Image Classification in Semi-Supervised Learning | Bowen Tian et.al. | 2409.03192 | null |
| 2024-09-05 | The AdEMAMix Optimizer: Better, Faster, Older | Matteo Pagliardini et.al. | 2409.03137 | null |
| 2024-09-04 | iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation | Hayeon Jo et.al. | 2409.02838 | null |
| 2024-09-03 | MedUnA: Language guided Unsupervised Adaptation of Vision-Language Models for Medical Image Classification | Umaima Rahman et.al. | 2409.02729 | null |
| 2024-09-05 | OpenFact at CheckThat! 2024: Combining Multiple Attack Methods for Effective Adversarial Text Generation | Włodzimierz Lewoniewski et.al. | 2409.02649 | null |
| 2024-09-04 | Boosting Generalizability towards Zero-Shot Cross-Dataset Single-Image Indoor Depth by Meta-Initialization | Cho-Ying Wu et.al. | 2409.02486 | null |
| 2024-09-03 | Evaluation and Comparison of Visual Language Models for Transportation Engineering Problems | Sanjita Prajapati et.al. | 2409.02278 | null |
| 2024-09-05 | Robust Clustering on High-Dimensional Data with Stochastic Quantization | Anton Kozyriev et.al. | 2409.02066 | link |
| 2024-09-03 | Compressed learning based onboard semantic compression for remote sensing platforms | Protim Bhattacharjee et.al. | 2409.01988 | null |
| 2024-09-03 | State-of-the-art Advances of Deep-learning Linguistic Steganalysis Research | Yihao Wang et.al. | 2409.01780 | null |
| 2024-09-03 | Enhancing Fine-Grained Visual Recognition in the Low-Data Regime Through Feature Magnitude Regularization | Avraham Chapman et.al. | 2409.01672 | null |
| 2024-09-03 | ReSpike: Residual Frames-based Hybrid Spiking Neural Networks for Efficient Action Recognition | Shiting Xiao et.al. | 2409.01564 | null |
| 2024-08-30 | Assessing Generative Language Models in Classification Tasks: Performance and Self-Evaluation Capabilities in the Environmental and Climate Change Domain | Francesca Grasso et.al. | 2408.17362 | link |
| 2024-08-30 | Covariance-corrected Whitening Alleviates Network Degeneration on Imbalanced Classification | Zhiwei Zhang et.al. | 2408.17197 | null |
| 2024-08-30 | Improving Extraction of Clinical Event Contextual Properties from Electronic Health Records: A Comparative Study | Shubham Agarwal et.al. | 2408.17181 | null |
| 2024-09-02 | Instant Adversarial Purification with Adversarial Consistency Distillation | Chun Tong Lei et.al. | 2408.17064 | null |
| 2024-08-30 | Generative Modeling Perspective for Control and Reasoning in Robotics | Takuma Yoneda et.al. | 2408.17041 | null |
| 2024-08-29 | Tex-ViT: A Generalizable, Robust, Texture-based dual-branch cross-attention deepfake detector | Deepak Dagar et.al. | 2408.16892 | null |
| 2024-08-29 | SODAWideNet++: Combining Attention and Convolutions for Salient Object Detection | Rohit Venkata Sai Dulam et.al. | 2408.16645 | null |
| 2024-08-29 | Android Malware Detection Based on RGB Images and Multi-feature Fusion | Zhiqiang Wang et.al. | 2408.16555 | null |
| 2024-08-29 | SAU: A Dual-Branch Network to Enhance Long-Tailed Recognition via Generative Models | Guangxi Li et.al. | 2408.16273 | link |
| 2024-08-29 | Improving Diffusion-based Data Augmentation with Inversion Spherical Interpolation | Yanghao Wang et.al. | 2408.16266 | null |
| 2024-08-29 | Low Saturation Confidence Distribution-based Test-Time Adaptation for Cross-Domain Remote Sensing Image Classification | Yu Liang et.al. | 2408.16265 | null |
| 2024-08-28 | EMP: Enhance Memory in Data Pruning | Jinying Xiao et.al. | 2408.16031 | null |
| 2024-08-28 | Local Descriptors Weighted Adaptive Threshold Filtering For Few-Shot Learning | Bingchen Yan et.al. | 2408.15924 | null |
| 2024-08-28 | ModalityMirror: Improving Audio Classification in Modality Heterogeneity Federated Learning with Multimodal Distillation | Tiantian Feng et.al. | 2408.15803 | null |
| 2024-08-28 | Visual Prompt Engineering for Medical Vision Language Models in Radiology | Stefan Denner et.al. | 2408.15802 | null |
| 2024-08-28 | Harnessing the Intrinsic Knowledge of Pretrained Language Models for Challenging Text Classification Settings | Lingyu Gao et.al. | 2408.15650 | null |
| 2024-08-27 | DCT-CryptoNets: Scaling Private Inference in the Frequency Domain | Arjun Roy et.al. | 2408.15231 | null |
| 2024-08-27 | A Review of Transformer-Based Models for Computer Vision Tasks: Capturing Global Context and Spatial Relationships | Gracile Astlin Pereira et.al. | 2408.15178 | null |
| 2024-08-28 | AnomalousPatchCore: Exploring the Use of Anomalous Samples in Industrial Anomaly Detection | Mykhailo Koshil et.al. | 2408.15113 | null |
| 2024-08-27 | Data downlink prioritization using image classification on-board a 6U CubeSat | Keenan A. A. Chatar et.al. | 2408.14865 | null |
| 2024-08-27 | Leveraging Self-supervised Audio Representations for Data-Efficient Acoustic Scene Classification | Yiqiang Cai et.al. | 2408.14862 | null |
| 2024-08-27 | Text-guided Foundation Model Adaptation for Long-Tailed Medical Image Classification | Sirui Li et.al. | 2408.14770 | null |
| 2024-08-26 | On-Chip Learning with Memristor-Based Neural Networks: Assessing Accuracy and Efficiency Under Device Variations, Conductance Errors, and Input Noise | M. Reza Eslami et.al. | 2408.14680 | null |
| 2024-08-26 | Attend-Fusion: Efficient Audio-Visual Fusion for Video Classification | Mahrukh Awan et.al. | 2408.14441 | null |
| 2024-08-26 | Uncertainties of Latent Representations in Computer Vision | Michael Kirchhof et.al. | 2408.14281 | null |
| 2024-08-26 | MSFMamba: Multi-Scale Feature Fusion State Space Model for Multi-Source Remote Sensing Image Classification | Feng Gao et.al. | 2408.14255 | null |
| 2024-08-26 | Feature Aligning Few shot Learning Method Using Local Descriptors Weighted Rules | Bingchen Yan et.al. | 2408.14192 | null |
| 2024-08-26 | GenFormer – Generated Images are All You Need to Improve Robustness of Transformers on Small Datasets | Sven Oehri et.al. | 2408.14131 | null |
| 2024-08-25 | Few-Shot Histopathology Image Classification: Evaluating State-of-the-Art Methods and Unveiling Performance Insights | Ardhendu Sekhar et.al. | 2408.13816 | null |
| 2024-08-25 | On the Robustness of Kolmogorov-Arnold Networks: An Adversarial Perspective | Tal Alter et.al. | 2408.13809 | null |
| 2024-08-25 | Enhancing Adaptive Deep Networks for Image Classification via Uncertainty-aware Decision Fusion | Xu Zhang et.al. | 2408.13744 | link |
| 2024-08-25 | 3D-RCNet: Learning from Transformer to Build a 3D Relational ConvNet for Hyperspectral Image Classification | Haizhao Jing et.al. | 2408.13728 | null |
| 2024-08-24 | Enhanced Astronomical Source Classification with Integration of Attention Mechanisms and Vision Transformers | Srinadh Reddy Bhavanam et.al. | 2408.13634 | null |
| 2024-08-23 | Domain-specific long text classification from sparse relevant information | Célia D’Cruz et.al. | 2408.13253 | null |
| 2024-08-23 | EAViT: External Attention Vision Transformer for Audio Classification | Aquib Iqbal et.al. | 2408.13201 | null |
| 2024-08-23 | A gradient system based on anisotropic monochrome image processing with orientation auto-adjustment | Harbir Antil et.al. | 2408.12847 | null |
| 2024-08-23 | Underwater SONAR Image Classification and Analysis using LIME-based Explainable Artificial Intelligence | Purushothaman Natarajan et.al. | 2408.12837 | null |
| 2024-08-23 | VALE: A Multimodal Visual and Language Explanation Framework for Image Classifiers using eXplainable AI and Language Models | Purushothaman Natarajan et.al. | 2408.12808 | null |
| 2024-08-23 | BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks on Large Language Models | Yige Li et.al. | 2408.12798 | null |
| 2024-08-23 | Semi-Supervised Variational Adversarial Active Learning via Learning to Rank and Agreement-Based Pseudo Labeling | Zongyao Lyu et.al. | 2408.12774 | null |
| 2024-08-23 | Symmetric masking strategy enhances the performance of Masked Image Modeling | Khanh-Binh Nguyen et.al. | 2408.12772 | null |
| 2024-08-22 | ssProp: Energy-Efficient Training for Convolutional Neural Networks with Scheduled Sparse Back Propagation | Lujia Zhong et.al. | 2408.12561 | link |
| 2024-08-22 | The Russian-focused embedders’ exploration: ruMTEB benchmark and Russian embedding model design | Artem Snegirev et.al. | 2408.12503 | null |
| 2024-08-22 | Enhanced Infield Agriculture with Interpretable Machine Learning Approaches for Crop Classification | Sudi Murindanyi et.al. | 2408.12426 | null |
| 2024-08-22 | AT-SNN: Adaptive Tokens for Vision Transformer on Spiking Neural Network | Donghwa Kang et.al. | 2408.12293 | null |
| 2024-08-22 | Whole Slide Image Classification of Salivary Gland Tumours | John Charlton et.al. | 2408.12275 | null |
| 2024-08-22 | Query-Efficient Video Adversarial Attack with Stylized Logo | Duoxun Tang et.al. | 2408.12099 | null |
| 2024-08-21 | Approaching Deep Learning through the Spectral Dynamics of Weights | David Yunis et.al. | 2408.11804 | link |
| 2024-08-21 | SBDet: A Symmetry-Breaking Object Detector via Relaxed Rotation-Equivariance | Zhiqiang Wu et.al. | 2408.11760 | null |
| 2024-08-21 | Improving Calibration by Relating Focal Loss, Temperature Scaling, and Properness | Viacheslav Komisarenko et.al. | 2408.11598 | link |
| 2024-08-21 | MSCPT: Few-shot Whole Slide Image Classification with Multi-scale and Context-focused Prompt Tuning | Minghao Han et.al. | 2408.11505 | null |
| 2024-08-21 | Enabling Small Models for Zero-Shot Classification through Model Label Learning | Jia Zhang et.al. | 2408.11449 | null |
| 2024-08-21 | Automatic Dataset Construction (ADC): Sample Collection, Data Curation, and Beyond | Minghao Liu et.al. | 2408.11338 | null |
| 2024-08-21 | Towards Evaluating Large Language Models on Sarcasm Understanding | Yazhou Zhang et.al. | 2408.11319 | null |
| 2024-08-20 | Privacy-preserving Universal Adversarial Defense for Black-box Models | Qiao Li et.al. | 2408.10647 | null |
| 2024-08-20 | A Tutorial on Explainable Image Classification for Dementia Stages Using Convolutional Neural Network and Gradient-weighted Class Activation Mapping | Kevin Kam Fung Yuen et.al. | 2408.10572 | null |
| 2024-08-20 | NoMatterXAI: Generating “No Matter What” Alterfactual Examples for Explaining Black-Box Text Classification Models | Tuc Nguyen et.al. | 2408.10528 | null |
| 2024-08-20 | Cervical Cancer Detection Using Multi-Branch Deep Learning Model | Tatsuhiro Baba et.al. | 2408.10498 | null |
| 2024-08-19 | HaSPeR: An Image Repository for Hand Shadow Puppet Recognition | Syed Rifat Raiyan et.al. | 2408.10360 | link |
| 2024-08-19 | Leveraging Superfluous Information in Contrastive Representation Learning | Xuechu Yu et.al. | 2408.10292 | null |
| 2024-08-19 | SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models | Anke Tang et.al. | 2408.10174 | link |
| 2024-08-19 | Towards Robust Federated Image Classification: An Empirical Study of Weight Selection Strategies in Manufacturing | Vinit Hegiste et.al. | 2408.10024 | null |
| 2024-08-19 | Detecting Adversarial Attacks in Semantic Segmentation via Uncertainty Estimation: A Deep Analysis | Kira Maag et.al. | 2408.10021 | null |
| 2024-08-19 | Active Learning for Identifying Disaster-Related Tweets: A Comparison with Keyword Filtering and Generic Fine-Tuning | David Hanny et.al. | 2408.09914 | null |
| 2024-08-19 | Ranking Generated Answers: On the Agreement of Retrieval Models with Humans on Consumer Health Questions | Sebastian Heineking et.al. | 2408.09831 | null |
| 2024-08-19 | AutoML-guided Fusion of Entity and LLM-based representations | Boshko Koloski et.al. | 2408.09794 | null |
| 2024-08-19 | Dataset Distillation for Histopathology Image Classification | Cong Cong et.al. | 2408.09709 | null |
| 2024-08-19 | A Strategy to Combine 1stGen Transformers and Open LLMs for Automatic Text Classification | Claudio M. V. de Andrade et.al. | 2408.09629 | null |
| 2024-08-18 | Attention Is Not What You Need: Revisiting Multi-Instance Learning for Whole Slide Image Classification | Xin Liu et.al. | 2408.09449 | null |
| 2024-08-17 | Narrowing the Focus: Learned Optimizers for Pretrained Models | Gus Kristiansen et.al. | 2408.09310 | null |
| 2024-08-16 | DPA: Dual Prototypes Alignment for Unsupervised Adaptation of Vision-Language Models | Eman Ali et.al. | 2408.08855 | null |
| 2024-08-16 | LEVIS: Large Exact Verifiable Input Spaces for Neural Networks | Mohamad Fares El Hajj Chehade et.al. | 2408.08824 | null |
| 2024-08-16 | Leveraging FourierKAN Classification Head for Pre-Trained Transformer-based Text Classification | Abdullah Al Imran et.al. | 2408.08803 | null |
| 2024-08-16 | Xpikeformer: Hybrid Analog-Digital Hardware Acceleration for Spiking Transformers | Zihang Song et.al. | 2408.08794 | null |
| 2024-08-16 | Quantum convolutional neural networks for jet images classification | Hala Elhag et.al. | 2408.08701 | null |
| 2024-08-16 | MM-UNet: A Mixed MLP Architecture for Improved Ophthalmic Image Segmentation | Zunjie Xiao et.al. | 2408.08600 | null |
| 2024-08-16 | Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs | Jinming Liu et.al. | 2408.08575 | null |
| 2024-08-16 | Efficient Image-to-Image Diffusion Classifier for Adversarial Robustness | Hefei Mei et.al. | 2408.08502 | link |
| 2024-08-15 | Beyond Uniform Query Distribution: Key-Driven Grouped Query Attention | Zohaib Khan et.al. | 2408.08454 | null |
| 2024-08-15 | Predictive uncertainty estimation in deep learning for lung carcinoma classification in digital pathology under real dataset shifts | Abdur R. Fayjie et.al. | 2408.08432 | null |
| 2024-08-15 | SLCA++: Unleash the Power of Sequential Fine-tuning for Continual Learning with Pre-training | Gengwei Zhang et.al. | 2408.08295 | link |
| 2024-08-15 | Moving Healthcare AI-Support Systems for Visually Detectable Diseases onto Constrained Devices | Tess Watt et.al. | 2408.08215 | null |
| 2024-08-15 | Towards flexible perception with visual memory | Robert Geirhos et.al. | 2408.08172 | null |
| 2024-08-15 | Category-Prompt Refined Feature Learning for Long-Tailed Multi-Label Image Classification | Jiexuan Yan et.al. | 2408.08125 | link |
| 2024-08-15 | HAIR: Hypernetworks-based All-in-One Image Restoration | Jin Cao et.al. | 2408.08091 | link |
| 2024-08-14 | Large Language Models Prompting With Episodic Memory | Dai Do et.al. | 2408.07465 | null |
| 2024-08-14 | Leveraging Perceptual Scores for Dataset Pruning in Computer Vision Tasks | Raghavendra Singh et.al. | 2408.07243 | null |
| 2024-08-13 | Efficient Search for Customized Activation Functions with Gradient Descent | Lukas Strack et.al. | 2408.06820 | link |
| 2024-08-13 | Do Vision-Language Foundational models show Robust Visual Perception? | Shivam Chandhok et.al. | 2408.06781 | link |
| 2024-08-13 | Towards Cross-Domain Single Blood Cell Image Classification via Large-Scale LoRA-based Segment Anything Model | Yongcheng Li et.al. | 2408.06716 | link |
| 2024-08-13 | Coherence Awareness in Diffractive Neural Networks | Matan Kleiner et.al. | 2408.06681 | null |
| 2024-08-12 | Is it a work or leisure travel? Applying text classification to identify work-related travel on social networks | Lucas Félix et.al. | 2408.06341 | null |
| 2024-08-12 | Audio Enhancement for Computer Audition – An Iterative Training Paradigm Using Sample Importance | Manuel Milling et.al. | 2408.06264 | null |
| 2024-08-12 | Deep Learning System Boundary Testing through Latent Space Style Mixing | Amr Abdellatif et.al. | 2408.06258 | null |
| 2024-08-12 | Global-to-Local Support Spectrums for Language Model Explainability | Lucas Agussurja et.al. | 2408.05976 | null |
| 2024-08-12 | A Simple Task-aware Contrastive Local Descriptor Selection Strategy for Few-shot Learning between inter class and intra class | Qian Qiao et.al. | 2408.05953 | null |
| 2024-08-12 | Classifier Guidance Enhances Diffusion-based Adversarial Purification by Preserving Predictive Information | Mingkun Zhang et.al. | 2408.05900 | null |
| 2024-08-11 | HiLight: A Hierarchy-aware Light Global Model with Hierarchical Local ConTrastive Learning | Zhijian Chen et.al. | 2408.05786 | null |
| 2024-08-11 | PRECISe : Prototype-Reservation for Explainable Classification under Imbalanced and Scarce-Data Settings | Vaibhav Ganatra et.al. | 2408.05754 | null |
| 2024-08-11 | Disposable-key-based image encryption for collaborative learning of Vision Transformer | Rei Aso et.al. | 2408.05737 | null |
| 2024-08-11 | A Novel Momentum-Based Deep Learning Techniques for Medical Image Classification and Segmentation | Koushik Biswas et.al. | 2408.05692 | null |
| 2024-08-09 | A conformalized learning of a prediction set with applications to medical imaging classification | Roy Hirsch et.al. | 2408.05037 | null |
| 2024-08-09 | Generalisation First, Memorisation Second? Memorisation Localisation for Natural Language Classification Tasks | Verna Dankers et.al. | 2408.04965 | null |
| 2024-08-09 | LiD-FL: Towards List-Decodable Federated Learning | Hong Liu et.al. | 2408.04963 | null |
| 2024-08-09 | In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation | Dahyun Kang et.al. | 2408.04961 | link |
| 2024-08-08 | Enhanced Prototypical Part Network (EPPNet) For Explainable Image Classification Via Prototypes | Bhushan Atote et.al. | 2408.04606 | null |
| 2024-08-08 | SCENE: Evaluating Explainable AI Techniques Using Soft Counterfactuals | Haoran Zheng et.al. | 2408.04575 | null |
| 2024-08-08 | An experimental comparative study of backpropagation and alternatives for training binary neural networks for image classification | Ben Crulis et.al. | 2408.04460 | null |
| 2024-08-08 | Dual-branch PolSAR Image Classification Based on GraphMAE and Local Feature Extraction | Yuchen Wang et.al. | 2408.04294 | null |
| 2024-08-07 | FMiFood: Multi-modal Contrastive Learning for Food Image Classification | Xinyue Pan et.al. | 2408.03922 | null |
| 2024-08-07 | Leveraging Variation Theory in Counterfactual Data Augmentation for Optimized Active Learning | Simret Araya Gebreegziabher et.al. | 2408.03819 | null |
| 2024-08-07 | Intuitionistic Fuzzy Cognitive Maps for Interpretable Image Classification | Georgia Sovatzidi et.al. | 2408.03745 | null |
| 2024-08-07 | CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications | Tianfang Zhang et.al. | 2408.03703 | link |
| 2024-08-07 | Designing Extremely Memory-Efficient CNNs for On-device Vision Tasks | Jaewook Lee et.al. | 2408.03663 | null |
| 2024-08-07 | Making Robust Generalizers Less Rigid with Soft Ascent-Descent | Matthew J. Holland et.al. | 2408.03619 | null |
| 2024-08-06 | AI Foundation Models in Remote Sensing: A Survey | Siqi Lu et.al. | 2408.03464 | null |
| 2024-08-06 | Compress and Compare: Interactively Evaluating Efficiency and Behavior Across ML Model Compression Experiments | Angie Boggust et.al. | 2408.03274 | null |
| 2024-08-06 | A Debiased Nearest Neighbors Framework for Multi-Label Text Classification | Zifeng Cheng et.al. | 2408.03202 | null |
| 2024-08-06 | Leveraging Parameter Efficient Training Methods for Low Resource Text Classification: A Case Study in Marathi | Pranita Deshmukh et.al. | 2408.03172 | null |
| 2024-08-06 | Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression | Jonas Schmitt et.al. | 2408.03046 | null |
| 2024-08-06 | L3iTC at the FinLLM Challenge Task: Quantization for Financial Text Classification & Summarization | Elvys Linhares Pontes et.al. | 2408.03033 | null |
| 2024-08-06 | Adversarial Robustness of Open-source Text Classification Models and Fine-Tuning Chains | Hao Qin et.al. | 2408.02963 | null |
| 2024-08-06 | Dual-View Pyramid Pooling in Deep Neural Networks for Improved Medical Image Classification and Confidence Calibration | Xiaoqing Zhang et.al. | 2408.02906 | null |
| 2024-08-05 | Interpretation of the Intent Detection Problem as Dynamics in a Low-dimensional Space | Eduardo Sanchez-Karhunen et.al. | 2408.02838 | null |
| 2024-08-05 | Pre-trained Encoder Inference: Revealing Upstream Encoders In Downstream Machine Learning Services | Shaopeng Fu et.al. | 2408.02814 | null |
| 2024-08-05 | FPT+: A Parameter and Memory Efficient Transfer Learning Method for High-resolution Medical Image Classification | Yijin Huang et.al. | 2408.02426 | null |
| 2024-08-05 | On the Robustness of Malware Detectors to Adversarial Samples | Muhammad Salman et.al. | 2408.02310 | null |
| 2024-08-05 | Low-Cost Self-Ensembles Based on Multi-Branch Transformation and Grouped Convolution | Hojung Lee et.al. | 2408.02307 | null |
| 2024-08-05 | Network Fission Ensembles for Low-Cost Self-Ensembles | Hojung Lee et.al. | 2408.02301 | null |
| 2024-08-04 | VidModEx: Interpretable and Efficient Black Box Model Extraction for High-Dimensional Spaces | Somnath Sendhil Kumar et.al. | 2408.02140 | link |
| 2024-08-04 | DeMansia: Mamba Never Forgets Any Tokens | Ricky Fang et.al. | 2408.01986 | null |
| 2024-08-06 | A Survey and Evaluation of Adversarial Attacks for Object Detection | Khoi Nguyen Tiet Nguyen et.al. | 2408.01934 | null |
| 2024-08-03 | Safe Semi-Supervised Contrastive Learning Using In-Distribution Data as Positive Examples | Min Gu Kwak et.al. | 2408.01872 | null |
| 2024-08-03 | LAM3D: Leveraging Attention for Monocular 3D Object Detection | Diana-Alexandra Sas et.al. | 2408.01739 | null |
| 2024-08-02 | Counterfactual Explanations for Medical Image Classification and Regression using Diffusion Autoencoder | Matan Atad et.al. | 2408.01571 | null |
| 2024-08-02 | Spatial-Spectral Morphological Mamba for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2408.01372 | link |
| 2024-08-02 | WaveMamba: Spatial-Spectral Wavelet Mamba for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2408.01231 | null |
| 2024-08-02 | Multi-head Spatial-Spectral Mamba for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2408.01224 | link |
| 2024-08-02 | Rethinking Pre-trained Feature Extractor Selection in Multiple Instance Learning for Whole Slide Image Classification | Bryan Wong et.al. | 2408.01167 | null |
| 2024-08-01 | CERT-ED: Certifiably Robust Text Classification for Edit Distance | Zhuoqun Huang et.al. | 2408.00728 | null |
| 2024-08-01 | Deep Learning in Medical Image Classification from MRI-based Brain Tumor Images | Xiaoyi Liu et.al. | 2408.00636 | null |
| 2024-08-01 | DECIDER: Leveraging Foundation Model Priors for Improved Model Failure Detection and Explanation | Rakshith Subramanyam et.al. | 2408.00331 | null |
| 2024-07-31 | Vera Verto: Multimodal Hijacking Attack | Minxing Zhang et.al. | 2408.00129 | null |
| 2024-07-31 | Learning Video Context as Interleaved Multimodal Sequences | Kevin Qinghong Lin et.al. | 2407.21757 | link |
| 2024-07-30 | Contrasting Deep Learning Models for Direct Respiratory Insufficiency Detection Versus Blood Oxygen Saturation Estimation | Marcelo Matheus Gauy et.al. | 2407.20989 | null |
| 2024-07-30 | Faithful and Plausible Natural Language Explanations for Image Classification: A Pipeline Approach | Adam Wojciechowski et.al. | 2407.20899 | null |
| 2024-08-01 | DFE-IANet: A Method for Polyp Image Classification Based on Dual-domain Feature Extraction and Interaction Attention | Wei Wang et.al. | 2407.20843 | null |
| 2024-08-01 | The Susceptibility of Example-Based Explainability Methods to Class Outliers | Ikhtiyor Nematov et.al. | 2407.20678 | null |
| 2024-07-30 | Knowledge Fused Recognition: Fusing Hierarchical Knowledge for Image Recognition through Quantitative Relativity Modeling and Deep Metric Learning | Yunfeng Zhao et.al. | 2407.20600 | null |
| 2024-07-30 | Exploring Liquid Neural Networks on Loihi-2 | Wiktoria Agata Pawlak et.al. | 2407.20590 | null |
| 2024-07-29 | Graphite: A Graph-based Extreme Multi-Label Short Text Classifier for Keyphrase Recommendation | Ashirbad Mishra et.al. | 2407.20462 | null |
| 2024-07-29 | Diffusion Feedback Helps CLIP See Better | Wenxuan Wang et.al. | 2407.20171 | link |
| 2024-07-29 | Distilling High Diagnostic Value Patches for Whole Slide Image Classification Using Attention Mechanism | Tianhang Nan et.al. | 2407.19821 | null |
| 2024-07-28 | Competition-based Adaptive ReLU for Deep Neural Networks | Junjia Chen et.al. | 2407.19441 | null |
| 2024-07-28 | Depth-Wise Convolutions in Vision Transformers for Efficient Training on Small Datasets | Tianxiao Zhang et.al. | 2407.19394 | link |
| 2024-07-27 | Inference-Time Selective Debiasing | Gleb Kuzmin et.al. | 2407.19345 | null |
| 2024-07-27 | Stellar Blend Image Classification Using Computationally Efficient Gaussian Processes | Chinedu Eleh et.al. | 2407.19297 | null |
| 2024-07-27 | Towards Robust Few-shot Class Incremental Learning in Audio Classification using Contrastive Representation | Riyansha Singh et.al. | 2407.19265 | null |
| 2024-07-27 | A Survey of Malware Detection Using Deep Learning | Ahmed Bensaoud et.al. | 2407.19153 | null |
| 2024-07-26 | UniForensics: Face Forgery Detection via General Facial Representation | Ziyuan Fang et.al. | 2407.19079 | null |
| 2024-07-26 | A Scalable Quantum Non-local Neural Network for Image Classification | Sparsh Gupta et.al. | 2407.18906 | link |
| 2024-07-26 | Unifying Visual and Semantic Feature Spaces with Diffusion Models for Enhanced Cross-Modal Alignment | Yuze Zheng et.al. | 2407.18854 | null |
| 2024-07-26 | Local Binary Pattern(LBP) Optimization for Feature Extraction | Zeinab Sedaghatjoo et.al. | 2407.18665 | null |
| 2024-07-26 | Topology Optimization of Random Memristors for Input-Aware Dynamic SNN | Bo Wang et.al. | 2407.18625 | null |
| 2024-07-26 | Content-driven Magnitude-Derivative Spectrum Complementary Learning for Hyperspectral Image Classification | Huiyan Bai et.al. | 2407.18593 | null |
| 2024-07-26 | VSSD: Vision Mamba with Non-Casual State Space Duality | Yuheng Shi et.al. | 2407.18559 | link |
| 2024-07-25 | Self-supervised pre-training with diffusion model for few-shot landmark detection in x-ray images | Roberto Di Via et.al. | 2407.18125 | null |
| 2024-07-25 | Mew: Multiplexed Immunofluorescence Image Analysis through an Efficient Multiplex Network | Sukwon Yun et.al. | 2407.17857 | link |
| 2024-07-25 | SAM-MIL: A Spatial Contextual Aware Multiple Instance Learning Approach for Whole Slide Image Classification | Heng Fang et.al. | 2407.17689 | link |
| 2024-07-26 | Unsqueeze [CLS] Bottleneck to Learn Rich Representations | Qing Su et.al. | 2407.17671 | link |
| 2024-07-24 | Explaining the Model, Protecting Your Data: Revealing and Mitigating the Data Privacy Risks of Post-Hoc Model Explanations via Membership Inference | Catherine Huang et.al. | 2407.17663 | null |
| 2024-07-23 | S-E Pipeline: A Vision Transformer (ViT) based Resilient Classification Pipeline for Medical Imaging Against Adversarial Attacks | Neha A S et.al. | 2407.17587 | null |
| 2024-07-24 | A Novel Two-Step Fine-Tuning Pipeline for Cold-Start Active Learning in Text Classification Tasks | Fabiano Belém et.al. | 2407.17284 | null |
| 2024-07-24 | Graph Neural Networks: A suitable Alternative to MLPs in Latent 3D Medical Image Classification? | Johannes Kiechle et.al. | 2407.17219 | link |
| 2024-07-24 | Quanv4EO: Empowering Earth Observation by means of Quanvolutional Neural Networks | Alessandro Sebastianelli et.al. | 2407.17108 | null |
| 2024-07-24 | An Adaptive Gradient Regularization Method | Huixiu Jiang et.al. | 2407.16944 | null |
| 2024-07-23 | Lawma: The Power of Specialization for Legal Tasks | Ricardo Dominguez-Olmedo et.al. | 2407.16615 | null |
| 2024-07-23 | Deep Bayesian segmentation for colon polyps: Well-calibrated predictions in medical imaging | Daniela L. Ramos et.al. | 2407.16608 | null |
| 2024-07-23 | Designing robust diffractive neural networks with improved transverse shift tolerance | Daniil V. Soshnikov et.al. | 2407.16456 | null |
| 2024-07-23 | Image Classification using Fuzzy Pooling in Convolutional Kolmogorov-Arnold Networks | Ayan Igali et.al. | 2407.16268 | null |
| 2024-07-23 | HSVLT: Hierarchical Scale-Aware Vision-Language Transformer for Multi-Label Image Classification | Shuyi Ouyang et.al. | 2407.16244 | null |
| 2024-07-23 | Improved Few-Shot Image Classification Through Multiple-Choice Questions | Dipika Khullar et.al. | 2407.16145 | null |
| 2024-07-22 | Pavement Fatigue Crack Detection and Severity Classification Based on Convolutional Neural Network | Zhen Wang et.al. | 2407.16021 | null |
| 2024-07-22 | AIDE: Antithetical, Intent-based, and Diverse Example-Based Explanations | Ikhtiyor Nematov et.al. | 2407.16010 | null |
| 2024-07-22 | Comprehensive Study on Performance Evaluation and Optimization of Model Compression: Bridging Traditional Deep Learning and Large Language Models | Aayush Saxena et.al. | 2407.15904 | null |
| 2024-07-22 | Beyond Size and Class Balance: Alpha as a New Dataset Quality Metric for Deep Learning | Josiah Couch et.al. | 2407.15724 | null |
| 2024-07-22 | Retinomorphic Feature Detection and Machine Vision in a Network Laser | Wai Kit Ng et.al. | 2407.15558 | null |
| 2024-07-22 | Learning deep illumination-robust features from multispectral filter array images | Anis Amziane et.al. | 2407.15472 | null |
| 2024-07-22 | Is user feedback always informative? Retrieval Latent Defending for Semi-Supervised Domain Adaptation without Source Data | Junha Song et.al. | 2407.15383 | null |
| 2024-07-22 | FMDNN: A Fuzzy-guided Multi-granular Deep Neural Network for Histopathological Image Classification | Weiping Ding et.al. | 2407.15312 | null |
| 2024-07-21 | Assessing Sample Quality via the Latent Space of Generative Models | Jingyi Xu et.al. | 2407.15171 | null |
| 2024-07-21 | A multi-level multi-label text classification dataset of 19th century Ottoman and Russian literary and critical texts | Gokcen Gokceoglu et.al. | 2407.15136 | null |
| 2024-07-20 | Toward Efficient Convolutional Neural Networks With Structured Ternary Patterns | Christos Kyrkou et.al. | 2407.14831 | link |
| 2024-07-20 | Subgraph Clustering and Atom Learning for Improved Image Classification | Aryan Singh et.al. | 2407.14772 | null |
| 2024-07-20 | A Comprehensive Review of Few-shot Action Recognition | Yuyang Wanyan et.al. | 2407.14744 | null |
| 2024-07-19 | DEPICT: Diffusion-Enabled Permutation Importance for Image Classification Tasks | Sarah Jabbour et.al. | 2407.14509 | null |
| 2024-07-19 | Enhancing Zero-shot Audio Classification using Sound Attribute Knowledge from Large Language Models | Xuenan Xu et.al. | 2407.14355 | null |
| 2024-07-19 | EmoCAM: Toward Understanding What Drives CNN-based Emotion Recognition | Youssef Doulfoukar et.al. | 2407.14314 | null |
| 2024-07-18 | CoAPT: Context Attribute words for Prompt Tuning | Gun Lee et.al. | 2407.13808 | null |
| 2024-07-18 | GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model | Abdelrahman Shaker et.al. | 2407.13772 | link |
| 2024-07-18 | Addressing Imbalance for Class Incremental Learning in Medical Image Classification | Xuze Hao et.al. | 2407.13768 | null |
| 2024-07-18 | Differential Privacy Mechanisms in Neural Tangent Kernel Regression | Jiuxiang Gu et.al. | 2407.13621 | null |
| 2024-07-18 | CycleMix: Mixing Source Domains for Domain Generalization in Style-Dependent Data | Aristotelis Ballas et.al. | 2407.13421 | link |
| 2024-07-17 | LookupViT: Compressing visual information to a limited number of tokens | Rajat Koner et.al. | 2407.12753 | null |
| 2024-07-17 | Toward INT4 Fixed-Point Training via Exploring Quantization Error for Gradients | Dohyung Kim et.al. | 2407.12637 | null |
| 2024-07-17 | Domain-specific or Uncertainty-aware models: Does it really make a difference for biomedical text classification? | Aman Sinha et.al. | 2407.12626 | null |
| 2024-07-18 | Benchmarking Robust Self-Supervised Learning Across Diverse Downstream Tasks | Antoni Kowalczuk et.al. | 2407.12588 | link |
| 2024-07-17 | Non-parametric regularization for class imbalance federated medical image classification | Jeffry Wicaksana et.al. | 2407.12446 | link |
| 2024-07-17 | FETCH: A Memory-Efficient Replay Approach for Continual Learning in Image Classification | Markus Weißflog et.al. | 2407.12375 | null |
| 2024-07-17 | Adaptive Cascading Network for Continual Test-Time Adaptation | Kien X. Nguyen et.al. | 2407.12240 | null |
| 2024-07-16 | Generalized Coverage for More Robust Low-Budget Active Learning | Wonho Bae et.al. | 2407.12212 | null |
| 2024-07-18 | A Closer Look at Benchmarking Self-Supervised Pre-training with Image Classification | Markus Marks et.al. | 2407.12210 | null |
| 2024-07-16 | Novel Artistic Scene-Centric Datasets for Effective Transfer Learning in Fragrant Spaces | Shumei Liu et.al. | 2407.11701 | null |
| 2024-07-16 | Probing the Efficacy of Federated Parameter-Efficient Fine-Tuning of Vision Transformers for Medical Image Classification | Naif Alkhunaizi et.al. | 2407.11573 | null |
| 2024-07-16 | TCFormer: Visual Recognition via Token Clustering Transformer | Wang Zeng et.al. | 2407.11321 | link |
| 2024-07-16 | PADRe: A Unifying Polynomial Attention Drop-in Replacement for Efficient Vision Transformer | Pierre-David Letourneau et.al. | 2407.11306 | null |
| 2024-07-15 | Unconstrained Open Vocabulary Image Classification: Zero-Shot Transfer from Text to Image via CLIP Inversion | Philipp Allgeuer et.al. | 2407.11211 | null |
| 2024-07-16 | DataDream: Few-shot Guided Dataset Generation | Jae Myung Kim et.al. | 2407.10910 | link |
| 2024-07-15 | Pathology-knowledge Enhanced Multi-instance Prompt Learning for Few-shot Whole Slide Image Classification | Linhao Qu et.al. | 2407.10814 | null |
| 2024-07-15 | Employing Sentence Space Embedding for Classification of Data Stream from Fake News Domain | Paweł Zyblewski et.al. | 2407.10807 | null |
| 2024-07-15 | Anticipating Future Object Compositions without Forgetting | Youssef Zahran et.al. | 2407.10723 | null |
| 2024-07-15 | GeoMix: Towards Geometry-Aware Data Augmentation | Wentao Zhao et.al. | 2407.10681 | link |
| 2024-07-15 | Learning Natural Consistency Representation for Face Forgery Video Detection | Daichi Zhang et.al. | 2407.10550 | null |
| 2024-07-15 | Improving Hyperbolic Representations via Gromov-Wasserstein Regularization | Yifei Yang et.al. | 2407.10495 | null |
| 2024-07-15 | Backdoor Attacks against Image-to-Image Networks | Wenbo Jiang et.al. | 2407.10445 | null |
| 2024-07-14 | Deep Learning Algorithms for Early Diagnosis of Acute Lymphoblastic Leukemia | Dimitris Papaioannou et.al. | 2407.10251 | null |
| 2024-07-14 | Advancing Continual Learning for Robust Deepfake Audio Classification | Feiyi Dong et.al. | 2407.10108 | null |
| 2024-07-12 | Evaluating the Adversarial Robustness of Semantic Segmentation: Trying Harder Pays Off | Levente Halmosi et.al. | 2407.09150 | link |
| 2024-07-12 | Open Vocabulary Multi-Label Video Classification | Rohit Gupta et.al. | 2407.09073 | null |
| 2024-07-12 | GPC: Generative and General Pathology Image Classifier | Anh Tien Nguyen et.al. | 2407.09035 | null |
| 2024-07-12 | CAMP: Continuous and Adaptive Learning Model in Pathology | Anh Tien Nguyen et.al. | 2407.09030 | null |
| 2024-07-12 | SlideGCD: Slide-based Graph Collaborative Training with Knowledge Distillation for Whole Slide Image Classification | Tong Shu et.al. | 2407.08968 | null |
| 2024-07-12 | Domain-Hierarchy Adaptation via Chain of Iterative Reasoning for Few-shot Hierarchical Text Classification | Ke Ji et.al. | 2407.08959 | null |
| 2024-07-11 | Local Clustering for Lung Cancer Image Classification via Sparse Solution Technique | Jackson Hamel et.al. | 2407.08800 | null |
| 2024-07-11 | Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification | Wenshuo Peng et.al. | 2407.08787 | null |
| 2024-07-11 | ElasticAST: An Audio Spectrogram Transformer for All Length and Resolutions | Jiu Feng et.al. | 2407.08691 | link |
| 2024-07-11 | Histopathological Image Classification with Cell Morphology Aware Deep Neural Networks | Andrey Ignatov et.al. | 2407.08625 | link |
| 2024-07-11 | BiasPruner: Debiased Continual Learning for Medical Image Classification | Nourhan Bayasi et.al. | 2407.08609 | link |
| 2024-07-11 | GraphMamba: An Efficient Graph Structure Learning Vision Mamba for Hyperspectral Image Classification | Aitao Yang et.al. | 2407.08255 | link |
| 2024-07-11 | Beyond Text: Leveraging Multi-Task Learning and Cognitive Appraisal Theory for Post-Purchase Intention Analysis | Gerard Christopher Yeo et.al. | 2407.08182 | null |
| 2024-07-11 | Enrich the content of the image Using Context-Aware Copy Paste | Qiushi Guo et.al. | 2407.08151 | null |
| 2024-07-10 | MambaVision: A Hybrid Mamba-Transformer Vision Backbone | Ali Hatamizadeh et.al. | 2407.08083 | link |
| 2024-07-10 | The Misclassification Likelihood Matrix: Some Classes Are More Likely To Be Misclassified Than Others | Daniel Sikar et.al. | 2407.07818 | null |
| 2024-07-11 | Trainable Highly-expressive Activation Functions | Irit Chelly et.al. | 2407.07564 | null |
| 2024-07-10 | HDKD: Hybrid Data-Efficient Knowledge Distillation Network for Medical Image Classification | Omar S. EL-Assiouti et.al. | 2407.07516 | null |
| 2024-07-10 | Towards a text-based quantitative and explainable histopathology image analysis | Anh Tien Nguyen et.al. | 2407.07360 | null |
| 2024-07-11 | FALFormer: Feature-aware Landmarks self-attention for Whole-slide Image Classification | Doanh C. Bui et.al. | 2407.07340 | link |
| 2024-07-10 | Dual-stage Hyperspectral Image Classification Model with Spectral Supertoken | Peifu Liu et.al. | 2407.07307 | link |
| 2024-07-09 | Exploring Camera Encoder Designs for Autonomous Driving Perception | Barath Lakshmanan et.al. | 2407.07276 | null |
| 2024-07-09 | CTRL-F: Pairing Convolution with Transformer for Image Classification via Multi-Level Feature Cross-Attention and Representation Learning Fusion | Hosam S. EL-Assiouti et.al. | 2407.06673 | null |
| 2024-07-09 | NoisyAG-News: A Benchmark for Addressing Instance-Dependent Noise in Text Classification | Hongfei Huang et.al. | 2407.06579 | null |
| 2024-07-08 | Hybrid Classical-Quantum architecture for vectorised image classification of hand-written sketches | Y. Cordero et.al. | 2407.06416 | null |
| 2024-07-08 | GeoWATCH for Detecting Heavy Construction in Heterogeneous Time Series of Satellite Images | Jon Crall et.al. | 2407.06337 | null |
| 2024-07-08 | Multi-Label Plant Species Classification with Self-Supervised Vision Transformers | Murilo Gustineli et.al. | 2407.06298 | link |
| 2024-07-08 | Active Label Refinement for Robust Training of Imbalanced Medical Image Classification Tasks in the Presence of High Label Noise | Bidur Khanal et.al. | 2407.05973 | null |
| 2024-07-08 | Wavelet Convolutions for Large Receptive Fields | Shahaf E. Finder et.al. | 2407.05848 | link |
| 2024-07-08 | Evaluating the Fairness of Neural Collapse in Medical Image Classification | Kaouther Mouheb et.al. | 2407.05843 | null |
| 2024-07-08 | Learning to Adapt Category Consistent Meta-Feature of CLIP for Few-Shot Classification | Jiaying Shi et.al. | 2407.05647 | null |
| 2024-07-08 | New Directions in Text Classification Research: Maximizing The Performance of Sentiment Classification from Limited Data | Surya Agustian et.al. | 2407.05627 | null |
| 2024-07-08 | Momentum Auxiliary Network for Supervised Local Learning | Junhao Su et.al. | 2407.05623 | link |
| 2024-07-08 | Open-world Multi-label Text Classification with Extremely Weak Supervision | Xintong Li et.al. | 2407.05609 | link |
| 2024-07-08 | FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance | Jiedong Zhuang et.al. | 2407.05578 | null |
| 2024-07-08 | An accurate detection is not all you need to combat label noise in web-noisy datasets | Paul Albert et.al. | 2407.05528 | null |
| 2024-07-07 | Leveraging Topological Guidance for Improved Knowledge Distillation | Eun Som Jeon et.al. | 2407.05316 | link |
| 2024-07-05 | AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation | Yuhan Zhu et.al. | 2407.04603 | null |
| 2024-07-05 | AMD: Automatic Multi-step Distillation of Large-scale Vision Models | Cheng Han et.al. | 2407.04208 | null |
| 2024-07-04 | LeDNet: Localization-enabled Deep Neural Network for Multi-Label Radiography Image Classification | Lalit Pant et.al. | 2407.03931 | null |
| 2024-07-04 | DocXplain: A Novel Model-Agnostic Explainability Method for Document Image Classification | Saifullah Saifullah et.al. | 2407.03830 | null |
| 2024-07-04 | reBEN: Refined BigEarthNet Dataset for Remote Sensing Image Analysis | Kai Norman Clasen et.al. | 2407.03653 | link |
| 2024-07-04 | Resampled Datasets Are Not Enough: Mitigating Societal Bias Beyond Single Attributes | Yusuke Hirota et.al. | 2407.03623 | null |
| 2024-07-04 | Self Adaptive Threshold Pseudo-labeling and Unreliable Sample Contrastive Loss for Semi-supervised Image Classification | Xuerong Zhang et.al. | 2407.03596 | null |
| 2024-07-04 | DGR-MIL: Exploring Diverse Global Representation in Multiple Instance Learning for Whole Slide Image Classification | Wenhui Zhu et.al. | 2407.03575 | link |
| 2024-07-03 | A multicategory jet image classification framework using deep neural network | Jairo Orozco Sandoval et.al. | 2407.03524 | null |
| 2024-07-03 | Model Guidance via Explanations Turns Image Classifiers into Segmentation Models | Xiaoyan Yu et.al. | 2407.03009 | null |
| 2024-07-03 | ShiftAddAug: Augment Multiplication-Free Tiny Neural Network with Hybrid Computation | Yipin Guo et.al. | 2407.02881 | null |
| 2024-07-03 | Fine-Grained Scene Image Classification with Modality-Agnostic Adapter | Yiqun Wang et.al. | 2407.02769 | link |
| 2024-07-03 | ADFQ-ViT: Activation-Distribution-Friendly Post-Training Quantization for Vision Transformers | Yanfeng Jiang et.al. | 2407.02763 | null |
| 2024-07-02 | Spectral Graph Reasoning Network for Hyperspectral Image Classification | Huiling Wang et.al. | 2407.02647 | null |
| 2024-07-01 | CGRclust: Chaos Game Representation for Twin Contrastive Clustering of Unlabelled DNA Sequences | Fatemeh Alipour et.al. | 2407.02538 | link |
| 2024-07-02 | Exploring the Role of Transliteration in In-Context Learning for Low-resource Languages Written in Non-Latin Scripts | Chunlan Ma et.al. | 2407.02320 | null |
| 2024-07-03 | Federated Distillation for Medical Image Classification: Towards Trustworthy Computer-Aided Diagnosis | Sufen Ren et.al. | 2407.02261 | null |
| 2024-07-02 | Hybrid Feature Collaborative Reconstruction Network for Few-Shot Fine-Grained Image Classification | Shulei Qiu et.al. | 2407.02123 | null |
| 2024-07-01 | Optimized Learning for X-Ray Image Classification for Multi-Class Disease Diagnoses with Accelerated Computing Strategies | Sebastian A. Cruz Romero et.al. | 2407.01705 | null |
| 2024-07-02 | xLSTM-UNet can be an Effective 2D & 3D Medical Image Segmentation Backbone with Vision-LSTM (ViL) better than its Mamba Counterpart | Tianrun Chen et.al. | 2407.01530 | link |
| 2024-07-01 | Scarecrow monitoring system:employing mobilenet ssd for enhanced animal supervision | Balaji VS et.al. | 2407.01435 | null |
| 2024-07-01 | Semantic Compositions Enhance Vision-Language Contrastive Learning | Maxwell Aladago et.al. | 2407.01408 | null |
| 2024-07-01 | GalLoP: Learning Global and Local Prompts for Vision-Language Models | Marc Lafon et.al. | 2407.01400 | null |
| 2024-07-01 | Protecting Privacy in Classifiers by Token Manipulation | Re’em Harel et.al. | 2407.01334 | null |
| 2024-07-01 | Gradient-based Class Weighting for Unsupervised Domain Adaptation in Dense Prediction Visual Tasks | Roberto Alcover-Couso et.al. | 2407.01327 | null |
| 2024-06-28 | Extract More from Less: Efficient Fine-Grained Visual Recognition in Low-Data Regimes | Dmitry Demidov et.al. | 2406.19814 | link |
| 2024-06-27 | Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads | Ali Khaleghi Rahimian et.al. | 2406.19391 | link |
| 2024-06-27 | Learning Visual Conditioning Tokens to Correct Domain Shift for Fully Test-time Adaptation | Yushun Tang et.al. | 2406.19341 | null |
| 2024-06-27 | Spiking Convolutional Neural Networks for Text Classification | Changze Lv et.al. | 2406.19230 | link |
| 2024-06-27 | Adaptive Stochastic Weight Averaging | Caglar Demir et.al. | 2406.19092 | link |
| 2024-06-27 | FedMLP: Federated Multi-Label Medical Image Classification under Task Heterogeneity | Zhaobin Sun et.al. | 2406.18995 | link |
| 2024-06-26 | Detecting Machine-Generated Texts: Not Just “AI vs Humans” and Explainability is Complicated | Jiazhou Ji et.al. | 2406.18259 | null |
| 2024-06-26 | ViT-1.58b: Mobile Vision Transformers in the 1-bit Era | Zhengqing Yuan et.al. | 2406.18051 | null |
| 2024-06-25 | Benchmarking Deep Learning Models on NVIDIA Jetson Nano for Real-Time Systems: An Empirical Investigation | Tushar Prasanna Swaminathan et.al. | 2406.17749 | link |
| 2024-06-25 | Structured Unrestricted-Rank Matrices for Parameter Efficient Fine-tuning | Arijit Sehanobish et.al. | 2406.17740 | null |
| 2024-06-25 | BayTTA: Uncertainty-aware medical image classification with optimized test-time augmentation using Bayesian model averaging | Zeinab Sherkatghanad et.al. | 2406.17640 | link |
| 2024-06-26 | Mitigate the Gap: Investigating Approaches for Improving Cross-Modal Alignment in CLIP | Sedigheh Eslami et.al. | 2406.17639 | null |
| 2024-06-25 | Knowledge Distillation in Automated Annotation: Supervised Text Classification with LLM-Generated Training Labels | Nicholas Pangakis et.al. | 2406.17633 | null |
| 2024-06-25 | Retrieval-style In-Context Learning for Few-shot Hierarchical Text Classification | Huiyao Chen et.al. | 2406.17534 | link |
| 2024-06-25 | TSynD: Targeted Synthetic Data Generation for Enhanced Medical Image Classification | Joshua Niemeijer et.al. | 2406.17473 | null |
| 2024-06-25 | Dynamic Scheduling for Vehicle-to-Vehicle Communications Enhanced Federated Learning | Jintao Yan et.al. | 2406.17470 | null |
| 2024-06-25 | Implicit-Zoo: A Large-Scale Dataset of Neural Implicit Functions for 2D Images and 3D Scenes | Qi Ma et.al. | 2406.17438 | null |
| 2024-06-25 | Robustly Optimized Deep Feature Decoupling Network for Fatty Liver Diseases Detection | Peng Huang et.al. | 2406.17338 | null |
| 2024-06-24 | Evaluation of Language Models in the Medical Context Under Resource-Constrained Settings | Andrea Posada et.al. | 2406.16611 | link |
| 2024-06-24 | Improving robustness to corruptions with multiplicative weight perturbations | Trung Trinh et.al. | 2406.16540 | null |
| 2024-06-24 | UNICAD: A Unified Approach for Attack Detection, Noise Reduction and Novel Class Identification | Alvaro Lopez Pellicer et.al. | 2406.16501 | null |
| 2024-06-24 | Improving Quaternion Neural Networks with Quaternionic Activation Functions | Johannes Pöppelbaum et.al. | 2406.16481 | null |
| 2024-06-24 | Learning in Wilson-Cowan model for metapopulation | Raffaele Marino et.al. | 2406.16453 | link |
| 2024-06-24 | Context-augmented Retrieval: A Novel Framework for Fast Information Retrieval based Response Generation using Large Language Model | Sai Ganesh et.al. | 2406.16383 | null |
| 2024-06-24 | Combining Supervised Learning and Reinforcement Learning for Multi-Label Classification Tasks with Partial Labels | Zixia Jia et.al. | 2406.16293 | null |
| 2024-06-23 | Jacobian Descent for Multi-Objective Optimization | Pierre Quinton et.al. | 2406.16232 | null |
| 2024-06-23 | Learning with Noisy Ground Truth: From 2D Classification to 3D Reconstruction | Yangdi Lu et.al. | 2406.15982 | null |
| 2024-06-22 | PUDD: Towards Robust Multi-modal Prototype-based Deepfake Detection | Alvaro Lopez Pellcier et.al. | 2406.15921 | null |
| 2024-06-21 | Retrieval Augmented Zero-Shot Text Classification | Tassallah Abdullahi et.al. | 2406.15241 | null |
| 2024-06-21 | DiffExplainer: Unveiling Black Box Models Via Counterfactual Generation | Yingying Fang et.al. | 2406.15182 | null |
| 2024-06-21 | This actually looks like that: Proto-BagNets for local and global interpretability-by-design | Kerol Djoumessi et.al. | 2406.15168 | link |
| 2024-06-21 | Hierarchical thematic classification of major conference proceedings | Arsentii Kuzmin et.al. | 2406.14983 | null |
| 2024-06-21 | Demonstrating the Efficacy of Kolmogorov-Arnold Networks in Vision Tasks | Minjong Cheon et.al. | 2406.14916 | link |
| 2024-06-21 | MU-Bench: A Multitask Multimodal Benchmark for Machine Unlearning | Jiali Cheng et.al. | 2406.14796 | null |
| 2024-06-20 | Depth $F_1$ : Improving Evaluation of Cross-Domain Text Classification by Measuring Semantic Generalizability | Parker Seegmiller et.al. | 2406.14695 | null |
| 2024-06-20 | Automatic Labels are as Effective as Manual Labels in Biomedical Images Classification with Deep Learning | Niccolò Marini et.al. | 2406.14351 | null |
| 2024-06-20 | Self-supervised Interpretable Concept-based Models for Text Classification | Francesco De Santis et.al. | 2406.14335 | null |
| 2024-06-20 | Adaptive Adversarial Cross-Entropy Loss for Sharpness-Aware Minimization | Tanapat Ratchatorn et.al. | 2406.14329 | null |
| 2024-06-20 | Boosting Hyperspectral Image Classification with Gate-Shift-Fuse Mechanisms in a Novel CNN-Transformer Approach | Mohamed Fadhlallah Guerri et.al. | 2406.14120 | null |
| 2024-06-20 | Seg-LSTM: Performance of xLSTM for Semantic Segmentation of Remotely Sensed Images | Qinfeng Zhu et.al. | 2406.14086 | link |
| 2024-06-21 | CMTNet: Convolutional Meets Transformer Network for Hyperspectral Images Classification | Faxu Guo et.al. | 2406.14080 | null |
| 2024-06-20 | Communication-Efficient Adaptive Batch Size Strategies for Distributed Local Gradient Methods | Tim Tsz-Kit Lau et.al. | 2406.13936 | null |
| 2024-06-19 | WATT: Weight Average Test-Time Adaption of CLIP | David Osowiechi et.al. | 2406.13875 | link |
| 2024-06-19 | CNN Based Flank Predictor for Quadruped Animal Species | Vanessa Suessle et.al. | 2406.13588 | null |
| 2024-06-19 | Online Domain-Incremental Learning Approach to Classify Acoustic Scenes in All Locations | Manjunath Mulimani et.al. | 2406.13386 | null |
| 2024-06-18 | LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging | Jinuk Kim et.al. | 2406.12837 | link |
| 2024-06-18 | Privacy Preserving Federated Learning in Medical Imaging with Uncertainty Estimation | Nikolas Koutsoubis et.al. | 2406.12815 | link |
| 2024-06-18 | Online Anchor-based Training for Image Classification Tasks | Maria Tzelepi et.al. | 2406.12662 | null |
| 2024-06-18 | Fighting Randomness with Randomness: Mitigating Optimisation Instability of Fine-Tuning using Delayed Ensemble and Noisy Interpolation | Branislav Pecher et.al. | 2406.12471 | null |
| 2024-06-18 | GW-MoE: Resolving Uncertainty in MoE Router with Global Workspace Theory | Haoze Wu et.al. | 2406.12375 | null |
| 2024-06-18 | What Did I Do Wrong? Quantifying LLMs’ Sensitivity and Consistency to Prompt Engineering | Federico Errica et.al. | 2406.12334 | null |
| 2024-06-18 | Unleashing the Potential of Open-set Noisy Samples Against Label Noise for Medical Image Classification | Zehui Liao et.al. | 2406.12293 | null |
| 2024-06-18 | Advancing Cross-Domain Generalizability in Face Anti-Spoofing: Insights, Design, and Metrics | Hyojin Kim et.al. | 2406.12258 | null |
| 2024-06-19 | MiSuRe is all you need to explain your image segmentation | Syed Nouman Hasany et.al. | 2406.12173 | null |
| 2024-06-17 | Enhancing Text Classification through LLM-Driven Active Learning and Human Annotation | Hamidreza Rouzegar et.al. | 2406.12114 | link |
| 2024-06-17 | Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of 99% | Lei Zhu et.al. | 2406.11837 | link |
| 2024-06-17 | PrAViC: Probabilistic Adaptation Framework for Real-Time Video Classification | Magdalena Trędowicz et.al. | 2406.11443 | null |
| 2024-06-17 | Cross-domain Open-world Discovery | Shuo Wen et.al. | 2406.11422 | link |
| 2024-06-17 | BaFTA: Backprop-Free Test-Time Adaptation For Zero-Shot Vision-Language Models | Xuefeng Hu et.al. | 2406.11309 | null |
| 2024-06-17 | An Empirical Investigation of Matrix Factorization Methods for Pre-trained Transformers | Ashim Gupta et.al. | 2406.11307 | null |
| 2024-06-17 | Text Grafting: Near-Distribution Weak Supervision for Minority Classes in Text Classification | Letian Peng et.al. | 2406.11115 | null |
| 2024-06-16 | Fine-grained Classes and How to Find Them | Matej Grcić et.al. | 2406.11070 | link |
| 2024-06-16 | Leveraging Foundation Models for Multi-modal Federated Learning with Incomplete Modality | Liwei Che et.al. | 2406.11048 | null |
| 2024-06-16 | Curating Stopwords in Marathi: A TF-IDF Approach for Improved Text Analysis and Information Retrieval | Rohan Chavan et.al. | 2406.11029 | link |
| 2024-06-16 | Universal Cross-Lingual Text Classification | Riya Savant et.al. | 2406.11028 | null |
| 2024-06-14 | UniAudio 1.5: Large Language Model-driven Audio Codec is A Few-shot Audio Task Learner | Dongchao Yang et.al. | 2406.10056 | null |
| 2024-06-14 | Comparison of fine-tuning strategies for transfer learning in medical image classification | Ana Davila et.al. | 2406.10050 | null |
| 2024-06-14 | Forgetting Order of Continual Learning: Examples That are Learned First are Forgotten Last | Guy Hacohen et.al. | 2406.09935 | null |
| 2024-06-13 | MirrorCheck: Efficient Adversarial Defense for Vision-Language Models | Samar Fares et.al. | 2406.09250 | null |
| 2024-06-13 | Self-Training for Sample-Efficient Active Learning for Text Classification with Pre-Trained Language Models | Christopher Schröder et.al. | 2406.09206 | null |
| 2024-06-13 | Large-Scale Evaluation of Open-Set Image Classification Techniques | Halil Bisgin et.al. | 2406.09112 | link |
| 2024-06-13 | LaCoOT: Layer Collapse through Optimal Transport | Victor Quétu et.al. | 2406.08933 | null |
| 2024-06-13 | The Penalized Inverse Probability Measure for Conformal Classification | Paul Melki et.al. | 2406.08884 | null |
| 2024-06-13 | Conceptual Learning via Embedding Approximations for Reinforcing Interpretability and Transparency | Maor Dikter et.al. | 2406.08840 | link |
| 2024-06-13 | DenoiseReID: Denoising Model for Representation Learning of Person Re-Identification | Zhengrui Xu et.al. | 2406.08773 | null |
| 2024-06-12 | Fine-Tuned ‘Small’ LLMs (Still) Significantly Outperform Zero-Shot Generative AI Models in Text Classification | Martin Juan José Bucher et.al. | 2406.08660 | null |
| 2024-06-12 | Intelligent Multi-View Test Time Augmentation | Efe Ozturk et.al. | 2406.08593 | null |
| 2024-06-12 | Transformation-Dependent Adversarial Attacks | Yaoteng Tan et.al. | 2406.08443 | null |
| 2024-06-12 | AdaNCA: Neural Cellular Automata As Adaptors For More Robust Vision Transformer | Yitao Xu et.al. | 2406.08298 | null |
| 2024-06-12 | DistilDoc: Knowledge Distillation for Visually-Rich Document Applications | Jordy Van Landeghem et.al. | 2406.08226 | null |
| 2024-06-12 | Fully Few-shot Class-incremental Audio Classification Using Expandable Dual-embedding Extractor | Yongjie Si et.al. | 2406.08122 | null |
| 2024-06-12 | Low-Complexity Acoustic Scene Classification Using Parallel Attention-Convolution Network | Yanxiong Li et.al. | 2406.08119 | null |
| 2024-06-12 | A $^{2}$ -MAE: A spatial-temporal-spectral unified remote sensing pre-training method based on anchor-aware masked autoencoder | Lixian Zhang et.al. | 2406.08079 | null |
| 2024-06-12 | Adversarial Evasion Attack Efficiency against Large Language Models | João Vitorino et.al. | 2406.08050 | null |
| 2024-06-12 | Accurate Explanation Model for Image Classifiers using Class Association Embedding | Ruitao Xie et.al. | 2406.07961 | link |
| 2024-06-12 | Multi-Teacher Multi-Objective Meta-Learning for Zero-Shot Hyperspectral Band Selection | Jie Feng et.al. | 2406.07949 | null |
| 2024-06-12 | Small Scale Data-Free Knowledge Distillation | He Liu et.al. | 2406.07876 | link |
| 2024-06-11 | fKAN: Fractional Kolmogorov-Arnold Networks with trainable Jacobi basis functions | Alireza Afzal Aghaei et.al. | 2406.07456 | link |
| 2024-06-11 | Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach | Challapalli Phanindra Revanth et.al. | 2406.07332 | null |
| 2024-06-11 | Noise-Robust Voice Conversion by Conditional Denoising Training Using Latent Variables of Recording Quality and Environment | Takuto Igarashi et.al. | 2406.07280 | null |
| 2024-06-11 | EEG-ImageNet: An Electroencephalogram Dataset and Benchmarks with Image Visual Stimuli of Multi-Granularity Labels | Shuqi Zhu et.al. | 2406.07151 | link |
| 2024-06-11 | RS-Agent: Automating Remote Sensing Tasks through Intelligent Agents | Wenjia Xu et.al. | 2406.07089 | null |
| 2024-06-11 | DualMamba: A Lightweight Spectral-Spatial Mamba-Convolution Network for Hyperspectral Image Classification | Jiamu Sheng et.al. | 2406.07050 | null |
| 2024-06-11 | Fairness-Aware Meta-Learning via Nash Bargaining | Yi Zeng et.al. | 2406.07029 | null |
| 2024-06-11 | Mitigating Boundary Ambiguity and Inherent Bias for Text Classification in the Era of Large Language Models | Zhenyi Lu et.al. | 2406.07001 | link |
| 2024-06-11 | Scaling up masked audio encoder learning for general audio classification | Heinrich Dinkel et.al. | 2406.06992 | null |
| 2024-06-10 | Multi-Objective Neural Architecture Search for In-Memory Computing | Md Hasibul Amin et.al. | 2406.06746 | null |
| 2024-06-10 | Robust Latent Representation Tuning for Image-text Classification | Hao Sun et.al. | 2406.06048 | null |
| 2024-06-09 | Contrastive Learning from Synthetic Audio Doppelgangers | Manuel Cherep et.al. | 2406.05923 | null |
| 2024-06-09 | Scaling Graph Convolutions for Mobile Vision | William Avery et.al. | 2406.05850 | link |
| 2024-06-09 | Evolution-aware VAriance (EVA) Coreset Selection for Medical Image Classification | Yuxin Hong et.al. | 2406.05677 | null |
| 2024-06-09 | Which Backbone to Use: A Resource-efficient Domain Specific Comparison for Computer Vision | Pranav Jeevan et.al. | 2406.05612 | link |
| 2024-06-08 | Aligning Human Knowledge with Visual Concepts Towards Explainable Medical Image Classification | Yunhe Gao et.al. | 2406.05596 | null |
| 2024-06-07 | The Unmet Promise of Synthetic Training Images: Using Retrieved Real Images Performs Better | Scott Geng et.al. | 2406.05184 | link |
| 2024-06-07 | A Novel Time Series-to-Image Encoding Approach for Weather Phenomena Classification | Christian Giannetti et.al. | 2406.05096 | null |
| 2024-06-07 | Classification Metrics for Image Explanations: Towards Building Reliable XAI-Evaluations | Benjamin Fresz et.al. | 2406.05068 | link |
| 2024-06-07 | REP: Resource-Efficient Prompting for On-device Continual Learning | Sungho Jeon et.al. | 2406.04772 | null |
| 2024-06-07 | AICoderEval: Improving AI Domain Code Generation of Large Language Models | Yinghui Xia et.al. | 2406.04712 | null |
| 2024-06-07 | Cooperative Meta-Learning with Gradient Augmentation | Jongyun Shin et.al. | 2406.04639 | link |
| 2024-06-06 | OCCAM: Towards Cost-Efficient and Accuracy-Aware Image Classification Inference | Dujian Ding et.al. | 2406.04508 | null |
| 2024-06-06 | Can Language Models Use Forecasting Strategies? | Sarah Pratt et.al. | 2406.04446 | null |
| 2024-06-06 | Parameter-Inverted Image Pyramid Networks | Xizhou Zhu et.al. | 2406.04330 | link |
| 2024-06-07 | BEADs: Bias Evaluation Across Domains | Shaina Raza et.al. | 2406.04220 | null |
| 2024-06-06 | What Do Language Models Learn in Context? The Structured Task Hypothesis | Jiaoda Li et.al. | 2406.04216 | null |
| 2024-06-06 | Pointer-Guided Pre-Training: Infusing Large Language Models with Paragraph-Level Contextual Awareness | Lars Hillebrand et.al. | 2406.04156 | link |
| 2024-06-07 | ReDistill: Residual Encoded Distillation for Peak Memory Reduction | Fang Chen et.al. | 2406.03744 | null |
| 2024-06-06 | LLMEmbed: Rethinking Lightweight LLM’s Genuine Function in Text Classification | Chun Liu et.al. | 2406.03725 | link |
| 2024-06-05 | Convolutional Neural Networks and Vision Transformers for Fashion MNIST Classification: A Literature Review | Sonia Bbouzidi et.al. | 2406.03478 | null |
| 2024-06-05 | IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models | David Ifeoluwa Adelani et.al. | 2406.03368 | null |
| 2024-06-05 | Audio Mamba: Bidirectional State Space Model for Audio Representation Learning | Mehmet Hamza Erol et.al. | 2406.03344 | link |
| 2024-06-05 | FusionBench: A Comprehensive Benchmark of Deep Model Fusion | Anke Tang et.al. | 2406.03280 | null |
| 2024-06-05 | VWise: A novel benchmark for evaluating scene classification for vehicular applications | Pedro Azevedo et.al. | 2406.03273 | null |
| 2024-06-05 | Tiny models from tiny data: Textual and null-text inversion for few-shot distillation | Erik Landolsi et.al. | 2406.03146 | link |
| 2024-06-05 | Exploiting LMM-based knowledge for image classification tasks | Maria Tzelepi et.al. | 2406.03071 | null |
| 2024-06-04 | Randomized Geometric Algebra Methods for Convex Neural Networks | Yifei Wang et.al. | 2406.02806 | null |
| 2024-06-04 | DL-KDD: Dual-Light Knowledge Distillation for Action Recognition in the Dark | Chi-Jui Chang et.al. | 2406.02468 | null |
| 2024-06-04 | GrootVL: Tree Topology is All You Need in State Space Model | Yicheng Xiao et.al. | 2406.02395 | link |
| 2024-06-04 | Hybrid Quantum-Classical Neural Network for LAB Color Space Image Classification | Kwokho Ng et.al. | 2406.02229 | null |
| 2024-06-03 | Few-Shot Classification of Interactive Activities of Daily Living (InteractADL) | Zane Durante et.al. | 2406.01662 | link |
| 2024-06-03 | CoLa-DCE – Concept-guided Latent Diffusion Counterfactual Explanations | Franz Motzkus et.al. | 2406.01649 | null |
| 2024-06-03 | Asynchronous Multi-Server Federated Learning for Geo-Distributed Clients | Yuncong Zuo et.al. | 2406.01439 | null |
| 2024-06-03 | Compute-Efficient Medical Image Classification with Softmax-Free Transformers and Sequence Normalization | Firas Khader et.al. | 2406.01314 | null |
| 2024-06-03 | Continuous Geometry-Aware Graph Diffusion via Hyperbolic Neural PDE | Jiaxu Liu et.al. | 2406.01282 | null |
| 2024-06-04 | MultiMax: Sparse and Multi-Modal Attention Learning | Yuxuan Zhou et.al. | 2406.01189 | link |
| 2024-06-03 | Synergizing Unsupervised and Supervised Learning: A Hybrid Approach for Accurate Natural Language Task Modeling | Wrick Talukdar et.al. | 2406.01096 | null |
| 2024-05-31 | You Only Scan Once: Efficient Multi-dimension Sequential Modeling with LightNet | Zhen Qin et.al. | 2405.21022 | null |
| 2024-05-31 | Investigating Calibration and Corruption Robustness of Post-hoc Pruned Perception CNNs: An Image Classification Benchmark Study | Pallavi Mitra et.al. | 2405.20876 | null |
| 2024-05-31 | Improving Generalization and Convergence by Enhancing Implicit Regularization | Mingze Wang et.al. | 2405.20763 | null |
| 2024-05-31 | Robust Stable Spiking Neural Networks | Jianhao Ding et.al. | 2405.20694 | null |
| 2024-05-31 | Enhancing Counterfactual Image Generation Using Mahalanobis Distance with Distribution Preferences in Feature Space | Yukai Zhang et.al. | 2405.20685 | null |
| 2024-05-31 | GenMix: Combining Generative and Mixture Data Augmentation for Medical Image Classification | Hansang Lee et.al. | 2405.20650 | null |
| 2024-05-31 | ToxVidLLM: A Multimodal LLM-based Framework for Toxicity Detection in Code-Mixed Videos | Krishanu Maity et.al. | 2405.20628 | null |
| 2024-05-30 | Mitigating the Impact of Labeling Errors on Training via Rockafellian Relaxation | Louis L. Chen et.al. | 2405.20531 | null |
| 2024-05-30 | DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark | Haoxing Chen et.al. | 2405.19707 | link |
| 2024-05-30 | A Novel Approach for Automated Design Information Mining from Issue Logs | Jiuang Zhao et.al. | 2405.19623 | null |
| 2024-05-29 | I Bet You Did Not Mean That: Testing Semantic Importance via Betting | Jacopo Teneggi et.al. | 2405.19146 | link |
| 2024-05-29 | Verifiably Robust Conformal Prediction | Linus Jeary et.al. | 2405.18942 | null |
| 2024-05-29 | Leveraging Many-To-Many Relationships for Defending Against Visual-Language Adversarial Attacks | Futa Waseda et.al. | 2405.18770 | null |
| 2024-05-29 | GIST: Greedy Independent Set Thresholding for Diverse Data Summarization | Matthew Fahrbach et.al. | 2405.18754 | null |
| 2024-05-29 | LLM-based Hierarchical Concept Decomposition for Interpretable Fine-Grained Image Classification | Renyi Qu et.al. | 2405.18672 | null |
| 2024-05-28 | Its Not a Modality Gap: Characterizing and Addressing the Contrastive Gap | Abrar Fahim et.al. | 2405.18570 | null |
| 2024-05-28 | Why are Visually-Grounded Language Models Bad at Image Classification? | Yuhui Zhang et.al. | 2405.18415 | link |
| 2024-05-28 | MSPE: Multi-Scale Patch Embedding Prompts Vision Transformers to Any Resolution | Wenzhuo Liu et.al. | 2405.18240 | null |
| 2024-05-28 | Confidence-aware multi-modality learning for eye disease screening | Ke Zou et.al. | 2405.18167 | link |
| 2024-05-28 | 4-bit Shampoo for Memory-Efficient Network Training | Sike Wang et.al. | 2405.18144 | null |
| 2024-05-28 | DMT-JEPA: Discriminative Masked Targets for Joint-Embedding Predictive Architecture | Shentong Mo et.al. | 2405.17995 | null |
| 2024-05-27 | WASH: Train your Ensemble with Communication-Efficient Weight Shuffling, then Average | Louis Fournier et.al. | 2405.17517 | null |
| 2024-05-27 | Model-Agnostic Zeroth-Order Policy Optimization for Meta-Learning of Ergodic Linear Quadratic Regulators | Yunian Pan et.al. | 2405.17370 | null |
| 2024-05-27 | On the Noise Robustness of In-Context Learning for Text Generation | Hongfu Gao et.al. | 2405.17264 | null |
| 2024-05-27 | Superpixelwise Low-rank Approximation based Partial Label Learning for Hyperspectral Image Classification | Shujun Yang et.al. | 2405.17110 | link |
| 2024-05-26 | Demystify Mamba in Vision: A Linear Attention Perspective | Dongchen Han et.al. | 2405.16605 | null |
| 2024-05-26 | AdaFisher: Adaptive Second Order Optimization via Fisher Information | Damien Martins Gomes et.al. | 2405.16397 | null |
| 2024-05-25 | ModelLock: Locking Your Model With a Spell | Yifeng Gao et.al. | 2405.16285 | null |
| 2024-05-25 | Accelerating Transformers with Spectrum-Preserving Token Merging | Hoai-Chau Tran et.al. | 2405.16148 | null |
| 2024-05-25 | Breaking the False Sense of Security in Backdoor Defense through Re-Activation Attack | Mingli Zhu et.al. | 2405.16134 | null |
| 2024-05-24 | Grounding Stylistic Domain Generalization with Quantitative Domain Shift Measures and Synthetic Scene Images | Yiran Luo et.al. | 2405.15961 | null |
| 2024-05-24 | A Neurosymbolic Framework for Bias Correction in CNNs | Parth Padalkar et.al. | 2405.15886 | null |
| 2024-05-24 | What Do You See? Enhancing Zero-Shot Image Classification with Multimodal Large Language Models | Abdelrahman Abdelhamed et.al. | 2405.15668 | null |
| 2024-05-24 | Class Machine Unlearning for Complex Data via Concepts Inference and Data Poisoning | Wenhan Chang et.al. | 2405.15662 | null |
| 2024-05-24 | Exposing Image Classifier Shortcuts with Counterfactual Frequency (CoF) Tables | James Hinns et.al. | 2405.15661 | null |
| 2024-05-24 | Harnessing Increased Client Participation with Cohort-Parallel Federated Learning | Akash Dhasade et.al. | 2405.15644 | null |
| 2024-05-24 | Transformer-based Federated Learning for Multi-Label Remote Sensing Image Classification | Barış Büyüktaş et.al. | 2405.15405 | null |
| 2024-05-24 | CLIP model is an Efficient Online Lifelong Learner | Leyuan Wang et.al. | 2405.15155 | null |
| 2024-05-24 | OptLLM: Optimal Assignment of Queries to Large Language Models | Yueyue Liu et.al. | 2405.15130 | null |
| 2024-05-23 | A Lost Opportunity for Vision-Language Models: A Comparative Study of Online Test-time Adaptation for Vision-Language Models | Mario Döbler et.al. | 2405.14977 | link |
| 2024-05-23 | Domain Wall Magnetic Tunnel Junction Reliable Integrate and Fire Neuron | Can Cui1 et.al. | 2405.14851 | null |
| 2024-05-23 | Explaining Black-box Model Predictions via Two-level Nested Feature Attributions with Consistency Property | Yuya Yoshikawa et.al. | 2405.14522 | null |
| 2024-05-23 | SIAVC: Semi-Supervised Framework for Industrial Accident Video Classification | Zuoyong Li et.al. | 2405.14506 | null |
| 2024-05-23 | Scalable Visual State Space Model with Fractal Scanning | Lv Tang et.al. | 2405.14480 | null |
| 2024-05-23 | Segformer++: Efficient Token-Merging Strategies for High-Resolution Semantic Segmentation | Daniel Kienzle et.al. | 2405.14467 | null |
| 2024-05-23 | Boosting Robustness by Clipping Gradients in Distributed Learning | Youssef Allouah et.al. | 2405.14432 | null |
| 2024-05-23 | Advancing Spiking Neural Networks for Sequential Modeling with Central Pattern Generators | Changze Lv et.al. | 2405.14362 | null |
| 2024-05-23 | Simple Hamiltonian dynamics is a powerful quantum processing resource | Akitada Sakurai et.al. | 2405.14245 | null |
| 2024-05-23 | ChronosLex: Time-aware Incremental Training for Temporal Generalization of Legal Classification Tasks | T. Y. S. S Santosh et.al. | 2405.14211 | null |
| 2024-05-22 | Just rotate it! Uncertainty estimation in closed-source models via multiple queries | Konstantinos Pitas et.al. | 2405.13864 | null |
| 2024-05-21 | Decentralized Federated Learning Over Imperfect Communication Channels | Weicai Li et.al. | 2405.12894 | null |
| 2024-05-21 | Multimodal Adaptive Inference for Document Image Classification with Anytime Early Exiting | Omar Hamed et.al. | 2405.12705 | null |
| 2024-05-21 | Exploration of Masked and Causal Language Modelling for Text Generation | Nicolo Micheletti et.al. | 2405.12630 | null |
| 2024-05-21 | 3DSS-Mamba: 3D-Spectral-Spatial Mamba for Hyperspectral Image Classification | Yan He et.al. | 2405.12487 | null |
| 2024-05-20 | Alzheimer’s Magnetic Resonance Imaging Classification Using Deep and Meta-Learning Models | Nida Nasir et.al. | 2405.12126 | null |
| 2024-05-20 | Mamba-in-Mamba: Centralized Mamba-Cross-Scan in Tokenized Mamba Model for Hyperspectral Image Classification | Weilian Zhou et.al. | 2405.12003 | link |
| 2024-05-20 | A Constraint-Enforcing Reward for Adversarial Attacks on Text Classifiers | Tom Roth et.al. | 2405.11904 | null |
| 2024-05-21 | A Novel Cartography-Based Curriculum Learning Method Applied on RoNLI: The First Romanian Natural Language Inference Corpus | Eduard Poesina et.al. | 2405.11877 | link |
| 2024-05-20 | SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model | Siavash Shams et.al. | 2405.11831 | link |
| 2024-05-20 | Exploring Ordinality in Text Classification: A Comparative Study of Explicit and Implicit Techniques | Siva Rajesh Kasa et.al. | 2405.11775 | null |
| 2024-05-19 | SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization | Jialong Guo et.al. | 2405.11582 | link |
| 2024-05-19 | Reproducibility Study of CDUL: CLIP-Driven Unsupervised Learning for Multi-Label Image Classification | Manan Shah et.al. | 2405.11574 | link |
| 2024-05-19 | An Invisible Backdoor Attack Based On Semantic Feature | Yangming Chen et.al. | 2405.11551 | null |
| 2024-05-19 | Verification technology for finger vein biometric | George Kumi Kyeremeh et.al. | 2405.11540 | null |
| 2024-05-17 | Reduced storage direct tensor ring decomposition for convolutional neural networks compression | Mateusz Gabor et.al. | 2405.10802 | link |
| 2024-05-17 | Benchmarking Large Language Models on CFLUE – A Chinese Financial Language Understanding Evaluation Dataset | Jie Zhu et.al. | 2405.10542 | link |
| 2024-05-17 | Smart Expert System: Large Language Models as Text Classifiers | Zhiqiang Wang et.al. | 2405.10523 | link |
| 2024-05-16 | Data-Efficient Low-Complexity Acoustic Scene Classification in the DCASE 2024 Challenge | Florian Schmid et.al. | 2405.10018 | null |
| 2024-05-16 | ROCOv2: Radiology Objects in COntext Version 2, an Updated Multimodal Image Dataset | Johannes Rückert et.al. | 2405.10004 | link |
| 2024-05-15 | Improving Label Error Detection and Elimination with Uncertainty Quantification | Johannes Jakubik et.al. | 2405.09602 | null |
| 2024-05-15 | Tackling Distribution Shifts in Task-Oriented Communication with Information Bottleneck | Hongru Li et.al. | 2405.09514 | null |
| 2024-05-15 | Feature-based Federated Transfer Learning: Communication Efficiency, Robustness and Privacy | Feng Wang et.al. | 2405.09014 | link |
| 2024-05-14 | The Pitfalls and Promise of Conformal Inference Under Adversarial Attacks | Ziquan Liu et.al. | 2405.08886 | link |
| 2024-05-14 | Harnessing the power of longitudinal medical imaging for eye disease prognosis using Transformer-based sequence modeling | Gregory Holste et.al. | 2405.08780 | null |
| 2024-05-14 | FolkTalent: Enhancing Classification and Tagging of Indian Folk Paintings | Nancy Hada et.al. | 2405.08776 | null |
| 2024-05-14 | The impact of Compositionality in Zero-shot Multi-label action recognition for Object-based tasks | Carmela Calabrese et.al. | 2405.08695 | null |
| 2024-05-14 | Achieving Fairness Through Channel Pruning for Dermatological Disease Diagnosis | Qingpeng Kong et.al. | 2405.08681 | link |
| 2024-05-14 | Investigating Design Choices in Joint-Embedding Predictive Architectures for General Audio Representation Learning | Alain Riou et.al. | 2405.08679 | null |
| 2024-05-14 | Dual-Branch Network for Portrait Image Quality Assessment | Wei Sun et.al. | 2405.08555 | null |
| 2024-05-13 | Who’s in and who’s out? A case study of multimodal CLIP-filtering in DataComp | Rachel Hong et.al. | 2405.08209 | link |
| 2024-05-14 | MambaOut: Do We Really Need Mamba for Vision? | Weihao Yu et.al. | 2405.07992 | link |
| 2024-05-13 | Constrained Exploration via Reflected Replica Exchange Stochastic Gradient Langevin Dynamics | Haoyang Zheng et.al. | 2405.07839 | link |
| 2024-05-13 | Analysis of the rate of convergence of an over-parametrized convolutional neural network image classifier learned by gradient descent | Michael Kohler et.al. | 2405.07619 | null |
| 2024-05-13 | On-device Online Learning and Semantic Management of TinyML Systems | Haoyu Ren et.al. | 2405.07601 | null |
| 2024-05-13 | GLiRA: Black-Box Membership Inference Attack via Knowledge Distillation | Andrey V. Galichin et.al. | 2405.07562 | null |
| 2024-05-13 | Fine-tuning the SwissBERT Encoder Model for Embedding Sentences and Documents | Juri Grosjean et.al. | 2405.07513 | null |
| 2024-05-13 | MoVL:Exploring Fusion Strategies for the Domain-Adaptive Application of Pretrained Models in Medical Imaging Tasks | Haijiang Tian et.al. | 2405.07411 | null |
| 2024-05-12 | Explainable Convolutional Neural Networks for Retinal Fundus Classification and Cutting-Edge Segmentation Models for Retinal Blood Vessels from Fundus Images | Fatema Tuj Johora Faria et.al. | 2405.07338 | null |
| 2024-05-12 | Differentiable Model Scaling using Differentiable Topk | Kai Liu et.al. | 2405.07194 | null |
| 2024-05-11 | A framework of text-dependent speaker verification for chinese numerical string corpus | Litong Zheng et.al. | 2405.07029 | null |
| 2024-05-10 | Pseudo-Prompt Generating in Pre-trained Vision-Language Models for Multi-Label Medical Image Classification | Yaoqin Ye et.al. | 2405.06468 | null |
| 2024-05-10 | Multi-level Personalized Federated Learning on Heterogeneous and Long-Tailed Data | Rongyu Zhang et.al. | 2405.06413 | null |
| 2024-05-10 | SaudiBERT: A Large Language Model Pretrained on Saudi Dialect Corpora | Faisal Qarah et.al. | 2405.06239 | link |
| 2024-05-09 | Deep Multi-Task Learning for Malware Image Classification | Ahmed Bensaoud et.al. | 2405.05906 | null |
| 2024-05-09 | Enhancing Suicide Risk Detection on Social Media through Semi-Supervised Deep Label Smoothing | Matthew Squires et.al. | 2405.05795 | null |
| 2024-05-09 | CSA-Net: Channel-wise Spatially Autocorrelated Attention Networks | Nick et.al. | 2405.05755 | null |
| 2024-05-09 | How Quality Affects Deep Neural Networks in Fine-Grained Image Classification | Joseph Smith et.al. | 2405.05742 | null |
| 2024-05-09 | End-to-End Generative Semantic Communication Powered by Shared Semantic Knowledge Base | Shuling Li et.al. | 2405.05738 | null |
| 2024-05-09 | Using Machine Translation to Augment Multilingual Classification | Adam King et.al. | 2405.05478 | null |
| 2024-05-08 | AFEN: Respiratory Disease Classification using Ensemble Learning | Rahul Nadkarni et.al. | 2405.05467 | null |
| 2024-05-08 | XAMPLER: Learning to Retrieve Cross-Lingual In-Context Examples | Peiqin Lin et.al. | 2405.05116 | link |
| 2024-05-08 | Explanation as a Watermark: Towards Harmless and Multi-bit Model Ownership Verification via Watermarking Feature Attribution | Shuo Shao et.al. | 2405.04825 | null |
| 2024-05-07 | Exploring Explainable AI Techniques for Improved Interpretability in Lung and Colon Cancer Classification | Mukaffi Bin Moin et.al. | 2405.04610 | link |
| 2024-05-07 | Pragmatist Intelligence: Where the Principle of Usefulness Can Take ANNs | Antonio Bikić et.al. | 2405.04386 | null |
| 2024-05-07 | Semi-Supervised Disease Classification based on Limited Medical Image Data | Yan Zhang et.al. | 2405.04295 | null |
| 2024-05-07 | DCNN: Dual Cross-current Neural Networks Realized Using An Interactive Deep Learning Discriminator for Fine-grained Objects | Da Fu et.al. | 2405.04093 | null |
| 2024-05-07 | Feature Map Convergence Evaluation for Functional Module | Ludan Zhang et.al. | 2405.04041 | null |
| 2024-05-07 | VMambaCC: A Visual State Space Model for Crowd Counting | Hao-Yuan Ma et.al. | 2405.03978 | null |
| 2024-05-06 | On Adversarial Examples for Text Classification by Perturbing Latent Representations | Korn Sooksatra et.al. | 2405.03789 | null |
| 2024-05-06 | CICA: Content-Injected Contrastive Alignment for Zero-Shot Document Image Classification | Sankalp Sinha et.al. | 2405.03660 | null |
| 2024-05-06 | Deep Space Separable Distillation for Lightweight Acoustic Scene Classification | ShuQi Ye et.al. | 2405.03567 | null |
| 2024-05-06 | Liberating Seen Classes: Boosting Few-Shot and Zero-Shot Text Classification via Anchor Generation and Classification Reframing | Han Liu et.al. | 2405.03565 | null |
| 2024-05-06 | A Lightweight Neural Architecture Search Model for Medical Image Classification | Lunchen Xie et.al. | 2405.03462 | null |
| 2024-05-06 | Interpretable Network Visualizations: A Human-in-the-Loop Approach for Post-hoc Explainability of CNN-based Image Classification | Matteo Bianchi et.al. | 2405.03301 | null |
| 2024-05-06 | TED: Accelerate Model Training by Internal Generalization | Jinying Xiao et.al. | 2405.03228 | null |
| 2024-05-06 | Advancing Multimodal Medical Capabilities of Gemini | Lin Yang et.al. | 2405.03162 | null |
| 2024-05-05 | A scoping review of using Large Language Models (LLMs) to investigate Electronic Health Records (EHRs) | Lingyao Li et.al. | 2405.03066 | null |
| 2024-05-05 | Parameter-Efficient Fine-Tuning with Discrete Fourier Transform | Ziqi Gao et.al. | 2405.03003 | link |
| 2024-05-04 | MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning | Vishal Nedungadi et.al. | 2405.02771 | null |
| 2024-05-03 | Multi-method Integration with Confidence-based Weighting for Zero-shot Image Classification | Siqi Yin et.al. | 2405.02155 | null |
| 2024-05-03 | The Trade-off between Performance, Efficiency, and Fairness in Adapter Modules for Text Classification | Minh Duc Bui et.al. | 2405.02010 | null |
| 2024-05-03 | Which Identities Are Mobilized: Towards an automated detection of social group appeals in political texts | Felicia Riethmüller et.al. | 2405.01904 | null |
| 2024-05-02 | PVF (Parameter Vulnerability Factor): A Quantitative Metric Measuring AI Vulnerability and Resilience Against Parameter Corruptions | Xun Jiao et.al. | 2405.01741 | null |
| 2024-05-02 | Development of Skip Connection in Deep Neural Networks for Computer Vision and Medical Image Analysis: A Survey | Guoping Xu et.al. | 2405.01725 | link |
| 2024-05-02 | SOAR: Advancements in Small Body Object Detection for Aerial Imagery Using State Space Models and Programmable Gradients | Tushar Verma et.al. | 2405.01699 | null |
| 2024-05-02 | Explainable AI (XAI) in Image Segmentation in Medicine, Industry, and Beyond: A Survey | Rokas Gipiškis et.al. | 2405.01636 | null |
| 2024-05-02 | Improving Intervention Efficacy via Concept Realignment in Concept Bottleneck Models | Nishad Singhi et.al. | 2405.01531 | null |
| 2024-05-03 | Decoupling Feature Extraction and Classification Layers for Calibrated Neural Networks | Mikkel Jordahn et.al. | 2405.01196 | null |
| 2024-05-02 | Uncertainty-aware self-training with expectation maximization basis transformation | Zijia Wang et.al. | 2405.01175 | null |
| 2024-05-02 | Transformers Fusion across Disjoint Samples for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2405.01095 | null |
| 2024-05-02 | Efficient and Flexible Method for Reducing Moderate-size Deep Neural Networks with Condensation | Tianyi Chen et.al. | 2405.01041 | null |
| 2024-05-02 | Benchmarking Representations for Speech, Music, and Acoustic Events | Moreno La Quatra et.al. | 2405.00934 | link |
| 2024-05-01 | Digital-analog quantum convolutional neural networks for image classification | Anton Simen et.al. | 2405.00548 | null |
| 2024-05-03 | BiomedRAG: A Retrieval Augmented Large Language Model for Biomedicine | Mingchen Li et.al. | 2405.00465 | null |
| 2024-05-01 | Visual and audio scene classification for detecting discrepancies in video: a baseline method and experimental protocol | Konstantinos Apostolidis et.al. | 2405.00384 | null |
| 2024-05-01 | Data Augmentation Policy Search for Long-Term Forecasting | Liran Nochumsohn et.al. | 2405.00319 | null |
| 2024-04-30 | Let’s Focus: Focused Backdoor Attack against Federated Transfer Learning | Marco Arazzi et.al. | 2404.19420 | null |
| 2024-04-30 | Large Language Model Informed Patent Image Retrieval | Hao-Cheng Lo et.al. | 2404.19360 | null |
| 2024-04-30 | Enhancing Intrinsic Features for Debiasing via Investigating Class-Discerning Common Attributes in Bias-Contrastive Pair | Jeonghoon Park et.al. | 2404.19250 | null |
| 2024-04-29 | Spectral-Spatial Mamba for Hyperspectral Image Classification | Lingbo Huang et.al. | 2404.18401 | null |
| 2024-04-28 | TextGram: Towards a better domain-adaptive pretraining | Sharayu Hiwarkhedkar et.al. | 2404.18228 | null |
| 2024-04-28 | L3Cube-MahaNews: News-based Short Text and Long Document Classification Datasets in Marathi | Saloni Mittal et.al. | 2404.18216 | link |
| 2024-04-28 | S $^2$ Mamba: A Spatial-spectral State Space Model for Hyperspectral Image Classification | Guanchun Wang et.al. | 2404.18213 | null |
| 2024-04-27 | Implicit Generative Prior for Bayesian Neural Networks | Yijia Liu et.al. | 2404.18008 | link |
| 2024-04-27 | Towards Privacy-Preserving Audio Classification Systems | Bhawana Chhaglani et.al. | 2404.18002 | null |
| 2024-04-27 | A Method of Moments Embedding Constraint and its Application to Semi-Supervised Learning | Michael Majurski et.al. | 2404.17978 | null |
| 2024-04-27 | Spatial, Temporal, and Geometric Fusion for Remote Sensing Images | Hessah Albanwan et.al. | 2404.17851 | null |
| 2024-04-27 | Leveraging Cross-Modal Neighbor Representation for Improved CLIP Classification | Chao Yi et.al. | 2404.17753 | link |
| 2024-04-26 | SPLICE – Streamlining Digital Pathology Image Processing | Areej Alsaafin et.al. | 2404.17704 | null |
| 2024-04-26 | SDFD: Building a Versatile Synthetic Face Image Dataset with Diverse Attributes | Georgia Baltsou et.al. | 2404.17255 | null |
| 2024-04-25 | Incorporating Lexical and Syntactic Knowledge for Unsupervised Cross-Lingual Transfer | Jianyu Zheng et.al. | 2404.16627 | link |
| 2024-04-25 | IMWA: Iterative Model Weight Averaging Benefits Class-Imbalanced Learning Tasks | Zitong Huang et.al. | 2404.16331 | null |
| 2024-04-25 | Lacunarity Pooling Layers for Plant Image Classification using Texture Analysis | Akshatha Mohan et.al. | 2404.16268 | link |
| 2024-04-24 | MiMICRI: Towards Domain-centered Counterfactual Explanations of Cardiovascular Image Classification Models | Grace Guo et.al. | 2404.16174 | null |
| 2024-04-24 | MoDE: CLIP Data Experts via Clustering | Jiawei Ma et.al. | 2404.16030 | link |
| 2024-04-26 | A Survey on Visual Mamba | Hanwei Zhang et.al. | 2404.15956 | null |
| 2024-04-24 | Vision Transformer-based Adversarial Domain Adaptation | Yahan Li et.al. | 2404.15817 | link |
| 2024-04-24 | Rethinking Model Prototyping through the MedMNIST+ Dataset Collection | Sebastian Doerrich et.al. | 2404.15786 | null |
| 2024-04-24 | Efficient Multi-Model Fusion with Adversarial Complementary Representation Learning | Zuheng Kang et.al. | 2404.15704 | null |
| 2024-04-24 | Brain Storm Optimization Based Swarm Learning for Diabetic Retinopathy Image Classification | Liang Qu et.al. | 2404.15585 | null |
| 2024-04-23 | An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models | Yangchen Pan et.al. | 2404.15518 | null |
| 2024-04-23 | Deep multi-prototype capsule networks | Saeid Abbassi et.al. | 2404.15445 | null |
| 2024-04-23 | A review of deep learning-based information fusion techniques for multimodal medical image classification | Yihao Li et.al. | 2404.15022 | null |
| 2024-04-23 | Social Media and Artificial Intelligence for Sustainable Cities and Societies: A Water Quality Analysis Use-case | Muhammad Asif Auyb et.al. | 2404.14977 | null |
| 2024-04-23 | Traditional to Transformers: A Survey on Current Trends and Future Prospects for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2404.14955 | link |
| 2024-04-23 | Pyramid Hierarchical Transformer for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2404.14945 | link |
| 2024-04-23 | Importance of Disjoint Sampling in Conventional and Transformer Models for Hyperspectral Image Classification | Muhammad Ahmad et.al. | 2404.14944 | link |
| 2024-04-23 | CoProNN: Concept-based Prototypical Nearest Neighbors for Explaining Vision Models | Teodor Chiaburu et.al. | 2404.14830 | link |
| 2024-04-22 | WangLab at MEDIQA-M3G 2024: Multimodal Medical Answer Generation using Large Language Models | Ronald Xie et.al. | 2404.14567 | null |
| 2024-04-22 | CKD: Contrastive Knowledge Distillation from A Sample-wise Perspective | Wencheng Zhu et.al. | 2404.14109 | null |
| 2024-04-21 | EncodeNet: A Framework for Boosting DNN Accuracy with Entropy-driven Generalized Converting Autoencoder | Hasanul Mahmud et.al. | 2404.13770 | null |
| 2024-04-21 | PEACH: Pretrained-embedding Explanation Across Contextual and Hierarchical Structure | Feiqi Cao et.al. | 2404.13645 | link |
| 2024-04-21 | I2CANSAY:Inter-Class Analogical Augmentation and Intra-Class Significance Analysis for Non-Exemplar Online Task-Free Continual Learning | Songlin Dong et.al. | 2404.13576 | null |
| 2024-04-21 | IMO: Greedy Layer-Wise Sparse Representation Learning for Out-of-Distribution Text Classification with Pre-trained Models | Tao Feng et.al. | 2404.13504 | null |
| 2024-04-20 | Nested-TNT: Hierarchical Vision Transformers with Multi-Scale Feature Processing | Yuang Liu et.al. | 2404.13434 | null |
| 2024-04-20 | Evaluating Subword Tokenization: Alien Subword Composition and OOV Generalization Challenge | Khuyagbaatar Batsuren et.al. | 2404.13292 | link |
| 2024-04-20 | 3D-Convolution Guided Spectral-Spatial Transformer for Hyperspectral Image Classification | Shyam Varahagiri et.al. | 2404.13252 | link |
| 2024-04-19 | On-board classification of underwater images using hybrid classical-quantum CNN based method | Sreeraj Rajan Warrier et.al. | 2404.13130 | null |
| 2024-04-19 | Next Generation Loss Function for Image Classification | Shakhnaz Akhmedova et.al. | 2404.12948 | null |
| 2024-04-19 | A Hybrid Generative and Discriminative PointNet on Unordered Point Sets | Yang Ye et.al. | 2404.12925 | null |
| 2024-04-19 | Transformer-Based Classification Outcome Prediction for Multimodal Stroke Treatment | Danqing Ma et.al. | 2404.12634 | null |
| 2024-04-18 | When LLMs are Unfit Use FastFit: Fast and Effective Text Classification with Many Classes | Asaf Yehudai et.al. | 2404.12365 | link |
| 2024-04-18 | Observation, Analysis, and Solution: Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training | Jin Gao et.al. | 2404.12210 | link |
| 2024-04-18 | Concept Induction using LLMs: a user experiment for assessment | Adrita Barua et.al. | 2404.11875 | null |
| 2024-04-17 | Pretraining Billion-scale Geospatial Foundational Models on Frontier | Aristeidis Tsaris et.al. | 2404.11706 | null |
| 2024-04-17 | AI-Enhanced Cognitive Behavioral Therapy: Deep Learning and Large Language Models for Extracting Cognitive Pathways from Social Media Texts | Meng Jiang et.al. | 2404.11449 | null |
| 2024-04-17 | Achieving Rotation Invariance in Convolution Operations: Shifting from Data-Driven to Mechanism-Assured | Hanlin Mo et.al. | 2404.11309 | null |
| 2024-04-17 | A Progressive Framework of Vision-language Knowledge Distillation and Alignment for Multilingual Scene | Wenbo Zhang et.al. | 2404.11249 | null |
| 2024-04-17 | A Novel ICD Coding Framework Based on Associated and Hierarchical Code Description Distillation | Bin Zhang et.al. | 2404.11132 | null |
| 2024-04-17 | Small Language Models are Good Too: An Empirical Study of Zero-Shot Classification | Pierre Lepagnol et.al. | 2404.11122 | null |
| 2024-04-18 | Supervised Contrastive Vision Transformer for Breast Histopathological Image Classification | Mohammad Shiri et.al. | 2404.11052 | null |
| 2024-04-17 | InfoMatch: Entropy Neural Estimation for Semi-Supervised Image Classification | Qi Han et.al. | 2404.11003 | link |
| 2024-04-16 | Incubating Text Classifiers Following User Instruction with Nothing but LLM | Letian Peng et.al. | 2404.10877 | link |
| 2024-04-16 | Vocabulary-free Image Classification and Semantic Segmentation | Alessandro Conti et.al. | 2404.10864 | link |
| 2024-04-16 | Assessing The Impact of CNN Auto Encoder-Based Image Denoising on Image Classification Tasks | Mohsen Hami et.al. | 2404.10664 | null |
| 2024-04-16 | Tree Bandits for Generative Bayes | Sean O’Hagan et.al. | 2404.10436 | null |
| 2024-04-16 | AudioProtoPNet: An interpretable deep learning model for bird sound classification | René Heinrich et.al. | 2404.10420 | null |
| 2024-04-16 | Lighter, Better, Faster Multi-Source Domain Adaptation with Gaussian Mixture Models and Optimal Transport | Eduardo Fernandes Montesuma et.al. | 2404.10261 | null |
| 2024-04-15 | Distributed Federated Learning-Based Deep Learning Model for Privacy MRI Brain Tumor Detection | Lisang Zhou et.al. | 2404.10026 | null |
| 2024-04-15 | Interaction as Explanation: A User Interaction-based Method for Explaining Image Classification Models | Hyeonggeun Yun et.al. | 2404.09828 | null |
| 2024-04-15 | Quantization of Large Language Models with an Overdetermined Basis | Daniil Merkulov et.al. | 2404.09737 | null |
| 2024-04-15 | Pseudo-label Learning with Calibrated Confidence Using an Energy-based Model | Masahito Toba et.al. | 2404.09585 | null |
| 2024-04-14 | Breast Cancer Image Classification Method Based on Deep Transfer Learning | Weimin Wang et.al. | 2404.09226 | null |
| 2024-04-14 | Coreset Selection for Object Detection | Hojun Lee et.al. | 2404.09161 | null |
| 2024-04-13 | Exploring Explainability in Video Action Recognition | Avinab Saha et.al. | 2404.09067 | null |
| 2024-04-13 | Fast Fishing: Approximating BAIT for Efficient and Scalable Deep Active Image Classification | Denis Huseljic et.al. | 2404.08981 | link |
| 2024-04-13 | PM2: A New Prompting Multi-modal Model Paradigm for Few-shot Medical Image Classification | Zhenwei Wang et.al. | 2404.08915 | null |
| 2024-04-12 | VertAttack: Taking advantage of Text Classifiers’ horizontal vision | Jonathan Rusert et.al. | 2404.08538 | null |
| 2024-04-12 | SpectralMamba: Efficient Mamba for Hyperspectral Image Classification | Jing Yao et.al. | 2404.08489 | null |
| 2024-04-12 | OTTER: Improving Zero-Shot Classification via Optimal Transport | Changho Shin et.al. | 2404.08461 | null |
| 2024-04-12 | A Survey of Neural Network Robustness Assessment in Image Recognition | Jie Wang et.al. | 2404.08285 | null |
| 2024-04-12 | Convolutional neural network classification of cancer cytopathology images: taking breast cancer as an example | MingXuan Xiao et.al. | 2404.08279 | null |
| 2024-04-11 | HGRN2: Gated Linear RNNs with State Expansion | Zhen Qin et.al. | 2404.07904 | link |
| 2024-04-11 | Exploiting Object-based and Segmentation-based Semantic Features for Deep Learning-based Indoor Scene Classification | Ricardo Pereira et.al. | 2404.07739 | null |
| 2024-04-11 | Contrastive-Based Deep Embeddings for Label Noise-Resilient Histopathology Image Classification | Lucas Dedieu et.al. | 2404.07605 | link |
| 2024-04-11 | Learning to Classify New Foods Incrementally Via Compressed Exemplars | Justin Yang et.al. | 2404.07507 | null |
| 2024-04-11 | Interactive Prompt Debugging with Sequence Salience | Ian Tenney et.al. | 2404.07498 | null |
| 2024-04-11 | Privacy preserving layer partitioning for Deep Neural Network models | Kishore Rajasekar et.al. | 2404.07437 | null |
| 2024-04-11 | CopilotCAD: Empowering Radiologists with Report Completion Models and Quantitative Evidence from Medical Image Foundation Models | Sheng Wang et.al. | 2404.07424 | null |
| 2024-04-11 | Improving Shift Invariance in Convolutional Neural Networks with Translation Invariant Polyphase Sampling | Sourajit Saha et.al. | 2404.07410 | null |
| 2024-04-10 | Lost in Translation: Modern Neural Networks Still Struggle With Small Realistic Image Transformations | Ofir Shifman et.al. | 2404.07153 | null |
| 2024-04-10 | Learning of deep convolutional network image classifiers via stochastic gradient descent and over-parametrization | Michael Kohler et.al. | 2404.07128 | null |
| 2024-04-10 | Accelerating Cardiac MRI Reconstruction with CMRatt: An Attention-Driven Approach | Anam Hashmi et.al. | 2404.06941 | null |
| 2024-04-10 | Multi-Label Continual Learning for the Medical Domain: A Novel Benchmark | Marina Ceccon et.al. | 2404.06859 | null |
| 2024-04-10 | Neural Optimizer Equation, Decay Function, and Learning Rate Schedule Joint Evolution | Brandon Morgan et.al. | 2404.06679 | null |
| 2024-04-09 | Variational Stochastic Gradient Descent for Deep Neural Networks | Haotian Chen et.al. | 2404.06549 | link |
| 2024-04-09 | On adversarial training and the 1 Nearest Neighbor classifier | Amir Hagai et.al. | 2404.06313 | link |
| 2024-04-09 | Audio-Visual Generalized Zero-Shot Learning using Pre-Trained Large Multi-Modal Models | David Kurzendörfer et.al. | 2404.06309 | link |
| 2024-04-09 | Counterfactual Reasoning for Multi-Label Image Classification via Patching-Based Training | Ming-Kun Xie et.al. | 2404.06287 | null |
| 2024-04-09 | Quantum Circuit $C^*$ -algebra Net | Yuka Hashimoto et.al. | 2404.06218 | null |
| 2024-04-09 | VI-OOD: A Unified Representation Learning Framework for Textual Out-of-distribution Detection | Li-Ming Zhan et.al. | 2404.06217 | link |
| 2024-04-09 | Symmetry-guided gradient descent for quantum neural networks | Kaiming Bian et.al. | 2404.06108 | null |
| 2024-04-10 | Using Few-Shot Learning to Classify Primary Lung Cancer and Other Malignancy with Lung Metastasis in Cytological Imaging via Endobronchial Ultrasound Procedures | Ching-Kai Lin et.al. | 2404.06080 | null |
| 2024-04-08 | Neural Cellular Automata for Lightweight, Robust and Explainable Classification of White Blood Cell Images | Michael Deutges et.al. | 2404.05584 | null |
| 2024-04-08 | On the Convergence of Continual Learning with Adaptive Methods | Seungyub Han et.al. | 2404.05555 | null |
| 2024-04-08 | Multi-Task Learning for Features Extraction in Financial Annual Reports | Syrielle Montariol et.al. | 2404.05281 | link |
| 2024-04-08 | Allowing humans to interactively guide machines where to look does not always improve a human-AI team’s classification accuracy | Giang Nguyen et.al. | 2404.05238 | null |
| 2024-04-08 | iVPT: Improving Task-relevant Information Sharing in Visual Prompt Tuning by Cross-layer Dynamic Connection | Nan Zhou et.al. | 2404.05207 | null |
| 2024-04-08 | Semantic Stealth: Adversarial Text Attacks on NLP Using Several Methods | Roopkatha Dey et.al. | 2404.05159 | null |
| 2024-04-07 | PairAug: What Can Augmented Image-Text Pairs Do for Radiology? | Yutong Xie et.al. | 2404.04960 | link |
| 2024-04-07 | GvT: A Graph-based Vision Transformer with Talking-Heads Utilizing Sparsity, Trained from Scratch on Small Datasets | Dongjing Shan et.al. | 2404.04924 | null |
| 2024-04-06 | Focused Active Learning for Histopathological Image Classification | Arne Schmidt et.al. | 2404.04663 | null |
| 2024-04-06 | Trustless Audits without Revealing Data or Models | Suppakit Waiwitlikhit et.al. | 2404.04500 | null |
| 2024-04-05 | Evaluating Adversarial Robustness: A Comparison Of FGSM, Carlini-Wagner Attacks, And The Role of Distillation as Defense Mechanism | Trilokesh Ranjan Sarkar et.al. | 2404.04245 | null |
| 2024-04-05 | Noisy Label Processing for Classification: A Survey | Mengting Li et.al. | 2404.04159 | null |
| 2024-04-05 | Learning Correlation Structures for Vision Transformers | Manjin Kim et.al. | 2404.03924 | null |
| 2024-04-05 | LiDAR-Guided Cross-Attention Fusion for Hyperspectral Band Selection and Image Classification | Judy X Yang et.al. | 2404.03883 | null |
| 2024-04-04 | Dendrites endow artificial neural networks with accurate, robust and parameter-efficient learning | Spyridon Chavlis et.al. | 2404.03708 | null |
| 2024-04-05 | A Methodology to Study the Impact of Spiking Neural Network Parameters considering Event-Based Automotive Data | Iqra Bano et.al. | 2404.03493 | null |
| 2024-04-04 | Meta Invariance Defense Towards Generalizable Robustness to Unknown Adversarial Attacks | Lei Zhang et.al. | 2404.03340 | null |
| 2024-04-04 | Sparse Concept Bottleneck Models: Gumbel Tricks in Contrastive Learning | Andrei Semenov et.al. | 2404.03323 | link |
| 2024-04-04 | FACTUAL: A Novel Framework for Contrastive Learning Based Robust SAR Image Classification | Xu Wang et.al. | 2404.03225 | null |
| 2024-04-03 | Exploring the Trade-off Between Model Performance and Explanation Plausibility of Text Classifiers Using Human Rationales | Lucas E. Resck et.al. | 2404.03098 | link |
| 2024-04-03 | Guarantees of confidentiality via Hammersley-Chapman-Robbins bounds | Kamalika Chaudhuri et.al. | 2404.02866 | link |
| 2024-04-03 | FPT: Feature Prompt Tuning for Few-shot Readability Assessment | Ziyang Wang et.al. | 2404.02772 | link |
| 2024-04-03 | Adversarial Attacks and Dimensionality in Text Classifiers | Nandish Chattopadhyay et.al. | 2404.02660 | null |
| 2024-04-04 | Non-negative Subspace Feature Representation for Few-shot Learning in Medical Imaging | Keqiang Fan et.al. | 2404.02656 | null |
| 2024-04-03 | Adaptive Cross-lingual Text Classification through In-Context One-Shot Demonstrations | Emilio Villa-Cueva et.al. | 2404.02452 | link |
| 2024-04-03 | A Novel Approach to Breast Cancer Histopathological Image Classification Using Cross-Colour Space Feature Fusion and Quantum-Classical Stack Ensemble Method | Sambit Mallick et.al. | 2404.02447 | null |
| 2024-04-03 | Enhancing Low-Resource LLMs Classification with PEFT and Synthetic Data | Parth Patwa et.al. | 2404.02422 | null |
| 2024-04-02 | Smooth Deep Saliency | Rudolf Herdt et.al. | 2404.02282 | null |
| 2024-04-02 | Visual Concept Connectome (VCC): Open World Concept Discovery and their Interlayer Connections in Deep Models | Matthew Kowal et.al. | 2404.02233 | null |
| 2024-04-02 | ImageNot: A contrast with ImageNet preserves model rankings | Olawale Salaudeen et.al. | 2404.02112 | null |
| 2024-04-02 | Explainability in JupyterLab and Beyond: Interactive XAI Systems for Integrated and Collaborative Workflows | Grace Guo et.al. | 2404.02081 | null |
| 2024-04-02 | Ukrainian Texts Classification: Exploration of Cross-lingual Knowledge Transfer Approaches | Daryna Dementieva et.al. | 2404.02043 | null |
| 2024-04-02 | CAM-Based Methods Can See through Walls | Magamed Taimeskhanov et.al. | 2404.01964 | link |
| 2024-04-02 | Beyond Image Super-Resolution for Image Recognition with Task-Driven Perceptual Loss | Jaeha Kim et.al. | 2404.01692 | null |
| 2024-04-02 | A Universal Knowledge Embedded Contrastive Learning Framework for Hyperspectral Image Classification | Quanwei Liu et.al. | 2404.01673 | null |
| 2024-04-01 | Can Biases in ImageNet Models Explain Generalization? | Paul Gavrikov et.al. | 2404.01509 | link |
| 2024-04-01 | Parallel Proportional Fusion of Spiking Quantum Neural Network for Optimizing Image Classification | Zuyu Xu et.al. | 2404.01359 | null |
| 2024-04-01 | Bridging Remote Sensors with Multisensor Geospatial Foundation Models | Boran Han et.al. | 2404.01260 | link |
| 2024-04-01 | Diagnosis of Skin Cancer Using VGG16 and VGG19 Based Transfer Learning Models | Amir Faghihi et.al. | 2404.01160 | null |
| 2024-03-29 | Learn “No” to Say “Yes” Better: Improving Vision-Language Models via Negations | Jaisidh Singh et.al. | 2403.20312 | link |
| 2024-03-29 | MCNet: A crowd denstity estimation network based on integrating multiscale attention module | Qiang Guo et.al. | 2403.20173 | null |
| 2024-03-29 | Segmentation, Classification and Interpretation of Breast Cancer Medical Images using Human-in-the-Loop Machine Learning | David Vázquez-Lema et.al. | 2403.20112 | null |
| 2024-03-29 | Adverb Is the Key: Simple Text Data Augmentation with Adverb Deletion | Juhwan Choi et.al. | 2403.20015 | link |
| 2024-03-29 | Diverse Feature Learning by Self-distillation and Reset | Sejik Park et.al. | 2403.19941 | null |
| 2024-03-29 | Heterogeneous Network Based Contrastive Learning Method for PolSAR Land Cover Classification | Jianfeng Cai et.al. | 2403.19902 | link |
| 2024-03-28 | X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization | Anna Kukleva et.al. | 2403.19811 | link |
| 2024-03-28 | RSMamba: Remote Sensing Image Classification with State Space Model | Keyan Chen et.al. | 2403.19654 | link |
| 2024-03-28 | Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model | Zhicai Wang et.al. | 2403.19600 | link |
| 2024-03-28 | The Bad Batches: Enhancing Self-Supervised Learning in Image Classification Through Representative Batch Curation | Ozgu Goksu et.al. | 2403.19579 | null |
| 2024-03-28 | Low-Rank Rescaled Vision Transformer Fine-Tuning: A Residual Design Approach | Wei Dong et.al. | 2403.19067 | link |
| 2024-03-27 | Evaluating Large Language Models for Health-Related Text Classification Tasks with Public Social Media Data | Yuting Guo et.al. | 2403.19031 | null |
| 2024-03-27 | Robustness and Visual Explanation for Black Box Image, Video, and ECG Signal Classification with Reinforcement Learning | Soumyendu Sarkar et.al. | 2403.18985 | null |
| 2024-03-27 | The Impact of Uniform Inputs on Activation Sparsity and Energy-Latency Attacks in Computer Vision | Andreas Müller et.al. | 2403.18587 | link |
| 2024-03-27 | Uncertainty-Aware SAR ATR: Defending Against Adversarial Attacks via Bayesian Neural Networks | Tian Ye et.al. | 2403.18318 | null |
| 2024-03-27 | Multi-scale Unified Network for Image Classification | Wenzhuo Liu et.al. | 2403.18294 | null |
| 2024-03-26 | The Need for Speed: Pruning Transformers with One Recipe | Samir Khaki et.al. | 2403.17921 | link |
| 2024-03-26 | Compressed Multi-task embeddings for Data-Efficient Downstream training and inference in Earth Observation | Carlos Gomes et.al. | 2403.17886 | null |
| 2024-03-26 | PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition | Chenhongyi Yang et.al. | 2403.17695 | link |
| 2024-03-26 | Language Models for Text Classification: Is In-Context Learning Enough? | Aleksandra Edwards et.al. | 2403.17661 | null |
| 2024-03-26 | Boosting Few-Shot Learning with Disentangled Self-Supervised Learning and Meta-Learning for Medical Image Classification | Eva Pachetti et.al. | 2403.17530 | null |
| 2024-03-26 | HILL: Hierarchy-aware Information Lossless Contrastive Learning for Hierarchical Text Classification | He Zhu et.al. | 2403.17307 | link |
| 2024-03-25 | Histogram Layers for Neural Engineered Features | Joshua Peeples et.al. | 2403.17176 | link |
| 2024-03-25 | Task2Box: Box Embeddings for Modeling Asymmetric Task Relationships | Rangel Daroya et.al. | 2403.17173 | link |
| 2024-03-25 | CipherFormer: Efficient Transformer Private Inference with Low Round Complexity | Weize Wang et.al. | 2403.16860 | null |
| 2024-03-25 | Assessing the Performance of Deep Learning for Automated Gleason Grading in Prostate Cancer | Dominik Müller et.al. | 2403.16695 | null |
| 2024-03-25 | DeepGleason: a System for Automated Gleason Grading of Prostate Cancer using Deep Neural Networks | Dominik Müller et.al. | 2403.16678 | link |
| 2024-03-25 | LARA: Linguistic-Adaptive Retrieval-Augmented LLMs for Multi-Turn Intent Classification | Liu Junhua et.al. | 2403.16504 | null |
| 2024-03-24 | On machine learning analysis of atomic force microscopy images for image classification, sample surface recognition | Igor Sokolov et.al. | 2403.16230 | null |
| 2024-03-24 | Leveraging Deep Learning and Xception Architecture for High-Accuracy MRI Classification in Alzheimer Diagnosis | Shaojie Li et.al. | 2403.16212 | null |
| 2024-03-24 | Multi-Task Learning with Multi-Task Optimization | Lu Bai et.al. | 2403.16162 | null |
| 2024-03-24 | CBGT-Net: A Neuromimetic Architecture for Robust Classification of Streaming Data | Shreya Sharma et.al. | 2403.15974 | link |
| 2024-03-23 | A Deep Learning Architectures for Kidney Disease Classification | Muhammad Shoaib Farooq et.al. | 2403.15895 | null |
| 2024-03-23 | VLUE: A New Benchmark and Multi-task Knowledge Transfer Learning for Vietnamese Natural Language Understanding | Phong Nguyen-Thuan Do et.al. | 2403.15882 | null |
| 2024-03-23 | VLM-CPL: Consensus Pseudo Labels from Vision-Language Models for Human Annotation-Free Pathological Image Classification | Lanfeng Zhong et.al. | 2403.15836 | null |
| 2024-03-22 | Your Image is My Video: Reshaping the Receptive Field via Image-To-Video Differentiable AutoAugmentation and Fusion | Sofia Casarin et.al. | 2403.15194 | null |
| 2024-03-22 | Image Classification with Rotation-Invariant Variational Quantum Circuits | Paul San Sebastian et.al. | 2403.15031 | null |
| 2024-03-22 | Extracting Human Attention through Crowdsourced Patch Labeling | Minsuk Chang et.al. | 2403.15013 | null |
| 2024-03-22 | Clean-image Backdoor Attacks | Dazhong Rong et.al. | 2403.15010 | null |
| 2024-03-22 | ParFormer: Vision Transformer Baseline with Parallel Local Global Token Mixer and Convolution Attention Patch Embedding | Novendra Setyawan et.al. | 2403.15004 | null |
| 2024-03-22 | MasonTigers at SemEval-2024 Task 8: Performance Analysis of Transformer-based Models on Machine-Generated Text Detection | Sadiya Sayara Chowdhury Puspo et.al. | 2403.14989 | null |
| 2024-03-21 | Learning with SASQuaTCh: a Novel Variational Quantum Transformer Architecture with Kernel-Based Self-Attention | Ethan N. Evans et.al. | 2403.14753 | null |
| 2024-03-21 | Estimating Physical Information Consistency of Channel Data Augmentation for Remote Sensing Images | Tom Burgert et.al. | 2403.14547 | null |
| 2024-03-21 | Multi-Level Explanations for Generative Language Models | Lucas Monteiro Paes et.al. | 2403.14459 | link |
| 2024-03-21 | Tensor network compressibility of convolutional models | Sukhbinder Singh et.al. | 2403.14379 | null |
| 2024-03-21 | LayoutLLM: Large Language Model Instruction Tuning for Visually Rich Document Understanding | Masato Fujitake et.al. | 2403.14252 | null |
| 2024-03-21 | Safeguarding Medical Image Segmentation Datasets against Unauthorized Training via Contour- and Texture-Aware Perturbations | Xun Lin et.al. | 2403.14250 | null |
| 2024-03-21 | Improving Image Classification Accuracy through Complementary Intra-Class and Inter-Class Mixup | Ye Xu et.al. | 2403.14137 | link |
| 2024-03-20 | Bridge the Modality and Capacity Gaps in Vision-Language Model Selection | Chao Yi et.al. | 2403.13797 | null |
| 2024-03-20 | Leveraging feature communication in federated learning for remote sensing image classification | Anh-Kiet Duong et.al. | 2403.13575 | null |
| 2024-03-20 | MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining | Di Wang et.al. | 2403.13430 | link |
| 2024-03-20 | Building Optimal Neural Architectures using Interpretable Knowledge | Keith G. Mills et.al. | 2403.13293 | link |
| 2024-03-19 | LUWA Dataset: Learning Lithic Use-Wear Analysis on Microscopic Images | Jing Zhang et.al. | 2403.13171 | null |
| 2024-03-19 | Improved EATFormer: A Vision Transformer for Medical Image Classification | Yulong Shisu et.al. | 2403.13167 | null |
| 2024-03-19 | SIFT-DBT: Self-supervised Initialization and Fine-Tuning for Imbalanced Digital Breast Tomosynthesis Image Classification | Yuexi Du et.al. | 2403.13148 | link |
| 2024-03-19 | Using evolutionary computation to optimize task performance of unclocked, recurrent Boolean circuits in FPGAs | Raphael Norman-Tenazas et.al. | 2403.13105 | null |
| 2024-03-19 | Investigating Text Shortening Strategy in BERT: Truncation vs Summarization | Mirza Alim Mutasodirin et.al. | 2403.12799 | link |
| 2024-03-18 | Posterior Uncertainty Quantification in Neural Networks using Data Augmentation | Luhuan Wu et.al. | 2403.12729 | link |
| 2024-03-19 | SEVEN: Pruning Transformer Model by Reserving Sentinels | Jinying Xiao et.al. | 2403.12688 | link |
| 2024-03-19 | Simple Hack for Transformers against Heavy Long-Text Classification on a Time- and Memory-Limited GPU Service | Mirza Alim Mutasodirin et.al. | 2403.12563 | null |
| 2024-03-19 | Prompt-Guided Adaptive Model Transformation for Whole Slide Image Classification | Yi Lin et.al. | 2403.12537 | null |
| 2024-03-19 | CrossTune: Black-Box Few-Shot Classification with Label Enhancement | Danqing Luo et.al. | 2403.12468 | null |
| 2024-03-18 | Generalizing deep learning models for medical image classification | Matta Sarah et.al. | 2403.12167 | null |
| 2024-03-19 | Leveraging Spatial and Semantic Feature Extraction for Skin Cancer Diagnosis with Capsule Networks and Graph Neural Networks | K. P. Santoso et.al. | 2403.12009 | null |
| 2024-03-18 | High-energy physics image classification: A Survey of Jet Applications | Hamza Kheddar et.al. | 2403.11934 | null |
| 2024-03-18 | Better (pseudo-)labels for semi-supervised instance segmentation | François Porcher et.al. | 2403.11675 | null |
| 2024-03-18 | Continual Forgetting for Pre-trained Vision Models | Hongbo Zhao et.al. | 2403.11530 | link |
| 2024-03-18 | Uncertainty-Calibrated Test-Time Model Adaptation without Forgetting | Mingkui Tan et.al. | 2403.11491 | null |
| 2024-03-17 | Potential of Domain Adaptation in Machine Learning in Ecology and Hydrology to Improve Model Extrapolability | Haiyang Shi et.al. | 2403.11331 | null |
| 2024-03-17 | A Modified Word Saliency-Based Adversarial Attack on Text Classification Models | Hetvi Waghela et.al. | 2403.11297 | null |
| 2024-03-17 | Forging the Forger: An Attempt to Improve Authorship Verification via Data Augmentation | Silvia Corbara et.al. | 2403.11265 | null |
| 2024-03-17 | Multiple Teachers-Meticulous Student: A Domain Adaptive Meta-Knowledge Distillation Model for Medical Image Classification | Shahabedin Nabavi et.al. | 2403.11226 | null |
| 2024-03-16 | Forward Learning of Graph Neural Networks | Namyong Park et.al. | 2403.11004 | link |
| 2024-03-16 | Understanding Robustness of Visual State Space Models for Image Classification | Chengbin Du et.al. | 2403.10935 | null |
| 2024-03-16 | Automatic location detection based on deep learning | Anjali Karangiya et.al. | 2403.10912 | null |
| 2024-03-14 | Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models | Akhil Kedia et.al. | 2403.09635 | link |
| 2024-03-14 | XCoOp: Explainable Prompt Learning for Computer-Aided Diagnosis via Concept-guided Context Optimization | Yequan Bie et.al. | 2403.09410 | null |
| 2024-03-14 | ConDiSR: Contrastive Disentanglement and Style Regularization for Single Domain Generalization | Aleksandr Matsun et.al. | 2403.09400 | null |
| 2024-03-14 | A Hierarchical Fused Quantum Fuzzy Neural Network for Image Classification | Sheng-Yao Wu et.al. | 2403.09318 | null |
| 2024-03-14 | CLIP-EBC: CLIP Can Count Accurately through Enhanced Blockwise Classification | Yiming Ma et.al. | 2403.09281 | null |
| 2024-03-14 | Are Vision Language Models Texture or Shape Biased and Can We Steer Them? | Paul Gavrikov et.al. | 2403.09193 | link |
| 2024-03-14 | Randomized Principal Component Analysis for Hyperspectral Image Classification | Mustafa Ustuner et.al. | 2403.09117 | null |
| 2024-03-14 | CardioCaps: Attention-based Capsule Network for Class-Imbalanced Echocardiogram Classification | Hyunkyung Han et.al. | 2403.09108 | link |
| 2024-03-14 | The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models? | Qinyu Zhao et.al. | 2403.09037 | link |
| 2024-03-13 | PathM3: A Multimodal Multi-Task Multiple Instance Learning Framework for Whole Slide Image Classification and Captioning | Qifeng Zhou et.al. | 2403.08967 | null |
| 2024-03-13 | DAM: Dynamic Adapter Merging for Continual Video QA Learning | Feng Cheng et.al. | 2403.08755 | link |
| 2024-03-13 | Leveraging Compressed Frame Sizes For Ultra-Fast Video Classification | Yuxing Han et.al. | 2403.08580 | null |
| 2024-03-13 | HOLMES: HOLonym-MEronym based Semantic inspection for Convolutional Image Classifiers | Francesco Dibitonto et.al. | 2403.08536 | link |
| 2024-03-13 | Pig aggression classification using CNN, Transformers and Recurrent Networks | Junior Silva Souza et.al. | 2403.08528 | null |
| 2024-03-13 | Reduced Jeffries-Matusita distance: A Novel Loss Function to Improve Generalization Performance of Deep Classification Models | Mohammad Lashkari et.al. | 2403.08408 | null |
| 2024-03-13 | Iterative Online Image Synthesis via Diffusion Model for Imbalanced Classification | Shuhan Li et.al. | 2403.08407 | null |
| 2024-03-13 | Advancing Security in AI Systems: A Novel Approach to Detecting Backdoors in Deep Neural Networks | Khondoker Murad Hossain et.al. | 2403.08208 | null |
| 2024-03-13 | Multiscale Low-Frequency Memory Network for Improved Feature Extraction in Convolutional Neural Networks | Fuzhi Wu et.al. | 2403.08157 | link |
| 2024-03-12 | Harnessing Artificial Intelligence to Combat Online Hate: Exploring the Challenges and Opportunities of Large Language Models in Hate Speech Detection | Tharindu Kumarage et.al. | 2403.08035 | null |
| 2024-03-13 | Visual Decoding and Reconstruction via EEG Embeddings with Guided Diffusion | Dongyang Li et.al. | 2403.07721 | link |
| 2024-03-12 | FPT: Fine-grained Prompt Tuning for Parameter and Memory Efficient Fine Tuning in High-resolution Medical Image Classification | Yijin Huang et.al. | 2403.07576 | null |
| 2024-03-12 | Backdoor Attack with Mode Mixture Latent Modification | Hongwei Zhang et.al. | 2403.07463 | null |
| 2024-03-12 | In-context learning enables multimodal large language models to classify cancer pathology images | Dyke Ferber et.al. | 2403.07407 | null |
| 2024-03-12 | Premonition: Using Generative Models to Preempt Future Data Changes in Continual Learning | Mark D. McDonnell et.al. | 2403.07356 | null |
| 2024-03-12 | How does promoting the minority fraction affect generalization? A theoretical study of the one-hidden-layer neural network on group imbalance | Hongkang Li et.al. | 2403.07310 | null |
| 2024-03-12 | A Bayesian Approach to OOD Robustness in Image Classification | Prakhar Kaushik et.al. | 2403.07277 | link |
| 2024-03-11 | LeOCLR: Leveraging Original Images for Contrastive Learning of Visual Representations | Mohammad Alkhalefi et.al. | 2403.06813 | null |
| 2024-03-11 | Dynamic Perturbation-Adaptive Adversarial Training on Medical Image Classification | Shuai Li et.al. | 2403.06798 | null |
| 2024-03-11 | Leveraging Internal Representations of Model for Magnetic Image Classification | Adarsh N L et.al. | 2403.06797 | null |
| 2024-03-11 | Shortcut Learning in Medical Image Segmentation | Manxi Lin et.al. | 2403.06748 | null |
| 2024-03-11 | Active Generation for Image Classification | Tao Huang et.al. | 2403.06517 | null |
| 2024-03-11 | Evolving Knowledge Distillation with Large Language Models and Active Learning | Chengyuan Liu et.al. | 2403.06414 | null |
| 2024-03-11 | ‘One size doesn’t fit all’: Learning how many Examples to use for In-Context Learning for Improved Text Classification | Manish Chandra et.al. | 2403.06402 | null |
| 2024-03-10 | Probing Image Compression For Class-Incremental Learning | Justin Yang et.al. | 2403.06288 | null |
| 2024-03-10 | Bayesian Random Semantic Data Augmentation for Medical Image Classification | Yaoyao Zhu et.al. | 2403.06138 | link |
| 2024-03-10 | Universal Debiased Editing for Fair Medical Image Classification | Ruinan Jin et.al. | 2403.06104 | null |
| 2024-03-08 | Tune without Validation: Searching for Learning Rate and Weight Decay on Training Sets | Lorenzo Brigato et.al. | 2403.05532 | null |
| 2024-03-08 | Generalized Correspondence Matching via Flexible Hierarchical Refinement and Patch Descriptor Distillation | Yu Han et.al. | 2403.05388 | null |
| 2024-03-08 | The Impact of Quantization on the Robustness of Transformer-based Text Classifiers | Seyed Parsa Neshaei et.al. | 2403.05365 | null |
| 2024-03-08 | Multiple Instance Learning with random sampling for Whole Slide Image Classification | H. Keshvarikhojasteh et.al. | 2403.05351 | null |
| 2024-03-08 | Learning Expressive And Generalizable Motion Features For Face Forgery Detection | Jingyi Zhang et.al. | 2403.05172 | null |
| 2024-03-08 | Defending Against Unforeseen Failure Modes with Latent Adversarial Training | Stephen Casper et.al. | 2403.05030 | link |
| 2024-03-07 | Fooling Neural Networks for Motion Forecasting via Adversarial Attacks | Edgar Medina et.al. | 2403.04954 | null |
| 2024-03-07 | T-TAME: Trainable Attention Mechanism for Explaining Convolutional Networks and Vision Transformers | Mariano V. Ntrougkas et.al. | 2403.04523 | link |
| 2024-03-07 | Source Matters: Source Dataset Impact on Model Robustness in Medical Imaging | Dovile Juodelyte et.al. | 2403.04484 | link |
| 2024-03-07 | Advancing Biomedical Text Mining with Community Challenges | Hui Zong et.al. | 2403.04261 | null |
| 2024-03-07 | Scalable On-Chip Optical Linear Processing Unit Using a Single Thin-Film Lithium Niobate Ring Modulator | Zhaoang Deng et.al. | 2403.04216 | null |
| 2024-03-07 | Scalable and Robust Transformer Decoders for Interpretable Image Classification with Foundation Models | Evelyn Mannix et.al. | 2403.04125 | null |
| 2024-03-07 | Privacy-preserving Fine-tuning of Large Language Models through Flatness | Tiejin Chen et.al. | 2403.04124 | null |
| 2024-03-06 | MedMamba: Vision Mamba for Medical Image Classification | Yubiao Yue et.al. | 2403.03849 | link |
| 2024-03-06 | On the Effectiveness of Distillation in Mitigating Backdoors in Pre-trained Encoder | Tingxu Han et.al. | 2403.03846 | link |
| 2024-03-06 | RADIA – Radio Advertisement Detection with Intelligent Analytics | Jorge Álvarez et.al. | 2403.03538 | null |
| 2024-03-06 | Inverse-Free Fast Natural Gradient Descent Method for Deep Learning | Xinwei Ou et.al. | 2403.03473 | null |
| 2024-03-06 | Sparse Spiking Neural Network: Exploiting Heterogeneity in Timescales for Pruning Recurrent SNN | Biswadeep Chakraborty et.al. | 2403.03409 | null |
| 2024-03-05 | RulePrompt: Weakly Supervised Text Classification with Prompting PLMs and Self-Iterative Logical Rules | Miaomiao Li et.al. | 2403.02932 | link |
| 2024-03-05 | Demonstrating Mutual Reinforcement Effect through Information Flow | Chengguang Gan et.al. | 2403.02902 | null |
| 2024-03-05 | Quantum Mixed-State Self-Attention Network | Fu Chen et.al. | 2403.02871 | null |
| 2024-03-05 | SOFIM: Stochastic Optimization Using Regularized Fisher Information Matrix | Gayathri C et.al. | 2403.02833 | null |
| 2024-03-05 | SGD with Partial Hessian for Deep Neural Networks Optimization | Ying Sun et.al. | 2403.02681 | link |
| 2024-03-05 | G-EvoNAS: Evolutionary Neural Architecture Search Based on Network Growth | Juan Zou et.al. | 2403.02667 | null |
| 2024-03-05 | Remove that Square Root: A New Efficient Scale-Invariant Version of AdaGrad | Sayantan Choudhury et.al. | 2403.02648 | link |
| 2024-03-05 | Modeling Collaborator: Enabling Subjective Vision Classification With Minimal Human Effort via LLM Tool-Use | Imad Eddine Toubal et.al. | 2403.02626 | null |
| 2024-03-04 | When do Convolutional Neural Networks Stop Learning? | Sahan Ahmad et.al. | 2403.02473 | link |
| 2024-03-04 | NiNformer: A Network in Network Transformer with Token Mixing Generated Gating Function | Abdullah Nazhat Abdullah et.al. | 2403.02411 | link |
| 2024-03-02 | Can a Confident Prior Replace a Cold Posterior? | Martin Marek et.al. | 2403.01272 | link |
| 2024-03-02 | Leveraging Self-Supervised Learning for Scene Recognition in Child Sexual Abuse Imagery | Pedro H. V. Valois et.al. | 2403.01183 | null |
| 2024-03-02 | Auxiliary Tasks Enhanced Dual-affinity Learning for Weakly Supervised Semantic Segmentation | Lian Xu et.al. | 2403.01156 | null |
| 2024-03-02 | ELA: Efficient Local Attention for Deep Convolutional Neural Networks | Wei Xu et.al. | 2403.01123 | null |
| 2024-03-01 | Margin Discrepancy-based Adversarial Training for Multi-Domain Text Classification | Yuan Wu et.al. | 2403.00888 | null |
| 2024-03-01 | Text classification of column headers with a controlled vocabulary: leveraging LLMs for metadata enrichment | Margherita Martorana et.al. | 2403.00884 | null |
| 2024-03-01 | SURE: SUrvey REcipes for building reliable and robust deep networks | Yuting Li et.al. | 2403.00543 | link |
| 2024-03-01 | Invariant Test-Time Adaptation for Vision-Language Model Generalization | Huan Ma et.al. | 2403.00376 | null |
| 2024-02-29 | TELEClass: Taxonomy Enrichment and LLM-Enhanced Hierarchical Text Classification with Minimal Supervision | Yunyi Zhang et.al. | 2403.00165 | null |
| 2024-02-29 | Assessing Visually-Continuous Corruption Robustness of Neural Networks Relative to Human Performance | Huakun Shen et.al. | 2402.19401 | null |
| 2024-02-29 | Stitching Gaps: Fusing Situated Perceptual Knowledge with Vision Transformers for High-Level Image Classification | Delfina Sol Martinez Pandiani et.al. | 2402.19339 | null |
| 2024-02-29 | Generalizable Whole Slide Image Classification with Fine-Grained Visual-Semantic Interaction | Hao Li et.al. | 2402.19326 | null |
| 2024-02-29 | Decompose-and-Compose: A Compositional Approach to Mitigating Spurious Correlation | Fahimeh Hosseini Noohdani et.al. | 2402.18919 | null |
| 2024-02-29 | Utilizing Local Hierarchy with Adversarial Training for Hierarchical Text Classification | Zihan Wang et.al. | 2402.18825 | link |
| 2024-02-28 | Comparing Importance Sampling Based Methods for Mitigating the Effect of Class Imbalance | Indu Panigrahi et.al. | 2402.18742 | link |
| 2024-02-28 | Deep Neural Network Models Trained With A Fixed Random Classifier Transfer Better Across Domains | Hafiz Tiomoko Ali et.al. | 2402.18614 | null |
| 2024-02-28 | Orchid: Flexible and Data-Dependent Convolution for Sequence Modeling | Mahdi Karami et.al. | 2402.18508 | null |
| 2024-02-28 | Prompt-Driven Dynamic Object-Centric Learning for Single Domain Generalization | Deng Li et.al. | 2402.18447 | null |
| 2024-02-29 | A Modular System for Enhanced Robustness of Multimedia Understanding Networks via Deep Parametric Estimation | Francesco Barbato et.al. | 2402.18402 | null |
| 2024-02-28 | A Multimodal Handover Failure Detection Dataset and Baselines | Santosh Thoduka et.al. | 2402.18319 | null |
| 2024-02-28 | Classes Are Not Equal: An Empirical Study on Image Recognition Fairness | Jiequan Cui et.al. | 2402.18133 | null |
| 2024-02-27 | Understanding Neural Network Binarization with Forward and Backward Proximal Quantizers | Yiwei Lu et.al. | 2402.17710 | null |
| 2024-02-27 | SDF2Net: Shallow to Deep Feature Fusion Network for PolSAR Image Classification | Mohammed Q. Alkhatib et.al. | 2402.17672 | link |
| 2024-02-27 | **Predict the Next Word: |
Evgenia Ilia et.al. | 2402.17527 | null |
| 2024-02-27 | Scaling Supervised Local Learning with Augmented Auxiliary Networks | Chenxiang Ma et.al. | 2402.17318 | link |
| 2024-02-26 | Offline Writer Identification Using Convolutional Neural Network Activation Features | Vincent Christlein et.al. | 2402.17029 | null |
(<a href=../README.md>back to main</a>)