Classification - 2024-06
Classification - 2024-06
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2024-06-28 | Extract More from Less: Efficient Fine-Grained Visual Recognition in Low-Data Regimes | Dmitry Demidov et.al. | 2406.19814 | translate | read | link |
| 2024-06-27 | Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads | Ali Khaleghi Rahimian et.al. | 2406.19391 | translate | read | link |
| 2024-06-27 | Learning Visual Conditioning Tokens to Correct Domain Shift for Fully Test-time Adaptation | Yushun Tang et.al. | 2406.19341 | translate | read | null |
| 2024-06-27 | Spiking Convolutional Neural Networks for Text Classification | Changze Lv et.al. | 2406.19230 | translate | read | link |
| 2024-06-27 | Adaptive Stochastic Weight Averaging | Caglar Demir et.al. | 2406.19092 | translate | read | link |
| 2024-06-27 | FedMLP: Federated Multi-Label Medical Image Classification under Task Heterogeneity | Zhaobin Sun et.al. | 2406.18995 | translate | read | link |
| 2024-06-26 | Detecting Machine-Generated Texts: Not Just “AI vs Humans” and Explainability is Complicated | Jiazhou Ji et.al. | 2406.18259 | translate | read | null |
| 2024-06-26 | ViT-1.58b: Mobile Vision Transformers in the 1-bit Era | Zhengqing Yuan et.al. | 2406.18051 | translate | read | null |
| 2024-06-25 | Benchmarking Deep Learning Models on NVIDIA Jetson Nano for Real-Time Systems: An Empirical Investigation | Tushar Prasanna Swaminathan et.al. | 2406.17749 | translate | read | link |
| 2024-06-25 | Structured Unrestricted-Rank Matrices for Parameter Efficient Fine-tuning | Arijit Sehanobish et.al. | 2406.17740 | translate | read | null |
| 2024-06-25 | BayTTA: Uncertainty-aware medical image classification with optimized test-time augmentation using Bayesian model averaging | Zeinab Sherkatghanad et.al. | 2406.17640 | translate | read | link |
| 2024-06-26 | Mitigate the Gap: Investigating Approaches for Improving Cross-Modal Alignment in CLIP | Sedigheh Eslami et.al. | 2406.17639 | translate | read | null |
| 2024-06-25 | Knowledge Distillation in Automated Annotation: Supervised Text Classification with LLM-Generated Training Labels | Nicholas Pangakis et.al. | 2406.17633 | translate | read | null |
| 2024-06-25 | Retrieval-style In-Context Learning for Few-shot Hierarchical Text Classification | Huiyao Chen et.al. | 2406.17534 | translate | read | link |
| 2024-06-25 | TSynD: Targeted Synthetic Data Generation for Enhanced Medical Image Classification | Joshua Niemeijer et.al. | 2406.17473 | translate | read | null |
| 2024-06-25 | Dynamic Scheduling for Vehicle-to-Vehicle Communications Enhanced Federated Learning | Jintao Yan et.al. | 2406.17470 | translate | read | null |
| 2024-06-25 | Implicit-Zoo: A Large-Scale Dataset of Neural Implicit Functions for 2D Images and 3D Scenes | Qi Ma et.al. | 2406.17438 | translate | read | null |
| 2024-06-25 | Robustly Optimized Deep Feature Decoupling Network for Fatty Liver Diseases Detection | Peng Huang et.al. | 2406.17338 | translate | read | null |
| 2024-06-24 | Evaluation of Language Models in the Medical Context Under Resource-Constrained Settings | Andrea Posada et.al. | 2406.16611 | translate | read | link |
| 2024-06-24 | Improving robustness to corruptions with multiplicative weight perturbations | Trung Trinh et.al. | 2406.16540 | translate | read | null |
| 2024-06-24 | UNICAD: A Unified Approach for Attack Detection, Noise Reduction and Novel Class Identification | Alvaro Lopez Pellicer et.al. | 2406.16501 | translate | read | null |
| 2024-06-24 | Improving Quaternion Neural Networks with Quaternionic Activation Functions | Johannes Pöppelbaum et.al. | 2406.16481 | translate | read | null |
| 2024-06-24 | Learning in Wilson-Cowan model for metapopulation | Raffaele Marino et.al. | 2406.16453 | translate | read | link |
| 2024-06-24 | Context-augmented Retrieval: A Novel Framework for Fast Information Retrieval based Response Generation using Large Language Model | Sai Ganesh et.al. | 2406.16383 | translate | read | null |
| 2024-06-24 | Combining Supervised Learning and Reinforcement Learning for Multi-Label Classification Tasks with Partial Labels | Zixia Jia et.al. | 2406.16293 | translate | read | null |
| 2024-06-23 | Jacobian Descent for Multi-Objective Optimization | Pierre Quinton et.al. | 2406.16232 | translate | read | null |
| 2024-06-23 | Learning with Noisy Ground Truth: From 2D Classification to 3D Reconstruction | Yangdi Lu et.al. | 2406.15982 | translate | read | null |
| 2024-06-22 | PUDD: Towards Robust Multi-modal Prototype-based Deepfake Detection | Alvaro Lopez Pellcier et.al. | 2406.15921 | translate | read | null |
| 2024-06-21 | Retrieval Augmented Zero-Shot Text Classification | Tassallah Abdullahi et.al. | 2406.15241 | translate | read | null |
| 2024-06-21 | DiffExplainer: Unveiling Black Box Models Via Counterfactual Generation | Yingying Fang et.al. | 2406.15182 | translate | read | null |
| 2024-06-21 | This actually looks like that: Proto-BagNets for local and global interpretability-by-design | Kerol Djoumessi et.al. | 2406.15168 | translate | read | link |
| 2024-06-21 | Hierarchical thematic classification of major conference proceedings | Arsentii Kuzmin et.al. | 2406.14983 | translate | read | null |
| 2024-06-21 | Demonstrating the Efficacy of Kolmogorov-Arnold Networks in Vision Tasks | Minjong Cheon et.al. | 2406.14916 | translate | read | link |
| 2024-06-21 | MU-Bench: A Multitask Multimodal Benchmark for Machine Unlearning | Jiali Cheng et.al. | 2406.14796 | translate | read | null |
| 2024-06-20 | Depth $F_1$ : Improving Evaluation of Cross-Domain Text Classification by Measuring Semantic Generalizability | Parker Seegmiller et.al. | 2406.14695 | translate | read | null |
| 2024-06-20 | Automatic Labels are as Effective as Manual Labels in Biomedical Images Classification with Deep Learning | Niccolò Marini et.al. | 2406.14351 | translate | read | null |
| 2024-06-20 | Self-supervised Interpretable Concept-based Models for Text Classification | Francesco De Santis et.al. | 2406.14335 | translate | read | null |
| 2024-06-20 | Adaptive Adversarial Cross-Entropy Loss for Sharpness-Aware Minimization | Tanapat Ratchatorn et.al. | 2406.14329 | translate | read | null |
| 2024-06-20 | Boosting Hyperspectral Image Classification with Gate-Shift-Fuse Mechanisms in a Novel CNN-Transformer Approach | Mohamed Fadhlallah Guerri et.al. | 2406.14120 | translate | read | null |
| 2024-06-20 | Seg-LSTM: Performance of xLSTM for Semantic Segmentation of Remotely Sensed Images | Qinfeng Zhu et.al. | 2406.14086 | translate | read | link |
| 2024-06-21 | CMTNet: Convolutional Meets Transformer Network for Hyperspectral Images Classification | Faxu Guo et.al. | 2406.14080 | translate | read | null |
| 2024-06-20 | Communication-Efficient Adaptive Batch Size Strategies for Distributed Local Gradient Methods | Tim Tsz-Kit Lau et.al. | 2406.13936 | translate | read | null |
| 2024-06-19 | WATT: Weight Average Test-Time Adaption of CLIP | David Osowiechi et.al. | 2406.13875 | translate | read | link |
| 2024-06-19 | CNN Based Flank Predictor for Quadruped Animal Species | Vanessa Suessle et.al. | 2406.13588 | translate | read | null |
| 2024-06-19 | Online Domain-Incremental Learning Approach to Classify Acoustic Scenes in All Locations | Manjunath Mulimani et.al. | 2406.13386 | translate | read | null |
| 2024-06-18 | LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging | Jinuk Kim et.al. | 2406.12837 | translate | read | link |
| 2024-06-18 | Privacy Preserving Federated Learning in Medical Imaging with Uncertainty Estimation | Nikolas Koutsoubis et.al. | 2406.12815 | translate | read | link |
| 2024-06-18 | Online Anchor-based Training for Image Classification Tasks | Maria Tzelepi et.al. | 2406.12662 | translate | read | null |
| 2024-06-18 | Fighting Randomness with Randomness: Mitigating Optimisation Instability of Fine-Tuning using Delayed Ensemble and Noisy Interpolation | Branislav Pecher et.al. | 2406.12471 | translate | read | null |
| 2024-06-18 | GW-MoE: Resolving Uncertainty in MoE Router with Global Workspace Theory | Haoze Wu et.al. | 2406.12375 | translate | read | null |
| 2024-06-18 | What Did I Do Wrong? Quantifying LLMs’ Sensitivity and Consistency to Prompt Engineering | Federico Errica et.al. | 2406.12334 | translate | read | null |
| 2024-06-18 | Unleashing the Potential of Open-set Noisy Samples Against Label Noise for Medical Image Classification | Zehui Liao et.al. | 2406.12293 | translate | read | null |
| 2024-06-18 | Advancing Cross-Domain Generalizability in Face Anti-Spoofing: Insights, Design, and Metrics | Hyojin Kim et.al. | 2406.12258 | translate | read | null |
| 2024-06-19 | MiSuRe is all you need to explain your image segmentation | Syed Nouman Hasany et.al. | 2406.12173 | translate | read | null |
| 2024-06-17 | Enhancing Text Classification through LLM-Driven Active Learning and Human Annotation | Hamidreza Rouzegar et.al. | 2406.12114 | translate | read | link |
| 2024-06-17 | Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of 99% | Lei Zhu et.al. | 2406.11837 | translate | read | link |
| 2024-06-17 | PrAViC: Probabilistic Adaptation Framework for Real-Time Video Classification | Magdalena Trędowicz et.al. | 2406.11443 | translate | read | null |
| 2024-06-17 | Cross-domain Open-world Discovery | Shuo Wen et.al. | 2406.11422 | translate | read | link |
| 2024-06-17 | BaFTA: Backprop-Free Test-Time Adaptation For Zero-Shot Vision-Language Models | Xuefeng Hu et.al. | 2406.11309 | translate | read | null |
| 2024-06-17 | An Empirical Investigation of Matrix Factorization Methods for Pre-trained Transformers | Ashim Gupta et.al. | 2406.11307 | translate | read | null |
| 2024-06-17 | Text Grafting: Near-Distribution Weak Supervision for Minority Classes in Text Classification | Letian Peng et.al. | 2406.11115 | translate | read | null |
| 2024-06-16 | Fine-grained Classes and How to Find Them | Matej Grcić et.al. | 2406.11070 | translate | read | link |
| 2024-06-16 | Leveraging Foundation Models for Multi-modal Federated Learning with Incomplete Modality | Liwei Che et.al. | 2406.11048 | translate | read | null |
| 2024-06-16 | Curating Stopwords in Marathi: A TF-IDF Approach for Improved Text Analysis and Information Retrieval | Rohan Chavan et.al. | 2406.11029 | translate | read | link |
| 2024-06-16 | Universal Cross-Lingual Text Classification | Riya Savant et.al. | 2406.11028 | translate | read | null |
| 2024-06-14 | UniAudio 1.5: Large Language Model-driven Audio Codec is A Few-shot Audio Task Learner | Dongchao Yang et.al. | 2406.10056 | translate | read | null |
| 2024-06-14 | Comparison of fine-tuning strategies for transfer learning in medical image classification | Ana Davila et.al. | 2406.10050 | translate | read | null |
| 2024-06-14 | Forgetting Order of Continual Learning: Examples That are Learned First are Forgotten Last | Guy Hacohen et.al. | 2406.09935 | translate | read | null |
| 2024-06-13 | MirrorCheck: Efficient Adversarial Defense for Vision-Language Models | Samar Fares et.al. | 2406.09250 | translate | read | null |
| 2024-06-13 | Self-Training for Sample-Efficient Active Learning for Text Classification with Pre-Trained Language Models | Christopher Schröder et.al. | 2406.09206 | translate | read | null |
| 2024-06-13 | Large-Scale Evaluation of Open-Set Image Classification Techniques | Halil Bisgin et.al. | 2406.09112 | translate | read | link |
| 2024-06-13 | LaCoOT: Layer Collapse through Optimal Transport | Victor Quétu et.al. | 2406.08933 | translate | read | null |
| 2024-06-13 | The Penalized Inverse Probability Measure for Conformal Classification | Paul Melki et.al. | 2406.08884 | translate | read | null |
| 2024-06-13 | Conceptual Learning via Embedding Approximations for Reinforcing Interpretability and Transparency | Maor Dikter et.al. | 2406.08840 | translate | read | link |
| 2024-06-13 | DenoiseReID: Denoising Model for Representation Learning of Person Re-Identification | Zhengrui Xu et.al. | 2406.08773 | translate | read | null |
| 2024-06-12 | Fine-Tuned ‘Small’ LLMs (Still) Significantly Outperform Zero-Shot Generative AI Models in Text Classification | Martin Juan José Bucher et.al. | 2406.08660 | translate | read | null |
| 2024-06-12 | Intelligent Multi-View Test Time Augmentation | Efe Ozturk et.al. | 2406.08593 | translate | read | null |
| 2024-06-12 | Transformation-Dependent Adversarial Attacks | Yaoteng Tan et.al. | 2406.08443 | translate | read | null |
| 2024-06-12 | AdaNCA: Neural Cellular Automata As Adaptors For More Robust Vision Transformer | Yitao Xu et.al. | 2406.08298 | translate | read | null |
| 2024-06-12 | DistilDoc: Knowledge Distillation for Visually-Rich Document Applications | Jordy Van Landeghem et.al. | 2406.08226 | translate | read | null |
| 2024-06-12 | Fully Few-shot Class-incremental Audio Classification Using Expandable Dual-embedding Extractor | Yongjie Si et.al. | 2406.08122 | translate | read | null |
| 2024-06-12 | Low-Complexity Acoustic Scene Classification Using Parallel Attention-Convolution Network | Yanxiong Li et.al. | 2406.08119 | translate | read | null |
| 2024-06-12 | A $^{2}$ -MAE: A spatial-temporal-spectral unified remote sensing pre-training method based on anchor-aware masked autoencoder | Lixian Zhang et.al. | 2406.08079 | translate | read | null |
| 2024-06-12 | Adversarial Evasion Attack Efficiency against Large Language Models | João Vitorino et.al. | 2406.08050 | translate | read | null |
| 2024-06-12 | Accurate Explanation Model for Image Classifiers using Class Association Embedding | Ruitao Xie et.al. | 2406.07961 | translate | read | link |
| 2024-06-12 | Multi-Teacher Multi-Objective Meta-Learning for Zero-Shot Hyperspectral Band Selection | Jie Feng et.al. | 2406.07949 | translate | read | null |
| 2024-06-12 | Small Scale Data-Free Knowledge Distillation | He Liu et.al. | 2406.07876 | translate | read | link |
| 2024-06-11 | fKAN: Fractional Kolmogorov-Arnold Networks with trainable Jacobi basis functions | Alireza Afzal Aghaei et.al. | 2406.07456 | translate | read | link |
| 2024-06-11 | Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach | Challapalli Phanindra Revanth et.al. | 2406.07332 | translate | read | null |
| 2024-06-11 | Noise-Robust Voice Conversion by Conditional Denoising Training Using Latent Variables of Recording Quality and Environment | Takuto Igarashi et.al. | 2406.07280 | translate | read | null |
| 2024-06-11 | EEG-ImageNet: An Electroencephalogram Dataset and Benchmarks with Image Visual Stimuli of Multi-Granularity Labels | Shuqi Zhu et.al. | 2406.07151 | translate | read | link |
| 2024-06-11 | RS-Agent: Automating Remote Sensing Tasks through Intelligent Agents | Wenjia Xu et.al. | 2406.07089 | translate | read | null |
| 2024-06-11 | DualMamba: A Lightweight Spectral-Spatial Mamba-Convolution Network for Hyperspectral Image Classification | Jiamu Sheng et.al. | 2406.07050 | translate | read | null |
| 2024-06-11 | Fairness-Aware Meta-Learning via Nash Bargaining | Yi Zeng et.al. | 2406.07029 | translate | read | null |
| 2024-06-11 | Mitigating Boundary Ambiguity and Inherent Bias for Text Classification in the Era of Large Language Models | Zhenyi Lu et.al. | 2406.07001 | translate | read | link |
| 2024-06-11 | Scaling up masked audio encoder learning for general audio classification | Heinrich Dinkel et.al. | 2406.06992 | translate | read | null |
| 2024-06-10 | Multi-Objective Neural Architecture Search for In-Memory Computing | Md Hasibul Amin et.al. | 2406.06746 | translate | read | null |
| 2024-06-10 | Robust Latent Representation Tuning for Image-text Classification | Hao Sun et.al. | 2406.06048 | translate | read | null |
| 2024-06-09 | Contrastive Learning from Synthetic Audio Doppelgangers | Manuel Cherep et.al. | 2406.05923 | translate | read | null |
| 2024-06-09 | Scaling Graph Convolutions for Mobile Vision | William Avery et.al. | 2406.05850 | translate | read | link |
| 2024-06-09 | Evolution-aware VAriance (EVA) Coreset Selection for Medical Image Classification | Yuxin Hong et.al. | 2406.05677 | translate | read | null |
| 2024-06-09 | Which Backbone to Use: A Resource-efficient Domain Specific Comparison for Computer Vision | Pranav Jeevan et.al. | 2406.05612 | translate | read | link |
| 2024-06-08 | Aligning Human Knowledge with Visual Concepts Towards Explainable Medical Image Classification | Yunhe Gao et.al. | 2406.05596 | translate | read | null |
| 2024-06-07 | The Unmet Promise of Synthetic Training Images: Using Retrieved Real Images Performs Better | Scott Geng et.al. | 2406.05184 | translate | read | link |
| 2024-06-07 | A Novel Time Series-to-Image Encoding Approach for Weather Phenomena Classification | Christian Giannetti et.al. | 2406.05096 | translate | read | null |
| 2024-06-07 | Classification Metrics for Image Explanations: Towards Building Reliable XAI-Evaluations | Benjamin Fresz et.al. | 2406.05068 | translate | read | link |
| 2024-06-07 | REP: Resource-Efficient Prompting for On-device Continual Learning | Sungho Jeon et.al. | 2406.04772 | translate | read | null |
| 2024-06-07 | AICoderEval: Improving AI Domain Code Generation of Large Language Models | Yinghui Xia et.al. | 2406.04712 | translate | read | null |
| 2024-06-07 | Cooperative Meta-Learning with Gradient Augmentation | Jongyun Shin et.al. | 2406.04639 | translate | read | link |
| 2024-06-06 | OCCAM: Towards Cost-Efficient and Accuracy-Aware Image Classification Inference | Dujian Ding et.al. | 2406.04508 | translate | read | null |
| 2024-06-06 | Can Language Models Use Forecasting Strategies? | Sarah Pratt et.al. | 2406.04446 | translate | read | null |
| 2024-06-06 | Parameter-Inverted Image Pyramid Networks | Xizhou Zhu et.al. | 2406.04330 | translate | read | link |
| 2024-06-07 | BEADs: Bias Evaluation Across Domains | Shaina Raza et.al. | 2406.04220 | translate | read | null |
| 2024-06-06 | What Do Language Models Learn in Context? The Structured Task Hypothesis | Jiaoda Li et.al. | 2406.04216 | translate | read | null |
| 2024-06-06 | Pointer-Guided Pre-Training: Infusing Large Language Models with Paragraph-Level Contextual Awareness | Lars Hillebrand et.al. | 2406.04156 | translate | read | link |
| 2024-06-07 | ReDistill: Residual Encoded Distillation for Peak Memory Reduction | Fang Chen et.al. | 2406.03744 | translate | read | null |
| 2024-06-06 | LLMEmbed: Rethinking Lightweight LLM’s Genuine Function in Text Classification | Chun Liu et.al. | 2406.03725 | translate | read | link |
| 2024-06-05 | Convolutional Neural Networks and Vision Transformers for Fashion MNIST Classification: A Literature Review | Sonia Bbouzidi et.al. | 2406.03478 | translate | read | null |
| 2024-06-05 | IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models | David Ifeoluwa Adelani et.al. | 2406.03368 | translate | read | null |
| 2024-06-05 | Audio Mamba: Bidirectional State Space Model for Audio Representation Learning | Mehmet Hamza Erol et.al. | 2406.03344 | translate | read | link |
| 2024-06-05 | FusionBench: A Comprehensive Benchmark of Deep Model Fusion | Anke Tang et.al. | 2406.03280 | translate | read | null |
| 2024-06-05 | VWise: A novel benchmark for evaluating scene classification for vehicular applications | Pedro Azevedo et.al. | 2406.03273 | translate | read | null |
| 2024-06-05 | Tiny models from tiny data: Textual and null-text inversion for few-shot distillation | Erik Landolsi et.al. | 2406.03146 | translate | read | link |
| 2024-06-05 | Exploiting LMM-based knowledge for image classification tasks | Maria Tzelepi et.al. | 2406.03071 | translate | read | null |
| 2024-06-04 | Randomized Geometric Algebra Methods for Convex Neural Networks | Yifei Wang et.al. | 2406.02806 | translate | read | null |
| 2024-06-04 | DL-KDD: Dual-Light Knowledge Distillation for Action Recognition in the Dark | Chi-Jui Chang et.al. | 2406.02468 | translate | read | null |
| 2024-06-04 | GrootVL: Tree Topology is All You Need in State Space Model | Yicheng Xiao et.al. | 2406.02395 | translate | read | link |
| 2024-06-04 | Hybrid Quantum-Classical Neural Network for LAB Color Space Image Classification | Kwokho Ng et.al. | 2406.02229 | translate | read | null |
| 2024-06-03 | Few-Shot Classification of Interactive Activities of Daily Living (InteractADL) | Zane Durante et.al. | 2406.01662 | translate | read | link |
| 2024-06-03 | CoLa-DCE – Concept-guided Latent Diffusion Counterfactual Explanations | Franz Motzkus et.al. | 2406.01649 | translate | read | null |
| 2024-06-03 | Asynchronous Multi-Server Federated Learning for Geo-Distributed Clients | Yuncong Zuo et.al. | 2406.01439 | translate | read | null |
| 2024-06-03 | Compute-Efficient Medical Image Classification with Softmax-Free Transformers and Sequence Normalization | Firas Khader et.al. | 2406.01314 | translate | read | null |
| 2024-06-03 | Continuous Geometry-Aware Graph Diffusion via Hyperbolic Neural PDE | Jiaxu Liu et.al. | 2406.01282 | translate | read | null |
| 2024-06-04 | MultiMax: Sparse and Multi-Modal Attention Learning | Yuxuan Zhou et.al. | 2406.01189 | translate | read | link |
| 2024-06-03 | Synergizing Unsupervised and Supervised Learning: A Hybrid Approach for Accurate Natural Language Task Modeling | Wrick Talukdar et.al. | 2406.01096 | translate | read | null |
(<a href=../Classification.md>back to Classification</a>)