Transfer Learning - 2026-03
Transfer Learning - 2026-03
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2026-03-31 | ContextClaim: A Context-Driven Paradigm for Verifiable Claim Detection | Yufeng Li et.al. | 2603.30025 | translate | read | null |
| 2026-03-31 | SurgTEMP: Temporal-Aware Surgical Video Question Answering with Text-guided Visual Memory for Laparoscopic Cholecystectomy | Shi Li et.al. | 2603.29962 | translate | read | null |
| 2026-03-31 | ScoringBench: A Benchmark for Evaluating Tabular Foundation Models with Proper Scoring Rules | Jonas Landsgesell et.al. | 2603.29928 | translate | read | null |
| 2026-03-31 | Task Scarcity and Label Leakage in Relational Transfer Learning | Francisco Galuppo Azevedo et.al. | 2603.29914 | translate | read | null |
| 2026-03-31 | FLEURS-Kobani: Extending the FLEURS Dataset for Northern Kurdish | Daban Q. Jaff et.al. | 2603.29892 | translate | read | null |
| 2026-03-31 | Curvature-Guided LoRA: Steering in the pretrained NTK subspace | Frédéric Zheng et.al. | 2603.29824 | translate | read | null |
| 2026-03-31 | ENEIDE: A High Quality Silver Standard Dataset for Named Entity Recognition and Linking in Historical Italian | Cristian Santini et.al. | 2603.29801 | translate | read | null |
| 2026-03-31 | Training-Free Dynamic Upcycling of Expert Language Models | Eros Fanì et.al. | 2603.29765 | translate | read | null |
| 2026-03-31 | One-for-All: A Lightweight Stabilized and Parameter-Efficient Pre-trained LLM for Time Series Forecasting | Prasanjit Dey et.al. | 2603.29756 | translate | read | null |
| 2026-03-31 | Drift-Aware Continual Tokenization for Generative Recommendation | Yuebo Feng et.al. | 2603.29705 | translate | read | null |
| 2026-03-31 | FED-Bench: A Cross-Granular Benchmark for Disentangled Evaluation of Facial Expression Editing | Fengjian Xue et.al. | 2603.29697 | translate | read | null |
| 2026-03-31 | CoRe-DA: Contrastive Regression for Unsupervised Domain Adaptation in Surgical Skill Assessment | Dimitrios Anastasiou et.al. | 2603.29666 | translate | read | null |
| 2026-03-31 | A review on the use of complex networks in science education research | Paula Tuzón et.al. | 2603.29663 | translate | read | null |
| 2026-03-31 | STRADAViT: Towards a Foundational Model for Radio Astronomy through Self-Supervised Transfer | Andrea DeMarco et.al. | 2603.29660 | translate | read | null |
| 2026-03-31 | 6GAgentGym: Tool Use, Data Synthesis, and Agentic Learning for Network Management | Jiao Chen et.al. | 2603.29656 | translate | read | null |
| 2026-03-31 | Self-Supervised Federated Learning under Data Heterogeneity for Label-Scarce Diatom Classification | Mingkun Tan et.al. | 2603.29633 | translate | read | null |
| 2026-03-31 | BigEarthNet.txt: A Large-Scale Multi-Sensor Image-Text Dataset and Benchmark for Earth Observation | Johann-Ludwig Herzog et.al. | 2603.29630 | translate | read | null |
| 2026-03-31 | FlowID : Enhancing Forensic Identification with Latent Flow-Matching Models | Jules Ripoll et.al. | 2603.29591 | translate | read | null |
| 2026-03-31 | Transfer Learning for Moderate-Dimensional Ridge-Regularized Robust Linear Regression | Lingfeng Lyu et.al. | 2603.29575 | translate | read | null |
| 2026-03-31 | Impact of enriched meaning representations for language generation in dialogue tasks: A comprehensive exploration of the relevance of tasks, corpora and metrics | Alain Vázquez et.al. | 2603.29518 | translate | read | null |
| 2026-03-31 | MemFactory: Unified Inference & Training Framework for Agent Memory | Ziliang Guo et.al. | 2603.29493 | translate | read | null |
| 2026-03-31 | Calibrated Confidence Expression for Radiology Report Generation | David Bani-Harouni et.al. | 2603.29492 | translate | read | null |
| 2026-03-31 | 30-meter Land Surface Temperature from Landsat via Progressive Self-Training Downscaling | Huanfeng Shen et.al. | 2603.29478 | translate | read | null |
| 2026-03-31 | Survival In-Context: Prior-fitted In-context Learning Tabular Foundation Model for Survival Analysis | Dmitrii Seletkov et.al. | 2603.29475 | translate | read | null |
| 2026-03-31 | An Isotropic Approach to Efficient Uncertainty Quantification with Gradient Norms | Nils Grünefeld et.al. | 2603.29466 | translate | read | null |
| 2026-03-31 | Few-shot Writer Adaptation via Multimodal In-Context Learning | Tom Simon et.al. | 2603.29450 | translate | read | null |
| 2026-03-31 | AGFT: Alignment-Guided Fine-Tuning for Zero-Shot Adversarial Robustness of Vision-Language Models | Yubo Cui et.al. | 2603.29410 | translate | read | null |
| 2026-03-31 | CIPHER: Counterfeit Image Pattern High-level Examination via Representation | Kyeonghun Kim et.al. | 2603.29356 | translate | read | null |
| 2026-03-31 | Open Machine Translation for Esperanto | Ona de Gibert et.al. | 2603.29345 | translate | read | null |
| 2026-03-31 | Mexican Burrowing Toads as gravitational wave detectors | Frederic V. Hessman et.al. | 2603.29334 | translate | read | null |
| 2026-03-31 | HSFM: Hard-Set-Guided Feature-Space Meta-Learning for Robust Classification under Spurious Correlations | Aryan Yazdan Parast et.al. | 2603.29313 | translate | read | null |
| 2026-03-31 | Self-Improving Code Generation via Semantic Entropy and Behavioral Consensus | Huan Zhang et.al. | 2603.29292 | translate | read | null |
| 2026-03-31 | PRISM: A Multi-View Multi-Capability Retail Video Dataset for Embodied Vision-Language Models | Amirreza Rouhi et.al. | 2603.29281 | translate | read | null |
| 2026-03-31 | From Physics to Surrogate Intelligence: A Unified Electro-Thermo-Optimization Framework for TSV Networks | Mohamed Gharib et.al. | 2603.29268 | translate | read | null |
| 2026-03-31 | Omni-NegCLIP: Enhancing CLIP with Front-Layer Contrastive Fine-Tuning for Comprehensive Negation Understanding | Jingqi Xu et.al. | 2603.29258 | translate | read | null |
| 2026-03-31 | Long-Document QA with Chain-of-Structured-Thought and Fine-Tuned SLMs | Zhuowen Liang et.al. | 2603.29232 | translate | read | null |
| 2026-03-31 | SiPaKosa: A Comprehensive Corpus of Canonical and Classical Buddhist Texts in Sinhala and Pali | Ranidu Gurusinghe et.al. | 2603.29221 | translate | read | null |
| 2026-03-31 | Segmentation of Gray Matters and White Matters from Brain MRI data | Chang Sun et.al. | 2603.29171 | translate | read | null |
| 2026-03-31 | Dual-Imbalance Continual Learning for Real-World Food Recognition | Xiaoyan Zhang et.al. | 2603.29133 | translate | read | null |
| 2026-03-31 | Efficient Bilevel Optimization with KFAC-Based Hypergradients | Disen Liao et.al. | 2603.29108 | translate | read | null |
| 2026-03-31 | Exploring non-trivial band structure and spin polarizations in $d$ -wave altermagnets tailored by anisotropic optical fields | Andrii Iurov et.al. | 2603.29106 | translate | read | null |
| 2026-03-31 | TrajectoryMover: Generative Movement of Object Trajectories in Videos | Kiran Chhatre et.al. | 2603.29092 | translate | read | null |
| 2026-03-31 | IQRA 2026: Interspeech Challenge on Automatic Assessment Pronunciation for Modern Standard Arabic (MSA) | Yassine El Kheir et.al. | 2603.29087 | translate | read | null |
| 2026-03-30 | Expectation Error Bounds for Transfer Learning in Linear Regression and Linear Neural Networks | Meitong Liu et.al. | 2603.28739 | translate | read | null |
| 2026-03-30 | See it to Place it: Evolving Macro Placements with Vision-Language Models | Ikechukwu Uchendu et.al. | 2603.28733 | translate | read | null |
| 2026-03-30 | SOLE-R1: Video-Language Reasoning as the Sole Reward for On-Robot Reinforcement Learning | Philip Schroeder et.al. | 2603.28730 | translate | read | null |
| 2026-03-30 | Can Hierarchical Cross-Modal Fusion Predict Human Perception of AI Dubbed Content? | Ashwini Dasare et.al. | 2603.28717 | translate | read | null |
| 2026-03-30 | EpiScreen: Early Epilepsy Detection from Electronic Health Records with Large Language Models | Shuang Zhou et.al. | 2603.28698 | translate | read | null |
| 2026-03-30 | From $α$ decay to cluster decay: an extreme case of transfer learning | Yinu Zhang et.al. | 2603.28628 | translate | read | null |
| 2026-03-30 | TGIF2: Extended Text-Guided Inpainting Forgery Dataset & Benchmark | Hannes Mareen et.al. | 2603.28613 | translate | read | null |
| 2026-03-30 | Unsafe2Safe: Controllable Image Anonymization for Downstream Utility | Mih Dinh et.al. | 2603.28605 | translate | read | null |
| 2026-03-30 | Fine-Tuning Large Language Models for Cooperative Tactical Deconfliction of Small Unmanned Aerial Systems | Iman Sharifi et.al. | 2603.28561 | translate | read | null |
| 2026-03-30 | GraphWalker: Agentic Knowledge Graph Question Answering via Synthetic Trajectory Curriculum | Shuwen Xu et.al. | 2603.28533 | translate | read | null |
| 2026-03-30 | JW-VL: A Vision-Language Model for Solar Physics | Mingfu Shao et.al. | 2603.28504 | translate | read | null |
| 2026-03-30 | INSID3: Training-Free In-Context Segmentation with DINOv3 | Claudia Cuttano et.al. | 2603.28480 | translate | read | null |
| 2026-03-30 | CiQi-Agent: Aligning Vision, Tools and Aesthetics in Multimodal Agent for Cultural Reasoning on Chinese Porcelains | Wenhan Wang et.al. | 2603.28474 | translate | read | null |
| 2026-03-30 | HISA: Efficient Hierarchical Indexing for Fine-Grained Sparse Attention | Yufei Xu et.al. | 2603.28458 | translate | read | null |
| 2026-03-30 | COvolve: Adversarial Co-Evolution of Large-Language-Model-Generated Policies and Environments via Two-Player Zero-Sum Game | Alkis Sygkounas et.al. | 2603.28386 | translate | read | null |
| 2026-03-30 | AutoCut: End-to-end advertisement video editing based on multimodal discretization and controllable generation | Milton Zhou et.al. | 2603.28366 | translate | read | null |
| 2026-03-30 | Physics-Informed Neural Networks for Predicting Hydrogen Sorption in Geological Formations: Thermodynamically Constrained Deep Learning Integrating Classical Adsorption Theory | Mohammad Nooraiepour et.al. | 2603.28328 | translate | read | null |
| 2026-03-30 | LIBERO-Para: A Diagnostic Benchmark and Metrics for Paraphrase Robustness in VLA Models | Chanyoung Kim et.al. | 2603.28301 | translate | read | null |
| 2026-03-30 | DinoDental: Benchmarking DINOv3 as a Unified Vision Encoder for Dental Image Analysis | Kun Tang et.al. | 2603.28297 | translate | read | null |
| 2026-03-30 | Learning from imperfect quantum data via unsupervised domain adaptation with classical shadows | Kosuke Ito et.al. | 2603.28294 | translate | read | null |
| 2026-03-30 | Merge and Conquer: Instructing Multilingual Models by Adding Target Language Weights | Eneko Valero et.al. | 2603.28263 | translate | read | null |
| 2026-03-30 | DongYuan: An LLM-Based Framework for Integrative Chinese and Western Medicine Spleen-Stomach Disorders Diagnosis | Hua Li et.al. | 2603.28191 | translate | read | null |
| 2026-03-30 | A Closer Look at Cross-Domain Few-Shot Object Detection: Fine-Tuning Matters and Parallel Decoder Helps | Xuanlong Yu et.al. | 2603.28182 | translate | read | null |
| 2026-03-30 | ToLL: Topological Layout Learning with Structural Multi-view Augmentation for 3D Scene Graph Pretraining | Yucheng Huang et.al. | 2603.28178 | translate | read | null |
| 2026-03-30 | From Reviews to Requirements: Can LLMs Generate Human-Like User Stories? | Shadman Sakib et.al. | 2603.28163 | translate | read | null |
| 2026-03-30 | Q-DIVER: Integrated Quantum Transfer Learning and Differentiable Quantum Architecture Search with EEG Data | Junghoon Justin Park et.al. | 2603.28122 | translate | read | null |
| 2026-03-30 | Compressing Code Context for LLM-based Issue Resolution | Haoxiang Jia et.al. | 2603.28119 | translate | read | null |
| 2026-03-30 | $AutoDrive\text{-}P^3$ : Unified Chain of Perception-Prediction-Planning Thought via Reinforcement Fine-Tuning | Yuqi Ye et.al. | 2603.28116 | translate | read | null |
| 2026-03-30 | Can Large Language Models be a Cardinality Estimator? An Empirical study | Liangzu Liu et.al. | 2603.28080 | translate | read | null |
| 2026-03-30 | Is One-Shot In-Context Learning Helpful for Data Selection in Task-Specific Fine-Tuning of Multimodal LLMs? | Xiao An et.al. | 2603.28058 | translate | read | null |
| 2026-03-30 | Reducing Oracle Feedback with Vision-Language Embeddings for Preference-Based RL | Udita Ghosh et.al. | 2603.28053 | translate | read | null |
| 2026-03-30 | Event6D: Event-based Novel Object 6D Pose Tracking | Jae-Young Kang et.al. | 2603.28045 | translate | read | null |
| 2026-03-30 | Seeing the Unseen: Rethinking Illicit Promotion Detection with In-Context Learning | Sangyi Wu et.al. | 2603.28043 | translate | read | null |
| 2026-03-30 | Transfer Learning for an Endangered Slavic Variety: Dependency Parsing in Pomak Across Contact-Shaped Dialects | Sercan Karakaş et.al. | 2603.28033 | translate | read | null |
| 2026-03-30 | Efficient Domain Adaptation for Text Line Recognition via Decoupled Language Models | Arundhathi Dev et.al. | 2603.28028 | translate | read | null |
| 2026-03-30 | Adapting SAM to Nuclei Instance Segmentation and Classification via Cooperative Fine-Grained Refinement | Jingze Su et.al. | 2603.28027 | translate | read | null |
| 2026-03-30 | SegRGB-X: General RGB-X Semantic Segmentation Model | Jiong Liu et.al. | 2603.28023 | translate | read | null |
| 2026-03-30 | A Comparative Study of Molecular Dynamics Approaches for Simulating Ionic Conductivity in Solid Lithium Electrolytes | Dounia Shaaban Kabakibo et.al. | 2603.28012 | translate | read | null |
| 2026-03-30 | FedDES: Graph-Based Dynamic Ensemble Selection for Personalized Federated Learning | Brianna Mueller et.al. | 2603.28006 | translate | read | null |
| 2026-03-30 | CLIP-AUTT: Test-Time Personalization with Action Unit Prompting for Fine-Grained Video Emotion Recognition | Muhammad Osama Zeeshan et.al. | 2603.27999 | translate | read | null |
| 2026-03-30 | UniDA3D: A Unified Domain-Adaptive Framework for Multi-View 3D Object Detection | Hongjing Wu et.al. | 2603.27995 | translate | read | null |
| 2026-03-30 | On the Role of Encoder Depth: Pruning Whisper and LoRA Fine-Tuning in SLAM-ASR | Ganesh Pavan Kartikeya Bharadwaj Kolluri et.al. | 2603.27981 | translate | read | null |
| 2026-03-30 | Principal Prototype Analysis on Manifold for Interpretable Reinforcement Learning | Bodla Krishna Vamshi et.al. | 2603.27971 | translate | read | null |
| 2026-03-30 | Learning Multi-View Spatial Reasoning from Cross-View Relations | Suchae Jeong et.al. | 2603.27967 | translate | read | null |
| 2026-03-30 | GEAKG: Generative Executable Algorithm Knowledge Graphs | Camilo Chacón Sartori et.al. | 2603.27922 | translate | read | null |
| 2026-03-30 | ITQ3_S: High-Fidelity 3-bit LLM Inference via Interleaved Ternary Quantization with Rotation-Domain Smoothing | Edward J. Yoon et.al. | 2603.27914 | translate | read | null |
| 2026-03-25 | Retrieval Improvements Do Not Guarantee Better Answers: A Study of RAG for AI Policy QA | Saahil Mathur et.al. | 2603.24580 | translate | read | null |
| 2026-03-25 | VFIG: Vectorizing Complex Figures in SVG with Vision-Language Models | Qijia He et.al. | 2603.24575 | translate | read | null |
| 2026-03-25 | Scaling Recurrence-aware Foundation Models for Clinical Records via Next-Visit Prediction | Haresh Rengaraj Rajamohan et.al. | 2603.24562 | translate | read | null |
| 2026-03-25 | Boosting LLMs for Mutation Generation | Bo Wang et.al. | 2603.24560 | translate | read | null |
| 2026-03-25 | LensWalk: Agentic Video Understanding by Planning How You See in Videos | Keliang Li et.al. | 2603.24558 | translate | read | null |
| 2026-03-25 | UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience | Zichuan Lin et.al. | 2603.24533 | translate | read | null |
| 2026-03-25 | TuneShift-KD: Knowledge Distillation and Transfer for Fine-tuned Models | Yushi Guan et.al. | 2603.24518 | translate | read | null |
| 2026-03-25 | Fine-tuning universal machine learning potentials for transition state search in surface catalysis | Raffaele Cheula et.al. | 2603.24482 | translate | read | null |
| 2026-03-25 | Conformalized Transfer Learning for Li-ion Battery State of Health Forecasting under Manufacturing and Usage Variability | Samuel Filgueira da Silva et.al. | 2603.24475 | translate | read | null |
| 2026-03-25 | The Gait Signature of Frailty: Transfer Learning based Deep Gait Models for Scalable Frailty Assessment | Laura McDaniel et.al. | 2603.24434 | translate | read | null |
| 2026-03-25 | PINGALA: Prosody-Aware Decoding for Sanskrit Poetry Generation | Manoj Balaji Jagadeeshan et.al. | 2603.24413 | translate | read | null |
| 2026-03-25 | Causal Transfer in Medical Image Analysis | Mohammed M. Abdelsamea et.al. | 2603.24388 | translate | read | null |
| 2026-03-25 | Improving Lean4 Autoformalization via Cycle Consistency Fine-tuning | Arsen Shebzukhov et.al. | 2603.24372 | translate | read | null |
| 2026-03-25 | Aluminum solidification and nanopolycrystal deformation via a Graph Neural Network Potential and Million-Atom Simulations | Ian Störmer et.al. | 2603.24360 | translate | read | null |
| 2026-03-25 | Le MuMo JEPA: Multi-Modal Self-Supervised Representation Learning with Learnable Fusion Tokens | Ciem Cornelissen et.al. | 2603.24327 | translate | read | null |
| 2026-03-25 | Heuristic Self-Paced Learning for Domain Adaptive Semantic Segmentation under Adverse Conditions | Shiqin Wang et.al. | 2603.24322 | translate | read | null |
| 2026-03-25 | Samasāmayik: A Parallel Dataset for Hindi-Sanskrit Machine Translation | N J Karthika et.al. | 2603.24307 | translate | read | null |
| 2026-03-25 | Cost-Sensitive Neighborhood Aggregation for Heterophilous Graphs: When Does Per-Edge Routing Help? | Eyal Weiss et.al. | 2603.24291 | translate | read | null |
| 2026-03-25 | ScrollScape: Unlocking 32K Image Generation With Video Diffusion Priors | Haodong Yu et.al. | 2603.24270 | translate | read | null |
| 2026-03-25 | Forecasting with Guidance: Representation-Level Supervision for Time Series Forecasting | Jiacheng Wang et.al. | 2603.24262 | translate | read | null |
| 2026-03-25 | Optimizing Multilingual LLMs via Federated Learning: A Study of Client Language Composition | Aleix Sant et.al. | 2603.24242 | translate | read | null |
| 2026-03-25 | RVLM: Recursive Vision-Language Models with Adaptive Depth | Nicanor Mayumu et.al. | 2603.24224 | translate | read | null |
| 2026-03-25 | Variation is the Norm: Embracing Sociolinguistics in NLP | Anne-Marie Lutgen et.al. | 2603.24222 | translate | read | null |
| 2026-03-25 | Uncovering Memorization in Timeseries Imputation models: LBRM Membership Inference and its link to attribute Leakage | Faiz Taleb et.al. | 2603.24213 | translate | read | null |
| 2026-03-25 | SumRank: Aligning Summarization Models for Long-Document Listwise Reranking | Jincheng Feng et.al. | 2603.24204 | translate | read | null |
| 2026-03-25 | A Deep Dive into Scaling RL for Code Generation with Synthetic Data and Curricula | Cansu Sancaktar et.al. | 2603.24202 | translate | read | null |
| 2026-03-25 | RefReward-SR: LR-Conditioned Reward Modeling for Preference-Aligned Super-Resolution | Yushuai Song et.al. | 2603.24198 | translate | read | null |
| 2026-03-25 | MedAidDialog: A Multilingual Multi-Turn Medical Dialogue Dataset for Accessible Healthcare | Shubham Kumar Nigam et.al. | 2603.24132 | translate | read | null |
| 2026-03-25 | Alignment Reduces Expressed but Not Encoded Gender Bias: A Unified Framework and Study | Nour Bouchouchi et.al. | 2603.24125 | translate | read | null |
| 2026-03-25 | LGTM: Training-Free Light-Guided Text-to-Image Diffusion Model via Initial Noise Manipulation | Ryugo Morita et.al. | 2603.24086 | translate | read | null |
| 2026-03-25 | SOMA: Strategic Orchestration and Memory-Augmented System for Vision-Language-Action Model Robustness via In-Context Adaptation | Zhuoran Li et.al. | 2603.24060 | translate | read | null |
| 2026-03-25 | AD-Reasoning: Multimodal Guideline-Guided Reasoning for Alzheimer’s Disease Diagnosis | Qiuhui Chen et.al. | 2603.24059 | translate | read | null |
| 2026-03-25 | MoE-Sieve: Routing-Guided LoRA for Efficient MoE Fine-Tuning | Andrea Manzoni et.al. | 2603.24044 | translate | read | null |
| 2026-03-25 | Schema on the Inside: A Two-Phase Fine-Tuning Method for High-Efficiency Text-to-SQL at Scale | Chinmay Soni et.al. | 2603.24023 | translate | read | null |
| 2026-03-25 | GARP-EFM: Improving Foundation Models with Revealed Preference Structure | Victor H. Aguiar et.al. | 2603.23993 | translate | read | null |
| 2026-03-25 | Can we generate portable representations for clinical time series data using LLMs? | Zongliang Ji et.al. | 2603.23987 | translate | read | null |
| 2026-03-25 | PointRFT: Explicit Reinforcement Fine-tuning for Point Cloud Few-shot Learning | Yankai Wang et.al. | 2603.23957 | translate | read | null |
| 2026-03-25 | VOLMO: Versatile and Open Large Models for Ophthalmology | Zhenyue Qin et.al. | 2603.23953 | translate | read | null |
| 2026-03-25 | DP^2-VL: Private Photo Dataset Protection by Data Poisoning for Vision-Language Models | Hongyi Miao et.al. | 2603.23925 | translate | read | null |
| 2026-03-25 | Can VLMs Reason Robustly? A Neuro-Symbolic Investigation | Weixin Chen et.al. | 2603.23867 | translate | read | null |
| 2026-03-25 | Perturbation: A simple and efficient adversarial tracer for representation learning in language models | Joshua Rozner et.al. | 2603.23821 | translate | read | null |
| 2026-03-24 | Retinal Disease Classification from Fundus Images using CNN Transfer Learning | Ali Akram et.al. | 2603.23785 | translate | read | null |
| 2026-03-24 | Probabilistic Geometric Alignment via Bayesian Latent Transport for Domain-Adaptive Foundation Models | Kuepon Aueawatthanaphisut et.al. | 2603.23783 | translate | read | null |
| 2026-03-24 | Lightweight Fairness for LLM-Based Recommendations via Kernelized Projection and Gated Adapters | Nan Cui et.al. | 2603.23780 | translate | read | null |
| 2026-03-24 | Thermally inflated accretors in post-mass transfer binaries: Abell 35 and its class revisited | Soumyadeep Bhattacharjee et.al. | 2603.23756 | translate | read | null |
| 2026-03-24 | Detection and Classification of (Pre)Cancerous Cells in Pap Smears: An Ensemble Strategy for the RIVA Cervical Cytology Challenge | Lautaro Kogan et.al. | 2603.23742 | translate | read | null |
| 2026-03-24 | Wasserstein Parallel Transport for Predicting the Dynamics of Statistical Systems | Tristan Luca Saidi et.al. | 2603.23736 | translate | read | null |
| 2026-03-24 | An Adapter-free Fine-tuning Approach for Tuning 3D Foundation Models | Sneha Paul et.al. | 2603.23730 | translate | read | null |
| 2026-03-24 | The Diminishing Returns of Early-Exit Decoding in Modern LLMs | Rui Wei et.al. | 2603.23701 | translate | read | null |
| 2026-03-24 | MetaKube: An Experience-Aware LLM Framework for Kubernetes Failure Diagnosis | Wei Sun et.al. | 2603.23580 | translate | read | null |
| 2026-03-24 | Dual-Criterion Curriculum Learning: Application to Temporal Data | Gaspard Abel et.al. | 2603.23573 | translate | read | null |
| 2026-03-24 | ConceptCoder: Improve Code Reasoning via Concept Learning | Md Mahbubur Rahman et.al. | 2603.23470 | translate | read | null |
| 2026-03-24 | DetPO: In-Context Learning with Multi-Modal LLMs for Few-Shot Object Detection | Gautam Rajendrakumar Gare et.al. | 2603.23455 | translate | read | null |
| 2026-03-24 | A Joint Reinforcement Learning Scheduling and Compression Framework for Teleoperated Driving | Giacomo Avanzi et.al. | 2603.23387 | translate | read | null |
| 2026-03-24 | ViBe: Ultra-High-Resolution Video Synthesis Born from Pure Images | Yunfeng Wu et.al. | 2603.23326 | translate | read | null |
| 2026-03-24 | SafeSeek: Universal Attribution of Safety Circuits in Language Models | Miao Yu et.al. | 2603.23268 | translate | read | null |
| 2026-03-24 | The NCS-Model: A seismic foundation model trained on the Norwegian repository of public data | Alba Ordonez et.al. | 2603.23211 | translate | read | null |
| 2026-03-24 | GSwap: Realistic Head Swapping with Dynamic Neural Gaussian Field | Jingtao Zhou et.al. | 2603.23168 | translate | read | null |
| 2026-03-24 | Automatic Segmentation of 3D CT scans with SAM2 using a zero-shot approach | Miquel Lopez Escoriza et.al. | 2603.23116 | translate | read | null |
| 2026-03-24 | PolarAPP: Beyond Polarization Demosaicking for Polarimetric Applications | Yidong Luo et.al. | 2603.23071 | translate | read | null |
| 2026-03-24 | MLLM-HWSI: A Multimodal Large Language Model for Hierarchical Whole Slide Image Understanding | Basit Alawode et.al. | 2603.23067 | translate | read | null |
| 2026-03-24 | MSR-HuBERT: Self-supervised Pre-training for Adaptation to Multiple Sampling Rates | Zikang Huang et.al. | 2603.23048 | translate | read | null |
| 2026-03-24 | Parametric Knowledge and Retrieval Behavior in RAG Fine-Tuning for Electronic Design Automation | Julian Oestreich et.al. | 2603.23047 | translate | read | null |
| 2026-03-24 | Traffic Sign Recognition in Autonomous Driving: Dataset, Benchmark, and Field Experiment | Guoyang Zhao et.al. | 2603.23034 | translate | read | null |
| 2026-03-24 | Fine-tuning of universal machine-learning interatomic potentials for 2D high-entropy alloys | Chun Zhou et.al. | 2603.23029 | translate | read | null |
| 2026-03-24 | Beyond Hate: Differentiating Uncivil and Intolerant Speech in Multimodal Content Moderation | Nils A. Herrmann et.al. | 2603.22985 | translate | read | null |
| 2026-03-24 | PersonalQ: Select, Quantize, and Serve Personalized Diffusion Models for Efficient Inference | Qirui Wang et.al. | 2603.22943 | translate | read | null |
| 2026-03-24 | Optimizing Small Language Models for NL2SQL via Chain-of-Thought Fine-Tuning | Anshul Solanki et.al. | 2603.22942 | translate | read | null |
| 2026-03-24 | EVA: Efficient Reinforcement Learning for End-to-End Video Agent | Yaolun Zhang et.al. | 2603.22918 | translate | read | null |
| 2026-03-24 | When AVSR Meets Video Conferencing: Dataset, Degradation, and the Hidden Mechanism Behind Performance Collapse | Yihuan Huang et.al. | 2603.22915 | translate | read | null |
| 2026-03-24 | EchoKV: Efficient KV Cache Compression via Similarity-Based Reconstruction | Yixuan Wang et.al. | 2603.22910 | translate | read | null |
| 2026-03-24 | Dual-Teacher Distillation with Subnetwork Rectification for Black-Box Domain Adaptation | Zhe Zhang et.al. | 2603.22908 | translate | read | null |
| 2026-03-24 | VLGOR: Visual-Language Knowledge Guided Offline Reinforcement Learning for Generalizable Agents | Pengsen Liu et.al. | 2603.22892 | translate | read | null |
| 2026-03-24 | Chain-of-Authorization: Internalizing Authorization into Large Language Models via Reasoning Trajectories | Yang Li et.al. | 2603.22869 | translate | read | null |
| 2026-03-24 | TDATR: Improving End-to-End Table Recognition via Table Detail-Aware Learning and Cell-Level Visual Alignment | Chunxia Qin et.al. | 2603.22819 | translate | read | null |
| 2026-03-24 | Instrument-Splatting++: Towards Controllable Surgical Instrument Digital Twin Using Gaussian Splatting | Shuojue Yang et.al. | 2603.22792 | translate | read | null |
| 2026-03-24 | KARMA: Knowledge-Action Regularized Multimodal Alignment for Personalized Search at Taobao | Zhi Sun et.al. | 2603.22779 | translate | read | null |
| 2026-03-24 | AgriPestDatabase-v1.0: A Structured Insect Dataset for Training Agricultural Large Language Model | Yagizhan Bilal Durak et.al. | 2603.22777 | translate | read | null |
| 2026-03-24 | DALDALL: Data Augmentation for Lexical and Semantic Diverse in Legal Domain by leveraging LLM-Persona | Janghyeok Choi et.al. | 2603.22765 | translate | read | null |
| 2026-03-24 | KALAVAI: Predicting When Independent Specialist Fusion Works – A Quantitative Model for Post-Hoc Cooperative LLM Training | Ramchand Kumaresan et.al. | 2603.22755 | translate | read | null |
| 2026-03-24 | Multitask-Informed Prior for In-Context Learning on Tabular Data: Application to Steel Property Prediction | Dimitrios Sinodinos et.al. | 2603.22738 | translate | read | null |
| 2026-03-24 | The Interspeech 2026 Audio Encoder Capability Challenge for Large Audio Language Models | Heinrich Dinkel et.al. | 2603.22728 | translate | read | null |
| 2026-03-24 | Does Teaming-Up LLMs Improve Secure Code Generation? A Comprehensive Evaluation with Multi-LLMSecCodeEval | Bushra Sabir et.al. | 2603.22717 | translate | read | null |
| 2026-03-24 | Why Database Manuals Are Not Enough: Efficient and Reliable Configuration Tuning for DBMSs via Code-Driven LLM Agents | Xinyi Zhang et.al. | 2603.22708 | translate | read | null |
| 2026-03-24 | How Far Can VLMs Go for Visual Bug Detection? Studying 19,738 Keyframes from 41 Hours of Gameplay Videos | Wentao Lu et.al. | 2603.22706 | translate | read | null |
| 2026-03-24 | WiFi2Cap: Semantic Action Captioning from Wi-Fi CSI via Limb-Level Semantic Alignment | Tzu-Ti Wei et.al. | 2603.22690 | translate | read | null |
| 2026-03-24 | GeoTikzBridge: Advancing Multimodal Code Generation for Geometric Perception and Reasoning | Jiayin Sun et.al. | 2603.22687 | translate | read | null |
| 2026-03-24 | Designing a Meta-Reflective Dashboard for Instructor Insight into Student-AI Interactions | Boxuan Ma et.al. | 2603.22674 | translate | read | null |
| 2026-03-24 | A mechanism for nonmonotonic $T_{c,max}(n)$ in multilayer cuprates | Pavel Kornilovitch et.al. | 2603.22662 | translate | read | null |
| 2026-03-24 | Generalizing Dynamics Modeling More Easily from Representation Perspective | Yiming Wang et.al. | 2603.22655 | translate | read | null |
| 2026-03-23 | Transfer learning via interpolating structures | T. A. Dardeno et.al. | 2603.22621 | translate | read | null |
| 2026-03-23 | TrajLoom: Dense Future Trajectory Generation from Video | Zewei Zhang et.al. | 2603.22606 | translate | read | null |
| 2026-03-23 | A Foundation Model for Instruction-Conditioned In-Context Time Series Tasks | Anish Saha et.al. | 2603.22586 | translate | read | null |
| 2026-03-23 | CanViT: Toward Active-Vision Foundation Models | Yohaï-Eliel Berreby et.al. | 2603.22570 | translate | read | null |
| 2026-03-23 | Privacy-Preserving Reinforcement Learning from Human Feedback via Decoupled Reward Modeling | Young Hyun Cho et.al. | 2603.22563 | translate | read | null |
| 2026-03-23 | Quantum Tunneling of Primordial Black Holes to White Holes: Rates, Constraints, and Implications for Fast Radio Bursts | Christopher Ewasiuk et.al. | 2603.22516 | translate | read | null |
| 2026-03-23 | Cognitive Training for Language Models: Towards General Capabilities via Cross-Entropy Games | Clément Hongler et.al. | 2603.22479 | translate | read | null |
| 2026-03-23 | Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs | Haoming Meng et.al. | 2603.22446 | translate | read | null |
| 2026-03-23 | FAAR: Format-Aware Adaptive Rounding for NVFP4 | Hanglin Li et.al. | 2603.22370 | translate | read | null |
| 2026-03-23 | MCLR: Improving Conditional Modeling in Visual Generative Models via Inter-Class Likelihood-Ratio Maximization and Establishing the Equivalence between Classifier-Free Guidance and Alignment Objectives | Xiang Li et.al. | 2603.22364 | translate | read | null |
| 2026-03-22 | Personalized Federated Sequential Recommender | Yicheng Di et.al. | 2603.22349 | translate | read | null |
| 2026-03-20 | A Multi-Modal CNN-LSTM Framework with Multi-Head Attention and Focal Loss for Real-Time Elderly Fall Detection | Lijie Zhou et.al. | 2603.22313 | translate | read | null |
| 2026-03-23 | 3D-Layout-R1: Structured Reasoning for Language-Instructed Spatial Editing | Haoyu Zhen et.al. | 2603.22279 | translate | read | null |
| 2026-03-23 | Development and large-scale benchmarks of a protein-ligand absolute binding free energy toolkit | Yu Liu et.al. | 2603.22274 | translate | read | null |
| 2026-03-23 | GenOpticalFlow: A Generative Approach to Unsupervised Optical Flow Learning | Yixuan Luo et.al. | 2603.22270 | translate | read | null |
| 2026-03-23 | Evaluating the Reliability and Fidelity of Automated Judgment Systems of Large Language Models | Tom Biskupski et.al. | 2603.22214 | translate | read | null |
| 2026-03-23 | Enhancing Document-Level Machine Translation via Filtered Synthetic Corpora and Two-Stage LLM Adaptation | Ireh Kim et.al. | 2603.22186 | translate | read | null |
| 2026-03-23 | Revisiting Quantum Code Generation: Where Should Domain Knowledge Live? | Oscar Novo et.al. | 2603.22184 | translate | read | null |
| 2026-03-23 | DA-VAE: Plug-in Latent Compression for Diffusion via Detail Alignment | Xin Cai et.al. | 2603.22125 | translate | read | null |
| 2026-03-23 | Overcoming sampling limitations using machine-learned interatomic potentials: the case of water-in-salt electrolytes | Luca Brugnoli et.al. | 2603.22099 | translate | read | null |
| 2026-03-23 | AdditiveLLM2: A Multi-modal Large Language Model for Additive Manufacturing | Peter Pak et.al. | 2603.22017 | translate | read | null |
| 2026-03-23 | SecureBreak – A dataset towards safe and secure models | Marco Arazzi et.al. | 2603.21975 | translate | read | null |
| 2026-03-23 | Parameter-Efficient Fine-Tuning for Medical Text Summarization: A Comparative Study of Lora, Prompt Tuning, and Full Fine-Tuning | Ulugbek Shernazarov et.al. | 2603.21970 | translate | read | null |
| 2026-03-23 | Chronological Contrastive Learning: Few-Shot Progression Assessment in Irreversible Diseases | Clemens Watzenböck et.al. | 2603.21935 | translate | read | null |
| 2026-03-23 | SHAPE: Structure-aware Hierarchical Unsupervised Domain Adaptation with Plausibility Evaluation for Medical Image Segmentation | Linkuan Zhou et.al. | 2603.21904 | translate | read | null |
| 2026-03-23 | Adaptive Federated Fine-Tuning of Self-Supervised Speech Representations | Xin Guo et.al. | 2603.21888 | translate | read | null |
| 2026-03-23 | ADaFuSE: Adaptive Diffusion-generated Image and Text Fusion for Interactive Text-to-Image Retrieval | Zhuocheng Zhang et.al. | 2603.21886 | translate | read | null |
| 2026-03-23 | Not All Layers Are Created Equal: Adaptive LoRA Ranks for Personalized Image Generation | Donald Shenaj et.al. | 2603.21884 | translate | read | null |
| 2026-03-23 | Deep S2P: Integrating Learning Based Stereo Matching Into the Satellite Stereo Pipeline | Elías Masquil et.al. | 2603.21882 | translate | read | null |
| 2026-03-23 | CoRA: Boosting Time Series Foundation Models for Multivariate Forecasting through Correlation-aware Adapter | Hanyin Cheng et.al. | 2603.21828 | translate | read | null |
| 2026-03-23 | Timing In stand-up Comedy: Text, Audio, Laughter, Kinesics (TIC-TALK): Pipeline and Database for the Multimodal Study of Comedic Timing | Yaelle Zribi et.al. | 2603.21803 | translate | read | null |
| 2026-03-23 | Fine-tuning of light-time effect in triple systems | David Vokrouhlický et.al. | 2603.21799 | translate | read | null |
| 2026-03-23 | Benchmarking Recurrent Event-Based Object Detection for Industrial Multi-Class Recognition on MTEvent | Lokeshwaran Manohar et.al. | 2603.21787 | translate | read | null |
| 2026-03-23 | SHARP: Spectrum-aware Highly-dynamic Adaptation for Resolution Promotion in Remote Sensing Synthesis | Bingxuan Zhao et.al. | 2603.21783 | translate | read | null |
| 2026-03-23 | Getting to the Point: Why Pointing Improves LVLMs | Simone Alghisi et.al. | 2603.21746 | translate | read | null |
| 2026-03-23 | EvoIdeator: Evolving Scientific Ideas through Checklist-Grounded Reinforcement Learning | Andreas Sauter et.al. | 2603.21728 | translate | read | null |
| 2026-03-23 | CurvZO: Adaptive Curvature-Guided Sparse Zeroth-Order Optimization for Efficient LLM Fine-Tuning | Shuo Wang et.al. | 2603.21725 | translate | read | null |
| 2026-03-23 | OmniFM: Toward Modality-Robust and Task-Agnostic Federated Learning for Heterogeneous Medical Imaging | Meilin Liu et.al. | 2603.21660 | translate | read | null |
| 2026-03-23 | mSFT: Addressing Dataset Mixtures Overfitting Heterogeneously in Multi-task SFT | Woosung Koh et.al. | 2603.21606 | translate | read | null |
| 2026-03-23 | TERS-ABNet: A Deep Learning Approach for Automated Single-Molecule Structure Reconstruction with Atomic Precision from TERS Mapping | Jie Cui et.al. | 2603.21579 | translate | read | null |
| 2026-03-23 | Mind over Space: Can Multimodal Large Language Models Mentally Navigate? | Qihui Zhu et.al. | 2603.21577 | translate | read | null |
| 2026-03-23 | Rethinking Visual Privacy: A Compositional Privacy Risk Framework for Severity Assessment with VLMs | Efthymios Tsaprazlis et.al. | 2603.21573 | translate | read | null |
| 2026-03-23 | CataractSAM-2: A Domain-Adapted Model for Anterior Segment Surgery Segmentation and Scalable Ground-Truth Annotation | Mohammad Eslami et.al. | 2603.21566 | translate | read | null |
| 2026-03-23 | SynSym: A Synthetic Data Generation Framework for Psychiatric Symptom Identification | Migyeong Kang et.al. | 2603.21529 | translate | read | null |
| 2026-03-23 | Learning Inflation Narratives from Reddit: How Lightweight LLMs Reveal Forward-Looking Economic Signals | Ryuichi Saito et.al. | 2603.21501 | translate | read | null |
| 2026-03-23 | EpiMask: Leveraging Epipolar Distance Based Masks in Cross-Attention for Satellite Image Matching | Rahul Deshmukh et.al. | 2603.21463 | translate | read | null |
| 2026-03-22 | DomAgent: Leveraging Knowledge Graphs and Case-Based Reasoning for Domain-Specific Code Generation | Shuai Wang et.al. | 2603.21430 | translate | read | null |
| 2026-03-22 | Efficient Fine-Tuning Methods for Portuguese Question Answering: A Comparative Study of PEFT on BERTimbau and Exploratory Evaluation of Generative LLMs | Mariela M. Nina et.al. | 2603.21418 | translate | read | null |
| 2026-03-22 | Silent Commitment Failure in Instruction-Tuned Language Models: Evidence of Governability Divergence Across Architectures | Gregory M. Ruddell et.al. | 2603.21415 | translate | read | null |
| 2026-03-22 | Mitigating Objectness Bias and Region-to-Text Misalignment for Open-Vocabulary Panoptic Segmentation | Nikolay Kormushev et.al. | 2603.21386 | translate | read | null |
| 2026-03-22 | PivotRL: High Accuracy Agentic Post-Training at Low Compute Cost | Junkeun Yi et.al. | 2603.21383 | translate | read | null |
| 2026-03-22 | Privacy-Preserving Federated Action Recognition via Differentially Private Selective Tuning and Efficient Communication | Idris Zakariyya et.al. | 2603.21305 | translate | read | null |
| 2026-03-22 | Enhancing reasoning accuracy in large language models during inference time | Vinay Sharma et.al. | 2603.21301 | translate | read | null |
| 2026-03-22 | Sonny: Breaking the Compute Wall in Medium-Range Weather Forecasting | Minjong Cheon et.al. | 2603.21284 | translate | read | null |
| 2026-03-22 | Aggregation Alignment for Federated Learning with Mixture-of-Experts under Data Heterogeneity | Zihan Fang et.al. | 2603.21276 | translate | read | null |
| 2026-03-22 | Positional Segmentor-Guided Counterfactual Fine-Tuning for Spatially Localized Image Synthesis | Tian Xia et.al. | 2603.21213 | translate | read | null |
| 2026-03-22 | Pretrained Video Models as Differentiable Physics Simulators for Urban Wind Flows | Janne Perini et.al. | 2603.21210 | translate | read | null |
| 2026-03-22 | Context Selection for Hypothesis and Statistical Evidence Extraction from Full-Text Scientific Articles | Sai Koneru et.al. | 2603.21193 | translate | read | null |
| 2026-03-22 | Reward Sharpness-Aware Fine-Tuning for Diffusion Models | Kwanyoung Kim et.al. | 2603.21175 | translate | read | null |
| 2026-03-22 | Incentivizing Generative Zero-Shot Learning via Outcome-Reward Reinforcement Learning with Visual Cues | Wenjin Hou et.al. | 2603.21138 | translate | read | null |
| 2026-03-22 | Diffusion-based Probabilistic Air Quality Forecasting with Mechanistic Insight | Ao Ding et.al. | 2603.21131 | translate | read | null |
| 2026-03-22 | Frequency Switching Mechanism for Parameter-E!cient Multi-Task Learning | Shih-Wen Liu et.al. | 2603.21111 | translate | read | null |
| 2026-03-22 | Learning Progressive Adaptation for Multi-Modal Tracking | He Wang et.al. | 2603.21100 | translate | read | null |
| 2026-03-22 | Learning to Optimize Joint Source and RIS-assisted Channel Encoding for Multi-User Semantic Communication Systems | Haidong Wang et.al. | 2603.21097 | translate | read | null |
| 2026-03-22 | Mixture of Chapters: Scaling Learnt Memory in Transformers | Tasmay Pankaj Tibrewal et.al. | 2603.21096 | translate | read | null |
| 2026-03-22 | Representation-Level Adversarial Regularization for Clinically Aligned Multitask Thyroid Ultrasound Assessment | Dina Salama et.al. | 2603.21095 | translate | read | null |
| 2026-03-22 | Hierarchical Text-Guided Brain Tumor Segmentation via Sub-Region-Aware Prompts | Bahram Mohammadi et.al. | 2603.21083 | translate | read | null |
| 2026-03-22 | CoVFT: Context-aware Visual Fine-tuning for Multimodal Large Language Models | Nan Zhou et.al. | 2603.21077 | translate | read | null |
| 2026-03-22 | A Two-stage Transformer Framework for Temporal Localization of Distracted Driver Behaviors | Gia-Bao Doan et.al. | 2603.21048 | translate | read | null |
| 2026-03-22 | Statistical Learning for Latent Embedding Alignment with Application to Brain Encoding and Decoding | Shuoxun Xu et.al. | 2603.21042 | translate | read | null |
| 2026-03-22 | KLDrive: Fine-Grained 3D Scene Reasoning for Autonomous Driving based on Knowledge Graph | Ye Tian et.al. | 2603.21029 | translate | read | null |
| 2026-03-22 | ECI: Effective Contrastive Information to Evaluate Hard-Negatives | Aarush Sinha et.al. | 2603.20990 | translate | read | null |
| 2026-03-22 | Consistent but Dangerous: Per-Sample Safety Classification Reveals False Reliability in Medical Vision-Language Models | Binesh Sadanandan et.al. | 2603.20985 | translate | read | null |
| 2026-03-21 | Learning to Aggregate Zero-Shot LLM Agents for Corporate Disclosure Classification | Kemal Kirtac et.al. | 2603.20965 | translate | read | null |
| 2026-03-21 | User Preference Modeling for Conversational LLM Agents: Weak Rewards from Retrieval-Augmented Interaction | Yuren Hao et.al. | 2603.20939 | translate | read | null |
| 2026-03-21 | Restoring Neural Network Plasticity for Faster Transfer Learning | Xander Coetzer et.al. | 2603.20860 | translate | read | null |
| 2026-03-21 | A Knowledge-Informed Pretrained Model for Causal Discovery | Wenbo Xu et.al. | 2603.20842 | translate | read | null |
| 2026-03-21 | Less is More in Semantic Space: Intrinsic Decoupling via Clifford-M for Fundus Image Classification | Yifeng Zheng et.al. | 2603.20806 | translate | read | null |
| 2026-03-21 | MEMO: Human-like Crisp Edge Detection Using Masked Edge Prediction | Jiaxin Cheng et.al. | 2603.20782 | translate | read | null |
| 2026-03-21 | Memory-Efficient Fine-Tuning Diffusion Transformers via Dynamic Patch Sampling and Block Skipping | Sunghyun Park et.al. | 2603.20755 | translate | read | null |
| 2026-03-21 | Optically Activated Superconductivity in MgB2 via Electroluminescent GaP Inhomogeneous Phase | Yao Qi et.al. | 2603.20719 | translate | read | null |
| 2026-03-21 | Decoupling Numerical and Structural Parameters: An Empirical Study on Adaptive Genetic Algorithms via Deep Reinforcement Learning for the Large-Scale TSP | Hongyu Wang et.al. | 2603.20702 | translate | read | null |
| 2026-03-21 | Clinical Cognition Alignment for Gastrointestinal Diagnosis with Multimodal LLMs | Huan Zheng et.al. | 2603.20698 | translate | read | null |
| 2026-03-21 | E-SocialNav: Efficient Socially Compliant Navigation with Language Models | Ling Xiao et.al. | 2603.20664 | translate | read | null |
| 2026-03-21 | A Multihead Continual Learning Framework for Fine-Grained Fashion Image Retrieval with Contrastive Learning and Exponential Moving Average Distillation | Ling Xiao et.al. | 2603.20648 | translate | read | null |
| 2026-03-21 | ScaleEdit-12M: Scaling Open-Source Image Editing Data Generation via Multi-Agent Framework | Guanzhou Chen et.al. | 2603.20644 | translate | read | null |
| 2026-03-21 | Optimal low-rank stochastic gradient estimation for LLM training | Zehao Li et.al. | 2603.20632 | translate | read | null |
| 2026-03-21 | GHOST: Ground-projected Hypotheses from Observed Structure-from-Motion Trajectories | Tomasz Frelek et.al. | 2603.20583 | translate | read | null |
| 2026-03-20 | When Negation Is a Geometry Problem in Vision-Language Models | Fawaz Sammani et.al. | 2603.20554 | translate | read | null |
| 2026-03-20 | Grounded Chess Reasoning in Language Models via Master Distillation | Zhenwei Tang et.al. | 2603.20510 | translate | read | null |
| 2026-03-20 | AE-LLM: Adaptive Efficiency Optimization for Large Language Models | Kaito Tanaka et.al. | 2603.20492 | translate | read | null |
| 2026-03-20 | Time-Reversed BSDEs for Accurate Gradient Estimation in Diffusion Models | Yuhang Mei et.al. | 2603.20455 | translate | read | null |
| 2026-03-20 | Leveraging Natural Language Processing and Machine Learning for Evidence-Based Food Security Policy Decision-Making in Data-Scarce Making | Karan Kumar Singh et.al. | 2603.20425 | translate | read | null |
| 2026-03-20 | Meta-Learning for Repeated Bayesian Persuasion | Ata Poyraz Turna et.al. | 2603.20408 | translate | read | null |
| 2026-03-20 | FAAR: Efficient Frequency-Aware Multi-Task Fine-Tuning via Automatic Rank Selection | Maxime Fontana et.al. | 2603.20403 | translate | read | null |
| 2026-03-20 | From Cross-Validation to SURE: Asymptotic Risk of Tuned Regularized Estimators | Karun Adusumilli et.al. | 2603.20388 | translate | read | null |
| 2026-03-20 | Multi-Stage Fine-Tuning of Pathology Foundation Models with Head-Diverse Ensembling for White Blood Cell Classification | Antony Gitau et.al. | 2603.20383 | translate | read | null |
| 2026-03-20 | Bounded Coupled AI Learning Dynamics in Tri-Hierarchical Drone Swarms | Oleksii Bychkov et.al. | 2603.20333 | translate | read | null |
| 2026-03-20 | Prompt-Free Lightweight SAM Adaptation for Histopathology Nuclei Segmentation with Strong Cross-Dataset Generalization | Muhammad Hassan Maqsood et.al. | 2603.20326 | translate | read | null |
| 2026-03-19 | Transferable Multi-Bit Watermarking Across Frozen Diffusion Models via Latent Consistency Bridges | Hong-Hanh Nguyen-Le et.al. | 2603.20304 | translate | read | null |
| 2026-03-17 | OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis | Zhuofeng Li et.al. | 2603.20278 | translate | read | null |
| 2026-03-17 | Me, Myself, and $π$ : Evaluating and Explaining LLM Introspection | Atharv Naphade et.al. | 2603.20276 | translate | read | null |
| 2026-03-20 | Evaluating Evidence Grounding Under User Pressure in Instruction-Tuned Language Models | Sai Koneru et.al. | 2603.20162 | translate | read | null |
| 2026-03-20 | An Agentic Multi-Agent Architecture for Cybersecurity Risk Management | Ravish Gupta et.al. | 2603.20131 | translate | read | null |
| 2026-03-20 | Chain-of-Adaptation: Surgical Vision-Language Adaptation with Reinforcement Learning | Jiajie Li et.al. | 2603.20116 | translate | read | null |
| 2026-03-20 | An Empirical Study of SFT-DPO Interaction and Parameterization in Small Language Models | Yuming Feng et.al. | 2603.20100 | translate | read | null |
| 2026-03-20 | Predicting States of Understanding in Explanatory Interactions Using Cognitive Load-Related Linguistic Cues | Yu Wang et.al. | 2603.20079 | translate | read | null |
| 2026-03-20 | Fine-tuning Timeseries Predictors Using Reinforcement Learning | Hugo Cazaux et.al. | 2603.20063 | translate | read | null |
| 2026-03-20 | Diffusion-Based Makeup Transfer with Facial Region-Aware Makeup Features | Zheng Gao et.al. | 2603.20012 | translate | read | null |
| 2026-03-20 | Monte Carlo conformal prediction for quantifying uncertainty in radio galaxy classification under ambiguous ground truth | Alex Walls et.al. | 2603.20000 | translate | read | null |
| 2026-03-20 | SAGE: Sustainable Agent-Guided Expert-tuning for Culturally Attuned Translation in Low-Resource Southeast Asia | Zhixiang Lu et.al. | 2603.19931 | translate | read | null |
| 2026-03-20 | High-energy neutrino flux from SN2024ggi: constraints from semi-analytic modeling of its post-explosive emission | M. Buccheri et.al. | 2603.19919 | translate | read | null |
| 2026-03-20 | Integrating Meta-Features with Knowledge Graph Embeddings for Meta-Learning | Antonis Klironomos et.al. | 2603.19888 | translate | read | null |
| 2026-03-20 | SIMPLER: Efficient Foundation Model Adaptation via Similarity-Guided Layer Pruning for Earth Observation | Víctor Barreiro et.al. | 2603.19873 | translate | read | null |
| 2026-03-20 | MedQ-Engine: A Closed-Loop Data Engine for Evolving MLLMs in Medical Image Quality Assessment | Jiyao Liu et.al. | 2603.19863 | translate | read | null |
| 2026-03-20 | Enhancing Alignment for Unified Multimodal Models via Semantically-Grounded Supervision | Jiyeong Kim et.al. | 2603.19807 | translate | read | null |
| 2026-03-20 | Generalized Task-Driven Design of Soft Robots via Reduced-Order FEM-based Surrogate Modeling | Yao Yao et.al. | 2603.19794 | translate | read | null |
| 2026-03-20 | Invariant quantile regression for heterogeneous environments | Bo Fu et.al. | 2603.19745 | translate | read | null |
| 2026-03-20 | FedPDPO: Federated Personalized Direct Preference Optimization for Large Language Model Alignment | Kewen Zhu et.al. | 2603.19741 | translate | read | null |
| 2026-03-20 | Stepwise: Neuro-Symbolic Proof Search for Automated Systems Verification | Baoding He et.al. | 2603.19715 | translate | read | null |
| 2026-03-20 | AIGQ: An End-to-End Hybrid Generative Architecture for E-commerce Query Recommendation | Jingcao Xu et.al. | 2603.19710 | translate | read | null |
| 2026-03-20 | Demographic-Aware Self-Supervised Anomaly Detection Pretraining for Equitable Rare Cardiac Diagnosis | Chaoqin Huang et.al. | 2603.19695 | translate | read | null |
| 2026-03-20 | A Subgoal-driven Framework for Improving Long-Horizon LLM Agents | Taiyi Wang et.al. | 2603.19685 | translate | read | null |
| 2026-03-20 | Vision-Language Attribute Disentanglement and Reinforcement for Lifelong Person Re-Identification | Kunlun Xu et.al. | 2603.19678 | translate | read | null |
| 2026-03-20 | PolicySim: An LLM-Based Agent Social Simulation Sandbox for Proactive Policy Optimization | Renhong Huang et.al. | 2603.19649 | translate | read | null |
| 2026-03-20 | UniBioTransfer: A Unified Framework for Multiple Biometrics Transfer | Caiyi Sun et.al. | 2603.19637 | translate | read | null |
| 2026-03-20 | Demonstrations, CoT, and Prompting: A Theoretical Analysis of ICL | Xuhan Tong et.al. | 2603.19611 | translate | read | null |
| 2026-03-20 | CeRLP: A Cross-embodiment Robot Local Planning Framework for Visual Navigation | Haoyu Xi et.al. | 2603.19602 | translate | read | null |
| 2026-03-20 | CO-EVOLVE: Bidirectional Co-Evolution of Graph Structure and Semantics for Heterophilous Learning | Jinming Xing et.al. | 2603.19596 | translate | read | null |
| 2026-03-20 | Evolving Embodied Intelligence: Graph Neural Network–Driven Co-Design of Morphology and Control in Soft Robotics | Jianqiang Wang et.al. | 2603.19582 | translate | read | null |
| 2026-03-20 | PA2D-MORL: Pareto Ascent Directional Decomposition based Multi-Objective Reinforcement Learning | Tianmeng Hu et.al. | 2603.19579 | translate | read | null |
| 2026-03-20 | MagicSeg: Open-World Segmentation Pretraining via Counterfactural Diffusion-Based Auto-Generation | Kaixin Cai et.al. | 2603.19575 | translate | read | null |
| 2026-03-20 | Dual-Domain Representation Alignment: Bridging 2D and 3D Vision via Geometry-Aware Architecture Search | Haoyu Zhang et.al. | 2603.19563 | translate | read | null |
| 2026-03-20 | Scalable Cross-Facility Federated Learning for Scientific Foundation Models on Multiple Supercomputers | Yijiang Li et.al. | 2603.19544 | translate | read | null |
| 2026-03-19 | Learning to Disprove: Formal Counterexample Generation with Large Language Models | Zenan Li et.al. | 2603.19514 | translate | read | null |
| 2026-03-19 | Teaching an Agent to Sketch One Part at a Time | Xiaodan Du et.al. | 2603.19500 | translate | read | null |
| 2026-03-19 | ICLAD: In-Context Learning for Unified Tabular Anomaly Detection Across Supervision Regimes | Jack Yi Wei et.al. | 2603.19497 | translate | read | null |
| 2026-03-19 | Any-Subgroup Equivariant Networks via Symmetry Breaking | Abhinav Goel et.al. | 2603.19486 | translate | read | null |
| 2026-03-19 | Instruction-Free Tuning of Large Vision Language Models for Medical Instruction Following | Myeongkyun Kang et.al. | 2603.19482 | translate | read | null |
| 2026-03-19 | Reinforcement-guided generative protein language models enable de novo design of highly diverse AAV capsids | Lucas Ferraz et.al. | 2603.19473 | translate | read | null |
| 2026-03-19 | Listen First, Then Answer: Timestamp-Grounded Speech Reasoning | Jihoon Jeong et.al. | 2603.19468 | translate | read | null |
| 2026-03-19 | ProactiveBench: Benchmarking Proactiveness in Multimodal Large Language Models | Thomas De Min et.al. | 2603.19466 | translate | read | null |
| 2026-03-19 | Hyperagents | Jenny Zhang et.al. | 2603.19461 | translate | read | null |
| 2026-03-19 | In-the-Wild Camouflage Attack on Vehicle Detectors through Controllable Image Editing | Xiao Fang et.al. | 2603.19456 | translate | read | null |
| 2026-03-19 | Pseudo-Labeling for Unsupervised Domain Adaptation with Kernel GLMs | Nathan Weill et.al. | 2603.19422 | translate | read | null |
| 2026-03-19 | TuLaBM: Tumor-Biased Latent Bridge Matching for Contrast-Enhanced MRI Synthesis | Atharva Rege et.al. | 2603.19386 | translate | read | null |
| 2026-03-17 | Target Concept Tuning Improves Extreme Weather Forecasting | Shijie Ren et.al. | 2603.19325 | translate | read | null |
| 2026-03-19 | SAMA: Factorized Semantic Anchoring and Motion Alignment for Instruction-Guided Video Editing | Xinyao Zhang et.al. | 2603.19228 | translate | read | null |
| 2026-03-19 | RPiAE: A Representation-Pivoted Autoencoder Enhancing Both Image Generation and Editing | Yue Gong et.al. | 2603.19206 | translate | read | null |
| 2026-03-19 | How Auditory Knowledge in LLM Backbones Shapes Audio Language Models: A Holistic Evaluation | Ke-Han Lu et.al. | 2603.19195 | translate | read | null |
| 2026-03-19 | Sparse Autoencoders Reveal Interpretable and Steerable Features in VLA Models | Aiden Swann et.al. | 2603.19183 | translate | read | null |
| 2026-03-19 | ARIADNE: A Perception-Reasoning Synergy Framework for Trustworthy Coronary Angiography Analysis | Zhan Jin et.al. | 2603.19169 | translate | read | null |
| 2026-03-19 | ADAPT: Attention Driven Adaptive Prompt Scheduling and InTerpolating Orthogonal Complements for Rare Concepts Generation | Kwanyoung Lee et.al. | 2603.19157 | translate | read | null |
| 2026-03-19 | Enhancing Pretrained Model-based Continual Representation Learning via Guided Random Projection | Ruilin Li et.al. | 2603.19145 | translate | read | null |
| 2026-03-19 | From Inference Efficiency to Embodied Efficiency: Revisiting Efficiency Metrics for Vision-Language-Action Models | Zhuofan Li et.al. | 2603.19131 | translate | read | null |
| 2026-03-19 | TAU-R1: Visual Language Model for Traffic Anomaly Understanding | Yuqiang Lin et.al. | 2603.19098 | translate | read | null |
| 2026-03-19 | Adaptive Nonlinear Data Assimilation through P-Spline Triangular Measure Transport | Berent Å. S. Lunde et.al. | 2603.19058 | translate | read | null |
| 2026-03-19 | MoRI: Learning Motivation-Grounded Reasoning for Scientific Ideation in Large Language Models | Chenyang Gu et.al. | 2603.19044 | translate | read | null |
| 2026-03-19 | Security awareness in LLM agents: the NDAI zone case | Enrico Bottazzi et.al. | 2603.19011 | translate | read | null |
| 2026-03-19 | CRAFT: Aligning Diffusion Models with Fine-Tuning Is Easier Than You Think | Zening Sun et.al. | 2603.18991 | translate | read | null |
| 2026-03-19 | An Optimised Greedy-Weighted Ensemble Framework for Financial Loan Default Prediction | Ezekiel Nii Noye Nortey et.al. | 2603.18927 | translate | read | null |
| 2026-03-19 | Why Better Cross-Lingual Alignment Fails for Better Cross-Lingual Transfer: Case of Encoders | Yana Veitsman et.al. | 2603.18863 | translate | read | null |
| 2026-03-19 | BeamAgent: LLM-Aided MIMO Beamforming with Decoupled Intent Parsing and Alternating Optimization for Joint Site Selection and Precoding | Xiucheng Wang et.al. | 2603.18855 | translate | read | null |
| 2026-03-19 | Seasoning Generative Models for a Generalization Aftertaste | Hisham Husain et.al. | 2603.18817 | translate | read | null |
| 2026-03-19 | Functional Subspace Watermarking for Large Language Models | Zikang Ding et.al. | 2603.18793 | translate | read | null |
| 2026-03-19 | SEAR: Simple and Efficient Adaptation of Visual Geometric Transformers for RGB+Thermal 3D Reconstruction | Vsevolod Skorokhodov et.al. | 2603.18774 | translate | read | null |
| 2026-03-19 | Automatic Configuration of LLM Post-Training Pipelines | Channe Chwa et.al. | 2603.18773 | translate | read | null |
| 2026-03-19 | ProCal: Probability Calibration for Neighborhood-Guided Source-Free Domain Adaptation | Ying Zheng et.al. | 2603.18764 | translate | read | null |
| 2026-03-19 | DA-Mamba: Learning Domain-Aware State Space Model for Global-Local Alignment in Domain Adaptive Object Detection | Haochen Li et.al. | 2603.18757 | translate | read | null |
| 2026-03-19 | Multimodal Model for Computational Pathology:Representation Learning and Image Compression | Peihang Wu et.al. | 2603.18660 | translate | read | null |
| 2026-03-19 | Balanced Thinking: Improving Chain of Thought Training in Vision Language Models | Shaked Perek et.al. | 2603.18656 | translate | read | null |
| 2026-03-19 | DeePAW: A universal machine learning model for orbital-free ab initio calculations | Tianhao Su et.al. | 2603.18650 | translate | read | null |
| 2026-03-19 | A Comparative Empirical Study of Catastrophic Forgetting Mitigation in Sequential Task Adaptation for Continual Natural Language Processing Systems | Aram Abrahamyan et.al. | 2603.18641 | translate | read | null |
| 2026-03-19 | MOSAIC: Multi-Objective Slice-Aware Iterative Curation for Alignment | Yipu Dou et.al. | 2603.18637 | translate | read | null |
| 2026-03-19 | SwiftGS: Episodic Priors for Immediate Satellite Surface Recovery | Rong Fu et.al. | 2603.18634 | translate | read | null |
| 2026-03-19 | SQL-Commenter: Aligning Large Language Models for SQL Comment Generation with Direct Preference Optimization | Lei Yu et.al. | 2603.18606 | translate | read | null |
| 2026-03-19 | Radiation damping of the soliton internal mode in 1D quadratic Klein-Gordon equation | Piotr Bizoń et.al. | 2603.18605 | translate | read | null |
| 2026-03-19 | Remedying Target-Domain Astigmatism for Cross-Domain Few-Shot Object Detection | Yongwei Jiang et.al. | 2603.18541 | translate | read | null |
| 2026-03-19 | Scaling Sim-to-Real Reinforcement Learning for Robot VLAs with Generative 3D Worlds | Andrew Choi et.al. | 2603.18532 | translate | read | null |
| 2026-03-19 | 3DreamBooth: High-Fidelity 3D Subject-Driven Video Generation Model | Hyun-kyu Ko et.al. | 2603.18524 | translate | read | null |
| 2026-03-19 | Counting Circuits: Mechanistic Interpretability of Visual Reasoning in Large Vision-Language Models | Liwei Che et.al. | 2603.18523 | translate | read | null |
| 2026-03-19 | Foundations and Architectures of Artificial Intelligence for Motor Insurance | Teerapong Panboonyuen et.al. | 2603.18508 | translate | read | null |
| 2026-03-19 | Cross-Domain Demo-to-Code via Neurosymbolic Counterfactual Reasoning | Jooyoung Kim et.al. | 2603.18495 | translate | read | null |
| 2026-03-19 | Robust Near-Critical Dynamics in Heavy-Tailed Neural Networks | Ryota Kojima et.al. | 2603.18478 | translate | read | null |
| 2026-03-19 | Interpretable Prostate Cancer Detection using a Small Cohort of MRI Images | Vahid Monfared et.al. | 2603.18460 | translate | read | null |
| 2026-03-19 | SynQ: Accurate Zero-shot Quantization by Synthesis-aware Fine-tuning | Minjun Kim et.al. | 2603.18423 | translate | read | null |
| 2026-03-19 | Mind the Rarities: Can Rare Skin Diseases Be Reliably Diagnosed via Diagnostic Reasoning? | Yang Liu et.al. | 2603.18418 | translate | read | null |
| 2026-03-19 | Where are the Hidden Gems? Applying Transformer Models for Design Discussion Detection | Lawrence Arkoh et.al. | 2603.18393 | translate | read | null |
| 2026-03-19 | PowerFlow: Unlocking the Dual Nature of LLMs via Principled Distribution Matching | Ruishuo Chen et.al. | 2603.18363 | translate | read | null |
| 2026-03-18 | Synthetic Data Generation for Training Diversified Commonsense Reasoning Models | Tianhui Zhang et.al. | 2603.18361 | translate | read | null |
| 2026-03-18 | Learning to Reason with Curriculum I: Provable Benefits of Autocurriculum | Nived Rajaraman et.al. | 2603.18325 | translate | read | null |
| 2026-03-18 | Approximate Subgraph Matching with Neural Graph Representations and Reinforcement Learning | Kaiyang Li et.al. | 2603.18314 | translate | read | null |
| 2026-03-18 | CycleCap: Improving VLMs Captioning Performance via Self-Supervised Cycle Consistency Fine-Tuning | Marios Krestenitis et.al. | 2603.18282 | translate | read | null |
| 2026-03-18 | EDM-ARS: A Domain-Specific Multi-Agent System for Automated Educational Data Mining Research | Chenguang Pan et.al. | 2603.18273 | translate | read | null |
| 2026-03-18 | Retrieval-Augmented LLM Agents: Learning to Learn from Experience | Thomas Palmeira Ferraz et.al. | 2603.18272 | translate | read | null |
| 2026-03-18 | R2-Dreamer: Redundancy-Reduced World Models without Decoders or Augmentation | Naoki Morihira et.al. | 2603.18202 | translate | read | null |
| 2026-03-18 | TeachingCoach: A Fine-Tuned Scaffolding Chatbot for Instructional Guidance to Instructors | Isabel Molnar et.al. | 2603.18189 | translate | read | null |
| 2026-03-18 | VLM-AutoDrive: Post-Training Vision-Language Models for Safety-Critical Autonomous Driving Events | Mohammad Qazim Bhat et.al. | 2603.18178 | translate | read | null |
| 2026-03-18 | Intellectual Stewardship: Re-adapting Human Minds for Creative Knowledge Work in the Age of AI | Jianwei Zhang et.al. | 2603.18117 | translate | read | null |
| 2026-03-18 | Transfer Learning for Contextual Joint Assortment-Pricing under Cross-Market Heterogeneity | Elynn Chen et.al. | 2603.18114 | translate | read | null |
| 2026-03-18 | VC-Soup: Value-Consistency Guided Multi-Value Alignment for Large Language Models | Hefei Xu et.al. | 2603.18113 | translate | read | null |
| 2026-03-18 | Training-Only Heterogeneous Image-Patch-Text Graph Supervision for Advancing Few-Shot Learning Adapters | Mohammed Rahman Sherif Khan Mohammad et.al. | 2603.18101 | translate | read | null |
| 2026-03-18 | Enhancing Reinforcement Learning Fine-Tuning with an Online Refiner | Hao Ma et.al. | 2603.18088 | translate | read | null |
| 2026-03-18 | Probabilistic Federated Learning on Uncertain and Heterogeneous Data with Model Personalization | Ratun Rahman et.al. | 2603.18083 | translate | read | null |
| 2026-03-18 | Lightweight Adaptation for LLM-based Technical Service Agent: Latent Logic Augmentation and Robust Noise Reduction | Yi Yu et.al. | 2603.18074 | translate | read | null |
| 2026-03-18 | Continually self-improving AI | Zitong Yang et.al. | 2603.18073 | translate | read | null |
| 2026-03-18 | CARE: Covariance-Aware and Rank-Enhanced Decomposition for Enabling Multi-Head Latent Attention | Zhongzhu Zhou et.al. | 2603.17946 | translate | read | null |
| 2026-03-18 | ShapleyLaw: A Game-Theoretic Approach to Multilingual Scaling Laws | Xuyang Cao et.al. | 2603.17945 | translate | read | null |
| 2026-03-18 | RoboForge: Physically Optimized Text-guided Whole-Body Locomotion for Humanoids | Xichen Yuan et.al. | 2603.17927 | translate | read | null |
| 2026-03-18 | Training Diffusion Language Models for Black-Box Optimization | Zipeng Sun et.al. | 2603.17919 | translate | read | null |
| 2026-03-18 | Only relative ranks matter in weight-clustered large language models | Borja Aizpurua et.al. | 2603.17917 | translate | read | null |
| 2026-03-18 | RHYME-XT: A Neural Operator for Spatiotemporal Control Systems | Marijn Ruiter et.al. | 2603.17867 | translate | read | null |
| 2026-03-18 | M2P: Improving Visual Foundation Models with Mask-to-Point Weakly-Supervised Learning for Dense Point Tracking | Qiangqiang Wu et.al. | 2603.17813 | translate | read | null |
| 2026-03-18 | ChopGrad: Pixel-Wise Losses for Latent Video Diffusion via Truncated Backpropagation | Dmitriy Rivkin et.al. | 2603.17812 | translate | read | null |
| 2026-03-18 | Exploring parameter-efficient fine-tuning (PEFT) of billion-parameter vision models with QLoRA and DoRA: insights into generalization for limited-data image classification under a 98:1 test-to-train regime | Haiyu Yang et.al. | 2603.17782 | translate | read | null |
| 2026-03-18 | Evidence Packing for Cross-Domain Image Deepfake Detection with LVLMs | Yuxin Liu et.al. | 2603.17761 | translate | read | null |
| 2026-03-18 | Bosonic quantum mixtures with competing interactions: quantum liquid droplets and supersolids | Sarah Hirthe et.al. | 2603.17745 | translate | read | null |
| 2026-03-18 | From Virtual Environments to Real-World Trials: Emerging Trends in Autonomous Driving | A. Humnabadkar et.al. | 2603.17714 | translate | read | null |
| 2026-03-18 | Parameter-Efficient Modality-Balanced Symmetric Fusion for Multimodal Remote Sensing Semantic Segmentation | Haocheng Li et.al. | 2603.17705 | translate | read | null |
| 2026-03-18 | DeepCORO-CLIP: A Multi-View Foundation Model for Comprehensive Coronary Angiography Video-Text Analysis and External Validation | Sarra Harrabi et.al. | 2603.17675 | translate | read | null |
| 2026-03-18 | Post-Training Local LLM Agents for Linux Privilege Escalation with Verifiable Rewards | Philipp Normann et.al. | 2603.17673 | translate | read | null |
| 2026-03-18 | FINER: MLLMs Hallucinate under Fine-grained Negative Queries | Rui Xiao et.al. | 2603.17662 | translate | read | null |
| 2026-03-18 | Interpretable Cross-Domain Few-Shot Learning with Rectified Target-Domain Local Alignment | Yaze Zhao et.al. | 2603.17655 | translate | read | null |
| 2026-03-18 | From Isolated Scoring to Collaborative Ranking: A Comparison-Native Framework for LLM-Based Paper Evaluation | Pujun Zheng et.al. | 2603.17588 | translate | read | null |
| 2026-03-18 | LoGSAM: Parameter-Efficient Cross-Modal Grounding for MRI Segmentation | Mohammad Robaitul Islam Bhuiyan et.al. | 2603.17576 | translate | read | null |
| 2026-03-18 | KA2L: A Knowledge-Aware Active Learning Framework for LLMs | Haoxuan Yin et.al. | 2603.17566 | translate | read | null |
| 2026-03-18 | In Trust We Survive: Emergent Trust Learning | Qianpu Chen et.al. | 2603.17564 | translate | read | null |
| 2026-03-18 | Zipper-LoRA: Dynamic Parameter Decoupling for Speech-LLM based Multilingual Speech Recognition | Yuxiang Mei et.al. | 2603.17558 | translate | read | null |
| 2026-03-18 | Prompt-Free Universal Region Proposal Network | Qihong Tang et.al. | 2603.17554 | translate | read | null |
| 2026-03-18 | Temporal Gains, Spatial Costs: Revisiting Video Fine-Tuning in Multimodal Large Language Models | Linghao Zhang et.al. | 2603.17541 | translate | read | null |
| 2026-03-18 | Anisotropic Permeability Tensor Prediction from Porous Media Microstructure via Physics-Informed Progressive Transfer Learning with Hybrid CNN-Transformer | Mohammad Nooraiepour et.al. | 2603.17532 | translate | read | null |
| 2026-03-18 | Detecting the Machine: A Comprehensive Benchmark of AI-Generated Text Detectors Across Architectures, Domains, and Adversarial Conditions | Madhav S. Baidya et.al. | 2603.17522 | translate | read | null |
| 2026-03-18 | EI: Early Intervention for Multimodal Imaging based Disease Recognition | Qijie Wei et.al. | 2603.17514 | translate | read | null |
| 2026-03-18 | QuantFL: Sustainable Federated Learning for Edge IoT via Pre-Trained Model Quantisation | Charuka Herath et.al. | 2603.17507 | translate | read | null |
| 2026-03-18 | Inducing Epistemological Humility in Large Language Models: A Targeted SFT Approach to Reducing Hallucination | Cem Uluoglakci et.al. | 2603.17504 | translate | read | null |
| 2026-03-18 | Revisiting Cross-Attention Mechanisms: Leveraging Beneficial Noise for Domain-Adaptive Learning | Zelin Zang et.al. | 2603.17474 | translate | read | null |
| 2026-03-18 | VLM2Rec: Resolving Modality Collapse in Vision-Language Model Embedders for Multimodal Sequential Recommendation | Junyoung Kim et.al. | 2603.17450 | translate | read | null |
| 2026-03-18 | SHIFT: Motion Alignment in Video Diffusion Models with Adversarial Hybrid Fine-Tuning | Xi Ye et.al. | 2603.17426 | translate | read | null |
| 2026-03-18 | Mutually Causal Semantic Distillation Network for Zero-Shot Learning | Shiming Chen et.al. | 2603.17412 | translate | read | null |
| 2026-03-18 | Harnessing the Power of Foundation Models for Accurate Material Classification | Qingran Lin et.al. | 2603.17390 | translate | read | null |
| 2026-03-18 | Grid Spatial Understanding: A Dataset for Textual Spatial Reasoning over Grids, Embodied Settings, and Coordinate Structures | Risham Sidhu et.al. | 2603.17333 | translate | read | null |
| 2026-03-18 | Ruyi2.5 Technical Report | Huan Song et.al. | 2603.17311 | translate | read | null |
| 2026-03-18 | Variational Rectification Inference for Learning with Noisy Labels | Haoliang Sun et.al. | 2603.17255 | translate | read | null |
| 2026-03-18 | ListK: Semantic ORDER BY and LIMIT K with Listwise Prompting | Jason Shin et.al. | 2603.17223 | translate | read | null |
| 2026-03-17 | SA-CycleGAN-2.5D: Self-Attention CycleGAN with Tri-Planar Context for Multi-Site MRI Harmonization | Ishrith Gowda et.al. | 2603.17219 | translate | read | null |
| 2026-03-17 | Anonymous-by-Construction: An LLM-Driven Framework for Privacy-Preserving Text | Federico Albanese et.al. | 2603.17217 | translate | read | null |
| 2026-03-17 | Integration of local and global surrogates for failure probability estimation | Audrey Gaymann et.al. | 2603.17211 | translate | read | null |
| 2026-03-17 | A scalable neural bundle map for multiphysics prediction in lithium-ion battery across varying configurations | Zhiwei Zhao et.al. | 2603.17209 | translate | read | null |
| 2026-03-17 | SYMDIREC: A Neuro-Symbolic Divide-Retrieve-Conquer Framework for Enhanced RTL Synthesis and Summarization | Prashanth Vijayaraghavan et.al. | 2603.17208 | translate | read | null |
| 2026-03-17 | Tabular LLMs for Interpretable Few-Shot Alzheimer’s Disease Prediction with Multimodal Biomedical Data | Sophie Kearney et.al. | 2603.17191 | translate | read | null |
| 2026-03-17 | MetaClaw: Just Talk – An Agent That Meta-Learns and Evolves in the Wild | Peng Xia et.al. | 2603.17187 | translate | read | null |
| 2026-03-17 | How Clued up are LLMs? Evaluating Multi-Step Deductive Reasoning in a Text-Based Game Environment | Rebecca Ansell et.al. | 2603.17169 | translate | read | null |
| 2026-03-17 | Personalized Fall Detection by Balancing Data with Selective Feedback Using Contrastive Learning | Awatif Yasmin et.al. | 2603.17148 | translate | read | null |
| 2026-03-17 | REAL: Regression-Aware Reinforcement Learning for LLM-as-a-Judge | Yasi Zhang et.al. | 2603.17145 | translate | read | null |
| 2026-03-17 | SENSE: Efficient EEG-to-Text via Privacy-Preserving Semantic Retrieval | Akshaj Murhekar et.al. | 2603.17109 | translate | read | null |
| 2026-03-17 | LLM-Powered Flood Depth Estimation from Social Media Imagery: A Vision-Language Model Framework with Mechanistic Interpretability for Transportation Resilience | Nafis Fuad et.al. | 2603.17108 | translate | read | null |
| 2026-03-17 | Evaluating LLM-Simulated Conversations in Modeling Inconsistent and Uncollaborative Behaviors in Human Social Interaction | Ryo Kamoi et.al. | 2603.17094 | translate | read | null |
| 2026-03-17 | SLowRL: Safe Low-Rank Adaptation Reinforcement Learning for Locomotion | Elham Daneshmand et.al. | 2603.17092 | translate | read | null |
| 2026-03-17 | ACE-LoRA: Graph-Attentive Context Enhancement for Parameter-Efficient Adaptation of Medical Vision-Language Models | M. Arda Aydın et.al. | 2603.17079 | translate | read | null |
| 2026-03-17 | TeleDex: Accessible Dexterous Teleoperation | Omar Rayyan et.al. | 2603.17065 | translate | read | null |
| 2026-03-17 | Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models | Songchun Zhang et.al. | 2603.17051 | translate | read | null |
| 2026-03-17 | Rewarding DINO: Predicting Dense Rewards with Vision Foundation Models | Pierre Krack et.al. | 2603.16978 | translate | read | null |
| 2026-03-17 | Hybrid Classical-Quantum Transfer Learning with Noisy Quantum Circuits | D. Martín-Pérez et.al. | 2603.16973 | translate | read | null |
| 2026-03-17 | Efficient Reasoning on the Edge | Yelysei Bondarenko et.al. | 2603.16867 | translate | read | null |
| 2026-03-17 | MolmoB0T: Large-Scale Simulation Enables Zero-Shot Manipulation | Abhay Deshpande et.al. | 2603.16861 | translate | read | null |
| 2026-03-17 | DreamPlan: Efficient Reinforcement Fine-Tuning of Vision-Language Planners via Video World Models | Emily Yue-Ting Jia et.al. | 2603.16860 | translate | read | null |
| 2026-03-17 | Dynamic Meta-Layer Aggregation for Byzantine-Robust Federated Learning | Reek Das et.al. | 2603.16846 | translate | read | null |
| 2026-03-17 | Internalizing Agency from Reflective Experience | Rui Ge et.al. | 2603.16843 | translate | read | null |
| 2026-03-17 | Learning to Present: Inverse Specification Rewards for Agentic Slide Generation | Karthik Ragunath Ananda Kumar et.al. | 2603.16839 | translate | read | null |
| 2026-03-17 | Anticipatory Planning for Multimodal AI Agents | Yongyuan Liang et.al. | 2603.16777 | translate | read | null |
| 2026-03-17 | Probing Cultural Signals in Large Language Models through Author Profiling | Valentin Lafargue et.al. | 2603.16749 | translate | read | null |
| 2026-03-17 | MedCL-Bench: Benchmarking stability-efficiency trade-offs and scaling in biomedical continual learning | Min Zeng et.al. | 2603.16738 | translate | read | null |
| 2026-03-17 | Confusion-Aware Spectral Regularizer for Long-Tailed Recognition | Ziquan Zhu et.al. | 2603.16732 | translate | read | null |
| 2026-03-17 | Search2Motion: Training-Free Object-Level Motion Control via Attention-Consensus Search | Sainan Liu et.al. | 2603.16711 | translate | read | null |
| 2026-03-17 | Self-Aware Markov Models for Discrete Reasoning | Gregor Kornhardt et.al. | 2603.16661 | translate | read | null |
| 2026-03-17 | Can Linguistically Related Languages Guide LLM Translation in Low-Resource Settings? | Aishwarya Ramasethu et.al. | 2603.16660 | translate | read | null |
| 2026-03-17 | Machines acquire scientific taste from institutional traces | Ziqin Gong et.al. | 2603.16659 | translate | read | null |
| 2026-03-17 | Omanic: Towards Step-wise Evaluation of Multi-hop Reasoning in Large Language Models | Xiaojie Gu et.al. | 2603.16654 | translate | read | null |
| 2026-03-17 | FlowComposer: Composable Flows for Compositional Zero-Shot Learning | Zhenqi He et.al. | 2603.16641 | translate | read | null |
| 2026-03-17 | Ligand-Controlled Phonon Dynamics in CsPbBr3 Nanocrystals Revealed by Machine-Learned Interatomic Potentials | Seungjun Cha et.al. | 2603.16631 | translate | read | null |
| 2026-03-17 | Deep Tabular Representation Corrector | Hangting Ye et.al. | 2603.16569 | translate | read | null |
| 2026-03-17 | UrbanFlow-3K: A Dataset of 3,000 Lattice-Boltzmann Simulations of Random Building Layouts | Hojin Lee et.al. | 2603.16554 | translate | read | null |
| 2026-03-17 | CompDiff: Hierarchical Compositional Diffusion for Fair and Zero-Shot Intersectional Medical Image Generation | Mahmoud Ibrahim et.al. | 2603.16551 | translate | read | null |
| 2026-03-17 | SAMSEM – A Generic and Scalable Approach for IC Metal Line Segmentation | Christian Gehrmann et.al. | 2603.16548 | translate | read | null |
| 2026-03-17 | Exploring different approaches to customize language models for domain-specific text-to-code generation | Luís Freire et.al. | 2603.16526 | translate | read | null |
| 2026-03-17 | Bridging the High-Frequency Data Gap: A Millisecond-Resolution Network Dataset for Advancing Time Series Foundation Models | Subina Khanal et.al. | 2603.16497 | translate | read | null |
| 2026-03-17 | GAP-MLLM: Geometry-Aligned Pre-training for Activating 3D Spatial Perception in Multimodal Large Language Models | Jiaxin Zhang et.al. | 2603.16461 | translate | read | null |
| 2026-03-17 | Evo-Retriever: LLM-Guided Curriculum Evolution with Viewpoint-Pathway Collaboration for Multimodal Document Retrieval | Weiqing Li et.al. | 2603.16455 | translate | read | null |
| 2026-03-17 | MFTune: An Efficient Multi-fidelity Framework for Spark SQL Configuration Tuning | Beicheng Xu et.al. | 2603.16450 | translate | read | null |
| 2026-03-17 | An Efficient Heterogeneous Co-Design for Fine-Tuning on a Single GPU | Ruijia Yang et.al. | 2603.16428 | translate | read | null |
| 2026-03-17 | IndexRAG: Bridging Facts for Cross-Document Reasoning at Index Time | Zhenghua Bao et.al. | 2603.16415 | translate | read | null |
| 2026-03-17 | DermaFlux: Synthetic Skin Lesion Generation with Rectified Flows for Enhanced Image Classification | Stathis Galanakis et.al. | 2603.16392 | translate | read | null |
| 2026-03-17 | InViC: Intent-aware Visual Cues for Medical Visual Question Answering | Zhisong Wang et.al. | 2603.16372 | translate | read | null |
| 2026-03-17 | Toward Experimentation-as-a-Service in 5G/6G: The Plaza6G Prototype for AI-Assisted Trials | Sergio Barrachina-Muñoz et.al. | 2603.16356 | translate | read | null |
| 2026-03-17 | PashtoCorp: A 1.25-Billion-Word Corpus, Evaluation Suite, and Reproducible Pipeline for Low-Resource Language Development | Hanif Rahman et.al. | 2603.16354 | translate | read | null |
| 2026-03-17 | Tuning Cu/Diamond Interfacial Thermal Conductance via Nitrogen-Termination Engineering | Guang Yang et.al. | 2603.16347 | translate | read | null |
| 2026-03-17 | SpikeCLR: Contrastive Self-Supervised Learning for Few-Shot Event-Based Vision using Spiking Neural Networks | Maxime Vaillant et.al. | 2603.16338 | translate | read | null |
| 2026-03-17 | Attention-guided Evidence Grounding for Spoken Question Answering | Ke Yang et.al. | 2603.16292 | translate | read | null |
| 2026-03-17 | Laya: A LeJEPA Approach to EEG via Latent Prediction over Reconstruction | Saarang Panchavati et.al. | 2603.16281 | translate | read | null |
| 2026-03-17 | Is Semi-Automatic Transcription Useful in Corpus Creation? Preliminary Considerations on the KIParla Corpus | Martina Simonotti et.al. | 2603.16258 | translate | read | null |
| 2026-03-17 | Enabling Dynamic Tracking in Vision-Language-Action Models via Time-Discrete and Time-Continuous Velocity Feedforward | Johannes Hechtl et.al. | 2603.16218 | translate | read | null |
| 2026-03-17 | Generative AI for Quantum Circuits and Quantum Code: A Technical Review and Taxonomy | Juhani Merilehto et.al. | 2603.16216 | translate | read | null |
| 2026-03-17 | Offline Exploration-Aware Fine-Tuning for Long-Chain Mathematical Reasoning | Yongyu Mu et.al. | 2603.16206 | translate | read | null |
| 2026-03-17 | ECHO: Edge-Cloud Humanoid Orchestration for Language-to-Motion Control | Haozhe Jia et.al. | 2603.16188 | translate | read | null |
| 2026-03-17 | Sample-Efficient Adaptation of Drug-Response Models to Patient Tumors under Strong Biological Domain Shift | Camille Jimenez Cortes et.al. | 2603.16185 | translate | read | null |
| 2026-03-17 | Polyglot-Lion: Efficient Multilingual ASR for Singapore via Balanced Fine-Tuning of Qwen3-ASR | Quy-Anh Dang et.al. | 2603.16184 | translate | read | null |
| 2026-03-17 | The Finetuner’s Fallacy: When to Pretrain with Your Finetuning Data | Christina Baek et.al. | 2603.16177 | translate | read | null |
| 2026-03-17 | HIPO: Instruction Hierarchy via Constrained Reinforcement Learning | Keru Chen et.al. | 2603.16152 | translate | read | null |
| 2026-03-17 | Communication-Aware Multi-Agent Reinforcement Learning for Decentralized Cooperative UAV Deployment | Enguang Fan et.al. | 2603.16141 | translate | read | null |
| 2026-03-17 | Rethinking UMM Visual Generation: Masked Modeling for Efficient Image-Only Pre-training | Peng Sun et.al. | 2603.16139 | translate | read | null |
| 2026-03-12 | HumDex:Humanoid Dexterous Manipulation Made Easy | Liang Heng et.al. | 2603.12260 | translate | read | null |
| 2026-03-12 | SciMDR: Benchmarking and Advancing Scientific Multimodal Document Reasoning | Ziyu Chen et.al. | 2603.12249 | translate | read | null |
| 2026-03-12 | Matching Features, Not Tokens: Energy-Based Fine-Tuning of Language Models | Samy Jelassi et.al. | 2603.12248 | translate | read | null |
| 2026-03-12 | Neural Thickets: Diverse Task Experts Are Dense Around Pretrained Weights | Yulu Gan et.al. | 2603.12228 | translate | read | null |
| 2026-03-12 | Interpreting Contrastive Embeddings in Specific Domains with Fuzzy Rules | Javier Fumanal-Idocin et.al. | 2603.12227 | translate | read | null |
| 2026-03-12 | UniMotion: Self-Supervised Learning for Cross-Domain IMU Motion Recognition | Prerna Khanna et.al. | 2603.12218 | translate | read | null |
| 2026-03-12 | Real-World Point Tracking with Verifier-Guided Pseudo-Labeling | Görkay Aydemir et.al. | 2603.12217 | translate | read | null |
| 2026-03-12 | BehaviorVLM: Unified Finetuning-Free Behavioral Understanding with Vision-Language Reasoning | Jingyang Ke et.al. | 2603.12176 | translate | read | null |
| 2026-03-12 | To Words and Beyond: Probing Large Language Models for Sentence-Level Psycholinguistic Norms of Memorability and Reading Times | Thomas Hikaru Clark et.al. | 2603.12105 | translate | read | null |
| 2026-03-12 | Resource-Efficient Iterative LLM-Based NAS with Feedback Memory | Xiaojie Gu et.al. | 2603.12091 | translate | read | null |
| 2026-03-12 | EmbTracker: Traceable Black-box Watermarking for Federated Language Models | Haodong Zhao et.al. | 2603.12089 | translate | read | null |
| 2026-03-12 | Just Use XML: Revisiting Joint Translation and Label Projection | Thennal D K et.al. | 2603.12021 | translate | read | null |
| 2026-03-12 | Can RL Improve Generalization of LLM Agents? An Empirical Study | Zhiheng Xi et.al. | 2603.12011 | translate | read | null |
| 2026-03-12 | BTZSC: A Benchmark for Zero-Shot Text Classification Across Cross-Encoders, Embedding Models, Rerankers and LLMs | Ilias Aarab et.al. | 2603.11991 | translate | read | null |
| 2026-03-12 | Ada3Drift: Adaptive Training-Time Drifting for One-Step 3D Visuomotor Robotic Manipulation | Chongyang Xu et.al. | 2603.11984 | translate | read | null |
| 2026-03-12 | PersonaTrace: Synthesizing Realistic Digital Footprints with LLM Agents | Minjia Wang et.al. | 2603.11955 | translate | read | null |
| 2026-03-12 | Resurfacing Paralinguistic Awareness in Large Audio Language Models | Hao Yang et.al. | 2603.11947 | translate | read | null |
| 2026-03-12 | Prototype-Based Knowledge Guidance for Fine-Grained Structured Radiology Reporting | Chantal Pellegrini et.al. | 2603.11938 | translate | read | null |
| 2026-03-12 | MobileKernelBench: Can LLMs Write Efficient Kernels for Mobile Devices? | Xingze Zou et.al. | 2603.11935 | translate | read | null |
| 2026-03-12 | Bielik-Minitron-7B: Compressing Large Language Models via Structured Pruning and Knowledge Distillation for the Polish Language | Remigiusz Kinas et.al. | 2603.11881 | translate | read | null |
| 2026-03-12 | DatedGPT: Preventing Lookahead Bias in Large Language Models with Time-Aware Pretraining | Yutong Yan et.al. | 2603.11838 | translate | read | null |
| 2026-03-12 | Towards High-Fidelity CAD Generation via LLM-Driven Program Generation and Text-Based B-Rep Primitive Grounding | Jiahao Li et.al. | 2603.11831 | translate | read | null |
| 2026-03-12 | RADAR: Closed-Loop Robotic Data Generation via Semantic Planning and Autonomous Causal Environment Reset | Yongzhong Wang et.al. | 2603.11811 | translate | read | null |
| 2026-03-12 | OSM-based Domain Adaptation for Remote Sensing VLMs | Stefan Maria Ailuro et.al. | 2603.11804 | translate | read | null |
| 2026-03-12 | Large Language Models for Biomedical Article Classification | Jakub Proboszcz et.al. | 2603.11780 | translate | read | null |
| 2026-03-12 | Software-Hardware Binding for Protection of Sensitive Data in Embedded Software | Bernhard Fischer et.al. | 2603.11727 | translate | read | null |
| 2026-03-12 | A technology-oriented mapping of the language and translation industry: Analysing stakeholder values and their potential implication for translation pedagogy | María Isabel Rivas Ginel et.al. | 2603.11667 | translate | read | null |
| 2026-03-12 | Simple Recipe Works: Vision-Language-Action Models are Natural Continual Learners with Reinforcement Learning | Jiaheng Hu et.al. | 2603.11653 | translate | read | null |
| 2026-03-12 | VisDoT : Enhancing Visual Reasoning through Human-Like Interpretation Grounding and Decomposition of Thought | Eunsoo Lee et.al. | 2603.11631 | translate | read | null |
| 2026-03-12 | Noise-aware few-shot learning through bi-directional multi-view prompt alignment | Lu Niu et.al. | 2603.11617 | translate | read | null |
| 2026-03-12 | AutoScout: Structured Optimization for Automating ML System Configuration | Jimmy Shong et.al. | 2603.11603 | translate | read | null |
| 2026-03-12 | WeEdit: A Dataset, Benchmark and Glyph-Guided Framework for Text-centric Image Editing | Hui Zhang et.al. | 2603.11593 | translate | read | null |
| 2026-03-12 | Streaming Translation and Transcription Through Speech-to-Text Causal Alignment | Roman Koshkin et.al. | 2603.11578 | translate | read | null |
| 2026-03-12 | MDS-VQA: Model-Informed Data Selection for Video Quality Assessment | Jian Zou et.al. | 2603.11525 | translate | read | null |
| 2026-03-12 | Meta-generalized gradient approximation made in the Hartree gauge | Yan Oueis et.al. | 2603.11517 | translate | read | null |
| 2026-03-12 | Grammar of the Wave: Towards Explainable Multivariate Time Series Event Detection via Neuro-Symbolic VLM Agents | Sky Chenwei Wan et.al. | 2603.11479 | translate | read | null |
| 2026-03-12 | Deep Learning Network-Temporal Models For Traffic Prediction | Yufeng Xin et.al. | 2603.11475 | translate | read | null |
| 2026-03-12 | Prediction-Oriented Transfer Learning for Survival Analysis | Yu Gu et.al. | 2603.11465 | translate | read | null |
| 2026-03-12 | Enhancing Lightweight Vision Language Models through Group Competitive Learning for Socially Compliant Navigation | Xinyu Zhang et.al. | 2603.11447 | translate | read | null |
| 2026-03-12 | ZTab: Domain-based Zero-shot Annotation for Table Columns | Ehsan Hoseinzade et.al. | 2603.11436 | translate | read | null |
| 2026-03-12 | BLooP: Zero-Shot Abstractive Summarization using Large Language Models with Bigram Lookahead Promotion | Varun Iyer et.al. | 2603.11415 | translate | read | null |
| 2026-03-12 | Speak or Stay Silent: Context-Aware Turn-Taking in Multi-Party Dialogue | Kratika Bhagtani et.al. | 2603.11409 | translate | read | null |
| 2026-03-12 | Reproducible Synthetic Clinical Letters for Seizure Frequency Information Extraction | Yujian Gan et.al. | 2603.11407 | translate | read | null |
| 2026-03-12 | BEACON: Budget-Aware Entity Matching Across Domains (Extended Technical Report) | Nicholas Pulsone et.al. | 2603.11391 | translate | read | null |
| 2026-03-12 | Deactivating Refusal Triggers: Understanding and Mitigating Overrefusal in Safety Alignment | Zhiyu Xue et.al. | 2603.11388 | translate | read | null |
| 2026-03-11 | CP violation in two meson tau decays | Daniel A. López Aguilar et.al. | 2603.11348 | translate | read | null |
| 2026-03-11 | Meta-Reinforcement Learning with Self-Reflection for Agentic Search | Teng Xiao et.al. | 2603.11327 | translate | read | null |
| 2026-03-11 | Temporal Text Classification with Large Language Models | Nishat Raihan et.al. | 2603.11295 | translate | read | null |
| 2026-03-11 | Vector Higgs-Portal Dark Matter: How UV Completion Reopens Viable Parameter Space | Halim Shaikh et.al. | 2603.11233 | translate | read | null |
| 2026-03-11 | ExecVerify: White-Box RL with Verifiable Stepwise Rewards for Code Execution Reasoning | Lingxiao Tang et.al. | 2603.11226 | translate | read | null |
| 2026-03-11 | A Simple Efficiency Incremental Learning Framework via Vision-Language Model with Nonlinear Multi-Adapters | Haihua Luo et.al. | 2603.11211 | translate | read | null |
| 2026-03-11 | Representation Finetuning for Continual Learning | Haihua Luo et.al. | 2603.11201 | translate | read | null |
| 2026-03-11 | DeReason: A Difficulty-Aware Curriculum Improves Decoupled SFT-then-RL Training for General Reasoning | Hanxu Hu et.al. | 2603.11193 | translate | read | null |
| 2026-03-11 | Enhancing Value Alignment of LLMs with Multi-agent system and Combinatorial Fusion | Yuanhong Wu et.al. | 2603.11126 | translate | read | null |
| 2026-03-11 | Fingerprinting Concepts in Data Streams with Supervised and Unsupervised Meta-Information | Ben Halstead et.al. | 2603.11094 | translate | read | null |
| 2026-03-09 | Matlantis-PFP v8: Universal Machine Learning Interatomic Potential with Better Experimental Agreements via r2SCAN Functional | Chikashi Shinagawa et.al. | 2603.11063 | translate | read | null |
| 2026-03-11 | V2M-Zero: Zero-Pair Time-Aligned Video-to-Music Generation | Yan-Bo Lin et.al. | 2603.11042 | translate | read | null |
| 2026-03-11 | Cross-Species Transfer Learning for Electrophysiology-to-Transcriptomics Mapping in Cortical GABAergic Interneurons | Theo Schwider et.al. | 2603.11000 | translate | read | null |
| 2026-03-11 | TreeON: Reconstructing 3D Tree Point Clouds from Orthophotos and Heightmaps | Angeliki Grammatikaki et.al. | 2603.10996 | translate | read | null |
| 2026-03-11 | Med-DualLoRA: Local Adaptation of Foundation Models for 3D Cardiac MRI | Joan Perramon-Llussà et.al. | 2603.10967 | translate | read | null |
| 2026-03-11 | Training-Free Multi-Step Inference for Target Speaker Extraction | Zhenghai You et.al. | 2603.10921 | translate | read | null |
| 2026-03-11 | When Fine-Tuning Fails and when it Generalises: Role of Data Diversity and Mixed Training in LLM-based TTS | Anupam Purwar et.al. | 2603.10904 | translate | read | null |
| 2026-03-11 | From Images to Words: Efficient Cross-Modal Knowledge Distillation to Language Models from Black-box Teachers | Ayan Sengupta et.al. | 2603.10877 | translate | read | null |
| 2026-03-11 | Bilevel Layer-Positioning LoRA for Real Image Dehazing | Yan Zhang et.al. | 2603.10872 | translate | read | null |
| 2026-03-11 | UltrasoundAgents: Hierarchical Multi-Agent Evidence-Chain Reasoning for Breast Ultrasound Diagnosis | Yali Zhu et.al. | 2603.10852 | translate | read | null |
| 2026-03-11 | Towards Cold-Start Drafting and Continual Refining: A Value-Driven Memory Approach with Application to NPU Kernel Synthesis | Yujie Zheng et.al. | 2603.10846 | translate | read | null |
| 2026-03-11 | Evaluating Few-Shot Pill Recognition Under Visual Domain Shift | W. I. Chu et.al. | 2603.10833 | translate | read | null |
| 2026-03-11 | ReTabSyn: Realistic Tabular Data Synthesis via Reinforcement Learning | Xiaofeng Lin et.al. | 2603.10823 | translate | read | null |
| 2026-03-11 | The Quadratic Geometry of Flow Matching: Semantic Granularity Alignment for Text-to-Image Synthesis | Zhinan Xiong et.al. | 2603.10785 | translate | read | null |
| 2026-03-11 | MAVEN: A Meta-Reinforcement Learning Framework for Varying-Dynamics Expertise in Agile Quadrotor Maneuvers | Jin Zhou et.al. | 2603.10714 | translate | read | null |
| 2026-03-11 | RandMark: On Random Watermarking of Visual Foundation Models | Anna Chistyakova et.al. | 2603.10695 | translate | read | null |
| 2026-03-11 | HyPER-GAN: Hybrid Patch-Based Image-to-Image Translation for Real-Time Photorealism Enhancement | Stefanos Pasios et.al. | 2603.10604 | translate | read | null |
| 2026-03-11 | Adaptive RAN Slicing Control via Reward-Free Self-Finetuning Agents | Yuanhao Li et.al. | 2603.10564 | translate | read | null |
| 2026-03-11 | PET-F2I: A Comprehensive Benchmark and Parameter-Efficient Fine-Tuning of LLMs for PET/CT Report Impression Generation | Yuchen Liu et.al. | 2603.10560 | translate | read | null |
| 2026-03-11 | In-Memory ADC-Based Nonlinear Activation Quantization for Efficient In-Memory Computing | Shuai Dong et.al. | 2603.10540 | translate | read | null |
| 2026-03-11 | DSFlash: Comprehensive Panoptic Scene Graph Generation in Realtime | Julian Lorenz et.al. | 2603.10538 | translate | read | null |
| 2026-03-11 | IH-Challenge: A Training Dataset to Improve Instruction Hierarchy on Frontier LLMs | Chuan Guo et.al. | 2603.10521 | translate | read | null |
| 2026-03-11 | Visually-Guided Controllable Medical Image Generation via Fine-Grained Semantic Disentanglement | Xin Huang et.al. | 2603.10519 | translate | read | null |
| 2026-03-11 | COHORT: Hybrid RL for Collaborative Large DNN Inference on Multi-Robot Systems Under Real-Time Constraints | Mohammad Saeid Anwar et.al. | 2603.10436 | translate | read | null |
| 2026-03-11 | Domain-Adaptive Health Indicator Learning with Degradation-Stage Synchronized Sampling and Cross-Domain Autoencoder | Jungho Choo et.al. | 2603.10430 | translate | read | null |
| 2026-03-11 | GeoSense: Internalizing Geometric Necessity Perception for Multimodal Reasoning | Ruiheng Liu et.al. | 2603.10370 | translate | read | null |
| 2026-03-11 | Mitigating Translationese Bias in Multilingual LLM-as-a-Judge via Disentangled Information Bottleneck | Hongbin Zhang et.al. | 2603.10351 | translate | read | null |
| 2026-03-11 | Multi-Modal Intelligent Channel Modeling: From Fine-tuned LLMs to Pre-trained Foundation Models | Lu Bai et.al. | 2603.10343 | translate | read | null |
| 2026-03-11 | Regime-aware financial volatility forecasting via in-context learning | Saba Asaad et.al. | 2603.10299 | translate | read | null |
| 2026-03-11 | GaLoRA: Parameter-Efficient Graph-Aware LLMs for Node Classification | Mayur Choudhary et.al. | 2603.10298 | translate | read | null |
| 2026-03-10 | GR-SAP: Generative Replay for Safety Alignment Preservation during Fine-Tuning | Zhouxiang Fang et.al. | 2603.10243 | translate | read | null |
| 2026-03-10 | Why Does It Look There? Structured Explanations for Image Classification | Jiarui Li et.al. | 2603.10234 | translate | read | null |
| 2026-03-10 | Sabiá-4 Technical Report | Thiago Laitz et.al. | 2603.10213 | translate | read | null |
| 2026-03-10 | Adaptive Activation Cancellation for Hallucination Mitigation in Large Language Models | Eric Yocam et.al. | 2603.10195 | translate | read | null |
| 2026-03-10 | Video-Based Reward Modeling for Computer-Use Agents | Linxin Song et.al. | 2603.10178 | translate | read | null |
| 2026-03-10 | ReMix: Reinforcement routing for mixtures of LoRAs in LLM finetuning | Ruizhong Qiu et.al. | 2603.10160 | translate | read | null |
| 2026-03-10 | Bias in Universal Machine-Learned Interatomic Potentials and its Effects on Fine-Tuning | Nicolas Wong et.al. | 2603.10159 | translate | read | null |
| 2026-03-10 | A Survey of Weight Space Learning: Understanding, Representation, and Generation | Xiaolong Han et.al. | 2603.10090 | translate | read | null |
| 2026-03-10 | Amnesia: Adversarial Semantic Layer Specific Activation Steering in Large Language Models | Ali Raza et.al. | 2603.10080 | translate | read | null |
| 2026-03-10 | ADVERSA: Measuring Multi-Turn Guardrail Degradation and Judge Reliability in Large Language Models | Harry Owiredu-Ashley et.al. | 2603.10068 | translate | read | null |
| 2026-03-06 | Evaluating Generalization Mechanisms in Autonomous Cyber Attack Agents | Ondřej Lukáš et.al. | 2603.10041 | translate | read | null |
| 2026-03-10 | TiPToP: A Modular Open-Vocabulary Planning System for Robotic Manipulation | William Shen et.al. | 2603.09971 | translate | read | null |
| 2026-03-10 | ReCoSplat: Autoregressive Feed-Forward Gaussian Splatting Using Render-and-Compare | Freeman Cheng et.al. | 2603.09968 | translate | read | null |
| 2026-03-10 | Towards a Neural Debugger for Python | Maximilian Beck et.al. | 2603.09951 | translate | read | null |
| 2026-03-10 | Model Merging in the Era of Large Language Models: Methods, Applications, and Future Directions | Mingyang Song et.al. | 2603.09938 | translate | read | null |
| 2026-03-10 | Unsupervised Domain Adaptation with Target-Only Margin Disparity Discrepancy | Gauthier Miralles et.al. | 2603.09932 | translate | read | null |
| 2026-03-10 | Three phases of odd robotic active matter | Fan Bo et.al. | 2603.09897 | translate | read | null |
| 2026-03-10 | Stepping VLMs onto the Court: Benchmarking Spatial Intelligence in Sports | Yuchen Yang et.al. | 2603.09896 | translate | read | null |
| 2026-03-10 | MSSR: Memory-Aware Adaptive Replay for Continual LLM Fine-Tuning | Yiyang Lu et.al. | 2603.09892 | translate | read | null |
| 2026-03-10 | LCA: Local Classifier Alignment for Continual Learning | Tung Tran et.al. | 2603.09888 | translate | read | null |
| 2026-03-10 | CarbonBench: A Global Benchmark for Upscaling of Carbon Fluxes Using Zero-Shot Learning | Aleksei Rozanov et.al. | 2603.09868 | translate | read | null |
| 2026-03-10 | GAST: Gradient-aligned Sparse Tuning of Large Language Models with Data-layer Selection | Kai Yao et.al. | 2603.09865 | translate | read | null |
| 2026-03-10 | RecThinker: An Agentic Framework for Tool-Augmented Reasoning in Recommendation | Haobo Zhang et.al. | 2603.09843 | translate | read | null |
| 2026-03-10 | EPIC-EuroParl-UdS: Information-Theoretic Perspectives on Translation and Interpreting | Maria Kunilovskaya et.al. | 2603.09785 | translate | read | null |
| 2026-03-10 | Beyond Fine-Tuning: Robust Food Entity Linking under Ontology Drift with FoodOntoRAG | Jan Drole et.al. | 2603.09758 | translate | read | null |
| 2026-03-10 | Let’s Reward Step-by-Step: Step-Aware Contrastive Alignment for Vision-Language Navigation in Continuous Environments | Haoyuan Li et.al. | 2603.09740 | translate | read | null |
| 2026-03-10 | RbtAct: Rebuttal as Supervision for Actionable Review Feedback Generation | Sihong Wu et.al. | 2603.09723 | translate | read | null |
| 2026-03-10 | Finetuning a Text-to-Audio Model for Room Impulse Response Generation | Kirak Kim et.al. | 2603.09708 | translate | read | null |
| 2026-03-10 | TemporalDoRA: Temporal PEFT for Robust Surgical Video Question Answering | Luca Carlini et.al. | 2603.09696 | translate | read | null |
| 2026-03-10 | ActiveUltraFeedback: Efficient Preference Data Generation using Active Learning | Davit Melikidze et.al. | 2603.09692 | translate | read | null |
| 2026-03-10 | ESAinsTOD: A Unified End-to-End Schema-Aware Instruction-Tuning Framework for Task-Oriented Dialog Modeling | Dechuan Teng et.al. | 2603.09691 | translate | read | null |
| 2026-03-10 | On Catastrophic Forgetting in Low-Rank Decomposition-Based Parameter-Efficient Fine-Tuning | Muhammad Ahmad et.al. | 2603.09684 | translate | read | null |
| 2026-03-10 | EsoLang-Bench: Evaluating Genuine Reasoning in Large Language Models via Esoteric Programming Languages | Aman Sharma et.al. | 2603.09678 | translate | read | null |
| 2026-03-10 | Learning the Hierarchical Organization in Brain Network for Brain Disorder Diagnosis | Jingfeng Tang et.al. | 2603.09606 | translate | read | null |
| 2026-03-10 | Build, Borrow, or Just Fine-Tune? A Political Scientist’s Guide to Choosing NLP Models | Shreyas Meher et.al. | 2603.09595 | translate | read | null |
| 2026-03-10 | Routing without Forgetting | Alessio Masano et.al. | 2603.09576 | translate | read | null |
| 2026-03-10 | GeoAlignCLIP: Enhancing Fine-Grained Vision-Language Alignment in Remote Sensing via Multi-Granular Consistency Learning | Xiao Yang et.al. | 2603.09566 | translate | read | null |
| 2026-03-10 | RESBev: Making BEV Perception More Robust | Lifeng Zhuo et.al. | 2603.09529 | translate | read | null |
| 2026-03-10 | Efficiently Aligning Draft Models via Parameter- and Data-Efficient Adaptation | Luxi Lin et.al. | 2603.09527 | translate | read | null |
| 2026-03-10 | TrainDeeploy: Hardware-Accelerated Parameter-Efficient Fine-Tuning of Small Transformer Models at the Extreme Edge | Run Wang et.al. | 2603.09511 | translate | read | null |
| 2026-03-10 | Context-Nav: Context-Driven Exploration and Viewpoint-Aware 3D Spatial Reasoning for Instance Navigation | Won Shik Jang et.al. | 2603.09506 | translate | read | null |
| 2026-03-10 | Evolving Prompt Adaptation for Vision-Language Models | Enming Zhang et.al. | 2603.09493 | translate | read | null |
| 2026-03-10 | MORE-R1: Guiding LVLM for Multimodal Object-Entity Relation Extraction via Stepwise Reasoning with Reinforcement Learning | Xiang Yuan et.al. | 2603.09478 | translate | read | null |
| 2026-03-10 | An Empirical Study and Theoretical Explanation on Task-Level Model-Merging Collapse | Yuan Cao et.al. | 2603.09463 | translate | read | null |
| 2026-03-10 | MetaDAT: Generalizable Trajectory Prediction via Meta Pre-training and Data-Adaptive Test-Time Updating | Yuning Wang et.al. | 2603.09419 | translate | read | null |
| 2026-03-10 | SPAARS: Safer RL Policy Alignment through Abstract Exploration and Refined Exploitation of Action Space | Swaminathan S K et.al. | 2603.09378 | translate | read | null |
| 2026-03-10 | MIL-PF: Multiple Instance Learning on Precomputed Features for Mammography Classification | Nikola Jovišić et.al. | 2603.09374 | translate | read | null |
| 2026-03-10 | TA-GGAD: Testing-time Adaptive Graph Model for Generalist Graph Anomaly Detection | Xiong Zhang et.al. | 2603.09349 | translate | read | null |
| 2026-03-10 | IntroSVG: Learning from Rendering Feedback for Text-to-SVG Generation via an Introspective Generator-Critic Framework | Feiyu Wang et.al. | 2603.09312 | translate | read | null |
| 2026-03-10 | Paralinguistic Emotion-Aware Validation Timing Detection in Japanese Empathetic Spoken Dialogue | Zi Haur Pang et.al. | 2603.09307 | translate | read | null |
| 2026-03-10 | CORAL: Scalable Multi-Task Robot Learning via LoRA Experts | Yuankai Luo et.al. | 2603.09298 | translate | read | null |
| 2026-03-10 | ForgeDreamer: Industrial Text-to-3D Generation with Multi-Expert LoRA and Cross-View Hypergraph | Junhao Cai et.al. | 2603.09266 | translate | read | null |
| 2026-03-10 | Multi-model approach for autonomous driving: A comprehensive study on traffic sign-, vehicle- and lane detection and behavioral cloning | Kanishkha Jaisankar et.al. | 2603.09255 | translate | read | null |
| 2026-03-10 | Cognitively Layered Data Synthesis for Domain Adaptation of LLMs to Space Situational Awareness | Ding Linghu et.al. | 2603.09231 | translate | read | null |
| 2026-03-10 | Non-Hermitian-induced higher-order topological phases in acoustic fractal lattices | Shuanghuizhi Li et.al. | 2603.09186 | translate | read | null |
| 2026-03-10 | DEO: Training-Free Direct Embedding Optimization for Negation-Aware Retrieval | Taegyeong Lee et.al. | 2603.09185 | translate | read | null |
| 2026-03-10 | ZeroWBC: Learning Natural Visuomotor Humanoid Control Directly from Human Egocentric Video | Haoran Yang et.al. | 2603.09170 | translate | read | null |
| 2026-03-10 | Bioalignment: Measuring and Improving LLM Disposition Toward Biological Systems for AI Safety | Trent R Northen et.al. | 2603.09154 | translate | read | null |
| 2026-03-10 | RTFDNet: Fusion-Decoupling for Robust RGB-T Segmentation | Kunyu Tan et.al. | 2603.09149 | translate | read | null |
| 2026-03-10 | DexHiL: A Human-in-the-Loop Framework for Vision-Language-Action Model Post-Training in Dexterous Manipulation | Yifan Han et.al. | 2603.09121 | translate | read | null |
| 2026-03-10 | Probabilistic Hysteresis Factor Prediction for Electric Vehicle Batteries with Graphite Anodes Containing Silicon | Runyao Yu et.al. | 2603.09103 | translate | read | null |
| 2026-03-10 | OmniEdit: A Training-free framework for Lip Synchronization and Audio-Visual Editing | Lixiang Lin et.al. | 2603.09084 | translate | read | null |
| 2026-03-10 | Learning Adaptive LLM Decoding | Chloe H. Su et.al. | 2603.09065 | translate | read | null |
| 2026-03-09 | The Coupling Within: Flow Matching via Distilled Normalizing Flows | David Berthelot et.al. | 2603.09014 | translate | read | null |
| 2026-03-09 | Learning When to Sample: Confidence-Aware Self-Consistency for Efficient LLM Chain-of-Thought Reasoning | Juming Xiong et.al. | 2603.08999 | translate | read | null |
| 2026-03-09 | A Consensus-Driven Multi-LLM Pipeline for Missing-Person Investigations | Joshua Castillo et.al. | 2603.08954 | translate | read | null |
| 2026-03-09 | BiCLIP: Domain Canonicalization via Structured Geometric Transformation | Pranav Mantini et.al. | 2603.08942 | translate | read | null |
| 2026-03-09 | Quantifying Memorization and Privacy Risks in Genomic Language Models | Alexander Nemecek et.al. | 2603.08913 | translate | read | null |
| 2026-03-09 | APPLV: Adaptive Planner Parameter Learning from Vision-Language-Action Model | Yuanjie Lu et.al. | 2603.08862 | translate | read | null |
| 2026-03-09 | A Lightweight Multi-Cancer Tumor Localization Framework for Deployable Digital Pathology | Brian Isett et.al. | 2603.08844 | translate | read | null |
| 2026-03-09 | Fish Audio S2 Technical Report | Shijia Liao et.al. | 2603.08823 | translate | read | null |
| 2026-03-09 | HMR-1: Hierarchical Massage Robot with Vision-Language-Model for Embodied Healthcare | Rongtao Xu et.al. | 2603.08817 | translate | read | null |
| 2026-03-09 | Multi-level meta-reinforcement learning with skill-based curriculum | Sichen Yang et.al. | 2603.08773 | translate | read | null |
| 2026-03-09 | Structural Causal Bottleneck Models | Simon Bing et.al. | 2603.08682 | translate | read | null |
| 2026-03-09 | Group Entropies and Mirror Duality: A Class of Flexible Mirror Descent Updates for Machine Learning | Andrzej Cichocki et.al. | 2603.08651 | translate | read | null |
| 2026-03-09 | Grow, Don’t Overwrite: Fine-tuning Without Forgetting | Dyah Adila et.al. | 2603.08647 | translate | read | null |
| 2026-03-09 | A Deep Learning Framework for Amplitude Generation of Generic EMRIs | Yan-bo Zeng et.al. | 2603.08635 | translate | read | null |
| 2026-03-09 | RetroAgent: From Solving to Evolving via Retrospective Dual Intrinsic Feedback | Xiaoying Zhang et.al. | 2603.08561 | translate | read | null |
| 2026-03-09 | SecAgent: Efficient Mobile GUI Agent with Semantic Context | Yiping Xie et.al. | 2603.08533 | translate | read | null |
| 2026-03-09 | AtomVLA: Scalable Post-Training for Robotic Manipulation via Predictive Latent World Models | Xiaoquan Sun et.al. | 2603.08519 | translate | read | null |
| 2026-03-09 | Reading $\neq$ Seeing: Diagnosing and Closing the Typography Gap in Vision-Language Models | Heng Zhou et.al. | 2603.08497 | translate | read | null |
| 2026-03-09 | Visual Self-Fulfilling Alignment: Shaping Safety-Oriented Personas via Threat-Related Images | Qishun Yang et.al. | 2603.08486 | translate | read | null |
| 2026-03-09 | Reasoning as Compression: Unifying Budget Forcing via the Conditional Information Bottleneck | Fabio Valerio Massoli et.al. | 2603.08462 | translate | read | null |
| 2026-03-09 | Alfa: Attentive Low-Rank Filter Adaptation for Structure-Aware Cross-Domain Personalized Gaze Estimation | He-Yen Hsieh et.al. | 2603.08445 | translate | read | null |
| 2026-03-09 | Can Vision-Language Models Solve the Shell Game? | Tiedong Liu et.al. | 2603.08436 | translate | read | null |
| 2026-03-09 | Tactile Recognition of Both Shapes and Materials with Automatic Feature Optimization-Enabled Meta Learning | Hongliang Zhao et.al. | 2603.08423 | translate | read | null |
| 2026-03-09 | Meta-RL with Shared Representations Enables Fast Adaptation in Energy Systems | Théo Zangato et.al. | 2603.08418 | translate | read | null |
| 2026-03-09 | Diffusion-Based Data Augmentation for Image Recognition: A Systematic Analysis and Evaluation | Zekun Li et.al. | 2603.08364 | translate | read | null |
| 2026-03-09 | Amortized Phylodynamic Inference with Neural Bayes Estimators and Recursive Neural Networks | Alexander E. Zarebski et.al. | 2603.08345 | translate | read | null |
| 2026-03-09 | CORE-Acu: Structured Reasoning Traces and Knowledge Graph Safety Verification for Acupuncture Clinical Decision Support | Liuyi Xu et.al. | 2603.08321 | translate | read | null |
| 2026-03-09 | Concept-Guided Fine-Tuning: Steering ViTs away from Spurious Correlations to Improve Robustness | Yehonatan Elisha et.al. | 2603.08309 | translate | read | null |
| 2026-03-09 | Novel Semantic Prompting for Zero-Shot Action Recognition | Salman Iqbal et.al. | 2603.08289 | translate | read | null |
| 2026-03-09 | Using Multimodal and Language-Agnostic Sentence Embeddings for Abstractive Summarization | Chaimae Chellaf et.al. | 2603.08282 | translate | read | null |
| 2026-03-09 | NCL-UoR at SemEval-2026 Task 5: Embedding-Based Methods, Fine-Tuning, and LLMs for Word Sense Plausibility Rating | Tong Wu et.al. | 2603.08256 | translate | read | null |
| 2026-03-09 | Bootstrapping Audiovisual Speech Recognition in Zero-AV-Resource Scenarios with Synthetic Visual Data | Pol Buitrago et.al. | 2603.08249 | translate | read | null |
| 2026-03-09 | Quantifying Cross-Lingual Transfer in Paralinguistic Speech Tasks | Pol Buitrago et.al. | 2603.08231 | translate | read | null |
| 2026-03-09 | DualTurn: Learning Turn-Taking from Dual-Channel Generative Speech Pretraining | Shangeth Rajaa et.al. | 2603.08216 | translate | read | null |
| 2026-03-09 | E0 transition strengths as a tool to constraint model parameters. Application to even-even Xe isotopes | P. Martin-Higueras et.al. | 2603.08197 | translate | read | null |
| 2026-03-09 | AutoAdapt: An Automated Domain Adaptation Framework for LLMs | Sidharth Sinha et.al. | 2603.08181 | translate | read | null |
| 2026-03-09 | Is continuous CoT better suited for multi-lingual reasoning? | Ali Hamza Bashir et.al. | 2603.08177 | translate | read | null |
| 2026-03-09 | RexDrug: Reliable Multi-Drug Combination Extraction through Reasoning-Enhanced LLMs | Zhijun Wang et.al. | 2603.08166 | translate | read | null |
| 2026-03-09 | Language-Invariant Multilingual Speaker Verification for the TidyVoice 2026 Challenge | Ze Li et.al. | 2603.08092 | translate | read | null |
| 2026-03-09 | From Reactive to Map-Based AI: Tuned Local LLMs for Semantic Zone Inference in Object-Goal Navigation | Yudai Noda et.al. | 2603.08086 | translate | read | null |
| 2026-03-09 | In-Context Reinforcement Learning for Tool Use in Large Language Models | Yaoqi Ye et.al. | 2603.08068 | translate | read | null |
| 2026-03-09 | Adversarial Domain Adaptation Enables Knowledge Transfer Across Heterogeneous RNA-Seq Datasets | Kevin Dradjat et.al. | 2603.08062 | translate | read | null |
| 2026-03-09 | Stabilized Fine-Tuning with LoRA in Federated Learning: Mitigating the Side Effect of Client Size and Rank via the Scaling Factor | Jiayu Huang et.al. | 2603.08058 | translate | read | null |
| 2026-03-09 | CDRRM: Contrast-Driven Rubric Generation for Reliable and Interpretable Reward Modeling | Dengcan Liu et.al. | 2603.08035 | translate | read | null |
| 2026-03-09 | FedMomentum: Preserving LoRA Training Momentum in Federated Fine-Tuning | Peishen Yan et.al. | 2603.08014 | translate | read | null |
| 2026-03-09 | It’s Time to Get It Right: Improving Analog Clock Reading and Clock-Hand Spatial Reasoning in Vision-Language Models | Jaeha Choi et.al. | 2603.08011 | translate | read | null |
| 2026-03-09 | SGG-R $^{\rm 3}$ : From Next-Token Prediction to End-to-End Unbiased Scene Graph Generation | Jiaye Feng et.al. | 2603.07961 | translate | read | null |
| 2026-03-09 | Unsupervised Domain Adaptation for Audio Deepfake Detection with Modular Statistical Transformations | Urawee Thani et.al. | 2603.07935 | translate | read | null |
| 2026-03-09 | IMSE: Intrinsic Mixture of Spectral Experts Fine-tuning for Test-Time Adaptation | Sunghyun Baek et.al. | 2603.07926 | translate | read | null |
| 2026-03-09 | Robust Transfer Learning with Side Information | Akram S. Awad et.al. | 2603.07921 | translate | read | null |
| 2026-03-09 | Ares: Adaptive Reasoning Effort Selection for Efficient LLM Agents | Jingbo Yang et.al. | 2603.07915 | translate | read | null |
| 2026-03-09 | NaviDriveVLM: Decoupling High-Level Reasoning and Motion Planning for Autonomous Driving | Ximeng Tao et.al. | 2603.07901 | translate | read | null |
| 2026-03-09 | SMGI: A Structural Theory of General Artificial Intelligence | Aomar Osmani et.al. | 2603.07896 | translate | read | null |
| 2026-03-09 | MINT: Molecularly Informed Training with Spatial Transcriptomics Supervision for Pathology Foundation Models | Minsoo Lee et.al. | 2603.07895 | translate | read | null |
| 2026-03-09 | Choose What to Observe: Task-Aware Semantic-Geometric Representations for Visuomotor Policy | Haoran Ding et.al. | 2603.07875 | translate | read | null |
| 2026-03-09 | SynPlanResearch-R1: Encouraging Tool Exploration for Deep Research with Synthetic Plans | Hansi Zeng et.al. | 2603.07853 | translate | read | null |
| 2026-03-08 | Training-free Temporal Object Tracking in Surgical Videos | Subhadeep Koley et.al. | 2603.07839 | translate | read | null |
| 2026-03-08 | Transferable Optimization Network for Cross-Domain Image Reconstruction | Yunmei Chen et.al. | 2603.07831 | translate | read | null |
| 2026-03-08 | Benchmarking Large Language Models for Quebec Insurance: From Closed-Book to Retrieval-Augmented Generation | David Beauchemin et.al. | 2603.07825 | translate | read | null |
| 2026-03-08 | ProgAgent:A Continual RL Agent with Progress-Aware Rewards | Jinzhou Tan et.al. | 2603.07784 | translate | read | null |
| 2026-03-08 | Meta-PINNs: Meta-Learning Enhanced Physics-Informed Machine Learning Framework for Turbomachinery Flow Predictions under Varying Operation Conditions | Yuling Han et.al. | 2603.07740 | translate | read | null |
| 2026-03-08 | PARSE: Part-Aware Relational Spatial Modeling | Yinuo Bai et.al. | 2603.07704 | translate | read | null |
| 2026-03-08 | Compressed-Domain-Aware Online Video Super-Resolution | Yuhang Wang et.al. | 2603.07694 | translate | read | null |
| 2026-03-08 | Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence | Yuanyuan Gao et.al. | 2603.07660 | translate | read | null |
| 2026-03-08 | Evaluating Synthetic Data for Baggage Trolley Detection in Airport Logistics | Abdeldjalil Taibi et.al. | 2603.07645 | translate | read | null |
| 2026-03-08 | Duala: Dual-Level Alignment of Subjects and Stimuli for Cross-Subject fMRI Decoding | Shumeng Li et.al. | 2603.07625 | translate | read | null |
| 2026-03-08 | MetaSort: An Accelerated Approach for Non-uniform Compression and Few-shot Classification of Neural Spike Waveforms | Luca M. Meyer et.al. | 2603.07602 | translate | read | null |
| 2026-03-08 | Revisiting the LiRA Membership Inference Attack Under Realistic Assumptions | Najeeb Jebreel et.al. | 2603.07567 | translate | read | null |
| 2026-03-08 | Learning the APT Kill Chain: Temporal Reasoning over Provenance Data for Attack Stage Estimation | Trung V. Phan et.al. | 2603.07560 | translate | read | null |
| 2026-03-08 | Nwāchā Munā: A Devanagari Speech Corpus and Proximal Transfer Benchmark for Nepal Bhasha ASR | Rishikesh Kumar Sharma et.al. | 2603.07554 | translate | read | null |
| 2026-03-08 | Accent Vector: Controllable Accent Manipulation for Multilingual TTS Without Accented Data | Thanathai Lertpetchpun et.al. | 2603.07534 | translate | read | null |
| 2026-03-08 | TableMind++: An Uncertainty-Aware Programmatic Agent for Tool-Augmented Table Reasoning | Mingyue Cheng et.al. | 2603.07528 | translate | read | null |
| 2026-03-08 | One-for-All Model Initialization with Frequency-Domain Knowledge | Jianlu Shen et.al. | 2603.07523 | translate | read | null |
| 2026-03-08 | InterReal: A Unified Physics-Based Imitation Framework for Learning Human-Object Interaction Skills | Dayang Liang et.al. | 2603.07516 | translate | read | null |
| 2026-03-08 | Brexit Means Brexit: Selection Bias, Echo Chambers, and Entrenched Opinion on Reddit | Marian-Andrei Rizoiu et.al. | 2603.07509 | translate | read | null |
| 2026-03-08 | FedEU: Evidential Uncertainty-Driven Federated Fine-Tuning of Vision Foundation Models for Remote Sensing Image Segmentation | Xiaokang Zhang et.al. | 2603.07468 | translate | read | null |
| 2026-03-08 | Trusting What You Cannot See: Auditable Fine-Tuning and Inference for Proprietary AI | Heng Jin et.al. | 2603.07466 | translate | read | null |
| 2026-03-08 | Classifying Novel 3D-Printed Objects without Retraining: Towards Post-Production Automation in Additive Manufacturing | Fanis Mathioulakis et.al. | 2603.07465 | translate | read | null |
| 2026-03-08 | Selective Transfer Learning of Cross-Modality Distillation for Monocular 3D Object Detection | Rui Ding et.al. | 2603.07464 | translate | read | null |
| 2026-03-08 | SIGMAE: A Spectral-Index-Guided Foundation Model for Multispectral Remote Sensing | Xiaokang Zhang et.al. | 2603.07463 | translate | read | null |
| 2026-03-08 | Dial: A Knowledge-Grounded Dialect-Specific NL2SQL System | Xiang Zhang et.al. | 2603.07449 | translate | read | null |
| 2026-03-08 | Few Tokens, Big Leverage: Preserving Safety Alignment by Constraining Safety Tokens during Fine-tuning | Guoli Wang et.al. | 2603.07445 | translate | read | null |
| 2026-03-08 | Med-Evo: Test-time Self-evolution for Medical Multimodal Large Language Models | Dunyuan Xu et.al. | 2603.07443 | translate | read | null |
| 2026-03-08 | Generalization in Online Reinforcement Learning for Mobile Agents | Li Gu et.al. | 2603.07432 | translate | read | null |
| 2026-03-08 | Adaptive Capacity Allocation for Vision Language Action Fine-tuning | Donghoon Kim et.al. | 2603.07404 | translate | read | null |
| 2026-03-08 | AQuA: Toward Strategic Response Generation for Ambiguous Visual Questions | Jihyoung Jang et.al. | 2603.07394 | translate | read | null |
| 2026-03-08 | Deterministic Fuzzy Triage for Legal Compliance Classification and Evidence Retrieval | Rian Atri et.al. | 2603.07390 | translate | read | null |
| 2026-03-07 | RILEC: Detection and Generation of L1 Russian Interference Errors in English Learner Texts | Darya Kharlamova et.al. | 2603.07366 | translate | read | null |
| 2026-03-07 | The Third Ambition: Artificial Intelligence and the Science of Human Behavior | W. Russell Neuman et.al. | 2603.07329 | translate | read | null |
| 2026-03-07 | Faster-HEAL: An Efficient and Privacy-Preserving Collaborative Perception Framework for Heterogeneous Autonomous Vehicles | Armin Maleki et.al. | 2603.07314 | translate | read | null |
| 2026-03-07 | AutoResearch-RL: Perpetual Self-Evaluating Reinforcement Learning Agents for Autonomous Neural Architecture Discovery | Nilesh Jain et.al. | 2603.07300 | translate | read | null |
| 2026-03-07 | MAviS: A Multimodal Conversational Assistant For Avian Species | Yevheniia Kryklyvets et.al. | 2603.07294 | translate | read | null |
| 2026-03-07 | Taiwan Safety Benchmark and Breeze Guard: Toward Trustworthy AI for Taiwanese Mandarin | Po-Chun Hsu et.al. | 2603.07286 | translate | read | null |
| 2026-03-07 | VisualDeltas: Learning Preferences from Visual Quality Perturbations | Hailiang Huang et.al. | 2603.07272 | translate | read | null |
| 2026-03-07 | How to Steal Reasoning Without Reasoning Traces | Tingwei Zhang et.al. | 2603.07267 | translate | read | null |
| 2026-03-07 | Learning When to Cooperate Under Heterogeneous Goals | Max Taylor-Davies et.al. | 2603.07253 | translate | read | null |
| 2026-03-07 | FabricGen: Microstructure-Aware Woven Fabric Generation | Yingjie Tang et.al. | 2603.07240 | translate | read | null |
| 2026-03-07 | $\textbf{Re}^{2}$ : Unlocking LLM Reasoning via Reinforcement Learning with Re-solving | Pinzheng Wang et.al. | 2603.07197 | translate | read | null |
| 2026-03-07 | FreeFly-Thinking : Aligning Chain-of-Thought Reasoning with Continuous UAV Navigation | Jiaxu Zhou et.al. | 2603.07181 | translate | read | null |
| 2026-03-07 | ACD-U: Asymmetric co-teaching with machine unlearning for robust learning with noisy labels | Reo Fukunaga et.al. | 2603.07166 | translate | read | null |
| 2026-03-07 | Emotion Transcription in Conversation: A Benchmark for Capturing Subtle and Complex Emotional States through Natural Language | Yoshiki Tanaka et.al. | 2603.07138 | translate | read | null |
| 2026-03-07 | Combining Adam and its Inverse Counterpart to Enhance Generalization of Deep Learning Optimizers | Tao Shi et.al. | 2603.07122 | translate | read | null |
| 2026-03-07 | NuNext: Reframing Nucleus Detection as Next-Point Detection | Zhongyi Shui et.al. | 2603.07098 | translate | read | null |
| 2026-03-07 | Facial Expression Generation Aligned with Human Preference for Natural Dyadic Interaction | Xu Chen et.al. | 2603.07093 | translate | read | null |
| 2026-03-07 | Exploring the Reasoning Depth of Small Language Models in Software Architecture: A Multidimensional Evaluation Framework Towards Software Engineering 2.0 | Ha Vo et.al. | 2603.07091 | translate | read | null |
| 2026-03-07 | mAVE: A Watermark for Joint Audio-Visual Generation Models | Luyang Si et.al. | 2603.07090 | translate | read | null |
| 2026-03-07 | Countdown-Code: A Testbed for Studying The Emergence and Generalization of Reward Hacking in RLVR | Muhammad Khalifa et.al. | 2603.07084 | translate | read | null |
| 2026-03-07 | Bi-directional digital twin prototype anchoring with multi-periodicity learning for few-shot fault diagnosis | Pengcheng Xia et.al. | 2603.07054 | translate | read | null |
| 2026-03-07 | Language-Aware Distillation for Multilingual Instruction-Following Speech LLMs with ASR-Only Supervision | Shreyas Gopal et.al. | 2603.07025 | translate | read | null |
| 2026-03-07 | Hit-RAG: Learning to Reason with Long Contexts via Preference Alignment | Junming Liu et.al. | 2603.07023 | translate | read | null |
| 2026-03-07 | OV-DEIM: Real-time DETR-Style Open-Vocabulary Object Detection with GridSynthetic Augmentation | Leilei Wang et.al. | 2603.07022 | translate | read | null |
| 2026-03-07 | AutoChecklist: Composable Pipelines for Checklist Generation and Scoring with LLM-as-a-Judge | Karen Zhou et.al. | 2603.07019 | translate | read | null |
| 2026-03-07 | TrajPred: Trajectory-Conditioned Joint Embedding Prediction for Surgical Instrument-Tissue Interaction Recognition in Vision-Language Models | Jiajun Cheng et.al. | 2603.06999 | translate | read | null |
| 2026-03-07 | Perception-Aware Multimodal Spatial Reasoning from Monocular Images | Yanchun Cheng et.al. | 2603.06985 | translate | read | null |
| 2026-03-07 | Diffusion Controller: Framework, Algorithms and Parameterization | Tong Yang et.al. | 2603.06981 | translate | read | null |
| 2026-03-07 | Chart-RL: Generalized Chart Comprehension via Reinforcement Learning with Verifiable Rewards | Xin Zhang et.al. | 2603.06958 | translate | read | null |
| 2026-03-06 | Reforming the Mechanism: Editing Reasoning Patterns in LLMs with Circuit Reshaping | Zhenyu Lei et.al. | 2603.06923 | translate | read | null |
| 2026-03-06 | A Dynamic Self-Evolving Extraction System | Moin Amin-Naseri et.al. | 2603.06915 | translate | read | null |
| 2026-03-06 | MedInjection-FR: Exploring the Role of Native, Synthetic, and Translated Data in Biomedical Instruction Tuning | Ikram Belmadani et.al. | 2603.06905 | translate | read | null |
| 2026-03-06 | Symmetry-Constrained Language-Guided Program Synthesis for Discovering Governing Equations from Noisy and Partial Observations | Mirza Samad Ahmed Baig et.al. | 2603.06869 | translate | read | null |
| 2026-03-06 | Learning-Based Robust Control: Unifying Exploration and Distributional Robustness for Reliable Robotics via Free Energy | Hozefa Jesawada et.al. | 2603.06831 | translate | read | null |
| 2026-03-06 | “Dark Triad” Model Organisms of Misalignment: Narrow Fine-Tuning Mirrors Human Antisocial Behavior | Roshni Lulla et.al. | 2603.06816 | translate | read | null |
| 2026-03-06 | Metalearning traffic assignment for network disruptions with graph convolutional neural networks | Serio Agriesti et.al. | 2603.06763 | translate | read | null |
| 2026-03-06 | Property-driven Protein Inverse Folding With Multi-Objective Preference Alignment | Xiaoyang Hou et.al. | 2603.06748 | translate | read | null |
| 2026-03-06 | Improved Constrained Generation by Bridging Pretrained Generative Models | Xiaoxuan Liang et.al. | 2603.06742 | translate | read | null |
| 2026-03-06 | Safe Transformer: An Explicit Safety Bit For Interpretable And Controllable Alignment | Jingyuan Feng et.al. | 2603.06727 | translate | read | null |
| 2026-03-05 | SIQA: Toward Reliable Scientific Image Quality Assessment | Wenzhe Li et.al. | 2603.06700 | translate | read | null |
| 2026-03-06 | SUREON: A Benchmark and Vision-Language-Model for Surgical Reasoning | Alejandra Perez et.al. | 2603.06570 | translate | read | null |
| 2026-03-06 | EgoReasoner: Learning Egocentric 4D Reasoning via Task-Adaptive Structured Thinking | Fangrui Zhu et.al. | 2603.06561 | translate | read | null |
| 2026-03-06 | Speak in Context: Multilingual ASR with Speech Context Alignment via Contrastive Learning | Yuchen Zhang et.al. | 2603.06505 | translate | read | null |
| 2026-03-06 | COLD-Steer: Steering Large Language Models via In-Context One-step Learning Dynamics | Kartik Sharma et.al. | 2603.06495 | translate | read | null |
| 2026-03-06 | NOBLE: Accelerating Transformers with Nonlinear Low-Rank Branches | Ethan Smith et.al. | 2603.06492 | translate | read | null |
| 2026-03-06 | History-Conditioned Spatio-Temporal Visual Token Pruning for Efficient Vision-Language Navigation | Qitong Wang et.al. | 2603.06480 | translate | read | null |
| 2026-03-06 | Do Foundation Models Know Geometry? Probing Frozen Features for Continuous Physical Measurement | Yakov Pyotr Shkolnikov et.al. | 2603.06459 | translate | read | null |
| 2026-03-06 | Pinterest Canvas: Large-Scale Image Generation at Pinterest | Yu Wang et.al. | 2603.06453 | translate | read | null |
| 2026-03-06 | From Prompting to Preference Optimization: A Comparative Study of LLM-based Automated Essay Scoring | Minh Hoang Nguyen et.al. | 2603.06424 | translate | read | null |
| 2026-03-06 | Doctor or Patient? Synergizing Diarization and ASR for Code-Switched Hinglish Medical Conditions Extraction | Séverin Baroudi et.al. | 2603.06373 | translate | read | null |
| 2026-03-06 | Variable selection in linear mixed model meta-regression with suspected interaction effects – How can tree-based methods help? | Jan-Bernd Igelmann et.al. | 2603.06328 | translate | read | null |
| 2026-03-06 | Continual Adaptation for Pacific Indigenous Speech Recognition | Yang Xiao et.al. | 2603.06310 | translate | read | null |
| 2026-03-06 | Attribute Distribution Modeling and Semantic-Visual Alignment for Generative Zero-shot Learning | Haojie Pu et.al. | 2603.06281 | translate | read | null |
| 2026-03-06 | Learning to Solve Orienteering Problem with Time Windows and Variable Profits | Songqun Gao et.al. | 2603.06260 | translate | read | null |
| 2026-03-06 | DC-Merge: Improving Model Merging with Directional Consistency | Han-Chen Zhang et.al. | 2603.06242 | translate | read | null |
| 2026-03-06 | MAD: A Multimodal and Multi-perspective Affective Dataset with Hierarchical Annotations | Shengwei Guo et.al. | 2603.06206 | translate | read | null |
| 2026-03-06 | Transformer-Based Pulse Shape Discrimination in HPGe Detectors with Masked Autoencoder Pre-training | Marta Babicz et.al. | 2603.06192 | translate | read | null |
| 2026-03-06 | SpaCRD: Multimodal Deep Fusion of Histology and Spatial Transcriptomics for Cancer Region Detection | Shuailin Xue et.al. | 2603.06186 | translate | read | null |
| 2026-03-06 | CRIMSON: A Clinically-Grounded LLM-Based Metric for Generative Radiology Report Evaluation | Mohammed Baharoon et.al. | 2603.06183 | translate | read | null |
| 2026-03-06 | Optimizing 3D Diffusion Models for Medical Imaging via Multi-Scale Reward Learning | Yueying Tian et.al. | 2603.06173 | translate | read | null |
| 2026-03-06 | Multimodal Behavior Tree Generation: A Small Vision-Language Model for Robot Task Planning | Cristiano Battistini et.al. | 2603.06084 | translate | read | null |
| 2026-03-06 | FontUse: A Data-Centric Approach to Style- and Use-Case-Conditioned In-Image Typography | Xia Xin et.al. | 2603.06038 | translate | read | null |
| 2026-03-06 | EffectMaker: Unifying Reasoning and Generation for Customized Visual Effect Creation | Shiyuan Yang et.al. | 2603.06014 | translate | read | null |
| 2026-03-06 | RePer-360: Releasing Perspective Priors for 360 $^\circ$ Depth Estimation via Self-Modulation | Cheng Guan et.al. | 2603.05999 | translate | read | null |
| 2026-03-06 | HarvestFlex: Strawberry Harvesting via Vision-Language-Action Policy Adaptation in the Wild | Ziyang Zhao et.al. | 2603.05982 | translate | read | null |
| 2026-03-06 | An Interactive Multi-Agent System for Evaluation of New Product Concepts | Bin Xuan et.al. | 2603.05980 | translate | read | null |
| 2026-03-06 | THETA: A Textual Hybrid Embedding-based Topic Analysis Framework and AI Scientist Agent for Scalable Computational Social Science | Zhenke Duan et.al. | 2603.05972 | translate | read | null |
| 2026-03-06 | Skeleton-to-Image Encoding: Enabling Skeleton Representation Learning via Vision-Pretrained Models | Siyuan Yang et.al. | 2603.05963 | translate | read | null |
| 2026-03-06 | Omni-Masked Gradient Descent: Memory-Efficient Optimization via Mask Traversal with Improved Convergence | Hui Yang et.al. | 2603.05960 | translate | read | null |
| 2026-03-06 | Domain-Adaptive Model Merging across Disconnected Modes | Junming Liu et.al. | 2603.05957 | translate | read | null |
| 2026-03-06 | LucidNFT: LR-Anchored Multi-Reward Preference Optimization for Generative Real-World Super-Resolution | Song Fei et.al. | 2603.05947 | translate | read | null |
| 2026-03-06 | Swooper: Learning High-Speed Aerial Grasping With a Simple Gripper | Ziken Huang et.al. | 2603.05935 | translate | read | null |
| 2026-03-06 | Implicit Style Conditioning: A Structured Style-Rewrite Framework for Low-Resource Character Modeling | Chanhui Zhu et.al. | 2603.05933 | translate | read | null |
| 2026-03-06 | Addressing the Ecological Fallacy in Larger LMs with Human Context | Nikita Soni et.al. | 2603.05928 | translate | read | null |
| 2026-03-06 | Stock Market Prediction Using Node Transformer Architecture Integrated with BERT Sentiment Analysis | Mohammad Al Ridhawi et.al. | 2603.05917 | translate | read | null |
| 2026-03-06 | Reference-guided Policy Optimization for Molecular Optimization via LLM Reasoning | Xuan Li et.al. | 2603.05900 | translate | read | null |
| 2026-03-06 | Shifting Adaptation from Weight Space to Memory Space: A Memory-Augmented Agent for Medical Image Segmentation | Bowen Chen et.al. | 2603.05873 | translate | read | null |
| 2026-03-06 | PatchCue: Enhancing Vision-Language Model Reasoning with Patch-Based Visual Cues | Yukun Qi et.al. | 2603.05869 | translate | read | null |
| 2026-03-06 | AnyCamVLA: Zero-Shot Camera Adaptation for Viewpoint Robust Vision-Language-Action Models | Hyeongjun Heo et.al. | 2603.05868 | translate | read | null |
| 2026-03-06 | Self-Auditing Parameter-Efficient Fine-Tuning for Few-Shot 3D Medical Image Segmentation | Son Thai Ly et.al. | 2603.05822 | translate | read | null |
| 2026-03-06 | Activation Steering for Accent Adaptation in Speech Foundation Models | Jinuo Sun et.al. | 2603.05813 | translate | read | null |
| 2026-03-06 | Visual Words Meet BM25: Sparse Auto-Encoder Visual Word Scoring for Image Retrieval | Donghoon Han et.al. | 2603.05781 | translate | read | null |
| 2026-03-06 | PVminerLLM: Structured Extraction of Patient Voice from Patient-Generated Text using Large Language Models | Samah Fodeh et.al. | 2603.05776 | translate | read | null |
| 2026-03-06 | Bridging Domains through Subspace-Aware Model Merging | Levy Chaves et.al. | 2603.05768 | translate | read | null |
| 2026-03-05 | MIRACL: A Diverse Meta-Reinforcement Learning for Multi-Objective Multi-Echelon Combinatorial Supply Chain Optimisation | Rifny Rachman et.al. | 2603.05760 | translate | read | null |
| 2026-03-05 | NERdME: a Named Entity Recognition Dataset for Indexing Research Artifacts in Code Repositories | Genet Asefa Gesese et.al. | 2603.05750 | translate | read | null |
| 2026-03-05 | From Phase Grounding to Intelligent Surgical Narratives | Ethan Peterson et.al. | 2603.05732 | translate | read | null |
| 2026-03-05 | Unsupervised domain adaptation for radioisotope identification in gamma spectroscopy | Peter Lalor et.al. | 2603.05719 | translate | read | null |
| 2026-03-05 | Parallelization Strategies for Dense LLM Deployment: Navigating Through Application-Specific Tradeoffs and Bottlenecks | Burak Topcu et.al. | 2603.05692 | translate | read | null |
| 2026-03-05 | FreeTxt-Vi: A Benchmarked Vietnamese-English Toolkit for Segmentation, Sentiment, and Summarisation | Hung Nguyen Huy et.al. | 2603.05690 | translate | read | null |
| 2026-03-05 | Latent space design of interatomic potentials | Susan R. Atlas et.al. | 2603.05655 | translate | read | null |
| 2026-03-05 | Koopman Regularized Deep Speech Disentanglement for Speaker Verification | Nikos Chazaridis et.al. | 2603.05577 | translate | read | null |
| 2026-03-05 | Task Parameter Extrapolation via Learning Inverse Tasks from Forward Demonstrations | Serdar Bahar et.al. | 2603.05576 | translate | read | null |
| 2026-03-05 | Censored LLMs as a Natural Testbed for Secret Knowledge Elicitation | Helena Casademunt et.al. | 2603.05494 | translate | read | null |
| 2026-03-05 | Observing and Controlling Features in Vision-Language-Action Models | Hugo Buurmeijer et.al. | 2603.05487 | translate | read | null |
| 2026-03-05 | NCTB-QA: A Large-Scale Bangla Educational Question Answering Dataset and Benchmarking Performance | Abrar Eyasir et.al. | 2603.05462 | translate | read | null |
| 2026-03-05 | High-Pressure Inelastic Neutron Spectroscopy: A true test of Machine-Learned Interatomic Potential energy landscapes | Jeff Armstrong et.al. | 2603.05442 | translate | read | null |
| 2026-03-05 | An Exploration-Analysis-Disambiguation Reasoning Framework for Word Sense Disambiguation with Low-Parameter LLMs | Deshan Sumanathilaka et.al. | 2603.05400 | translate | read | null |
| 2026-03-05 | OpenFrontier: General Navigation with Visual-Language Grounded Frontiers | Esteban Padilla et.al. | 2603.05377 | translate | read | null |
| 2026-03-05 | DiSCTT: Consensus-Guided Self-Curriculum for Efficient Test-Time Adaptation in Reasoning | Mohammad Mahdi Moradi et.al. | 2603.05357 | translate | read | null |
| 2026-03-05 | Exploring the potential and limitations of Model Merging for Multi-Domain Adaptation in ASR | Carlos Carvalho et.al. | 2603.05354 | translate | read | null |
| 2026-03-05 | PersianPunc: A Large-Scale Dataset and BERT-Based Approach for Persian Punctuation Restoration | Mohammad Javad Ranjbar Kalahroodi et.al. | 2603.05314 | translate | read | null |
| 2026-03-05 | Iterative On-Policy Refinement of Hierarchical Diffusion Policies for Language-Conditioned Manipulation | Clemence Grislain et.al. | 2603.05291 | translate | read | null |
| 2026-03-05 | VietJobs: A Vietnamese Job Advertisement Dataset | Hieu Pham Dinh et.al. | 2603.05262 | translate | read | null |
| 2026-03-05 | Wiki-R1: Incentivizing Multimodal Reasoning for Knowledge-based VQA via Data and Sampling Curriculum | Shan Ning et.al. | 2603.05256 | translate | read | null |
| 2026-03-05 | Reclaiming Lost Text Layers for Source-Free Cross-Domain Few-Shot Learning | Zhenyu Zhang et.al. | 2603.05235 | translate | read | null |
| 2026-03-05 | BabAR: from phoneme recognition to developmental measures of young children’s speech production | Marvin Lavechin et.al. | 2603.05213 | translate | read | null |
| 2026-03-05 | Stable-LoRA: Stabilizing Feature Learning of Low-Rank Adaptation | Yize Wu et.al. | 2603.05204 | translate | read | null |
| 2026-03-05 | Mario: Multimodal Graph Reasoning with Large Language Models | Yuanfu Sun et.al. | 2603.05181 | translate | read | null |
| 2026-03-05 | SRasP: Self-Reorientation Adversarial Style Perturbation for Cross-Domain Few-Shot Learning | Wenqian Li et.al. | 2603.05135 | translate | read | null |
| 2026-03-05 | LBM: Hierarchical Large Auto-Bidding Model via Reasoning and Acting | Yewen Li et.al. | 2603.05134 | translate | read | null |
| 2026-03-05 | TW-Sound580K: A Regional Audio-Text Dataset with Verification-Guided Curation for Localized Audio-Language Modeling | Hao-Hui Xie et.al. | 2603.05094 | translate | read | null |
| 2026-03-05 | MoRe: Motion-aware Feed-forward 4D Reconstruction Transformer | Juntong Fang et.al. | 2603.05078 | translate | read | null |
| 2026-03-05 | CLIP-driven Zero-shot Learning with Ambiguous Labels | Jinfu Fan et.al. | 2603.05053 | translate | read | null |
| 2026-03-05 | Enhancing Zero-shot Commonsense Reasoning by Integrating Visual Knowledge via Machine Imagination | Hyuntae Park et.al. | 2603.05040 | translate | read | null |
| 2026-03-05 | Tell2Adapt: A Unified Framework for Source Free Unsupervised Domain Adaptation via Vision Foundation Model | Yulong Shi et.al. | 2603.05012 | translate | read | null |
| 2026-03-05 | Lightweight and Scalable Transfer Learning Framework for Load Disaggregation | L. E. Garcia-Marrero et.al. | 2603.04998 | translate | read | null |
| 2026-03-05 | ThaiSafetyBench: Assessing Language Model Safety in Thai Cultural Contexts | Trapoom Ukarapol et.al. | 2603.04992 | translate | read | null |
| 2026-03-05 | 3D-RFT: Reinforcement Fine-Tuning for Video-based 3D Scene Understanding | Xiongkun Linghu et.al. | 2603.04976 | translate | read | null |
| 2026-03-05 | Functionality-Oriented LLM Merging on the Fisher–Rao Manifold | Jiayu Wang et.al. | 2603.04972 | translate | read | null |
| 2026-03-05 | Replaying pre-training data improves fine-tuning | Suhas Kotha et.al. | 2603.04964 | translate | read | null |
| 2026-03-05 | Adaptive Prototype-based Interpretable Grading of Prostate Cancer | Riddhasree Bhattacharyya et.al. | 2603.04947 | translate | read | null |
| 2026-03-05 | AILS-NTUA at SemEval-2026 Task 3: Efficient Dimensional Aspect-Based Sentiment Analysis | Stavros Gazetas et.al. | 2603.04933 | translate | read | null |
| 2026-03-05 | Authorize-on-Demand: Dynamic Authorization with Legality-Aware Intellectual Property Protection for VLMs | Lianyu Wang et.al. | 2603.04896 | translate | read | null |
| 2026-03-05 | Differential Privacy in Two-Layer Networks: How DP-SGD Harms Fairness and Robustness | Ruichen Xu et.al. | 2603.04881 | translate | read | null |
| 2026-03-05 | K-Gen: A Multimodal Language-Conditioned Approach for Interpretable Keypoint-Guided Trajectory Generation | Mingxuan Mu et.al. | 2603.04868 | translate | read | null |
| 2026-03-05 | Focus Then Listen: Exploring Plug-and-Play Audio Enhancer for Noise-Robust Large Audio Language Models | Han Yin et.al. | 2603.04862 | translate | read | null |
| 2026-03-05 | Causally Robust Reward Learning from Reason-Augmented Preference Feedback | Minjune Hwang et.al. | 2603.04861 | translate | read | null |
| 2026-03-05 | Osmosis Distillation: Model Hijacking with the Fewest Samples | Yuchen Shi et.al. | 2603.04859 | translate | read | null |
| 2026-03-05 | Beyond Text: Aligning Vision and Language for Multimodal E-Commerce Retrieval | Qujiaheng Zhang et.al. | 2603.04836 | translate | read | null |
| 2026-03-05 | Missingness Bias Calibration in Feature Attribution Explanations | Shailesh Sridhar et.al. | 2603.04831 | translate | read | null |
| 2026-03-05 | From Unfamiliar to Familiar: Detecting Pre-training Data via Gradient Deviations in Large Language Models | Ruiqi Zhang et.al. | 2603.04828 | translate | read | null |
| 2026-03-05 | VISA: Value Injection via Shielded Adaptation for Personalized LLM Alignment | Jiawei Chen et.al. | 2603.04822 | translate | read | null |
| 2026-03-05 | On the Strengths and Weaknesses of Data for Open-set Embodied Assistance | Pradyumna Tambwekar et.al. | 2603.04819 | translate | read | null |
| 2026-03-05 | Meta-D: Metadata-Aware Architectures for Brain Tumor Analysis and Missing-Modality Segmentation | SangHyuk Kim et.al. | 2603.04811 | translate | read | null |
| 2026-03-05 | WhisperAlign: Word-Boundary-Aware ASR and WhisperX-Anchored Pyannote Diarization for Long-Form Bengali Speech | Aurchi Chowdhury et.al. | 2603.04809 | translate | read | null |
| 2026-03-05 | Breaking Contextual Inertia: Reinforcement Learning with Single-Turn Anchors for Stable Multi-Turn Interaction | Xingwu Chen et.al. | 2603.04783 | translate | read | null |
| 2026-03-05 | DSA-SRGS: Super-Resolution Gaussian Splatting for Dynamic Sparse-View DSA Reconstruction | Shiyu Zhang et.al. | 2603.04770 | translate | read | null |
| 2026-03-05 | Adaptive Policy Switching of Two-Wheeled Differential Robots for Traversing over Diverse Terrains | Haruki Izawa et.al. | 2603.04761 | translate | read | null |
| 2026-03-05 | When Priors Backfire: On the Vulnerability of Unlearnable Examples to Pretraining | Zhihao Li et.al. | 2603.04731 | translate | read | null |
| 2026-03-05 | Detection of Illicit Content on Online Marketplaces using Large Language Models | Quoc Khoa Tran et.al. | 2603.04707 | translate | read | null |
| 2026-03-05 | Engineering Regression Without Real-Data Training: Domain Adaptation for Tabular Foundation Models Using Multi-Dataset Embeddings | Lyle Regenwetter et.al. | 2603.04692 | translate | read | null |
| 2026-03-04 | Improving the accuracy of physics-informed neural networks via last-layer retraining | Saad Qadeer et.al. | 2603.04672 | translate | read | null |
| 2026-03-04 | When Agents Persuade: Propaganda Generation and Mitigation in LLMs | Julia Jose et.al. | 2603.04636 | translate | read | null |
| 2026-03-04 | PDE foundation model-accelerated inverse estimation of system parameters in inertial confinement fusion | Mahindra Rautela et.al. | 2603.04606 | translate | read | null |
| 2026-03-04 | Adaptive Memory Admission Control for LLM Agents | Guilin Zhang et.al. | 2603.04549 | translate | read | null |
| 2026-03-04 | iScript: A Domain-Adapted Large Language Model and Benchmark for Physical Design Tcl Script Generation | Ning Xu et.al. | 2603.04476 | translate | read | null |
| 2026-03-04 | Gravitational confinement of ghost scalar fields in neutron stars | Argelia Bernal et.al. | 2603.04400 | translate | read | null |
| 2026-03-04 | bayesgrid: An Open-Source Python Tool for Generating Probabilistic Synthetic Transmission-Distribution Grids Using Bayesian Hierarchical Models | Henrique O. Caetano et.al. | 2603.04393 | translate | read | null |
| 2026-03-04 | Dual-Modality Multi-Stage Adversarial Safety Training: Robustifying Multimodal Web Agents Against Cross-Modal Attacks | Haoyu Liu et.al. | 2603.04364 | translate | read | null |
| 2026-03-04 | Robust Unscented Kalman Filtering via Recurrent Meta-Adaptation of Sigma-Point Weights | Kenan Majewski et.al. | 2603.04360 | translate | read | null |
| 2026-03-04 | Out-of-distribution transfer of PDE foundation models to material dynamics under extreme loading | Mahindra Rautela et.al. | 2603.04354 | translate | read | null |
| 2026-03-04 | Enhancing Authorship Attribution with Synthetic Paintings | Clarissa Loures et.al. | 2603.04343 | translate | read | null |
| 2026-03-04 | Activation Outliers in Transformer Quantization: Reproduction, Statistical Analysis, and Deployment Tradeoffs | Pranav Kumar Kaliaperumal et.al. | 2603.04308 | translate | read | null |
| 2026-03-04 | ZeSTA: Zero-Shot TTS Augmentation with Domain-Conditioned Training for Data-Efficient Personalized Speech Synthesis | Youngwon Choi et.al. | 2603.04219 | translate | read | null |
| 2026-03-04 | PlaneCycle: Training-Free 2D-to-3D Lifting of Foundation Models Without Adapters | Yinghong Yu et.al. | 2603.04165 | translate | read | null |
| 2026-03-04 | GarmentPile++: Affordance-Driven Cluttered Garments Retrieval with Vision-Language Reasoning | Mingleyang Li et.al. | 2603.04158 | translate | read | null |
| 2026-03-04 | Generating Exceptional Knots and Links with Arbitrary Braiding Topology | Bin Jiang et.al. | 2603.04143 | translate | read | null |
| 2026-03-04 | A Multi-Agent Framework for Interpreting Multivariate Physiological Time Series | Davide Gabrielli et.al. | 2603.04142 | translate | read | null |
| 2026-03-04 | Data-Aware Random Feature Kernel for Transformers | Amirhossein Farzam et.al. | 2603.04127 | translate | read | null |
| 2026-03-04 | Revisiting the Role of Foundation Models in Cell-Level Histopathological Image Analysis under Small-Patch Constraints – Effects of Training Data Scale and Blur Perturbations on CNNs and Vision Transformers | Hiroki Kagiyama et.al. | 2603.04081 | translate | read | null |
| 2026-03-04 | Monitoring Emergent Reward Hacking During Generation via Internal Activations | Patrick Wilhelm et.al. | 2603.04069 | translate | read | null |
| 2026-03-04 | Inference-Time Toxicity Mitigation in Protein Language Models | Manuel Fernández Burda et.al. | 2603.04045 | translate | read | null |
| 2026-03-04 | Force-Aware Residual DAgger via Trajectory Editing for Precision Insertion with Impedance Control | Yiou Huang et.al. | 2603.04038 | translate | read | null |
| 2026-03-04 | Who Judges the Judge? Evaluating LLM-as-a-Judge for French Medical open-ended QA | Ikram Belmadani et.al. | 2603.04033 | translate | read | null |
| 2026-03-02 | SageBwd: A Trainable Low-bit Attention | Jintao Zhang et.al. | 2603.02170 | translate | read | null |
| 2026-03-02 | Generative AI in Software Testing: Current Trends and Future Directions | Tanish Singla et.al. | 2603.02141 | translate | read | null |
| 2026-03-02 | Deep Unfolding for SIM-Assisted Multiband MU-MISO Downlink Systems | Muhammad Ibrahim et.al. | 2603.02122 | translate | read | null |
| 2026-03-02 | ClinConsensus: A Consensus-Based Benchmark for Evaluating Chinese Medical LLMs across Difficulty Levels | Xiang Zheng et.al. | 2603.02097 | translate | read | null |
| 2026-03-02 | Learning from Synthetic Data Improves Multi-hop Reasoning | Anmol Kabra et.al. | 2603.02091 | translate | read | null |
| 2026-03-02 | High-quality, high-information dataset for universal atomistic machine learning | Cesare Malosso et.al. | 2603.02089 | translate | read | null |
| 2026-03-02 | Detection-Gated Glottal Segmentation with Zero-Shot Cross-Dataset Transfer and Clinical Feature Extraction | Harikrishnan Unnikrishnan et.al. | 2603.02087 | translate | read | null |
| 2026-03-02 | $π$ -StepNFT: Wider Space Needs Finer Steps in Online RL for Flow-based VLAs | Siting Wang et.al. | 2603.02083 | translate | read | null |
| 2026-03-02 | Never Saddle for Reparameterized Steepest Descent as Mirror Flow | Tom Jacobs et.al. | 2603.02064 | translate | read | null |
| 2026-03-02 | EstLLM: Enhancing Estonian Capabilities in Multilingual LLMs via Continued Pretraining and Post-Training | Aleksei Dorkin et.al. | 2603.02041 | translate | read | null |
| 2026-03-02 | On-surface synthesis and aromaticity of large cyclocarbons | Lisanne Sellies et.al. | 2603.02040 | translate | read | null |
| 2026-03-02 | LAD-Drive: Bridging Language and Trajectory with Action-Aware Diffusion Transformers | Fabian Schmidt et.al. | 2603.02035 | translate | read | null |
| 2026-03-02 | CHOP: Counterfactual Human Preference Labels Improve Obstacle Avoidance in Visuomotor Navigation Policies | Gershom Seneviratne et.al. | 2603.02004 | translate | read | null |
| 2026-03-02 | Learning Vision-Based Omnidirectional Navigation: A Teacher-Student Approach Using Monocular Depth Estimation | Jan Finke et.al. | 2603.01999 | translate | read | null |
| 2026-03-02 | CharacterFlywheel: Scaling Iterative Improvement of Engaging and Steerable LLMs in Production | Yixin Nie et.al. | 2603.01973 | translate | read | null |
| 2026-03-02 | Probabilistic Retrofitting of Learned Simulators | Cristiana Diaconu et.al. | 2603.01949 | translate | read | null |
| 2026-03-02 | CoVe: Training Interactive Tool-Use Agents via Constraint-Guided Verification | Jinpeng Chen et.al. | 2603.01940 | translate | read | null |
| 2026-03-02 | VietSuperSpeech: A Large-Scale Vietnamese Conversational Speech Dataset for ASR Fine-Tuning in Chatbot, Customer Support, and Call Center Applications | Loan Do et.al. | 2603.01894 | translate | read | null |
| 2026-03-02 | Generative Visual Chain-of-Thought for Image Editing | Zijin Yin et.al. | 2603.01893 | translate | read | null |
| 2026-03-02 | Diagnosing Generalization Failures from Representational Geometry Markers | Chi-Ning Chou et.al. | 2603.01879 | translate | read | null |
| 2026-03-02 | Let the Agent Search: Autonomous Exploration Beats Rigid Workflows in Temporal Question Answering | Xufei Lv et.al. | 2603.01853 | translate | read | null |
| 2026-03-02 | Probing Materials Knowledge in LLMs: From Latent Embeddings to Reliable Predictions | Vineeth Venugopal et.al. | 2603.01834 | translate | read | null |
| 2026-03-02 | Voices, Faces, and Feelings: Multi-modal Emotion-Cognition Captioning for Mental Health Understanding | Zhiyuan Zhou et.al. | 2603.01816 | translate | read | null |
| 2026-03-02 | LLM-as-an-Annotator: Training Lightweight Models with LLM-Annotated Examples for Aspect Sentiment Tuple Prediction | Nils Constantin Hellwig et.al. | 2603.01778 | translate | read | null |
| 2026-03-02 | Meta-Learning Hyperparameters for Parameter Efficient Fine-Tuning | Zichen Tian et.al. | 2603.01759 | translate | read | null |
| 2026-03-02 | Unifying Heterogeneous Multi-Modal Remote Sensing Detection Via Language-Pivoted Pretraining | Yuxuan Li et.al. | 2603.01758 | translate | read | null |
| 2026-03-02 | CA-AFP: Cluster-Aware Adaptive Federated Pruning | Om Govind Jha et.al. | 2603.01739 | translate | read | null |
| 2026-03-02 | TopoCurate:Modeling Interaction Topology for Tool-Use Agent Training | Jinluan Yang et.al. | 2603.01714 | translate | read | null |
| 2026-03-02 | FT-Dojo: Towards Autonomous LLM Fine-Tuning with Language Agents | Qizheng Li et.al. | 2603.01712 | translate | read | null |
| 2026-03-02 | Cross-modal Identity Mapping: Minimizing Information Loss in Modality Conversion via Reinforcement Learning | Haonan Jia et.al. | 2603.01696 | translate | read | null |
| 2026-03-02 | Building a Strong Instruction Language Model for a Less-Resourced Language | Domen Vreš et.al. | 2603.01691 | translate | read | null |
| 2026-03-02 | A Unified Explanation for JWST Little Red Dots and High-Redshift Low-Mass Disk-like Galaxies: Prolate Galaxies Viewed End-on vs Side-on | Yingjie Peng et.al. | 2603.01668 | translate | read | null |
| 2026-03-02 | FreeGNN: Continual Source-Free Graph Neural Network Adaptation for Renewable Energy Forecasting | Abderaouf Bahi et.al. | 2603.01657 | translate | read | null |
| 2026-03-02 | DriveCombo: Benchmarking Compositional Traffic Rule Reasoning in Autonomous Driving | Enhui Ma et.al. | 2603.01637 | translate | read | null |
| 2026-03-02 | ToolRLA: Fine-Grained Reward Decomposition for Tool-Integrated Reinforcement Learning Alignment in Domain-Specific Agents | Pengbo Liu et.al. | 2603.01620 | translate | read | null |
| 2026-03-02 | Preference Score Distillation: Leveraging 2D Rewards to Align Text-to-3D Generation with Human Preference | Jiaqi Leng et.al. | 2603.01594 | translate | read | null |
| 2026-03-02 | SafeSci: Safety Evaluation of Large Language Models in Science Domains and Beyond | Xiangyang Zhu et.al. | 2603.01589 | translate | read | null |
| 2026-03-02 | Cryo-Bench: Benchmarking Foundation Models for Cryosphere Applications | Saurabh Kaushik et.al. | 2603.01576 | translate | read | null |
| 2026-03-02 | Beyond Length Scaling: Synergizing Breadth and Depth for Generative Reward Models | Qiyuan Zhang et.al. | 2603.01571 | translate | read | null |
| 2026-03-02 | Investigating Group Relative Policy Optimization for Diffusion Transformer based Text-to-Audio Generation | Yi Gu et.al. | 2603.01565 | translate | read | null |
| 2026-03-02 | Training-Free Spatio-temporal Decoupled Reasoning Video Segmentation with Adaptive Object Memory | Zhengtong Zhu et.al. | 2603.01545 | translate | read | null |
| 2026-03-02 | Retrieval, Refinement, and Ranking for Text-to-Video Generation via Prompt Optimization and Test-Time Scaling | Zillur Rahman et.al. | 2603.01509 | translate | read | null |
| 2026-03-02 | Enhancing Persona Following at Decoding Time via Dynamic Importance Estimation for Role-Playing Agents | Yuxin Liu et.al. | 2603.01438 | translate | read | null |
| 2026-03-02 | DOCFORGE-BENCH: A Comprehensive Benchmark for Document Forgery Detection and Analysis | Zengqi Zhao et.al. | 2603.01433 | translate | read | null |
| 2026-03-02 | The USTC-NERCSLIP Systems for the CHiME-9 MCoRec Challenge | Ya Jiang et.al. | 2603.01415 | translate | read | null |
| 2026-03-02 | Naturalness and Fisher Information | James Halverson et.al. | 2603.01411 | translate | read | null |
| 2026-03-02 | TIMI: Training-Free Image-to-3D Multi-Instance Generation with Spatial Fidelity | Xiao Cai et.al. | 2603.01371 | translate | read | null |
| 2026-03-02 | Fed-GAME: Personalized Federated Learning with Graph Attention Mixture-of-Experts For Time-Series Forecasting | Yi Li et.al. | 2603.01363 | translate | read | null |
| 2026-03-02 | Perspective-Equivariant Fine-tuning for Multispectral Demosaicing without Ground Truth | Andrew Wang et.al. | 2603.01332 | translate | read | null |
| 2026-03-02 | MetaState: Persistent Working Memory for Discrete Diffusion Language Models | Kejing Xia et.al. | 2603.01331 | translate | read | null |
| 2026-03-01 | Catalyst-Agent: Autonomous heterogeneous catalyst screening and optimization with an LLM Agent | Achuth Chandrasekhar et.al. | 2603.01311 | translate | read | null |
| 2026-03-01 | When Does RL Help Medical VLMs? Disentangling Vision, SFT, and RL Gains | Ahmadreza Jeddi et.al. | 2603.01301 | translate | read | link |
| 2026-03-01 | Theoretical Perspectives on Data Quality and Synergistic Effects in Pre- and Post-Training Reasoning Models | Adel Javanmard et.al. | 2603.01293 | translate | read | null |
| 2026-03-01 | Individual Turing Test: A Case Study of LLM-based Simulation Using Longitudinal Personal Data | Minghao Guo et.al. | 2603.01289 | translate | read | null |
| 2026-03-01 | Towards Policy-Adaptive Image Guardrail: Benchmark and Method | Caiyong Piao et.al. | 2603.01228 | translate | read | null |
| 2026-03-01 | Learn Hard Problems During RL with Reference Guided Fine-tuning | Yangzhen Wu et.al. | 2603.01223 | translate | read | null |
| 2026-03-01 | Agents Learn Their Runtime: Interpreter Persistence as Training-Time Semantics | Victor May et.al. | 2603.01209 | translate | read | null |
| 2026-03-01 | Token-level Data Selection for Safe LLM Fine-tuning | Yanping Li et.al. | 2603.01185 | translate | read | null |
| 2026-03-01 | VP-Hype: A Hybrid Mamba-Transformer Framework with Visual-Textual Prompting for Hyperspectral Image Classification | Abdellah Zakaria Sellam et.al. | 2603.01174 | translate | read | null |
| 2026-03-01 | FREE-Edit: Using Editing-aware Injection in Rectified Flow Models for Zero-shot Image-Driven Video Editing | Maomao Li et.al. | 2603.01164 | translate | read | null |
| 2026-03-01 | FCN-LLM: Empower LLM for Brain Functional Connectivity Network Understanding via Graph-level Multi-task Instruction Tuning | Xingcan Hu et.al. | 2603.01135 | translate | read | null |
| 2026-03-01 | GuiDINO: Rethinking Vision Foundation Model in Medical Image Segmentation | Zhuonan Liang et.al. | 2603.01115 | translate | read | null |
| 2026-03-01 | Can Vision Language Models Assess Graphic Design Aesthetics? A Benchmark, Evaluation, and Dataset Perspective | Arctanx An et.al. | 2603.01083 | translate | read | null |
| 2026-03-01 | How RL Unlocks the Aha Moment in Geometric Interleaved Reasoning | Xiangxiang Zhang et.al. | 2603.01070 | translate | read | null |
| 2026-03-01 | Unleashing VLA Potentials in Autonomous Driving via Explicit Learning from Failures | Yuechen Luo et.al. | 2603.01063 | translate | read | null |
| 2026-03-01 | Thoth: Mid-Training Bridges LLMs to Time Series Understanding | Jiafeng Lin et.al. | 2603.01042 | translate | read | link |
| 2026-03-01 | Sustainable Code Generation Using Large Language Models: A Systematic Literature Review | Sabiya Banu Masthan Ali et.al. | 2603.00989 | translate | read | null |
| 2026-03-01 | Beyond the Flat Sequence: Hierarchical and Preference-Aware Generative Recommendations | Zerui Chen et.al. | 2603.00980 | translate | read | null |
| 2026-03-01 | Stabilizing Policy Optimization via Logits Convexity | Hongzhan Chen et.al. | 2603.00963 | translate | read | null |
| 2026-03-01 | Using Songs to Improve Kazakh Automatic Speech Recognition | Rustem Yeshpanov et.al. | 2603.00961 | translate | read | null |
| 2026-03-01 | Prompt Sensitivity and Answer Consistency of Small Open-Source Large Language Models on Clinical Question Answering: Implications for Low-Resource Healthcare Deployment | Shravani Hariprasad et.al. | 2603.00917 | translate | read | null |
| 2026-03-01 | pySpatial: Generating 3D Visual Programs for Zero-Shot Spatial Reasoning | Zhanpeng Luo et.al. | 2603.00905 | translate | read | null |
| 2026-03-01 | Principled Fast and Meta Knowledge Learners for Continual Reinforcement Learning | Ke Sun et.al. | 2603.00903 | translate | read | null |
| 2026-03-01 | CHIMERA: Compact Synthetic Data for Generalizable LLM Reasoning | Xinyu Zhu et.al. | 2603.00889 | translate | read | null |
| 2026-03-01 | MC-Search: Evaluating and Enhancing Multimodal Agentic Search with Structured Long Reasoning Chains | Xuying Ning et.al. | 2603.00873 | translate | read | null |
| 2026-03-01 | MultiPUFFIN: A Multimodal Domain-Constrained Foundation Model for Molecular Property Prediction of Small Molecules | Idelfonso B. R. Nogueira et.al. | 2603.00857 | translate | read | null |
| 2026-03-01 | MedGPT-oss: Training a General-Purpose Vision-Language Model for Biomedicine | Kai Zhang et.al. | 2603.00842 | translate | read | null |
(<a href=../Transfer_Learning.md>back to Transfer Learning</a>)