LLM - 2026-03 | Paper Arxiv Daily

LLM - 2026-03

Publish Date	Title	Authors	PDF	Translate	Read	Code
2026-03-31	Reward-Based Online LLM Routing via NeuralUCB	Ming-Hua Tsai et.al.	2603.30035	translate	read	null
2026-03-31	The Triadic Cognitive Architecture: Bounding Autonomous Action via Spatio-Temporal and Epistemic Friction	Davide Di Gioia et.al.	2603.30031	translate	read	null
2026-03-31	Can Commercial LLMs Be Parliamentary Political Companions? Comparing LLM Reasoning Against Romanian Legislative Expuneri de Motive	Iulian Lucău et.al.	2603.30028	translate	read	null
2026-03-31	ContextClaim: A Context-Driven Paradigm for Verifiable Claim Detection	Yufeng Li et.al.	2603.30025	translate	read	null
2026-03-31	Hybrid Framework for Robotic Manipulation: Integrating Reinforcement Learning and Large Language Models	Md Saad et.al.	2603.30022	translate	read	null
2026-03-31	Architecting Secure AI Agents: Perspectives on System-Level Defenses Against Indirect Prompt Injection Attacks	Chong Xiang et.al.	2603.30016	translate	read	null
2026-03-31	Performative Scenario Optimization	Quanyan Zhu et.al.	2603.29982	translate	read	null
2026-03-31	SurgTEMP: Temporal-Aware Surgical Video Question Answering with Text-guided Visual Memory for Laparoscopic Cholecystectomy	Shi Li et.al.	2603.29962	translate	read	null
2026-03-31	Think Anywhere in Code Generation	Xue Jiang et.al.	2603.29957	translate	read	null
2026-03-31	EC-Bench: Enumeration and Counting Benchmark for Ultra-Long Videos	Fumihiko Tsuchiya et.al.	2603.29943	translate	read	null
2026-03-31	Bethe Ansatz with a Large Language Model	Balázs Pozsgay et.al.	2603.29932	translate	read	null
2026-03-31	SISA: A Scale-In Systolic Array for GEMM Acceleration	Luigi Altamura et.al.	2603.29913	translate	read	null
2026-03-31	C-TRAIL: A Commonsense World Framework for Trajectory Planning in Autonomous Driving	Zhihong Cui et.al.	2603.29908	translate	read	null
2026-03-31	ATP-Bench: Towards Agentic Tool Planning for MLLM Interleaved Generation	Yinuo Liu et.al.	2603.29902	translate	read	null
2026-03-31	ShapE-GRPO: Shapley-Enhanced Reward Allocation for Multi-Candidate LLM Training	Rui Ai et.al.	2603.29871	translate	read	null
2026-03-31	SNEAK: Evaluating Strategic Communication and Information Leakage in Large Language Models	Adar Avsian et.al.	2603.29846	translate	read	null
2026-03-31	Compiling Code LLMs into Lightweight Executables	Jieke Shi et.al.	2603.29813	translate	read	null
2026-03-31	ENEIDE: A High Quality Silver Standard Dataset for Named Entity Recognition and Linking in Historical Italian	Cristian Santini et.al.	2603.29801	translate	read	null
2026-03-31	Training-Free Dynamic Upcycling of Expert Language Models	Eros Fanì et.al.	2603.29765	translate	read	null
2026-03-31	One-for-All: A Lightweight Stabilized and Parameter-Efficient Pre-trained LLM for Time Series Forecasting	Prasanjit Dey et.al.	2603.29756	translate	read	null
2026-03-31	AI-Programmable Wireless Connectivity: Challenges and Research Directions Toward Interactive and Immersive Industry	Haris Gacanin et.al.	2603.29752	translate	read	null
2026-03-31	Spontaneous Functional Differentiation in Large Language Models: A Brain-Like Intelligence Economy	Junjie Zhang et.al.	2603.29735	translate	read	null
2026-03-31	Measuring the metacognition of AI	Richard Servajean et.al.	2603.29693	translate	read	null
2026-03-31	KEditVis: A Visual Analytics System for Knowledge Editing of Large Language Models	Zhenning Chen et.al.	2603.29689	translate	read	null
2026-03-31	Beyond the Steeper Curve: AI-Mediated Metacognitive Decoupling and the Limits of the Dunning-Kruger Metaphor	Christopher Koch et.al.	2603.29681	translate	read	null
2026-03-31	Agenda-based Narrative Extraction: Steering Pathfinding Algorithms with Large Language Models	Brian Felipe Keith-Norambuena et.al.	2603.29661	translate	read	null
2026-03-31	An Empirical Study of Multi-Agent Collaboration for Automated Research	Yang Shen et.al.	2603.29632	translate	read	null
2026-03-31	BigEarthNet.txt: A Large-Scale Multi-Sensor Image-Text Dataset and Benchmark for Earth Observation	Johann-Ludwig Herzog et.al.	2603.29630	translate	read	null
2026-03-31	Enhancing LLM-Based Bug Reproduction for Android Apps via Pre-Assessment of Visual Effects	Xiangyang Xiao et.al.	2603.29623	translate	read	null
2026-03-31	Learning Diagnostic Reasoning for Decision Support in Toxicology	Nico Oberländer et.al.	2603.29608	translate	read	null
2026-03-31	When Can We Trust LLM Graders? Calibrating Confidence for Automated Assessment	Robinson Ferrer et.al.	2603.29559	translate	read	null
2026-03-31	Can LLM Agents Identify Spoken Dialects like a Linguist?	Tobias Bystrich et.al.	2603.29541	translate	read	null
2026-03-31	Sampling at intermediate temperatures is optimal for training large language models in protein structure prediction	L. Ghiringhelli et.al.	2603.29529	translate	read	null
2026-03-31	LLM Probe: Evaluating LLMs for Low-Resource Languages	Hailay Kidu Teklehaymanot et.al.	2603.29517	translate	read	null
2026-03-31	Learning to Generate Formally Verifiable Step-by-Step Logic Reasoning via Structured Formal Intermediaries	Luoxin Chen et.al.	2603.29500	translate	read	null
2026-03-31	Distilling Human-Aligned Privacy Sensitivity Assessment from Large Language Models	Gabriel Loiseau et.al.	2603.29497	translate	read	null
2026-03-31	MemFactory: Unified Inference & Training Framework for Agent Memory	Ziliang Guo et.al.	2603.29493	translate	read	null
2026-03-31	CXLRAMSim v1.0: System-Level Exploration of CXL Memory Expander Cards	Karan Pathak et.al.	2603.29483	translate	read	null
2026-03-31	M-MiniGPT4: Multilingual VLLM Alignment via Translated Data	Seung Hun Han et.al.	2603.29467	translate	read	null
2026-03-31	An Isotropic Approach to Efficient Uncertainty Quantification with Gradient Norms	Nils Grünefeld et.al.	2603.29466	translate	read	null
2026-03-31	Authorship Impersonation via LLM Prompting does not Evade Authorship Verification Methods	Baoyi Zeng et.al.	2603.29454	translate	read	null
2026-03-31	SeGPruner: Semantic-Geometric Visual Token Pruner for 3D Question Answering	Wenli Li et.al.	2603.29437	translate	read	null
2026-03-31	Adversarial Prompt Injection Attack on Multimodal Large Language Models	Meiwen Ding et.al.	2603.29418	translate	read	null
2026-03-31	PRISM: PRIor from corpus Statistics for topic Modeling	Tal Ishon et.al.	2603.29406	translate	read	null
2026-03-31	ELT-Bench-Verified: Benchmark Quality Issues Underestimate AI Agent Capabilities	Christopher Zanoli et.al.	2603.29399	translate	read	null
2026-03-31	Is my model perplexed for the right reason? Contrasting LLMs’ Benchmark Behavior with Token-Level Perplexity	Zoë Prins et.al.	2603.29396	translate	read	null
2026-03-31	Assessing Multimodal Chronic Wound Embeddings with Expert Triplet Agreement	Fabian Kabus et.al.	2603.29376	translate	read	null
2026-03-31	Beyond Idealized Patients: Evaluating LLMs under Challenging Patient Behaviors in Medical Consultations	Yahan Li et.al.	2603.29373	translate	read	null
2026-03-31	AI-Generated Prior Authorization Letters: Strong Clinical Content, Weak Administrative Scaffolding	Moiz Sadiq Awan et.al.	2603.29366	translate	read	null
2026-03-31	Self-Improving Code Generation via Semantic Entropy and Behavioral Consensus	Huan Zhang et.al.	2603.29292	translate	read	null
2026-03-31	MELT: Improve Composed Image Retrieval via the Modification Frequentation-Rarity Balance Network	Guozhi Qiu et.al.	2603.29291	translate	read	null
2026-03-31	Sima AIunty: Caste Audit in LLM-Driven Matchmaking	Atharva Naik et.al.	2603.29288	translate	read	null
2026-03-31	Customer Analysis and Text Generation for Small Retail Stores Using LLM-Generated Marketing Presence	Shiori Nakamura et.al.	2603.29273	translate	read	null
2026-03-31	Aligning Multimodal Sequential Recommendations via Robust Direct Preference Optimization with Sparse MoE	Hejin Huang et.al.	2603.29259	translate	read	null
2026-03-31	Scaling the Long Video Understanding of Multimodal Large Language Models via Visual Memory Mechanism	Tao Chen et.al.	2603.29252	translate	read	null
2026-03-31	Long-Document QA with Chain-of-Structured-Thought and Fine-Tuned SLMs	Zhuowen Liang et.al.	2603.29232	translate	read	null
2026-03-31	Advancing LLM-based phoneme-to-grapheme for multilingual speech recognition	Lukuang Dong et.al.	2603.29217	translate	read	null
2026-03-31	Software Vulnerability Detection Using a Lightweight Graph Neural Network	Miles Farmer et.al.	2603.29216	translate	read	null
2026-03-31	Route-Induced Density and Stability (RIDE): Controlled Intervention and Mechanism Analysis of Routing-Style Meta Prompts on LLM Internal States	Dianxing Zhang et.al.	2603.29206	translate	read	null
2026-03-31	BiMoE: Brain-Inspired Experts for EEG-Dominant Affective State Recognition	Hongyu Zhu et.al.	2603.29205	translate	read	null
2026-03-31	Developing Adaptive Context Compression Techniques for Large Language Models (LLMs) in Long-Running Interactions	Payal Fofadiya et.al.	2603.29193	translate	read	null
2026-03-31	Webscraper: Leverage Multimodal Large Language Models for Index-Content Web Scraping	Guan-Lun Huang et.al.	2603.29161	translate	read	null
2026-03-31	SimMOF: AI agent for Automated MOF Simulations	Jaewoong Lee et.al.	2603.29152	translate	read	null
2026-03-31	Knowledge database development by large language models for countermeasures against viruses and marine toxins	Hung N. Do et.al.	2603.29149	translate	read	null
2026-03-31	REFINE: Real-world Exploration of Interactive Feedback and Student Behaviour	Fares Fawzi et.al.	2603.29142	translate	read	null
2026-03-31	Modernizing Ground Truth: Four Shifts Toward Improving Reliability and Validity in AI in Education	Danielle R. Thomas et.al.	2603.29141	translate	read	null
2026-03-31	SciVisAgentBench: A Benchmark for Evaluating Scientific Data Analysis and Visualization Agents	Kuangshi Ai et.al.	2603.29139	translate	read	null
2026-03-31	GISTBench: Evaluating LLM User Understanding via Evidence-Based Interest Verification	Iordanis Fostiropoulos et.al.	2603.29112	translate	read	null
2026-03-31	VueBuds: Visual Intelligence with Wireless Earbuds	Maruchi Kim et.al.	2603.29095	translate	read	null
2026-03-31	WybeCoder: Verified Imperative Code Generation	Fabian Gloeckle et.al.	2603.29088	translate	read	null
2026-03-30	HandX: Scaling Bimanual Motion and Interaction Generation	Zimu Zhang et.al.	2603.28766	translate	read	null
2026-03-30	Adaptive Block-Scaled Data Types	Jack Cook et.al.	2603.28765	translate	read	null
2026-03-30	Rethinking Language Model Scaling under Transferable Hypersphere Optimization	Liliang Ren et.al.	2603.28743	translate	read	null
2026-03-30	SAGAI-MID: A Generative AI-Driven Middleware for Dynamic Runtime Interoperability	Oliver Aleksander Larsen et.al.	2603.28731	translate	read	null
2026-03-30	EpiScreen: Early Epilepsy Detection from Electronic Health Records with Large Language Models	Shuang Zhou et.al.	2603.28698	translate	read	null
2026-03-30	AdaptToken: Entropy-based Adaptive Token Selection for MLLM Long Video Understanding	Haozhe Qi et.al.	2603.28696	translate	read	null
2026-03-30	C2RustXW: Program-Structure-Aware C-to-Rust Translation via Program Analysis and LLM	Yanyan Yan et.al.	2603.28686	translate	read	null
2026-03-30	A Techno-Economic Framework for Cost Modeling and Revenue Opportunities in Open and Programmable AI-RAN	Gabriele Gemmi et.al.	2603.28680	translate	read	null
2026-03-30	Safeguarding LLMs Against Misuse and AI-Driven Malware Using Steganographic Canaries	Md Raz et.al.	2603.28655	translate	read	null
2026-03-30	BACE: LLM-based Code Generation through Bayesian Anchored Co-Evolution of Code and Test Populations	Kaushitha Silva et.al.	2603.28653	translate	read	null
2026-03-30	The Ultimate Tutorial for AI-driven Scale Development in Generative Psychometrics: Releasing AIGENIE from its Bottle	Lara Russell-Lasalandra et.al.	2603.28643	translate	read	null
2026-03-30	Seeing with You: Perception-Reasoning Coevolution for Multimodal Reasoning	Ziqi Miao et.al.	2603.28618	translate	read	null
2026-03-30	ResAdapt: Adaptive Resolution for Efficient Multimodal Reasoning	Huanxuan Liao et.al.	2603.28610	translate	read	null
2026-03-30	One stout to rule them all: Reconciling artificial intelligence, data science and malted alcoholic beverages	Dmitrii Usynin et.al.	2603.28607	translate	read	null
2026-03-30	Unsafe2Safe: Controllable Image Anonymization for Downstream Utility	Mih Dinh et.al.	2603.28605	translate	read	null
2026-03-30	Moving Beyond Review: Applying Language Models to Planning and Translation in Reflection	Seyed Parsa Neshaei et.al.	2603.28596	translate	read	null
2026-03-30	MonitorBench: A Comprehensive Benchmark for Chain-of-Thought Monitorability in Large Language Models	Han Wang et.al.	2603.28590	translate	read	null
2026-03-30	Towards a Medical AI Scientist	Hongtao Wu et.al.	2603.28589	translate	read	null
2026-03-30	Tiered Super-Moore’s Law: Price Evolution, Production Frontiers, and Market Competition in Large Language Model Inference Services	Mingdeng Du et.al.	2603.28576	translate	read	null
2026-03-30	CirrusBench: Evaluating LLM-based Agents Beyond Correctness in Real-World Cloud Service Environments	Yi Yu et.al.	2603.28569	translate	read	null
2026-03-30	XSPA: Crafting Imperceptible X-Shaped Sparse Adversarial Perturbations for Transferable Attacks on VLMs	Chengyin Hu et.al.	2603.28568	translate	read	null
2026-03-30	Fine-Tuning Large Language Models for Cooperative Tactical Deconfliction of Small Unmanned Aerial Systems	Iman Sharifi et.al.	2603.28561	translate	read	null
2026-03-30	EarlySciRev: A Dataset of Early-Stage Scientific Revisions Extracted from LaTeX Writing Traces	Léane Jourdan et.al.	2603.28515	translate	read	null
2026-03-30	Generalizable Detection of AI Generated Images with Large Models and Fuzzy Decision Tree	Fei Wu et.al.	2603.28508	translate	read	null
2026-03-30	Courtroom-Style Multi-Agent Debate with Progressive RAG and Role-Switching for Controversial Claim Verification	Masnun Nuha Chowdhury et.al.	2603.28488	translate	read	null
2026-03-30	CiQi-Agent: Aligning Vision, Tools and Aesthetics in Multimodal Agent for Cultural Reasoning on Chinese Porcelains	Wenhan Wang et.al.	2603.28474	translate	read	null
2026-03-30	Evolutionary Discovery of Reinforcement Learning Algorithms via Large Language Models	Alkis Sygkounas et.al.	2603.28416	translate	read	null
2026-03-30	Within the MDT Room: Situated in Multidisciplinary Team-Grounded Agent Debate for Clinical Diagnosis	Peng Kuai et.al.	2603.28393	translate	read	null
2026-03-30	COvolve: Adversarial Co-Evolution of Large-Language-Model-Generated Policies and Environments via Two-Player Zero-Sum Game	Alkis Sygkounas et.al.	2603.28386	translate	read	null
2026-03-30	Using Games to Learn How Large Language Models Work	Allison Chen et.al.	2603.28374	translate	read	null
2026-03-30	Coherent Without Grounding, Grounded Without Success: Observability and Epistemic Failure	Camilo Chacón Sartori et.al.	2603.28371	translate	read	null
2026-03-30	AutoCut: End-to-end advertisement video editing based on multimodal discretization and controllable generation	Milton Zhou et.al.	2603.28366	translate	read	null
2026-03-30	SEA: Evaluating Sketch Abstraction Efficiency via Element-level Commonsense Visual Question Answering	Jiho Park et.al.	2603.28363	translate	read	null
2026-03-30	Deep Research of Deep Research: From Transformer to Agent, From AI to AI for Science	Yipeng Yu et.al.	2603.28361	translate	read	null
2026-03-30	A Multi-Agent Rhizomatic Pipeline for Non-Linear Literature Analysis	Julio C. Serrano. Joonas Kevari et.al.	2603.28336	translate	read	null
2026-03-30	Integrating Multimodal Large Language Model Knowledge into Amodal Completion	Heecheol Yun et.al.	2603.28333	translate	read	null
2026-03-30	Building evidence-based knowledge graphs from full-text literature for disease-specific biomedical reasoning	Chang Zong et.al.	2603.28325	translate	read	null
2026-03-30	VulnScout-C: A Lightweight Transformer for C Code Vulnerability Detection	Aymen Lassoued et.al.	2603.28309	translate	read	null
2026-03-30	Evaluating LLMs for Answering Student Questions in Introductory Programming Courses	Thomas Van Mullem et.al.	2603.28295	translate	read	null
2026-03-30	Merge and Conquer: Instructing Multilingual Models by Adding Target Language Weights	Eneko Valero et.al.	2603.28263	translate	read	null
2026-03-30	Categorical Perception in Large Language Model Hidden States: Structural Warping at Digit-Count Boundaries	Jon-Paul Cacioli et.al.	2603.28258	translate	read	null
2026-03-30	DiffAttn: Diffusion-Based Drivers’ Visual Attention Prediction with LLM-Enhanced Semantic Reasoning	Weimin Liu et.al.	2603.28251	translate	read	null
2026-03-30	\textit{Versteasch du mi?} Computational and Socio-Linguistic Perspectives on GenAI, LLMs, and Non-Standard Language	Verena Platzgummer et.al.	2603.28213	translate	read	null
2026-03-30	Beyond Cosine Similarity: Zero-Initialized Residual Complex Projection for Aspect-Based Sentiment Analysis	Yijin Wang et.al.	2603.28205	translate	read	null
2026-03-30	ERPO: Token-Level Entropy-Regulated Policy Optimization for Large Reasoning Models	Song Yu et.al.	2603.28204	translate	read	null
2026-03-30	EpiPersona: Persona Projection and Episode Coupling for Pluralistic Preference Modeling	Yujie Zhang et.al.	2603.28197	translate	read	null
2026-03-30	DongYuan: An LLM-Based Framework for Integrative Chinese and Western Medicine Spleen-Stomach Disorders Diagnosis	Hua Li et.al.	2603.28191	translate	read	null
2026-03-30	PReD: An LLM-based Foundation Multimodal Model for Electromagnetic Perception, Recognition, and Decision	Zehua Han et.al.	2603.28183	translate	read	null
2026-03-30	From Reviews to Requirements: Can LLMs Generate Human-Like User Stories?	Shadman Sakib et.al.	2603.28163	translate	read	null
2026-03-30	Reducing Mental Workload through On-Demand Human Assistance for Physical Action Failures in LLM-based Multi-Robot Coordination	Shoichi Hasegawa et.al.	2603.28156	translate	read	null
2026-03-30	ORACAL: A Robust and Explainable Multimodal Framework for Smart Contract Vulnerability Detection with Causal Graph Enrichment	Tran Duong Minh Dai et.al.	2603.28128	translate	read	null
2026-03-30	Compressing Code Context for LLM-based Issue Resolution	Haoxiang Jia et.al.	2603.28119	translate	read	null
2026-03-30	InconLens: Interactive Visual Diagnosis of Behavioral Inconsistencies in LLM-based Agentic Systems	Shuo Yan et.al.	2603.28106	translate	read	null
2026-03-30	DELTA: A DAG-aware Efficient OCS Logical Topology Optimization Framework for AIDCs	Niangen Ye et.al.	2603.28096	translate	read	null
2026-03-30	Can Large Language Models be a Cardinality Estimator? An Empirical study	Liangzu Liu et.al.	2603.28080	translate	read	null
2026-03-30	SLOW: Strategic Logical-inference Open Workspace for Cognitive Adaptation in AI Tutoring	Yuang Wei et.al.	2603.28062	translate	read	null
2026-03-30	DAInfer+: Neurosymbolic Inference of API Specifications from Documentation via Embedding Models	Maryam Masoudian et.al.	2603.28060	translate	read	null
2026-03-30	Is One-Shot In-Context Learning Helpful for Data Selection in Task-Specific Fine-Tuning of Multimodal LLMs?	Xiao An et.al.	2603.28058	translate	read	null
2026-03-30	Meta-Harness: End-to-End Optimization of Model Harnesses	Yoonho Lee et.al.	2603.28052	translate	read	null
2026-03-30	Beyond the Answer: Decoding the Behavior of LLMs as Scientific Reasoners	Rohan Pandey et.al.	2603.28038	translate	read	null
2026-03-30	Low-Latency Edge LLM Handover via Joint KV Cache Transfer and Token Prefill	Seunghun Lee et.al.	2603.28018	translate	read	null
2026-03-30	Progressive Prompt-Guided Cross-Modal Reasoning for Referring Image Segmentation	Jiachen Li et.al.	2603.27993	translate	read	null
2026-03-30	ViviDoc: Generating Interactive Documents through Human-Agent Collaboration	Yinghao Tang et.al.	2603.27991	translate	read	null
2026-03-30	Principal Prototype Analysis on Manifold for Interpretable Reinforcement Learning	Bodla Krishna Vamshi et.al.	2603.27971	translate	read	null
2026-03-30	CARV: A Diagnostic Benchmark for Compositional Analogical Reasoning in Multimodal LLMs	Yongkang Du et.al.	2603.27958	translate	read	null
2026-03-30	Artificial Intelligence in Science: Returns, Reallocation, and Reorganization	Moh Hosseinioun et.al.	2603.27956	translate	read	null
2026-03-30	EnsemJudge: Enhancing Reliability in Chinese LLM-Generated Text Detection through Diverse Model Ensembles	Zhuoshang Wang et.al.	2603.27949	translate	read	null
2026-03-30	JaWildText: A Benchmark for Vision-Language Models on Japanese Scene Text Understanding	Koki Maeda et.al.	2603.27942	translate	read	null
2026-03-30	GEAKG: Generative Executable Algorithm Knowledge Graphs	Camilo Chacón Sartori et.al.	2603.27922	translate	read	null
2026-03-30	Adversarial Attacks on Multimodal Large Language Models: A Comprehensive Survey	Bhavuk Jain et.al.	2603.27918	translate	read	null
2026-03-30	ITQ3_S: High-Fidelity 3-bit LLM Inference via Interleaved Ternary Quantization with Rotation-Domain Smoothing	Edward J. Yoon et.al.	2603.27914	translate	read	null
2026-03-25	Vibe Coding XR: Accelerating AI + XR Prototyping with XR Blocks and Gemini	Ruofei Du et.al.	2603.24591	translate	read	null
2026-03-25	MARCH: Multi-Agent Reinforced Self-Check for LLM Hallucination	Zhuo Li et.al.	2603.24579	translate	read	null
2026-03-25	LensWalk: Agentic Video Understanding by Planning How You See in Videos	Keliang Li et.al.	2603.24558	translate	read	null
2026-03-25	Evaluating Chunking Strategies For Retrieval-Augmented Generation in Oil and Gas Enterprise Documents	Samuel Taiwo et.al.	2603.24556	translate	read	null
2026-03-25	Representation Learning to Study Temporal Dynamics in Tutorial Scaffolding	Conrad Borchers et.al.	2603.24535	translate	read	null
2026-03-25	UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience	Zichuan Lin et.al.	2603.24533	translate	read	null
2026-03-25	Video-Only ToM: Enhancing Theory of Mind in Multimodal Large Language Models	Siqi Liu et.al.	2603.24484	translate	read	null
2026-03-25	Mechanic: Sorrifier-Driven Formal Decomposition Workflow for Automated Theorem Proving	Ruichen Qiu et.al.	2603.24465	translate	read	null
2026-03-25	PINGALA: Prosody-Aware Decoding for Sanskrit Poetry Generation	Manoj Balaji Jagadeeshan et.al.	2603.24413	translate	read	null
2026-03-25	AI-Supervisor: Autonomous AI Research Supervision via a Persistent Research World Model	Yunbo Long et.al.	2603.24402	translate	read	null
2026-03-25	3D-Mix for VLA: A Plug-and-Play Module for Integrating VGGT-based 3D Information into Vision-Language-Action Models	Bin Yu et.al.	2603.24393	translate	read	null
2026-03-25	When AI Meets Early Childhood Education: Large Language Models as Assessment Teammates in Chinese Preschools	Xingming Li et.al.	2603.24389	translate	read	null
2026-03-25	MolEvolve: LLM-Guided Evolutionary Search for Interpretable Molecular Optimization	Xiangsen Chen et.al.	2603.24382	translate	read	null
2026-03-25	LATS: Large Language Model Assisted Teacher-Student Framework for Multi-Agent Reinforcement Learning in Traffic Signal Control	Yifeng Zhang et.al.	2603.24361	translate	read	null
2026-03-25	Enhancing Efficiency and Performance in Deepfake Audio Detection through Neuron-level dropin & Neuroplasticity Mechanisms	Yupei Li et.al.	2603.24343	translate	read	null
2026-03-25	Large Language Model Guided Incentive Aware Reward Design for Cooperative Multi-Agent Reinforcement Learning	Dogan Urgun et.al.	2603.24324	translate	read	null
2026-03-25	Optimizing Multilingual LLMs via Federated Learning: A Study of Client Language Composition	Aleix Sant et.al.	2603.24242	translate	read	null
2026-03-25	UniScale: Synergistic Entire Space Data and Model Scaling for Search Ranking	Liren Yu et.al.	2603.24226	translate	read	null
2026-03-25	Environment-Grounded Multi-Agent Workflow for Autonomous Penetration Testing	Michael Somma et.al.	2603.24221	translate	read	null
2026-03-25	Who Benefits from RAG? The Role of Exposure, Utility and Attribution Bias	Mahdi Dehghan et.al.	2603.24218	translate	read	null
2026-03-25	SumRank: Aligning Summarization Models for Long-Document Listwise Reranking	Jincheng Feng et.al.	2603.24204	translate	read	null
2026-03-25	Invisible Threats from Model Context Protocol: Generating Stealthy Injection Payload via Tree-based Adaptive Search	Yulin Shen et.al.	2603.24203	translate	read	null
2026-03-25	A Deep Dive into Scaling RL for Code Generation with Synthetic Data and Curricula	Cansu Sancaktar et.al.	2603.24202	translate	read	null
2026-03-25	RefReward-SR: LR-Conditioned Reward Modeling for Preference-Aligned Super-Resolution	Yushuai Song et.al.	2603.24198	translate	read	null
2026-03-25	Unlocking Few-Shot Capabilities in LVLMs via Prompt Conditioning and Head Selection	Adhemar de Senneville et.al.	2603.24181	translate	read	null
2026-03-25	Towards Automated Crowdsourced Testing via Personified-LLM	Shengcheng Yu et.al.	2603.24160	translate	read	null
2026-03-25	Linking Global Science Funding to Research Publications	Jacob Aarup Dalsgaard et.al.	2603.24147	translate	read	null
2026-03-25	Sequence-aware Large Language Models for Explainable Recommendation	Gangyi Zhang et.al.	2603.24136	translate	read	null
2026-03-25	MedAidDialog: A Multilingual Multi-Turn Medical Dialogue Dataset for Accessible Healthcare	Shubham Kumar Nigam et.al.	2603.24132	translate	read	null
2026-03-25	Alignment Reduces Expressed but Not Encoded Gender Bias: A Unified Framework and Study	Nour Bouchouchi et.al.	2603.24125	translate	read	null
2026-03-25	Towards Effective Experiential Learning: Dual Guidance for Utilization and Internalization	Fei Bai et.al.	2603.24093	translate	read	null
2026-03-25	When Understanding Becomes a Risk: Authenticity and Safety Risks in the Emerging Image Generation Paradigm	Ye Leng et.al.	2603.24079	translate	read	null
2026-03-25	ConceptKT: A Benchmark for Concept-Level Deficiency Prediction in Knowledge Tracing	Yu-Chen Kang et.al.	2603.24073	translate	read	null
2026-03-25	Enhanced Mycelium of Thought (EMoT): A Bio-Inspired Hierarchical Reasoning Architecture with Strategic Dormancy and Mnemonic Encoding	Florian Odi Stummer et.al.	2603.24065	translate	read	null
2026-03-25	SOMA: Strategic Orchestration and Memory-Augmented System for Vision-Language-Action Model Robustness via In-Context Adaptation	Zhuoran Li et.al.	2603.24060	translate	read	null
2026-03-25	FinToolSyn: A forward synthesis Framework for Financial Tool-Use Dialogue Data with Dynamic Tool Retrieval	Caishuang Huang et.al.	2603.24051	translate	read	null
2026-03-25	ACAVCaps: Enabling large-scale training for fine-grained and diverse audio understanding	Yadong Niu et.al.	2603.24038	translate	read	null
2026-03-25	A^3: Towards Advertising Aesthetic Assessment	Kaiyuan Ji et.al.	2603.24037	translate	read	null
2026-03-25	Decompose and Transfer: CoT-Prompting Enhanced Alignment for Open-Vocabulary Temporal Action Detection	Sa Zhu et.al.	2603.24030	translate	read	null
2026-03-25	Thinking with Tables: Enhancing Multi-Modal Tabular Understanding via Neuro-Symbolic Reasoning	Kun-Yang Yu et.al.	2603.24004	translate	read	null
2026-03-25	Forensic Implications of Localized AI: Artifact Analysis of Ollama, LM Studio, and llama.cpp	Shariq Murtuza et.al.	2603.23996	translate	read	null
2026-03-25	Understanding the Challenges in Iterative Generative Optimization with LLMs	Allen Nie et.al.	2603.23994	translate	read	null
2026-03-25	From Untamed Black Box to Interpretable Pedagogical Orchestration: The Ensemble of Specialized LLMs Architecture for Adaptive Tutoring	Nizam Kadir et.al.	2603.23990	translate	read	null
2026-03-25	CoCR-RAG: Enhancing Retrieval-Augmented Generation in Web Q&A via Concept-oriented Context Reconstruction	Kaize Shi et.al.	2603.23989	translate	read	null
2026-03-25	Can we generate portable representations for clinical time series data using LLMs?	Zongliang Ji et.al.	2603.23987	translate	read	null
2026-03-25	Diet Your LLM: Dimension-wise Global Pruning of LLMs via Merging Task-specific Importance Score	Jimyung Hong et.al.	2603.23985	translate	read	null
2026-03-25	BRIDG-Q: Barren-Plateau-Resilient Initialisation with Data-Aware LLM-Generated Quantum Circuits	Ngoc Nhi Nguyen et.al.	2603.23979	translate	read	null
2026-03-25	SilLang: Improving Gait Recognition with Silhouette Language Encoding	Ruiyi Zhan et.al.	2603.23976	translate	read	null
2026-03-25	Grounding Arabic LLMs in the Doha Historical Dictionary: Retrieval-Augmented Understanding of Quran and Hadith	Somaya Eltanbouly et.al.	2603.23972	translate	read	null
2026-03-25	Policy-Guided Threat Hunting: An LLM enabled Framework with Splunk SOC Triage	Rishikesh Sahay et.al.	2603.23966	translate	read	null
2026-03-25	From Pixels to Digital Agents: An Empirical Study on the Taxonomy and Technological Trends of Reinforcement Learning Environments	Lijing Luo et.al.	2603.23964	translate	read	null
2026-03-25	PointRFT: Explicit Reinforcement Fine-tuning for Point Cloud Few-shot Learning	Yankai Wang et.al.	2603.23957	translate	read	null
2026-03-25	Towards Energy-aware Requirements Dependency Classification: Knowledge-Graph vs. Vector-Retrieval Augmented Inference with SLMs	Shreyas Patil et.al.	2603.23954	translate	read	null
2026-03-25	VOLMO: Versatile and Open Large Models for Ophthalmology	Zhenyue Qin et.al.	2603.23953	translate	read	null
2026-03-25	Argument Mining as a Text-to-Text Generation Task	Masayuki Kawarada et.al.	2603.23949	translate	read	null
2026-03-25	Dialogue to Question Generation for Evidence-based Medical Guideline Agent Development	Zongliang Ji et.al.	2603.23937	translate	read	null
2026-03-25	Self-Distillation for Multi-Token Prediction	Guoliang Zhao et.al.	2603.23911	translate	read	null
2026-03-25	AnalogAgent: Self-Improving Analog Circuit Design Automation with LLM Agents	Zhixuan Bao et.al.	2603.23910	translate	read	null
2026-03-25	DUPLEX: Agentic Dual-System Planning via LLM-Driven Information Extraction	Keru Hua et.al.	2603.23909	translate	read	null
2026-03-25	SiftMoE: Similarity-Aware Energy-Efficient Expert Selection for Wireless Distributed MoE Inference	Qian Chen et.al.	2603.23888	translate	read	null
2026-03-25	Towards Real-World Document Parsing via Realistic Scene Synthesis and Document-Aware Training	Gengluo Li et.al.	2603.23885	translate	read	null
2026-03-25	POSIM: A Multi-Agent Simulation Framework for Social Media Public Opinion Evolution and Governance	Yongmao Zhang et.al.	2603.23884	translate	read	null
2026-03-25	ProcureGym: A Multi-Agent Markov Game Framework for Modeling National Volume-based Drug Procurement	Jia Wang et.al.	2603.23880	translate	read	null
2026-03-25	Self-Evolving Multi-Agent Framework for Efficient Decision Making in Real-Time Strategy Scenarios	Li Ma et.al.	2603.23875	translate	read	null
2026-03-25	HDPO: Hybrid Distillation Policy Optimization via Privileged Self-Distillation	Ken Ding et.al.	2603.23871	translate	read	null
2026-03-25	APISENSOR: Robust Discovery of Web API from Runtime Traffic Logs	Yanjing Yang et.al.	2603.23852	translate	read	null
2026-03-25	VILLA: Versatile Information Retrieval From Scientific Literature Using Large LAnguage Models	Blessy Antony et.al.	2603.23849	translate	read	null
2026-03-25	PoliticsBench: Benchmarking Political Values in Large Language Models with Multi-Turn Roleplay	Rohan Khetan et.al.	2603.23841	translate	read	null
2026-03-25	Bridging the Interpretation Gap in Accessibility Testing: Empathetic and Legal-Aware Bug Report Generation via Large Language Models	Ryoya Koyama et.al.	2603.23828	translate	read	null
2026-03-25	How Vulnerable Are Edge LLMs?	Ao Ding et.al.	2603.23822	translate	read	null
2026-03-25	How are AI agents used? Evidence from 177,000 MCP tools	Merlin Stein et.al.	2603.23802	translate	read	null
2026-03-25	Human, AI, and Hybrid Ensembles for Detection of Adaptive, RL-based Social Bots	Valerio La Gatta et.al.	2603.23796	translate	read	null
2026-03-24	Sparse Autoencoders for Interpretable Medical Image Representation Learning	Philipp Wesp et.al.	2603.23794	translate	read	null
2026-03-24	The Cognitive Firewall:Securing Browser Based AI Agents Against Indirect Prompt Injection Via Hybrid Edge Cloud Defense	Qianlong Lan et.al.	2603.23791	translate	read	null
2026-03-24	Leveraging Large Language Models for Trustworthiness Assessment of Web Applications	Oleksandr Yarotskyi et.al.	2603.23781	translate	read	null
2026-03-24	Lightweight Fairness for LLM-Based Recommendations via Kernelized Projection and Gated Adapters	Nan Cui et.al.	2603.23780	translate	read	null
2026-03-24	AI-driven Intent-Based Networking Approach for Self-configuration of Next Generation Networks	Md. Kamrul Hossain et.al.	2603.23772	translate	read	null
2026-03-24	IslamicMMLU: A Benchmark for Evaluating LLMs on Islamic Knowledge	Ali Abdelaal et.al.	2603.23750	translate	read	null
2026-03-24	Exploring Self-Tracking Practices of Older Adults with CVD to Inform the Design of LLM-Enabled Health Data Sensemaking	Duosi Dai et.al.	2603.23733	translate	read	null
2026-03-24	LLMs Do Not Grade Essays Like Humans	Jerin George Mathew et.al.	2603.23714	translate	read	null
2026-03-24	The Diminishing Returns of Early-Exit Decoding in Modern LLMs	Rui Wei et.al.	2603.23701	translate	read	null
2026-03-24	Towards Leveraging LLMs to Generate Abstract Penetration Test Cases from Software Architecture	Mahdi Jafari et.al.	2603.23698	translate	read	null
2026-03-24	Assessment Design in the AI Era: A Method for Identifying Items Functioning Differentially for Humans and Chatbots	Licol Zeinfeld et.al.	2603.23682	translate	read	null
2026-03-24	PLACID: Privacy-preserving Large language models for Acronym Clinical Inference and Disambiguation	Manjushree B. Aithal et.al.	2603.23678	translate	read	null
2026-03-24	Energy Efficient Software Hardware CoDesign for Machine Learning: From TinyML to Large Language Models	Mohammad Saleh Vahdatpour et.al.	2603.23668	translate	read	null
2026-03-24	GTO Wizard Benchmark	Marc-Antoine Provost et.al.	2603.23660	translate	read	null
2026-03-24	Probing Ethical Framework Representations in Large Language Models: Structure, Entanglement, and Methodological Challenges	Weilun Xu et.al.	2603.23659	translate	read	null
2026-03-24	Swiss-Bench SBP-002: A Frontier Model Comparison on Swiss Legal and Regulatory Tasks	Fatih Uenal et.al.	2603.23646	translate	read	null
2026-03-24	LLM Inference at the Edge: Mobile, NPU, and GPU Performance Efficiency Trade-offs Under Sustained Load	Pranay Tummalapalli et.al.	2603.23640	translate	read	null
2026-03-24	Can LLM Agents Be CFOs? A Benchmark for Resource Allocation in Dynamic Enterprise Environments	Yi Han et.al.	2603.23638	translate	read	null
2026-03-24	Detect–Repair–Verify for LLM-Generated Code: A Multi-Language, Multi-Granularity Empirical Study	Cheng Cheng et.al.	2603.23633	translate	read	null
2026-03-24	Ukrainian Visual Word Sense Disambiguation Benchmark	Yurii Laba et.al.	2603.23627	translate	read	null
2026-03-24	A Theory of LLM Information Susceptibility	Zhuo-Yang Song et.al.	2603.23626	translate	read	null
2026-03-24	Revisiting Real-Time Digging-In Effects: No Evidence from NP/Z Garden-Paths	Amani Maina-Kilaas et.al.	2603.23624	translate	read	null
2026-03-24	LLMLOOP: Improving LLM-Generated Code and Tests through Automated Iterative Feedback Loops	Ravin Ravi et.al.	2603.23613	translate	read	null
2026-03-24	LLMORPH: Automated Metamorphic Testing of Large Language Models	Steven Cho et.al.	2603.23611	translate	read	null
2026-03-24	Environment Maps: Structured Environmental Representations for Long-Horizon Agents	Yenchia Feng et.al.	2603.23610	translate	read	null
2026-03-24	The Geometric Price of Discrete Logic: Context-driven Manifold Dynamics of Number Representations	Long Zhang et.al.	2603.23577	translate	read	null
2026-03-24	APreQEL: Adaptive Mixed Precision Quantization For Edge LLMs	Meriem Bouzouad et.al.	2603.23575	translate	read	null
2026-03-23	Mixture of Demonstrations for Textual Graph Understanding and Question Answering	Yukun Wu et.al.	2603.23554	translate	read	null
2026-03-24	MedObvious: Exposing the Medical Moravec’s Paradox in VLMs via Clinical Triage	Ufaq Khan et.al.	2603.23501	translate	read	null
2026-03-24	Failure of contextual invariance in gender inference with large language models	Sagar Kumar et.al.	2603.23485	translate	read	null
2026-03-24	SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning	Haoyu Huang et.al.	2603.23483	translate	read	null
2026-03-24	ReqFusion: A Multi-Provider Framework for Automated PEGS Analysis Across Software Domains	Muhammad Khalid et.al.	2603.23482	translate	read	null
2026-03-24	UniFunc3D: Unified Active Spatial-Temporal Grounding for 3D Functionality Segmentation	Jiaying Lin et.al.	2603.23478	translate	read	null
2026-03-24	Evidence of political bias in search engines and language models before major elections	Íris Damião et.al.	2603.23474	translate	read	null
2026-03-24	ConceptCoder: Improve Code Reasoning via Concept Learning	Md Mahbubur Rahman et.al.	2603.23470	translate	read	null
2026-03-24	3DCity-LLM: Empowering Multi-modality Large Language Models for 3D City-scale Perception and Understanding	Yiping Chen et.al.	2603.23447	translate	read	null
2026-03-24	Evaluating LLM-Based Test Generation Under Software Evolution	Sabaat Haroon et.al.	2603.23443	translate	read	null
2026-03-24	SortedRL: Accelerating RL Training for LLMs through Online Length-Aware Scheduling	Yiqi Zhang et.al.	2603.23414	translate	read	null
2026-03-24	Beyond Preset Identities: How Agents Form Stances and Boundaries in Generative Societies	Hanzhong Zhang et.al.	2603.23406	translate	read	null
2026-03-24	Unleashing Spatial Reasoning in Multimodal Large Language Models via Textual Representation Guided Reasoning	Jiacheng Hua et.al.	2603.23404	translate	read	null
2026-03-24	Off-Policy Value-Based Reinforcement Learning for Large Language Models	Peng-Yuan Wang et.al.	2603.23355	translate	read	null
2026-03-24	Leveraging LLMs and Social Media to Understand User Perception of Smartphone-Based Earthquake Early Warnings	Hanjing Wang et.al.	2603.23322	translate	read	null
2026-03-24	ARGENT: Adaptive Hierarchical Image-Text Representations	Chuong Huynh et.al.	2603.23311	translate	read	null
2026-03-24	Curriculum-Driven 3D CT Report Generation via Language-Free Visual Grafting and Zone-Constrained Compression	V. K. Cody Bumgardner et.al.	2603.23308	translate	read	null
2026-03-24	Designing Agentic AI-Based Screening for Portfolio Investment	Mehmet Caner et.al.	2603.23300	translate	read	null
2026-03-24	Emergence of Fragility in LLM-based Social Networks: the Case of Moltbook	Luca Sodano et.al.	2603.23279	translate	read	null
2026-03-24	A Multimodal Framework for Human-Multi-Agent Interaction	Shaid Hasan et.al.	2603.23271	translate	read	null
2026-03-24	Not All Tokens Are Created Equal: Query-Efficient Jailbreak Fuzzing for LLMs	Wenyu Chen et.al.	2603.23269	translate	read	null
2026-03-24	SafeSeek: Universal Attribution of Safety Circuits in Language Models	Miao Yu et.al.	2603.23268	translate	read	null
2026-03-24	Is AI Catching Up to Human Expression? Exploring Emotion, Personality, Authorship, and Linguistic Style in English and Arabic with Six Large Language Models	Nasser A Alsadhan et.al.	2603.23251	translate	read	null
2026-03-24	MemCollab: Cross-Agent Memory Collaboration via Contrastive Trajectory Distillation	Yurui Chang et.al.	2603.23234	translate	read	null
2026-03-24	PERMA: Benchmarking Personalized Memory Agents via Event-Driven Preference and Realistic Task Environments	Shuochen Liu et.al.	2603.23231	translate	read	null
2026-03-24	I Came, I Saw, I Explained: Benchmarking Multimodal LLMs on Figurative Meaning in Memes	Shijia Zhou et.al.	2603.23229	translate	read	null
2026-03-24	Decoding AI Authorship: Can LLMs Truly Mimic Human Style Across Literature and Politics?	Nasser A Alsadhan et.al.	2603.23219	translate	read	null
2026-03-24	Sparser, Faster, Lighter Transformer Language Models	Edoardo Cetin et.al.	2603.23198	translate	read	null
2026-03-24	ViKey: Enhancing Temporal Understanding in Videos via Visual Prompting	Yeonkyung Lee et.al.	2603.23186	translate	read	null
2026-03-24	Robust Safety Monitoring of Language Models via Activation Watermarking	Toluwani Aremu et.al.	2603.23171	translate	read	null
2026-03-24	Describe-Then-Act: Proactive Agent Steering via Distilled Language-Action World Models	Massimiliano Pappa et.al.	2603.23149	translate	read	null
2026-03-24	Why AI-Generated Text Detection Fails: Evidence from Explainable AI Beyond Benchmark Accuracy	Shushanta Pudasaini et.al.	2603.23146	translate	read	null
2026-03-24	Can Language Models Pass Software Testing Certification Exams? a case study	Fitash Ul Haq et.al.	2603.23142	translate	read	null
2026-03-24	HGNet: Scalable Foundation Model for Automated Knowledge Graph Generation from Scientific Literature	Devvrat Joshi et.al.	2603.23136	translate	read	null
2026-03-24	InterDyad: Interactive Dyadic Speech-to-Video Generation by Querying Intermediate Visual Guidance	Dongwei Pan et.al.	2603.23132	translate	read	null
2026-03-24	SMSP: A Plug-and-Play Strategy of Multi-Scale Perception for MLLMs to Perceive Visual Illusions	Jinzhe Tu et.al.	2603.23118	translate	read	null
2026-03-24	AgentFoX: LLM Agent-Guided Fusion with eXplainability for AI-Generated Image Detection	Yangxin Yu et.al.	2603.23115	translate	read	null
2026-03-24	When Language Models Lose Their Mind: The Consequences of Brain Misalignment	Gabriele Merlin et.al.	2603.23091	translate	read	null
2026-03-24	Good for the Planet, Bad for Me? Intended and Unintended Consequences of AI Energy Consumption Disclosure	Michael Klesel et.al.	2603.23075	translate	read	null
2026-03-24	Can an LLM Detect Instances of Microservice Infrastructure Patterns?	Carlos Eduardo Duarte et.al.	2603.23073	translate	read	null
2026-03-24	MLLM-HWSI: A Multimodal Large Language Model for Hierarchical Whole Slide Image Understanding	Basit Alawode et.al.	2603.23067	translate	read	null
2026-03-24	Post-Selection Distributional Model Evaluation	Amirmohammad Farzaneh et.al.	2603.23055	translate	read	null
2026-03-24	DBAutoDoc: Automated Discovery and Documentation of Undocumented Database Schemas via Statistical Analysis and Iterative LLM Refinement	Amith Nagarajan et.al.	2603.23050	translate	read	null
2026-03-24	PCR: A Prefetch-Enhanced Cache Reuse System for Low-Latency RAG Serving	Wenfeng Wang et.al.	2603.23049	translate	read	null
2026-03-24	Parametric Knowledge and Retrieval Behavior in RAG Fine-Tuning for Electronic Design Automation	Julian Oestreich et.al.	2603.23047	translate	read	null
2026-03-24	Cog3DMap: Multi-View Vision-Language Reasoning with 3D Cognitive Maps	Chanyoung Gwak et.al.	2603.23023	translate	read	null
2026-03-24	Can Large Language Models Reason and Optimize Under Constraints?	Fabien Bernier et.al.	2603.23004	translate	read	null
2026-03-24	JFTA-Bench: Evaluate LLM’s Ability of Tracking and Analyzing Malfunctions Using Fault Trees	Yuhui Wang et.al.	2603.22978	translate	read	null
2026-03-24	Beyond Theoretical Bounds: Empirical Privacy Loss Calibration for Text Rewriting Under Local Differential Privacy	Weijun Li et.al.	2603.22968	translate	read	null
2026-03-24	Set-Valued Prediction for Large Language Models with Feasibility-Aware Coverage Guarantees	Ye Li et.al.	2603.22966	translate	read	null
2026-03-24	Caption Generation for Dongba Paintings via Prompt Learning and Semantic Fusion	Shuangwu Qian et.al.	2603.22946	translate	read	null
2026-03-24	From Morality Installation in LLMs to LLMs in Morality-as-a-System	Gunter Bombaerts et.al.	2603.22944	translate	read	null
2026-03-24	Optimizing Small Language Models for NL2SQL via Chain-of-Thought Fine-Tuning	Anshul Solanki et.al.	2603.22942	translate	read	null
2026-03-24	Ran Score: a LLM-based Evaluation Score for Radiology Report Generation	Ran Zhang et.al.	2603.22935	translate	read	null
2026-03-24	ProGRank: Probe-Gradient Reranking to Defend Dense-Retriever RAG from Corpus Poisoning	Xiangyu Yin et.al.	2603.22934	translate	read	null
2026-03-24	SoK: The Attack Surface of Agentic AI – Tools, and Autonomy	Ali Dehghantanha et.al.	2603.22928	translate	read	null
2026-03-24	Quality Over Clicks: Intrinsic Quality-Driven Iterative Reinforcement Learning for Cold-Start E-Commerce Query Suggestion	Qi Sun et.al.	2603.22922	translate	read	null
2026-03-24	EVA: Efficient Reinforcement Learning for End-to-End Video Agent	Yaolun Zhang et.al.	2603.22918	translate	read	null
2026-03-24	ForestPrune: High-ratio Visual Token Compression for Video Multimodal Large Language Models via Spatial-Temporal Forest Modeling	Shaobo Ju et.al.	2603.22911	translate	read	null
2026-03-24	EchoKV: Efficient KV Cache Compression via Similarity-Based Reconstruction	Yixuan Wang et.al.	2603.22910	translate	read	null
2026-03-24	Separating Diagnosis from Control: Auditable Policy Adaptation in Agent-Based Simulations with LLM-Based Diagnostics	Shaoxin Zhong et.al.	2603.22904	translate	read	null
2026-03-24	VLGOR: Visual-Language Knowledge Guided Offline Reinforcement Learning for Generalizable Agents	Pengsen Liu et.al.	2603.22892	translate	read	null
2026-03-24	TreeTeaming: Autonomous Red-Teaming of Vision-Language Models via Hierarchical Strategy Exploration	Chunxiao Li et.al.	2603.22882	translate	read	null
2026-03-24	ForeSea: AI Forensic Search with Multi-modal Queries for Video Surveillance	Hyojin Park et.al.	2603.22872	translate	read	null
2026-03-24	Dynamical Systems Theory Behind a Hierarchical Reasoning Model	Vasiliy A. Es’kin et.al.	2603.22871	translate	read	null
2026-03-24	Chain-of-Authorization: Internalizing Authorization into Large Language Models via Reasoning Trajectories	Yang Li et.al.	2603.22869	translate	read	null
2026-03-24	Aerial Agentic AI: Synergizing LLM and SLM for Low-Altitude Wireless Networks	Li Dong et.al.	2603.22866	translate	read	null
2026-03-24	The Evolution of Tool Use in LLM Agents: From Single-Tool Call to Multi-Tool Orchestration	Haoyuan Xu et.al.	2603.22862	translate	read	null
2026-03-24	Who Sits Where? Automated Detection of Director Interlocks in Indian Companies	Prateek Sancheti et.al.	2603.22860	translate	read	null
2026-03-24	Retrieval-Guided Photovoltaic Inventory Estimation from Satellite Imagery for Distribution Grid Planning	Muhao Guo et.al.	2603.22856	translate	read	null
2026-03-24	Analysing LLM Persona Generation and Fairness Interpretation in Polarised Geopolitical Contexts	Maida Aizaz et.al.	2603.22837	translate	read	null
2026-03-24	Improving Safety Alignment via Balanced Direct Preference Optimization	Shiji Zhao et.al.	2603.22829	translate	read	null
2026-03-24	Focus, Don’t Prune: Identifying Instruction-Relevant Regions for Information-Rich Image Understanding	Mincheol Kwon et.al.	2603.22815	translate	read	null
2026-03-24	Efficient Hallucination Detection: Adaptive Bayesian Estimation of Semantic Entropy with Guided Semantic Exploration	Qiyao Sun et.al.	2603.22812	translate	read	null
2026-03-24	Span Modeling for Idiomaticity and Figurative Language Detection with Span Contrastive Loss	Blake Matheny et.al.	2603.22799	translate	read	null
2026-03-24	Caterpillar of Thoughts: The Optimal Test-Time Algorithm for Large Language Models	Amir Azarmehr et.al.	2603.22784	translate	read	null
2026-03-24	Know3D: Prompting 3D Generation with Knowledge from Vision-Language Models	Wenyue Chen et.al.	2603.22782	translate	read	null
2026-03-24	KARMA: Knowledge-Action Regularized Multimodal Alignment for Personalized Search at Taobao	Zhi Sun et.al.	2603.22779	translate	read	null
2026-03-24	AgriPestDatabase-v1.0: A Structured Insect Dataset for Training Agricultural Large Language Model	Yagizhan Bilal Durak et.al.	2603.22777	translate	read	null
2026-03-24	Characterizing CPU-Induced Slowdowns in Multi-GPU LLM Inference	Euijun Chung et.al.	2603.22774	translate	read	null
2026-03-24	DALDALL: Data Augmentation for Lexical and Semantic Diverse in Legal Domain by leveraging LLM-Persona	Janghyeok Choi et.al.	2603.22765	translate	read	null
2026-03-24	ENC-Bench: A Benchmark for Evaluating Multimodal Large Language Models in Electronic Navigational Chart Understanding	Ao Cheng et.al.	2603.22763	translate	read	null
2026-03-24	MVPBench: A Multi-Video Perception Evaluation Benchmark for Multi-Modal Video Understanding	Purui Bai et.al.	2603.22756	translate	read	null
2026-03-24	PRISM: A Dual View of LLM Reasoning through Semantic Flow and Latent Computation	Ruidi Chang et.al.	2603.22754	translate	read	null
2026-03-24	CIPL: A Target-Independent Framework for Channel-Inversion Privacy Leakage in Agents	Tao Huang et.al.	2603.22751	translate	read	null
2026-03-24	Beyond Binary Correctness: Scaling Evaluation of Long-Horizon Agents on Subjective Enterprise Tasks	Abhishek Chandwani et.al.	2603.22744	translate	read	null
2026-03-24	Explanation Generation for Contradiction Reconciliation with LLMs	Jason Chan et.al.	2603.22735	translate	read	null
2026-03-24	HyFI: Hyperbolic Feature Interpolation for Brain-Vision Alignment	Sangmin Jo et.al.	2603.22721	translate	read	null
2026-03-24	Does Teaming-Up LLMs Improve Secure Code Generation? A Comprehensive Evaluation with Multi-LLMSecCodeEval	Bushra Sabir et.al.	2603.22717	translate	read	null
2026-03-24	Detecting Non-Membership in LLM Training Data via Rank Correlations	Pranav Shetty et.al.	2603.22707	translate	read	null
2026-03-24	Synthetic or Authentic? Building Mental Patient Simulators from Longitudinal Evidence	Baihan Li et.al.	2603.22704	translate	read	null
2026-03-24	GeoTikzBridge: Advancing Multimodal Code Generation for Geometric Perception and Reasoning	Jiayin Sun et.al.	2603.22687	translate	read	null
2026-03-24	Improving LLM Predictions via Inter-Layer Structural Encoders	Tom Ulanovski et.al.	2603.22665	translate	read	null
2026-03-24	Benchmarking Multi-Agent LLM Architectures for Financial Document Processing: A Comparative Study of Orchestration Patterns, Cost-Accuracy Tradeoffs and Production Scaling Strategies	Siddhant Kulkarni et.al.	2603.22651	translate	read	null
2026-03-23	AwesomeLit: Towards Hypothesis Generation with Agent-Supported Literature Research	Zefei Xie et.al.	2603.22648	translate	read	null
2026-03-23	Multi-Method Validation of Large Language Model Medical Translation Across High- and Low-Resource Languages	Chukwuebuka Anyaegbuna et.al.	2603.22642	translate	read	null
2026-03-23	LGSE: Lexically Grounded Subword Embedding Initialization for Low-Resource Language Adaptation	Hailay Teklehaymanot et.al.	2603.22629	translate	read	null
2026-03-23	To Agree or To Be Right? The Grounding-Sycophancy Tradeoff in Medical Vision-Language Models	OFM Riaz Rahman Aranya et.al.	2603.22623	translate	read	null
2026-03-23	Emotional Support with Conversational AI: Talking to Machines About Life	Olivia Yan Huang et.al.	2603.22618	translate	read	null
2026-03-23	BioShield: A Context-Aware Firewall for Securing Bio-LLMs	Protiva Das et.al.	2603.22612	translate	read	null
2026-03-23	Understanding LLM Performance Degradation in Multi-Instance Processing: The Roles of Instance Count and Context Length	Jingxuan Chen et.al.	2603.22608	translate	read	null
2026-03-23	Lie to Me: How Faithful Is Chain-of-Thought Reasoning in Reasoning Models?	Richard J. Young et.al.	2603.22582	translate	read	null
2026-03-23	STRIATUM-CTF: A Protocol-Driven Agentic Framework for General-Purpose CTF Solving	James Hugglestone et.al.	2603.22577	translate	read	null
2026-03-23	CAPITU: A Benchmark for Evaluating Instruction-Following in Brazilian Portuguese with Literary Context	Giovana Kerche Bonás et.al.	2603.22576	translate	read	null
2026-03-23	TrustTrade: Human-Inspired Selective Consensus Reduces Decision Uncertainty in LLM Trading Agents	Minghan Li et.al.	2603.22567	translate	read	null
2026-03-23	Reddit After Roe: A Computational Analysis of Abortion Narratives and Barriers in the Wake of Dobbs	Aria Pessianzadeh et.al.	2603.22566	translate	read	null
2026-03-23	Privacy-Preserving Reinforcement Learning from Human Feedback via Decoupled Reward Modeling	Young Hyun Cho et.al.	2603.22563	translate	read	null
2026-03-23	GraphRAG for Engineering Diagrams: ChatP&ID Enables LLM Interaction with P&IDs	Achmad Anggawirya Alimin et.al.	2603.22528	translate	read	null
2026-03-23	LLMON: An LLM-native Markup Language to Leverage Structure and Semantics at the LLM Interface	Michael Hind et.al.	2603.22519	translate	read	null
2026-03-23	Generating and Evaluating Sustainable Procurement Criteria for the Swiss Public Sector using In-Context Prompting with Large Language Models	Yingqiang Gao et.al.	2603.22513	translate	read	null
2026-03-23	Do Large Language Models Reduce Research Novelty? Evidence from Information Systems Journals	Ali Safari et.al.	2603.22510	translate	read	null
2026-03-23	Rashid: A Cipher-Based Framework for Exploring In-Context Language Learning	Niyati Bafna et.al.	2603.22497	translate	read	null
2026-03-23	Tiny Inference-Time Scaling with Latent Verifiers	Davide Bucciarelli et.al.	2603.22492	translate	read	null
2026-03-23	From Brittle to Robust: Improving LLM Annotations for SE Optimization	Lohith Senthilkumar et.al.	2603.22474	translate	read	null
2026-03-23	LLM-guided headline rewriting for clickability enhancement without clickbait	Yehudit Aperstein et.al.	2603.22459	translate	read	null
2026-03-23	Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs	Haoming Meng et.al.	2603.22446	translate	read	null
2026-03-23	From Static Templates to Dynamic Runtime Graphs: A Survey of Workflow Optimization for LLM Agents	Ling Yue et.al.	2603.22386	translate	read	null
2026-03-23	FAAR: Format-Aware Adaptive Rounding for NVFP4	Hanglin Li et.al.	2603.22370	translate	read	null
2026-03-23	Reasoner-Executor-Synthesizer: Scalable Agentic Architecture with Static O(1) Context Window	Ivan Dobrovolskyi et.al.	2603.22367	translate	read	null
2026-03-22	Demystifying Low-Rank Knowledge Distillation in Large Language Models: Convergence, Generalization, and Information-Theoretic Guarantees	Alberlucia Rafael Soarez et.al.	2603.22355	translate	read	null
2026-03-21	Errors in AI-Assisted Retrieval of Medical Literature: A Comparative Study	Jenny Gao et.al.	2603.22344	translate	read	null
2026-03-21	T-MAP: Red-Teaming LLM Agents with Trajectory-aware Evolutionary Search	Hyomin Lee et.al.	2603.22341	translate	read	null
2026-03-21	Causal Direct Preference Optimization for Distributionally Robust Generative Recommendation	Chu Zhao et.al.	2603.22335	translate	read	null
2026-03-20	Large Language Models for Missing Data Imputation: Understanding Behavior, Hallucination Effects, and Control Mechanisms	Arthur Dantas Mangussi et.al.	2603.22332	translate	read	null
2026-03-23	VideoDetective: Clue Hunting via both Extrinsic Query and Intrinsic Relevance for Long Video Understanding	Ruoliu Yang et.al.	2603.22285	translate	read	null
2026-03-23	3D-Layout-R1: Structured Reasoning for Language-Instructed Spatial Editing	Haoyu Zhen et.al.	2603.22279	translate	read	null
2026-03-23	The Dual Mechanisms of Spatial Reasoning in Vision-Language Models	Kelly Cui et.al.	2603.22278	translate	read	null
2026-03-23	Greater accessibility can amplify discrimination in generative AI	Carolin Holtermann et.al.	2603.22260	translate	read	null
2026-03-23	RotorMap and Quantum Fingerprints of DNA Sequences via Rotary Position Embeddings	Danylo Yakymenko et.al.	2603.22245	translate	read	null
2026-03-23	Gumbel Distillation for Parallel Text Generation	Chi Zhang et.al.	2603.22216	translate	read	null
2026-03-23	Evaluating the Reliability and Fidelity of Automated Judgment Systems of Large Language Models	Tom Biskupski et.al.	2603.22214	translate	read	null
2026-03-23	SPA: A Simple but Tough-to-Beat Baseline for Knowledge Injection	Kexian Tang et.al.	2603.22213	translate	read	null
2026-03-23	Seeing is Improving: Visual Feedback for Iterative Text Layout Refinement	Junrong Guo et.al.	2603.22187	translate	read	null
2026-03-23	Enhancing Document-Level Machine Translation via Filtered Synthetic Corpora and Two-Stage LLM Adaptation	Ireh Kim et.al.	2603.22186	translate	read	null
2026-03-23	Revisiting Quantum Code Generation: Where Should Domain Knowledge Live?	Oscar Novo et.al.	2603.22184	translate	read	null
2026-03-23	Closed-Loop Verbal Reinforcement Learning for Task-Level Robotic Planning	Dmitrii Plotnikov et.al.	2603.22169	translate	read	null
2026-03-23	Causal Evidence that Language Models use Confidence to Drive Behavior	Dharshan Kumaran et.al.	2603.22161	translate	read	null
2026-03-23	Multimodal Survival Analysis with Locally Deployable Large Language Models	Moritz Gögl et.al.	2603.22158	translate	read	null
2026-03-23	On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation	Kexin Huang et.al.	2603.22117	translate	read	null
2026-03-23	Lemma Discovery in Agentic Program Verification	Huan Zhao et.al.	2603.22114	translate	read	null
2026-03-23	SpatialBoost: Enhancing Visual Representation through Language-Guided Reasoning	Byungwoo Jeon et.al.	2603.22057	translate	read	null
2026-03-23	Dual-Space Knowledge Distillation with Key-Query Matching for Large Language Models with Vocabulary Mismatch	Stella Eva Tsiapali et.al.	2603.22056	translate	read	null
2026-03-23	Dynamic analysis enhances issue resolution	Mingwei Liu et.al.	2603.22048	translate	read	null
2026-03-23	AdditiveLLM2: A Multi-modal Large Language Model for Additive Manufacturing	Peter Pak et.al.	2603.22017	translate	read	null
2026-03-23	ROM: Real-time Overthinking Mitigation via Streaming Detection and Intervention	Xinyan Wang et.al.	2603.22016	translate	read	null
2026-03-23	SecureBreak – A dataset towards safe and secure models	Marco Arazzi et.al.	2603.21975	translate	read	null
2026-03-23	Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe	Xixi Wu et.al.	2603.21972	translate	read	null
2026-03-23	Parameter-Efficient Fine-Tuning for Medical Text Summarization: A Comparative Study of Lora, Prompt Tuning, and Full Fine-Tuning	Ulugbek Shernazarov et.al.	2603.21970	translate	read	null
2026-03-23	Unified Spatiotemporal Token Compression for Video-LLMs at Ultra-Low Retention	Junhao Du et.al.	2603.21957	translate	read	null
2026-03-23	Group3D: MLLM-Driven Semantic Grouping for Open-Vocabulary 3D Object Detection	Youbin Kim et.al.	2603.21944	translate	read	null
2026-03-23	ADaFuSE: Adaptive Diffusion-generated Image and Text Fusion for Interactive Text-to-Image Retrieval	Zhuocheng Zhang et.al.	2603.21886	translate	read	null
2026-03-23	P^2O: Joint Policy and Prompt Optimization	Xinyu Lu et.al.	2603.21877	translate	read	null
2026-03-23	Holistic Scaling Laws for Optimal Mixture-of-Experts Architecture Optimization	Weilin Wan et.al.	2603.21862	translate	read	null
2026-03-23	Reasoning or Rhetoric? An Empirical Analysis of Moral Reasoning Explanations in Large Language Models	Aryan Kasat et.al.	2603.21854	translate	read	null
2026-03-23	Asymmetric Dynamics of Partisan Warriors in YouTube Comments	Keyeun Lee et.al.	2603.21776	translate	read	null
2026-03-23	The Presupposition Problem in Representation Genesis	Yiling Wu et.al.	2603.21745	translate	read	null
2026-03-23	EvoIdeator: Evolving Scientific Ideas through Checklist-Grounded Reinforcement Learning	Andreas Sauter et.al.	2603.21728	translate	read	null
2026-03-23	CurvZO: Adaptive Curvature-Guided Sparse Zeroth-Order Optimization for Efficient LLM Fine-Tuning	Shuo Wang et.al.	2603.21725	translate	read	null
2026-03-23	Can a Robot Walk the Robotic Dog: Triple-Zero Collaborative Navigation for Heterogeneous Multi-Agent Systems	Yaxuan Wang et.al.	2603.21723	translate	read	null
2026-03-23	SemEval-2026 Task 12: Abductive Event Reasoning: Towards Real-World Event Causal Inference for Large Language Models	Pengfei Cao et.al.	2603.21720	translate	read	null
2026-03-23	Probing How Scalable Table Data Enhances General Long-Context Reasoning	Huaibing Xie et.al.	2603.21719	translate	read	null
2026-03-23	Compensating Visual Insufficiency with Stratified Language Guidance for Long-Tail Class Incremental Learning	Xi Wang et.al.	2603.21708	translate	read	null
2026-03-23	Data-Free Layer-Adaptive Merging via Fisher Information for Long-to-Short Reasoning LLMs	Tian Xia et.al.	2603.21705	translate	read	null
2026-03-23	Rethinking Token Reduction for Large Vision-Language Models	Yi Wang et.al.	2603.21701	translate	read	null
2026-03-23	Structured Visual Narratives Undermine Safety Alignment in Multimodal Large Language Models	Rui Yang Tan et.al.	2603.21697	translate	read	null
2026-03-23	Deterministic Hallucination Detection in Medical VQA via Confidence-Evidence Bayesian Gain	Mohammad Asadi et.al.	2603.21693	translate	read	null
2026-03-23	AI Token Futures Market: Commoditization of Compute and Derivatives Contract Design	Yicai Xing et.al.	2603.21690	translate	read	null
2026-03-23	Is AI Ready for Multimodal Hate Speech Detection? A Comprehensive Dataset and Benchmark Evaluation	Rui Xing et.al.	2603.21686	translate	read	null
2026-03-23	Optimizing Multi-Agent Weather Captioning via Text Gradient Descent: A Training-Free Approach with Consensus-Aware Gradient Fusion	Shixu Liu et.al.	2603.21673	translate	read	null
2026-03-23	HumanOmni-Speaker: Identifying Who said What and When	Detao Bai et.al.	2603.21664	translate	read	null
2026-03-23	TAMTRL: Teacher-Aligned Reward Reshaping for Multi-Turn Reinforcement Learning in Long-Context Compression	Li Wang et.al.	2603.21663	translate	read	null
2026-03-23	OmniFM: Toward Modality-Robust and Task-Agnostic Federated Learning for Heterogeneous Medical Imaging	Meilin Liu et.al.	2603.21660	translate	read	null
2026-03-23	Towards Secure Retrieval-Augmented Generation: A Comprehensive Review of Threats, Defenses and Benchmarks	Yanming Mu et.al.	2603.21654	translate	read	null
2026-03-23	Auditing MCP Servers for Over-Privileged Tool Capabilities	Charoes Huang et.al.	2603.21641	translate	read	null
2026-03-23	Silicon Bureaucracy and AI Test-Oriented Education: Contamination Sensitivity and Score Confidence in LLM Benchmarks	Yiliang Song et.al.	2603.21636	translate	read	null
2026-03-23	AgenticRec: End-to-End Tool-Integrated Policy Optimization for Ranking-Oriented Recommender Agents	Tianyi Li et.al.	2603.21613	translate	read	null
2026-03-23	Riemannian Geometry Speaks Louder Than Words: From Graph Foundation Model to Next-Generation Graph Intelligence	Philip S. Yu et.al.	2603.21601	translate	read	null
2026-03-23	SSAM: Singular Subspace Alignment for Merging Multimodal Large Language Models	Md Kaykobad Reza et.al.	2603.21584	translate	read	null
2026-03-23	Overview of TREC 2025 Biomedical Generative Retrieval (BioGen) Track	Deepak Gupta et.al.	2603.21582	translate	read	null
2026-03-23	Mind over Space: Can Multimodal Large Language Models Mentally Navigate?	Qihui Zhu et.al.	2603.21577	translate	read	null
2026-03-23	Adaptive Robust Estimator for Multi-Agent Reinforcement Learning	Zhongyi Li et.al.	2603.21574	translate	read	null
2026-03-23	DATASHI: A Parallel English-Tashlhiyt Corpus for Orthography Normalization and Low-Resource Language Processing	Nasser-Eddine Monir et.al.	2603.21571	translate	read	null
2026-03-23	Kolmogorov Complexity Bounds for LLM Steganography and a Perplexity-Based Detection Proxy	Andrii Shportko et.al.	2603.21567	translate	read	null
2026-03-23	Counterfactual Credit Policy Optimization for Multi-Agent Collaboration	Zhongyi Li et.al.	2603.21563	translate	read	null
2026-03-23	AI In Cybersecurity Education – Scalable Agentic CTF Design Principles and Educational Outcomes	Haoran Xi et.al.	2603.21551	translate	read	null
2026-03-23	LLM-Based Test Case Generation in DBMS through Monte Carlo Tree Search	Yujia Chen et.al.	2603.21530	translate	read	null
2026-03-23	SynSym: A Synthetic Data Generation Framework for Psychiatric Symptom Identification	Migyeong Kang et.al.	2603.21529	translate	read	null
2026-03-23	VIGIL: Part-Grounded Structured Reasoning for Generalizable Deepfake Detection	Xinghan Li et.al.	2603.21526	translate	read	null
2026-03-23	CatRAG: Functor-Guided Structural Debiasing with Retrieval Augmentation for Fair LLMs	Ravi Ranjan et.al.	2603.21524	translate	read	null
2026-03-23	SafePilot: A Framework for Assuring LLM-enabled Cyber-Physical Systems	Weizhe Xu et.al.	2603.21523	translate	read	null
2026-03-23	Efficient Failure Management for Multi-Agent Systems with Reasoning Trace Representation	Lingzhe Zhang et.al.	2603.21522	translate	read	null
2026-03-23	Generalizable Self-Evolving Memory for Automatic Prompt Optimization	Guanbao Liang et.al.	2603.21520	translate	read	null
2026-03-23	Triangulating Temporal Dynamics in Multilingual Swiss Online News	Bros Victor et.al.	2603.21519	translate	read	null
2026-03-23	Learning Inflation Narratives from Reddit: How Lightweight LLMs Reveal Forward-Looking Economic Signals	Ryuichi Saito et.al.	2603.21501	translate	read	null
2026-03-23	Agentic Automation of BT-RADS Scoring: End-to-End Multi-Agent System for Standardized Brain Tumor Follow-up Assessment	Mohamed Sobhi Jabal et.al.	2603.21494	translate	read	null
2026-03-23	Learning Trajectory-Aware Multimodal Large Language Models for Video Reasoning Segmentation	Jingnan Luo et.al.	2603.21488	translate	read	null
2026-03-23	TagLLM: A Fine-Grained Tag Generation Approach for Note Recommendation	Zhijian Chen et.al.	2603.21481	translate	read	null
2026-03-23	Beyond Correlation: Refutation-Validated Aspect-Based Sentiment Analysis for Explainable Energy Market Returns	Wihan van der Heever et.al.	2603.21473	translate	read	null
2026-03-23	DRTriton: Large-Scale Synthetic Data Reinforcement Learning for Triton Kernel Generation	Siqi Guo et.al.	2603.21465	translate	read	null
2026-03-22	Deliberative multi-agent large language models improve clinical reasoning in ophthalmology	Ehsan Misaghi et.al.	2603.21447	translate	read	null
2026-03-22	KG-Hopper: Empowering Compact Open LLMs with Knowledge Graph Reasoning via Reinforcement Learning	Shuai Wang et.al.	2603.21440	translate	read	null
2026-03-22	DomAgent: Leveraging Knowledge Graphs and Case-Based Reasoning for Domain-Specific Code Generation	Shuai Wang et.al.	2603.21430	translate	read	null
2026-03-22	Uncertainty-Aware Knowledge Distillation for Multimodal Large Language Models	Jingchen Sun et.al.	2603.21426	translate	read	null
2026-03-22	Efficient Fine-Tuning Methods for Portuguese Question Answering: A Comparative Study of PEFT on BERTimbau and Exploratory Evaluation of Generative LLMs	Mariela M. Nina et.al.	2603.21418	translate	read	null
2026-03-22	Enterprise Sales Copilot: Enabling Real-Time AI Support with Automatic Information Retrieval in Live Sales Calls	Jielin Qiu et.al.	2603.21416	translate	read	null
2026-03-22	Silent Commitment Failure in Instruction-Tuned Language Models: Evidence of Governability Divergence Across Architectures	Gregory M. Ruddell et.al.	2603.21415	translate	read	null
2026-03-22	Multi-Perspective LLM Annotations for Valid Analyses in Subjective Tasks	Navya Mehrotra et.al.	2603.21404	translate	read	null
2026-03-22	Persona Vectors in Games: Measuring and Steering Strategies via Activation Vectors	Johnathan Sun et.al.	2603.21398	translate	read	null
2026-03-22	Task-Specific Efficiency Analysis: When Small Language Models Outperform Large Language Models	Jinghan Cao et.al.	2603.21389	translate	read	null
2026-03-22	PLR: Plackett-Luce for Reordering In-Context Learning Examples	Pawel Batorski et.al.	2603.21373	translate	read	null
2026-03-22	TIDE: Token-Informed Depth Execution for Per-Token Early Exit in LLM Inference	Jaber Jaber et.al.	2603.21365	translate	read	null
2026-03-22	Benchmarking Bengali Dialectal Bias: A Multi-Stage Framework Integrating RAG-Based Translation and Human-Augmented RLAIF	K. M. Jubair Sami et.al.	2603.21359	translate	read	null
2026-03-22	RoboAlign: Learning Test-Time Reasoning for Language-Action Alignment in Vision-Language-Action Models	Dongyoung Kim et.al.	2603.21341	translate	read	null
2026-03-22	COINBench: Moving Beyond Individual Perspectives to Collective Intent Understanding	Xiaozhe Li et.al.	2603.21329	translate	read	null
2026-03-22	Improving Coherence and Persistence in Agentic AI for System Optimization	Pantea Karimi et.al.	2603.21321	translate	read	null
2026-03-22	Enhancing reasoning accuracy in large language models during inference time	Vinay Sharma et.al.	2603.21301	translate	read	null
2026-03-22	When Models Judge Themselves: Unsupervised Self-Evolution for Multimodal Reasoning	Zhengxian Wu et.al.	2603.21289	translate	read	null
2026-03-22	When the Chain Breaks: Interactive Diagnosis of LLM Chain-of-Thought Reasoning Errors	Shiwei Chen et.al.	2603.21286	translate	read	null
2026-03-22	WARBENCH: A Comprehensive Benchmark for Evaluating LLMs in Military Decision-Making	Zongjie Li et.al.	2603.21280	translate	read	null
2026-03-22	Conversation Tree Architecture: A Structured Framework for Context-Aware Multi-Branch LLM Conversations	Pranav Hemanth et.al.	2603.21278	translate	read	null
2026-03-22	Aggregation Alignment for Federated Learning with Mixture-of-Experts under Data Heterogeneity	Zihan Fang et.al.	2603.21276	translate	read	null
2026-03-22	Graph of States: Solving Abductive Tasks with Large Language Models	Yu Luo et.al.	2603.21250	translate	read	null
2026-03-22	Graph Fusion Across Languages using Large Language Models	Kaung Myat Kyaw et.al.	2603.21248	translate	read	null
2026-03-22	ConsRoute:Consistency-Aware Adaptive Query Routing for Cloud-Edge-Device Large Language Models	Haoyu Qiao et.al.	2603.21237	translate	read	null
2026-03-22	QMoP: Query Guided Mixture-of-Projector for Efficient Visual Token Compression	Zhongyang Li et.al.	2603.21232	translate	read	null
2026-03-22	Context Selection for Hypothesis and Statistical Evidence Extraction from Full-Text Scientific Articles	Sai Koneru et.al.	2603.21193	translate	read	null
2026-03-22	DS2SC-Agent: A Multi-Agent Automated Pipeline for Rapid Chiplet Model Generation	Yiwei Wu et.al.	2603.21190	translate	read	null
2026-03-22	GIDE: Unlocking Diffusion LLMs for Precise Training-Free Image Editing	Zifeng Zhu et.al.	2603.21176	translate	read	null
2026-03-22	Reward Sharpness-Aware Fine-Tuning for Diffusion Models	Kwanyoung Kim et.al.	2603.21175	translate	read	null
2026-03-22	Explainable Semantic Textual Similarity via Dissimilar Span Detection	Diego Miguel Lozano et.al.	2603.21174	translate	read	null
2026-03-22	Many Dialects, Many Languages, One Cultural Lens: Evaluating Multilingual VLMs for Bengali Culture Understanding Across Historically Linked Languages and Regional Dialects	Nurul Labib Sayeedi et.al.	2603.21165	translate	read	null
2026-03-22	Revisiting Tree Search for LLMs: Gumbel and Sequential Halving for Budget-Scalable Reasoning	Leonid Ugadiarov et.al.	2603.21162	translate	read	null
2026-03-22	Can LLMs Fool Graph Learning? Exploring Universal Adversarial Attacks on Text-Attributed Graphs	Zihui Chen et.al.	2603.21155	translate	read	null
2026-03-22	TRACE: A Multi-Agent System for Autonomous Physical Reasoning in Seismological	Feng Liu et.al.	2603.21152	translate	read	null
2026-03-22	ORACLE: Optimizing Reasoning Abilities of Large Language Models via Constraint-Led Synthetic Data Elicitation	Zhuojie Yang et.al.	2603.21140	translate	read	null
2026-03-22	CVT-Bench: Counterfactual Viewpoint Transformations Reveal Unstable Spatial Representations in Multimodal LLMs	Shanmukha Vellamcheti et.al.	2603.21114	translate	read	null
2026-03-22	Evaluating Reasoning-Based Scaffolds for Human-AI Co-Annotation: The ReasonAlign Annotation Protocol	Smitha Muthya Sudheendra et.al.	2603.21094	translate	read	null
2026-03-22	CoVFT: Context-aware Visual Fine-tuning for Multimodal Large Language Models	Nan Zhou et.al.	2603.21077	translate	read	null
2026-03-22	When Minor Edits Matter: LLM-Driven Prompt Attack for Medical VLM Robustness in Ultrasound	Yasamin Medghalchi et.al.	2603.21047	translate	read	null
2026-03-22	Left Behind: Cross-Lingual Transfer as a Bridge for Low-Resource Languages in Large Language Models	Abdul-Salem Beibitkhan et.al.	2603.21036	translate	read	null
2026-03-22	KLDrive: Fine-Grained 3D Scene Reasoning for Autonomous Driving based on Knowledge Graph	Ye Tian et.al.	2603.21029	translate	read	null
2026-03-22	SkillProbe: Security Auditing for Emerging Agent Skill Marketplaces via Multi-Agent Collaboration	Zihan Guo et.al.	2603.21019	translate	read	null
2026-03-22	Mitigating Selection Bias in Large Language Models via Permutation-Aware GRPO	Jinquan Zheng et.al.	2603.21016	translate	read	null
2026-03-22	CLT-Forge: A Scalable Library for Cross-Layer Transcoders and Attribution Graphs	Florent Draye et.al.	2603.21014	translate	read	null
2026-03-22	ECI: Effective Contrastive Information to Evaluate Hard-Negatives	Aarush Sinha et.al.	2603.20990	translate	read	null
2026-03-22	Can we automatize scientific discovery in the cognitive sciences?	Akshay K. Jagadish et.al.	2603.20988	translate	read	null
2026-03-21	Detection of adversarial intent in Human-AI teams using LLMs	Abed K. Musaffar et.al.	2603.20976	translate	read	null
2026-03-21	Learning to Aggregate Zero-Shot LLM Agents for Corporate Disclosure Classification	Kemal Kirtac et.al.	2603.20965	translate	read	null
2026-03-21	Alignment Whack-a-Mole : Finetuning Activates Verbatim Recall of Copyrighted Books in Large Language Models	Xinyue Liu et.al.	2603.20957	translate	read	null
2026-03-21	User Preference Modeling for Conversational LLM Agents: Weak Rewards from Retrieval-Augmented Interaction	Yuren Hao et.al.	2603.20939	translate	read	null
2026-03-21	AC4A: Access Control for Agents	Reshabh K Sharma et.al.	2603.20933	translate	read	null
2026-03-21	Do LLM-Driven Agents Exhibit Engagement Mechanisms? Controlled Tests of Information Load, Descriptive Norms, and Popularity Cues	Tai-Quan Peng et.al.	2603.20911	translate	read	null
2026-03-21	LLM-ODE: Data-driven Discovery of Dynamical Systems with Large Language Models	Amirmohammad Ziaei Bideh et.al.	2603.20910	translate	read	null
2026-03-21	Mitigating Shortcut Reasoning in Language Models: A Gradient-Aware Training Approach	Hongyu Cao et.al.	2603.20899	translate	read	null
2026-03-21	AcoustEmo: Open-Vocabulary Emotion Reasoning via Utterance-Aware Acoustic Q-Former	Liyun Zhang et.al.	2603.20894	translate	read	null
2026-03-21	RubricRAG: Towards Interpretable and Reliable LLM Evaluation via Domain Knowledge Retrieval for Rubric Generation	Kaustubh D. Dhole et.al.	2603.20882	translate	read	null
2026-03-21	Engineering Pitfalls in AI Coding Tools: An Empirical Study of Bugs in Claude Code, Codex, and Gemini CLI	Ruixin Zhang et.al.	2603.20847	translate	read	null
2026-03-21	Predictive Regularization Against Visual Representation Degradation in Multimodal Large Language Models	Enguang Wang et.al.	2603.20808	translate	read	null
2026-03-21	BenchBench: Benchmarking Automated Benchmark Generation	Yandan Zheng et.al.	2603.20807	translate	read	null
2026-03-21	RLVR Training of LLMs Does Not Improve Thinking Ability for General QA: Evaluation Method and a Simple Solution	Kaiyuan Li et.al.	2603.20799	translate	read	null
2026-03-21	The Anatomy of an Edit: Mechanism-Guided Activation Steering for Knowledge Editing	Yuan Cao et.al.	2603.20795	translate	read	null
2026-03-21	Code-MIE: A Code-style Model for Multimodal Information Extraction with Scene Graph and Entity Attribute Knowledge Enhancement	Jiang Liu et.al.	2603.20781	translate	read	null
2026-03-21	SATTC: Structure-Aware Label-Free Test-Time Calibration for Cross-Subject EEG-to-Image Retrieval	Qunjie Huang et.al.	2603.20738	translate	read	null
2026-03-21	MzansiText and MzansiLM: An Open Corpus and Decoder-Only Language Model for South African Languages	Anri Lombard et.al.	2603.20732	translate	read	null
2026-03-21	Premier: Personalized Preference Modulation with Learnable User Embedding in Text-to-Image Generation	Zihao Wang et.al.	2603.20725	translate	read	null
2026-03-21	Cross-modal Fuzzy Alignment Network for Text-Aerial Person Retrieval and A Large-scale Benchmark	Yifei Deng et.al.	2603.20721	translate	read	null
2026-03-21	NDT: Non-Differential Transformer and Its Application to Sentiment Analysis	Soudeep Ghoshal et.al.	2603.20704	translate	read	null
2026-03-21	Clinical Cognition Alignment for Gastrointestinal Diagnosis with Multimodal LLMs	Huan Zheng et.al.	2603.20698	translate	read	null
2026-03-21	AI-Driven Multi-Agent Simulation of Stratified Polyamory Systems: A Computational Framework for Optimizing Social Reproductive Efficiency	Yicai Xing et.al.	2603.20678	translate	read	null
2026-03-21	Towards Intelligent Geospatial Data Discovery: a knowledge graph-driven multi-agent framework powered by large language models	Ruixiang Liu et.al.	2603.20670	translate	read	null
2026-03-21	WWW.Serve: Interconnecting Global LLM Services through Decentralization	Huanyu Wang et.al.	2603.20661	translate	read	null
2026-03-21	A Multihead Continual Learning Framework for Fine-Grained Fashion Image Retrieval with Contrastive Learning and Exponential Moving Average Distillation	Ling Xiao et.al.	2603.20648	translate	read	null
2026-03-21	Hear Both Sides: Efficient Multi-Agent Debate via Diversity-Aware Message Retention	Manh Nguyen et.al.	2603.20640	translate	read	null
2026-03-21	OmniCodec: Low Frame Rate Universal Audio Codec with Semantic-Acoustic Disentanglement	Jingbin Hu et.al.	2603.20638	translate	read	null
2026-03-21	AEGIS: From Clues to Verdicts – Graph-Guided Deep Vulnerability Reasoning via Dialectics and Meta-Auditing	Sen Fang et.al.	2603.20637	translate	read	null
2026-03-21	A Modular LLM Framework for Explainable Price Outlier Detection	Shadi Sartipi et.al.	2603.20636	translate	read	null
2026-03-21	Optimal low-rank stochastic gradient estimation for LLM training	Zehao Li et.al.	2603.20632	translate	read	null
2026-03-21	Evaluating LLM-generated code for domain-specific languages: molecular dynamics with LAMMPS	Ethan Holbrook et.al.	2603.20630	translate	read	null
2026-03-21	The Art of Midwifery in LLMs: Optimizing Role Personas for Large Language Models as Moral Assistants	Yangyi Wu et.al.	2603.20626	translate	read	null
2026-03-21	JUBAKU: An Adversarial Benchmark for Exposing Culturally Grounded Stereotypes in Japanese LLMs	Taihei Shiotani et.al.	2603.20581	translate	read	null
2026-03-21	Context Cartography: Toward Structured Governance of Contextual Space in Large Language Model Systems	Zihua Wu et.al.	2603.20578	translate	read	null
2026-03-21	LJ-Bench: Ontology-Based Benchmark for U.S. Crime	Hung Yun Tseng et.al.	2603.20572	translate	read	null
2026-03-20	Permutation-Consensus Listwise Judging for Robust Factuality Evaluation	Tianyi Huang et.al.	2603.20562	translate	read	null
2026-03-20	Understanding Behavior Cloning with Action Quantization	Haoqun Cao et.al.	2603.20538	translate	read	null
2026-03-20	RMNP: Row-Momentum Normalized Preconditioning for Scalable Matrix-Based Optimization	Shenyang Deng et.al.	2603.20527	translate	read	null
2026-03-20	Evaluating Large Language Models on Historical Health Crisis Knowledge in Resource-Limited Settings: A Hybrid Multi-Metric Study	Mohammed Rakibul Hasan et.al.	2603.20514	translate	read	null
2026-03-20	AE-LLM: Adaptive Efficiency Optimization for Large Language Models	Kaito Tanaka et.al.	2603.20492	translate	read	null
2026-03-20	Developing an ESG-Oriented Large Language Model through ESG Practices	Gabriel Assis et.al.	2603.20480	translate	read	null
2026-03-20	Diffutron: A Masked Diffusion Language Model for Turkish Language	Şuayp Talha Kocabay et.al.	2603.20466	translate	read	null
2026-03-20	Solver-Aided Verification of Policy Compliance in Tool-Augmented LLM Agents	Cailin Winston et.al.	2603.20449	translate	read	null
2026-03-20	A Training-Free Regeneration Paradigm: Contrastive Reflection Memory Guided Self-Verification and Self-Improvement	Yuran Li et.al.	2603.20441	translate	read	null
2026-03-20	Deep reflective reasoning in interdependence constrained structured data extraction from clinical notes for digital health	Jingwei Huang et.al.	2603.20435	translate	read	null
2026-03-20	Coding Agents are Effective Long-Context Processors	Weili Cao et.al.	2603.20432	translate	read	null
2026-03-20	KV Cache Optimization Strategies for Scalable and Efficient LLM Inference	Yichun Xu et.al.	2603.20397	translate	read	null
2026-03-20	The production of meaning in the processing of natural language	Christopher J. Agostino et.al.	2603.20381	translate	read	null
2026-03-20	LumosX: Relate Any Identities with Their Attributes for Personalized Video Generation	Jiazheng Xing et.al.	2603.20192	translate	read	null
2026-03-20	IndoorR2X: Indoor Robot-to-Everything Coordination with LLM-Driven Planning	Fan Yang et.al.	2603.20182	translate	read	null
2026-03-20	AI Agents Can Already Autonomously Perform Experimental High Energy Physics	Eric A. Moreno et.al.	2603.20179	translate	read	null
2026-03-20	Learning Dynamic Belief Graphs for Theory-of-mind Reasoning	Ruxiao Chen et.al.	2603.20170	translate	read	null
2026-03-20	Semantic Token Clustering for Efficient Uncertainty Quantification in Large Language Models	Qi Cao et.al.	2603.20161	translate	read	null
2026-03-20	Enhancing Hyperspace Analogue to Language (HAL) Representations via Attention-Based Pooling for Text Classification	Ali Sakour et.al.	2603.20149	translate	read	null
2026-03-12	MM-CondChain: A Programmatically Verified Benchmark for Visually Grounded Deep Compositional Reasoning	Haozhan Shen et.al.	2603.12266	translate	read	null
2026-03-12	Video Streaming Thinking: VideoLLMs Can Watch and Think Simultaneously	Yiran Guan et.al.	2603.12262	translate	read	null
2026-03-12	Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing	Baifeng Shi et.al.	2603.12254	translate	read	null
2026-03-12	EndoCoT: Scaling Endogenous Chain-of-Thought Reasoning in Diffusion Models	Xuanlang Dai et.al.	2603.12252	translate	read	null
2026-03-12	Language Model Teams as Distributed Systems	Elizabeth Mieczkowski et.al.	2603.12229	translate	read	null
2026-03-12	Sparking Scientific Creativity via LLM-Driven Interdisciplinary Inspiration	Priyanka Kargupta et.al.	2603.12226	translate	read	null
2026-03-12	ForensicZip: More Tokens are Better but Not Necessary in Forensic Vision-Language Models	Yingxin Lai et.al.	2603.12208	translate	read	null
2026-03-12	CLASP: Defending Hybrid Large Language Models Against Hidden State Poisoning Attacks	Alexandre Le Mercier et.al.	2603.12206	translate	read	null
2026-03-12	IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse	Yushi Bai et.al.	2603.12201	translate	read	null
2026-03-12	Long-Context Encoder Models for Polish Language Understanding	Sławomir Dadas et.al.	2603.12191	translate	read	null
2026-03-12	LatentGeo: Learnable Auxiliary Constructions in Latent Space for Multimodal Geometric Reasoning	Haiying Xu et.al.	2603.12166	translate	read	null
2026-03-12	LifeSim: Long-Horizon User Life Simulator for Personalized Assistant Evaluation	Feiyu Duan et.al.	2603.12152	translate	read	null
2026-03-12	IsoCompute Playbook: Optimally Scaling Sampling Compute for LLM RL	Zhoujun Cheng et.al.	2603.12151	translate	read	null
2026-03-12	Linking Perception, Confidence and Accuracy in MLLMs	Yuetian Du et.al.	2603.12149	translate	read	null
2026-03-12	EgoIntent: An Egocentric Step-level Benchmark for Understanding What, Why, and Next	Ye Pan et.al.	2603.12147	translate	read	null
2026-03-12	TopoBench: Benchmarking LLMs on Hard Topological Reasoning	Mayug Maniparambil et.al.	2603.12133	translate	read	null
2026-03-12	Hoi3DGen: Generating High-Quality Human-Object-Interactions in 3D	Agniv Sharma et.al.	2603.12126	translate	read	null
2026-03-12	Cross-Context Review: Improving LLM Output Quality by Separating Production and Review Sessions	Tae-Eun Song et.al.	2603.12123	translate	read	null
2026-03-12	SommBench: Assessing Sommelier Expertise of Language Models	William Brach et.al.	2603.12117	translate	read	null
2026-03-12	On Information Self-Locking in Reinforcement Learning for Active Reasoning of LLM agents	Deyu Zou et.al.	2603.12109	translate	read	null
2026-03-12	EvoTok: A Unified Image Tokenizer via Residual Latent Evolution for Visual Understanding and Generation	Yan Li et.al.	2603.12108	translate	read	null
2026-03-12	To Words and Beyond: Probing Large Language Models for Sentence-Level Psycholinguistic Norms of Memorability and Reading Times	Thomas Hikaru Clark et.al.	2603.12105	translate	read	null
2026-03-12	Human-Centred LLM Privacy Audits: Findings and Frictions	Dimitri Staufer et.al.	2603.12094	translate	read	null
2026-03-12	Resource-Efficient Iterative LLM-Based NAS with Feedback Memory	Xiaojie Gu et.al.	2603.12091	translate	read	null
2026-03-12	Cascade: Composing Software-Hardware Attack Gadgets for Adversarial Threat Amplification in Compound AI Systems	Sarbartha Banerjee et.al.	2603.12023	translate	read	null
2026-03-12	BTZSC: A Benchmark for Zero-Shot Text Classification Across Cross-Encoders, Embedding Models, Rerankers and LLMs	Ilias Aarab et.al.	2603.11991	translate	read	null
2026-03-12	LABSHIELD: A Multimodal Benchmark for Safety-Critical Reasoning and Planning in Scientific Laboratories	Qianpu Sun et.al.	2603.11987	translate	read	null
2026-03-12	CHiL(L)Grader: Calibrated Human-in-the-Loop Short-Answer Grading	Pranav Raikote et.al.	2603.11957	translate	read	null
2026-03-12	PersonaTrace: Synthesizing Realistic Digital Footprints with LLM Agents	Minjia Wang et.al.	2603.11955	translate	read	null
2026-03-12	MobileKernelBench: Can LLMs Write Efficient Kernels for Mobile Devices?	Xingze Zou et.al.	2603.11935	translate	read	null
2026-03-12	Chem4DLLM: 4D Multimodal LLMs for Chemical Dynamics Understanding	Xinyu Li et.al.	2603.11924	translate	read	null
2026-03-12	CoMMET: To What Extent Can LLMs Perform Theory of Mind Tasks?	Ruirui Chen et.al.	2603.11915	translate	read	null
2026-03-12	Understanding LLM Behavior When Encountering User-Supplied Harmful Content in Harmless Tasks	Junjie Chu et.al.	2603.11914	translate	read	null
2026-03-12	Think While Watching: Online Streaming Segment-Level Memory for Multi-Turn Video Reasoning in Multimodal Large Language Models	Lu Wang et.al.	2603.11896	translate	read	null
2026-03-12	QUARE: Multi-Agent Negotiation for Balancing Quality Attributes in Requirements Engineering	Haowei Cheng et.al.	2603.11890	translate	read	null
2026-03-12	Bielik-Minitron-7B: Compressing Large Language Models via Structured Pruning and Knowledge Distillation for the Polish Language	Remigiusz Kinas et.al.	2603.11881	translate	read	null
2026-03-12	Silent Speech Interfaces in the Era of Large Language Models: A Comprehensive Taxonomy and Systematic Review	Kele Xu et.al.	2603.11877	translate	read	null
2026-03-12	AdaFuse: Accelerating Dynamic Adapter Inference via Token-Level Pre-Gating and Fused Kernel Optimization	Qiyang Li et.al.	2603.11873	translate	read	null
2026-03-12	ZeroSense:How Vision matters in Long Context Compression	Yonghan Gao et.al.	2603.11846	translate	read	null
2026-03-12	DatedGPT: Preventing Lookahead Bias in Large Language Models with Time-Aware Pretraining	Yutong Yan et.al.	2603.11838	translate	read	null
2026-03-12	Towards High-Fidelity CAD Generation via LLM-Driven Program Generation and Text-Based B-Rep Primitive Grounding	Jiahao Li et.al.	2603.11831	translate	read	null
2026-03-12	Large language models for optical network O&M: Agent-embedded workflow for automation	Shengnan Li et.al.	2603.11828	translate	read	null
2026-03-12	OMNIA: Closing the Loop by Leveraging LLMs for Knowledge Graph Completion	Frédéric Ieng et.al.	2603.11820	translate	read	null
2026-03-12	RADAR: Closed-Loop Robotic Data Generation via Semantic Planning and Autonomous Causal Environment Reset	Yongzhong Wang et.al.	2603.11811	translate	read	null
2026-03-12	Automating Skill Acquisition through Large-Scale Mining of Open-Source Agentic Repositories: A Framework for Multi-Agent Procedural Knowledge Extraction	Shuzhen Bi et.al.	2603.11808	translate	read	null
2026-03-12	DocSage: An Information Structuring Agent for Multi-Doc Multi-Entity Question Answering	Teng Lin et.al.	2603.11798	translate	read	null
2026-03-12	Language Generation with Replay: A Learning-Theoretic View of Model Collapse	Giorgio Racca et.al.	2603.11784	translate	read	null
2026-03-12	Large Language Models for Biomedical Article Classification	Jakub Proboszcz et.al.	2603.11780	translate	read	null
2026-03-12	Legal-DC: Benchmarking Retrieval-Augmented Generation for Legal Documents	Yaocong Li et.al.	2603.11772	translate	read	null
2026-03-12	Governing Evolving Memory in LLM Agents: Risks, Mechanisms, and the Stability and Safety Governed Memory (SSGM) Framework	Chingkwun Lam et.al.	2603.11768	translate	read	null
2026-03-12	Gender Bias in Generative AI-assisted Recruitment Processes	Martina Ullasci et.al.	2603.11736	translate	read	null
2026-03-12	When OpenClaw Meets Hospital: Toward an Agentic Operating System for Dynamic Clinical Workflows	Wenxian Yang et.al.	2603.11721	translate	read	null
2026-03-12	Scaling Laws for Educational AI Agents	Mengsong Wu et.al.	2603.11709	translate	read	null
2026-03-12	OSCBench: Benchmarking Object State Change in Text-to-Video Generation	Xianjing Han et.al.	2603.11698	translate	read	null
2026-03-12	Explicit Logic Channel for Validation and Enhancement of MLLMs on Zero-Shot Tasks	Mei Chee Leong et.al.	2603.11689	translate	read	null
2026-03-12	SemBench: A Universal Semantic Framework for LLM Evaluation	Mikel Zubillaga et.al.	2603.11687	translate	read	null
2026-03-12	From Control to Foresight: Simulation as a New Paradigm for Human-Agent Collaboration	Gaole He et.al.	2603.11677	translate	read	null
2026-03-12	Multi-Task Reinforcement Learning for Enhanced Multimodal LLM-as-a-Judge	Junjie Wu et.al.	2603.11665	translate	read	null
2026-03-12	Resonate: Reinforcing Text-to-Audio Generation via Online Feedback from Large Audio Language Models	Xiquan Li et.al.	2603.11661	translate	read	null
2026-03-12	Tokenization Allows Multimodal Large Language Models to Understand, Generate and Edit Architectural Floor Plans	Sizhong Qin et.al.	2603.11640	translate	read	null
2026-03-12	VisDoT : Enhancing Visual Reasoning through Human-Like Interpretation Grounding and Decomposition of Thought	Eunsoo Lee et.al.	2603.11631	translate	read	null
2026-03-12	Sema: A High-performance System for LLM-based Semantic Query Processing	Kangkang Qi et.al.	2603.11622	translate	read	null
2026-03-12	Taming OpenClaw: Security Analysis and Mitigation of Autonomous LLM Agent Threats	Xinhao Deng et.al.	2603.11619	translate	read	null
2026-03-12	LaMoGen: Language to Motion Generation Through LLM-Guided Symbolic Inference	Junkun Jiang et.al.	2603.11605	translate	read	null
2026-03-12	Performance Evaluation of Open-Source Large Language Models for Assisting Pathology Report Writing in Japanese	Masataka Kawai et.al.	2603.11597	translate	read	null
2026-03-12	Leveraging Large Language Models and Survival Analysis for Early Prediction of Chemotherapy Outcomes	Muhammad Faisal Shahid et.al.	2603.11594	translate	read	null
2026-03-12	UtilityMax Prompting: A Formal Framework for Multi-Objective Large Language Model Optimization	Ofir Marom et.al.	2603.11583	translate	read	null
2026-03-12	Streaming Translation and Transcription Through Speech-to-Text Causal Alignment	Roman Koshkin et.al.	2603.11578	translate	read	null
2026-03-12	Where Matters More Than What: Decoding-aligned KV Cache Compression via Position-aware Pseudo Queries	Zhenxu Tian et.al.	2603.11564	translate	read	null
2026-03-12	AI Knows What’s Wrong But Cannot Fix It: Helicoid Dynamics in Frontier LLMs Under High-Stakes Decisions	Alejandro R Jadad et.al.	2603.11559	translate	read	null
2026-03-12	FBCIR: Balancing Cross-Modal Focuses in Composed Image Retrieval	Chenchen Zhao et.al.	2603.11520	translate	read	null
2026-03-12	Multi-Agent Collaboration for Automated Design Exploration on High Performance Computing Systems	Harshitha Menon et.al.	2603.11515	translate	read	null
2026-03-12	KEPo: Knowledge Evolution Poison on Graph-based Retrieval-Augmented Generation	Qizhi Chen et.al.	2603.11501	translate	read	null
2026-03-12	Try, Check and Retry: A Divide-and-Conquer Framework for Boosting Long-context Tool-Calling Performance of LLMs	Kunfeng Chen et.al.	2603.11495	translate	read	null
2026-03-12	PRMB: Benchmarking Reward Models in Long-Horizon CBT-based Counseling Dialogue	Yougen Zhou et.al.	2603.11494	translate	read	null
2026-03-12	AutoVeriFix+: High-Correctness RTL Generation via Trace-Aware Causal Fix and Semantic Redundancy Pruning	Yan Tan et.al.	2603.11489	translate	read	null
2026-03-12	Quantized Inference for OneRec-V2	Yi Su et.al.	2603.11486	translate	read	null
2026-03-12	INFACT: A Diagnostic Benchmark for Induced Faithfulness and Factuality Hallucinations in Video-LLMs	Junqi Yang et.al.	2603.11481	translate	read	null
2026-03-12	Deep Learning Network-Temporal Models For Traffic Prediction	Yufeng Xin et.al.	2603.11475	translate	read	null
2026-03-12	CoViLLM: An Adaptive Human-Robot Collaborative Assembly Framework Using Large Language Models for Manufacturing	Jiabao Zhao et.al.	2603.11461	translate	read	null
2026-03-12	LLM-Assisted Causal Structure Disambiguation and Factor Extraction for Legal Judgment Prediction	Yuzhi Liang et.al.	2603.11446	translate	read	null
2026-03-12	BLooP: Zero-Shot Abstractive Summarization using Large Language Models with Bigram Lookahead Promotion	Varun Iyer et.al.	2603.11415	translate	read	null
2026-03-12	MaterialFigBENCH: benchmark dataset with figures for evaluating college-level materials science problem-solving abilities of multimodal large language models	Michiko Yoshitake et.al.	2603.11414	translate	read	null
2026-03-12	Algorithmic Consequences of Particle Filters for Sentence Processing: Amplified Garden-Paths and Digging-In Effects	Amani Maina-Kilaas et.al.	2603.11412	translate	read	null
2026-03-12	Speak or Stay Silent: Context-Aware Turn-Taking in Multi-Party Dialogue	Kratika Bhagtani et.al.	2603.11409	translate	read	null
2026-03-12	Beyond Polarity: Multi-Dimensional LLM Sentiment Signals for WTI Crude Oil Futures Return Prediction	Dehao Dai et.al.	2603.11408	translate	read	null
2026-03-12	Stop Listening to Me! How Multi-turn Conversations Can Degrade Diagnostic Reasoning	Kevin H. Guo et.al.	2603.11394	translate	read	null
2026-03-12	To Believe or Not To Believe: Comparing Supporting Information Tools to Aid Human Judgments of AI Veracity	Jessica Irons et.al.	2603.11393	translate	read	null
2026-03-12	Agentic AI for Embodied-enhanced Beam Prediction in Low-Altitude Economy Networks	Min Hao et.al.	2603.11392	translate	read	null
2026-03-12	BEACON: Budget-Aware Entity Matching Across Domains (Extended Technical Report)	Nicholas Pulsone et.al.	2603.11391	translate	read	null
2026-03-12	Deactivating Refusal Triggers: Understanding and Mitigating Overrefusal in Safety Alignment	Zhiyu Xue et.al.	2603.11388	translate	read	null
2026-03-11	DriveXQA: Cross-modal Visual Question Answering for Adverse Driving Scene Understanding	Mingzhe Tao et.al.	2603.11380	translate	read	null
2026-03-11	Resolving Java Code Repository Issues with iSWE Agent	Jatin Ganhotra et.al.	2603.11356	translate	read	null
2026-03-11	Novelty Adaptation Through Hybrid Large Language Model (LLM)-Symbolic Planning and LLM-guided Reinforcement Learning	Hong Lu et.al.	2603.11351	translate	read	null
2026-03-11	FinRule-Bench: A Benchmark for Joint Reasoning over Financial Tables and Principles	Arun Vignesh Malarkkan et.al.	2603.11339	translate	read	null
2026-03-11	LLM-Augmented Digital Twin for Policy Evaluation in Short-Video Platforms	Haoting Zhang et.al.	2603.11333	translate	read	null
2026-03-11	Jailbreak Scaling Laws for Large Language Models: Polynomial-Exponential Crossover	Indranil Halder et.al.	2603.11331	translate	read	null
2026-03-11	Bridging the Cognitive Gap: Co-Designing and Evaluating a Voice-Enabled Community Chatbot for Older Adults	Feng Chen et.al.	2603.11303	translate	read	null
2026-03-11	Counterweights and Complementarities: The Convergence of AI and Blockchain Powering a Decentralized Future	Yibai Li et.al.	2603.11299	translate	read	null
2026-03-11	Temporal Text Classification with Large Language Models	Nishat Raihan et.al.	2603.11295	translate	read	null
2026-03-11	AI Psychometrics: Evaluating the Psychological Reasoning of Large Language Models with Psychometric Validities	Yibai Li et.al.	2603.11279	translate	read	null
2026-03-11	COMPASS: The explainable agentic framework for Sovereignty, Sustainability, Compliance, and Ethics	Jean-Sébastien et.al.	2603.11277	translate	read	null
2026-03-11	The Unlearning Mirage: A Dynamic Framework for Evaluating LLM Unlearning	Raj Sanjay Shah et.al.	2603.11266	translate	read	null
2026-03-11	Artificial Intelligence for Sentiment Analysis of Persian Poetry	Arash Zargar et.al.	2603.11254	translate	read	null
2026-03-11	LLMs Can Infer Political Alignment from Online Conversations	Byunghwee Lee et.al.	2603.11253	translate	read	null
2026-03-11	Reversible Lifelong Model Editing via Semantic Routing-Based LoRA	Haihua Luo et.al.	2603.11239	translate	read	null
2026-03-11	Markovian Generation Chains in Large Language Models	Mingmeng Geng et.al.	2603.11228	translate	read	null
2026-03-11	Security-by-Design for LLM-Based Code Generation: Leveraging Internal Representations for Concept-Driven Steering Mechanisms	Maximilian Wendlinger et.al.	2603.11212	translate	read	null
2026-03-11	Can LLMs Help Localize Fake Words in Partially Fake Speech?	Lin Zhang et.al.	2603.11205	translate	read	null
2026-03-11	DeReason: A Difficulty-Aware Curriculum Improves Decoupled SFT-then-RL Training for General Reasoning	Hanxu Hu et.al.	2603.11193	translate	read	null
2026-03-11	Systematic Scaling Analysis of Jailbreak Attacks in Large Language Models	Xiangwen Wang et.al.	2603.11149	translate	read	null
2026-03-11	H2LooP Spark Preview: Continual Pretraining of Large Language Models for Low-Level Embedded Systems Code	Amit Singh et.al.	2603.11139	translate	read	null
2026-03-11	Enhancing Value Alignment of LLMs with Multi-agent system and Combinatorial Fusion	Yuanhong Wu et.al.	2603.11126	translate	read	null
2026-03-11	Uni-ASR: Unified LLM-Based Architecture for Non-Streaming and Streaming Automatic Speech Recognition	Yinfeng Xia et.al.	2603.11123	translate	read	null
2026-03-11	Task-Conditioned Routing Signatures in Sparse Mixture-of-Experts Transformers	Mynampati Sri Ranganadha Avinash et.al.	2603.11114	translate	read	null
2026-03-11	Understanding by Reconstruction: Reversing the Software Development Process for LLM Pretraining	Zhiyuan Zeng et.al.	2603.11103	translate	read	null
2026-03-11	Graph Tokenization for Bridging Graphs and Transformers	Zeyuan Guo et.al.	2603.11099	translate	read	null
2026-03-11	The Attack and Defense Landscape of Agentic AI: A Comprehensive Survey	Juhee Kim et.al.	2603.11088	translate	read	null
2026-03-10	Quality-Driven Agentic Reasoning for LLM-Assisted Software Design: Questions-of-Thoughts (QoT) as a Time-Series Self-QA Chain	Yen-Ku Liu et.al.	2603.11082	translate	read	null
2026-03-10	CR-Bench: Evaluating the Real-World Utility of AI Code Review Agents	Kristen Pereira et.al.	2603.11078	translate	read	null
2026-03-10	Summarize Before You Speak with ARACH: A Training-Free Inference-Time Plug-In for Enhancing LLMs via Global Attention Reallocation	Jingtao Wang et.al.	2603.11067	translate	read	null
2026-03-11	LLMGreenRec: LLM-Based Multi-Agent Recommender System for Sustainable E-Commerce	Hao N. Nguyen et.al.	2603.11025	translate	read	null
2026-03-11	Does AI See like Art Historians? Interpreting How Vision Language Models Recognize Artistic Style	Marvin Limpijankit et.al.	2603.11024	translate	read	null
2026-03-11	Leech Lattice Vector Quantization for Efficient LLM Compression	Tycho F. A. van der Ouderaa et.al.	2603.11021	translate	read	null
2026-03-11	A Systematic Study of Pseudo-Relevance Feedback with LLMs	Nour Jedidi et.al.	2603.11008	translate	read	null
2026-03-11	TOSSS: a CVE-based Software Security Benchmark for Large Language Models	Marc Damie et.al.	2603.10969	translate	read	null
2026-03-11	LLM2Vec-Gen: Generative Embeddings from Large Language Models	Parishad BehnamGhader et.al.	2603.10913	translate	read	null
2026-03-11	When Fine-Tuning Fails and when it Generalises: Role of Data Diversity and Mixed Training in LLM-based TTS	Anupam Purwar et.al.	2603.10904	translate	read	null
2026-03-11	LookaheadKV: Fast and Accurate KV Cache Eviction by Glimpsing into the Future without Generation	Jinwoo Ahn et.al.	2603.10899	translate	read	null
2026-03-11	A Hybrid Knowledge-Grounded Framework for Safety and Traceability in Prescription Verification	Yichi Zhu et.al.	2603.10891	translate	read	null
2026-03-11	Dynamics-Predictive Sampling for Active RL Finetuning of Large Reasoning Models	Yixiu Mao et.al.	2603.10887	translate	read	null
2026-03-11	Exploring Indicators of Developers’ Sentiment Perceptions in Student Software Projects	Martin Obaidi et.al.	2603.10864	translate	read	null
2026-03-11	Beyond Sequential Distance: Inter-Modal Distance Invariant Position Encoding	Lin Chen et.al.	2603.10863	translate	read	null
2026-03-11	OSUM-Pangu: An Open-Source Multidimension Speech Understanding Foundation Model Built upon OpenPangu on Ascend NPUs	Yujie Liao et.al.	2603.10862	translate	read	null
2026-03-11	Towards Cold-Start Drafting and Continual Refining: A Value-Driven Memory Approach with Application to NPU Kernel Synthesis	Yujie Zheng et.al.	2603.10846	translate	read	null
2026-03-11	PivotAttack: Rethinking the Search Trajectory in Hard-Label Text Attacks via Pivot Words	Yuzhi Liang et.al.	2603.10842	translate	read	null
2026-03-11	Speaker Verification with Speech-Aware LLMs: Evaluation and Augmentation	Thomas Thebaud et.al.	2603.10827	translate	read	null
2026-03-11	Nurture-First Agent Development: Building Domain-Expert AI Agents Through Conversational Knowledge Crystallization	Linghao Zhang et.al.	2603.10808	translate	read	null
2026-03-11	Risk-Adjusted Harm Scoring for Automated Red Teaming for LLMs in Financial Services	Fabrizio Dimino et.al.	2603.10807	translate	read	null
2026-03-11	Semantic Satellite Communications for Synchronized Audiovisual Reconstruction	Fangyu Liu et.al.	2603.10791	translate	read	null
2026-03-11	Taking Shortcuts for Categorical VQA Using Super Neurons	Pierre Musacchio et.al.	2603.10781	translate	read	null
2026-03-11	Large Language Models as Annotators for Machine Translation Quality Estimation	Sidi Wang et.al.	2603.10775	translate	read	null
2026-03-11	Word Recovery in Large Language Models Enables Character-Level Tokenization Robustness	Zhipeng Yang et.al.	2603.10771	translate	read	null
2026-03-11	mAceReason-Math: A Dataset of High-Quality Multilingual Math Problems Ready For RLVR	Konstantin Dobler et.al.	2603.10767	translate	read	null
2026-03-11	CodePercept: Code-Grounded Visual STEM Perception for MLLMs	Tongkun Guan et.al.	2603.10757	translate	read	null
2026-03-11	CacheSolidarity: Preventing Prefix Caching Side Channels in Multi-tenant LLM Serving Systems	Panagiotis Georgios Pennas et.al.	2603.10726	translate	read	null
2026-03-11	UAV traffic scene understanding: A cross-spectral guided approach and a unified benchmark	Yu Zhang et.al.	2603.10722	translate	read	null
2026-03-11	Prism- $Δ$ : Differential Subspace Steering for Prompt Highlighting in Large Language Models	Yuyao Ge et.al.	2603.10705	translate	read	null
2026-03-11	Breaking User-Centric Agency: A Tri-Party Framework for Agent-Based Recommendation	Yaxin Gong et.al.	2603.10673	translate	read	null
2026-03-11	ESG Reporting Lifecycle Management with Large Language Models and AI Agents	Thong Hoang et.al.	2603.10646	translate	read	null
2026-03-11	Making Bielik LLM Reason (Better): A Field Report	Adam Trybus et.al.	2603.10640	translate	read	null
2026-03-11	Reinforcement Learning with Conditional Expectation Reward	Changyi Xiao et.al.	2603.10624	translate	read	null
2026-03-11	Disentangling Similarity and Relatedness in Topic Models	Hanlin Xiao et.al.	2603.10619	translate	read	null
2026-03-11	MUNIChus: Multilingual News Image Captioning Benchmark	Yuji Chen et.al.	2603.10613	translate	read	null
2026-03-11	Does LLM Alignment Really Need Diversity? An Empirical Study of Adapting RLVR Methods for Moral Reasoning	Zhaowei Zhang et.al.	2603.10588	translate	read	null
2026-03-11	Distilling LLM Semantic Priors into Encoder-Only Multi-Talker ASR with Talker-Count Routing	Hao Shi et.al.	2603.10587	translate	read	null
2026-03-11	End-to-End Chatbot Evaluation with Adaptive Reasoning and Uncertainty Filtering	Nhi Dang et.al.	2603.10570	translate	read	null
2026-03-11	Adaptive RAN Slicing Control via Reward-Free Self-Finetuning Agents	Yuanhao Li et.al.	2603.10564	translate	read	null
2026-03-11	PET-F2I: A Comprehensive Benchmark and Parameter-Efficient Fine-Tuning of LLMs for PET/CT Report Impression Generation	Yuchen Liu et.al.	2603.10560	translate	read	null
2026-03-11	Automatic End-to-End Data Integration using Large Language Models	Aaron Steiner et.al.	2603.10547	translate	read	null
2026-03-11	Resource-constrained Amazons chess decision framework integrating large language models and graph attention	Tianhao Qian et.al.	2603.10512	translate	read	null
2026-03-11	IMTBench: A Multi-Scenario Cross-Modal Collaborative Evaluation Benchmark for In-Image Machine Translation	Jiahao Lyu et.al.	2603.10495	translate	read	null
2026-03-11	Human-AI Co-reasoning for Clinical Diagnosis with Evidence-Integrated Language Agent	Zhongzhen Huang et.al.	2603.10492	translate	read	null
2026-03-11	PEEM: Prompt Engineering Evaluation Metrics for Interpretable Joint Evaluation of Prompts and Responses	Minki Hong et.al.	2603.10477	translate	read	null
2026-03-11	Learning to Negotiate: Multi-Agent Deliberation for Collective Value Alignment in LLMs	Panatchakorn Anantaprayoon et.al.	2603.10476	translate	read	null
2026-03-11	Aligning Large Language Models with Searcher Preferences	Wei Wu et.al.	2603.10473	translate	read	null
2026-03-11	Fighting Hallucinations with Counterfactuals: Diffusion-Guided Perturbations for LVLM Hallucination Suppression	Hamidreza Dastmalchi et.al.	2603.10470	translate	read	null
2026-03-11	The Curse and Blessing of Mean Bias in FP4-Quantized LLM Training	Hengjie Cao et.al.	2603.10444	translate	read	null
2026-03-11	Designing Service Systems from Textual Evidence	Ruicheng Ao et.al.	2603.10400	translate	read	null
2026-03-11	Verbalizing LLM’s Higher-order Uncertainty via Imprecise Probabilities	Anita Yang et.al.	2603.10396	translate	read	null
2026-03-11	Don’t Let the Claw Grip Your Hand: A Security Analysis and Defense Framework for OpenClaw	Zhengyang Shan et.al.	2603.10387	translate	read	null
2026-03-11	Speech Codec Probing from Semantic and Phonetic Perspectives	Xuan Shi et.al.	2603.10371	translate	read	null
2026-03-11	GeoSense: Internalizing Geometric Necessity Perception for Multimodal Reasoning	Ruiheng Liu et.al.	2603.10370	translate	read	null
2026-03-11	Utility Function is All You Need: LLM-based Congestion Control	Neta Rozen-Schiff et.al.	2603.10357	translate	read	null
2026-03-11	S-HPLB: Efficient LLM Attention Serving via Sparsity-Aware Head Parallelism Load Balance	Di Liu et.al.	2603.10353	translate	read	null
2026-03-11	Mitigating Translationese Bias in Multilingual LLM-as-a-Judge via Disentangled Information Bottleneck	Hongbin Zhang et.al.	2603.10351	translate	read	null
2026-03-11	Multi-Modal Intelligent Channel Modeling: From Fine-tuned LLMs to Pre-trained Foundation Models	Lu Bai et.al.	2603.10343	translate	read	null
2026-03-11	AgentServe: Algorithm-System Co-Design for Efficient Agentic AI Serving on a Consumer-Grade GPU	Yuning Zhang et.al.	2603.10342	translate	read	null
2026-03-11	Large language models can disambiguate opioid slang on social media	Kristy A. Carpenter et.al.	2603.10313	translate	read	null
2026-03-11	Is this Idea Novel? An Automated Benchmark for Judgment of Research Ideas	Tim Schopf et.al.	2603.10303	translate	read	null
2026-03-11	Regime-aware financial volatility forecasting via in-context learning	Saba Asaad et.al.	2603.10299	translate	read	null
2026-03-11	GaLoRA: Parameter-Efficient Graph-Aware LLMs for Node Classification	Mayur Choudhary et.al.	2603.10298	translate	read	null
2026-03-11	Simulation-in-the-Reasoning (SiR): A Conceptual Framework for Empirically Grounded AI in Autonomous Transportation	Wuping Xin et.al.	2603.10294	translate	read	null
2026-03-11	Conversational AI-Enhanced Exploration System to Query Large-Scale Digitised Collections of Natural History Museums	Yiyuan Wang et.al.	2603.10285	translate	read	null
2026-03-10	SpecOps: A Fully Automated AI Agent Testing Framework in Real-World GUI Environments	Syed Yusuf Ahmed et.al.	2603.10268	translate	read	null
2026-03-10	GR-SAP: Generative Replay for Safety Alignment Preservation during Fine-Tuning	Zhouxiang Fang et.al.	2603.10243	translate	read	null
2026-03-10	S-GRADES – Studying Generalization of Student Response Assessments in Diverse Evaluative Settings	Tasfia Seuti et.al.	2603.10233	translate	read	null
2026-03-10	Hierarchical Task Model Predictive Control for Sequential Mobile Manipulation Tasks	Xintong Du et.al.	2603.10232	translate	read	null
2026-03-10	Paladin: A Policy Framework for Securing Cloud APIs by Combining Application Context with Generative AI	Shriti Priya et.al.	2603.10228	translate	read	null
2026-03-10	Rethinking the Harmonic Loss via Non-Euclidean Distance Layers	Maxwell Miller-Golub et.al.	2603.10225	translate	read	null
2026-03-10	Adaptive Activation Cancellation for Hallucination Mitigation in Large Language Models	Eric Yocam et.al.	2603.10195	translate	read	null
2026-03-10	MCP-in-SoS: Risk assessment framework for open-source MCP servers	Pratyay Kumar et.al.	2603.10194	translate	read	null
2026-03-10	Calibration-Reasoning Framework for Descriptive Speech Quality Assessment	Elizaveta Kostenok et.al.	2603.10175	translate	read	null
2026-03-10	Omics Data Discovery Agents	Alexandre Hutton et.al.	2603.10161	translate	read	null
2026-03-10	Social Knowledge for Cross-Domain User Preference Modeling	Nir Lotan et.al.	2603.10148	translate	read	null
2026-03-10	Reason and Verify: A Framework for Faithful Retrieval-Augmented Generation	Eeham Khan et.al.	2603.10143	translate	read	null
2026-03-10	The Generation-Recognition Asymmetry: Six Dimensions of a Fundamental Divide in Formal Language Theory	Romain Peyrichou et.al.	2603.10139	translate	read	null
2026-03-10	CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR	Sijia Cui et.al.	2603.10101	translate	read	null
2026-03-10	Code-Space Response Oracles: Generating Interpretable Multi-Agent Policies with Large Language Models	Daniel Hennes et.al.	2603.10098	translate	read	null
2026-03-10	Multi-Stream Perturbation Attack: Breaking Safety Alignment of Thinking LLMs Through Concurrent Task Interference	Fan Yang et.al.	2603.10091	translate	read	null
2026-03-10	ES-dLLM: Efficient Inference for Diffusion Large Language Models by Early-Skipping	Zijian Zhu et.al.	2603.10088	translate	read	null
2026-03-10	Pooling Engram Conditional Memory in Large Language Models using CXL	Ruiyang Ma et.al.	2603.10087	translate	read	null
2026-03-10	KernelSkill: A Multi-Agent Framework for GPU Kernel Optimization	Qitong Sun et.al.	2603.10085	translate	read	null
2026-03-10	Amnesia: Adversarial Semantic Layer Specific Activation Steering in Large Language Models	Ali Raza et.al.	2603.10080	translate	read	null
2026-03-10	Why LLMs Fail: A Failure Analysis and Partial Success Measurement for Automated Security Patch Generation	Amir Al-Maamari et.al.	2603.10072	translate	read	null
2026-03-10	ADVERSA: Measuring Multi-Turn Guardrail Degradation and Judge Reliability in Large Language Models	Harry Owiredu-Ashley et.al.	2603.10068	translate	read	null
2026-03-09	Training Language Models via Neural Cellular Automata	Dan Lee et.al.	2603.10055	translate	read	null
2026-03-08	Toward Epistemic Stability: Engineering Consistent Procedures for Industrial LLM Hallucination Reduction	Brian Freeman et.al.	2603.10047	translate	read	null
2026-03-10	Understanding the Use of a Large Language Model-Powered Guide to Make Virtual Reality Accessible for Blind and Low Vision People	Jazmin Collins et.al.	2603.09964	translate	read	null
2026-03-10	Think Before You Lie: How Reasoning Improves Honesty	Ann Yuan et.al.	2603.09957	translate	read	null
2026-03-10	Towards a Neural Debugger for Python	Maximilian Beck et.al.	2603.09951	translate	read	null
2026-03-10	PathMem: Toward Cognition-Aligned Memory Transformation for Pathology MLLMs	Jinyue Li et.al.	2603.09943	translate	read	null
2026-03-10	Model Merging in the Era of Large Language Models: Methods, Applications, and Future Directions	Mingyang Song et.al.	2603.09938	translate	read	null
2026-03-10	WikiCLIP: An Efficient Contrastive Baseline for Open-domain Visual Entity Recognition	Shan Ning et.al.	2603.09921	translate	read	null
2026-03-10	MSSR: Memory-Aware Adaptive Replay for Continual LLM Fine-Tuning	Yiyang Lu et.al.	2603.09892	translate	read	null
2026-03-10	Influencing LLM Multi-Agent Dialogue via Policy-Parameterized Prompts	Hongbo Bo et.al.	2603.09890	translate	read	null
2026-03-10	Benchmarking Political Persuasion Risks Across Frontier Large Language Models	Zhongren Chen et.al.	2603.09884	translate	read	null
2026-03-10	Do What I Say: A Spoken Prompt Dataset for Instruction-Following	Maike Züfle et.al.	2603.09881	translate	read	null
2026-03-10	InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing	Changyao Tian et.al.	2603.09877	translate	read	null
2026-03-10	MissBench: Benchmarking Multimodal Affective Analysis under Imbalanced Missing Modalities	Tien Anh Pham et.al.	2603.09874	translate	read	null
2026-03-10	GAST: Gradient-aligned Sparse Tuning of Large Language Models with Data-layer Selection	Kai Yao et.al.	2603.09865	translate	read	null
2026-03-10	SCENEBench: An Audio Understanding Benchmark Grounded in Assistive and Industrial Use Cases	Laya Iyer et.al.	2603.09853	translate	read	null
2026-03-10	RecThinker: An Agentic Framework for Tool-Augmented Reasoning in Recommendation	Haobo Zhang et.al.	2603.09843	translate	read	null
2026-03-10	One-Eval: An Agentic System for Automated and Traceable LLM Evaluation	Chengyu Shen et.al.	2603.09821	translate	read	null
2026-03-10	Good Reasoning Makes Good Demonstrations: Implicit Reasoning Quality Supervision via In-Context Reinforcement Learning	Tiehua Mei et.al.	2603.09803	translate	read	null
2026-03-10	MITRA: An AI Assistant for Knowledge Retrieval in Physics Collaborations	Abhishikth Mallampalli et.al.	2603.09800	translate	read	null
2026-03-10	Quantifying the Necessity of Chain of Thought through Opaque Serial Depth	Jonah Brown-Cohen et.al.	2603.09786	translate	read	null
2026-03-10	LogoDiffuser: Training-Free Multilingual Logo Generation and Stylization via Letter-Aware Attention Control	Mingyu Kang et.al.	2603.09759	translate	read	null
2026-03-10	Beyond Fine-Tuning: Robust Food Entity Linking under Ontology Drift with FoodOntoRAG	Jan Drole et.al.	2603.09758	translate	read	null
2026-03-10	Epistemic Closure: Autonomous Mechanism Completion for Physically Consistent Simulation	Yue Wua et.al.	2603.09756	translate	read	null
2026-03-10	Let’s Reward Step-by-Step: Step-Aware Contrastive Alignment for Vision-Language Navigation in Continuous Environments	Haoyuan Li et.al.	2603.09740	translate	read	null
2026-03-10	FetalAgents: A Multi-Agent System for Fetal Ultrasound Image and Video Analysis	Xiaotian Hu et.al.	2603.09733	translate	read	null
2026-03-10	EXPLORE-Bench: Egocentric Scene Prediction with Long-Horizon Reasoning	Chengjun Yu et.al.	2603.09731	translate	read	null
2026-03-10	WVA: A Global Optimization Control Plane for llmd	Abhishek Malvankar et.al.	2603.09730	translate	read	null
2026-03-10	RbtAct: Rebuttal as Supervision for Actionable Review Feedback Generation	Sihong Wu et.al.	2603.09723	translate	read	null
2026-03-10	OOD-MMSafe: Advancing MLLM Safety from Harmful Intent to Hidden Consequences	Ming Wen et.al.	2603.09706	translate	read	null
2026-03-10	Evaluation of LLMs in retrieving food and nutritional context for RAG systems	Maks Požarnik Vavken et.al.	2603.09704	translate	read	null
2026-03-10	An Empirical Study of Interaction Smells in Multi-Turn Human-LLM Collaborative Code Generation	Binquan Zhang et.al.	2603.09701	translate	read	null
2026-03-10	ActiveUltraFeedback: Efficient Preference Data Generation using Active Learning	Davit Melikidze et.al.	2603.09692	translate	read	null
2026-03-10	ESAinsTOD: A Unified End-to-End Schema-Aware Instruction-Tuning Framework for Task-Oriented Dialog Modeling	Dechuan Teng et.al.	2603.09691	translate	read	null
2026-03-10	AutoViVQA: A Large-Scale Automatically Constructed Dataset for Vietnamese Visual Question Answering	Nguyen Anh Tuong et.al.	2603.09689	translate	read	null
2026-03-10	Automatic Cardiac Risk Management Classification using large-context Electronic Patients Health Records	Jacopo Vitale et.al.	2603.09685	translate	read	null
2026-03-10	EsoLang-Bench: Evaluating Genuine Reasoning in Large Language Models via Esoteric Programming Languages	Aman Sharma et.al.	2603.09678	translate	read	null
2026-03-10	MiniAppBench: Evaluating the Shift from Text to Interactive HTML Responses in LLM-Powered Assistants	Zuhao Zhang et.al.	2603.09652	translate	read	null
2026-03-10	Tracking Cancer Through Text: Longitudinal Extraction From Radiology Reports Using Open-Source Large Language Models	Luc Builtjes et.al.	2603.09638	translate	read	null
2026-03-10	Grounding Synthetic Data Generation With Vision and Language Models	Ümit Mert Çağlar et.al.	2603.09625	translate	read	null
2026-03-10	Compartmentalization-Aware Automated Program Repair	Jia Hu et.al.	2603.09544	translate	read	null
2026-03-10	Dynamic Multimodal Expression Generation for LLM-Driven Pedagogical Agents: From User Experience Perspective	Ninghao Wan et.al.	2603.09536	translate	read	null
2026-03-10	Enhancing Debunking Effectiveness through LLM-based Personality Adaptation	Pietro Dell’Oglio et.al.	2603.09533	translate	read	null
2026-03-10	EmbC-Test: How to Speed Up Embedded Software Testing Using LLMs and RAG	Maximilian Harnot et.al.	2603.09497	translate	read	null
2026-03-10	GenePlan: Evolving Better Generalized PDDL Plans using Large Language Models	Andrew Murray et.al.	2603.09481	translate	read	null
2026-03-10	CyberThreat-Eval: Can Large Language Models Automate Real-World Threat Research?	Xiangsen Chen et.al.	2603.09452	translate	read	null
2026-03-10	AI Act Evaluation Benchmark: An Open, Transparent, and Reproducible Evaluation Dataset for NLP and RAG Systems	Athanasios Davvetas et.al.	2603.09435	translate	read	null
2026-03-10	Common Sense vs. Morality: The Curious Case of Narrative Focus Bias in LLMs	Saugata Purkayastha et.al.	2603.09434	translate	read	null
2026-03-10	Investigating Gender Stereotypes in Large Language Models via Social Determinants of Health	Trung Hieu Ngo et.al.	2603.09416	translate	read	null
2026-03-10	Quantifying and extending the coverage of spatial categorization data sets	Wanchun Li et.al.	2603.09373	translate	read	null
2026-03-10	The Virtuous Cycle: AI-Powered Vector Search and Vector Search-Augmented AI	Jiuqi Wei et.al.	2603.09347	translate	read	null
2026-03-10	TaSR-RAG: Taxonomy-guided Structured Reasoning for Retrieval-Augmented Generation	Jiashuo Sun et.al.	2603.09341	translate	read	null
2026-03-10	Beyond Scaling: Assessing Strategic Reasoning and Rapid Decision-Making Capability of LLMs in Zero-sum Environments	Yang Li et.al.	2603.09337	translate	read	null
2026-03-10	Can ChatGPT Generate Realistic Synthetic System Requirement Specifications? Results of a Case Study	Alex R. Mattukat et.al.	2603.09335	translate	read	null
2026-03-10	OddGridBench: Exposing the Lack of Fine-Grained Visual Discrepancy Sensitivity in Multimodal Large Language Models	Tengjin Weng et.al.	2603.09326	translate	read	null
2026-03-10	Curveball Steering: The Right Direction To Steer Isn’t Always Linear	Shivam Raval et.al.	2603.09313	translate	read	null
2026-03-10	Investor risk profiles of large language models	Hanyong Cho et.al.	2603.09303	translate	read	null
2026-03-10	Constructing a Portfolio Optimization Benchmark Framework for Evaluating Large Language Models	Hanyong Cho et.al.	2603.09301	translate	read	null
2026-03-10	TA-Mem: Tool-Augmented Autonomous Memory Retrieval for LLM in Long-Term Conversational QA	Mengwei Yuan et.al.	2603.09297	translate	read	null
2026-03-10	ToolRosetta: Bridging Open-Source Repositories and Large Language Model Agents through Automated Tool Standardization	Shimin Di et.al.	2603.09290	translate	read	null
2026-03-10	Evoking User Memory: Personalizing LLM via Recollection-Familiarity Adaptive Retrieval	Yingyi Zhang et.al.	2603.09250	translate	read	null
2026-03-10	Social-R1: Towards Human-like Social Reasoning in LLMs	Jincenzi Wu et.al.	2603.09249	translate	read	null
2026-03-10	Cognitively Layered Data Synthesis for Domain Adaptation of LLMs to Space Situational Awareness	Ding Linghu et.al.	2603.09231	translate	read	null
2026-03-10	TubeMLLM: A Foundation Model for Topology Knowledge Exploration in Vessel-like Anatomy	Yaoyu Liu et.al.	2603.09217	translate	read	null
2026-03-10	PIM-SHERPA: Software Method for On-device LLM Inference by Resolving PIM Memory Attribute and Layout Inconsistencies	Sunjung Lee et.al.	2603.09216	translate	read	null
2026-03-10	Acoustic and Semantic Modeling of Emotion in Spoken Language	Soumya Dutta et.al.	2603.09212	translate	read	null
2026-03-10	MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data	Zongxia Li et.al.	2603.09206	translate	read	null
2026-03-10	Emotion is Not Just a Label: Latent Emotional Factors in LLM Processing	Benjamin Reichman et.al.	2603.09205	translate	read	null
2026-03-10	The Reasoning Trap – Logical Reasoning as a Mechanistic Pathway to Situational Awareness	Subramanyam Sahoo et.al.	2603.09200	translate	read	null
2026-03-10	DEO: Training-Free Direct Embedding Optimization for Negation-Aware Retrieval	Taegyeong Lee et.al.	2603.09185	translate	read	null
2026-03-10	Evaluating the Practical Effectiveness of LLM-Driven Index Tuning with Microsoft Database Tuning Advisor	Xiaoying Wang et.al.	2603.09181	translate	read	null
2026-03-10	Point Cloud as a Foreign Language for Multi-modal Large Language Model	Sneha Paul et.al.	2603.09173	translate	read	null
2026-03-10	Wrong Code, Right Structure: Learning Netlist Representations from Imperfect LLM-Generated RTL	Siyang Cai et.al.	2603.09161	translate	read	null
2026-03-10	RubiCap: Rubric-Guided Reinforcement Learning for Dense Image Captioning	Tzu-Heng Huang et.al.	2603.09160	translate	read	null
2026-03-10	Real-Time Trust Verification for Safe Agentic Actions using TrustBench	Tavishi Sharma et.al.	2603.09157	translate	read	null
2026-03-10	Bioalignment: Measuring and Improving LLM Disposition Toward Biological Systems for AI Safety	Trent R Northen et.al.	2603.09154	translate	read	null
2026-03-10	DataFactory: Collaborative Multi-Agent Framework for Advanced Table Question Answering	Tong Wang et.al.	2603.09152	translate	read	null
2026-03-10	Deep Tabular Research via Continual Experience-Driven Execution	Junnan Dong et.al.	2603.09151	translate	read	null
2026-03-10	QUSR: Quality-Aware and Uncertainty-Guided Image Super-Resolution Diffusion Model	Junjie Yin et.al.	2603.09125	translate	read	null
2026-03-10	Decoupling Reasoning and Confidence: Resurrecting Calibration in Reinforcement Learning from Verifiable Rewards	Zhengzhao Ma et.al.	2603.09117	translate	read	null
2026-03-10	Progressive Representation Learning for Multimodal Sentiment Analysis with Incomplete Modalities	Jindi Bao et.al.	2603.09111	translate	read	null
2026-03-10	VIVID-Med: LLM-Supervised Structured Pretraining for Deployable Medical ViTs	Xiyao Wang et.al.	2603.09109	translate	read	null
2026-03-10	Composed Vision-Language Retrieval for Skin Cancer Case Search via Joint Alignment of Global and Local Representations	Yuheng Wang et.al.	2603.09108	translate	read	null
2026-03-10	Class Model Generation from Requirements using Large Language Models	Jackson Nguyen et.al.	2603.09100	translate	read	null
2026-03-10	Reading, Not Thinking: Understanding and Bridging the Modality Gap When Text Becomes Pixels in Multimodal LLMs	Kaiser Sun et.al.	2603.09095	translate	read	null
2026-03-10	Chain of Event-Centric Causal Thought for Physically Plausible Video Generation	Zixuan Wang et.al.	2603.09094	translate	read	null
2026-03-10	Not All News Is Equal: Topic- and Event-Conditional Sentiment from Finetuned LLMs for Aluminum Price Forecasting	Alvaro Paredes Amorin et.al.	2603.09085	translate	read	null
2026-03-10	Learning Adaptive LLM Decoding	Chloe H. Su et.al.	2603.09065	translate	read	null
2026-03-10	FlexServe: A Fast and Secure LLM Serving System for Mobile Devices with Flexible Resource Isolation	Yinpeng Wu et.al.	2603.09046	translate	read	null
2026-03-09	Automating Detection and Root-Cause Analysis of Flaky Tests in Quantum Software	Janakan Sivaloganathan et.al.	2603.09029	translate	read	null
2026-03-09	The Missing Memory Hierarchy: Demand Paging for LLM Context Windows	Tony Mason et.al.	2603.09023	translate	read	null
2026-03-09	Meissa: Multi-modal Medical Agentic Intelligence	Yixiong Chen et.al.	2603.09018	translate	read	null
2026-03-09	Learning When to Sample: Confidence-Aware Self-Consistency for Efficient LLM Chain-of-Thought Reasoning	Juming Xiong et.al.	2603.08999	translate	read	null
2026-03-09	MAPLE: Elevating Medical Reasoning from Statistical Consensus to Process-Led Alignment	Kailong Fan et.al.	2603.08987	translate	read	null
2026-03-09	GenAI Is No Silver Bullet for Qualitative Research in Software Engineering	Neil A. Ernst et.al.	2603.08951	translate	read	null
2026-03-09	AgentOS: From Application Silos to a Natural Language-Driven Data Ecosystem	Rui Liu et.al.	2603.08938	translate	read	null
2026-03-09	VoxEmo: Benchmarking Speech Emotion Recognition with Speech LLMs	Hezhao Zhang et.al.	2603.08936	translate	read	null
2026-03-09	PathoScribe: Transforming Pathology Data into a Living Library with a Unified LLM-Driven Framework for Semantic Retrieval and Clinical Integration	Abdul Rehman Akbar et.al.	2603.08935	translate	read	null
2026-03-09	MEGC2026: Micro-Expression Grand Challenge on Visual Question Answering	Xinqi Fan et.al.	2603.08927	translate	read	null
2026-03-09	ConFu: Contemplate the Future for Better Speculative Sampling	Zongyue Qin et.al.	2603.08899	translate	read	null
2026-03-09	A Decentralized Frontier AI Architecture Based on Personal Instances, Synthetic Data, and Collective Context Synchronization	Jacek Małecki et.al.	2603.08893	translate	read	null
2026-03-09	LLM-Agent Interactions on Markets with Information Asymmetries	Alexander Erlei et.al.	2603.08853	translate	read	null
2026-03-09	Investigating the Effects of LLM Use on Critical Thinking Under Time Constraints: Access Timing and Time Availability	Jiayin Zhi et.al.	2603.08849	translate	read	null
2026-03-09	HMR-1: Hierarchical Massage Robot with Vision-Language-Model for Embodied Healthcare	Rongtao Xu et.al.	2603.08817	translate	read	null
2026-03-09	Scale-Plan: Scalable Language-Enabled Task Planning for Heterogeneous Multi-Robot Teams	Piyush Gupta et.al.	2603.08814	translate	read	null
2026-03-09	Large Language Model-Assisted Superconducting Qubit Experiments	Shiheng Li et.al.	2603.08801	translate	read	null
2026-03-09	Granulon: Awakening Pixel-Level Visual Encoders with Adaptive Multi-Granularity Semantics for MLLM	Junyuan Mao et.al.	2603.08800	translate	read	null
2026-03-09	Agentic Critical Training	Weize Liu et.al.	2603.08706	translate	read	null
2026-03-09	Evaluating Financial Intelligence in Large Language Models: Benchmarking SuperInvesting AI with LLM Engines	Akshay Gulati et.al.	2603.08704	translate	read	null
2026-03-09	UNBOX: Unveiling Black-box visual models with Natural-language	Simone Carnemolla et.al.	2603.08639	translate	read	null
2026-03-09	Boosting MLLM Spatial Reasoning with Geometrically Referenced 3D Scene Representations	Jiangye Yuan et.al.	2603.08592	translate	read	null
2026-03-09	RetroAgent: From Solving to Evolving via Retrospective Dual Intrinsic Feedback	Xiaoying Zhang et.al.	2603.08561	translate	read	null
2026-03-09	SecAgent: Efficient Mobile GUI Agent with Semantic Context	Yiping Xie et.al.	2603.08533	translate	read	null
2026-03-09	SCAFFOLD-CEGIS: Preventing Latent Security Degradation in LLM-Driven Iterative Code Refinement	Yi Chen et.al.	2603.08520	translate	read	null
2026-03-09	AtomVLA: Scalable Post-Training for Robotic Manipulation via Predictive Latent World Models	Xiaoquan Sun et.al.	2603.08519	translate	read	null
2026-03-09	Fanar-Sadiq: A Multi-Agent Architecture for Grounded Islamic QA	Ummar Abbas et.al.	2603.08501	translate	read	null
2026-03-09	Visual Self-Fulfilling Alignment: Shaping Safety-Oriented Personas via Threat-Related Images	Qishun Yang et.al.	2603.08486	translate	read	null
2026-03-09	Behavioral Generative Agents for Power Dispatch and Auction	Shaoze Li et.al.	2603.08477	translate	read	null
2026-03-09	R2F: Repurposing Ray Frontiers for LLM-free Object Navigation	Francesco Argenziano et.al.	2603.08475	translate	read	null
2026-03-09	LycheeCluster: Efficient Long-Context Inference with Structure-Aware Chunking and Hierarchical KV Indexing	Dongfang Li et.al.	2603.08453	translate	read	null
2026-03-09	A prospective clinical feasibility study of a conversational diagnostic AI in an ambulatory primary care clinic	Peter Brodeur et.al.	2603.08448	translate	read	null
2026-03-09	LLM-Driven Online Aggregation for Unstructured Text Analytics	Chao Hui et.al.	2603.08443	translate	read	null
2026-03-09	Sandpiper: Orchestrated AI-Annotation for Educational Discourse at Scale	Daryl Hedley et.al.	2603.08406	translate	read	null
2026-03-09	Revealing Behavioral Plasticity in Large Language Models: A Token-Conditional Perspective	Liyuan Mao et.al.	2603.08398	translate	read	null
2026-03-09	COACH meets QUORUM: A Framework and Pipeline for Aligning User, Expert and Developer Perspectives in LLM-generated Health Counselling	Yee Man Ng et.al.	2603.08392	translate	read	null
2026-03-09	AULLM++: Structural Reasoning with Large Language Models for Micro-Expression Recognition	Zhishu Liu et.al.	2603.08387	translate	read	null
2026-03-09	M $^3$ -ACE: Rectifying Visual Perception in Multimodal Math Reasoning via Multi-Agentic Context Engineering	Peijin Xie et.al.	2603.08369	translate	read	null
2026-03-09	SPD-RAG: Sub-Agent Per Document Retrieval-Augmented Generation	Yagiz Can Akay et.al.	2603.08329	translate	read	null
2026-03-09	Agentic Neurosymbolic Collaboration for Mathematical Discovery: A Case Study in Combinatorial Design	Hai Xia et.al.	2603.08322	translate	read	null
2026-03-09	CORE-Acu: Structured Reasoning Traces and Knowledge Graph Safety Verification for Acupuncture Clinical Decision Support	Liuyi Xu et.al.	2603.08321	translate	read	null
2026-03-09	AdaCultureSafe: Adaptive Cultural Safety Grounded by Cultural Knowledge in Large Language Models	Hankun Kang et.al.	2603.08275	translate	read	null
2026-03-09	How Much Do LLMs Hallucinate in Document Q&A Scenarios? A 172-Billion-Token Study Across Temperatures, Context Lengths, and Hardware Platforms	JV Roig et.al.	2603.08274	translate	read	null
2026-03-09	Towards a more efficient bias detection in financial language models	Firas Hadj Kacem et.al.	2603.08267	translate	read	null
2026-03-09	FinToolBench: Evaluating LLM Agents for Real-World Financial Tool Use	Jiaxuan Lu et.al.	2603.08262	translate	read	null
2026-03-09	NCL-UoR at SemEval-2026 Task 5: Embedding-Based Methods, Fine-Tuning, and LLMs for Word Sense Plausibility Rating	Tong Wu et.al.	2603.08256	translate	read	null
2026-03-09	Fibration Policy Optimization	Chang Li et.al.	2603.08239	translate	read	null
2026-03-09	The Struggle Between Continuation and Refusal: A Mechanistic Analysis of the Continuation-Triggered Jailbreak in LLMs	Yonghong Deng et.al.	2603.08234	translate	read	null
2026-03-09	Supporting Workflow Reproducibility by Linking Bioinformatics Tools across Papers and Executable Code	Clémence Sebe et.al.	2603.08195	translate	read	null
2026-03-09	SERQ: Saliency-Aware Low-Rank Error Reconstruction for LLM Quantization	Yeonsik Park et.al.	2603.08185	translate	read	null
2026-03-09	TildeOpen LLM: Leveraging Curriculum Learning to Achieve Equitable Language Representation	Toms Bergmanis et.al.	2603.08182	translate	read	null
2026-03-09	AutoAdapt: An Automated Domain Adaptation Framework for LLMs	Sidharth Sinha et.al.	2603.08181	translate	read	null
2026-03-09	MERLIN: Building Low-SNR Robust Multimodal LLMs for Electromagnetic Signals	Junyu Shen et.al.	2603.08174	translate	read	null
2026-03-09	RexDrug: Reliable Multi-Drug Combination Extraction through Reasoning-Enhanced LLMs	Zhijun Wang et.al.	2603.08166	translate	read	null
2026-03-09	The Differential Effects of Agreeableness and Extraversion on Older Adults’ Perceptions of Conversational AI Explanations in Assistive Settings	Niharika Mathur et.al.	2603.08164	translate	read	null
2026-03-09	Gender Bias in MT for a Genderless Language: New Benchmarks for Basque	Amaia Murillo et.al.	2603.08153	translate	read	null
2026-03-09	Gradually Excavating External Knowledge for Implicit Complex Question Answering	Chang Liu et.al.	2603.08148	translate	read	null
2026-03-09	EvoScientist: Towards Multi-Agent Evolving AI Scientists for End-to-End Scientific Discovery	Yougang Lyu et.al.	2603.08127	translate	read	null
2026-03-09	SAMoE-VLA: A Scene Adaptive Mixture-of-Experts Vision-Language-Action Model for Autonomous Driving	Zihan You et.al.	2603.08113	translate	read	null
2026-03-09	Invisible Safety Threat: Malicious Finetuning for LLM via Steganography	Guangnian Wan et.al.	2603.08104	translate	read	null
2026-03-09	Toward Robust LLM-Based Judges: Taxonomic Bias Evaluation and Debiasing Optimization	Hongli Zhou et.al.	2603.08091	translate	read	null
2026-03-09	EAGLE-Pangu: Accelerator-Safe Tree Speculative Decoding on Ascend NPUs	Chang Han et.al.	2603.08088	translate	read	null
2026-03-09	From Reactive to Map-Based AI: Tuned Local LLMs for Semantic Zone Inference in Object-Goal Navigation	Yudai Noda et.al.	2603.08086	translate	read	null
2026-03-09	The AI Amplifier Effect: Defining Human-AI Intimacy and Romantic Relationships with Conversational AI	Ching Christie Pang et.al.	2603.08084	translate	read	null
2026-03-09	High-Fidelity Pruning for Large Language Models	Yijun Zhu et.al.	2603.08083	translate	read	null
2026-03-09	Why Large Language Models can Secretly Outperform Embedding Similarity in Information Retrieval	Matei Benescu et.al.	2603.08077	translate	read	null
2026-03-09	Synthetic Defect Image Generation for Power Line Insulator Inspection Using Multimodal Large Language Models	Xuesong Wang et.al.	2603.08069	translate	read	null
2026-03-09	In-Context Reinforcement Learning for Tool Use in Large Language Models	Yaoqi Ye et.al.	2603.08068	translate	read	null
2026-03-09	Deterministic Differentiable Structured Pruning for Large Language Models	Weiyu Huang et.al.	2603.08065	translate	read	null
2026-03-09	CinemaWorld: Generative Augmented Reality with LLMs and 3D Scene Generation for Movie Augmentation	Keiichi Ihara et.al.	2603.08060	translate	read	null
2026-03-09	Stabilized Fine-Tuning with LoRA in Federated Learning: Mitigating the Side Effect of Client Size and Rank via the Scaling Factor	Jiayu Huang et.al.	2603.08058	translate	read	null
2026-03-09	S2S-FDD: Bridging Industrial Time Series and Natural Language for Explainable Zero-shot Fault Diagnosis	Baoxue Li et.al.	2603.08048	translate	read	null
2026-03-09	CDRRM: Contrast-Driven Rubric Generation for Reliable and Interpretable Reward Modeling	Dengcan Liu et.al.	2603.08035	translate	read	null
2026-03-09	ConflictBench: Evaluating Human-AI Conflict via Interactive and Visually Grounded Environments	Weixiang Zhao et.al.	2603.08024	translate	read	null
2026-03-09	Capacity-Aware Mixture Law Enables Efficient LLM Data Optimization	Jingwei Li et.al.	2603.08022	translate	read	null
2026-03-09	Missing No More: Dictionary-Guided Cross-Modal Image Fusion under Missing Infrared	Yafei Zhang et.al.	2603.08018	translate	read	null
2026-03-09	FedMomentum: Preserving LoRA Training Momentum in Federated Fine-Tuning	Peishen Yan et.al.	2603.08014	translate	read	null
2026-03-09	PIRA-Bench: A Transition from Reactive GUI Agents to GUI-based Proactive Intent Recommendation Agents	Yuxiang Chai et.al.	2603.08013	translate	read	null
2026-03-09	SmartThinker: Progressive Chain-of-Thought Length Calibration for Efficient Large Language Model Reasoning	Chenzhi Hu et.al.	2603.08000	translate	read	null
2026-03-09	CMMR-VLN: Vision-and-Language Navigation via Continual Multimodal Memory Retrieval	Haozhou Li et.al.	2603.07997	translate	read	null
2026-03-09	AutoTraces: Autoregressive Trajectory Forecasting via Multimodal Large Language Models	Teng Wang et.al.	2603.07989	translate	read	null
2026-03-09	Adaptive Collaboration with Humans: Metacognitive Policy Optimization for Multi-Agent LLMs with Continual Learning	Wei Yang et.al.	2603.07972	translate	read	null
2026-03-09	GOMA: Geometrically Optimal Mapping via Analytical Modeling for Spatial Accelerators	Wulve Yang et.al.	2603.07962	translate	read	null
2026-03-09	SGG-R $^{\rm 3}$ : From Next-Token Prediction to End-to-End Unbiased Scene Graph Generation	Jiaye Feng et.al.	2603.07961	translate	read	null
2026-03-09	ELLMob: Event-Driven Human Mobility Generation with Self-Aligned LLM Framework	Yusong Wang et.al.	2603.07946	translate	read	null
2026-03-09	AI Agents, Language, Deep Learning and the Next Revolution in Science	Ke Li et.al.	2603.07940	translate	read	null
2026-03-09	Text to Automata Diagrams: Comparing TikZ Code Generation with Direct Image Synthesis	Ethan Young et.al.	2603.07936	translate	read	null
2026-03-09	BRIDGE: Benchmark for multi-hop Reasoning In long multimodal Documents with Grounded Evidence	Biao Xiang et.al.	2603.07931	translate	read	null
2026-03-09	SWE-Fuse: Empowering Software Agents via Issue-free Trajectory Learning and Entropy-aware RLVR Training	Xin-Cheng Wen et.al.	2603.07927	translate	read	null
2026-03-09	LeJOT-AutoML: LLM-Driven Feature Engineering for Job Execution Time Prediction in Databricks Cost Optimization	Lizhi Ma et.al.	2603.07897	translate	read	null
2026-03-09	Reject, Resample, Repeat: Understanding Parallel Reasoning in Language Model Inference	Noah Golowich et.al.	2603.07887	translate	read	null
2026-03-09	CCR-Bench: A Comprehensive Benchmark for Evaluating LLMs on Complex Constraints, Control Flows, and Real-World Cases	Xiaona Xue et.al.	2603.07886	translate	read	null
2026-03-09	What Do AI Agents Talk About? Emergent Communication Structure in the First AI-Only Social Network	Taksch Dube et.al.	2603.07880	translate	read	null
2026-03-09	Hospitality-VQA: Decision-Oriented Informativeness Evaluation for Vision-Language Models	Jeongwoo Lee et.al.	2603.07868	translate	read	null
2026-03-08	An Efficient and Effective Evaluator for Text2SQL Models on Unseen and Unlabeled Data	Trinh Pham et.al.	2603.07841	translate	read	null
2026-03-08	AI Steerability 360: A Toolkit for Steering Large Language Models	Erik Miehling et.al.	2603.07837	translate	read	null
2026-03-08	AI Misuse in Education Is a Measurement Problem: Toward a Learning Visibility Framework	Eduardo Davalos et.al.	2603.07834	translate	read	null
2026-03-08	Benchmarking Large Language Models for Quebec Insurance: From Closed-Book to Retrieval-Augmented Generation	David Beauchemin et.al.	2603.07825	translate	read	null
2026-03-08	Reasoning Knowledge-Gap in Drone Planning via LLM-based Active Elicitation	Zeyu Fang et.al.	2603.07824	translate	read	null
2026-03-08	Temperature-Aware Scheduling of LLM Inference in Large-Scale Geo-Distributed Edge Data Centers with Distributed Optimization	Arash Khalatbarisoltani et.al.	2603.07810	translate	read	null
2026-03-08	Dual-Metric Evaluation of Social Bias in Large Language Models: Evidence from an Underrepresented Nepali Cultural Context	Ashish Pandey et.al.	2603.07792	translate	read	null
2026-03-08	ArcLight: A Lightweight LLM Inference Architecture for Many-Core CPUs	Yuzhuang Xu et.al.	2603.07770	translate	read	null
2026-03-08	MedQ-Deg: A Multidimensional Benchmark for Evaluating MLLMs Across Medical Image Quality Degradations	Jiyao Liu et.al.	2603.07769	translate	read	null
2026-03-08	QuadAI at SemEval-2026 Task 3: Ensemble Learning of Hybrid RoBERTa and LLMs for Dimensional Aspect-Based Sentiment Analysis	A. J. W. de Vink et.al.	2603.07766	translate	read	null
2026-03-08	3ViewSense: Spatial and Mental Perspective Reasoning from Orthographic Views in Vision-Language Models	Shaoxiong Zhan et.al.	2603.07751	translate	read	null
2026-03-02	Symbol-Equivariant Recurrent Reasoning Models	Richard Freinschlag et.al.	2603.02193	translate	read	null
2026-03-02	Multi-Head Low-Rank Attention	Songtao Liu et.al.	2603.02188	translate	read	null
2026-03-02	How Small Can 6G Reason? Scaling Tiny Language Models for AI-Native Networks	Mohamed Amine Ferrag et.al.	2603.02156	translate	read	null
2026-03-02	Zero- and Few-Shot Named-Entity Recognition: Case Study and Dataset in the Crime Domain (CrimeNER)	Miguel Lopez-Duran et.al.	2603.02150	translate	read	null
2026-03-02	LongRLVR: Long-Context Reinforcement Learning Requires Verifiable Context Rewards	Guanzheng Chen et.al.	2603.02146	translate	read	null
2026-03-02	LLMs as Strategic Actors: Behavioral Alignment, Risk Calibration, and Argumentation Framing in Geopolitical Simulations	Veronika Solopova et.al.	2603.02128	translate	read	null
2026-03-02	Pencil Puzzle Bench: A Benchmark for Multi-Step Verifiable Reasoning	Justin Waugh et.al.	2603.02119	translate	read	null
2026-03-02	Recursive Think-Answer Process for LLMs and VLMs	Byung-Kwan Lee et.al.	2603.02099	translate	read	null
2026-03-02	OmniRet: Efficient and High-Fidelity Omni Modality Retrieval	Chuong Huynh et.al.	2603.02098	translate	read	null
2026-03-02	ClinConsensus: A Consensus-Based Benchmark for Evaluating Chinese Medical LLMs across Difficulty Levels	Xiang Zheng et.al.	2603.02097	translate	read	null
2026-03-02	Adam Converges Without Any Modification On Update Rules	Yushun Zhang et.al.	2603.02092	translate	read	null
2026-03-02	Learning from Synthetic Data Improves Multi-hop Reasoning	Anmol Kabra et.al.	2603.02091	translate	read	null
2026-03-02	GenDB: The Next Generation of Query Processing – Synthesized, Not Engineered	Jiale Lao et.al.	2603.02081	translate	read	null
2026-03-02	Trident: Adaptive Scheduling for Heterogeneous Multimodal Data Pipelines	Ding Pan et.al.	2603.02075	translate	read	null
2026-03-02	Exploring Plan Space through Conversation: An Agentic Framework for LLM-Mediated Explanations in Planning	Guilhem Fouilhé et.al.	2603.02070	translate	read	null
2026-03-02	Beyond Microservices: Testing Web-Scale RCA Methods on GPU-Driven LLM Workloads	Dominik Scheinert et.al.	2603.02057	translate	read	null
2026-03-02	Expanding LLM Agent Boundaries with Strategy-Guided Exploration	Andrew Szot et.al.	2603.02045	translate	read	null
2026-03-02	EstLLM: Enhancing Estonian Capabilities in Multilingual LLMs via Continued Pretraining and Post-Training	Aleksei Dorkin et.al.	2603.02041	translate	read	null
2026-03-02	LAD-Drive: Bridging Language and Trajectory with Action-Aware Diffusion Transformers	Fabian Schmidt et.al.	2603.02035	translate	read	null
2026-03-02	MetaRCA: A Generalizable Root Cause Analysis Framework for Cloud-Native Systems Powered by Meta Causal Knowledge	Shuai Liang et.al.	2603.02032	translate	read	null
2026-03-02	Learning to Read Where to Look: Disease-Aware Vision-Language Pretraining for 3D CT	Simon Ging et.al.	2603.02026	translate	read	null
2026-03-02	MMR-Life: Piecing Together Real-life Scenes for Multimodal Multi-image Reasoning	Jiachun Li et.al.	2603.02024	translate	read	link
2026-03-02	CharacterFlywheel: Scaling Iterative Improvement of Engaging and Steerable LLMs in Production	Yixin Nie et.al.	2603.01973	translate	read	null
2026-03-02	LiveCultureBench: a Multi-Agent, Multi-Cultural Benchmark for Large Language Models in Dynamic Social Simulations	Viet-Thanh Pham et.al.	2603.01952	translate	read	null
2026-03-02	When Numbers Tell Half the Story: Human-Metric Alignment in Topic Model Evaluation	Thibault Prouteau et.al.	2603.01945	translate	read	null
2026-03-02	Ignore All Previous Instructions: Jailbreaking as a de-escalatory peace building practise to resist LLM social media bots	Huw Day et.al.	2603.01942	translate	read	null
2026-03-02	Real Money, Fake Models: Deceptive Model Claims in Shadow APIs	Yage Zhang et.al.	2603.01919	translate	read	null
2026-03-02	AdaPonderLM: Gated Pondering Language Models with Token-Wise Adaptive Depth	Shixiang Song et.al.	2603.01914	translate	read	null
2026-03-02	Efficient RLVR Training via Weighted Mutual Information Data Selection	Xinyu Zhou et.al.	2603.01907	translate	read	null
2026-03-02	VietSuperSpeech: A Large-Scale Vietnamese Conversational Speech Dataset for ASR Fine-Tuning in Chatbot, Customer Support, and Call Center Applications	Loan Do et.al.	2603.01894	translate	read	null
2026-03-02	KDFlow: A User-Friendly and Efficient Knowledge Distillation Framework for Large Language Models	Songming Zhang et.al.	2603.01875	translate	read	null
2026-03-02	Let the Agent Search: Autonomous Exploration Beats Rigid Workflows in Temporal Question Answering	Xufei Lv et.al.	2603.01853	translate	read	null
2026-03-02	Probing Materials Knowledge in LLMs: From Latent Embeddings to Reliable Predictions	Vineeth Venugopal et.al.	2603.01834	translate	read	null
2026-03-02	OpenAutoNLU: Open Source AutoML Library for NLU	Grigory Arshinov et.al.	2603.01824	translate	read	null
2026-03-02	Emerging Human-like Strategies for Semantic Memory Foraging in Large Language Models	Eric Lacosse et.al.	2603.01822	translate	read	null
2026-03-02	Voices, Faces, and Feelings: Multi-modal Emotion-Cognition Captioning for Mental Health Understanding	Zhiyuan Zhou et.al.	2603.01816	translate	read	null
2026-03-02	Architecture-Aware Multi-Design Generation for Repository-Level Feature Addition	Mingwei Liu et.al.	2603.01814	translate	read	null
2026-03-02	ALTER: Asymmetric LoRA for Token-Entropy-Guided Unlearning of LLMs	Xunlei Chen et.al.	2603.01792	translate	read	null
2026-03-02	nchellwig at SemEval-2026 Task 3: Self-Consistent Structured Generation (SCSG) for Dimensional Aspect-Based Sentiment Analysis using Large Language Models	Nils Constantin Hellwig et.al.	2603.01788	translate	read	null
2026-03-02	Co-Evolutionary Multi-Modal Alignment via Structured Adversarial Evolution	Guoxin Shi et.al.	2603.01784	translate	read	null
2026-03-02	GAM-RAG: Gain-Adaptive Memory for Evolving Retrieval in Retrieval-Augmented Generation	Yifan Wang et.al.	2603.01783	translate	read	null
2026-03-02	LLM-as-an-Annotator: Training Lightweight Models with LLM-Annotated Examples for Aspect Sentiment Tuple Prediction	Nils Constantin Hellwig et.al.	2603.01778	translate	read	null
2026-03-02	FreeAct: Freeing Activations for LLM Quantization	Xiaohao Liu et.al.	2603.01776	translate	read	null
2026-03-02	Beyond the Resumé: A Rubric-Aware Automatic Interview System for Information Elicitation	Harry Stuart et.al.	2603.01775	translate	read	null
2026-03-02	AnnoABSA: A Web-Based Annotation Tool for Aspect-Based Sentiment Analysis with Retrieval-Augmented Suggestions	Nils Constantin Hellwig et.al.	2603.01773	translate	read	null
2026-03-02	Bootstrapping Embeddings for Low Resource Languages	Merve Basoz et.al.	2603.01732	translate	read	null
2026-03-02	Learning Domain-Aware Task Prompt Representations for Multi-Domain All-in-One Image Restoration	Guanglu Dong et.al.	2603.01725	translate	read	null
2026-03-02	GMP: A Benchmark for Content Moderation under Co-occurring Violations and Dynamic Rules	Houde Dong et.al.	2603.01724	translate	read	null
2026-03-02	Changes in Manuscript Length, Research Team Size, and International Collaboration in the Post-2022 Period: Evidence from PLOS ONE	Yossi Ben-Zion et.al.	2603.01718	translate	read	null
2026-03-02	FT-Dojo: Towards Autonomous LLM Fine-Tuning with Language Agents	Qizheng Li et.al.	2603.01712	translate	read	null
2026-03-02	Cross-modal Identity Mapping: Minimizing Information Loss in Modality Conversion via Reinforcement Learning	Haonan Jia et.al.	2603.01696	translate	read	null
2026-03-02	Building a Strong Instruction Language Model for a Less-Resourced Language	Domen Vreš et.al.	2603.01691	translate	read	null
2026-03-02	Surgical Post-Training: Cutting Errors, Keeping Knowledge	Wenye Lin et.al.	2603.01683	translate	read	null
2026-03-02	CeProAgents: A Hierarchical Agents System for Automated Chemical Process Development	Yuhang Yang et.al.	2603.01654	translate	read	null
2026-03-02	LexChronos: An Agentic Framework for Structured Event Timeline Extraction in Indian Jurisprudence	Anka Chandrahas Tummepalli et.al.	2603.01651	translate	read	null
2026-03-02	Learning Structured Reasoning via Tractable Trajectory Control	Po-Nien Kung et.al.	2603.01641	translate	read	null
2026-03-02	Learning to Draft: Adaptive Speculative Decoding with Reinforcement Learning	Jiebin Zhang et.al.	2603.01639	translate	read	null
2026-03-02	Who Explains Privacy Policies to Me? Embodied and Textual LLM-Powered Privacy Assistants in Virtual Reality	Vincent Freiberger et.al.	2603.01638	translate	read	null
2026-03-02	DriveCombo: Benchmarking Compositional Traffic Rule Reasoning in Autonomous Driving	Enhui Ma et.al.	2603.01637	translate	read	null
2026-03-02	Assessing Crime Disclosure Patterns in a Large-Scale Cybercrime Forum	Raphael Hoheisel et.al.	2603.01624	translate	read	null
2026-03-02	IDProxy: Cold-Start CTR Prediction for Ads and Recommendation at Xiaohongshu with Multimodal LLMs	Yubin Zhang et.al.	2603.01590	translate	read	null
2026-03-02	SafeSci: Safety Evaluation of Large Language Models in Science Domains and Beyond	Xiangyang Zhu et.al.	2603.01589	translate	read	null
2026-03-02	DualSentinel: A Lightweight Framework for Detecting Targeted Attacks in Black-box LLM via Dual Entropy Lull Pattern	Xiaoyi Pang et.al.	2603.01574	translate	read	null
2026-03-02	Investigating Group Relative Policy Optimization for Diffusion Transformer based Text-to-Audio Generation	Yi Gu et.al.	2603.01565	translate	read	null
2026-03-02	From Secure Agentic AI to Secure Agentic Web: Challenges, Threats, and Future Directions	Zhihang Deng et.al.	2603.01564	translate	read	null
2026-03-02	LFPO: Likelihood-Free Policy Optimization for Masked Diffusion Models	Chenxing Wei et.al.	2603.01563	translate	read	null
2026-03-02	RubricBench: Aligning Model-Generated Rubrics with Human Standards	Qiyuan Zhang et.al.	2603.01562	translate	read	link
2026-03-02	Benchmarking LLM Summaries of Multimodal Clinical Time Series for Remote Monitoring	Aditya Shukla et.al.	2603.01557	translate	read	null
2026-03-02	S5-HES Agent: Society 5.0-driven Agentic Framework to Democratize Smart Home Environment Simulation	Akila Siriweera et.al.	2603.01554	translate	read	null
2026-03-02	Extracting Training Dialogue Data from Large Language Model based Task Bots	Shuo Zhang et.al.	2603.01550	translate	read	null
2026-03-02	Training-Free Spatio-temporal Decoupled Reasoning Video Segmentation with Adaptive Object Memory	Zhengtong Zhu et.al.	2603.01545	translate	read	null
2026-03-02	FATE: Closed-Loop Feasibility-Aware Task Generation with Active Repair for Physically Grounded Robotic Curricula	Bingchuan Wei et.al.	2603.01505	translate	read	null
2026-03-02	GAC: Stabilizing Asynchronous RL Training for LLMs via Gradient Alignment Control	Haofeng Xu et.al.	2603.01501	translate	read	null
2026-03-02	Towards Privacy-Preserving LLM Inference via Collaborative Obfuscation (Technical Report)	Yu Lin et.al.	2603.01499	translate	read	null
2026-03-02	Inference-Time Safety For Code LLMs Via Retrieval-Augmented Revision	Manisha Mukherjee et.al.	2603.01494	translate	read	null
2026-03-02	LLM-assisted Semantic Option Discovery for Facilitating Adaptive Deep Reinforcement Learning	Chang Yao et.al.	2603.01488	translate	read	null
2026-03-02	Harmonizing Dense and Sparse Signals in Multi-turn RL: Dual-Horizon Credit Assignment for Industrial Sales Agents	Haojin Yang et.al.	2603.01481	translate	read	null
2026-03-02	SFCo-Nav: Efficient Zero-Shot Visual Language Navigation via Collaboration of Slow LLM and Fast Attributed Graph Alignment	Chaoran Xiong et.al.	2603.01477	translate	read	null
2026-03-02	Reconstructing Content via Collaborative Attention to Improve Multimodal Embedding Quality	Jiahan Chen et.al.	2603.01471	translate	read	null
2026-03-02	ProtRLSearch: A Multi-Round Multimodal Protein Search Agent with Large Language Models Trained via Reinforcement Learning	Congying Liu et.al.	2603.01464	translate	read	null
2026-03-02	Production-Grade AI Coding System for Client-Side Development	Ruihan Wang et.al.	2603.01460	translate	read	null
2026-03-02	From Verbatim to Gist: Distilling Pyramidal Multimodal Memory via Semantic Information Bottleneck for Long-Horizon Video Agents	Niu Lian et.al.	2603.01455	translate	read	null
2026-03-02	VidDoS: Universal Denial-of-Service Attack on Video-based Large Language Models	Duoxun Tang et.al.	2603.01454	translate	read	null
2026-03-02	Enhancing Persona Following at Decoding Time via Dynamic Importance Estimation for Role-Playing Agents	Yuxin Liu et.al.	2603.01438	translate	read	null
2026-03-02	Decoding Answers Before Chain-of-Thought: Evidence from Pre-CoT Probes and Activation Steering	Kyle Cox et.al.	2603.01437	translate	read	null
2026-03-02	LaSER: Internalizing Explicit Reasoning into Latent Space for Dense Retrieval	Jiajie Jin et.al.	2603.01425	translate	read	null
2026-03-02	Quantifying Conversational Reliability of Large Language Models under Multi-Turn Interaction	Jiyoon Myung et.al.	2603.01423	translate	read	null
2026-03-02	SciDER: Scientific Data-centric End-to-end Researcher	Ke Lin et.al.	2603.01421	translate	read	null
2026-03-02	ReFeed: Retrieval Feedback-Guided Dataset Construction for Style-Aware Query Rewriting	Jiyoon Myung et.al.	2603.01417	translate	read	null
2026-03-02	Jailbreaking Embodied LLMs via Action-level Manipulation	Xinyu Huang et.al.	2603.01414	translate	read	null
2026-03-02	When Humans Don’t Feel Like an Option: Contextual Factors That Shape When Older Adults Turn to Conversational AI for Emotional Support	Mengqi Shi et.al.	2603.01413	translate	read	null
2026-03-02	GraphScout: Empowering Large Language Models with Intrinsic Exploration Ability for Agentic Graph Reasoning	Yuchen Ying et.al.	2603.01410	translate	read	null
2026-03-02	MIST-RL: Mutation-based Incremental Suite Testing via Reinforcement Learning	Sicheng Zhu et.al.	2603.01409	translate	read	null
2026-03-02	Token Reduction via Local and Global Contexts Optimization for Efficient Video Large Language Models	Jinlong Li et.al.	2603.01400	translate	read	null
2026-03-02	Quasar: Quantized Self-Speculative Acceleration for Rapid Inference via Memory-Efficient Verification	Guang Huang et.al.	2603.01399	translate	read	null
2026-03-02	Toward Graph-Tokenizing Large Language Models with Reconstructive Graph Instruction Tuning	Zhongjian Zhang et.al.	2603.01385	translate	read	null
2026-03-02	3BASiL: An Algorithmic Framework for Sparse plus Low-Rank Compression of LLMs	Mehdi Makni et.al.	2603.01376	translate	read	null
2026-03-02	Words & Weights: Streamlining Multi-Turn Interactions via Co-Adaptation	Chenxing Wei et.al.	2603.01375	translate	read	null
2026-03-02	PanCanBench: A Comprehensive Benchmark for Evaluating Large Language Models in Pancreatic Oncology	Yimin Zhao et.al.	2603.01343	translate	read	null
2026-03-02	Structural Hallucination in Large Language Models: A Network-Based Evaluation of Knowledge Organization and Citation Integrity	Moses Boudourides et.al.	2603.01341	translate	read	null
2026-03-01	SWE-Adept: An LLM-Based Agentic Framework for Deep Codebase Analysis and Structured Issue Resolution	Kang He et.al.	2603.01327	translate	read	null
2026-03-01	Truth as a Trajectory: What Internal Representations Reveal About Large Language Model Reasoning	Hamed Damirchi et.al.	2603.01326	translate	read	null
2026-03-01	Caught in a Mafia Romance: How Users Explore Intimate Roleplay and Narrative Exploration with Chatbots	Julia Kieserman et.al.	2603.01319	translate	read	null
2026-03-01	Actor’s Note: Examining the Role of AI-Generated Questions in Character Journaling for Actor Training	Sora Kang et.al.	2603.01314	translate	read	null
2026-03-01	Theoretical Perspectives on Data Quality and Synergistic Effects in Pre- and Post-Training Reasoning Models	Adel Javanmard et.al.	2603.01293	translate	read	null
2026-03-01	JailNewsBench: Multi-Lingual and Regional Benchmark for Fake News Generation under Jailbreak Attacks	Masahiro Kaneko et.al.	2603.01291	translate	read	null
2026-03-01	Individual Turing Test: A Case Study of LLM-based Simulation Using Longitudinal Personal Data	Minghao Guo et.al.	2603.01289	translate	read	null
2026-03-01	Attention Smoothing Is All You Need For Unlearning	Saleh Zare Zade et.al.	2603.01285	translate	read	null
2026-03-01	GlassMol: Interpretable Molecular Property Prediction with Concept Bottleneck Models	Oscar Rivera et.al.	2603.01274	translate	read	null
2026-03-01	NeuroSCA: Neuro-Symbolic Constraint Abstraction for Smart Contract Hybrid Fuzzing	Haochen Liang et.al.	2603.01272	translate	read	null
2026-03-01	MOSAIC: A Unified Platform for Cross-Paradigm Comparison and Evaluation of Homogeneous and Heterogeneous Multi-Agent RL, LLM, VLM, and Human Decision-Makers	Abdulhamid M. Mousa et.al.	2603.01260	translate	read	null
2026-03-01	A Systematic Study of LLM-Based Architectures for Automated Patching	Qingxiao Xu et.al.	2603.01257	translate	read	null
2026-03-01	Linking Knowledge to Care: Knowledge Graph-Augmented Medical Follow-Up Question Generation	Liwen Sun et.al.	2603.01252	translate	read	null
2026-03-01	Defensive Refusal Bias: How Safety Alignment Fails Cyber Defenders	David Campbell et.al.	2603.01246	translate	read	null
2026-03-01	Suffix-Constrained Greedy Search Algorithms for Causal Language Models	Ayoub Hammal et.al.	2603.01243	translate	read	null
2026-03-01	Self-Anchoring Calibration Drift in Large Language Models: How Multi-Turn Conversations Reshape Model Confidence	Harshavardhan et.al.	2603.01239	translate	read	null
2026-03-01	The Lattice Representation Hypothesis of Large Language Models	Bo Xiong et.al.	2603.01227	translate	read	null
2026-03-01	Can Thinking Models Think to Detect Hateful Memes?	Mohamed Bayan Kmainasi et.al.	2603.01225	translate	read	null
2026-03-01	Generative AI & Fictionality: How Novels Power Large Language Models	Edwin Roland et.al.	2603.01220	translate	read	null
2026-03-01	Reasoning Boosts Opinion Alignment in LLMs	Frédéric Berdoz et.al.	2603.01214	translate	read	null
2026-03-01	Can AI Agents Agree?	Frédéric Berdoz et.al.	2603.01213	translate	read	null
2026-03-01	Token-level Data Selection for Safe LLM Fine-tuning	Yanping Li et.al.	2603.01185	translate	read	null
2026-03-01	HAVEN: High-Bandwidth Flash Augmented Vector Engine for Large-Scale Approximate Nearest-Neighbor Search Acceleration	Po-Kai Hsu et.al.	2603.01175	translate	read	null
2026-03-01	DEP: A Decentralized Large Language Model Evaluation Protocol	Jianxiang Peng et.al.	2603.01167	translate	read	null
2026-03-01	Demystifying Group Relative Policy Optimization: Its Policy Gradient is a U-Statistic	Hongyi Zhou et.al.	2603.01162	translate	read	null
2026-03-01	Semantic XPath: Structured Agentic Memory Access for Conversational AI	Yifan Simon Liu et.al.	2603.01160	translate	read	null
2026-03-01	vEcho: A Paradigm Shift from Vulnerability Verification to Proactive Discovery with Large Language Models	Mingcheng Jiang et.al.	2603.01154	translate	read	null
2026-03-01	ArtLLM: Generating Articulated Assets via 3D LLM	Penghao Wang et.al.	2603.01142	translate	read	null
2026-03-01	FCN-LLM: Empower LLM for Brain Functional Connectivity Network Understanding via Graph-level Multi-task Instruction Tuning	Xingcan Hu et.al.	2603.01135	translate	read	null
2026-03-01	MedCollab: Causal-Driven Multi-Agent Collaboration for Full-Cycle Clinical Diagnosis via IBIS-Structured Argumentation	Yuqi Zhan et.al.	2603.01131	translate	read	null
2026-03-01	From Dialogue to Execution: Mixture-of-Agents Assisted Interactive Planning for Behavior Tree-Based Long-Horizon Robot Execution	Kanata Suzuki et.al.	2603.01113	translate	read	null
2026-03-01	DIVA-GRPO: Enhancing Multimodal Reasoning through Difficulty-Adaptive Variant Advantage	Haowen Gao et.al.	2603.01106	translate	read	null
2026-03-01	Egocentric Co-Pilot: Web-Native Smart-Glasses Agents for Assistive Egocentric AI	Sicheng Yang et.al.	2603.01104	translate	read	null
2026-03-01	Understanding LoRA as Knowledge Memory: An Empirical Analysis	Seungju Back et.al.	2603.01097	translate	read	null
2026-03-01	Alien Science: Sampling Coherent but Cognitively Unavailable Research Directions from Idea Atoms	Alejandro H. Artiles et.al.	2603.01092	translate	read	null
2026-03-01	CARD: Towards Conditional Design of Multi-agent Topological Structures	Tongtong Wu et.al.	2603.01089	translate	read	null
2026-03-01	Beyond Global Similarity: Towards Fine-Grained, Multi-Condition Multimodal Retrieval	Xuan Lu et.al.	2603.01082	translate	read	null
2026-03-01	How RL Unlocks the Aha Moment in Geometric Interleaved Reasoning	Xiangxiang Zhang et.al.	2603.01070	translate	read	null
2026-03-01	GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant	Zhuokang Shen et.al.	2603.01059	translate	read	null
2026-03-01	MMCOMET: A Large-Scale Multimodal Commonsense Knowledge Graph for Contextual Reasoning	Eileen Wang et.al.	2603.01055	translate	read	null
2026-03-01	CelloAI Benchmarks: Toward Repeatable Evaluation of AI Assistants	Mohammad Atif et.al.	2603.01051	translate	read	null
2026-03-01	Silo-Bench: A Scalable Environment for Evaluating Distributed Coordination in Multi-Agent LLM Systems	Yuzhe Zhang et.al.	2603.01045	translate	read	null
2026-03-01	Thoth: Mid-Training Bridges LLMs to Time Series Understanding	Jiafeng Lin et.al.	2603.01042	translate	read	link
2026-03-01	One-Token Verification for Reasoning Correctness Estimation	Zhan Zhuang et.al.	2603.01025	translate	read	null
2026-03-01	GeoMCP: A Trustworthy Framework for AI-Assisted Analytical Geotechnical Engineering	Yared W. Bekele et.al.	2603.01022	translate	read	null
2026-03-01	FastCode: Fast and Cost-Efficient Code Understanding and Reasoning	Zhonghang Li et.al.	2603.01012	translate	read	null
2026-03-01	CollabEval: Enhancing LLM-as-a-Judge via Multi-Agent Collaboration	Yiyue Qian et.al.	2603.00993	translate	read	null
2026-03-01	Sustainable Code Generation Using Large Language Models: A Systematic Literature Review	Sabiya Banu Masthan Ali et.al.	2603.00989	translate	read	null
2026-03-01	HiMAC: Hierarchical Macro-Micro Learning for Long-Horizon LLM Agents	Hongbo Jin et.al.	2603.00977	translate	read	null
2026-03-01	Stabilizing Policy Optimization via Logits Convexity	Hongzhan Chen et.al.	2603.00963	translate	read	null
2026-03-01	S-VoCAL: A Dataset and Evaluation Framework for Inferring Speaking Voice Character Attributes in Literature	Abigail Berthe-Pardo et.al.	2603.00958	translate	read	null
2026-03-01	Seeing Beyond 8bits: Subjective and Objective Quality Assessment of HDR-UGC Videos	Shreshth Saini et.al.	2603.00938	translate	read	null
2026-03-01	Learning to Weigh Waste: A Physics-Informed Multimodal Fusion Framework and Large-Scale Dataset for Commercial and Industrial Applications	Md. Adnanul Islam et.al.	2603.00931	translate	read	null
2026-03-01	Conformal Prediction for Risk-Controlled Medical Entity Extraction Across Clinical Domains	Manil Shrestha et.al.	2603.00924	translate	read	null
2026-03-01	Hybrid Neural-LLM Pipeline for Morphological Glossing in Endangered Language Documentation: A Case Study of Jungar Tuvan	Siyu Liang et.al.	2603.00923	translate	read	null
2026-03-01	DriveCode: Domain Specific Numerical Encoding for LLM-Based Autonomous Driving	Zhiye Wang et.al.	2603.00919	translate	read	null
2026-03-01	Prompt Sensitivity and Answer Consistency of Small Open-Source Large Language Models on Clinical Question Answering: Implications for Low-Resource Healthcare Deployment	Shravani Hariprasad et.al.	2603.00917	translate	read	null
2026-03-01	Curvature-Weighted Capacity Allocation: A Minimum Description Length Framework for Layer-Adaptive Large Language Model Optimization	Theophilus Amaefuna et.al.	2603.00910	translate	read	null
2026-03-01	KVSlimmer: Theoretical Insights and Practical Optimizations for Asymmetric KV Merging	Lianjun Liu et.al.	2603.00907	translate	read	null
2026-03-01	pySpatial: Generating 3D Visual Programs for Zero-Shot Spatial Reasoning	Zhanpeng Luo et.al.	2603.00905	translate	read	null
2026-03-01	Detect Repair Verify for Securing LLM Generated Code: A Multi-Language Empirical Study	Cheng Cheng et.al.	2603.00897	translate	read	null
2026-03-01	Evaluating AI Grading on Real-World Handwritten College Mathematics: A Large-Scale Study Toward a Benchmark	Zhiqi Yu et.al.	2603.00895	translate	read	null
2026-03-01	CHIMERA: Compact Synthetic Data for Generalizable LLM Reasoning	Xinyu Zhu et.al.	2603.00889	translate	read	null
2026-03-01	BioProAgent: Neuro-Symbolic Grounding for Constrained Scientific Planning	Yuyang Liu et.al.	2603.00876	translate	read	null
2026-03-01	MC-Search: Evaluating and Enhancing Multimodal Agentic Search with Structured Long Reasoning Chains	Xuying Ning et.al.	2603.00873	translate	read	null
2026-03-01	PARCER as an Operational Contract to Reduce Variance, Cost, and Risk in LLM Systems	Elzo Brito dos Santos Filho et.al.	2603.00856	translate	read	null
2026-03-01	Tiny-Critic RAG: Empowering Agentic Fallback with Parameter-Efficient Small Language Models	Yichao Wu et.al.	2603.00846	translate	read	null

(<a href=../LLM.md>back to LLM</a>)