LLM - 2025-02

Publish Date Title Authors PDF Translate Read Code
2025-02-28 LLM Post-Training: A Deep Dive into Reasoning Large Language Models Komal Kumar et.al. 2502.21321 translate read null
2025-02-28 FANformer: Improving Large Language Models Through Effective Periodicity Modeling Yihong Dong et.al. 2502.21309 translate read null
2025-02-28 Contextualizing biological perturbation experiments through language Menghua Wu et.al. 2502.21290 translate read null
2025-02-28 Adaptive Keyframe Sampling for Long Video Understanding Xi Tang et.al. 2502.21271 translate read null
2025-02-28 Token-level Ensembling of Models with Different Vocabularies Rachel Wicks et.al. 2502.21265 translate read null
2025-02-28 RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete Yuheng Ji et.al. 2502.21257 translate read null
2025-02-28 Semantic Volume: Quantifying and Detecting both External and Internal Uncertainty in LLMs Xiaomin Li et.al. 2502.21239 translate read null
2025-02-28 Transforming Tuberculosis Care: Optimizing Large Language Models For Enhanced Clinician-Patient Communication Daniil Filienko et.al. 2502.21236 translate read null
2025-02-28 ByteScale: Efficient Scaling of LLM Training with a 2048K Context Length on More Than 12,000 GPUs Hao Ge et.al. 2502.21231 translate read null
2025-02-27 R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts Zhongyang Li et.al. 2502.20395 translate read null
2025-02-27 Why Are Web AI Agents More Vulnerable Than Standalone LLMs? A Security Analysis Jeffrey Yang Fan Chiang et.al. 2502.20383 translate read null
2025-02-27 Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers Shalev Lifshitz et.al. 2502.20379 translate read null
2025-02-27 PhantomWiki: On-Demand Datasets for Reasoning and Retrieval Evaluation Albert Gong et.al. 2502.20377 translate read null
2025-02-27 Bridging Legal Knowledge and AI: Retrieval-Augmented Generation with Vector Stores, Knowledge Graphs, and Hierarchical Non-negative Matrix Factorization Ryan C. Barron et.al. 2502.20364 translate read null
2025-02-27 Bridging the Creativity Understanding Gap: Small-Scale Human Alignment Enables Expert-Level Humor Ranking in LLMs Kuan Lok Zhou et.al. 2502.20356 translate read null
2025-02-27 KEDRec-LM: A Knowledge-distilled Explainable Drug Recommendation Large Language Model Kai Zhang et.al. 2502.20350 translate read null
2025-02-27 Sparse Auto-Encoder Interprets Linguistic Features in Large Language Models Yi Jing et.al. 2502.20344 translate read null
2025-02-27 Thinking Slow, Fast: Scaling Inference Compute with Distilled Reasoners Daniele Paliotta et.al. 2502.20339 translate read null
2025-02-27 Expertise Is What We Want Alan Ashworth et.al. 2502.20335 translate read null
2025-02-26 Norm Growth and Stability Challenges in Localized Sequential Knowledge Editing Akshat Gupta et.al. 2502.19416 translate read null
2025-02-26 Code to Think, Think to Code: A Survey on Code-Enhanced Reasoning and Reasoning-Driven Code Intelligence in LLMs Dayu Yang et.al. 2502.19411 translate read null
2025-02-26 Less or More: Towards Glanceable Explanations for LLM Recommendations Using Ultra-Small Devices Xinru Wang et.al. 2502.19410 translate read null
2025-02-26 ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large Language Models Danae Sánchez Villegas et.al. 2502.19409 translate read null
2025-02-26 Learning Code-Edit Embedding to Model Student Debugging Behavior Hasnain Heickal et.al. 2502.19407 translate read null
2025-02-26 General Reasoning Requires Learning to Reason from the Get-go Seungwook Han et.al. 2502.19402 translate read null
2025-02-26 TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding Max Ku et.al. 2502.19400 translate read null
2025-02-26 Residual Speech Embeddings for Tone Classification: Removing Linguistic Content to Enhance Paralinguistic Analysis Hamdan Al Ahbabi et.al. 2502.19387 translate read null
2025-02-26 DataMan: Data Manager for Pre-training Large Language Models Ru Peng et.al. 2502.19363 translate read null
2025-02-26 Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning? Yancheng He et.al. 2502.19361 translate read null
2025-02-25 DRAMA: Diverse Augmentation from Large Language Models to Smaller Dense Retrievers Xueguang Ma et.al. 2502.18460 translate read null
2025-02-25 LLM-Based Design Pattern Detection Christian Schindler et.al. 2502.18458 translate read null
2025-02-25 FRIDA to the Rescue! Analyzing Synthetic Data Effectiveness in Object-Based Common Sense Reasoning for Disaster Response Mollie Shichman et.al. 2502.18452 translate read null
2025-02-25 SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution Yuxiang Wei et.al. 2502.18449 translate read null
2025-02-25 MAPoRL: Multi-Agent Post-Co-Training for Collaborative Large Language Models with Reinforcement Learning Chanwoo Park et.al. 2502.18439 translate read null
2025-02-25 TextGames: Learning to Self-Play Text-Based Puzzle Games via Language Model Reasoning Frederikus Hudi et.al. 2502.18431 translate read null
2025-02-25 OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference Xiangyu Zhao et.al. 2502.18411 translate read null
2025-02-25 Monte Carlo Temperature: a robust sampling strategy for LLM’s uncertainty quantification methods Nicola Cecere et.al. 2502.18389 translate read null
2025-02-25 How Far are LLMs from Real Search? A Comprehensive Study on Efficiency, Completeness, and Inherent Capabilities Minhua Lin et.al. 2502.18387 translate read null
2025-02-25 MindMem: Multimodal for Predicting Advertisement Memorability Using LLMs and Deep Learning Sepehr Asgarian et.al. 2502.18371 translate read null
2025-02-24 Introducing Visual Perception Token into Multimodal Large Language Model Runpeng Yu et.al. 2502.17425 translate read link
2025-02-24 MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs Jiarui Zhang et.al. 2502.17422 translate read link
2025-02-24 LongSpec: Long-Context Speculative Decoding with Efficient Drafting and Verification Penghui Yang et.al. 2502.17421 translate read link
2025-02-24 The Geometry of Refusal in Large Language Models: Concept Cones and Representational Independence Tom Wollschläger et.al. 2502.17420 translate read null
2025-02-24 From System 1 to System 2: A Survey of Reasoning Large Language Models Zhong-Zhi Li et.al. 2502.17419 translate read link
2025-02-24 Reasoning with Latent Thoughts: On the Power of Looped Transformers Nikunj Saunshi et.al. 2502.17416 translate read null
2025-02-24 COSMOS: A Hybrid Adaptive Optimizer for Memory-Efficient Training of LLMs Liming Liu et.al. 2502.17410 translate read link
2025-02-24 Large Language Models are Powerful EHR Encoders Stefan Hegselmann et.al. 2502.17403 translate read null
2025-02-24 DIS-CO: Discovering Copyrighted Content in VLMs Training Data André V. Duarte et.al. 2502.17358 translate read link
2025-02-24 On Relation-Specific Neurons in Large Language Models Yihong Liu et.al. 2502.17355 translate read link
2025-02-21 ELIP: Enhanced Visual-Language Foundation Models for Image Retrieval Guanqi Zhan et.al. 2502.15682 translate read null
2025-02-21 Privacy Ripple Effects from Adding or Removing Personal Information in Language Model Training Jaydeep Borkar et.al. 2502.15680 translate read null
2025-02-21 FLEKE: Federated Locate-then-Edit Knowledge Editing Zongkai Zhao et.al. 2502.15677 translate read null
2025-02-21 AutoToM: Automated Bayesian Inverse Planning and Model Discovery for Open-ended Theory of Mind Zhining Zhang et.al. 2502.15676 translate read null
2025-02-21 Almost AI, Almost Human: The Challenge of Detecting AI-Polished Writing Shoumik Saha et.al. 2502.15666 translate read null
2025-02-21 Machine-generated text detection prevents language model collapse George Drayson et.al. 2502.15654 translate read null
2025-02-21 Empowering LLMs with Logical Reasoning: A Comprehensive Survey Fengxiang Cheng et.al. 2502.15652 translate read null
2025-02-21 Steering into New Embedding Spaces: Analyzing Cross-Lingual Alignment Induced by Model Interventions in Multilingual Language Models Anirudh Sundar et.al. 2502.15639 translate read null
2025-02-21 The Relationship Between Reasoning and Performance in Large Language Models – o3 (mini) Thinks Harder, Not Longer Marthe Ballon et.al. 2502.15631 translate read null
2025-02-21 Probe Pruning: Accelerating LLMs through Dynamic Pruning via Model-Probing Qi Le et.al. 2502.15618 translate read null
2025-02-20 LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention Shang Yang et.al. 2502.14866 translate read link
2025-02-20 Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning Shuyue Stella Li et.al. 2502.14860 translate read link
2025-02-20 FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling Weilin Zhao et.al. 2502.14856 translate read null
2025-02-20 Prompt-to-Leaderboard Evan Frick et.al. 2502.14855 translate read null
2025-02-20 GATE: Graph-based Adaptive Tool Evolution Across Diverse Tasks Jianwen Luo et.al. 2502.14848 translate read null
2025-02-20 Red-Teaming LLM Multi-Agent Systems via Communication Attacks Pengfei He et.al. 2502.14847 translate read null
2025-02-20 Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation Yue Yang et.al. 2502.14846 translate read null
2025-02-20 Revealing and Mitigating Over-Attention in Knowledge Editing Pinzheng Wang et.al. 2502.14838 translate read null
2025-02-20 Middle-Layer Representation Alignment for Cross-Lingual Transfer in Fine-Tuned LLMs Danni Liu et.al. 2502.14830 translate read null
2025-02-20 Exploring Advanced Techniques for Visual Question Answering: A Comprehensive Comparison Aiswarya Baby et.al. 2502.14827 translate read null
2025-02-19 Where’s the Bug? Attention Probing for Scalable Fault Localization Adam Stein et.al. 2502.13966 translate read null
2025-02-19 Autellix: An Efficient Serving Engine for LLM Agents as General Programs Michael Luo et.al. 2502.13965 translate read null
2025-02-19 MuDAF: Long-Context Multi-Document Attention Focusing through Contrastive Learning on Attention Heads Weihao Liu et.al. 2502.13963 translate read null
2025-02-19 Is That Your Final Answer? Test-Time Scaling Improves Selective Question Answering William Jurayj et.al. 2502.13962 translate read link
2025-02-19 LIDDIA: Language-based Intelligent Drug Discovery Agent Reza Averly et.al. 2502.13959 translate read null
2025-02-19 Neurosymbolic artificial intelligence via large language models and coherence-driven inference Steve Huntsman et.al. 2502.13953 translate read null
2025-02-19 Why Safeguarded Ships Run Aground? Aligned Large Language Models’ Safety Mechanisms Tend to Be Anchored in The Template Region Chak Tou Leong et.al. 2502.13946 translate read null
2025-02-19 A Chain-of-Thought Subspace Meta-Learning for Few-shot Image Captioning with Large Vision and Language Models Hao Huang et.al. 2502.13942 translate read null
2025-02-19 LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization Guanzheng Chen et.al. 2502.13922 translate read link
2025-02-19 Exploring Code Language Models for Automated HLS-based Hardware Generation: Benchmark, Infrastructure and Analysis Jiahao Gai et.al. 2502.13921 translate read null
2025-02-18 Re-Align: Aligning Vision Language Models via Retrieval-Augmented Direct Preference Optimization Shuo Xing et.al. 2502.13146 translate read link
2025-02-18 Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation Bencheng Liao et.al. 2502.13145 translate read link
2025-02-18 UniGuardian: A Unified Defense for Detecting Prompt Injection, Backdoor Attacks and Adversarial Attacks in Large Language Models Huawei Lin et.al. 2502.13141 translate read null
2025-02-18 Towards Quantum Tensor Decomposition in Biomedical Applications Myson Burch et.al. 2502.13140 translate read null
2025-02-18 AIDE: AI-Driven Exploration in the Space of Code Zhengyao Jiang et.al. 2502.13138 translate read link
2025-02-18 Theorem Prover as a Judge for Synthetic Data Generation Joshua Ong Jun Leang et.al. 2502.13137 translate read null
2025-02-18 Learning to Defer for Causal Discovery with Imperfect Experts Oscar Clivio et.al. 2502.13132 translate read null
2025-02-18 Facilitating Long Context Understanding via Supervised Chain-of-Thought Reasoning Jingyang Lin et.al. 2502.13127 translate read null
2025-02-18 RuozhiBench: Evaluating LLMs with Logical Fallacies and Misleading Premises Zenan Zhai et.al. 2502.13125 translate read null
2025-02-18 Adapting Psycholinguistic Research for LLMs: Gender-inclusive Language in a Coreference Context Marion Bartl et.al. 2502.13120 translate read null
2025-02-17 Idiosyncrasies in Large Language Models Mingjie Sun et.al. 2502.12150 translate read link
2025-02-17 HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation Ling Yang et.al. 2502.12148 translate read link
2025-02-17 Fast or Better? Balancing Accuracy and Cost in Retrieval-Augmented Generation with Flexible User Control Jinyan Su et.al. 2502.12145 translate read null
2025-02-17 Small Models Struggle to Learn from Strong Reasoners Yuetai Li et.al. 2502.12143 translate read link
2025-02-17 SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs Yige Xu et.al. 2502.12134 translate read null
2025-02-17 Transformer Dynamics: A neuroscientific approach to interpretability of large language models Jesseba Fernando et.al. 2502.12131 translate read null
2025-02-17 Scaling Autonomous Agents via Automatic Reward Modeling And Planning Zhenfang Chen et.al. 2502.12130 translate read link
2025-02-17 Minimal Ranks, Maximum Confidence: Parameter-efficient Uncertainty Quantification for LoRA Patryk Marszałek et.al. 2502.12122 translate read null
2025-02-17 LLMs on the Line: Data Determines Loss-to-Loss Scaling Laws Prasanna Mayilvahanan et.al. 2502.12120 translate read null
2025-02-17 PRISM: Self-Pruning Intrinsic Selection Method for Training-Free Multimodal Data Selection Jinhe Bi et.al. 2502.12119 translate read null
2025-02-14 MM-RLHF: The Next Step Forward in Multimodal LLM Alignment Yi-Fan Zhang et.al. 2502.10391 translate read null
2025-02-14 Aspect-Oriented Summarization for Psychiatric Short-Term Readmission Prediction WonJin Yoon et.al. 2502.10388 translate read null
2025-02-14 Enhancing Multilingual LLM Pretraining with Model-Based Data Selection Bettina Messmer et.al. 2502.10361 translate read null
2025-02-14 Organize the Web: Constructing Domains Enhances Pre-Training Data Curation Alexander Wettig et.al. 2502.10341 translate read null
2025-02-14 Evaluating the Meta- and Object-Level Reasoning of Large Language Models for Question Answering Nick Ferguson et.al. 2502.10338 translate read null
2025-02-14 LLM-Powered Preference Elicitation in Combinatorial Assignment Ermis Soumalias et.al. 2502.10308 translate read null
2025-02-14 Open-Source AI-Powered Optimization in Scalene: Advancing Python Performance Profiling with DeepSeek-R1 and LLaMA 3.2 Saem Hasan et.al. 2502.10299 translate read null
2025-02-14 Are Large Language Models the future crowd workers of Linguistics? Iris Ferrazzo et.al. 2502.10266 translate read null
2025-02-14 Large Language Models and Synthetic Data for Monitoring Dataset Mentions in Research Papers Aivin V. Solatorio et.al. 2502.10263 translate read link
2025-02-14 VisCon-100K: Leveraging Contextual Web Data for Fine-tuning Vision Language Models Gokul Karthik Kumar et.al. 2502.10250 translate read null
2025-02-13 Theoretical Benefit and Limitation of Diffusion Language Model Guhao Feng et.al. 2502.09622 translate read null
2025-02-13 MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency Dongzhi Jiang et.al. 2502.09621 translate read null
2025-02-13 Exploring the Potential of Encoder-free Architectures in 3D LMMs Yiwen Tang et.al. 2502.09620 translate read link
2025-02-13 Human-LLM Coevolution: Evidence from Academic Writing Mingmeng Geng et.al. 2502.09606 translate read null
2025-02-13 SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models Yung-Sung Chuang et.al. 2502.09604 translate read link
2025-02-13 GAIA: A Global, Multi-modal, Multi-scale Vision-Language Dataset for Remote Sensing Image Analysis Angelos Zavras et.al. 2502.09598 translate read link
2025-02-13 Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMs Siyan Zhao et.al. 2502.09597 translate read link
2025-02-13 KIMAs: A Configurable Knowledge Integrated Multi-Agent System Zitao Li et.al. 2502.09596 translate read null
2025-02-13 Logical forms complement probability in understanding language model (and human) performance Yixuan Wang et.al. 2502.09589 translate read null
2025-02-13 Polymind: Parallel Visual Diagramming with Large Language Models to Support Prewriting Through Microtasks Qian Wan et.al. 2502.09577 translate read null
2025-02-12 Examining Multilingual Embedding Models Cross-Lingually Through LLM-Generated Adversarial Examples Andrianos Michail et.al. 2502.08638 translate read null
2025-02-12 Ensemble based approach to quantifying uncertainty of LLM based classifications Srijith Rajamohan et.al. 2502.08631 translate read null
2025-02-12 Commercial LLM Agents Are Already Vulnerable to Simple Yet Dangerous Attacks Ang Li et.al. 2502.08586 translate read null
2025-02-12 QA-Expand: Multi-Question Answer Generation for Enhanced Query Expansion in Information Retrieval Wonduk Seo et.al. 2502.08557 translate read null
2025-02-12 Fostering Appropriate Reliance on Large Language Models: The Role of Explanations, Sources, and Inconsistencies Sunnie S. Y. Kim et.al. 2502.08554 translate read null
2025-02-12 LLMs can implicitly learn from mistakes in-context Lisa Alazraki et.al. 2502.08550 translate read null
2025-02-12 LLM Pretraining with Continuous Concepts Jihoon Tack et.al. 2502.08524 translate read link
2025-02-12 The Paradox of Stochasticity: Limited Creativity and Computational Decoupling in Temperature-Varied LLM Outputs of Structured Fictional Data Evgenii Evstafev et.al. 2502.08515 translate read null
2025-02-12 Faithful, Unfaithful or Ambiguous? Multi-Agent Debate with Initial Stance for Summary Evaluation Mahnaz Koupaee et.al. 2502.08514 translate read null
2025-02-12 Measuring Diversity in Synthetic Datasets Yuchang Zhu et.al. 2502.08512 translate read null
2025-02-11 DarwinLM: Evolutionary Structured Pruning of Large Language Models Shengkun Tang et.al. 2502.07780 translate read link
2025-02-11 Auditing Prompt Caching in Language Model APIs Chenchen Gu et.al. 2502.07776 translate read link
2025-02-11 Automatic Robot Task Planning by Integrating Large Language Model with Genetic Programming Azizjon Kobilov et.al. 2502.07772 translate read null
2025-02-11 Great Power Brings Great Responsibility: Personalizing Conversational AI for Diverse Problem-Solvers Italo Santos et.al. 2502.07763 translate read null
2025-02-11 Scalable Fingerprinting of Large Language Models Anshul Nasery et.al. 2502.07760 translate read null
2025-02-11 Towards Efficient Optimizer Design for LLM via Structured Fisher Approximation with a Low-Rank Extension Wenbo Gong et.al. 2502.07752 translate read null
2025-02-11 WHODUNIT: Evaluation benchmark for culprit detection in mystery stories Kshitij Gupta et.al. 2502.07747 translate read link
2025-02-11 The Economics of Large Language Models: Token Allocation, Fine-Tuning, and Optimal Pricing Dirk Bergemann et.al. 2502.07736 translate read null
2025-02-11 Economics of Sourcing Human Data Sebastin Santy et.al. 2502.07732 translate read null
2025-02-11 Verifying LLM-Generated Code in the Context of Software Verification with Ada/SPARK Marcos Cramer et.al. 2502.07728 translate read null
2025-02-10 Rationalization Models for Text-to-SQL Gaetano Rossiello et.al. 2502.06759 translate read null
2025-02-10 Gradient Multi-Normalization for Stateless and Scalable LLM Training Meyer Scetbon et.al. 2502.06742 translate read null
2025-02-10 VersaPRM: Multi-Domain Process Reward Model via Synthetic Reasoning Data Thomas Zeng et.al. 2502.06737 translate read null
2025-02-10 Dynamic Loss-Based Sample Reweighting for Improved Large Language Model Pretraining Daouda Sow et.al. 2502.06733 translate read null
2025-02-10 Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling Runze Liu et.al. 2502.06703 translate read link
2025-02-10 Boosting Self-Efficacy and Performance of Large Language Models via Verbal Efficacy Stimulations Rui Chen et.al. 2502.06669 translate read null
2025-02-10 Automatic Evaluation of Healthcare LLMs Beyond Question-Answering Anna Arias-Duart et.al. 2502.06666 translate read null
2025-02-10 On the Limitations of Combining Sentiment Analysis Tools in a Cross-Platform Setting Martin Obaidi et.al. 2502.06665 translate read null
2025-02-10 EfficientLLM: Scalable Pruning-Aware Pretraining for Architecture-Agnostic Edge Language Models Xingrun Xing et.al. 2502.06663 translate read link
2025-02-10 Unbiased Evaluation of Large Language Models from a Causal Perspective Meilin Chen et.al. 2502.06655 translate read null
2025-02-07 Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuray Yunhang Shen et.al. 2502.05177 translate read link
2025-02-07 NoLiMa: Long-Context Evaluation Beyond Literal Matching Ali Modarressi et.al. 2502.05167 translate read link
2025-02-07 DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails Yihe Deng et.al. 2502.05163 translate read link
2025-02-07 A Lightweight Method to Disrupt Memorized Sequences in LLM Parjanya Prajakta Prashant et.al. 2502.05159 translate read null
2025-02-07 Hummingbird: High Fidelity Image Generation via Multimodal Context Alignment Minh-Quan Le et.al. 2502.05153 translate read null
2025-02-07 Transforming Science with Large Language Models: A Survey on AI-assisted Scientific Discovery, Experimentation, Content Generation, and Evaluation Steffen Eger et.al. 2502.05151 translate read link
2025-02-07 CodeSCM: Causal Analysis for Multi-Modal Code Generation Mukur Gupta et.al. 2502.05150 translate read null
2025-02-07 An Annotated Reading of ‘The Singer of Tales’ in the LLM Era Kush R. Varshney et.al. 2502.05148 translate read null
2025-02-07 Refining Integration-by-Parts Reduction of Feynman Integrals with Machine Learning Matt von Hippel et.al. 2502.05121 translate read null
2025-02-07 Flexible and Efficient Grammar-Constrained Decoding Kanghee Park et.al. 2502.05111 translate read null
2025-02-06 Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment Zuyan Liu et.al. 2502.04328 translate read null
2025-02-06 Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactions Yik Siu Chan et.al. 2502.04322 translate read link
2025-02-06 ChamaleonLLM: Batch-Aware Dynamic Low-Rank Adaptation via Inference-Time Clusters Kamer Ali Yuksel et.al. 2502.04315 translate read null
2025-02-06 ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization Yinjie Wang et.al. 2502.04306 translate read link
2025-02-06 Beyond Prompt Content: Enhancing LLM Performance via Content-Format Integrated Prompt Optimization Yuanye Liu et.al. 2502.04295 translate read link
2025-02-06 PILAF: Optimal Human Preference Sampling for Reward Modeling Yunzhen Feng et.al. 2502.04270 translate read null
2025-02-06 How does a Multilingual LM Handle Multiple Languages? Santhosh Kakarla et.al. 2502.04269 translate read null
2025-02-06 Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion Marco Mistretta et.al. 2502.04263 translate read link
2025-02-06 TriNER: A Series of Named Entity Recognition Models For Hindi, Bengali & Marathi Mohammed Amaan Dhamaskar et.al. 2502.04245 translate read null
2025-02-06 MAGA: MAssive Genre-Audience Reformulation to Pretraining Corpus Expansion Xintong Hao et.al. 2502.04235 translate read null
2025-02-05 Do Large Language Model Benchmarks Test Reliability? Joshua Vendrow et.al. 2502.03461 translate read null
2025-02-05 Adapt-Pruner: Adaptive Structural Pruning for Efficient Small Language Model Training Boyao Wang et.al. 2502.03460 translate read null
2025-02-05 A Schema-Guided Reason-while-Retrieve framework for Reasoning on Scene Graphs with Large-Language-Models (LLMs) Yiye Chen et.al. 2502.03450 translate read null
2025-02-05 BFS-Prover: Scalable Best-First Tree Search for LLM-based Automatic Theorem Proving Ran Xin et.al. 2502.03438 translate read null
2025-02-05 On Fairness of Unified Multimodal Large Language Model for Image Generation Ming Liu et.al. 2502.03429 translate read null
2025-02-05 Harnessing Large Language Models for Curated Code Reviews Oussama Ben Sghaier et.al. 2502.03425 translate read null
2025-02-05 Investigating Corporate Social Responsibility Initiatives: Examining the case of corporate Covid-19 response Meheli Basu et.al. 2502.03421 translate read null
2025-02-05 Think or Step-by-Step? UnZIPping the Black Box in Zero-Shot Prompts Nikta Gohari Sadr et.al. 2502.03418 translate read null
2025-02-05 SPRI: Aligning Large Language Models with Context-Situated Principles Hongli Zhan et.al. 2502.03397 translate read null
2025-02-05 LIMO: Less is More for Reasoning Yixin Ye et.al. 2502.03387 translate read null
2025-02-04 COCONut-PanCap: Joint Panoptic Segmentation and Grounded Captions for Fine-Grained Understanding and Generation Xueqing Deng et.al. 2502.02589 translate read null
2025-02-04 A comparison of translation performance between DeepL and Supertext Alex Flückiger et.al. 2502.02577 translate read null
2025-02-04 Are Language Models Up to Sequential Optimization Problems? From Evaluation to a Hegelian-Inspired Enhancement Soheil Abbasloo et.al. 2502.02573 translate read null
2025-02-04 Learning the RoPEs: Better 2D and 3D Position Encodings with STRING Connor Schenck et.al. 2502.02562 translate read null
2025-02-04 LLMs for Generation of Architectural Components: An Exploratory Empirical Study in the Serverless World Shrikara Arun et.al. 2502.02539 translate read null
2025-02-04 Adaptive Self-improvement LLM Agentic System for ML Library Development Genghan Zhang et.al. 2502.02534 translate read null
2025-02-04 Multi-Agent Design: Optimizing Agents with Better Prompts and Topologies Han Zhou et.al. 2502.02533 translate read null
2025-02-04 Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search Maohao Shen et.al. 2502.02508 translate read null
2025-02-04 EasySpec: Layer-Parallel Speculative Decoding for Efficient Multi-GPU Utilization Yize Wu et.al. 2502.02493 translate read null
2025-02-04 Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study Menglong Cui et.al. 2502.02481 translate read null
2025-02-03 SELMA: A Speech-Enabled Language Model for Virtual Assistant Interactions Dominik Wagner et.al. 2501.19377 translate read null

(<a href=../LLM.md>back to LLM</a>)