LLM - 2025-01

Publish Date Title Authors PDF Translate Read Code
2025-01-31 Vintix: Action Model via In-Context Reinforcement Learning Andrey Polubarov et.al. 2501.19400 translate read link
2025-01-31 Do LLMs Strategically Reveal, Conceal, and Infer Information? A Theoretical and Empirical Analysis in The Chameleon Game Mustafa O. Karabag et.al. 2501.19398 translate read link
2025-01-31 Cache Me If You Must: Adaptive Key-Value Quantization for Large Language Models Alina Shutova et.al. 2501.19392 translate read null
2025-01-31 Federated Sketching LoRA: On-Device Collaborative Fine-Tuning of Large Language Models Wenzhi Fang et.al. 2501.19389 translate read null
2025-01-31 We’re Different, We’re the Same: Creative Homogeneity Across LLMs Emily Wenger et.al. 2501.19361 translate read null
2025-01-31 Mechanical Properties of the Meninges: Large Language Model Assisted Systematic Review of over 25,000 Studies Brandon P. Chelstrom et.al. 2501.19359 translate read null
2025-01-31 The Energy Loss Phenomenon in RLHF: A New Perspective on Mitigating Reward Hacking Yuchun Miao et.al. 2501.19358 translate read null
2025-01-31 Do Large Multimodal Models Solve Caption Generation for Scientific Figures? Lessons Learned from SCICAP Challenge 2023 Ting-Yao E. Hsu et.al. 2501.19353 translate read null
2025-01-31 Towards Adaptive Self-Improvement for Smarter Energy Systems Alexander Sommer et.al. 2501.19340 translate read null
2025-01-30 Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs Yue Wang et.al. 2501.18585 translate read null
2025-01-30 Token-Hungry, Yet Precise: DeepSeek R1 Highlights the Need for Multi-Step Reasoning Over Speed in MATH Evgenii Evstafev et.al. 2501.18576 translate read null
2025-01-30 BounTCHA: A CAPTCHA Utilizing Boundary Identification in AI-extended Videos Lehao Lin et.al. 2501.18565 translate read null
2025-01-30 Semantic Web and Creative AI – A Technical Report from ISWS 2023 Raia Abu Ahmad et.al. 2501.18542 translate read null
2025-01-30 Illusions of Relevance: Using Content Injection Attacks to Deceive Retrievers, Rerankers, and LLM Judges Manveer Singh Tamber et.al. 2501.18536 translate read link
2025-01-30 Differentially Private Steering for Large Language Model Alignment Anmol Goel et.al. 2501.18532 translate read link
2025-01-30 Learn from the Past: Language-conditioned Object Rearrangement with Large Language Models Guanqun Cao et.al. 2501.18516 translate read null
2025-01-30 Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch Arthur Douillard et.al. 2501.18512 translate read null
2025-01-30 CLEAR: Cue Learning using Evolution for Accurate Recognition Applied to Sustainability Data Extraction Peter J. Bentley et.al. 2501.18504 translate read null
2025-01-30 A Tool for In-depth Analysis of Code Execution Reasoning of Large Language Models Changshu Liu et.al. 2501.18482 translate read null
2025-01-29 Learning Beyond the Surface: How Far Can Continual Pre-Training with LoRA Enhance LLMs’ Domain-Specific Insight Learning? Pouya Pezeshkpour et.al. 2501.17840 translate read link
2025-01-29 Leveraging Multimodal LLM for Inspirational User Interface Search Seokhyeon Park et.al. 2501.17799 translate read link
2025-01-29 BreezyVoice: Adapting TTS for Taiwanese Mandarin with Enhanced Polyphone Disambiguation – Challenges and Insights Chan-Jan Hsu et.al. 2501.17790 translate read null
2025-01-29 AdditiveLLM: Large Language Models Predict Defects in Additive Manufacturing Peter Pak et.al. 2501.17784 translate read null
2025-01-29 2SSP: A Two-Stage Framework for Structured Pruning of LLMs Fabrizio Sandri et.al. 2501.17771 translate read null
2025-01-29 Hybrid Graphs for Table-and-Text based Question Answering using LLMs Ankush Agarwal et.al. 2501.17767 translate read null
2025-01-29 On the Partitioning of GPU Power among Multi-Instances Tirth Vamja et.al. 2501.17752 translate read null
2025-01-29 Early External Safety Testing of OpenAI’s o3-mini: Insights from the Pre-Deployment Evaluation Aitor Arrieta et.al. 2501.17749 translate read null
2025-01-29 Using Code Generation to Solve Open Instances of Combinatorial Design Problems Christopher D. Rosin et.al. 2501.17725 translate read link
2025-01-29 RICoTA: Red-teaming of In-the-wild Conversation with Test Attempts Eujeong Choi et.al. 2501.17715 translate read link
2025-01-28 Cultural Differences and Perverse Incentives in Science Create a Bad Mix: Exploring Country-Level Publication Bias in Select ACM Conferences Aksheytha Chelikavada et.al. 2501.17150 translate read null
2025-01-28 FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data Deren Lei et.al. 2501.17144 translate read link
2025-01-28 ASTRAL: Automated Safety Testing of Large Language Models Miriam Ugarte et.al. 2501.17132 translate read null
2025-01-28 Optimizing Large Language Model Training Using FP4 Quantization Ruizhe Wang et.al. 2501.17116 translate read null
2025-01-28 Unlocking Transparent Alignment Through Enhanced Inverse Constitutional AI for Principle Extraction Carl-Leander Henneking et.al. 2501.17112 translate read null
2025-01-28 Token-by-Token Regeneration and Domain Biases: A Benchmark of LLMs on Advanced Mathematical Problem-Solving Evgenii Evstafev et.al. 2501.17084 translate read null
2025-01-28 Enhanced Retrieval of Long Documents: Leveraging Fine-Grained Block Representations with Large Language Models Minghan Li et.al. 2501.17039 translate read null
2025-01-28 Challenges in Ensuring AI Safety in DeepSeek-R1 Models: The Shortcomings of Reinforcement Learning Strategies Manojkumar Parmar et.al. 2501.17030 translate read null
2025-01-28 Automated Refactoring of Non-Idiomatic Python Code: A Differentiated Replication with LLMs Alessandro Midolo et.al. 2501.17024 translate read null
2025-01-28 Mobile Manipulation Instruction Generation from Multiple Images with Automatic Metric Enhancement Kei Katsumata et.al. 2501.17022 translate read null
2025-01-27 Evaluating The Performance of Using Large Language Models to Automate Summarization of CT Simulation Orders in Radiation Oncology Meiyun Cao et.al. 2501.16309 translate read null
2025-01-27 RAPID: Retrieval-Augmented Parallel Inference Drafting for Text-Based Video Event Retrieval Long Nguyen et.al. 2501.16303 translate read null
2025-01-27 Matryoshka Re-Ranker: A Flexible Re-Ranking Architecture With Configurable Depth and Width Zheng Liu et.al. 2501.16302 translate read null
2025-01-27 Large Models in Dialogue for Active Perception and Anomaly Detection Tzoulio Chamiti et.al. 2501.16300 translate read null
2025-01-27 FALCON: Resolving Visual Redundancy and Fragmentation in High-resolution Multimodal Large Language Models via Visual Registers Renshan Zhang et.al. 2501.16297 translate read null
2025-01-27 Brain-Adapter: Enhancing Neurological Disorder Analysis with Adapter-Tuning Multimodal Large Language Models Jing Zhang et.al. 2501.16282 translate read null
2025-01-27 Do LLMs Have Visualization Literacy? An Evaluation on Modified Visualizations to Test Generalization in Data Interpretation Jiayi Hong et.al. 2501.16277 translate read null
2025-01-27 URAG: Implementing a Unified Hybrid RAG for Precise Answers in University Admission Chatbots – A Case Study at HCMUT Long Nguyen et.al. 2501.16276 translate read null
2025-01-27 A foundation model for human-AI collaboration in medical literature mining Zifeng Wang et.al. 2501.16255 translate read null
2025-01-27 Multi-Agent Geospatial Copilots for Remote Sensing Workflows Chaehong Lee et.al. 2501.16254 translate read null
2025-01-24 HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation Xin Zhou et.al. 2501.14729 translate read link
2025-01-24 Do LLMs Provide Consistent Answers to Health-Related Questions across Languages? Ipek Baris Schlicht et.al. 2501.14719 translate read null
2025-01-24 Towards Better Understanding Table Instruction Tuning: Decoupling the Effects from Data versus Models Naihao Deng et.al. 2501.14717 translate read null
2025-01-24 FlexiGPT: Pruning and Extending Large Language Models with Low-Rank Weight Sharing James Seale Smith et.al. 2501.14713 translate read null
2025-01-24 The Karp Dataset Mason DiCicco et.al. 2501.14705 translate read null
2025-01-24 Rethinking Table Instruction Tuning Naihao Deng et.al. 2501.14693 translate read null
2025-01-24 An Empirical Study on LLM-based Classification of Requirements-related Provisions in Food-safety Regulations Shabnam Hassani et.al. 2501.14683 translate read null
2025-01-24 Diffusion based Text-to-Music Generationwith Global and Local Text based Conditioning Jisi Zhang et.al. 2501.14680 translate read null
2025-01-24 MedAgentBench: Dataset for Benchmarking LLMs as Agents in Medical Applications Yixing Jiang et.al. 2501.14654 translate read link
2025-01-24 Investigating the (De)Composition Capabilities of Large Language Models in Natural-to-Formal Language Conversion Ziyao Xu et.al. 2501.14649 translate read link
2025-01-23 CRPO: Confidence-Reward Driven Preference Optimization for Machine Translation Guofeng Cui et.al. 2501.13927 translate read null
2025-01-23 Analysis of Indic Language Capabilities in LLMs Aatman Vaidya et.al. 2501.13912 translate read null
2025-01-23 Privacy-Preserving Personalized Federated Prompt Learning for Multimodal Large Language Models Linh Tran et.al. 2501.13904 translate read null
2025-01-23 Exploring Finetuned Audio-LLM on Heart Murmur Features Adrian Florea et.al. 2501.13884 translate read null
2025-01-23 The machine learning platform for developers of large systems Alexey Naikov et.al. 2501.13881 translate read null
2025-01-23 A RAG-Based Institutional Assistant Gustavo Kuratomi et.al. 2501.13880 translate read null
2025-01-23 Large Vision-Language Models for Knowledge-Grounded Data Annotation of Memes Shiling Deng et.al. 2501.13851 translate read link
2025-01-23 On the Reasoning Capacity of AI Models and How to Quantify It Santosh Kumar Radha et.al. 2501.13833 translate read null
2025-01-23 Predicting Compact Phrasal Rewrites with Large Language Models for ASR Post Editing Hao Zhang et.al. 2501.13831 translate read null
2025-01-23 Hallucinations Can Improve Large Language Models in Drug Discovery Shuzhou Yuan et.al. 2501.13824 translate read null
2025-01-22 A Rate-Distortion Framework for Summarization Enes Arda et.al. 2501.13100 translate read null
2025-01-22 Refining Input Guardrails: Enhancing LLM-as-a-Judge Efficiency Through Chain-of-Thought Fine-Tuning and Alignment Melissa Kazemi Rad et.al. 2501.13080 translate read null
2025-01-22 Does Table Source Matter? Benchmarking and Improving Multimodal Scientific Table Understanding and Reasoning Bohao Yang et.al. 2501.13042 translate read link
2025-01-22 Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament Yantao Liu et.al. 2501.13007 translate read link
2025-01-22 Large Language Model-Based Semantic Communication System for Image Transmission Soheyb Ribouh et.al. 2501.12988 translate read null
2025-01-22 LLM4WM: Adapting LLM for Wireless Multi-Tasking Xuanyu Liu et.al. 2501.12983 translate read null
2025-01-22 OnionEval: An Unified Evaluation of Fact-conflicting Hallucination for Small-Large Language Models Chongren Sun et.al. 2501.12975 translate read link
2025-01-22 Accessible Smart Contracts Verification: Synthesizing Formal Models with Tamed LLMs Jan Corazza et.al. 2501.12972 translate read null
2025-01-22 It’s complicated. The relationship of algorithmic fairness and non-discrimination regulations in the EU AI Act Kristof Meding et.al. 2501.12962 translate read null
2025-01-22 Efficient Prompt Compression with Evaluator Heads for Long-Context Transformer Inference Weizhi Fei et.al. 2501.12959 translate read null
2025-01-21 InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling Yi Wang et.al. 2501.12386 translate read link
2025-01-21 Expertise elevates AI usage: experimental evidence comparing laypeople and professional artists Thomas F. Eisenmann et.al. 2501.12374 translate read link
2025-01-21 Is Long Context All You Need? Leveraging LLM’s Extended Context for NL2SQL Yeounoh Chung et.al. 2501.12372 translate read null
2025-01-21 Automatic Labelling with Open-source LLMs using Dynamic Label Schema Integration Thomas Walshe et.al. 2501.12332 translate read null
2025-01-21 VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model Xianwei Zhuang et.al. 2501.12327 translate read link
2025-01-21 LLM-Assisted Knowledge Graph Completion for Curriculum and Domain Modelling in Personalized Higher Education Recommendations Hasan Abu-Rasheed et.al. 2501.12300 translate read null
2025-01-21 MoGERNN: An Inductive Traffic Predictor for Unobserved Locations in Dynamic Sensing Networks Qishen Zhou et.al. 2501.12281 translate read link
2025-01-21 Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement Maosong Cao et.al. 2501.12273 translate read null
2025-01-21 FOCUS: First Order Concentrated Updating Scheme Yizhou Liu et.al. 2501.12243 translate read null
2025-01-21 InsTALL: Context-aware Instructional Task Assistance with Multi-modal Large Language Models Pha Nguyen et.al. 2501.12231 translate read null
2025-01-17 FaceXBench: Evaluating Multimodal LLMs on Face Understanding Kartik Narayan et.al. 2501.10360 translate read link
2025-01-17 Agent4Edu: Generating Learner Response Data by Generative Agents for Intelligent Education Systems Weibo Gao et.al. 2501.10332 translate read null
2025-01-17 Large language models for automated scholarly paper review: A survey Zhenzhen Zhuang et.al. 2501.10326 translate read null
2025-01-17 HiMix: Reducing Computational Complexity in Large Vision-Language Models Xuange Zhang et.al. 2501.10318 translate read null
2025-01-17 Towards Preventing Overreliance on Task-Oriented Conversational AI Through Accountability Modeling Suvodip Dey et.al. 2501.10316 translate read link
2025-01-17 Addressing Popularity Bias in Third-Party Library Recommendations Using LLMs Claudio Di Sipio et.al. 2501.10313 translate read null
2025-01-17 Computational Protein Science in the Era of Large Language Models (LLMs) Wenqi Fan et.al. 2501.10282 translate read null
2025-01-17 Test Wars: A Comparative Study of SBST, Symbolic Execution, and LLM-Based Approaches to Unit Test Generation Azat Abdullin et.al. 2501.10200 translate read null
2025-01-17 Generative Artificial Intelligence: Implications for Biomedical and Health Professions Education William Hersh et.al. 2501.10186 translate read null
2025-01-17 Multi-stage Training of Bilingual Islamic LLM for Neural Passage Retrieval Vera Pavlova et.al. 2501.10175 translate read null
2025-01-16 Distilling Multi-modal Large Language Models for Autonomous Driving Deepti Hegde et.al. 2501.09757 translate read null
2025-01-16 Lost in Translation, Found in Context: Sign Language Translation with Contextual Cues Youngjoon Jang et.al. 2501.09754 translate read null
2025-01-16 OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking Zekun Xi et.al. 2501.09751 translate read null
2025-01-16 Enhancing Lexicon-Based Text Embeddings with Large Language Models Yibin Lei et.al. 2501.09749 translate read null
2025-01-16 Suggesting Code Edits in Interactive Machine Learning Notebooks Using Large Language Models Bihui Jin et.al. 2501.09745 translate read null
2025-01-16 KU AIGEN ICL EDI@BC8 Track 3: Advancing Phenotype Named Entity Recognition and Normalization for Dysmorphology Physical Examination Reports Hajung Kim et.al. 2501.09744 translate read null
2025-01-16 Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps Nanye Ma et.al. 2501.09732 translate read null
2025-01-16 A Simple Aerial Detection Baseline of Multimodal Language Models Qingyun Li et.al. 2501.09720 translate read link
2025-01-16 CyberMentor: AI Powered Learning Tool Platform to Address Diverse Student Needs in Cybersecurity Education Tianyu Wang et.al. 2501.09709 translate read null
2025-01-16 Domain Adaptation of Foundation LLMs for e-Commerce Christian Herold et.al. 2501.09706 translate read null
2025-01-15 Aegis2.0: A Diverse AI Safety Dataset and Risks Taxonomy for Alignment of LLM Guardrails Shaona Ghosh et.al. 2501.09004 translate read null
2025-01-15 Vision Foundation Models for Computed Tomography Suraj Pai et.al. 2501.09001 translate read null
2025-01-15 Development and Validation of the Provider Documentation Summarization Quality Instrument for Large Language Models Emma Croxford et.al. 2501.08977 translate read null
2025-01-15 Learning to Extract Cross-Domain Aspects and Understanding Sentiments Using Large Language Models Karukriti Kaushik Ghosh et.al. 2501.08974 translate read null
2025-01-15 Analyzing the Ethical Logic of Six Large Language Models W. Russell Neuman et.al. 2501.08951 translate read null
2025-01-15 Applying General Turn-taking Models to Conversational Human-Robot Interaction Gabriel Skantze et.al. 2501.08946 translate read null
2025-01-15 Disentangling Exploration of Large Language Models by Optimal Exploitation Tim Grams et.al. 2501.08925 translate read null
2025-01-15 GenAI Content Detection Task 3: Cross-Domain Machine-Generated Text Detection Challenge Liam Dugan et.al. 2501.08913 translate read null
2025-01-15 Leveraging Large Language Models as Knowledge-Driven Agents for Reliable Retrosynthesis Planning Qinyu Ma et.al. 2501.08897 translate read null
2025-01-15 XMusic: Towards a Generalized and Controllable Symbolic Music Generation Framework Sida Tian et.al. 2501.08809 translate read null
2025-01-14 PokerBench: Training Large Language Models to become Professional Poker Players Richard Zhuang et.al. 2501.08328 translate read link
2025-01-14 Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks Miran Heo et.al. 2501.08326 translate read null
2025-01-14 ADAM-1: AI and Bioinformatics for Alzheimer’s Detection and Microbiome-Clinical Data Integrations Ziyuan Huang et.al. 2501.08324 translate read null
2025-01-14 Exploring Robustness of Multilingual LLMs on Real-World Noisy Data Amirhossein Aliakbarzadeh et.al. 2501.08322 translate read link
2025-01-14 Enhancing Automated Interpretability with Output-Centric Feature Descriptions Yoav Gur-Arieh et.al. 2501.08319 translate read link
2025-01-14 HALoGEN: Fantastic LLM Hallucinations and Where to Find Them Abhilasha Ravichander et.al. 2501.08292 translate read null
2025-01-14 LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding Hongyu Li et.al. 2501.08282 translate read link
2025-01-14 Exploring Robustness of LLMs to Sociodemographically-Conditioned Paraphrasing Pulkit Arora et.al. 2501.08276 translate read null
2025-01-14 TriMod Fusion for Multimodal Named Entity Recognition in Social Media Mosab Alfaqeeh et.al. 2501.08267 translate read null
2025-01-14 Addressing the sustainable AI trilemma: a case study on LLM agents and RAG Hui Wu et.al. 2501.08262 translate read null
2025-01-13 Imagine while Reasoning in Space: Multimodal Visualization-of-Thought Chengzu Li et.al. 2501.07542 translate read null
2025-01-13 ML Mule: Mobile-Driven Context-Aware Collaborative Learning Haoxiang Yu et.al. 2501.07536 translate read null
2025-01-13 Investigating Large Language Models in Inferring Personality Traits from User Conversations Jianfeng Zhu et.al. 2501.07532 translate read null
2025-01-13 RadAlign: Advancing Radiology Report Generation with Vision-Language Concept Alignment Difei Gu et.al. 2501.07525 translate read link
2025-01-13 Parallel Key-Value Cache Fusion for Position Invariant RAG Philhoon Oh et.al. 2501.07523 translate read null
2025-01-13 Exploring and Mitigating Adversarial Manipulation of Voting-Based Leaderboards Yangsibo Huang et.al. 2501.07493 translate read null
2025-01-13 TiEBe: A Benchmark for Assessing the Current Knowledge of Large Language Models Thales Sales Almeida et.al. 2501.07482 translate read null
2025-01-13 A Survey of Embodied AI in Healthcare: Techniques, Applications, and Opportunities Yihao Liu et.al. 2501.07468 translate read null
2025-01-13 Understanding and Benchmarking Artificial Intelligence: OpenAI’s o3 Is Not AGI Rolf Pfister et.al. 2501.07458 translate read null
2025-01-13 Enhancing LLM’s Ability to Generate More Repository-Aware Unit Tests Through Precise Contextual Information Injection Xin Yin et.al. 2501.07425 translate read null
2025-01-10 LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs Omkar Thawakar et.al. 2501.06186 translate read link
2025-01-10 PEACE: Empowering Geologic Map Holistic Understanding with MLLMs Yangyu Huang et.al. 2501.06184 translate read null
2025-01-10 Multilingual Performance of a Multimodal Artificial Intelligence System on Multisubject Physics Concept Inventories Gerd Kortemeyer et.al. 2501.06143 translate read null
2025-01-10 Supervision policies can shape long-term risk management in general-purpose AI models Manuel Cebrian et.al. 2501.06137 translate read link
2025-01-10 Contextual ASR Error Handling with LLMs Augmentation for Goal-Oriented Conversational AI Yuya Asano et.al. 2501.06129 translate read null
2025-01-10 Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language Understanding Fabian David Schmidt et.al. 2501.06117 translate read link
2025-01-10 From Conversation to Automation: Leveraging Large Language Models to Analyze Strategies in Problem Solving Therapy Elham Aghakhani et.al. 2501.06101 translate read null
2025-01-10 How to Tune a Multilingual Encoder Model for Germanic Languages: A Study of PEFT, Full Fine-Tuning, and Language Adapters Romina Oji et.al. 2501.06025 translate read link
2025-01-10 Addressing speaker gender bias in large scale speech translation systems Shubham Bansal et.al. 2501.05989 translate read null
2025-01-10 Exploring LLMs for Automated Pre-Testing of Cross-Cultural Surveys Divya Mani Adhikari et.al. 2501.05985 translate read null
2025-01-09 ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding Xingyu Fu et.al. 2501.05452 translate read link
2025-01-09 Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark Yunzhuo Hao et.al. 2501.05444 translate read null
2025-01-09 A survey of textual cyber abuse detection using cutting-edge language models and large language models Jose A. Diaz-Garcia et.al. 2501.05443 translate read null
2025-01-09 Using LLMs to Infer Non-Binary COVID-19 Sentiments of Chinese Micro-bloggers Jerry Chongyi Hu et.al. 2501.05423 translate read null
2025-01-09 FairCode: Evaluating Social Bias of LLMs in Code Generation Yongkang Du et.al. 2501.05396 translate read link
2025-01-09 Large Physics Models: Towards a collaborative approach with Large Language Models and Foundation Models Kristian G. Barman et.al. 2501.05382 translate read null
2025-01-09 Accelerated Diffusion Models via Speculative Sampling Valentin De Bortoli et.al. 2501.05370 translate read null
2025-01-09 Stream Aligner: Efficient Sentence-Level Alignment via Distribution Induction Hantao Lou et.al. 2501.05336 translate read link
2025-01-09 “What’s Happening”- A Human-centered Multimodal Interpreter Explaining the Actions of Autonomous Vehicles Xuewen Luo et.al. 2501.05322 translate read null
2025-01-09 CallNavi: A Study and Challenge on Function Calling Routing and Invocation in Large Language Models Yewei Song et.al. 2501.05255 translate read null
2025-01-08 Re-ranking the Context for Multimodal Retrieval Augmented Generation Matin Mortaheb et.al. 2501.04695 translate read null
2025-01-08 URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics Ruilin Luo et.al. 2501.04686 translate read link
2025-01-08 Enhancing Financial VQA in Vision Language Models using Intermediate Structured Representations Archita Srivastava et.al. 2501.04675 translate read null
2025-01-08 Assessing Language Comprehension in Large Language Models Using Construction Grammar Wesley Scivetti et.al. 2501.04661 translate read null
2025-01-08 Multi-task retriever fine-tuning for domain-specific and efficient RAG Patrice Béchard et.al. 2501.04652 translate read null
2025-01-08 FlairGPT: Repurposing LLMs for Interior Designs Gabrielle Littlefair et.al. 2501.04648 translate read null
2025-01-08 Knowledge Retrieval Based on Generative AI Te-Lun Yang et.al. 2501.04635 translate read null
2025-01-08 “Can you be my mum?”: Manipulating Social Robots in the Large Language Models Era Giulio Antonio Abbo et.al. 2501.04633 translate read null
2025-01-08 Quantum-inspired Embeddings Projection and Similarity Metrics for Representation Learning Ivan Kankeu et.al. 2501.04591 translate read null
2025-01-08 InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection Yuhang Liu et.al. 2501.04575 translate read link
2025-01-07 Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos Haobo Yuan et.al. 2501.04001 translate read link
2025-01-07 RAG-Check: Evaluating Multimodal Retrieval Augmented Generation Performance Matin Mortaheb et.al. 2501.03995 translate read null
2025-01-07 Influences on LLM Calibration: A Study of Response Agreement, Loss Functions, and Prompt Styles Yuxi Xia et.al. 2501.03991 translate read null
2025-01-07 (De)-Indexing and the Right to be Forgotten Salvatore Vilella et.al. 2501.03989 translate read null
2025-01-07 VLM-driven Behavior Tree for Context-aware Task Planning Naoki Wake et.al. 2501.03968 translate read null
2025-01-07 Vision Language Models as Values Detectors Giulio Antonio Abbo et.al. 2501.03957 translate read null
2025-01-07 Localizing AI: Evaluating Open-Weight Language Models for Languages of Baltic States Jurgita Kapočiūtė-Dzikienė et.al. 2501.03952 translate read null
2025-01-07 Not all tokens are created equal: Perplexity Attention Weighted Networks for AI generated text detection Pablo Miralles-González et.al. 2501.03940 translate read null
2025-01-07 Visual question answering: from early developments to recent advances – a survey Ngoc Dung Huynh et.al. 2501.03939 translate read null
2025-01-07 Exploring the Potential of Large Language Models in Public Transportation: San Antonio Case Study Ramya Jonnala et.al. 2501.03904 translate read null
2025-01-06 BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning Beichen Zhang et.al. 2501.03226 translate read link
2025-01-06 Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation Yuhui Zhang et.al. 2501.03225 translate read link
2025-01-06 Leveraging Explainable AI for LLM Text Attribution: Differentiating Human-Written and Multiple LLMs-Generated Text Ayat Najjar et.al. 2501.03212 translate read null
2025-01-06 Detecting AI-Generated Text in Educational Content: Leveraging Machine Learning and Explainable AI for Academic Integrity Ayat A. Najjar et.al. 2501.03203 translate read null
2025-01-06 CLIX: Cross-Lingual Explanations of Idiomatic Expressions Aaron Gluck et.al. 2501.03191 translate read null
2025-01-06 GLiREL – Generalist Model for Zero-Shot Relation Extraction Jack Boylan et.al. 2501.03172 translate read null
2025-01-06 Semantic Captioning: Benchmark Dataset and Graph-Aware Few-Shot In-Context Learning for SQL2Text Ali Al-Lawati et.al. 2501.03166 translate read link
2025-01-06 Large language models for artificial general intelligence (AGI): A survey of foundational principles and approaches Alhassan Mumuni et.al. 2501.03151 translate read null
2025-01-06 VicSim: Enhancing Victim Simulation with Emotional and Linguistic Fidelity Yerong Li et.al. 2501.03139 translate read null
2025-01-06 PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models Mingyang Song et.al. 2501.03124 translate read link
2025-01-03 VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction Chaoyou Fu et.al. 2501.01957 translate read link
2025-01-03 Cold-Start Recommendation towards the Era of Large Language Models (LLMs): A Comprehensive Survey and Roadmap Weizhi Zhang et.al. 2501.01945 translate read null
2025-01-03 Abstractive Text Summarization for Contemporary Sanskrit Prose: Issues and Challenges Shagun Sinha et.al. 2501.01933 translate read null
2025-01-03 Mitigating Hallucination for Large Vision Language Model by Inter-Modality Correlation Calibration Decoding Jiaming Li et.al. 2501.01926 translate read null
2025-01-03 Virgo: A Preliminary Exploration on Reproducing o1-like MLLM Yifan Du et.al. 2501.01904 translate read link
2025-01-03 Turning Logic Against Itself : Probing Model Defenses Through Contrastive Questions Rachneet Sachdeva et.al. 2501.01872 translate read link
2025-01-03 Multi-Agent Conversational Online Learning for Adaptive LLM Response Identification Xiangxiang Dai et.al. 2501.01849 translate read null
2025-01-03 MoColl: Agent-Based Specific and General Model Collaboration for Image Captioning Pu Yang et.al. 2501.01834 translate read null
2025-01-03 Time Series Language Model for Descriptive Caption Generation Mohamed Trabelsi et.al. 2501.01832 translate read null
2025-01-03 Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models Yanjiang Liu et.al. 2501.01830 translate read null
2025-01-02 Unifying Specialized Visual Encoders for Video Language Models Jihoon Chung et.al. 2501.01426 translate read link
2025-01-02 Multi-Modal Video Feature Extraction for Popularity Prediction Haixu Liu et.al. 2501.01422 translate read null
2025-01-02 Deep Discrete Encoders: Identifiable Deep Generative Models for Rich Data with Discrete Latent Layers Seunghyun Lee et.al. 2501.01414 translate read null
2025-01-02 OmniChat: Enhancing Spoken Dialogue Systems with Scalable Synthetic Data for Diverse Scenarios Xize Cheng et.al. 2501.01384 translate read null
2025-01-02 CLIP-UP: CLIP-Based Unanswerable Problem Detection for Visual Question Answering Ben Vardi et.al. 2501.01371 translate read null
2025-01-02 Embedding-based Approaches to Hyperpartisan News Detection Karthik Mohan et.al. 2501.01370 translate read null
2025-01-02 Aligning Large Language Models for Faithful Integrity Against Opposing Argument Yong Zhao et.al. 2501.01336 translate read null
2025-01-02 CySecBench: Generative AI-based CyberSecurity-focused Prompt Dataset for Benchmarking Large Language Models Johan Wahréus et.al. 2501.01335 translate read link
2025-01-02 Decoding Knowledge in Large Language Models: A Framework for Categorization and Comprehension Yanbo Fang et.al. 2501.01332 translate read null
2025-01-02 The Prompt Alchemist: Automated LLM-Tailored Prompt Optimization for Test Case Generation Shuzheng Gao et.al. 2501.01329 translate read null

(<a href=../LLM.md>back to LLM</a>)