LLM - 2024-09

Publish Date Title Authors PDF Translate Read Code
2024-09-30 MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning Haotian Zhang et.al. 2409.20566 translate read null
2024-09-30 Propose, Assess, Search: Harnessing LLMs for Goal-Oriented Planning in Instructional Videos Md Mohaiminul Islam et.al. 2409.20557 translate read null
2024-09-30 LLM Hallucinations in Practical Code Generation: Phenomena, Mechanism, and Mitigation Ziyao Zhang et.al. 2409.20550 translate read null
2024-09-30 Robi Butler: Remote Multimodal Interactions with Household Robot Assistant Anxing Xiao et.al. 2409.20548 translate read null
2024-09-30 Uncertainty-Informed Screening for Safer Solvents Used in the Synthesis of Perovskite via Language Models Arpan Mukherjee et.al. 2409.20512 translate read null
2024-09-30 COLLAGE: Collaborative Human-Agent Interaction Generation using Hierarchical Latent Diffusion and Language Models Divyanshu Daiya et.al. 2409.20502 translate read null
2024-09-30 HELPD: Mitigating Hallucination of LVLMs by Hierarchical Feedback Learning with Vision-enhanced Penalty Decoding Fan Yuan et.al. 2409.20429 translate read null
2024-09-30 World to Code: Multi-modal Data Generation via Self-Instructed Compositional Captioning and Filtering Jiacong Wang et.al. 2409.20424 translate read null
2024-09-27 LML: Language Model Learning a Dataset for Data-Augmented Prediction Praneeth Vadlapati et.al. 2409.18957 translate read link
2024-09-27 Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models Jiaming Li et.al. 2409.18943 translate read link
2024-09-27 From Seconds to Hours: Reviewing MultiModal Large Language Models on Comprehensive Long Video Understanding Heqing Zou et.al. 2409.18938 translate read link
2024-09-27 AIPatient: Simulating Patients with EHRs and LLM Powered Agentic Workflow Huizi Yu et.al. 2409.18924 translate read null
2024-09-27 Soft Measures for Extracting Causal Collective Intelligence Maryam Berijanian et.al. 2409.18911 translate read link
2024-09-27 Multi-Source Hard and Soft Information Fusion Approach for Accurate Cryptocurrency Price Movement Prediction Saeed Mohammadi Dashtaki et.al. 2409.18895 translate read null
2024-09-27 HM3: Hierarchical Multi-Objective Model Merging for Pretrained Models Yu Zhou et.al. 2409.18893 translate read null
2024-09-27 IDGen: Item Discrimination Induced Prompt Generation for LLM Evaluation Fan Lin et.al. 2409.18892 translate read null
2024-09-27 Predicting and analyzing memorization within fine-tuned Large Language Models Jérémie Dentan et.al. 2409.18858 translate read null
2024-09-27 Mitigating Selection Bias with Node Pruning and Auxiliary Options Hyeong Kyu Choi et.al. 2409.18857 translate read null
2024-09-26 EgoLM: Multi-Modal Language Model of Egocentric Motions Fangzhou Hong et.al. 2409.18127 translate read null
2024-09-26 Multi-View and Multi-Scale Alignment for Contrastive Language-Image Pre-training in Mammography Yuexi Du et.al. 2409.18119 translate read link
2024-09-26 E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding Ye Liu et.al. 2409.18111 translate read link
2024-09-26 Infering Alt-text For UI Icons With Large Language Models During App Development Sabrina Haque et.al. 2409.18060 translate read null
2024-09-26 DualAD: Dual-Layer Planning for Reasoning in Autonomous Driving Dingrui Wang et.al. 2409.18053 translate read null
2024-09-26 IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning Soeun Lee et.al. 2409.18046 translate read null
2024-09-26 Unveiling the Role of Pretraining in Direct Speech Translation Belen Alastruey et.al. 2409.18044 translate read null
2024-09-26 EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions Kai Chen et.al. 2409.18042 translate read link
2024-09-26 Compositional Hardness of Code in Large Language Models – A Probabilistic Perspective Yotam Wolf et.al. 2409.18028 translate read null
2024-09-26 An Adversarial Perspective on Machine Unlearning for AI Safety Jakub Łucki et.al. 2409.18025 translate read null
2024-09-25 Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models Matt Deitke et.al. 2409.17146 translate read link
2024-09-25 Attention Prompting on Image for Large Vision-Language Models Runpeng Yu et.al. 2409.17143 translate read link
2024-09-25 FineZip : Pushing the Limits of Large Language Models for Practical Lossless Text Compression Fazal Mittu et.al. 2409.17141 translate read link
2024-09-25 Turn Every Application into an Agent: Towards Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents Junting Lu et.al. 2409.17140 translate read null
2024-09-25 Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale Fan Zhou et.al. 2409.17115 translate read link
2024-09-25 Accumulator-Aware Post-Training Quantization Ian Colbert et.al. 2409.17092 translate read null
2024-09-25 VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models Yifei Liu et.al. 2409.17066 translate read link
2024-09-25 Using LLM for Real-Time Transcription and Summarization of Doctor-Patient Interactions into ePuskesmas in Indonesia Azmul Asmar Irfan et.al. 2409.17054 translate read null
2024-09-25 How to Connect Speech Foundation Models and Large Language Models? What Matters and What Does Not Francesco Verdini et.al. 2409.17044 translate read null
2024-09-25 Counterfactual Token Generation in Large Language Models Ivi Chatzi et.al. 2409.17027 translate read link
2024-09-24 MonoFormer: One Transformer for Both Diffusion and Autoregression Chuyang Zhao et.al. 2409.16280 translate read link
2024-09-24 A fast and sound tagging method for discontinuous named-entity recognition Caio Corro et.al. 2409.16243 translate read null
2024-09-24 LLM Echo Chamber: personalized and automated disinformation Tony Ma et.al. 2409.16241 translate read link
2024-09-24 Towards Enhancing Linked Data Retrieval in Conversational UIs using Large Language Models Omar Mussa et.al. 2409.16220 translate read null
2024-09-24 LLMCount: Enhancing Stationary mmWave Detection with Multimodal-LLM Boyan Li et.al. 2409.16209 translate read null
2024-09-25 CJEval: A Benchmark for Assessing Large Language Models Using Chinese Junior High School Exam Data Qian-Wen Zhang et.al. 2409.16202 translate read link
2024-09-24 HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models Haoran Que et.al. 2409.16191 translate read link
2024-09-24 Expert-level vision-language foundation model for real-world radiology and comprehensive evaluation Xiaohong Liu et.al. 2409.16183 translate read null
2024-09-24 Cyber Knowledge Completion Using Large Language Models Braden K Webb et.al. 2409.16176 translate read null
2024-09-24 Merging LoRAs like Playing LEGO: Pushing the Modularity of LoRA to Extremes Through Rank-Wise Clustering Ziyu Zhao et.al. 2409.16167 translate read null
2024-09-20 Gender Representation and Bias in Indian Civil Service Mock Interviews Somonnoy Banerjee et.al. 2409.12194 translate read null
2024-09-18 To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning Zayne Sprague et.al. 2409.12183 translate read link
2024-09-18 Finetuning Language Models to Emit Linguistic Expressions of Uncertainty Arslan Chaudhry et.al. 2409.12180 translate read null
2024-09-18 Decoding Style: Efficient Fine-Tuning of LLMs for Image-Guided Outfit Recommendation with Preference Najmeh Forouzandehmehr et.al. 2409.12150 translate read null
2024-09-18 MAgICoRe: Multi-Agent, Iterative, Coarse-to-Fine Refinement for Reasoning Justin Chih-Yao Chen et.al. 2409.12147 translate read link
2024-09-18 Experimental Evidence That Conversational Artificial Intelligence Can Steer Consumer Behavior Without Detection Tobias Werner et.al. 2409.12143 translate read null
2024-09-18 MoRAG – Multi-Fusion Retrieval Augmented Generation for Human Motion Kalakonda Sai Shashank et.al. 2409.12140 translate read link
2024-09-24 Takin: A Cohort of Superior Quality Zero-shot Speech Generation Models Sijing Chen et.al. 2409.12139 translate read null
2024-09-18 Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement An Yang et.al. 2409.12122 translate read null
2024-09-18 Low Frame-rate Speech Codec: a Codec Designed for Fast High-quality Speech LLM Training and Inference Edresson Casanova et.al. 2409.12117 translate read null
2024-09-17 AraDiCE: Benchmarks for Dialectal and Cultural Capabilities in LLMs Basel Mousi et.al. 2409.11404 translate read null
2024-09-17 NVLM: Open Frontier-Class Multimodal LLMs Wenliang Dai et.al. 2409.11402 translate read null
2024-09-17 Says Who? Effective Zero-Shot Annotation of Focalization Rebecca M. M. Hicke et.al. 2409.11390 translate read null
2024-09-17 Diversify and Conquer: Diversity-Centric Data Selection with Iterative Refinement Simon Yu et.al. 2409.11378 translate read link
2024-09-17 Towards Time Series Reasoning with LLMs Winnie Chow et.al. 2409.11376 translate read null
2024-09-17 Multi-OCT-SelfNet: Integrating Self-Supervised Learning with Multi-Source Data Fusion for Enhanced Multi-Class Retinal Disease Classification Fatema-E- Jannat et.al. 2409.11375 translate read null
2024-09-17 CoCA: Regaining Safety-awareness of Multimodal Large Language Models with Constitutional Calibration Jiahui Gao et.al. 2409.11365 translate read null
2024-09-17 AI Suggestions Homogenize Writing Toward Western Styles and Diminish Cultural Nuances Dhruv Agarwal et.al. 2409.11360 translate read null
2024-09-17 THaMES: An End-to-End Tool for Hallucination Mitigation and Evaluation in Large Language Models Mengfei Liang et.al. 2409.11353 translate read null
2024-09-18 Zero-resource Hallucination Detection for Text Generation via Graph-based Contextual Knowledge Triples Modeling Xinyue Fang et.al. 2409.11283 translate read null
2024-09-16 RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval Di Liu et.al. 2409.10516 translate read null
2024-09-16 Context-aware Code Segmentation for C-to-Rust Translation using Large Language Models Momoko Shiraishi et.al. 2409.10506 translate read null
2024-09-16 DILA: Dictionary Label Attention for Mechanistic Interpretability in High-dimensional Multi-label Medical Coding Prediction John Wu et.al. 2409.10504 translate read null
2024-09-16 Causal Language Modeling Can Elicit Search and Reasoning Capabilities on Logic Puzzles Kulin Shah et.al. 2409.10502 translate read link
2024-09-16 Code Vulnerability Detection: A Comparative Analysis of Emerging Large Language Models Shaznin Sultana et.al. 2409.10490 translate read null
2024-09-16 XLM for Autonomous Driving Systems: A Comprehensive Review Sonda Fourati et.al. 2409.10484 translate read null
2024-09-16 Schrodinger’s Memory: Large Language Models Wei Wang et.al. 2409.10482 translate read null
2024-09-16 LLM as BT-Planner: Leveraging LLMs for Behavior Tree Generation in Robot Task Planning Jicong Ao et.al. 2409.10444 translate read link
2024-09-16 A Large-Scale Privacy Assessment of Android Third-Party SDKs Mark Huasong Meng et.al. 2409.10411 translate read null
2024-09-17 Learnings from a Large-Scale Deployment of an LLM-Powered Expert-in-the-Loop Healthcare Chatbot Bhuvan Sachdeva et.al. 2409.10354 translate read null
2024-09-13 Agents in Software Engineering: Survey, Landscape, and Vision Yanxian Huang et.al. 2409.09030 translate read link
2024-09-13 Contri(e)ve: Context + Retrieve for Scholarly Question Answering Kanchan Shivashankar et.al. 2409.09010 translate read null
2024-09-13 Safeguarding Decentralized Social Media: LLM Agents for Automating Community Rule Compliance Lucio La Cava et.al. 2409.08963 translate read null
2024-09-13 Emerging Reliance Behaviors in Human-AI Text Generation: Hallucinations, Data Quality Assessment, and Cognitive Forcing Functions Zahra Ashktorab et.al. 2409.08937 translate read null
2024-09-13 SynSUM – Synthetic Benchmark with Structured and Unstructured Medical Records Paloma Rabaey et.al. 2409.08936 translate read link
2024-09-13 LLM-based Weak Supervision Framework for Query Intent Classification in Video Search Farnoosh Javadi et.al. 2409.08931 translate read null
2024-09-13 AnyBipe: An End-to-End Framework for Training and Deploying Bipedal Robots Guided by Large Language Models Yifei Yao et.al. 2409.08904 translate read null
2024-09-13 A Market for Lemons? Strategic Directions for a Vigilant Application of Artificial Intelligence in Entrepreneurship Research Martin Obschonka et.al. 2409.08890 translate read null
2024-09-13 Exploring Graph Structure Comprehension Ability of Multimodal Large Language Models: Case Studies Zhiqiang Zhong et.al. 2409.08864 translate read null
2024-09-13 FP-VEC: Fingerprinting Large Language Models via Efficient Vector Addition Zhenhua Xu et.al. 2409.08846 translate read null
2024-09-12 DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors Thomas Hanwen Zhu et.al. 2409.08278 translate read null
2024-09-12 Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale Rogerio Bonatti et.al. 2409.08264 translate read link
2024-09-12 OmniQuery: Contextually Augmenting Captured Multimodal Memory to Enable Personal Question Answering Jiahao Nick Li et.al. 2409.08250 translate read null
2024-09-12 Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources Alisia Lupidi et.al. 2409.08239 translate read null
2024-09-12 LLM Honeypot: Leveraging Large Language Models as Advanced Interactive Honeypot Systems Hakan T. Otal et.al. 2409.08234 translate read link
2024-09-12 What Makes a Maze Look Like a Maze? Joy Hsu et.al. 2409.08202 translate read null
2024-09-12 Fine-tuning Large Language Models for Entity Matching Aaron Steiner et.al. 2409.08185 translate read link
2024-09-12 Faster Speech-LLaMA Inference with Multi-token Prediction Desh Raj et.al. 2409.08148 translate read null
2024-09-12 LLM-POTUS Score: A Framework of Analyzing Presidential Debates with Large Language Models Zhengliang Liu et.al. 2409.08147 translate read null
2024-09-12 WhisperNER: Unified Open Named Entity and Speech Recognition Gil Ayache et.al. 2409.08107 translate read null
2024-09-11 “My Grade is Wrong!”: A Contestable AI Framework for Interactive Feedback in Evaluating Student Essays Shengxin Hong et.al. 2409.07453 translate read null
2024-09-11 SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories Ben Bogin et.al. 2409.07440 translate read link
2024-09-11 CLNX: Bridging Code and Natural Language for C/C++ Vulnerability-Contributing Commits Identification Zeqing Qin et.al. 2409.07407 translate read null
2024-09-11 AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge Han Wang et.al. 2409.07394 translate read link
2024-09-11 Recent Trends of Multimodal Affective Computing: A Survey from NLP Perspective Guimin Hu et.al. 2409.07388 translate read null
2024-09-11 Demo: SGCode: A Flexible Prompt-Optimizing System for Secure Generation of Code Khiem Ton et.al. 2409.07368 translate read null
2024-09-11 Think Together and Work Better: Combining Humans’ and LLMs’ Think-Aloud Outcomes for Effective Text Evaluation SeongYeub Chu et.al. 2409.07355 translate read link
2024-09-11 Securing Vision-Language Models with a Robust Encoder Against Jailbreak and Adversarial Attacks Md Zarif Hossain et.al. 2409.07353 translate read link
2024-09-11 Learning to Compress Contexts for Efficient Knowledge-based Visual Question Answering Weixi Weng et.al. 2409.07331 translate read null
2024-09-11 MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications Praveen K Kanithi et.al. 2409.07314 translate read null
2024-09-10 E2LLM: Encoder Elongated Large Language Models for Long-Context Understanding and Reasoning Zihan Liao et.al. 2409.06679 translate read link
2024-09-10 LLaMA-Omni: Seamless Speech Interaction with Large Language Models Qingkai Fang et.al. 2409.06666 translate read link
2024-09-10 Human Perception of LLM-generated Text Content in Social Media Environments Kristina Radivojevic et.al. 2409.06653 translate read null
2024-09-10 Optimal Workload Placement on Multi-Instance GPUs Bekir Turkkan et.al. 2409.06646 translate read null
2024-09-10 EyeCLIP: A visual-language foundation model for multi-modal ophthalmic image analysis Danli Shi et.al. 2409.06644 translate read null
2024-09-10 MoWE-Audio: Multitask AudioLLMs with Mixture of Weak Encoders Wenyu Zhang et.al. 2409.06635 translate read null
2024-09-10 A Practice of Post-Training on Llama-3 70B with Optimal Selection of Additional Language Mixture Ratio Ningyuan Xi et.al. 2409.06624 translate read null
2024-09-10 Alleviating Hallucinations in Large Language Models with Scepticism Modeling Yetao Wu et.al. 2409.06601 translate read null
2024-09-10 GroUSE: A Benchmark to Evaluate Evaluators in Grounded Question Answering Sacha Muller et.al. 2409.06595 translate read link
2024-09-10 MAPS: Energy-Reliability Tradeoff Management in Autonomous Vehicles Through LLMs Penetrated Science Mahdieh Aliazam et.al. 2409.06558 translate read null
2024-09-09 MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct Run Luo et.al. 2409.05840 translate read null
2024-09-09 Are Large Language Models a Threat to Programming Platforms? An Exploratory Study Md Mustakim Billah et.al. 2409.05824 translate read null
2024-09-09 Benchmarking Chinese Knowledge Rectification in Large Language Models Tianhe Lu et.al. 2409.05806 translate read link
2024-09-09 Breaking Neural Network Scaling Laws with Modularity Akhilan Boopathy et.al. 2409.05780 translate read null
2024-09-09 Evidence from fMRI Supports a Two-Phase Abstraction Process in Language Models Emily Cheng et.al. 2409.05771 translate read null
2024-09-09 Model Input Verification of Large Scale Simulations Rumyana Neykova et.al. 2409.05768 translate read null
2024-09-09 A Novel Idea Generation Tool using a Structured Conversational AI (CAI) System B. Sankar et.al. 2409.05747 translate read null
2024-09-09 LLMs Will Always Hallucinate, and We Need to Live With This Sourav Banerjee et.al. 2409.05746 translate read null
2024-09-09 A System and Benchmark for LLM-based Q\&A on Heterogeneous Data Achille Fokoue et.al. 2409.05735 translate read null
2024-09-09 Towards Democratizing Multilingual Large Language Models For Medicine Through A Two-Stage Instruction Fine-tuning Approach Meng Zhou et.al. 2409.05732 translate read link
2024-09-06 RLPF: Reinforcement Learning from Prediction Feedback for User Summarization with LLMs Jiaxing Wu et.al. 2409.04421 translate read null
2024-09-06 Question-Answering Dense Video Events Hangyu Qin et.al. 2409.04388 translate read link
2024-09-06 Learning vs Retrieval: The Role of In-Context Examples in Regression with LLMs Aliakbar Nafar et.al. 2409.04318 translate read null
2024-09-06 An optically accelerated extreme learning machine using hot atomic vapors Pierre Azam et.al. 2409.04312 translate read null
2024-09-06 Using Large Language Models to Generate Authentic Multi-agent Knowledge Work Datasets Desiree Heim et.al. 2409.04286 translate read null
2024-09-06 Advancing Automated Knowledge Transfer in Evolutionary Multitasking via Large Language Models Yuxiao Huang et.al. 2409.04270 translate read null
2024-09-06 GALLa: Graph Aligned Large Language Models for Improved Source Code Understanding Ziyin Zhang et.al. 2409.04183 translate read link
2024-09-06 Combining LLMs and Knowledge Graphs to Reduce Hallucinations in Question Answering Larissa Pusch et.al. 2409.04181 translate read null
2024-09-06 From Calculation to Adjudication: Examining LLM judges on Mathematical Reasoning Tasks Andreas Stephan et.al. 2409.04168 translate read null
2024-09-06 Can OpenSource beat ChatGPT? – A Comparative Study of Large Language Models for Text-to-Code Generation Luis Mayer et.al. 2409.04164 translate read null
2024-09-05 Attention Heads of Large Language Models: A Survey Zifan Zheng et.al. 2409.03752 translate read link
2024-09-05 LLM-CI: Assessing Contextual Integrity Norms in Language Models Yan Shvartzshnaider et.al. 2409.03735 translate read null
2024-09-05 Safety vs. Performance: How Multi-Objective Learning Reduces Barriers to Market Entry Meena Jagadeesan et.al. 2409.03734 translate read null
2024-09-05 Planning In Natural Language Improves LLM Search For Code Generation Evan Wang et.al. 2409.03733 translate read null
2024-09-05 RAG based Question-Answering for Contextual Response Prediction System Sriram Veturi et.al. 2409.03708 translate read null
2024-09-05 TRACE-cs: Trustworthy Reasoning for Contrastive Explanations in Course Scheduling Problems Stylianos Loukas Vasileiou et.al. 2409.03671 translate read null
2024-09-05 A Fused Large Language Model for Predicting Startup Success Abdurahman Maarouf et.al. 2409.03668 translate read null
2024-09-05 The representation landscape of few-shot learning and fine-tuning in large language models Diego Doimo et.al. 2409.03662 translate read link
2024-09-06 LLM-based multi-agent poetry generation in non-cooperative environments Ran Zhang et.al. 2409.03659 translate read link
2024-09-05 From MOOC to MAIC: Reshaping Online Teaching and Learning through LLM-driven Agents Jifan Yu et.al. 2409.03512 translate read null
2024-09-04 RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins (early version) Yao Mu et.al. 2409.02920 translate read link
2024-09-05 LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA Jiajie Zhang et.al. 2409.02897 translate read link
2024-09-04 LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture Xidong Wang et.al. 2409.02889 translate read link
2024-09-04 Historical German Text Normalization Using Type- and Token-Based Language Modeling Anton Ehrmanntraut et.al. 2409.02841 translate read null
2024-09-04 Exploring Sentiment Dynamics and Predictive Behaviors in Cryptocurrency Discussions by Few-Shot Learning with Large Language Models Moein Shahiki Tash et.al. 2409.02836 translate read null
2024-09-04 CMM-Math: A Chinese Multimodal Math Dataset To Evaluate and Enhance the Mathematics Reasoning of Large Multimodal Models Wentao Liu et.al. 2409.02834 translate read link
2024-09-04 ExpLLM: Towards Chain of Thought for Facial Expression Recognition Xing Lan et.al. 2409.02828 translate read link
2024-09-04 Design Contradictions: Help or Hindrance? Aron E. Owen et.al. 2409.02823 translate read null
2024-09-04 Language Understanding as a Constraint on Consensus Size in LLM Societies Giordano De Marzo et.al. 2409.02822 translate read null
2024-09-04 Towards a Unified View of Preference Learning for Large Language Models: A Survey Bofei Gao et.al. 2409.02795 translate read link

(<a href=../LLM.md>back to LLM</a>)