LLM - 2024-07

Publish Date Title Authors PDF Translate Read Code
2024-07-31 Vision-Language Model Based Handwriting Verification Mihir Chauhan et.al. 2407.21788 translate read null
2024-07-31 Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs Shi Liu et.al. 2407.21771 translate read null
2024-07-31 ReplanVLM: Replanning Robotic Tasks with Visual Language Models Aoran Mei et.al. 2407.21762 translate read null
2024-07-31 Adaptive Retrieval-Augmented Generation for Conversational Systems Xi Wang et.al. 2407.21712 translate read null
2024-07-31 CEAR: Automatic construction of a knowledge graph of chemical entities and roles from scientific literature Stefan Langer et.al. 2407.21708 translate read null
2024-07-31 TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities Ming Zhang et.al. 2407.21693 translate read null
2024-07-31 Synth-Empathy: Towards High-Quality Synthetic Empathy Data Hao Liang et.al. 2407.21669 translate read link
2024-07-31 LLM-for-X: Application-agnostic Integration of Large Language Models to Support Personal Writing Workflows Lukas Teufelberger et.al. 2407.21593 translate read null
2024-07-31 A Performance Study of LLM-Generated Code on Leetcode Tristan Coignion et.al. 2407.21579 translate read null
2024-07-31 PMoE: Progressive Mixture of Experts with Asymmetric Transformer for Continual Learning Min Jae Jung et.al. 2407.21571 translate read null
2024-07-30 ThinK: Thinner Key Cache by Query-Driven Pruning Yuhui Xu et.al. 2407.21018 translate read link
2024-07-30 CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning Yuexi Du et.al. 2407.21011 translate read link
2024-07-30 The Dual-Edged Sword of Technical Debt: Benefits and Issues Analyzed Through Developer Discussions Xiaozhou Li et.al. 2407.21007 translate read null
2024-07-30 MoFO: Momentum-Filtered Optimizer for Mitigating Forgetting in LLM Fine-Tuning Yupeng Chen et.al. 2407.20999 translate read null
2024-07-30 From Feature Importance to Natural Language Explanations Using LLMs with RAG Sule Tekkesinoglu et.al. 2407.20990 translate read null
2024-07-30 Large Language Models (LLMs) for Semantic Communication in Edge-based IoT Networks Alakesh Kalita et.al. 2407.20970 translate read null
2024-07-30 Automated Review Generation Method Based on Large Language Models Shican Wu et.al. 2407.20906 translate read link
2024-07-30 ThinkRepair: Self-Directed Automated Program Repair Xin Yin et.al. 2407.20898 translate read link
2024-07-30 Effective Black Box Testing of Sentiment Analysis Classification Networks Parsa Karbasizadeh et.al. 2407.20884 translate read null
2024-07-30 Breaking Agents: Compromising Autonomous LLM Agents Through Malfunction Amplification Boyang Zhang et.al. 2407.20859 translate read null
2024-07-29 Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing Ekaterina Iakovleva et.al. 2407.20232 translate read null
2024-07-29 Can Editing LLMs Inject Harm? Canyu Chen et.al. 2407.20224 translate read link
2024-07-29 QAEA-DR: A Unified Text Augmentation Framework for Dense Retrieval Hongming Tan et.al. 2407.20207 translate read null
2024-07-29 MindSearch: Mimicking Human Minds Elicits Deep AI Searcher Zehui Chen et.al. 2407.20183 translate read link
2024-07-29 Advancing Multimodal Large Language Models in Chart Question Answering with Visualization-Referenced Instruction Tuning Xingchen Zeng et.al. 2407.20174 translate read link
2024-07-29 Diffusion Feedback Helps CLIP See Better Wenxuan Wang et.al. 2407.20171 translate read link
2024-07-29 Language-Conditioned Offline RL for Multi-Robot Navigation Steven Morad et.al. 2407.20164 translate read null
2024-07-29 rLLM: Relational Table Learning with LLMs Weichen Li et.al. 2407.20157 translate read link
2024-07-29 ByteCheckpoint: A Unified Checkpointing System for LLM Development Borui Wan et.al. 2407.20143 translate read null
2024-07-29 Orca: Ocean Significant Wave Height Estimation with Spatio-temporally Aware Large Language Models Zhe Li et.al. 2407.20053 translate read null
2024-07-26 Small Molecule Optimization with Large Language Models Philipp Guevorguian et.al. 2407.18897 translate read link
2024-07-26 Human-artificial intelligence teaming for scientific information extraction from data-driven additive manufacturing research using large language models Mutahar Safdar et.al. 2407.18827 translate read null
2024-07-26 Automatic Detection of Moral Values in Music Lyrics Vjosa Preniqi et.al. 2407.18787 translate read link
2024-07-26 The power of Prompts: Evaluating and Mitigating Gender Bias in MT with LLMs Aleix Sant et.al. 2407.18786 translate read null
2024-07-26 TAGIFY: LLM-powered Tagging Interface for Improved Data Findability on OGD portals Kevin Kliimask et.al. 2407.18764 translate read null
2024-07-26 Knowledge Graph Structure as Prompt: Improving Small Language Models Capabilities for Knowledge-based Causal Discovery Yuni Susanti et.al. 2407.18752 translate read link
2024-07-26 Towards Effective and Efficient Continual Pre-training of Large Language Models Jie Chen et.al. 2407.18743 translate read link
2024-07-26 Towards Generalized Offensive Language Identification Alphaeus Dmonte et.al. 2407.18738 translate read null
2024-07-26 LLASP: Fine-tuning Large Language Models for Answer Set Programming Erica Coppolillo et.al. 2407.18723 translate read null
2024-07-26 Neurosymbolic AI for Enhancing Instructability in Generative AI Amit Sheth et.al. 2407.18722 translate read null
2024-07-25 Recursive Introspection: Teaching Language Model Agents How to Self-Improve Yuxiao Qu et.al. 2407.18219 translate read null
2024-07-25 Exploring Scaling Trends in LLM Robustness Nikolhaus Howe et.al. 2407.18213 translate read null
2024-07-25 Unlocking Tokens as Data Points for Generalization Bounds on Larger Language Models Sanae Lotfi et.al. 2407.18158 translate read null
2024-07-25 Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic Fakhraddin Alwajih et.al. 2407.18129 translate read null
2024-07-25 Fine-Tuning Large Language Models for Stock Return Prediction Using Newsflow Tian Guo et.al. 2407.18103 translate read null
2024-07-25 PEFT-U: Parameter-Efficient Fine-Tuning for User Personalization Christopher Clarke et.al. 2407.18078 translate read link
2024-07-25 C2P: Featuring Large Language Models with Causal Reasoning Abdolmahdi Bagheri et.al. 2407.18069 translate read null
2024-07-25 ComPeer: A Generative Conversational Agent for Proactive Peer Support Tianjian Liu et.al. 2407.18064 translate read null
2024-07-25 Audio Entailment: Assessing Deductive Reasoning for Audio Understanding Soham Deshmukh et.al. 2407.18062 translate read link
2024-07-25 Difficulty Estimation and Simplification of French Text Using LLMs Henri Jamet et.al. 2407.18061 translate read null
2024-07-24 I Could’ve Asked That: Reformulating Unanswerable Questions Wenting Zhao et.al. 2407.17469 translate read link
2024-07-24 WildHallucinations: Evaluating Long-form Factuality in LLMs with Real-World Entity Queries Wenting Zhao et.al. 2407.17468 translate read null
2024-07-24 CMR Scaling Law: Predicting Critical Mixture Ratios for Continual Pre-training of Language Models Jiawei Gu et.al. 2407.17467 translate read null
2024-07-24 $VILA^2$ : VILA Augmented VILA Yunhao Fang et.al. 2407.17453 translate read null
2024-07-24 Generative AI in Evidence-Based Software Engineering: A White Paper Mattel Esposito et.al. 2407.17440 translate read null
2024-07-24 Can Watermarking Large Language Models Prevent Copyrighted Text Generation and Hide Training Data? Michael-Andrei Panaitescu-Liess et.al. 2407.17417 translate read null
2024-07-24 (PASS) Visual Prompt Locates Good Structure Sparsity through a Recurrent HyperNetwork Tianjin Huang et.al. 2407.17412 translate read null
2024-07-24 Grammar-based Game Description Generation using Large Language Models Tsunehiko Tanaka et.al. 2407.17404 translate read null
2024-07-24 3D Question Answering for City Scene Understanding Penglei Sun et.al. 2407.17398 translate read null
2024-07-24 ViPer: Visual Personalization of Generative Models via Individual Preference Learning Sogand Salehi et.al. 2407.17365 translate read null
2024-07-23 Can Large Language Models Automatically Jailbreak GPT-4V? Yuanwei Wu et.al. 2407.16686 translate read null
2024-07-23 RedAgent: Red Teaming Large Language Models with Context-aware Autonomous Language Agent Huiyu Xu et.al. 2407.16667 translate read null
2024-07-23 Course-Correction: Safety Alignment Using Synthetic Preferences Rongwu Xu et.al. 2407.16637 translate read link
2024-07-23 Lawma: The Power of Specialization for Legal Tasks Ricardo Dominguez-Olmedo et.al. 2407.16615 translate read null
2024-07-23 Shared Imagination: LLMs Hallucinate Alike Yilun Zhou et.al. 2407.16604 translate read null
2024-07-23 Exploring Automatic Cryptographic API Misuse Detection in the Era of LLMs Yifan Xia et.al. 2407.16576 translate read null
2024-07-23 Retrieve, Generate, Evaluate: A Case Study for Medical Paraphrases Generation with Small Language Models Ioana Buhnila et.al. 2407.16565 translate read null
2024-07-23 Patched RTC: evaluating LLMs for diverse software development tasks Asankhaya Sharma et.al. 2407.16557 translate read link
2024-07-24 MicroEmo: Time-Sensitive Multimodal Emotion Recognition with Micro-Expression Dynamics in Video Dialogues Liyun Zhang et.al. 2407.16552 translate read null
2024-07-23 Imperfect Vision Encoders: Efficient and Robust Tuning for Vision-Language Models Aristeidis Panos et.al. 2407.16526 translate read null
2024-07-22 AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description Junyu Xie et.al. 2407.15850 translate read link
2024-07-22 LLMmap: Fingerprinting For Large Language Models Dario Pasquini et.al. 2407.15847 translate read null
2024-07-22 SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models Mingze Xu et.al. 2407.15841 translate read link
2024-07-22 MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Diversity Yangzhou Liu et.al. 2407.15838 translate read link
2024-07-22 dMel: Speech Tokenization made Simple He Bai et.al. 2407.15835 translate read link
2024-07-22 Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight Ziyuan Huang et.al. 2407.15819 translate read null
2024-07-22 Extracting Structured Insights from Financial News: An Augmented LLM Driven Approach Rian Dolphin et.al. 2407.15788 translate read null
2024-07-22 MoRSE: Bridging the Gap in Cybersecurity Expertise with Retrieval Augmented Generation Marco Simoni et.al. 2407.15748 translate read null
2024-07-22 OMoS-QA: A Dataset for Cross-Lingual Extractive Question Answering in a German Migration Context Steffen Kleinle et.al. 2407.15736 translate read null
2024-07-22 TaskGen: A Task-Based, Memory-Infused Agentic Framework using StrictJSON John Chong Min Tan et.al. 2407.15734 translate read link
2024-07-19 Internal Consistency and Self-Feedback in Large Language Models: A Survey Xun Liang et.al. 2407.14507 translate read link
2024-07-19 On Pre-training of Multimodal Language Models Customized for Chart Understanding Wan-Cyuan Fan et.al. 2407.14506 translate read null
2024-07-19 Evaluating the Reliability of Self-Explanations in Large Language Models Korbinian Randl et.al. 2407.14487 translate read link
2024-07-19 Contrastive Learning with Counterfactual Explanations for Radiology Report Generation Mingjie Li et.al. 2407.14474 translate read null
2024-07-19 Check-Eval: A Checklist-based Approach for Evaluating Text Quality Jayr Pereira et.al. 2407.14467 translate read null
2024-07-19 Undermining Mental Proof: How AI Can Make Cooperation Harder by Making Thinking Easier Zachary Wojtowicz et.al. 2407.14452 translate read null
2024-07-19 From Instruction to Insight: Exploring the Functional and Semantic Roles of Text in Interactive Dashboards Nicole Sultanum et.al. 2407.14451 translate read null
2024-07-19 Token-level Correlation-guided Compression for Efficient Multimodal Document Understanding Renshan Zhang et.al. 2407.14439 translate read link
2024-07-19 The Vision of Autonomic Computing: Can LLMs Make It a Reality? Zhiyang Zhang et.al. 2407.14402 translate read null
2024-07-19 Open Artificial Knowledge Vadim Borisov et.al. 2407.14371 translate read null
2024-07-18 Visual Haystacks: Answering Harder Questions About Sets of Images Tsung-Han Wu et.al. 2407.13766 translate read link
2024-07-18 SegPoint: Segment Any Point Cloud via Large Language Model Shuting He et.al. 2407.13761 translate read null
2024-07-18 Black-Box Opinion Manipulation Attacks to Retrieval-Augmented Generation of Large Language Models Zhuo Chen et.al. 2407.13757 translate read null
2024-07-18 CellularLint: A Systematic Approach to Identify Inconsistent Behavior in Cellular Network Specifications Mirza Masfiqur Rahman et.al. 2407.13742 translate read null
2024-07-18 Baba Is AI: Break the Rules to Beat the Benchmark Nathan Cloos et.al. 2407.13729 translate read null
2024-07-18 CoDefeater: Using LLMs To Find Defeaters in Assurance Cases Usman Gohar et.al. 2407.13717 translate read null
2024-07-18 Understanding Reference Policies in Direct Preference Optimization Yixin Liu et.al. 2407.13709 translate read link
2024-07-18 A Comprehensive Review of Recommender Systems: Transitioning from Theory to Practice Shaina Raza et.al. 2407.13699 translate read null
2024-07-18 Prover-Verifier Games improve legibility of LLM outputs Jan Hendrik Kirchner et.al. 2407.13692 translate read link
2024-07-18 COMCAT: Leveraging Human Judgment to Improve Automatic Documentation and Summarization Skyler Grandel et.al. 2407.13648 translate read null
2024-07-17 LookupViT: Compressing visual information to a limited number of tokens Rajat Koner et.al. 2407.12753 translate read null
2024-07-17 EchoSight: Advancing Visual-Language Models with Wiki Knowledge Yibin Yan et.al. 2407.12735 translate read null
2024-07-17 NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model Zhongqun Zhang et.al. 2407.12727 translate read null
2024-07-17 Is Sarcasm Detection A Step-by-Step Reasoning Process in Large Language Models? Ben Yao et.al. 2407.12725 translate read null
2024-07-17 The Future of Learning: Large Language Models through the Lens of Students He Zhang et.al. 2407.12723 translate read null
2024-07-17 MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language Models Leyang Shen et.al. 2407.12709 translate read link
2024-07-17 Patch-Level Training for Large Language Models Chenze Shao et.al. 2407.12665 translate read link
2024-07-17 Zero-shot Text-guided Infinite Image Synthesis with LLM guidance Soyeong Kwon et.al. 2407.12642 translate read null
2024-07-17 Harnessing the Power of Artificial Intelligence to Vitalize Endangered Indigenous Languages: Technologies and Experiences Claudio Pinhanez et.al. 2407.12620 translate read null
2024-07-17 AudienceView: AI-Assisted Interpretation of Audience Feedback in Journalism William Brannon et.al. 2407.12613 translate read link
2024-07-16 UrbanWorld: An Urban World Model for 3D City Generation Yu Shang et.al. 2407.11965 translate read null
2024-07-16 NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window? Mo Li et.al. 2407.11963 translate read link
2024-07-16 Code Documentation and Analysis to Secure Software Development Paul Attie et.al. 2407.11934 translate read null
2024-07-16 What’s Wrong? Refining Meeting Summaries with LLM Feedback Frederic Kirstein et.al. 2407.11919 translate read null
2024-07-16 Ascend-CC: Confidential Computing on Heterogeneous NPU for Emerging Generative AI Workloads Aritra Dhar et.al. 2407.11888 translate read null
2024-07-16 Schema Matching with Large Language Models: an Experimental Study Marcel Parciak et.al. 2407.11852 translate read link
2024-07-16 LoFTI: Localization and Factuality Transfer to Indian Locales Sona Elza Simon et.al. 2407.11833 translate read link
2024-07-16 GPT Assisted Annotation of Rhetorical and Linguistic Features for Interpretable Propaganda Technique Detection in News Text Kyle Hamilton et.al. 2407.11827 translate read null
2024-07-16 PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation Branden Butler et.al. 2407.11798 translate read null
2024-07-16 Large Language Models as Misleading Assistants in Conversation Betty Li Hou et.al. 2407.11789 translate read null
2024-07-15 VGBench: Evaluating Large Language Models on Vector Graphics Understanding and Generation Bocheng Zou et.al. 2407.10972 translate read link
2024-07-15 Q-Sparse: All Large Language Models can be Fully Sparsely-Activated Hongyu Wang et.al. 2407.10969 translate read null
2024-07-15 No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations Walter Simoncini et.al. 2407.10964 translate read link
2024-07-15 Fast Matrix Multiplications for Lookup Table-Quantized LLMs Han Guo et.al. 2407.10960 translate read link
2024-07-15 MMM: Multilingual Mutual Reinforcement Effect Mix Datasets & Test with Open-domain Information Extraction Large Language Models Chengguang Gan et.al. 2407.10953 translate read null
2024-07-15 Can Textual Semantics Mitigate Sounding Object Segmentation Preference? Yaoting Wang et.al. 2407.10947 translate read link
2024-07-15 GRUtopia: Dream General Robots in a City at Scale Hanqing Wang et.al. 2407.10943 translate read link
2024-07-15 Benchmarking Vision Language Models for Cultural Understanding Shravan Nayak et.al. 2407.10920 translate read null
2024-07-15 FinDKG: Dynamic Knowledge Graphs with Large Language Models for Detecting Global Trends in Financial Markets Xiaohui Victor Li et.al. 2407.10909 translate read link
2024-07-15 Hey, That’s My Model! Introducing Chain & Hash, An LLM Fingerprinting Technique Mark Russinovich et.al. 2407.10887 translate read null
2024-07-12 FairyLandAI: Personalized Fairy Tales utilizing ChatGPT and DALLE-3 Georgios Makridis et.al. 2407.09467 translate read null
2024-07-12 Human-like Episodic Memory for Infinite Context LLMs Zafeirios Fountas et.al. 2407.09450 translate read link
2024-07-12 ASTPrompter: Weakly Supervised Automated Language Model Red-Teaming to Identify Likely Toxic Prompts Amelia F. Hardy et.al. 2407.09447 translate read null
2024-07-12 MUSCLE: A Model Update Strategy for Compatible LLM Evolution Jessica Echterhoff et.al. 2407.09435 translate read null
2024-07-12 Open (Clinical) LLMs are Sensitive to Instruction Phrasings Alberto Mario Ceballos Arroyo et.al. 2407.09429 translate read null
2024-07-12 TelecomGPT: A Framework to Build Telecom-Specfic Large Language Models Hang Zou et.al. 2407.09424 translate read null
2024-07-12 Mitigating Entity-Level Hallucination in Large Language Models Weihang Su et.al. 2407.09417 translate read link
2024-07-12 SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers Shraman Pramanick et.al. 2407.09413 translate read link
2024-07-12 PersonaRAG: Enhancing Retrieval-Augmented Generation Systems with User-Centric Agents Saber Zerhoudi et.al. 2407.09394 translate read link
2024-07-12 GAVEL: Generating Games Via Evolution and Language Models Graham Todd et.al. 2407.09388 translate read link
2024-07-11 MAVIS: Mathematical Visual Instruction Tuning Renrui Zhang et.al. 2407.08739 translate read link
2024-07-11 Real-Time Anomaly Detection and Reactive Planning with Large Language Models Rohan Sinha et.al. 2407.08735 translate read null
2024-07-11 Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist Zihao Zhou et.al. 2407.08733 translate read null
2024-07-11 A Taxonomy for Data Contamination in Large Language Models Medha Palavalli et.al. 2407.08716 translate read null
2024-07-11 GTA: A Benchmark for General Tool Agents Jize Wang et.al. 2407.08713 translate read link
2024-07-11 Extracting Training Data from Document-Based VQA Models Francesco Pinto et.al. 2407.08707 translate read null
2024-07-11 Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models Zhening Xing et.al. 2407.08701 translate read null
2024-07-11 Mitigating Catastrophic Forgetting in Language Transfer via Model Merging Anton Alexandrov et.al. 2407.08699 translate read null
2024-07-11 Cloud Atlas: Efficient Fault Localization for Cloud Systems using Language Models and Causal Insight Zhiqiang Xie et.al. 2407.08694 translate read null
2024-07-11 SEED-Story: Multimodal Long Story Generation with Large Language Model Shuai Yang et.al. 2407.08683 translate read link
2024-07-10 Training on the Test Task Confounds Evaluation and Emergence Ricardo Dominguez-Olmedo et.al. 2407.07890 translate read link
2024-07-10 Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization Junkang Wu et.al. 2407.07880 translate read link
2024-07-10 FACTS About Building Retrieval Augmented Generation-based Chatbots Rama Akkiraju et.al. 2407.07858 translate read null
2024-07-10 OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training Sami Jaghouar et.al. 2407.07852 translate read link
2024-07-10 Natural Language Mechanisms via Self-Resolution with Foundation Models Nicolas Della Penna et.al. 2407.07845 translate read null
2024-07-10 Transformer Alignment in Large Language Models Murdock Aubry et.al. 2407.07810 translate read null
2024-07-10 Attribute or Abstain: Large Language Models as Long Document Assistants Jan Buchmann et.al. 2407.07799 translate read link
2024-07-11 Evaluating Large Language Models with Grid-Based Game Competitions: An Extensible LLM Benchmark and Leaderboard Oguzhan Topsakal et.al. 2407.07796 translate read link
2024-07-10 Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities Tianjie Ju et.al. 2407.07791 translate read link
2024-07-10 WorldAPIs: The World Is Worth How Many APIs? A Thought Experiment Jiefu Ou et.al. 2407.07778 translate read null
2024-07-09 AnyTaskTune: Advanced Domain-Specific Solutions through Task-Fine-Tuning Jiaxi Cui et.al. 2407.07094 translate read link
2024-07-09 FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation Liqun Ma et.al. 2407.07093 translate read link
2024-07-09 Hypothetical Minds: Scaffolding Theory of Mind for Multi-Agent Tasks with Large Language Models Logan Cross et.al. 2407.07086 translate read link
2024-07-09 Adapting LLMs to Hebrew: Unveiling DictaLM 2.0 with Enhanced Vocabulary and Instruction Capabilities Shaltiel Shmidman et.al. 2407.07080 translate read null
2024-07-09 Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps Yung-Sung Chuang et.al. 2407.07071 translate read link
2024-07-09 Prompting Techniques for Secure Code Generation: A Systematic Investigation Catherine Tony et.al. 2407.07064 translate read null
2024-07-09 Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence Weize Chen et.al. 2407.07061 translate read link
2024-07-09 Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model Wenqi Zhang et.al. 2407.07053 translate read link
2024-07-09 CorMulT: A Semi-supervised Modality Correlation-aware Multimodal Transformer for Sentiment Analysis Yangmin Li et.al. 2407.07046 translate read null
2024-07-09 Using Large Language Models for Generating Smart Contracts for Health Insurance from Textual Policies Inwon Kang et.al. 2407.07019 translate read null
2024-07-08 Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision Orr Zohar et.al. 2407.06189 translate read link
2024-07-08 CrowdMoGen: Zero-Shot Text-Driven Collective Motion Generation Xinying Guo et.al. 2407.06188 translate read null
2024-07-08 On Speeding Up Language Model Evaluation Jin Peng Zhou et.al. 2407.06172 translate read link
2024-07-08 What’s Wrong with Your Code Generated by Large Language Models? An Extensive Study Shihan Dou et.al. 2407.06153 translate read null
2024-07-08 Using Grammar Masking to Ensure Syntactic Validity in LLM-based Modeling Tasks Lukas Netz et.al. 2407.06146 translate read null
2024-07-08 ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation Ethan Chern et.al. 2407.06135 translate read link
2024-07-08 Evaluating the Semantic Profiling Abilities of LLMs for Natural Language Utterances in Data Visualization Hannah K. Bako et.al. 2407.06129 translate read link
2024-07-08 Depression Detection and Analysis using Large Language Models on Textual and Audio-Visual Modalities Avinash Anand et.al. 2407.06125 translate read null
2024-07-08 Artificial Intuition: Efficient Classification of Scientific Abstracts Harsh Sakhrani et.al. 2407.06093 translate read null
2024-07-08 Merge, Ensemble, and Cooperate! A Survey on Collaborative Strategies in the Era of Large Language Models Jinliang Lu et.al. 2407.06089 translate read null
2024-07-05 Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMs Rudolf Laine et.al. 2407.04694 translate read null
2024-07-05 ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models Yuzhe Gu et.al. 2407.04693 translate read link
2024-07-05 Rethinking Visual Prompting for Multimodal Large Language Models with External Knowledge Yuanze Lin et.al. 2407.04681 translate read null
2024-07-05 Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition Ye Bai et.al. 2407.04675 translate read null
2024-07-05 Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement Yongji Wu et.al. 2407.04656 translate read null
2024-07-05 Entity Decomposition with Filtering: A Zero-Shot Clinical Named Entity Recognition Framework Reza Averly et.al. 2407.04629 translate read null
2024-07-05 On scalable oversight with weak LLMs judging strong LLMs Zachary Kenton et.al. 2407.04622 translate read null
2024-07-05 Leveraging Large Language Models for Integrated Satellite-Aerial-Terrestrial Networks: Recent Advances and Future Directions Shumaila Javaid et.al. 2407.04581 translate read null
2024-07-05 VRSD: Rethinking Similarity and Diversity for Retrieval in Large Language Models Hang Gao et.al. 2407.04573 translate read null
2024-07-05 PoPreRo: A New Dataset for Popularity Prediction of Romanian Reddit Posts Ana-Cristina Rogoz et.al. 2407.04541 translate read link
2024-07-03 BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations Zhantao Yang et.al. 2407.03314 translate read null
2024-07-03 Universal Length Generalization with Turing Programs Kaiying Hou et.al. 2407.03310 translate read null
2024-07-03 Large Language Models for JSON Schema Discovery Michael J. Mior et.al. 2407.03286 translate read null
2024-07-03 LLM Internal States Reveal Hallucination Risk Faced With a Query Ziwei Ji et.al. 2407.03282 translate read null
2024-07-03 Improving Retrieval-augmented Text-to-SQL with AST-based Ranking and Schema Pruning Zhili Shen et.al. 2407.03227 translate read null
2024-07-03 How Does Quantization Affect Multilingual LLMs? Kelly Marchisio et.al. 2407.03211 translate read null
2024-07-03 TheoremLlama: Transforming General-Purpose LLMs into Lean4 Experts Ruida Wang et.al. 2407.03203 translate read link
2024-07-03 Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models Haritz Puerto et.al. 2407.03181 translate read link
2024-07-03 Investigating Decoder-only Large Language Models for Speech-to-text Translation Chao-Wei Huang et.al. 2407.03169 translate read null
2024-07-03 SOS! Soft Prompt Attack Against Open-Source Large Language Models Ziqing Yang et.al. 2407.03160 translate read null
2024-07-02 MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention Huiqiang Jiang et.al. 2407.02490 translate read link
2024-07-02 Neurocache: Efficient Vector Retrieval for Long-range Language Modeling Ali Safaya et.al. 2407.02486 translate read link
2024-07-02 RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs Yue Yu et.al. 2407.02485 translate read null
2024-07-02 MMedAgent: Learning to Use Medical Tools with Multi-modal Agent Binxu Li et.al. 2407.02483 translate read null
2024-07-02 Understanding Alignment in Multimodal LLMs: A Comprehensive Study Elmira Amirloo et.al. 2407.02477 translate read null
2024-07-02 Open Scene Graphs for Open World Object-Goal Navigation Joel Loo et.al. 2407.02473 translate read null
2024-07-02 Reliable Confidence Intervals for Information Retrieval Evaluation Using Generative A.I Harrie Oosterhuis et.al. 2407.02464 translate read null
2024-07-02 Predicting vs. Acting: A Trade-off Between World Modeling & Agent Modeling Margaret Li et.al. 2407.02446 translate read null
2024-07-02 Video Watermarking: Safeguarding Your Video from (Unauthorized) Annotations by Video-based LLMs Jinmin Li et.al. 2407.02411 translate read null
2024-07-02 CEB: Compositional Evaluation Benchmark for Fairness in Large Language Models Song Wang et.al. 2407.02408 translate read null
2024-07-01 BMW Agents – A Framework For Task Automation Through Multi-Agent Collaboration Noel Crawford et.al. 2406.20041 translate read null

(<a href=../LLM.md>back to LLM</a>)