LLM - 2024-07 | Paper Arxiv Daily

LLM - 2024-07

Publish Date	Title	Authors	PDF	Translate	Read	Code
2024-07-31	Vision-Language Model Based Handwriting Verification	Mihir Chauhan et.al.	2407.21788	translate	read	null
2024-07-31	Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs	Shi Liu et.al.	2407.21771	translate	read	null
2024-07-31	ReplanVLM: Replanning Robotic Tasks with Visual Language Models	Aoran Mei et.al.	2407.21762	translate	read	null
2024-07-31	Adaptive Retrieval-Augmented Generation for Conversational Systems	Xi Wang et.al.	2407.21712	translate	read	null
2024-07-31	CEAR: Automatic construction of a knowledge graph of chemical entities and roles from scientific literature	Stefan Langer et.al.	2407.21708	translate	read	null
2024-07-31	TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities	Ming Zhang et.al.	2407.21693	translate	read	null
2024-07-31	Synth-Empathy: Towards High-Quality Synthetic Empathy Data	Hao Liang et.al.	2407.21669	translate	read	link
2024-07-31	LLM-for-X: Application-agnostic Integration of Large Language Models to Support Personal Writing Workflows	Lukas Teufelberger et.al.	2407.21593	translate	read	null
2024-07-31	A Performance Study of LLM-Generated Code on Leetcode	Tristan Coignion et.al.	2407.21579	translate	read	null
2024-07-31	PMoE: Progressive Mixture of Experts with Asymmetric Transformer for Continual Learning	Min Jae Jung et.al.	2407.21571	translate	read	null
2024-07-30	ThinK: Thinner Key Cache by Query-Driven Pruning	Yuhui Xu et.al.	2407.21018	translate	read	link
2024-07-30	CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning	Yuexi Du et.al.	2407.21011	translate	read	link
2024-07-30	The Dual-Edged Sword of Technical Debt: Benefits and Issues Analyzed Through Developer Discussions	Xiaozhou Li et.al.	2407.21007	translate	read	null
2024-07-30	MoFO: Momentum-Filtered Optimizer for Mitigating Forgetting in LLM Fine-Tuning	Yupeng Chen et.al.	2407.20999	translate	read	null
2024-07-30	From Feature Importance to Natural Language Explanations Using LLMs with RAG	Sule Tekkesinoglu et.al.	2407.20990	translate	read	null
2024-07-30	Large Language Models (LLMs) for Semantic Communication in Edge-based IoT Networks	Alakesh Kalita et.al.	2407.20970	translate	read	null
2024-07-30	Automated Review Generation Method Based on Large Language Models	Shican Wu et.al.	2407.20906	translate	read	link
2024-07-30	ThinkRepair: Self-Directed Automated Program Repair	Xin Yin et.al.	2407.20898	translate	read	link
2024-07-30	Effective Black Box Testing of Sentiment Analysis Classification Networks	Parsa Karbasizadeh et.al.	2407.20884	translate	read	null
2024-07-30	Breaking Agents: Compromising Autonomous LLM Agents Through Malfunction Amplification	Boyang Zhang et.al.	2407.20859	translate	read	null
2024-07-29	Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing	Ekaterina Iakovleva et.al.	2407.20232	translate	read	null
2024-07-29	Can Editing LLMs Inject Harm?	Canyu Chen et.al.	2407.20224	translate	read	link
2024-07-29	QAEA-DR: A Unified Text Augmentation Framework for Dense Retrieval	Hongming Tan et.al.	2407.20207	translate	read	null
2024-07-29	MindSearch: Mimicking Human Minds Elicits Deep AI Searcher	Zehui Chen et.al.	2407.20183	translate	read	link
2024-07-29	Advancing Multimodal Large Language Models in Chart Question Answering with Visualization-Referenced Instruction Tuning	Xingchen Zeng et.al.	2407.20174	translate	read	link
2024-07-29	Diffusion Feedback Helps CLIP See Better	Wenxuan Wang et.al.	2407.20171	translate	read	link
2024-07-29	Language-Conditioned Offline RL for Multi-Robot Navigation	Steven Morad et.al.	2407.20164	translate	read	null
2024-07-29	rLLM: Relational Table Learning with LLMs	Weichen Li et.al.	2407.20157	translate	read	link
2024-07-29	ByteCheckpoint: A Unified Checkpointing System for LLM Development	Borui Wan et.al.	2407.20143	translate	read	null
2024-07-29	Orca: Ocean Significant Wave Height Estimation with Spatio-temporally Aware Large Language Models	Zhe Li et.al.	2407.20053	translate	read	null
2024-07-26	Small Molecule Optimization with Large Language Models	Philipp Guevorguian et.al.	2407.18897	translate	read	link
2024-07-26	Human-artificial intelligence teaming for scientific information extraction from data-driven additive manufacturing research using large language models	Mutahar Safdar et.al.	2407.18827	translate	read	null
2024-07-26	Automatic Detection of Moral Values in Music Lyrics	Vjosa Preniqi et.al.	2407.18787	translate	read	link
2024-07-26	The power of Prompts: Evaluating and Mitigating Gender Bias in MT with LLMs	Aleix Sant et.al.	2407.18786	translate	read	null
2024-07-26	TAGIFY: LLM-powered Tagging Interface for Improved Data Findability on OGD portals	Kevin Kliimask et.al.	2407.18764	translate	read	null
2024-07-26	Knowledge Graph Structure as Prompt: Improving Small Language Models Capabilities for Knowledge-based Causal Discovery	Yuni Susanti et.al.	2407.18752	translate	read	link
2024-07-26	Towards Effective and Efficient Continual Pre-training of Large Language Models	Jie Chen et.al.	2407.18743	translate	read	link
2024-07-26	Towards Generalized Offensive Language Identification	Alphaeus Dmonte et.al.	2407.18738	translate	read	null
2024-07-26	LLASP: Fine-tuning Large Language Models for Answer Set Programming	Erica Coppolillo et.al.	2407.18723	translate	read	null
2024-07-26	Neurosymbolic AI for Enhancing Instructability in Generative AI	Amit Sheth et.al.	2407.18722	translate	read	null
2024-07-25	Recursive Introspection: Teaching Language Model Agents How to Self-Improve	Yuxiao Qu et.al.	2407.18219	translate	read	null
2024-07-25	Exploring Scaling Trends in LLM Robustness	Nikolhaus Howe et.al.	2407.18213	translate	read	null
2024-07-25	Unlocking Tokens as Data Points for Generalization Bounds on Larger Language Models	Sanae Lotfi et.al.	2407.18158	translate	read	null
2024-07-25	Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic	Fakhraddin Alwajih et.al.	2407.18129	translate	read	null
2024-07-25	Fine-Tuning Large Language Models for Stock Return Prediction Using Newsflow	Tian Guo et.al.	2407.18103	translate	read	null
2024-07-25	PEFT-U: Parameter-Efficient Fine-Tuning for User Personalization	Christopher Clarke et.al.	2407.18078	translate	read	link
2024-07-25	C2P: Featuring Large Language Models with Causal Reasoning	Abdolmahdi Bagheri et.al.	2407.18069	translate	read	null
2024-07-25	ComPeer: A Generative Conversational Agent for Proactive Peer Support	Tianjian Liu et.al.	2407.18064	translate	read	null
2024-07-25	Audio Entailment: Assessing Deductive Reasoning for Audio Understanding	Soham Deshmukh et.al.	2407.18062	translate	read	link
2024-07-25	Difficulty Estimation and Simplification of French Text Using LLMs	Henri Jamet et.al.	2407.18061	translate	read	null
2024-07-24	I Could’ve Asked That: Reformulating Unanswerable Questions	Wenting Zhao et.al.	2407.17469	translate	read	link
2024-07-24	WildHallucinations: Evaluating Long-form Factuality in LLMs with Real-World Entity Queries	Wenting Zhao et.al.	2407.17468	translate	read	null
2024-07-24	CMR Scaling Law: Predicting Critical Mixture Ratios for Continual Pre-training of Language Models	Jiawei Gu et.al.	2407.17467	translate	read	null
2024-07-24	$VILA^2$ : VILA Augmented VILA	Yunhao Fang et.al.	2407.17453	translate	read	null
2024-07-24	Generative AI in Evidence-Based Software Engineering: A White Paper	Mattel Esposito et.al.	2407.17440	translate	read	null
2024-07-24	Can Watermarking Large Language Models Prevent Copyrighted Text Generation and Hide Training Data?	Michael-Andrei Panaitescu-Liess et.al.	2407.17417	translate	read	null
2024-07-24	(PASS) Visual Prompt Locates Good Structure Sparsity through a Recurrent HyperNetwork	Tianjin Huang et.al.	2407.17412	translate	read	null
2024-07-24	Grammar-based Game Description Generation using Large Language Models	Tsunehiko Tanaka et.al.	2407.17404	translate	read	null
2024-07-24	3D Question Answering for City Scene Understanding	Penglei Sun et.al.	2407.17398	translate	read	null
2024-07-24	ViPer: Visual Personalization of Generative Models via Individual Preference Learning	Sogand Salehi et.al.	2407.17365	translate	read	null
2024-07-23	Can Large Language Models Automatically Jailbreak GPT-4V?	Yuanwei Wu et.al.	2407.16686	translate	read	null
2024-07-23	RedAgent: Red Teaming Large Language Models with Context-aware Autonomous Language Agent	Huiyu Xu et.al.	2407.16667	translate	read	null
2024-07-23	Course-Correction: Safety Alignment Using Synthetic Preferences	Rongwu Xu et.al.	2407.16637	translate	read	link
2024-07-23	Lawma: The Power of Specialization for Legal Tasks	Ricardo Dominguez-Olmedo et.al.	2407.16615	translate	read	null
2024-07-23	Shared Imagination: LLMs Hallucinate Alike	Yilun Zhou et.al.	2407.16604	translate	read	null
2024-07-23	Exploring Automatic Cryptographic API Misuse Detection in the Era of LLMs	Yifan Xia et.al.	2407.16576	translate	read	null
2024-07-23	Retrieve, Generate, Evaluate: A Case Study for Medical Paraphrases Generation with Small Language Models	Ioana Buhnila et.al.	2407.16565	translate	read	null
2024-07-23	Patched RTC: evaluating LLMs for diverse software development tasks	Asankhaya Sharma et.al.	2407.16557	translate	read	link
2024-07-24	MicroEmo: Time-Sensitive Multimodal Emotion Recognition with Micro-Expression Dynamics in Video Dialogues	Liyun Zhang et.al.	2407.16552	translate	read	null
2024-07-23	Imperfect Vision Encoders: Efficient and Robust Tuning for Vision-Language Models	Aristeidis Panos et.al.	2407.16526	translate	read	null
2024-07-22	AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description	Junyu Xie et.al.	2407.15850	translate	read	link
2024-07-22	LLMmap: Fingerprinting For Large Language Models	Dario Pasquini et.al.	2407.15847	translate	read	null
2024-07-22	SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models	Mingze Xu et.al.	2407.15841	translate	read	link
2024-07-22	MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Diversity	Yangzhou Liu et.al.	2407.15838	translate	read	link
2024-07-22	dMel: Speech Tokenization made Simple	He Bai et.al.	2407.15835	translate	read	link
2024-07-22	Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight	Ziyuan Huang et.al.	2407.15819	translate	read	null
2024-07-22	Extracting Structured Insights from Financial News: An Augmented LLM Driven Approach	Rian Dolphin et.al.	2407.15788	translate	read	null
2024-07-22	MoRSE: Bridging the Gap in Cybersecurity Expertise with Retrieval Augmented Generation	Marco Simoni et.al.	2407.15748	translate	read	null
2024-07-22	OMoS-QA: A Dataset for Cross-Lingual Extractive Question Answering in a German Migration Context	Steffen Kleinle et.al.	2407.15736	translate	read	null
2024-07-22	TaskGen: A Task-Based, Memory-Infused Agentic Framework using StrictJSON	John Chong Min Tan et.al.	2407.15734	translate	read	link
2024-07-19	Internal Consistency and Self-Feedback in Large Language Models: A Survey	Xun Liang et.al.	2407.14507	translate	read	link
2024-07-19	On Pre-training of Multimodal Language Models Customized for Chart Understanding	Wan-Cyuan Fan et.al.	2407.14506	translate	read	null
2024-07-19	Evaluating the Reliability of Self-Explanations in Large Language Models	Korbinian Randl et.al.	2407.14487	translate	read	link
2024-07-19	Contrastive Learning with Counterfactual Explanations for Radiology Report Generation	Mingjie Li et.al.	2407.14474	translate	read	null
2024-07-19	Check-Eval: A Checklist-based Approach for Evaluating Text Quality	Jayr Pereira et.al.	2407.14467	translate	read	null
2024-07-19	Undermining Mental Proof: How AI Can Make Cooperation Harder by Making Thinking Easier	Zachary Wojtowicz et.al.	2407.14452	translate	read	null
2024-07-19	From Instruction to Insight: Exploring the Functional and Semantic Roles of Text in Interactive Dashboards	Nicole Sultanum et.al.	2407.14451	translate	read	null
2024-07-19	Token-level Correlation-guided Compression for Efficient Multimodal Document Understanding	Renshan Zhang et.al.	2407.14439	translate	read	link
2024-07-19	The Vision of Autonomic Computing: Can LLMs Make It a Reality?	Zhiyang Zhang et.al.	2407.14402	translate	read	null
2024-07-19	Open Artificial Knowledge	Vadim Borisov et.al.	2407.14371	translate	read	null
2024-07-18	Visual Haystacks: Answering Harder Questions About Sets of Images	Tsung-Han Wu et.al.	2407.13766	translate	read	link
2024-07-18	SegPoint: Segment Any Point Cloud via Large Language Model	Shuting He et.al.	2407.13761	translate	read	null
2024-07-18	Black-Box Opinion Manipulation Attacks to Retrieval-Augmented Generation of Large Language Models	Zhuo Chen et.al.	2407.13757	translate	read	null
2024-07-18	CellularLint: A Systematic Approach to Identify Inconsistent Behavior in Cellular Network Specifications	Mirza Masfiqur Rahman et.al.	2407.13742	translate	read	null
2024-07-18	Baba Is AI: Break the Rules to Beat the Benchmark	Nathan Cloos et.al.	2407.13729	translate	read	null
2024-07-18	CoDefeater: Using LLMs To Find Defeaters in Assurance Cases	Usman Gohar et.al.	2407.13717	translate	read	null
2024-07-18	Understanding Reference Policies in Direct Preference Optimization	Yixin Liu et.al.	2407.13709	translate	read	link
2024-07-18	A Comprehensive Review of Recommender Systems: Transitioning from Theory to Practice	Shaina Raza et.al.	2407.13699	translate	read	null
2024-07-18	Prover-Verifier Games improve legibility of LLM outputs	Jan Hendrik Kirchner et.al.	2407.13692	translate	read	link
2024-07-18	COMCAT: Leveraging Human Judgment to Improve Automatic Documentation and Summarization	Skyler Grandel et.al.	2407.13648	translate	read	null
2024-07-17	LookupViT: Compressing visual information to a limited number of tokens	Rajat Koner et.al.	2407.12753	translate	read	null
2024-07-17	EchoSight: Advancing Visual-Language Models with Wiki Knowledge	Yibin Yan et.al.	2407.12735	translate	read	null
2024-07-17	NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model	Zhongqun Zhang et.al.	2407.12727	translate	read	null
2024-07-17	Is Sarcasm Detection A Step-by-Step Reasoning Process in Large Language Models?	Ben Yao et.al.	2407.12725	translate	read	null
2024-07-17	The Future of Learning: Large Language Models through the Lens of Students	He Zhang et.al.	2407.12723	translate	read	null
2024-07-17	MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language Models	Leyang Shen et.al.	2407.12709	translate	read	link
2024-07-17	Patch-Level Training for Large Language Models	Chenze Shao et.al.	2407.12665	translate	read	link
2024-07-17	Zero-shot Text-guided Infinite Image Synthesis with LLM guidance	Soyeong Kwon et.al.	2407.12642	translate	read	null
2024-07-17	Harnessing the Power of Artificial Intelligence to Vitalize Endangered Indigenous Languages: Technologies and Experiences	Claudio Pinhanez et.al.	2407.12620	translate	read	null
2024-07-17	AudienceView: AI-Assisted Interpretation of Audience Feedback in Journalism	William Brannon et.al.	2407.12613	translate	read	link
2024-07-16	UrbanWorld: An Urban World Model for 3D City Generation	Yu Shang et.al.	2407.11965	translate	read	null
2024-07-16	NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window?	Mo Li et.al.	2407.11963	translate	read	link
2024-07-16	Code Documentation and Analysis to Secure Software Development	Paul Attie et.al.	2407.11934	translate	read	null
2024-07-16	What’s Wrong? Refining Meeting Summaries with LLM Feedback	Frederic Kirstein et.al.	2407.11919	translate	read	null
2024-07-16	Ascend-CC: Confidential Computing on Heterogeneous NPU for Emerging Generative AI Workloads	Aritra Dhar et.al.	2407.11888	translate	read	null
2024-07-16	Schema Matching with Large Language Models: an Experimental Study	Marcel Parciak et.al.	2407.11852	translate	read	link
2024-07-16	LoFTI: Localization and Factuality Transfer to Indian Locales	Sona Elza Simon et.al.	2407.11833	translate	read	link
2024-07-16	GPT Assisted Annotation of Rhetorical and Linguistic Features for Interpretable Propaganda Technique Detection in News Text	Kyle Hamilton et.al.	2407.11827	translate	read	null
2024-07-16	PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation	Branden Butler et.al.	2407.11798	translate	read	null
2024-07-16	Large Language Models as Misleading Assistants in Conversation	Betty Li Hou et.al.	2407.11789	translate	read	null
2024-07-15	VGBench: Evaluating Large Language Models on Vector Graphics Understanding and Generation	Bocheng Zou et.al.	2407.10972	translate	read	link
2024-07-15	Q-Sparse: All Large Language Models can be Fully Sparsely-Activated	Hongyu Wang et.al.	2407.10969	translate	read	null
2024-07-15	No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations	Walter Simoncini et.al.	2407.10964	translate	read	link
2024-07-15	Fast Matrix Multiplications for Lookup Table-Quantized LLMs	Han Guo et.al.	2407.10960	translate	read	link
2024-07-15	MMM: Multilingual Mutual Reinforcement Effect Mix Datasets & Test with Open-domain Information Extraction Large Language Models	Chengguang Gan et.al.	2407.10953	translate	read	null
2024-07-15	Can Textual Semantics Mitigate Sounding Object Segmentation Preference?	Yaoting Wang et.al.	2407.10947	translate	read	link
2024-07-15	GRUtopia: Dream General Robots in a City at Scale	Hanqing Wang et.al.	2407.10943	translate	read	link
2024-07-15	Benchmarking Vision Language Models for Cultural Understanding	Shravan Nayak et.al.	2407.10920	translate	read	null
2024-07-15	FinDKG: Dynamic Knowledge Graphs with Large Language Models for Detecting Global Trends in Financial Markets	Xiaohui Victor Li et.al.	2407.10909	translate	read	link
2024-07-15	Hey, That’s My Model! Introducing Chain & Hash, An LLM Fingerprinting Technique	Mark Russinovich et.al.	2407.10887	translate	read	null
2024-07-12	FairyLandAI: Personalized Fairy Tales utilizing ChatGPT and DALLE-3	Georgios Makridis et.al.	2407.09467	translate	read	null
2024-07-12	Human-like Episodic Memory for Infinite Context LLMs	Zafeirios Fountas et.al.	2407.09450	translate	read	link
2024-07-12	ASTPrompter: Weakly Supervised Automated Language Model Red-Teaming to Identify Likely Toxic Prompts	Amelia F. Hardy et.al.	2407.09447	translate	read	null
2024-07-12	MUSCLE: A Model Update Strategy for Compatible LLM Evolution	Jessica Echterhoff et.al.	2407.09435	translate	read	null
2024-07-12	Open (Clinical) LLMs are Sensitive to Instruction Phrasings	Alberto Mario Ceballos Arroyo et.al.	2407.09429	translate	read	null
2024-07-12	TelecomGPT: A Framework to Build Telecom-Specfic Large Language Models	Hang Zou et.al.	2407.09424	translate	read	null
2024-07-12	Mitigating Entity-Level Hallucination in Large Language Models	Weihang Su et.al.	2407.09417	translate	read	link
2024-07-12	SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers	Shraman Pramanick et.al.	2407.09413	translate	read	link
2024-07-12	PersonaRAG: Enhancing Retrieval-Augmented Generation Systems with User-Centric Agents	Saber Zerhoudi et.al.	2407.09394	translate	read	link
2024-07-12	GAVEL: Generating Games Via Evolution and Language Models	Graham Todd et.al.	2407.09388	translate	read	link
2024-07-11	MAVIS: Mathematical Visual Instruction Tuning	Renrui Zhang et.al.	2407.08739	translate	read	link
2024-07-11	Real-Time Anomaly Detection and Reactive Planning with Large Language Models	Rohan Sinha et.al.	2407.08735	translate	read	null
2024-07-11	Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist	Zihao Zhou et.al.	2407.08733	translate	read	null
2024-07-11	A Taxonomy for Data Contamination in Large Language Models	Medha Palavalli et.al.	2407.08716	translate	read	null
2024-07-11	GTA: A Benchmark for General Tool Agents	Jize Wang et.al.	2407.08713	translate	read	link
2024-07-11	Extracting Training Data from Document-Based VQA Models	Francesco Pinto et.al.	2407.08707	translate	read	null
2024-07-11	Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models	Zhening Xing et.al.	2407.08701	translate	read	null
2024-07-11	Mitigating Catastrophic Forgetting in Language Transfer via Model Merging	Anton Alexandrov et.al.	2407.08699	translate	read	null
2024-07-11	Cloud Atlas: Efficient Fault Localization for Cloud Systems using Language Models and Causal Insight	Zhiqiang Xie et.al.	2407.08694	translate	read	null
2024-07-11	SEED-Story: Multimodal Long Story Generation with Large Language Model	Shuai Yang et.al.	2407.08683	translate	read	link
2024-07-10	Training on the Test Task Confounds Evaluation and Emergence	Ricardo Dominguez-Olmedo et.al.	2407.07890	translate	read	link
2024-07-10	Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization	Junkang Wu et.al.	2407.07880	translate	read	link
2024-07-10	FACTS About Building Retrieval Augmented Generation-based Chatbots	Rama Akkiraju et.al.	2407.07858	translate	read	null
2024-07-10	OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training	Sami Jaghouar et.al.	2407.07852	translate	read	link
2024-07-10	Natural Language Mechanisms via Self-Resolution with Foundation Models	Nicolas Della Penna et.al.	2407.07845	translate	read	null
2024-07-10	Transformer Alignment in Large Language Models	Murdock Aubry et.al.	2407.07810	translate	read	null
2024-07-10	Attribute or Abstain: Large Language Models as Long Document Assistants	Jan Buchmann et.al.	2407.07799	translate	read	link
2024-07-11	Evaluating Large Language Models with Grid-Based Game Competitions: An Extensible LLM Benchmark and Leaderboard	Oguzhan Topsakal et.al.	2407.07796	translate	read	link
2024-07-10	Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities	Tianjie Ju et.al.	2407.07791	translate	read	link
2024-07-10	WorldAPIs: The World Is Worth How Many APIs? A Thought Experiment	Jiefu Ou et.al.	2407.07778	translate	read	null
2024-07-09	AnyTaskTune: Advanced Domain-Specific Solutions through Task-Fine-Tuning	Jiaxi Cui et.al.	2407.07094	translate	read	link
2024-07-09	FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation	Liqun Ma et.al.	2407.07093	translate	read	link
2024-07-09	Hypothetical Minds: Scaffolding Theory of Mind for Multi-Agent Tasks with Large Language Models	Logan Cross et.al.	2407.07086	translate	read	link
2024-07-09	Adapting LLMs to Hebrew: Unveiling DictaLM 2.0 with Enhanced Vocabulary and Instruction Capabilities	Shaltiel Shmidman et.al.	2407.07080	translate	read	null
2024-07-09	Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps	Yung-Sung Chuang et.al.	2407.07071	translate	read	link
2024-07-09	Prompting Techniques for Secure Code Generation: A Systematic Investigation	Catherine Tony et.al.	2407.07064	translate	read	null
2024-07-09	Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence	Weize Chen et.al.	2407.07061	translate	read	link
2024-07-09	Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model	Wenqi Zhang et.al.	2407.07053	translate	read	link
2024-07-09	CorMulT: A Semi-supervised Modality Correlation-aware Multimodal Transformer for Sentiment Analysis	Yangmin Li et.al.	2407.07046	translate	read	null
2024-07-09	Using Large Language Models for Generating Smart Contracts for Health Insurance from Textual Policies	Inwon Kang et.al.	2407.07019	translate	read	null
2024-07-08	Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision	Orr Zohar et.al.	2407.06189	translate	read	link
2024-07-08	CrowdMoGen: Zero-Shot Text-Driven Collective Motion Generation	Xinying Guo et.al.	2407.06188	translate	read	null
2024-07-08	On Speeding Up Language Model Evaluation	Jin Peng Zhou et.al.	2407.06172	translate	read	link
2024-07-08	What’s Wrong with Your Code Generated by Large Language Models? An Extensive Study	Shihan Dou et.al.	2407.06153	translate	read	null
2024-07-08	Using Grammar Masking to Ensure Syntactic Validity in LLM-based Modeling Tasks	Lukas Netz et.al.	2407.06146	translate	read	null
2024-07-08	ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation	Ethan Chern et.al.	2407.06135	translate	read	link
2024-07-08	Evaluating the Semantic Profiling Abilities of LLMs for Natural Language Utterances in Data Visualization	Hannah K. Bako et.al.	2407.06129	translate	read	link
2024-07-08	Depression Detection and Analysis using Large Language Models on Textual and Audio-Visual Modalities	Avinash Anand et.al.	2407.06125	translate	read	null
2024-07-08	Artificial Intuition: Efficient Classification of Scientific Abstracts	Harsh Sakhrani et.al.	2407.06093	translate	read	null
2024-07-08	Merge, Ensemble, and Cooperate! A Survey on Collaborative Strategies in the Era of Large Language Models	Jinliang Lu et.al.	2407.06089	translate	read	null
2024-07-05	Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMs	Rudolf Laine et.al.	2407.04694	translate	read	null
2024-07-05	ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models	Yuzhe Gu et.al.	2407.04693	translate	read	link
2024-07-05	Rethinking Visual Prompting for Multimodal Large Language Models with External Knowledge	Yuanze Lin et.al.	2407.04681	translate	read	null
2024-07-05	Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition	Ye Bai et.al.	2407.04675	translate	read	null
2024-07-05	Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement	Yongji Wu et.al.	2407.04656	translate	read	null
2024-07-05	Entity Decomposition with Filtering: A Zero-Shot Clinical Named Entity Recognition Framework	Reza Averly et.al.	2407.04629	translate	read	null
2024-07-05	On scalable oversight with weak LLMs judging strong LLMs	Zachary Kenton et.al.	2407.04622	translate	read	null
2024-07-05	Leveraging Large Language Models for Integrated Satellite-Aerial-Terrestrial Networks: Recent Advances and Future Directions	Shumaila Javaid et.al.	2407.04581	translate	read	null
2024-07-05	VRSD: Rethinking Similarity and Diversity for Retrieval in Large Language Models	Hang Gao et.al.	2407.04573	translate	read	null
2024-07-05	PoPreRo: A New Dataset for Popularity Prediction of Romanian Reddit Posts	Ana-Cristina Rogoz et.al.	2407.04541	translate	read	link
2024-07-03	BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations	Zhantao Yang et.al.	2407.03314	translate	read	null
2024-07-03	Universal Length Generalization with Turing Programs	Kaiying Hou et.al.	2407.03310	translate	read	null
2024-07-03	Large Language Models for JSON Schema Discovery	Michael J. Mior et.al.	2407.03286	translate	read	null
2024-07-03	LLM Internal States Reveal Hallucination Risk Faced With a Query	Ziwei Ji et.al.	2407.03282	translate	read	null
2024-07-03	Improving Retrieval-augmented Text-to-SQL with AST-based Ranking and Schema Pruning	Zhili Shen et.al.	2407.03227	translate	read	null
2024-07-03	How Does Quantization Affect Multilingual LLMs?	Kelly Marchisio et.al.	2407.03211	translate	read	null
2024-07-03	TheoremLlama: Transforming General-Purpose LLMs into Lean4 Experts	Ruida Wang et.al.	2407.03203	translate	read	link
2024-07-03	Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models	Haritz Puerto et.al.	2407.03181	translate	read	link
2024-07-03	Investigating Decoder-only Large Language Models for Speech-to-text Translation	Chao-Wei Huang et.al.	2407.03169	translate	read	null
2024-07-03	SOS! Soft Prompt Attack Against Open-Source Large Language Models	Ziqing Yang et.al.	2407.03160	translate	read	null
2024-07-02	MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention	Huiqiang Jiang et.al.	2407.02490	translate	read	link
2024-07-02	Neurocache: Efficient Vector Retrieval for Long-range Language Modeling	Ali Safaya et.al.	2407.02486	translate	read	link
2024-07-02	RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs	Yue Yu et.al.	2407.02485	translate	read	null
2024-07-02	MMedAgent: Learning to Use Medical Tools with Multi-modal Agent	Binxu Li et.al.	2407.02483	translate	read	null
2024-07-02	Understanding Alignment in Multimodal LLMs: A Comprehensive Study	Elmira Amirloo et.al.	2407.02477	translate	read	null
2024-07-02	Open Scene Graphs for Open World Object-Goal Navigation	Joel Loo et.al.	2407.02473	translate	read	null
2024-07-02	Reliable Confidence Intervals for Information Retrieval Evaluation Using Generative A.I	Harrie Oosterhuis et.al.	2407.02464	translate	read	null
2024-07-02	Predicting vs. Acting: A Trade-off Between World Modeling & Agent Modeling	Margaret Li et.al.	2407.02446	translate	read	null
2024-07-02	Video Watermarking: Safeguarding Your Video from (Unauthorized) Annotations by Video-based LLMs	Jinmin Li et.al.	2407.02411	translate	read	null
2024-07-02	CEB: Compositional Evaluation Benchmark for Fairness in Large Language Models	Song Wang et.al.	2407.02408	translate	read	null
2024-07-01	BMW Agents – A Framework For Task Automation Through Multi-Agent Collaboration	Noel Crawford et.al.	2406.20041	translate	read	null

(<a href=../LLM.md>back to LLM</a>)