LLM - 2024-04 | Paper Arxiv Daily

LLM - 2024-04

Publish Date	Title	Authors	PDF	Translate	Read	Code
2024-04-30	DOCCI: Descriptions of Connected and Contrasting Images	Yasumasa Onoe et.al.	2404.19753	translate	read	null
2024-04-30	Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation	Yunhao Ge et.al.	2404.19752	translate	read	null
2024-04-30	PrivComp-KG : Leveraging Knowledge Graph and Large Language Models for Privacy Policy Compliance Verification	Leon Garza et.al.	2404.19744	translate	read	null
2024-04-30	Better & Faster Large Language Models via Multi-token Prediction	Fabian Gloeckle et.al.	2404.19737	translate	read	null
2024-04-30	A Framework for Leveraging Human Computation Gaming to Enhance Knowledge Graphs for Accuracy Critical Generative AI Applications	Steph Buongiorno et.al.	2404.19729	translate	read	null
2024-04-30	PANGeA: Procedural Artificial Narrative using Generative AI for Turn-Based Video Games	Steph Buongiorno et.al.	2404.19721	translate	read	null
2024-04-30	Assessing LLMs in Malicious Code Deobfuscation of Real-world Malware Campaigns	Constantinos Patsakis et.al.	2404.19715	translate	read	null
2024-04-30	Automated Generation of High-Quality Medical Simulation Scenarios Through Integration of Semi-Structured Data and Large Language Models	Scott Sumpter et.al.	2404.19713	translate	read	null
2024-04-30	When to Retrieve: Teaching LLMs to Utilize Information Retrieval Effectively	Tiziano Labruna et.al.	2404.19705	translate	read	link
2024-04-30	Naturally Supervised 3D Visual Grounding with Language-Regularized Concept Learners	Chun Feng et.al.	2404.19696	translate	read	null
2024-04-29	Hallucination of Multimodal Large Language Models: A Survey	Zechen Bai et.al.	2404.18930	translate	read	link
2024-04-29	DPO Meets PPO: Reinforced Token Optimization for RLHF	Han Zhong et.al.	2404.18922	translate	read	link
2024-04-29	TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation	Junhao Cheng et.al.	2404.18919	translate	read	null
2024-04-29	Kangaroo: Lossless Self-Speculative Decoding via Double Early Exiting	Fangcheng Liu et.al.	2404.18911	translate	read	link
2024-04-29	Human-in-the-Loop Synthetic Text Data Inspection with Provenance Tracking	Hong Jin Kang et.al.	2404.18881	translate	read	link
2024-04-29	More RLHF, More Trust? On The Impact of Human Preference Alignment On Language Model Trustworthiness	Aaron J. Li et.al.	2404.18870	translate	read	link
2024-04-29	Truth-value judgment in language models: belief directions are context sensitive	Stefan F. Schouten et.al.	2404.18865	translate	read	null
2024-04-29	Performance-Aligned LLMs for Generating Fast Code	Daniel Nichols et.al.	2404.18864	translate	read	null
2024-04-29	VERT: Verified Equivalent Rust Transpilation with Few-Shot Learning	Aidan Z. H. Yang et.al.	2404.18852	translate	read	null
2024-04-29	It’s Difficult to be Neutral – Human and LLM-based Sentiment Annotation of Patient Comments	Petter Mæhlum et.al.	2404.18832	translate	read	null
2024-04-26	Probabilistic Inference in Language Models via Twisted Sequential Monte Carlo	Stephen Zhao et.al.	2404.17546	translate	read	link
2024-04-26	Large Language Model Agent as a Mechanical Designer	Yayati Jadhav et.al.	2404.17525	translate	read	null
2024-04-26	On the Use of Large Language Models to Generate Capability Ontologies	Luis Miguel Vieira da Silva et.al.	2404.17524	translate	read	null
2024-04-26	Enhancing Legal Compliance and Regulation Analysis with Large Language Models	Shabnam Hassani et.al.	2404.17522	translate	read	null
2024-04-26	A Comprehensive Evaluation on Event Reasoning of Large Language Models	Zhengwei Tao et.al.	2404.17513	translate	read	link
2024-04-26	Learning text-to-video retrieval from image captioning	Lucas Ventura et.al.	2404.17498	translate	read	null
2024-04-26	CEval: A Benchmark for Evaluating Counterfactual Text Generation	Van Bach Nguyen et.al.	2404.17475	translate	read	link
2024-04-26	Ruffle&Riley: Insights from Designing and Evaluating a Large Language Model-Based Conversational Tutoring System	Robin Schmucker et.al.	2404.17460	translate	read	null
2024-04-26	“ChatGPT Is Here to Help, Not to Replace Anybody” – An Evaluation of Students’ Opinions On Integrating ChatGPT In CS Courses	Bruno Pereira Cipriano et.al.	2404.17443	translate	read	null
2024-04-26	InspectorRAGet: An Introspection Platform for RAG Evaluation	Kshitij Fadnis et.al.	2404.17347	translate	read	link
2024-04-25	Make-it-Real: Unleashing Large Multimodal Model’s Ability for Painting 3D Objects with Realistic Materials	Ye Fang et.al.	2404.16829	translate	read	null
2024-04-25	How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites	Zhe Chen et.al.	2404.16821	translate	read	link
2024-04-25	IndicGenBench: A Multilingual Benchmark to Evaluate Generation Capabilities of LLMs on Indic Languages	Harman Singh et.al.	2404.16816	translate	read	link
2024-04-25	Make Your LLM Fully Utilize the Context	Shengnan An et.al.	2404.16811	translate	read	link
2024-04-25	Improving Diversity of Commonsense Generation by Large Language Models via In-Context Learning	Tianhui Zhang et.al.	2404.16807	translate	read	null
2024-04-25	Weak-to-Strong Extrapolation Expedites Alignment	Chujie Zheng et.al.	2404.16792	translate	read	link
2024-04-25	SEED-Bench-2-Plus: Benchmarking Multimodal Large Language Models with Text-Rich Visual Comprehension	Bohao Li et.al.	2404.16790	translate	read	link
2024-04-25	Continual Learning of Large Language Models: A Comprehensive Survey	Haizhou Shi et.al.	2404.16789	translate	read	link
2024-04-25	Prefix Text as a Yarn: Eliciting Non-English Alignment in Foundation Language Model	Runzhe Zhan et.al.	2404.16766	translate	read	null
2024-04-25	RadGenome-Chest CT: A Grounded Vision-Language Dataset for Chest CT Analysis	Xiaoman Zhang et.al.	2404.16754	translate	read	null
2024-04-24	Hybrid LLM/Rule-based Approaches to Business Insights Generation from Structured Data	Aliaksei Vertsel et.al.	2404.15604	translate	read	null
2024-04-24	ImplicitAVE: An Open-Source Dataset and Multimodal LLMs Benchmark for Implicit Attribute Value Extraction	Henry Peng Zou et.al.	2404.15592	translate	read	link
2024-04-24	Can Foundational Large Language Models Assist with Conducting Pharmaceuticals Manufacturing Investigations?	Hossein Salami et.al.	2404.15578	translate	read	null
2024-04-23	PRISM: Patient Records Interpretation for Semantic Clinical Trial Matching using Large Language Models	Shashi Kant Gupta et.al.	2404.15549	translate	read	null
2024-04-23	Towards Systematic Evaluation of Logical Reasoning Ability of Large Language Models	Mihir Parmar et.al.	2404.15522	translate	read	link
2024-04-23	Visual Delta Generator with Large Multi-modal Models for Semi-supervised Composed Image Retrieval	Young Kyun Jang et.al.	2404.15516	translate	read	null
2024-04-23	ToM-LM: Delegating Theory Of Mind Reasoning to External Symbolic Executors in Large Language Models	Weizhi Tang et.al.	2404.15515	translate	read	null
2024-04-23	GeoLLM-Engine: A Realistic Environment for Building Geospatial Copilots	Simranjit Singh et.al.	2404.15500	translate	read	null
2024-04-23	IryoNLP at MEDIQA-CORR 2024: Tackling the Medical Error Detection & Correction Task On the Shoulders of Medical Agents	Jean-Philippe Corbeil et.al.	2404.15488	translate	read	link
2024-04-23	Large Language Models Spot Phishing Emails with Surprising Accuracy: A Comparative Analysis of Performance	Het Patel et.al.	2404.15485	translate	read	null
2024-04-23	Aligning LLM Agents by Learning Latent Preference from User Edits	Ge Gao et.al.	2404.15269	translate	read	link
2024-04-23	XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts	Yifeng Ding et.al.	2404.15247	translate	read	link
2024-04-23	Revisiting Unnaturalness for Automated Program Repair in the Era of Large Language Models	Aidan Z. H. Yang et.al.	2404.15236	translate	read	null
2024-04-23	Re-Thinking Inverse Graphics With Large Language Models	Peter Kulits et.al.	2404.15228	translate	read	null
2024-04-23	Setting up the Data Printer with Improved English to Ukrainian Machine Translation	Yurii Paniv et.al.	2404.15196	translate	read	link
2024-04-23	Regressive Side Effects of Training Language Models to Mimic Student Misconceptions	Shashank Sonkar et.al.	2404.15156	translate	read	null
2024-04-23	Bias patterns in the application of LLMs for clinical decision support: A comprehensive study	Raphael Poulain et.al.	2404.15149	translate	read	null
2024-04-23	Rethinking LLM Memorization through the Lens of Adversarial Compression	Avi Schwarzschild et.al.	2404.15146	translate	read	null
2024-04-23	MedDr: Diagnosis-Guided Bootstrapping for Large-Scale Medical Vision-Language Learning	Sunan He et.al.	2404.15127	translate	read	link
2024-04-23	Multimodal Large Language Model is a Human-Aligned Annotator for Text-to-Image Generation	Xun Wu et.al.	2404.15100	translate	read	null
2024-04-22	AutoAD III: The Prequel – Back to the Pixels	Tengda Han et.al.	2404.14412	translate	read	null
2024-04-22	SpaceByte: Towards Deleting Tokenization from Large Language Modeling	Kevin Slagle et.al.	2404.14408	translate	read	link
2024-04-22	RTP-LX: Can LLMs Evaluate Toxicity in Multilingual Scenarios?	Adrian de Wynter et.al.	2404.14397	translate	read	link
2024-04-22	A Survey on Self-Evolution of Large Language Models	Zhengwei Tao et.al.	2404.14387	translate	read	null
2024-04-22	Beyond Scaling: Predicting Patent Approval with Domain-specific Fine-grained Claim Dependency Graph	Xiaochen Kev Gao et.al.	2404.14372	translate	read	link
2024-04-22	Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data	Fahim Tajwar et.al.	2404.14367	translate	read	link
2024-04-22	Better Synthetic Data by Retrieving and Transforming Existing Datasets	Saumya Gandhi et.al.	2404.14361	translate	read	link
2024-04-22	Rethinking Legal Compliance Automation: Opportunities with Large Language Models	Shabnam Hassani et.al.	2404.14356	translate	read	null
2024-04-22	Automated Long Answer Grading with RiceChem Dataset	Shashank Sonkar et.al.	2404.14316	translate	read	null
2024-04-22	Explaining Arguments’ Strength: Unveiling the Role of Attacks and Supports (Technical Report)	Xiang Yin et.al.	2404.14304	translate	read	null
2024-04-19	MoVA: Adapting Mixture of Vision Experts to Multimodal Context	Zhuofan Zong et.al.	2404.13046	translate	read	link
2024-04-19	Unified Scene Representation and Reconstruction for 3D Large Language Models	Tao Chu et.al.	2404.13044	translate	read	null
2024-04-19	Data Alignment for Zero-Shot Concept Generation in Dermatology AI	Soham Gadgil et.al.	2404.13043	translate	read	null
2024-04-19	LaPA: Latent Prompt Assist Model For Medical Visual Question Answering	Tiancheng Gu et.al.	2404.13039	translate	read	link
2024-04-19	Sample Design Engineering: An Empirical Study of What Makes Good Downstream Fine-Tuning Samples for LLMs	Biyang Guo et.al.	2404.13033	translate	read	link
2024-04-19	When Life gives you LLMs, make LLM-ADE: Large Language Models with Adaptive Data Engineering	Stephen Choi et.al.	2404.13028	translate	read	null
2024-04-19	Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models	Chuofan Ma et.al.	2404.13013	translate	read	link
2024-04-19	Rethinking the Evaluation of Dialogue Systems: Effects of User Feedback on Crowdworkers and LLMs	Clemencia Siro et.al.	2404.12994	translate	read	link
2024-04-19	RedactBuster: Entity Type Recognition from Redacted Documents	Mirco Beltrame et.al.	2404.12991	translate	read	null
2024-04-19	FineRec:Exploring Fine-grained Sequential Recommendation	Xiaokun Zhang et.al.	2404.12975	translate	read	null
2024-04-18	BLINK: Multimodal Large Language Models Can See but Not Perceive	Xingyu Fu et.al.	2404.12390	translate	read	null
2024-04-18	MedThink: Explaining Medical Visual Question Answering via Multimodal Decision-Making Rationale	Xiaotang Gai et.al.	2404.12372	translate	read	null
2024-04-18	When LLMs are Unfit Use FastFit: Fast and Effective Text Classification with Many Classes	Asaf Yehudai et.al.	2404.12365	translate	read	link
2024-04-18	Towards a Foundation Model for Partial Differential Equation: Multi-Operator Learning and Extrapolation	Jingmin Sun et.al.	2404.12355	translate	read	link
2024-04-18	V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt Instruction Tuning	Hang Hua et.al.	2404.12353	translate	read	null
2024-04-18	Large Language Models in Targeted Sentiment Analysis	Nicolay Rusnachenko et.al.	2404.12342	translate	read	link
2024-04-18	Normative Requirements Operationalization with Large Language Models	Nick Feng et.al.	2404.12335	translate	read	null
2024-04-18	Large Language Models for Synthetic Participatory Planning of Shared Automated Electric Mobility Systems	Jiangbo Yu et.al.	2404.12317	translate	read	null
2024-04-18	Simultaneous Interpretation Corpus Construction by Large Language Models in Distant Language Pair	Yusuke Sakai et.al.	2404.12299	translate	read	null
2024-04-18	Augmenting emotion features in irony detection with Large language modeling	Yucheng Lin et.al.	2404.12291	translate	read	null
2024-04-17	A Deep Dive into Large Language Models for Automated Bug Localization and Repair	Soneya Binta Hossain et.al.	2404.11595	translate	read	null
2024-04-17	Related Work and Citation Text Generation: A Survey	Xiangci Li et.al.	2404.11588	translate	read	null
2024-04-17	LLMTune: Accelerate Database Knob Tuning with Large Language Models	Xinmei Huang et.al.	2404.11581	translate	read	null
2024-04-17	MoA: Mixture-of-Attention for Subject-Context Disentanglement in Personalized Image Generation	Kuan-Chieh et.al.	2404.11565	translate	read	null
2024-04-17	Quantifying Multilingual Performance of Large Language Models Across Languages	Zihao Li et.al.	2404.11553	translate	read	link
2024-04-17	Evaluating Span Extraction in Generative Paradigm: A Reflection on Aspect-Based Sentiment Analysis	Soyoung Yang et.al.	2404.11539	translate	read	null
2024-04-17	Pack of LLMs: Model Fusion at Test-Time via Perplexity Optimization	Costas Mavromatis et.al.	2404.11531	translate	read	null
2024-04-17	Embedding Privacy in Computational Social Science and Artificial Intelligence Research	Keenan Jones et.al.	2404.11515	translate	read	null
2024-04-17	Towards Coarse-to-Fine Evaluation of Inference Efficiency for Large Language Models	Yushuo Chen et.al.	2404.11502	translate	read	link
2024-04-17	Paraphrase and Solve: Exploring and Exploiting the Impact of Surface Form on Mathematical Reasoning in Large Language Models	Yue Zhou et.al.	2404.11500	translate	read	link
2024-04-16	Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback	Qiwei Di et.al.	2404.10776	translate	read	null
2024-04-16	LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation?	Yuchi Wang et.al.	2404.10763	translate	read	link
2024-04-16	Deep Learning and LLM-based Methods Applied to Stellar Lightcurve Classification	Yu-Yang Li et.al.	2404.10757	translate	read	null
2024-04-16	Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study	Shusheng Xu et.al.	2404.10719	translate	read	null
2024-04-16	An empirical study on code review activity prediction in practice	Doriane Olewicki et.al.	2404.10703	translate	read	null
2024-04-16	Automating REST API Postman Test Cases Using LLM	S Deepika Sri et.al.	2404.10678	translate	read	null
2024-04-16	ViTextVQA: A Large-Scale Visual Question Answering Dataset for Evaluating Vietnamese Text Comprehension in Images	Quan Van Nguyen et.al.	2404.10652	translate	read	link
2024-04-16	Self-playing Adversarial Language Game Enhances LLM Reasoning	Pengyu Cheng et.al.	2404.10642	translate	read	link
2024-04-16	HLAT: High-quality Large Language Model Pre-trained on AWS Trainium	Haozheng Fan et.al.	2404.10630	translate	read	null
2024-04-16	Private Attribute Inference from Images with Vision-Language Models	Batuhan Tömekçe et.al.	2404.10618	translate	read	null
2024-04-15	Personalized Collaborative Fine-Tuning for On-Device Large Language Models	Nicolas Wagner et.al.	2404.09753	translate	read	null
2024-04-15	Quantization of Large Language Models with an Overdetermined Basis	Daniil Merkulov et.al.	2404.09737	translate	read	null
2024-04-15	Unveiling Imitation Learning: Exploring the Impact of Data Falsity to Large Language Model	Hyunsoo Cho et.al.	2404.09717	translate	read	null
2024-04-15	Enhancing Robot Explanation Capabilities through Vision-Language Models: a Preliminary Study by Interpreting Visual Inputs for Improved Human-Robot Interaction	David Sobrín-Hidalgo et.al.	2404.09705	translate	read	null
2024-04-15	Generative AI for Game Theory-based Mobile Networking	Long He et.al.	2404.09699	translate	read	null
2024-04-15	Are Large Language Models Reliable Argument Quality Annotators?	Nailia Mirzakhmedova et.al.	2404.09696	translate	read	null
2024-04-15	LoRAP: Transformer Sub-Layers Deserve Differentiated Structured Compression for Large Language Models	Guangyan Li et.al.	2404.09695	translate	read	null
2024-04-15	Multi-News+: Cost-efficient Dataset Cleansing via LLM-based Data Annotation	Juhwan Choi et.al.	2404.09682	translate	read	link
2024-04-15	Do LLMs Understand Visual Anomalies? Uncovering LLM Capabilities in Zero-shot Anomaly Detection	Jiaqi Zhu et.al.	2404.09654	translate	read	null
2024-04-15	Bridging Vision and Language Spaces with Assignment Prediction	Jungin Park et.al.	2404.09632	translate	read	link
2024-04-12	Enhancing Visual Question Answering through Question-Driven Image Captions as Prompts	Övgü Özdemir et.al.	2404.08589	translate	read	link
2024-04-12	Enhancing Autonomous Vehicle Training with Language Model Integration and Critical Scenario Generation	Hanlin Tian et.al.	2404.08570	translate	read	null
2024-04-12	RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs	Shreyas Chaudhari et.al.	2404.08555	translate	read	null
2024-04-12	Online Safety Analysis for LLMs: a Benchmark, an Assessment, and a Path Forward	Xuan Xie et.al.	2404.08517	translate	read	null
2024-04-12	Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction	Haoran Qiu et.al.	2404.08509	translate	read	link
2024-04-12	LaSagnA: Language-based Segmentation Assistant for Complex Queries	Cong Wei et.al.	2404.08506	translate	read	link
2024-04-12	Strategic Interactions between Large Language Models-based Agents in Beauty Contests	Siting Lu et.al.	2404.08492	translate	read	null
2024-04-12	Thematic Analysis with Large Language Models: does it work with languages other than English? A targeted test in Italian	Stefano De Paoli et.al.	2404.08488	translate	read	null
2024-04-12	Comparing Apples to Oranges: LLM-powered Multimodal Intention Prediction in an Object Categorization Task	Hassan Ali et.al.	2404.08424	translate	read	null
2024-04-12	AdapterSwap: Continuous Training of LLMs with Data Removal and Access-Control Guarantees	William Fleshman et.al.	2404.08417	translate	read	null
2024-04-11	OpenBias: Open-set Bias Detection in Text-to-Image Generative Models	Moreno D’Incà et.al.	2404.07990	translate	read	link
2024-04-11	View Selection for 3D Captioning via Diffusion Ranking	Tiange Luo et.al.	2404.07984	translate	read	null
2024-04-11	Manipulating Large Language Models to Increase Product Visibility	Aounon Kumar et.al.	2404.07981	translate	read	link
2024-04-11	LLoCO: Learning Long Contexts Offline	Sijun Tan et.al.	2404.07979	translate	read	link
2024-04-11	Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language Models	Haotian Zhang et.al.	2404.07973	translate	read	null
2024-04-11	Leveraging Large Language Models (LLMs) to Support Collaborative Human-AI Online Risk Data Annotation	Jinkyung Park et.al.	2404.07926	translate	read	null
2024-04-11	LaVy: Vietnamese Multimodal Large Language Model	Chi Tran et.al.	2404.07922	translate	read	link
2024-04-11	AmpleGCG: Learning a Universal and Transferable Generative Model of Adversarial Suffixes for Jailbreaking Both Open and Closed LLMs	Zeyi Liao et.al.	2404.07921	translate	read	link
2024-04-11	DesignQA: A Multimodal Benchmark for Evaluating Large Language Models’ Understanding of Engineering Documentation	Anna C. Doris et.al.	2404.07917	translate	read	link
2024-04-11	High-Dimension Human Value Representation in Large Language Models	Samuel Cahyawijaya et.al.	2404.07900	translate	read	link
2024-04-10	UMBRAE: Unified Multimodal Decoding of Brain Signals	Weihao Xia et.al.	2404.07202	translate	read	null
2024-04-10	Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention	Tsendsuren Munkhdalai et.al.	2404.07143	translate	read	link
2024-04-11	Semantically-correlated memories in a dense associative model	Thomas F Burns et.al.	2404.07123	translate	read	null
2024-04-10	Continuous Language Model Interpolation for Dynamic and Controllable Text Generation	Sara Kangaslahti et.al.	2404.07117	translate	read	null
2024-04-11	From Model-centered to Human-Centered: Revision Distance as a Metric for Text Evaluation in LLMs-based Applications	Yongqiang Ma et.al.	2404.07108	translate	read	null
2024-04-10	Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on Graphs	Bowen Jin et.al.	2404.07103	translate	read	link
2024-04-10	Dynamic Generation of Personalities with Large Language Models	Jianzhi Liu et.al.	2404.07084	translate	read	null
2024-04-10	VLLMs Provide Better Context for Emotion Understanding Through Common Sense Reasoning	Alexandros Xenos et.al.	2404.07078	translate	read	link
2024-04-10	Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?	Mingyu Jin et.al.	2404.07066	translate	read	link
2024-04-10	Groundedness in Retrieval-augmented Long-form Generation: An Empirical Study	Alessandro Stolfo et.al.	2404.07060	translate	read	null
2024-04-09	Pitfalls of Conversational LLMs on News Debiasing	Ipek Baris Schlicht et.al.	2404.06488	translate	read	null
2024-04-09	Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks	Chonghua Wang et.al.	2404.06480	translate	read	link
2024-04-09	Automated Federated Pipeline for Parameter-Efficient Fine-Tuning of Large Language Models	Zihan Fang et.al.	2404.06448	translate	read	null
2024-04-09	Large Language Models to the Rescue: Deadlock Resolution in Multi-Robot Systems	Kunal Garg et.al.	2404.06413	translate	read	null
2024-04-09	AgentQuest: A Modular Benchmark Framework to Measure Progress and Improve LLM Agents	Luca Gioacchini et.al.	2404.06411	translate	read	link
2024-04-09	Take a Look at it! Rethinking How to Evaluate Language Model Jailbreak	Hongyu Cai et.al.	2404.06407	translate	read	link
2024-04-09	Apprentices to Research Assistants: Advancing Research with Large Language Models	M. Namvarpour et.al.	2404.06404	translate	read	null
2024-04-09	MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies	Shengding Hu et.al.	2404.06395	translate	read	link
2024-04-09	MuPT: A Generative Symbolic Music Pretrained Transformer	Xingwei Qu et.al.	2404.06393	translate	read	null
2024-04-09	Latent Distance Guided Alignment Training for Large Language Models	Haotian Luo et.al.	2404.06390	translate	read	null
2024-04-08	MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding	Bo He et.al.	2404.05726	translate	read	link
2024-04-08	Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs	Keen You et.al.	2404.05719	translate	read	null
2024-04-08	Comprehensive Study on German Language Models for Clinical and Biomedical Text Understanding	Ahmad Idrissi-Yaghir et.al.	2404.05694	translate	read	null
2024-04-08	Evaluating Mathematical Reasoning Beyond Accuracy	Shijie Xia et.al.	2404.05692	translate	read	link
2024-04-08	Retrieval-Augmented Open-Vocabulary Object Detection	Jooyeon Kim et.al.	2404.05687	translate	read	link
2024-04-08	MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation	Kunpeng Song et.al.	2404.05674	translate	read	link
2024-04-08	CoReS: Orchestrating the Dance of Reasoning and Segmentation	Xiaoyi Bao et.al.	2404.05673	translate	read	link
2024-04-08	Fighting crime with Transformers: Empirical analysis of address parsing methods in payment data	Haitham Hammami et.al.	2404.05632	translate	read	link
2024-04-08	LTNER: Large Language Model Tagging for Named Entity Recognition with Contextualized Entity Marking	Faren Yan et.al.	2404.05624	translate	read	null
2024-04-08	MedExpQA: Multilingual Benchmarking of Large Language Models for Medical Question Answering	Iñigo Alonso et.al.	2404.05590	translate	read	null
2024-04-05	Physical Property Understanding from Language-Embedded Feature Fields	Albert J. Zhai et.al.	2404.04242	translate	read	null
2024-04-05	Cleared for Takeoff? Compositional & Conditional Reasoning may be the Achilles Heel to (Flight-Booking) Language Agents	Harsh Kohli et.al.	2404.04237	translate	read	null
2024-04-05	Benchmarking and Improving Compositional Generalization of Multi-aspect Controllable Text Generation	Tianqi Zhong et.al.	2404.04232	translate	read	link
2024-04-05	Social Skill Training with Large Language Models	Diyi Yang et.al.	2404.04204	translate	read	null
2024-04-05	Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model	Xinrun Du et.al.	2404.04167	translate	read	null
2024-04-05	Large language models as oracles for instantiating ontologies with domain-specific knowledge	Giovanni Ciatto et.al.	2404.04108	translate	read	link
2024-04-05	Improving Factual Accuracy of Neural Table-to-Text Output by Addressing Input Problems in ToTTo	Barkavi Sundararajan et.al.	2404.04103	translate	read	link
2024-04-05	Robust Preference Optimization with Provable Noise Tolerance for LLMs	Xize Liang et.al.	2404.04102	translate	read	null
2024-04-05	Assessing the quality of information extraction	Filip Seitl et.al.	2404.04068	translate	read	null
2024-04-05	CLUE: A Clinical Language Understanding Evaluation for LLMs	Amin Dada et.al.	2404.04067	translate	read	link
2024-04-04	CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching	Dongzhi Jiang et.al.	2404.03653	translate	read	link
2024-04-04	AutoWebGLM: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent	Hanyu Lai et.al.	2404.03648	translate	read	link
2024-04-04	Capabilities of Large Language Models in Control Engineering: A Benchmark Study on GPT-4, Claude 3 Opus, and Gemini 1.0 Ultra	Darioush Kevian et.al.	2404.03647	translate	read	null
2024-04-04	Training LLMs over Neurally Compressed Text	Brian Lester et.al.	2404.03626	translate	read	null
2024-04-04	Unveiling LLMs: The Evolution of Latent Representations in a Temporal Knowledge Graph	Marco Bronzini et.al.	2404.03623	translate	read	link
2024-04-04	Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models	Wenshan Wu et.al.	2404.03622	translate	read	link
2024-04-04	DeViDe: Faceted medical knowledge for improved medical vision-language pre-training	Haozhe Luo et.al.	2404.03618	translate	read	null
2024-04-04	Sailor: Open Language Models for South-East Asia	Longxu Dou et.al.	2404.03608	translate	read	link
2024-04-04	Evaluating LLMs at Detecting Errors in LLM Responses	Ryo Kamoi et.al.	2404.03602	translate	read	link
2024-04-04	Intent Detection and Entity Extraction from BioMedical Literature	Ankan Mullick et.al.	2404.03598	translate	read	link
2024-04-03	ALOHa: A New Measure for Hallucination in Captioning Models	Suzanne Petryk et.al.	2404.02904	translate	read	null
2024-04-03	MatAtlas: Text-driven Consistent Geometry Texturing and Material Assignment	Duygu Ceylan et.al.	2404.02899	translate	read	null
2024-04-03	ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline	Yifan Xu et.al.	2404.02893	translate	read	link
2024-04-03	Integrating Explanations in Learning LTL Specifications from Demonstrations	Ashutosh Gupta et.al.	2404.02872	translate	read	null
2024-04-03	Toward Inference-optimal Mixture-of-Expert Large Language Models	Longfei Yun et.al.	2404.02852	translate	read	null
2024-04-03	I-Design: Personalized LLM Interior Designer	Ata Çelen et.al.	2404.02838	translate	read	null
2024-04-03	Cherry on Top: Parameter Heterogeneity and Quantization in Large Language Models	Wanyun Cui et.al.	2404.02837	translate	read	null
2024-04-03	Retrieving Examples from Memory for Retrieval Augmented Neural Machine Translation: A Systematic Comparison	Maxime Bouthors et.al.	2404.02835	translate	read	null
2024-04-03	Empowering Biomedical Discovery with AI Agents	Shanghua Gao et.al.	2404.02831	translate	read	null
2024-04-03	BAdam: A Memory Efficient Full Parameter Training Method for Large Language Models	Qijun Luo et.al.	2404.02827	translate	read	link
2024-04-02	Topic-based Watermarks for LLM-Generated Text	Alexander Nemecek et.al.	2404.02138	translate	read	null
2024-04-02	Exploring Automated Distractor Generation for Math Multiple-choice Questions via Large Language Models	Wanyong Feng et.al.	2404.02124	translate	read	null
2024-04-02	GINopic: Topic Modeling with Graph Isomorphism Network	Suman Adhya et.al.	2404.02115	translate	read	link
2024-04-02	CLAPNQ: Cohesive Long-form Answers from Passages in Natural Questions for RAG systems	Sara Rosenthal et.al.	2404.02103	translate	read	link
2024-04-02	Advancing LLM Reasoning Generalists with Preference Trees	Lifan Yuan et.al.	2404.02078	translate	read	link
2024-04-02	Digital Forgetting in Large Language Models: A Survey of Unlearning Methods	Alberto Blanco-Justicia et.al.	2404.02062	translate	read	null
2024-04-02	Long-context LLMs Struggle with Long In-context Learning	Tianle Li et.al.	2404.02060	translate	read	link
2024-04-02	Deconstructing In-Context Learning: Understanding Prompts via Corruption	Namrata Shivagunde et.al.	2404.02054	translate	read	link
2024-04-02	BERTopic-Driven Stock Market Predictions: Unraveling Sentiment Insights	Enmin Zhu et.al.	2404.02053	translate	read	null
2024-04-02	A Survey on Large Language Model-Based Game Agents	Sihao Hu et.al.	2404.02039	translate	read	link
2024-04-01	Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want	Weifeng Lin et.al.	2403.20271	translate	read	link

(<a href=../LLM.md>back to LLM</a>)