LLM - 2024-03 | Paper Arxiv Daily

LLM - 2024-03

Publish Date	Title	Authors	PDF	Translate	Read	Code
2024-03-29	Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models	Atsuyuki Miyai et.al.	2403.20331	translate	read	link
2024-03-29	Gecko: Versatile Text Embeddings Distilled from Large Language Models	Jinhyuk Lee et.al.	2403.20327	translate	read	null
2024-03-29	Convolutional Prompting meets Language Models for Continual Learning	Anurag Roy et.al.	2403.20317	translate	read	null
2024-03-29	Towards Greener LLMs: Bringing Energy-Efficiency to the Forefront of LLM Inference	Jovan Stojkovic et.al.	2403.20306	translate	read	null
2024-03-29	Can LLMs Correct Physicians, Yet? Investigating Effective Interaction Methods in the Medical Domain	Burcu Sayin et.al.	2403.20288	translate	read	null
2024-03-29	LUQ: Long-text Uncertainty Quantification for LLMs	Caiqi Zhang et.al.	2403.20279	translate	read	null
2024-03-29	Latxa: An Open Language Model and Evaluation Suite for Basque	Julen Etxaniz et.al.	2403.20266	translate	read	link
2024-03-29	ELITR-Bench: A Meeting Assistant Benchmark for Long-Context Language Models	Thibaut Thonet et.al.	2403.20262	translate	read	null
2024-03-29	Using LLMs to Model the Beliefs and Preferences of Targeted Populations	Keiichi Namikoshi et.al.	2403.20252	translate	read	null
2024-03-28	InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction	Sirui Xu et.al.	2403.19652	translate	read	null
2024-03-28	MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions	Kai Zhang et.al.	2403.19651	translate	read	null
2024-03-28	Change-Agent: Towards Interactive Comprehensive Change Interpretation and Analysis from Change Detection and Change Captioning	Chenyang Liu et.al.	2403.19646	translate	read	link
2024-03-28	Retrieval-Enhanced Knowledge Editing for Multi-Hop Question Answering in Language Models	Yucheng Shi et.al.	2403.19631	translate	read	null
2024-03-28	Semantic Map-based Generation of Navigation Instructions	Chengzu Li et.al.	2403.19603	translate	read	link
2024-03-28	LocCa: Visual Pretraining with Location-aware Captioners	Bo Wan et.al.	2403.19596	translate	read	null
2024-03-28	Img2Loc: Revisiting Image Geolocalization using Multi-modality Foundation Models and Image-based Retrieval-Augmented Generation	Zhongliang Zhou et.al.	2403.19584	translate	read	null
2024-03-28	WaterJudge: Quality-Detection Trade-off when Watermarking Large Language Models	Piotr Molenda et.al.	2403.19548	translate	read	null
2024-03-28	LLMs as Academic Reading Companions: Extending HCI Through Synthetic Personae	Celia Chen et.al.	2403.19506	translate	read	null
2024-03-28	Evolving Assembly Code in an Adversarial Environment	Irina Maliukov et.al.	2403.19489	translate	read	null
2024-03-27	Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models	Yanwei Li et.al.	2403.18814	translate	read	link
2024-03-27	ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation	Suraj Patni et.al.	2403.18807	translate	read	link
2024-03-27	Is Modularity Transferable? A Case Study through the Lens of Knowledge Distillation	Mateusz Klimaszewski et.al.	2403.18804	translate	read	null
2024-03-27	Long-form factuality in large language models	Jerry Wei et.al.	2403.18802	translate	read	link
2024-03-27	3P-LLM: Probabilistic Path Planning using Large Language Model for Autonomous Robot Navigation	Ehsan Latif et.al.	2403.18778	translate	read	null
2024-03-27	CheckEval: Robust Evaluation Framework using Large Language Model via Checklist	Yukyung Lee et.al.	2403.18771	translate	read	null
2024-03-27	MLDT: Multi-Level Decomposition for Complex Long-Horizon Robotic Task Planning with Open-Source Large Language Model	Yike Wu et.al.	2403.18760	translate	read	null
2024-03-27	Understanding the Learning Dynamics of Alignment with Human Feedback	Shawn Im et.al.	2403.18742	translate	read	null
2024-03-27	PhysicsAssistant: An LLM-Powered Interactive Learning Robot for Physics Lab Investigations	Ehsan Latif et.al.	2403.18721	translate	read	null
2024-03-27	NL-ITI: Optimizing Probing and Intervention for Improvement of ITI Method	Jakub Hoscilowicz et.al.	2403.18680	translate	read	link
2024-03-26	MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution	Wei Tao et.al.	2403.17927	translate	read	null
2024-03-26	LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning	Rui Pan et.al.	2403.17919	translate	read	null
2024-03-26	Addressing Social Misattributions of Large Language Models: An HCXAI-based Approach	Andrea Ferrario et.al.	2403.17873	translate	read	null
2024-03-26	Exploring LLMs as a Source of Targeted Synthetic Textual Data to Minimize High Confidence Misclassifications	Philip Lippmann et.al.	2403.17860	translate	read	null
2024-03-26	ChroniclingAmericaQA: A Large-scale Question Answering Dataset based on Historical American Newspaper Pages	Bhawna Piryani et.al.	2403.17859	translate	read	link
2024-03-26	Verbing Weirds Language (Models): Evaluation of English Zero-Derivation in Five LLMs	David R. Mortensen et.al.	2403.17856	translate	read	null
2024-03-26	ArabicaQA: A Comprehensive Dataset for Arabic Question Answering	Abdelrahman Abdallah et.al.	2403.17848	translate	read	link
2024-03-26	Assessment of Multimodal Large Language Models in Alignment with Human Values	Zhelun Shi et.al.	2403.17830	translate	read	null
2024-03-26	Accelerating Radio Spectrum Regulation Workflows with Large Language Models (LLMs)	Amir Ghasemi et.al.	2403.17819	translate	read	null
2024-03-26	Are Compressed Language Models Less Subgroup Robust?	Leonidas Gee et.al.	2403.17811	translate	read	link
2024-03-25	Towards Human-AI Deliberation: Design and Evaluation of LLM-Empowered Deliberative AI for AI-Assisted Decision-Making	Shuai Ma et.al.	2403.16812	translate	read	null
2024-03-25	An LLM-Based Digital Twin for Optimizing Human-in-the Loop Systems	Hanqing Yang et.al.	2403.16809	translate	read	null
2024-03-25	Iterative Refinement of Project-Level Code Context for Precise Code Generation with Compiler Feedback	Zhangqian Bi et.al.	2403.16792	translate	read	null
2024-03-25	All Artificial, Less Intelligence: GenAI through the Lens of Formal Verification	Deepak Narayan Gadde et.al.	2403.16750	translate	read	null
2024-03-25	Synapse: Learning Preferential Concepts from Visual Demonstrations	Sadanand Modak et.al.	2403.16689	translate	read	null
2024-03-25	Investigation of the effectiveness of applying ChatGPT in Dialogic Teaching Using Electroencephalography	Jiayue Zhang et.al.	2403.16687	translate	read	null
2024-03-25	ToXCL: A Unified Framework for Toxic Speech Detection and Explanation	Nhat M. Hoang et.al.	2403.16685	translate	read	link
2024-03-25	RU22Fact: Optimizing Evidence for Multilingual Explainable Fact-Checking on Russia-Ukraine Conflict	Yirong Zeng et.al.	2403.16662	translate	read	link
2024-03-25	Grammatical vs Spelling Error Correction: An Investigation into the Responsiveness of Transformer-based Language Models using BART and MarianMT	Rohit Raju et.al.	2403.16655	translate	read	null
2024-03-25	CLHA: A Simple yet Effective Contrastive Learning Framework for Human Alignment	Feiteng Fang et.al.	2403.16649	translate	read	null
2024-03-25	Virtual Co-Pilot: Multimodal Large Language Model-enabled Quick-access Procedures for Single Pilot Operations	Fan Li et.al.	2403.16645	translate	read	null
2024-03-25	Conversational Grounding: Annotation and Analysis of Grounding Acts and Grounding Units	Biswesh Mohapatra et.al.	2403.16609	translate	read	null
2024-03-25	TrustAI at SemEval-2024 Task 8: A Comprehensive Analysis of Multi-domain Machine Generated Text Detection Techniques	Ashok Urlana et.al.	2403.16592	translate	read	null
2024-03-25	Can Large Language Models (or Humans) Distill Text?	Nicolas Audinet de Pieuchon et.al.	2403.16584	translate	read	null
2024-03-22	LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models	Yuzhang Shang et.al.	2403.15388	translate	read	null
2024-03-22	Long-CLIP: Unlocking the Long-Text Capability of CLIP	Beichen Zhang et.al.	2403.15378	translate	read	null
2024-03-22	Can large language models explore in-context?	Akshay Krishnamurthy et.al.	2403.15371	translate	read	null
2024-03-22	CoLLEGe: Concept Embedding Generation for Large Language Models	Ryan Teehan et.al.	2403.15362	translate	read	null
2024-03-22	Multi-Review Fusion-in-Context	Aviv Slobodkin et.al.	2403.15351	translate	read	null
2024-03-22	CO-Fun: A German Dataset on Company Outsourcing in Fund Prospectuses for Named Entity Recognition and Relation Extraction	Neda Foroutan et.al.	2403.15322	translate	read	null
2024-03-22	Sphere Neural-Networks for Rational Reasoning	Tiansi Dong et.al.	2403.15297	translate	read	null
2024-03-22	Measuring Gender and Racial Biases in Large Language Models	Jiafu An et.al.	2403.15281	translate	read	null
2024-03-22	Bioinformatics and Biomedical Informatics with ChatGPT: Year One Review	Jinge Wang et.al.	2403.15274	translate	read	null
2024-03-22	Event Temporal Relation Extraction based on Retrieval-Augmented on LLMs	Xiaobin Zhang et.al.	2403.15273	translate	read	null
2024-03-21	MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?	Renrui Zhang et.al.	2403.14624	translate	read	null
2024-03-21	Language Repository for Long Video Understanding	Kumara Kahatapitiya et.al.	2403.14622	translate	read	link
2024-03-21	Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey	Zeyu Han et.al.	2403.14608	translate	read	null
2024-03-21	MyVLM: Personalizing VLMs for User-Specific Queries	Yuval Alaluf et.al.	2403.14599	translate	read	null
2024-03-21	Large Language Models for Multi-Choice Question Classification of Medical Subjects	Víctor Ponce-López et.al.	2403.14582	translate	read	null
2024-03-21	RAmBLA: A Framework for Evaluating the Reliability of LLMs as Assistants in the Biomedical Domain	William James Bolton et.al.	2403.14578	translate	read	link
2024-03-21	A Chain-of-Thought Prompting Approach with LLMs for Evaluating Students’ Formative Assessment Responses in Science	Clayton Cohn et.al.	2403.14565	translate	read	null
2024-03-21	EDT: Improving Large Language Models’ Generation by Entropy-based Dynamic Temperature Sampling	Shimao Zhang et.al.	2403.14541	translate	read	null
2024-03-21	Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference	Han Zhao et.al.	2403.14520	translate	read	null
2024-03-21	The Ethics of ChatGPT in Medicine and Healthcare: A Systematic Review on Large Language Models (LLMs)	Joschka Haltaufderheide et.al.	2403.14473	translate	read	null
2024-03-20	RAR: Retrieving And Ranking Augmented MLLMs for Visual Recognition	Ziyu Liu et.al.	2403.13805	translate	read	null
2024-03-20	Learning from Models and Data for Visual Grounding	Ruozhen He et.al.	2403.13804	translate	read	null
2024-03-20	Reverse Training to Nurse the Reversal Curse	Olga Golovneva et.al.	2403.13799	translate	read	null
2024-03-20	Chain-of-Interaction: Enhancing Large Language Models for Psychiatric Behavior Understanding by Dyadic Contexts	Guangzeng Han et.al.	2403.13786	translate	read	null
2024-03-20	Leveraging High-Resolution Features for Improved Deep Hashing-based Image Retrieval	Aymene Berriche et.al.	2403.13747	translate	read	null
2024-03-20	EthioLLM: Multilingual Large Language Models for Ethiopian Languages with Task Evaluation	Atnafu Lambebo Tonja et.al.	2403.13737	translate	read	null
2024-03-20	Large Language Models meet Network Slicing Management and Orchestration	Abdulhalim Dandoush et.al.	2403.13721	translate	read	null
2024-03-20	RoleInteract: Evaluating the Social Interaction of Role-Playing Agents	Hongzhan Chen et.al.	2403.13679	translate	read	null
2024-03-20	Do Not Worry if You Do Not Have Data: Building Pretrained Language Models Using Translationese	Meet Doshi et.al.	2403.13638	translate	read	null
2024-03-20	VL-Mamba: Exploring State Space Models for Multimodal Learning	Yanyuan Qiao et.al.	2403.13600	translate	read	null
2024-03-19	Dated Data: Tracing Knowledge Cutoffs in Large Language Models	Jeffrey Cheng et.al.	2403.12958	translate	read	null
2024-03-19	Automatic Information Extraction From Employment Tribunal Judgements Using Large Language Models	Joana Ribeiro de Faria et.al.	2403.12936	translate	read	null
2024-03-19	Rapid AIdeation: Generating Ideas With the Self and in Collaboration With Large Language Models	Gionnieve Lim et.al.	2403.12928	translate	read	null
2024-03-19	Supporting Energy Policy Research with Large Language Models	Grant Buster et.al.	2403.12924	translate	read	null
2024-03-19	Semantic Layering in Room Segmentation via LLMs	Taehyeon Kim et.al.	2403.12920	translate	read	null
2024-03-19	Toward Sustainable GenAI using Generation Directives for Carbon-Friendly Large Language Model Inference	Baolin Li et.al.	2403.12900	translate	read	null
2024-03-19	mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding	Anwen Hu et.al.	2403.12895	translate	read	link
2024-03-19	MEDBind: Unifying Language and Multimodal Medical Data Embeddings	Yuan Gao et.al.	2403.12894	translate	read	null
2024-03-19	HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning	Fucai Ke et.al.	2403.12884	translate	read	null
2024-03-19	Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models	Zehui Chen et.al.	2403.12881	translate	read	link
2024-03-18	HDLdebugger: Streamlining HDL debugging with Large Language Models	Xufeng Yao et.al.	2403.11671	translate	read	null
2024-03-18	Let’s Focus on Neuron: Neuron-Level Supervised Fine-tuning for Large Language Model	Haoyun Xu et.al.	2403.11621	translate	read	null
2024-03-18	Linguacodus: A Synergistic Framework for Transformative Code Generation in Machine Learning Pipelines	Ekaterina Trofimova et.al.	2403.11585	translate	read	null
2024-03-18	Reinforcement Learning with Token-level Feedback for Controllable Text Generation	Wendi Li et.al.	2403.11558	translate	read	null
2024-03-18	LLM^3:Large Language Model-based Task and Motion Planning with Motion Failure Reasoning	Shu Wang et.al.	2403.11552	translate	read	link
2024-03-18	TARN-VIST: Topic Aware Reinforcement Network for Visual Storytelling	Weiran Chen et.al.	2403.11550	translate	read	null
2024-03-18	DEE: Dual-stage Explainable Evaluation Method for Text Generation	Shenyu Zhang et.al.	2403.11509	translate	read	null
2024-03-18	Can LLMs Generate Human-Like Wayfinding Instructions? Towards Platform-Agnostic Embodied Instruction Synthesis	Vishnu Sashank Dorbala et.al.	2403.11487	translate	read	null
2024-03-18	VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding	Yue Fan et.al.	2403.11481	translate	read	null
2024-03-18	HateCOT: An Explanation-Enhanced Dataset for Generalizable Offensive Speech Detection via Large Language Models	Huy Nghiem et.al.	2403.11456	translate	read	link
2024-03-14	Dynamic Memory Compression: Retrofitting LLMs for Accelerated Inference	Piotr Nawrot et.al.	2403.09636	translate	read	null
2024-03-14	3D-VLA: A 3D Vision-Language-Action Generative World Model	Haoyu Zhen et.al.	2403.09631	translate	read	null
2024-03-14	MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training	Brandon McKinzie et.al.	2403.09611	translate	read	null
2024-03-14	Large Language Models and Causal Inference in Collaboration: A Comprehensive Survey	Xiaoyu Liu et.al.	2403.09606	translate	read	null
2024-03-14	Logical Discrete Graphical Models Must Supplement Large Language Models for Information Synthesis	Gregory Coppola et.al.	2403.09599	translate	read	null
2024-03-14	ExploRLLM: Guiding Exploration in Reinforcement Learning with Large Language Models	Runyu Ma et.al.	2403.09583	translate	read	null
2024-03-14	Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation	Yunhao Gou et.al.	2403.09572	translate	read	null
2024-03-14	Enhancing Trust in Autonomous Agents: An Architecture for Accountability and Explainability through Blockchain and Large Language Models	Laura Fernández-Becerra et.al.	2403.09567	translate	read	null
2024-03-14	Welcome Your New AI Teammate: On Safety Analysis by Leashing Large Language Models	Ali Nouri et.al.	2403.09565	translate	read	null
2024-03-14	Less is More: Data Value Estimation for Visual Instruction Tuning	Zikang Liu et.al.	2403.09559	translate	read	null
2024-03-13	Simple and Scalable Strategies to Continually Pre-train Large Language Models	Adam Ibrahim et.al.	2403.08763	translate	read	null
2024-03-13	Steering LLMs Towards Unbiased Responses: A Causality-Guided Debiasing Framework	Jingling Li et.al.	2403.08743	translate	read	null
2024-03-13	The Garden of Forking Paths: Observing Dynamic Parameters Distribution in Large Language Models	Carlo Nicolini et.al.	2403.08739	translate	read	null
2024-03-13	Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization	Renjie Pi et.al.	2403.08730	translate	read	null
2024-03-14	SOTOPIA- $π$ : Interactive Learning of Socially Intelligent Language Agents	Ruiyi Wang et.al.	2403.08715	translate	read	link
2024-03-13	Review of Generative AI Methods in Cybersecurity	Yagmur Yigit et.al.	2403.08701	translate	read	null
2024-03-13	TeaMs-RL: Teaching LLMs to Teach Themselves Better Instructions via Reinforcement Learning	Shangding Gu et.al.	2403.08694	translate	read	null
2024-03-13	Token Alignment via Character Matching for Subword Completion	Ben Athiwaratkun et.al.	2403.08688	translate	read	null
2024-03-13	Zero-shot and Few-shot Generation Strategies for Artificial Clinical Records	Erlend Frayling et.al.	2403.08664	translate	read	null
2024-03-13	Human Alignment of Large Language Models through Online Preference Optimisation	Daniele Calandriello et.al.	2403.08635	translate	read	null
2024-03-12	Beyond Text: Frozen Large Language Models in Visual Signal Comprehension	Lei Zhu et.al.	2403.07874	translate	read	link
2024-03-12	Rethinking Generative Large Language Model Evaluation for Semantic Comprehension	Fangyun Wei et.al.	2403.07872	translate	read	null
2024-03-12	Exploring Safety Generalization Challenges of Large Language Models via Code	Qibing Ren et.al.	2403.07865	translate	read	null
2024-03-12	DeliGrasp: Inferring Object Mass, Friction, and Compliance with LLMs for Adaptive and Minimally Deforming Grasp Policies	William Xie et.al.	2403.07832	translate	read	null
2024-03-12	The Missing Piece in Model Editing: A Deep Dive into the Hidden Damage Brought By Model Editing	Jianchen Wang et.al.	2403.07825	translate	read	null
2024-03-12	Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM	Sainbayar Sukhbaatar et.al.	2403.07816	translate	read	null
2024-03-12	Fine-tuning Large Language Models with Sequential Instructions	Hanxu Hu et.al.	2403.07794	translate	read	link
2024-03-12	Transforming Competition into Collaboration: The Revolutionary Role of Multi-Agent Systems and Language Models in Modern Organizations	Carlos Jose Xavier Cruz et.al.	2403.07769	translate	read	link
2024-03-12	Synth $^2$ : Boosting Visual-Language Models with Synthetic Captions and Image Embeddings	Sahand Sharifzadeh et.al.	2403.07750	translate	read	null
2024-03-12	FineMath: A Fine-Grained Mathematical Evaluation Benchmark for Chinese Large Language Models	Yan Liu et.al.	2403.07747	translate	read	null
2024-03-11	Hybrid Human-LLM Corpus Construction and LLM Evaluation for Rare Linguistic Phenomena	Leonie Weissweiler et.al.	2403.06965	translate	read	null
2024-03-11	Materials science in the era of large language models: a perspective	Ge Lei et.al.	2403.06949	translate	read	null
2024-03-11	Naming, Describing, and Quantifying Visual Objects in Humans and LLMs	Alberto Testoni et.al.	2403.06935	translate	read	null
2024-03-11	ERA-CoT: Improving Chain-of-Thought through Entity Relationship Analysis	Yanming Liu et.al.	2403.06932	translate	read	link
2024-03-11	MEND: Meta dEmonstratioN Distillation for Efficient and Effective In-Context Learning	Yichuan Li et.al.	2403.06914	translate	read	null
2024-03-11	Exploring Large Language Models and Hierarchical Frameworks for Classification of Large Unstructured Legal Documents	Nishchal Prasad et.al.	2403.06872	translate	read	null
2024-03-11	Development of a Reliable and Accessible Caregiving Language Model (CaLM)	Bambang Parmanto et.al.	2403.06857	translate	read	null
2024-03-11	DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation	Guosheng Zhao et.al.	2403.06845	translate	read	null
2024-03-11	RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback	Yanming Liu et.al.	2403.06840	translate	read	link
2024-03-11	ACFIX: Guiding LLMs with Mined Common RBAC Practices for Context-Aware Repair of Access Control Vulnerabilities in Smart Contracts	Lyuye Zhang et.al.	2403.06838	translate	read	null
2024-03-08	Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context	Machel Reid et.al.	2403.05530	translate	read	null
2024-03-08	GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM	Hao Kang et.al.	2403.05527	translate	read	link
2024-03-08	Beyond Finite Data: Towards Data-free Out-of-distribution Generalization via Extrapola	Yijiang Li et.al.	2403.05523	translate	read	null
2024-03-08	Will GPT-4 Run DOOM?	Adrian de Wynter et.al.	2403.05468	translate	read	null
2024-03-08	Cost-Performance Optimization for Processing Low-Resource Language Tasks Using Commercial LLMs	Arijit Nag et.al.	2403.05434	translate	read	null
2024-03-08	Explaining Pre-Trained Language Models with Attribution Scores: An Analysis in Low-Resource Settings	Wei Zhou et.al.	2403.05338	translate	read	null
2024-03-08	ChatASU: Evoking LLM’s Reflexion to Truly Understand Aspect Sentiment in Dialogues	Yiding Liu et.al.	2403.05326	translate	read	null
2024-03-08	RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation	Zihao Wang et.al.	2403.05313	translate	read	null
2024-03-08	Tapilot-Crossing: Benchmarking and Evolving LLMs Towards Interactive Data Analysis Agents	Jinyang Li et.al.	2403.05307	translate	read	null
2024-03-08	ACLSum: A New Dataset for Aspect-based Summarization of Scientific Publications	Sotaro Takeshita et.al.	2403.05303	translate	read	link
2024-03-07	Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed	Yifan Wang et.al.	2403.04765	translate	read	null
2024-03-07	iScore: Visual Analytics for Interpreting How Language Models Automatically Score Summaries	Adam Coscia et.al.	2403.04760	translate	read	link
2024-03-07	KnowledgeVIS: Interpreting Language Models by Comparing Fill-in-the-Blank Prompts	Adam Coscia et.al.	2403.04758	translate	read	link
2024-03-07	LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error	Boshi Wang et.al.	2403.04746	translate	read	link
2024-03-07	SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM	Jielin Qiu et.al.	2403.04735	translate	read	null
2024-03-07	ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes	Hashmat Shadab Malik et.al.	2403.04701	translate	read	null
2024-03-07	Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification	Ekaterina Fadeeva et.al.	2403.04696	translate	read	null
2024-03-07	PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation	Junsong Chen et.al.	2403.04692	translate	read	null
2024-03-07	Telecom Language Models: Must They Be Large?	Nicola Piovesan et.al.	2403.04666	translate	read	null
2024-03-07	QAQ: Quality Adaptive Quantization for LLM KV Cache	Shichen Dong et.al.	2403.04643	translate	read	link
2024-03-06	Bridging Language and Items for Retrieval and Recommendation	Yupeng Hou et.al.	2403.03952	translate	read	link
2024-03-06	Did Translation Models Get More Robust Without Anyone Even Noticing?	Ben Peters et.al.	2403.03923	translate	read	null
2024-03-06	Fuzzing BusyBox: Leveraging LLM and Crash Reuse for Embedded Bug Unearthing	Asmita et.al.	2403.03897	translate	read	null
2024-03-06	SaulLM-7B: A pioneering Large Language Model for Law	Pierre Colombo et.al.	2403.03883	translate	read	null
2024-03-06	Learning to Decode Collaboratively with Multiple Language Models	Shannon Zejiang Shen et.al.	2403.03870	translate	read	link
2024-03-06	On the Origins of Linear Representations in Large Language Models	Yibo Jiang et.al.	2403.03867	translate	read	null
2024-03-06	KIWI: A Dataset of Knowledge-Intensive Writing Instructions for Answering Research Questions	Fangyuan Xu et.al.	2403.03866	translate	read	null
2024-03-06	Are Language Models Puzzle Prodigies? Algorithmic Puzzles Unveil Serious Challenges in Multimodal Reasoning	Deepanway Ghosal et.al.	2403.03864	translate	read	link
2024-03-06	X-Shot: A Unified System to Handle Frequent, Few-shot and Zero-shot Learning Simultaneously in Classification	Hanzi Xu et.al.	2403.03863	translate	read	link
2024-03-06	Emojinize : Enriching Any Text with Emoji Translations	Lars Henning Klein et.al.	2403.03857	translate	read	null
2024-03-05	The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning	Nathaniel Li et.al.	2403.03218	translate	read	null
2024-03-05	CLEVR-POC: Reasoning-Intensive Visual Question Answering in Partially Observable Environments	Savitha Sam Abraham et.al.	2403.03203	translate	read	null
2024-03-05	Towards Democratized Flood Risk Management: An Advanced AI Assistant Enabled by GPT-4 for Enhanced Interpretability and Public Engagement	Rafaela Martelo et.al.	2403.03188	translate	read	link
2024-03-05	MOKA: Open-Vocabulary Robotic Manipulation through Mark-Based Visual Prompting	Fangchen Liu et.al.	2403.03174	translate	read	null
2024-03-05	SNIFFER: Multimodal Large Language Model for Explainable Out-of-Context Misinformation Detection	Peng Qi et.al.	2403.03170	translate	read	null
2024-03-05	PARADISE: Evaluating Implicit Planning Skills of Language Models with Procedural Warnings and Tips Dataset	Arda Uzunoğlu et.al.	2403.03167	translate	read	link
2024-03-05	Quantum Many-Body Physics Calculations with Large Language Models	Haining Pan et.al.	2403.03154	translate	read	null
2024-03-05	Language Guided Exploration for RL Agents in Text Environments	Hitesh Golchha et.al.	2403.03141	translate	read	null
2024-03-05	Angry Men, Sad Women: Large Language Models Reflect Gendered Stereotypes in Emotion Attribution	Flor Miriam Plaza-del-Arco et.al.	2403.03121	translate	read	null
2024-03-05	“In Dialogues We Learn”: Towards Personalized Dialogue Without Pre-defined Profiles through In-Dialogue Learning	Chuanqi Cheng et.al.	2403.03102	translate	read	null
2024-03-02	LM4OPT: Unveiling the Potential of Large Language Models in Formulating Mathematical Optimization Problems	Tasnim Ahmed et.al.	2403.01342	translate	read	null
2024-03-02	Chaining thoughts and LLMs to learn DNA structural biophysics	Tyler D. Ross et.al.	2403.01332	translate	read	null
2024-03-02	VNLP: Turkish NLP Package	Meliksah Turker et.al.	2403.01309	translate	read	null
2024-03-02	VBART: The Turkish LLM	Meliksah Turker et.al.	2403.01308	translate	read	null
2024-03-02	ICC: Quantifying Image Caption Concreteness for Multimodal Dataset Curation	Moran Yanuka et.al.	2403.01306	translate	read	null
2024-03-02	Improving the Validity of Automatically Generated Feedback via Reinforcement Learning	Alexander Scarlatos et.al.	2403.01304	translate	read	link
2024-03-02	NoMAD-Attention: Efficient LLM Inference on CPUs Through Multiply-add-free Attention	Tianyi Zhang et.al.	2403.01273	translate	read	null
2024-03-02	Employing LLMs for Incident Response Planning and Review	Sam Hays et.al.	2403.01271	translate	read	null
2024-03-02	A comprehensive cross-language framework for harmful content detection with the aid of sentiment analysis	Mohammad Dehghani et.al.	2403.01270	translate	read	null
2024-03-02	Dissecting Language Models: Machine Unlearning via Selective Pruning	Nicholas Pochinkov et.al.	2403.01267	translate	read	null

(<a href=../LLM.md>back to LLM</a>)