LLM - 2024-03

Publish Date Title Authors PDF Translate Read Code
2024-03-29 Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models Atsuyuki Miyai et.al. 2403.20331 translate read link
2024-03-29 Gecko: Versatile Text Embeddings Distilled from Large Language Models Jinhyuk Lee et.al. 2403.20327 translate read null
2024-03-29 Convolutional Prompting meets Language Models for Continual Learning Anurag Roy et.al. 2403.20317 translate read null
2024-03-29 Towards Greener LLMs: Bringing Energy-Efficiency to the Forefront of LLM Inference Jovan Stojkovic et.al. 2403.20306 translate read null
2024-03-29 Can LLMs Correct Physicians, Yet? Investigating Effective Interaction Methods in the Medical Domain Burcu Sayin et.al. 2403.20288 translate read null
2024-03-29 LUQ: Long-text Uncertainty Quantification for LLMs Caiqi Zhang et.al. 2403.20279 translate read null
2024-03-29 Latxa: An Open Language Model and Evaluation Suite for Basque Julen Etxaniz et.al. 2403.20266 translate read link
2024-03-29 ELITR-Bench: A Meeting Assistant Benchmark for Long-Context Language Models Thibaut Thonet et.al. 2403.20262 translate read null
2024-03-29 Using LLMs to Model the Beliefs and Preferences of Targeted Populations Keiichi Namikoshi et.al. 2403.20252 translate read null
2024-03-28 InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction Sirui Xu et.al. 2403.19652 translate read null
2024-03-28 MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions Kai Zhang et.al. 2403.19651 translate read null
2024-03-28 Change-Agent: Towards Interactive Comprehensive Change Interpretation and Analysis from Change Detection and Change Captioning Chenyang Liu et.al. 2403.19646 translate read link
2024-03-28 Retrieval-Enhanced Knowledge Editing for Multi-Hop Question Answering in Language Models Yucheng Shi et.al. 2403.19631 translate read null
2024-03-28 Semantic Map-based Generation of Navigation Instructions Chengzu Li et.al. 2403.19603 translate read link
2024-03-28 LocCa: Visual Pretraining with Location-aware Captioners Bo Wan et.al. 2403.19596 translate read null
2024-03-28 Img2Loc: Revisiting Image Geolocalization using Multi-modality Foundation Models and Image-based Retrieval-Augmented Generation Zhongliang Zhou et.al. 2403.19584 translate read null
2024-03-28 WaterJudge: Quality-Detection Trade-off when Watermarking Large Language Models Piotr Molenda et.al. 2403.19548 translate read null
2024-03-28 LLMs as Academic Reading Companions: Extending HCI Through Synthetic Personae Celia Chen et.al. 2403.19506 translate read null
2024-03-28 Evolving Assembly Code in an Adversarial Environment Irina Maliukov et.al. 2403.19489 translate read null
2024-03-27 Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models Yanwei Li et.al. 2403.18814 translate read link
2024-03-27 ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation Suraj Patni et.al. 2403.18807 translate read link
2024-03-27 Is Modularity Transferable? A Case Study through the Lens of Knowledge Distillation Mateusz Klimaszewski et.al. 2403.18804 translate read null
2024-03-27 Long-form factuality in large language models Jerry Wei et.al. 2403.18802 translate read link
2024-03-27 3P-LLM: Probabilistic Path Planning using Large Language Model for Autonomous Robot Navigation Ehsan Latif et.al. 2403.18778 translate read null
2024-03-27 CheckEval: Robust Evaluation Framework using Large Language Model via Checklist Yukyung Lee et.al. 2403.18771 translate read null
2024-03-27 MLDT: Multi-Level Decomposition for Complex Long-Horizon Robotic Task Planning with Open-Source Large Language Model Yike Wu et.al. 2403.18760 translate read null
2024-03-27 Understanding the Learning Dynamics of Alignment with Human Feedback Shawn Im et.al. 2403.18742 translate read null
2024-03-27 PhysicsAssistant: An LLM-Powered Interactive Learning Robot for Physics Lab Investigations Ehsan Latif et.al. 2403.18721 translate read null
2024-03-27 NL-ITI: Optimizing Probing and Intervention for Improvement of ITI Method Jakub Hoscilowicz et.al. 2403.18680 translate read link
2024-03-26 MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution Wei Tao et.al. 2403.17927 translate read null
2024-03-26 LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning Rui Pan et.al. 2403.17919 translate read null
2024-03-26 Addressing Social Misattributions of Large Language Models: An HCXAI-based Approach Andrea Ferrario et.al. 2403.17873 translate read null
2024-03-26 Exploring LLMs as a Source of Targeted Synthetic Textual Data to Minimize High Confidence Misclassifications Philip Lippmann et.al. 2403.17860 translate read null
2024-03-26 ChroniclingAmericaQA: A Large-scale Question Answering Dataset based on Historical American Newspaper Pages Bhawna Piryani et.al. 2403.17859 translate read link
2024-03-26 Verbing Weirds Language (Models): Evaluation of English Zero-Derivation in Five LLMs David R. Mortensen et.al. 2403.17856 translate read null
2024-03-26 ArabicaQA: A Comprehensive Dataset for Arabic Question Answering Abdelrahman Abdallah et.al. 2403.17848 translate read link
2024-03-26 Assessment of Multimodal Large Language Models in Alignment with Human Values Zhelun Shi et.al. 2403.17830 translate read null
2024-03-26 Accelerating Radio Spectrum Regulation Workflows with Large Language Models (LLMs) Amir Ghasemi et.al. 2403.17819 translate read null
2024-03-26 Are Compressed Language Models Less Subgroup Robust? Leonidas Gee et.al. 2403.17811 translate read link
2024-03-25 Towards Human-AI Deliberation: Design and Evaluation of LLM-Empowered Deliberative AI for AI-Assisted Decision-Making Shuai Ma et.al. 2403.16812 translate read null
2024-03-25 An LLM-Based Digital Twin for Optimizing Human-in-the Loop Systems Hanqing Yang et.al. 2403.16809 translate read null
2024-03-25 Iterative Refinement of Project-Level Code Context for Precise Code Generation with Compiler Feedback Zhangqian Bi et.al. 2403.16792 translate read null
2024-03-25 All Artificial, Less Intelligence: GenAI through the Lens of Formal Verification Deepak Narayan Gadde et.al. 2403.16750 translate read null
2024-03-25 Synapse: Learning Preferential Concepts from Visual Demonstrations Sadanand Modak et.al. 2403.16689 translate read null
2024-03-25 Investigation of the effectiveness of applying ChatGPT in Dialogic Teaching Using Electroencephalography Jiayue Zhang et.al. 2403.16687 translate read null
2024-03-25 ToXCL: A Unified Framework for Toxic Speech Detection and Explanation Nhat M. Hoang et.al. 2403.16685 translate read link
2024-03-25 RU22Fact: Optimizing Evidence for Multilingual Explainable Fact-Checking on Russia-Ukraine Conflict Yirong Zeng et.al. 2403.16662 translate read link
2024-03-25 Grammatical vs Spelling Error Correction: An Investigation into the Responsiveness of Transformer-based Language Models using BART and MarianMT Rohit Raju et.al. 2403.16655 translate read null
2024-03-25 CLHA: A Simple yet Effective Contrastive Learning Framework for Human Alignment Feiteng Fang et.al. 2403.16649 translate read null
2024-03-25 Virtual Co-Pilot: Multimodal Large Language Model-enabled Quick-access Procedures for Single Pilot Operations Fan Li et.al. 2403.16645 translate read null
2024-03-25 Conversational Grounding: Annotation and Analysis of Grounding Acts and Grounding Units Biswesh Mohapatra et.al. 2403.16609 translate read null
2024-03-25 TrustAI at SemEval-2024 Task 8: A Comprehensive Analysis of Multi-domain Machine Generated Text Detection Techniques Ashok Urlana et.al. 2403.16592 translate read null
2024-03-25 Can Large Language Models (or Humans) Distill Text? Nicolas Audinet de Pieuchon et.al. 2403.16584 translate read null
2024-03-22 LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models Yuzhang Shang et.al. 2403.15388 translate read null
2024-03-22 Long-CLIP: Unlocking the Long-Text Capability of CLIP Beichen Zhang et.al. 2403.15378 translate read null
2024-03-22 Can large language models explore in-context? Akshay Krishnamurthy et.al. 2403.15371 translate read null
2024-03-22 CoLLEGe: Concept Embedding Generation for Large Language Models Ryan Teehan et.al. 2403.15362 translate read null
2024-03-22 Multi-Review Fusion-in-Context Aviv Slobodkin et.al. 2403.15351 translate read null
2024-03-22 CO-Fun: A German Dataset on Company Outsourcing in Fund Prospectuses for Named Entity Recognition and Relation Extraction Neda Foroutan et.al. 2403.15322 translate read null
2024-03-22 Sphere Neural-Networks for Rational Reasoning Tiansi Dong et.al. 2403.15297 translate read null
2024-03-22 Measuring Gender and Racial Biases in Large Language Models Jiafu An et.al. 2403.15281 translate read null
2024-03-22 Bioinformatics and Biomedical Informatics with ChatGPT: Year One Review Jinge Wang et.al. 2403.15274 translate read null
2024-03-22 Event Temporal Relation Extraction based on Retrieval-Augmented on LLMs Xiaobin Zhang et.al. 2403.15273 translate read null
2024-03-21 MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems? Renrui Zhang et.al. 2403.14624 translate read null
2024-03-21 Language Repository for Long Video Understanding Kumara Kahatapitiya et.al. 2403.14622 translate read link
2024-03-21 Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey Zeyu Han et.al. 2403.14608 translate read null
2024-03-21 MyVLM: Personalizing VLMs for User-Specific Queries Yuval Alaluf et.al. 2403.14599 translate read null
2024-03-21 Large Language Models for Multi-Choice Question Classification of Medical Subjects Víctor Ponce-López et.al. 2403.14582 translate read null
2024-03-21 RAmBLA: A Framework for Evaluating the Reliability of LLMs as Assistants in the Biomedical Domain William James Bolton et.al. 2403.14578 translate read link
2024-03-21 A Chain-of-Thought Prompting Approach with LLMs for Evaluating Students’ Formative Assessment Responses in Science Clayton Cohn et.al. 2403.14565 translate read null
2024-03-21 EDT: Improving Large Language Models’ Generation by Entropy-based Dynamic Temperature Sampling Shimao Zhang et.al. 2403.14541 translate read null
2024-03-21 Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference Han Zhao et.al. 2403.14520 translate read null
2024-03-21 The Ethics of ChatGPT in Medicine and Healthcare: A Systematic Review on Large Language Models (LLMs) Joschka Haltaufderheide et.al. 2403.14473 translate read null
2024-03-20 RAR: Retrieving And Ranking Augmented MLLMs for Visual Recognition Ziyu Liu et.al. 2403.13805 translate read null
2024-03-20 Learning from Models and Data for Visual Grounding Ruozhen He et.al. 2403.13804 translate read null
2024-03-20 Reverse Training to Nurse the Reversal Curse Olga Golovneva et.al. 2403.13799 translate read null
2024-03-20 Chain-of-Interaction: Enhancing Large Language Models for Psychiatric Behavior Understanding by Dyadic Contexts Guangzeng Han et.al. 2403.13786 translate read null
2024-03-20 Leveraging High-Resolution Features for Improved Deep Hashing-based Image Retrieval Aymene Berriche et.al. 2403.13747 translate read null
2024-03-20 EthioLLM: Multilingual Large Language Models for Ethiopian Languages with Task Evaluation Atnafu Lambebo Tonja et.al. 2403.13737 translate read null
2024-03-20 Large Language Models meet Network Slicing Management and Orchestration Abdulhalim Dandoush et.al. 2403.13721 translate read null
2024-03-20 RoleInteract: Evaluating the Social Interaction of Role-Playing Agents Hongzhan Chen et.al. 2403.13679 translate read null
2024-03-20 Do Not Worry if You Do Not Have Data: Building Pretrained Language Models Using Translationese Meet Doshi et.al. 2403.13638 translate read null
2024-03-20 VL-Mamba: Exploring State Space Models for Multimodal Learning Yanyuan Qiao et.al. 2403.13600 translate read null
2024-03-19 Dated Data: Tracing Knowledge Cutoffs in Large Language Models Jeffrey Cheng et.al. 2403.12958 translate read null
2024-03-19 Automatic Information Extraction From Employment Tribunal Judgements Using Large Language Models Joana Ribeiro de Faria et.al. 2403.12936 translate read null
2024-03-19 Rapid AIdeation: Generating Ideas With the Self and in Collaboration With Large Language Models Gionnieve Lim et.al. 2403.12928 translate read null
2024-03-19 Supporting Energy Policy Research with Large Language Models Grant Buster et.al. 2403.12924 translate read null
2024-03-19 Semantic Layering in Room Segmentation via LLMs Taehyeon Kim et.al. 2403.12920 translate read null
2024-03-19 Toward Sustainable GenAI using Generation Directives for Carbon-Friendly Large Language Model Inference Baolin Li et.al. 2403.12900 translate read null
2024-03-19 mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding Anwen Hu et.al. 2403.12895 translate read link
2024-03-19 MEDBind: Unifying Language and Multimodal Medical Data Embeddings Yuan Gao et.al. 2403.12894 translate read null
2024-03-19 HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning Fucai Ke et.al. 2403.12884 translate read null
2024-03-19 Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models Zehui Chen et.al. 2403.12881 translate read link
2024-03-18 HDLdebugger: Streamlining HDL debugging with Large Language Models Xufeng Yao et.al. 2403.11671 translate read null
2024-03-18 Let’s Focus on Neuron: Neuron-Level Supervised Fine-tuning for Large Language Model Haoyun Xu et.al. 2403.11621 translate read null
2024-03-18 Linguacodus: A Synergistic Framework for Transformative Code Generation in Machine Learning Pipelines Ekaterina Trofimova et.al. 2403.11585 translate read null
2024-03-18 Reinforcement Learning with Token-level Feedback for Controllable Text Generation Wendi Li et.al. 2403.11558 translate read null
2024-03-18 LLM^3:Large Language Model-based Task and Motion Planning with Motion Failure Reasoning Shu Wang et.al. 2403.11552 translate read link
2024-03-18 TARN-VIST: Topic Aware Reinforcement Network for Visual Storytelling Weiran Chen et.al. 2403.11550 translate read null
2024-03-18 DEE: Dual-stage Explainable Evaluation Method for Text Generation Shenyu Zhang et.al. 2403.11509 translate read null
2024-03-18 Can LLMs Generate Human-Like Wayfinding Instructions? Towards Platform-Agnostic Embodied Instruction Synthesis Vishnu Sashank Dorbala et.al. 2403.11487 translate read null
2024-03-18 VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding Yue Fan et.al. 2403.11481 translate read null
2024-03-18 HateCOT: An Explanation-Enhanced Dataset for Generalizable Offensive Speech Detection via Large Language Models Huy Nghiem et.al. 2403.11456 translate read link
2024-03-14 Dynamic Memory Compression: Retrofitting LLMs for Accelerated Inference Piotr Nawrot et.al. 2403.09636 translate read null
2024-03-14 3D-VLA: A 3D Vision-Language-Action Generative World Model Haoyu Zhen et.al. 2403.09631 translate read null
2024-03-14 MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training Brandon McKinzie et.al. 2403.09611 translate read null
2024-03-14 Large Language Models and Causal Inference in Collaboration: A Comprehensive Survey Xiaoyu Liu et.al. 2403.09606 translate read null
2024-03-14 Logical Discrete Graphical Models Must Supplement Large Language Models for Information Synthesis Gregory Coppola et.al. 2403.09599 translate read null
2024-03-14 ExploRLLM: Guiding Exploration in Reinforcement Learning with Large Language Models Runyu Ma et.al. 2403.09583 translate read null
2024-03-14 Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation Yunhao Gou et.al. 2403.09572 translate read null
2024-03-14 Enhancing Trust in Autonomous Agents: An Architecture for Accountability and Explainability through Blockchain and Large Language Models Laura Fernández-Becerra et.al. 2403.09567 translate read null
2024-03-14 Welcome Your New AI Teammate: On Safety Analysis by Leashing Large Language Models Ali Nouri et.al. 2403.09565 translate read null
2024-03-14 Less is More: Data Value Estimation for Visual Instruction Tuning Zikang Liu et.al. 2403.09559 translate read null
2024-03-13 Simple and Scalable Strategies to Continually Pre-train Large Language Models Adam Ibrahim et.al. 2403.08763 translate read null
2024-03-13 Steering LLMs Towards Unbiased Responses: A Causality-Guided Debiasing Framework Jingling Li et.al. 2403.08743 translate read null
2024-03-13 The Garden of Forking Paths: Observing Dynamic Parameters Distribution in Large Language Models Carlo Nicolini et.al. 2403.08739 translate read null
2024-03-13 Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization Renjie Pi et.al. 2403.08730 translate read null
2024-03-14 SOTOPIA- $π$ : Interactive Learning of Socially Intelligent Language Agents Ruiyi Wang et.al. 2403.08715 translate read link
2024-03-13 Review of Generative AI Methods in Cybersecurity Yagmur Yigit et.al. 2403.08701 translate read null
2024-03-13 TeaMs-RL: Teaching LLMs to Teach Themselves Better Instructions via Reinforcement Learning Shangding Gu et.al. 2403.08694 translate read null
2024-03-13 Token Alignment via Character Matching for Subword Completion Ben Athiwaratkun et.al. 2403.08688 translate read null
2024-03-13 Zero-shot and Few-shot Generation Strategies for Artificial Clinical Records Erlend Frayling et.al. 2403.08664 translate read null
2024-03-13 Human Alignment of Large Language Models through Online Preference Optimisation Daniele Calandriello et.al. 2403.08635 translate read null
2024-03-12 Beyond Text: Frozen Large Language Models in Visual Signal Comprehension Lei Zhu et.al. 2403.07874 translate read link
2024-03-12 Rethinking Generative Large Language Model Evaluation for Semantic Comprehension Fangyun Wei et.al. 2403.07872 translate read null
2024-03-12 Exploring Safety Generalization Challenges of Large Language Models via Code Qibing Ren et.al. 2403.07865 translate read null
2024-03-12 DeliGrasp: Inferring Object Mass, Friction, and Compliance with LLMs for Adaptive and Minimally Deforming Grasp Policies William Xie et.al. 2403.07832 translate read null
2024-03-12 The Missing Piece in Model Editing: A Deep Dive into the Hidden Damage Brought By Model Editing Jianchen Wang et.al. 2403.07825 translate read null
2024-03-12 Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM Sainbayar Sukhbaatar et.al. 2403.07816 translate read null
2024-03-12 Fine-tuning Large Language Models with Sequential Instructions Hanxu Hu et.al. 2403.07794 translate read link
2024-03-12 Transforming Competition into Collaboration: The Revolutionary Role of Multi-Agent Systems and Language Models in Modern Organizations Carlos Jose Xavier Cruz et.al. 2403.07769 translate read link
2024-03-12 Synth $^2$ : Boosting Visual-Language Models with Synthetic Captions and Image Embeddings Sahand Sharifzadeh et.al. 2403.07750 translate read null
2024-03-12 FineMath: A Fine-Grained Mathematical Evaluation Benchmark for Chinese Large Language Models Yan Liu et.al. 2403.07747 translate read null
2024-03-11 Hybrid Human-LLM Corpus Construction and LLM Evaluation for Rare Linguistic Phenomena Leonie Weissweiler et.al. 2403.06965 translate read null
2024-03-11 Materials science in the era of large language models: a perspective Ge Lei et.al. 2403.06949 translate read null
2024-03-11 Naming, Describing, and Quantifying Visual Objects in Humans and LLMs Alberto Testoni et.al. 2403.06935 translate read null
2024-03-11 ERA-CoT: Improving Chain-of-Thought through Entity Relationship Analysis Yanming Liu et.al. 2403.06932 translate read link
2024-03-11 MEND: Meta dEmonstratioN Distillation for Efficient and Effective In-Context Learning Yichuan Li et.al. 2403.06914 translate read null
2024-03-11 Exploring Large Language Models and Hierarchical Frameworks for Classification of Large Unstructured Legal Documents Nishchal Prasad et.al. 2403.06872 translate read null
2024-03-11 Development of a Reliable and Accessible Caregiving Language Model (CaLM) Bambang Parmanto et.al. 2403.06857 translate read null
2024-03-11 DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation Guosheng Zhao et.al. 2403.06845 translate read null
2024-03-11 RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback Yanming Liu et.al. 2403.06840 translate read link
2024-03-11 ACFIX: Guiding LLMs with Mined Common RBAC Practices for Context-Aware Repair of Access Control Vulnerabilities in Smart Contracts Lyuye Zhang et.al. 2403.06838 translate read null
2024-03-08 Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context Machel Reid et.al. 2403.05530 translate read null
2024-03-08 GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM Hao Kang et.al. 2403.05527 translate read link
2024-03-08 Beyond Finite Data: Towards Data-free Out-of-distribution Generalization via Extrapola Yijiang Li et.al. 2403.05523 translate read null
2024-03-08 Will GPT-4 Run DOOM? Adrian de Wynter et.al. 2403.05468 translate read null
2024-03-08 Cost-Performance Optimization for Processing Low-Resource Language Tasks Using Commercial LLMs Arijit Nag et.al. 2403.05434 translate read null
2024-03-08 Explaining Pre-Trained Language Models with Attribution Scores: An Analysis in Low-Resource Settings Wei Zhou et.al. 2403.05338 translate read null
2024-03-08 ChatASU: Evoking LLM’s Reflexion to Truly Understand Aspect Sentiment in Dialogues Yiding Liu et.al. 2403.05326 translate read null
2024-03-08 RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation Zihao Wang et.al. 2403.05313 translate read null
2024-03-08 Tapilot-Crossing: Benchmarking and Evolving LLMs Towards Interactive Data Analysis Agents Jinyang Li et.al. 2403.05307 translate read null
2024-03-08 ACLSum: A New Dataset for Aspect-based Summarization of Scientific Publications Sotaro Takeshita et.al. 2403.05303 translate read link
2024-03-07 Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed Yifan Wang et.al. 2403.04765 translate read null
2024-03-07 iScore: Visual Analytics for Interpreting How Language Models Automatically Score Summaries Adam Coscia et.al. 2403.04760 translate read link
2024-03-07 KnowledgeVIS: Interpreting Language Models by Comparing Fill-in-the-Blank Prompts Adam Coscia et.al. 2403.04758 translate read link
2024-03-07 LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error Boshi Wang et.al. 2403.04746 translate read link
2024-03-07 SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM Jielin Qiu et.al. 2403.04735 translate read null
2024-03-07 ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes Hashmat Shadab Malik et.al. 2403.04701 translate read null
2024-03-07 Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification Ekaterina Fadeeva et.al. 2403.04696 translate read null
2024-03-07 PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation Junsong Chen et.al. 2403.04692 translate read null
2024-03-07 Telecom Language Models: Must They Be Large? Nicola Piovesan et.al. 2403.04666 translate read null
2024-03-07 QAQ: Quality Adaptive Quantization for LLM KV Cache Shichen Dong et.al. 2403.04643 translate read link
2024-03-06 Bridging Language and Items for Retrieval and Recommendation Yupeng Hou et.al. 2403.03952 translate read link
2024-03-06 Did Translation Models Get More Robust Without Anyone Even Noticing? Ben Peters et.al. 2403.03923 translate read null
2024-03-06 Fuzzing BusyBox: Leveraging LLM and Crash Reuse for Embedded Bug Unearthing Asmita et.al. 2403.03897 translate read null
2024-03-06 SaulLM-7B: A pioneering Large Language Model for Law Pierre Colombo et.al. 2403.03883 translate read null
2024-03-06 Learning to Decode Collaboratively with Multiple Language Models Shannon Zejiang Shen et.al. 2403.03870 translate read link
2024-03-06 On the Origins of Linear Representations in Large Language Models Yibo Jiang et.al. 2403.03867 translate read null
2024-03-06 KIWI: A Dataset of Knowledge-Intensive Writing Instructions for Answering Research Questions Fangyuan Xu et.al. 2403.03866 translate read null
2024-03-06 Are Language Models Puzzle Prodigies? Algorithmic Puzzles Unveil Serious Challenges in Multimodal Reasoning Deepanway Ghosal et.al. 2403.03864 translate read link
2024-03-06 X-Shot: A Unified System to Handle Frequent, Few-shot and Zero-shot Learning Simultaneously in Classification Hanzi Xu et.al. 2403.03863 translate read link
2024-03-06 Emojinize : Enriching Any Text with Emoji Translations Lars Henning Klein et.al. 2403.03857 translate read null
2024-03-05 The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning Nathaniel Li et.al. 2403.03218 translate read null
2024-03-05 CLEVR-POC: Reasoning-Intensive Visual Question Answering in Partially Observable Environments Savitha Sam Abraham et.al. 2403.03203 translate read null
2024-03-05 Towards Democratized Flood Risk Management: An Advanced AI Assistant Enabled by GPT-4 for Enhanced Interpretability and Public Engagement Rafaela Martelo et.al. 2403.03188 translate read link
2024-03-05 MOKA: Open-Vocabulary Robotic Manipulation through Mark-Based Visual Prompting Fangchen Liu et.al. 2403.03174 translate read null
2024-03-05 SNIFFER: Multimodal Large Language Model for Explainable Out-of-Context Misinformation Detection Peng Qi et.al. 2403.03170 translate read null
2024-03-05 PARADISE: Evaluating Implicit Planning Skills of Language Models with Procedural Warnings and Tips Dataset Arda Uzunoğlu et.al. 2403.03167 translate read link
2024-03-05 Quantum Many-Body Physics Calculations with Large Language Models Haining Pan et.al. 2403.03154 translate read null
2024-03-05 Language Guided Exploration for RL Agents in Text Environments Hitesh Golchha et.al. 2403.03141 translate read null
2024-03-05 Angry Men, Sad Women: Large Language Models Reflect Gendered Stereotypes in Emotion Attribution Flor Miriam Plaza-del-Arco et.al. 2403.03121 translate read null
2024-03-05 “In Dialogues We Learn”: Towards Personalized Dialogue Without Pre-defined Profiles through In-Dialogue Learning Chuanqi Cheng et.al. 2403.03102 translate read null
2024-03-02 LM4OPT: Unveiling the Potential of Large Language Models in Formulating Mathematical Optimization Problems Tasnim Ahmed et.al. 2403.01342 translate read null
2024-03-02 Chaining thoughts and LLMs to learn DNA structural biophysics Tyler D. Ross et.al. 2403.01332 translate read null
2024-03-02 VNLP: Turkish NLP Package Meliksah Turker et.al. 2403.01309 translate read null
2024-03-02 VBART: The Turkish LLM Meliksah Turker et.al. 2403.01308 translate read null
2024-03-02 ICC: Quantifying Image Caption Concreteness for Multimodal Dataset Curation Moran Yanuka et.al. 2403.01306 translate read null
2024-03-02 Improving the Validity of Automatically Generated Feedback via Reinforcement Learning Alexander Scarlatos et.al. 2403.01304 translate read link
2024-03-02 NoMAD-Attention: Efficient LLM Inference on CPUs Through Multiply-add-free Attention Tianyi Zhang et.al. 2403.01273 translate read null
2024-03-02 Employing LLMs for Incident Response Planning and Review Sam Hays et.al. 2403.01271 translate read null
2024-03-02 A comprehensive cross-language framework for harmful content detection with the aid of sentiment analysis Mohammad Dehghani et.al. 2403.01270 translate read null
2024-03-02 Dissecting Language Models: Machine Unlearning via Selective Pruning Nicholas Pochinkov et.al. 2403.01267 translate read null

(<a href=../LLM.md>back to LLM</a>)