LLM - 2024-12 | Paper Arxiv Daily

LLM - 2024-12

Publish Date	Title	Authors	PDF	Translate	Read	Code
2024-12-30	Distributed Mixture-of-Agents for Edge Inference with Large Language Models	Purbesh Mitra et.al.	2412.21200	translate	read	link
2024-12-31	HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation	Zhaojian Yu et.al.	2412.21199	translate	read	link
2024-12-30	Facilitating large language model Russian adaptation with Learned Embedding Propagation	Mikhail Tikhomirov et.al.	2412.21140	translate	read	link
2024-12-30	ExpShield: Safeguarding Web Text from Unauthorized Crawling and Language Modeling Exploitation	Ruixuan Liu et.al.	2412.21123	translate	read	null
2024-12-30	Toward Intelligent and Secure Cloud: Large Language Model Empowered Proactive Defense	Yuyang Zhou et.al.	2412.21051	translate	read	link
2024-12-30	TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization	Chia-Yu Hung et.al.	2412.21037	translate	read	link
2024-12-30	GePBench: Evaluating Fundamental Geometric Perception for Multimodal Large Language Models	Shangyu Xing et.al.	2412.21036	translate	read	null
2024-12-30	Automated Robustness Testing for LLM-based NLP Software	Mingxuan Xiao et.al.	2412.21016	translate	read	link
2024-12-30	MapQaTor: A System for Efficient Annotation of Map Query Datasets	Mahir Labib Dihan et.al.	2412.21015	translate	read	link
2024-12-31	Verbosity-Aware Rationale Reduction: Effective Reduction of Redundant Rationale via Principled Criteria	Joonwon Jang et.al.	2412.21006	translate	read	null
2024-12-27	Can AI Help with Your Personal Finances?	Oudom Hean et.al.	2412.19784	translate	read	null
2024-12-27	Machine Learning for Sentiment Analysis of Imported Food in Trinidad and Tobago	Cassandra Daniels et.al.	2412.19781	translate	read	null
2024-12-27	Fortran2CPP: Automating Fortran-to-C++ Migration using LLMs via Multi-Turn Dialogue and Dual-Agent Integration	Le Chen et.al.	2412.19770	translate	read	link
2024-12-27	Can Large Language Models Adapt to Other Agents In-Context?	Matthew Riemer et.al.	2412.19726	translate	read	null
2024-12-27	Text2Insight: Transform natural language text into insights seamlessly using multi-model architecture	Pradeep Sain et.al.	2412.19718	translate	read	null
2024-12-27	Toward Adaptive Reasoning in Large Language Models with Thought Rollback	Sijia Chen et.al.	2412.19707	translate	read	link
2024-12-27	A Large-scale Interpretable Multi-modality Benchmark for Facial Image Forgery Localization	Jingchun Lian et.al.	2412.19685	translate	read	null
2024-12-27	Boosting Private Domain Understanding of Efficient MLLMs: A Tuning-free, Adaptive, Universal Prompt Optimization Framework	Jiang Liu et.al.	2412.19684	translate	read	null
2024-12-27	CAD-GPT: Synthesising CAD Construction Sequence with Spatial Reasoning-Enhanced Multimodal LLMs	Siyu Wang et.al.	2412.19663	translate	read	link
2024-12-27	FreStega: A Plug-and-Play Method for Boosting Imperceptibility and Capacity in Generative Linguistic Steganography for Real-World Scenarios	Kaiyi Pang et.al.	2412.19652	translate	read	null
2024-12-24	Decentralized Intelligence in GameFi: Embodied AI Agents and the Convergence of DeFi and Virtual Ecosystems	Fernando Jia et.al.	2412.18601	translate	read	link
2024-12-24	A Paragraph is All It Takes: Rich Robot Behaviors from Interacting, Trusted LLMs	OpenMind et.al.	2412.18588	translate	read	null
2024-12-24	Exploring Embedding Priors in Prompt-Tuning for Improved Interpretability and Control	Sergey Sedov et.al.	2412.18582	translate	read	null
2024-12-24	Zero-resource Speech Translation and Recognition with LLMs	Karel Mundnich et.al.	2412.18566	translate	read	null
2024-12-24	Distilling Fine-grained Sentiment Understanding from Large Language Models	Yice Zhang et.al.	2412.18552	translate	read	link
2024-12-24	Token-Budget-Aware LLM Reasoning	Tingxu Han et.al.	2412.18547	translate	read	link
2024-12-24	PLD-Tree: Persistent Laplacian Decision Tree for Protein-Protein Binding Free Energy Prediction	Xingjian Xu et.al.	2412.18541	translate	read	null
2024-12-24	Harnessing Large Language Models for Knowledge Graph Question Answering via Adaptive Multi-Aspect Retrieval-Augmentation	Derong Xu Xinhang Li et.al.	2412.18537	translate	read	link
2024-12-24	Automated Code Review In Practice	Umut Cihan et.al.	2412.18531	translate	read	null
2024-12-24	Large Language Model guided Deep Reinforcement Learning for Decision Making in Autonomous Driving	Hao Pang et.al.	2412.18511	translate	read	null
2024-12-23	ChatGarment: Garment Estimation, Generation and Editing via Large Language Models	Siyuan Bian et.al.	2412.17811	translate	read	null
2024-12-23	Cross-Lingual Text-Rich Visual Comprehension: An Information Theory Perspective	Xinmiao Yu et.al.	2412.17787	translate	read	null
2024-12-23	ResearchTown: Simulator of Human Research Community	Haofei Yu et.al.	2412.17767	translate	read	link
2024-12-23	Survey of Large Multimodal Model Datasets, Application Categories and Taxonomy	Priyaranjan Pattnayak et.al.	2412.17759	translate	read	null
2024-12-23	ADC: Enhancing Function Calling Via Adversarial Datasets and Code Line-Level Feedback	Wei Zhang et.al.	2412.17754	translate	read	null
2024-12-23	Deliberation in Latent Space via Differentiable Cache Augmentation	Luyang Liu et.al.	2412.17747	translate	read	null
2024-12-23	YuLan-Mini: An Open Data-efficient Language Model	Yiwen Hu et.al.	2412.17743	translate	read	link
2024-12-23	Reasoning to Attend: Try to Understand How Token Works	Rui Qian et.al.	2412.17741	translate	read	link
2024-12-23	Knowledge Editing through Chain-of-Thought	Changyue Wang et.al.	2412.17727	translate	read	link
2024-12-23	Understanding the Logic of Direct Preference Alignment through Logic	Kyle Richardson et.al.	2412.17696	translate	read	null
2024-12-20	HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding	Chenxin Tao et.al.	2412.16158	translate	read	null
2024-12-20	Offline Reinforcement Learning for LLM Multi-Step Reasoning	Huaijie Wang et.al.	2412.16145	translate	read	link
2024-12-20	Can LLMs Obfuscate Code? A Systematic Analysis of Large Language Models into Assembly Code Obfuscation	Seyedreza Mohseni et.al.	2412.16135	translate	read	link
2024-12-20	Data-Driven Mechanism Design: Jointly Eliciting Preferences and Information	Dirk Bergemann et.al.	2412.16132	translate	read	null
2024-12-20	PromptOptMe: Error-Aware Prompt Compression for LLM-based MT Evaluation Metrics	Daniil Larionov et.al.	2412.16120	translate	read	null
2024-12-20	Deciphering the Underserved: Benchmarking LLM OCR for Low-Resource Scripts	Muhammad Abdullah Sohail et.al.	2412.16119	translate	read	link
2024-12-20	PruneVid: Visual Token Pruning for Efficient Video Large Language Models	Xiaohu Huang et.al.	2412.16117	translate	read	link
2024-12-20	The Content Moderator’s Dilemma: Removal of Toxic Content and Distortions to Online Discourse	Mahyar Habibi et.al.	2412.16114	translate	read	null
2024-12-20	Logical Consistency of Large Language Models in Fact-checking	Bishwamittra Ghosh et.al.	2412.16100	translate	read	null
2024-12-20	The Evolution of LLM Adoption in Industry Data Curation Practices	Crystal Qian et.al.	2412.16089	translate	read	null
2024-12-19	UIP2P: Unsupervised Instruction-based Image Editing via Cycle Edit Consistency	Enis Simsar et.al.	2412.15216	translate	read	null
2024-12-19	Flowing from Words to Pixels: A Framework for Cross-Modality Evolution	Qihao Liu et.al.	2412.15213	translate	read	null
2024-12-19	OpenEMMA: Open-Source Multimodal Model for End-to-End Autonomous Driving	Shuo Xing et.al.	2412.15208	translate	read	link
2024-12-19	AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving	Shuo Xing et.al.	2412.15206	translate	read	link
2024-12-19	MMLU-CF: A Contamination-free Multi-task Language Understanding Benchmark	Qihao Zhao et.al.	2412.15194	translate	read	link
2024-12-19	LlamaFusion: Adapting Pretrained Language Models for Multimodal Generation	Weijia Shi et.al.	2412.15188	translate	read	null
2024-12-19	Data for Mathematical Copilots: Better Ways of Presenting Proofs for Machine Learning	Simon Frieder et.al.	2412.15184	translate	read	null
2024-12-19	HPC-Coder-V2: Studying Code LLMs Across Low-Resource Parallel Languages	Aman Chaturvedi et.al.	2412.15178	translate	read	null
2024-12-19	Critical-Questions-of-Thought: Steering LLM reasoning with Argumentative Querying	Federico Castagna et.al.	2412.15177	translate	read	link
2024-12-19	Rethinking Uncertainty Estimation in Natural Language Generation	Lukas Aichberger et.al.	2412.15176	translate	read	null
2024-12-18	Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces	Jihan Yang et.al.	2412.14171	translate	read	link
2024-12-18	TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks	Frank F. Xu et.al.	2412.14161	translate	read	link
2024-12-18	Advanced Reasoning and Transformation Engine for Multi-Step Insight Synthesis in Data Analytics with Large Language Models	Atin Sakkeer Hussain et.al.	2412.14146	translate	read	null
2024-12-18	LLMs can realize combinatorial creativity: generating creative ideas via LLMs for scientific research	Tianyang Gu et.al.	2412.14141	translate	read	null
2024-12-18	Design choices made by LLM-based test generators prevent them from finding bugs	Noble Saji Mathews et.al.	2412.14137	translate	read	null
2024-12-18	Adversarial Hubness in Multi-Modal Retrieval	Tingwei Zhang et.al.	2412.14113	translate	read	link
2024-12-18	Alignment faking in large language models	Ryan Greenblatt et.al.	2412.14093	translate	read	link
2024-12-18	Future Research Avenues for Artificial Intelligence in Digital Gaming: An Exploratory Report	Markus Dablander et.al.	2412.14085	translate	read	null
2024-12-18	Rango: Adaptive Retrieval-Augmented Proving for Automated Software Verification	Kyle Thompson et.al.	2412.14063	translate	read	null
2024-12-18	Understanding and Evaluating Trust in Generative AI and Large Language Models for Spreadsheets	Simon Thorne et.al.	2412.14062	translate	read	null
2024-12-17	SafeAgentBench: A Benchmark for Safe Task Planning of Embodied LLM Agents	Sheng Yin et.al.	2412.13178	translate	read	link
2024-12-17	DnDScore: Decontextualization and Decomposition for Factuality Verification in Long-Form Text Generation	Miriam Wanner et.al.	2412.13175	translate	read	null
2024-12-17	Algorithmic Fidelity of Large Language Models in Generating Synthetic German Public Opinions: A Case Study	Bolei Ma et.al.	2412.13169	translate	read	link
2024-12-17	C-FedRAG: A Confidential Federated Retrieval-Augmented Generation System	Parker Addison et.al.	2412.13163	translate	read	null
2024-12-17	BanglishRev: A Large-Scale Bangla-English and Code-mixed Dataset of Product Reviews in E-Commerce	Mohammad Nazmush Shamael et.al.	2412.13161	translate	read	null
2024-12-17	SWAN: Preprocessing SGD Enables Adam-Level Performance On LLM Training With Significant Memory Reduction	Chao Ma et.al.	2412.13148	translate	read	null
2024-12-17	Are Your LLMs Capable of Stable Reasoning?	Junnan Liu et.al.	2412.13147	translate	read	link
2024-12-17	AI PERSONA: Towards Life-long Personalization of LLMs	Tiannan Wang et.al.	2412.13103	translate	read	null
2024-12-17	AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark	Jianlyu Chen et.al.	2412.13102	translate	read	link
2024-12-17	Modality-Inconsistent Continual Learning of Multimodal Large Language Models	Weiguo Pian et.al.	2412.13050	translate	read	null
2024-12-16	SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator	Guoxuan Chen et.al.	2412.12094	translate	read	link
2024-12-16	Instruction-based Image Manipulation by Watching How Things Move	Mingdeng Cao et.al.	2412.12087	translate	read	null
2024-12-16	CPath-Omni: A Unified Multimodal Foundation Model for Patch and Whole Slide Image Analysis in Computational Pathology	Yuxuan Sun et.al.	2412.12077	translate	read	null
2024-12-16	CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding	Guo Chen et.al.	2412.12075	translate	read	null
2024-12-16	Making FETCH! Happen: Finding Emergent Dog Whistles Through Common Habitats	Kuleen Sasse et.al.	2412.12072	translate	read	link
2024-12-16	How Private are Language Models in Abstractive Summarization?	Anthony Hughes et.al.	2412.12040	translate	read	null
2024-12-16	Can LLM Prompting Serve as a Proxy for Static Analysis in Vulnerability Detection	Ira Ceka et.al.	2412.12039	translate	read	null
2024-12-16	SpeechPrune: Context-aware Token Pruning for Speech Information Retrieval	Yueqian Lin et.al.	2412.12009	translate	read	null
2024-12-16	Agentic AI-Driven Technical Troubleshooting for Enterprise Systems: A Novel Weighted Retrieval-Augmented Generation Paradigm	Rajat Khanda et.al.	2412.12006	translate	read	null
2024-12-16	The Open Source Advantage in Large Language Models (LLMs)	Jiya Manchanda et.al.	2412.12004	translate	read	null
2024-12-13	UniMed-CLIP: Towards a Unified Image-Text Pretraining Paradigm for Diverse Medical Imaging Modalities	Muhammad Uzair Khattak et.al.	2412.10372	translate	read	link
2024-12-13	Robust image classification with multi-modal large language models	Francesco Villani et.al.	2412.10353	translate	read	null
2024-12-13	COMET: Benchmark for Comprehensive Biological Multi-omics Evaluation Tasks and Language Models	Yuchen Ren et.al.	2412.10347	translate	read	null
2024-12-13	Iris: Breaking GUI Complexity with Adaptive Focus and Self-Refining	Zhiqi Ge et.al.	2412.10342	translate	read	null
2024-12-13	AdvPrefix: An Objective for Nuanced LLM Jailbreaks	Sicheng Zhu et.al.	2412.10321	translate	read	null
2024-12-13	BrushEdit: All-In-One Image Inpainting and Editing	Yaowei Li et.al.	2412.10316	translate	read	link
2024-12-13	DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding	Zhiyu Wu et.al.	2412.10302	translate	read	link
2024-12-13	Buzz to Broadcast: Predicting Sports Viewership Using Social Media Engagement	Anakin Trotter et.al.	2412.10298	translate	read	link
2024-12-13	Still “Talking About Large Language Models”: Some Clarifications	Murray Shanahan et.al.	2412.10291	translate	read	null
2024-12-13	One world, one opinion? The superstar effect in LLM responses	Sofie Goethals et.al.	2412.10281	translate	read	null
2024-12-12	Doe-1: Closed-Loop Autonomous Driving with Large World Model	Wenzhao Zheng et.al.	2412.09627	translate	read	link
2024-12-12	EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM	Zhuofan Zong et.al.	2412.09618	translate	read	null
2024-12-12	Olympus: A Universal Task Router for Computer Vision Tasks	Yuanze Lin et.al.	2412.09612	translate	read	link
2024-12-12	SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding	Hao Li et.al.	2412.09604	translate	read	null
2024-12-12	Do Multimodal Large Language Models See Like Humans?	Jiaying Lin et.al.	2412.09603	translate	read	null
2024-12-12	InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions	Pan Zhang et.al.	2412.09596	translate	read	link
2024-12-12	OpenNER 1.0: Standardized Open-Access Named Entity Recognition Datasets in 50+ Languages	Chester Palen-Michel et.al.	2412.09587	translate	read	null
2024-12-12	DISHONEST: Dissecting misInformation Spread using Homogeneous sOcial NEtworks and Semantic Topic classification	Caleb Stam et.al.	2412.09578	translate	read	null
2024-12-12	DiverseAgentEntropy: Quantifying Black-Box LLM Uncertainty through Diverse Perspectives and Multi-Agent Interaction	Yu Feng et.al.	2412.09572	translate	read	null
2024-12-12	Does Representation Matter? Exploring Intermediate Layers in Large Language Models	Oscar Skean et.al.	2412.09563	translate	read	null
2024-12-11	Generative Semantic Communication: Architectures, Technologies, and Applications	Jinke Ren et.al.	2412.08642	translate	read	null
2024-12-11	Fast Prompt Alignment for Text-to-Image Generation	Khalil Mrini et.al.	2412.08639	translate	read	link
2024-12-11	Multimodal Latent Language Modeling with Next-Token Diffusion	Yutao Sun et.al.	2412.08635	translate	read	null
2024-12-11	Synthetic Vision: Training Vision-Language Models to Understand Physics	Vahid Balazadeh et.al.	2412.08619	translate	read	null
2024-12-11	Image Retrieval Methods in the Dissimilarity Space	Madhu Kiran et.al.	2412.08618	translate	read	null
2024-12-11	Exploiting the Index Gradients for Optimization-Based Jailbreaking on Large Language Models	Jiahui Li et.al.	2412.08615	translate	read	link
2024-12-11	Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning	Fan Lu et.al.	2412.08614	translate	read	link
2024-12-11	Preference Discerning with LLM-Enhanced Generative Retrieval	Fabian Paischer et.al.	2412.08604	translate	read	null
2024-12-11	Empirical Measurements of AI Training Power Demand on a GPU-Accelerated Node	Imran Latif et.al.	2412.08602	translate	read	null
2024-12-11	Leveraging Graph-RAG and Prompt Engineering to Enhance LLM-Based Automated Requirement Traceability and Compliance Checks	Arsalan Masoudifard et.al.	2412.08593	translate	read	null
2024-12-10	BiMediX2: Bio-Medical EXpert LMM for Diverse Medical Modalities	Sahal Shaji Mullappilly et.al.	2412.07769	translate	read	null
2024-12-10	Bayesian Optimization of Antibodies Informed by a Generative Model of Evolving Sequences	Alan Nawzad Amin et.al.	2412.07763	translate	read	link
2024-12-10	Zero-Shot ATC Coding with Large Language Models for Clinical Assessments	Zijian Chen et.al.	2412.07743	translate	read	null
2024-12-10	Image Retrieval with Intra-Sweep Representation Learning for Neck Ultrasound Scanning Guidance	Wanwen Chen et.al.	2412.07741	translate	read	null
2024-12-10	Granite Guardian	Inkit Padhi et.al.	2412.07724	translate	read	link
2024-12-10	DriveMM: All-in-One Large Multimodal Model for Autonomous Driving	Zhijian Huang et.al.	2412.07689	translate	read	link
2024-12-10	Privacy-Preserving Customer Support: A Framework for Secure and Scalable Interactions	Anant Prakash Awasthi et.al.	2412.07687	translate	read	null
2024-12-10	TRIM: Token Reduction and Inference Modeling for Cost-Effective Language Generation	Alfredo Garrachón Ruiz et.al.	2412.07682	translate	read	null
2024-12-10	Ask Humans or AI? Exploring Their Roles in Visualization Troubleshooting	Shuyu Shen et.al.	2412.07673	translate	read	null
2024-12-10	FlexLLM: Exploring LLM Customization for Moving Target Defense on Black-Box LLMs Against Jailbreak Attacks	Bocheng Chen et.al.	2412.07672	translate	read	null
2024-12-09	Training Large Language Models to Reason in a Continuous Latent Space	Shibo Hao et.al.	2412.06769	translate	read	null
2024-12-09	Why Do Developers Engage with ChatGPT in Issue-Tracker? Investigating Usage and Reliance on ChatGPT-Generated Code	Joy Krishan Das et.al.	2412.06757	translate	read	null
2024-12-09	Refusal Tokens: A Simple Way to Calibrate Refusals in Large Language Models	Neel Jain et.al.	2412.06748	translate	read	null
2024-12-09	JAPAGEN: Efficient Few/Zero-shot Learning via Japanese Training Dataset Generation with LLM	Takuro Fujii et.al.	2412.06738	translate	read	null
2024-12-09	AutoDCWorkflow: LLM-based Data Cleaning Workflow Auto-Generation and Benchmark	Lan Li et.al.	2412.06724	translate	read	null
2024-12-09	DEEPER: Dense Electroencephalography Passage Retrieval	Niall McGuire et.al.	2412.06695	translate	read	null
2024-12-09	OmniEvalKit: A Modular, Lightweight Toolbox for Evaluating Large Language Model and its Omni-Extensions	Yi-Kai Zhang et.al.	2412.06693	translate	read	null
2024-12-09	Exploring Critical Testing Scenarios for Decision-Making Policies: An LLM Approach	Weichao Xu et.al.	2412.06684	translate	read	null
2024-12-09	Toward LLM-Agent-Based Modeling of Transportation Systems: A Conceptual Framework	Tianming Liu et.al.	2412.06681	translate	read	null
2024-12-09	I Don’t Know: Explicit Modeling of Uncertainty with an [IDK] Token	Roi Cohen et.al.	2412.06676	translate	read	null
2024-12-06	Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling	Zhe Chen et.al.	2412.05271	translate	read	null
2024-12-06	APOLLO: SGD-like Memory, AdamW-level Performance	Hanqing Zhu et.al.	2412.05270	translate	read	link
2024-12-06	CompCap: Improving Multimodal Large Language Models with Composite Captions	Xiaohui Chen et.al.	2412.05243	translate	read	null
2024-12-06	MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale	Jarvis Guo et.al.	2412.05237	translate	read	link
2024-12-06	BEExformer: A Fast Inferencing Transformer Architecture via Binarization with Multiple Early Exits	Wazib Ansar et.al.	2412.05225	translate	read	null
2024-12-06	100% Hallucination Elimination Using Acurai	Michael C. Wood et.al.	2412.05223	translate	read	null
2024-12-06	Evaluating and Aligning CodeLLMs on Human Preference	Jian Yang et.al.	2412.05210	translate	read	link
2024-12-06	A Survey of Large Language Model-Based Generative AI for Text-to-SQL: Benchmarks, Applications, Use Cases, and Challenges	Aditi Singh et.al.	2412.05208	translate	read	null
2024-12-06	Are Frontier Large Language Models Suitable for Q&A in Science Centres?	Jacob Watson et.al.	2412.05200	translate	read	null
2024-12-06	SurgBox: Agent-Driven Operating Room Sandbox with Surgery Copilot	Jinlin Wu et.al.	2412.05187	translate	read	link
2024-12-05	p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay	Jun Zhang et.al.	2412.04449	translate	read	link
2024-12-05	EgoPlan-Bench2: A Benchmark for Multimodal Large Language Model Planning in Real-World Scenarios	Lu Qiu et.al.	2412.04447	translate	read	null
2024-12-05	Moto: Latent Motion Token as the Bridging Language for Robot Manipulation	Yi Chen et.al.	2412.04445	translate	read	link
2024-12-05	Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation	Yuying Ge et.al.	2412.04432	translate	read	link
2024-12-05	Grounding Descriptions in Images informs Zero-Shot Visual Recognition	Shaunak Halbe et.al.	2412.04429	translate	read	link
2024-12-05	Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion	Jiuhai Chen et.al.	2412.04424	translate	read	link
2024-12-05	Targeting the Core: A Simple and Effective Method to Attack RAG-based Agents via Direct LLM Manipulation	Xuying Li et.al.	2412.04415	translate	read	null
2024-12-05	Retrieval-Augmented Machine Translation with Unstructured Knowledge	Jiaan Wang et.al.	2412.04342	translate	read	link
2024-12-05	Liquid: Language Models are Scalable Multi-modal Generators	Junfeng Wu et.al.	2412.04332	translate	read	link
2024-12-05	The Hyperfitting Phenomenon: Sharpening and Stabilizing LLMs for Open-Ended Text Generation	Fredrik Carlsson et.al.	2412.04318	translate	read	null
2024-12-04	From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents	Xinyi Mou et.al.	2412.03563	translate	read	link
2024-12-04	SPICE: Smart Projection Interface for Cooking Enhancement	Vera Prohaska et.al.	2412.03551	translate	read	null
2024-12-04	Evaluating Gender Bias Transfer between Pre-trained and Prompt-Adapted Language Models	Natalie Mackraz et.al.	2412.03537	translate	read	null
2024-12-04	A Review on Scientific Knowledge Extraction using Large Language Models in Biomedical Sciences	Gabriel Lino Garcia et.al.	2412.03531	translate	read	null
2024-12-04	FANAL – Financial Activity News Alerting Language Modeling Framework	Urjitkumar Patel et.al.	2412.03527	translate	read	null
2024-12-04	You’re (Not) My Type – Can LLMs Generate Feedback of Specific Types for Introductory Programming Tasks?	Dominic Lohr et.al.	2412.03516	translate	read	null
2024-12-04	Flow Matching with General Discrete Paths: A Kinetic-Optimal Perspective	Neta Shaul et.al.	2412.03487	translate	read	null
2024-12-04	Training-Free Mitigation of Language Reasoning Degradation After Multimodal Instruction Tuning	Neale Ratzlaff et.al.	2412.03467	translate	read	null
2024-12-04	From Words to Workflows: Automating Business Processes	Laura Minkova et.al.	2412.03446	translate	read	null
2024-12-04	RedStone: Curating General, Code, Math, and QA Data for Large Language Models	Yaoyao Chang et.al.	2412.03398	translate	read	null
2024-12-03	T-REG: Preference Optimization with Token-Level Reward Regularization	Wenxuan Zhou et.al.	2412.02685	translate	read	link
2024-12-03	Mind the Gap: Examining the Self-Improvement Capabilities of Large Language Models	Yuda Song et.al.	2412.02674	translate	read	null
2024-12-03	LLM-Enhanced Path Planning: Safe and Efficient Autonomous Navigation with Instructional Inputs	Pranav Doma et.al.	2412.02655	translate	read	null
2024-12-03	Time-Reversal Provides Unsupervised Feedback to LLMs	Yerram Varun et.al.	2412.02626	translate	read	null
2024-12-03	Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback	Hiroki Furuta et.al.	2412.02617	translate	read	null
2024-12-03	AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?	Kaixiong Gong et.al.	2412.02611	translate	read	link
2024-12-03	Interpretable Company Similarity with Sparse Autoencoders	Marco Molinari et.al.	2412.02605	translate	read	null
2024-12-03	CEGI: Measuring the trade-off between efficiency and carbon emissions for SLMs and VLMs	Abhas Kumar et.al.	2412.02602	translate	read	null
2024-12-03	PrefixLLM: LLM-aided Prefix Circuit Design	Weihua Xiao et.al.	2412.02594	translate	read	null
2024-12-03	OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation	Junyuan Zhang et.al.	2412.02592	translate	read	link
2024-12-02	T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs	Shukang Yin et.al.	2411.19951	translate	read	link
2024-12-02	Critical Tokens Matter: Token-Level Contrastive Estimation Enhances LLM’s Reasoning Capability	Zicheng Lin et.al.	2411.19943	translate	read	link
2024-12-02	LUMIA: Linear probing for Unimodal and MultiModal Membership Inference Attacks leveraging internal LLM states	Luis Ibanez-Lissen et.al.	2411.19876	translate	read	null

(<a href=../LLM.md>back to LLM</a>)