Reinforcement Learning - 2024-02

Publish Date Title Authors PDF Translate Read Code
2024-02-29 Curiosity-driven Red-teaming for Large Language Models Zhang-Wei Hong et.al. 2402.19464 translate read link
2024-02-29 ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL Yifei Zhou et.al. 2402.19446 translate read link
2024-02-29 Pushing the Limits of Cross-Embodiment Learning for Manipulation and Navigation Jonathan Yang et.al. 2402.19432 translate read null
2024-02-29 Understanding Iterative Combinatorial Auction Designs via Multi-Agent Reinforcement Learning Greg d’Eon et.al. 2402.19420 translate read null
2024-02-29 RL-GPT: Integrating Reinforcement Learning and Code-as-policy Shaoteng Liu et.al. 2402.19299 translate read null
2024-02-29 StiefelGen: A Simple, Model Agnostic Approach for Time Series Data Augmentation over Riemannian Manifolds Prasad Cheema et.al. 2402.19287 translate read null
2024-02-29 Adaptive Testing Environment Generation for Connected and Automated Vehicles with Dense Reinforcement Learning Jingxuan Yang et.al. 2402.19275 translate read null
2024-02-29 Deep Reinforcement Learning: A Convex Optimization Approach Ather Gattami et.al. 2402.19212 translate read null
2024-02-29 ARMCHAIR: integrated inverse reinforcement learning and model predictive control for human-robot collaboration Angelo Caregnato-Neto et.al. 2402.19128 translate read null
2024-02-29 Temporal-Aware Deep Reinforcement Learning for Energy Storage Bidding in Energy and Contingency Reserve Markets Jinhao Li et.al. 2402.19110 translate read null
2024-02-28 Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards Haoxiang Wang et.al. 2402.18571 translate read link
2024-02-28 Unifying F1TENTH Autonomous Racing: Survey, Methods and Benchmarks Benjamin David Evans et.al. 2402.18558 translate read null
2024-02-28 Human-Centric Aware UAV Trajectory Planning in Search and Rescue Missions Employing Multi-Objective Reinforcement Learning with AHP and Similarity-Based Experience Replay Mahya Ramezani et.al. 2402.18487 translate read null
2024-02-28 FinAgent: A Multimodal Foundation Agent for Financial Trading: Tool-Augmented, Diversified, and Generalist Wentao Zhang et.al. 2402.18485 translate read null
2024-02-28 Implementing Online Reinforcement Learning with Clustering Neural Networks James E. Smith et.al. 2402.18472 translate read null
2024-02-28 Why Do Animals Need Shaping? A Theory of Task Composition and Curriculum Learning Jin Hwa Lee et.al. 2402.18361 translate read null
2024-02-28 Solving Multi-Entity Robotic Problems Using Permutation Invariant Neural Networks Tianxu An et.al. 2402.18345 translate read null
2024-02-28 Whole-body Humanoid Robot Locomotion with Human Reference Qiang Zhang et.al. 2402.18294 translate read null
2024-02-28 Is Crowdsourcing Breaking Your Bank? Cost-Effective Fine-Tuning of Pre-trained Language Models with Proximal Policy Optimization Shuo Yang et.al. 2402.18284 translate read null
2024-02-28 Reinforcement Learning and Graph Neural Networks for Probabilistic Risk Assessment Joachim Grimstad et.al. 2402.18246 translate read null

(<a href=../Reinforcement_Learning.md>back to Reinforcement Learning</a>)