Reinforcement Learning - 2024-02
Reinforcement Learning - 2024-02
| Publish Date | Title | Authors | Translate | Read | Code | |
|---|---|---|---|---|---|---|
| 2024-02-29 | Curiosity-driven Red-teaming for Large Language Models | Zhang-Wei Hong et.al. | 2402.19464 | translate | read | link |
| 2024-02-29 | ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL | Yifei Zhou et.al. | 2402.19446 | translate | read | link |
| 2024-02-29 | Pushing the Limits of Cross-Embodiment Learning for Manipulation and Navigation | Jonathan Yang et.al. | 2402.19432 | translate | read | null |
| 2024-02-29 | Understanding Iterative Combinatorial Auction Designs via Multi-Agent Reinforcement Learning | Greg d’Eon et.al. | 2402.19420 | translate | read | null |
| 2024-02-29 | RL-GPT: Integrating Reinforcement Learning and Code-as-policy | Shaoteng Liu et.al. | 2402.19299 | translate | read | null |
| 2024-02-29 | StiefelGen: A Simple, Model Agnostic Approach for Time Series Data Augmentation over Riemannian Manifolds | Prasad Cheema et.al. | 2402.19287 | translate | read | null |
| 2024-02-29 | Adaptive Testing Environment Generation for Connected and Automated Vehicles with Dense Reinforcement Learning | Jingxuan Yang et.al. | 2402.19275 | translate | read | null |
| 2024-02-29 | Deep Reinforcement Learning: A Convex Optimization Approach | Ather Gattami et.al. | 2402.19212 | translate | read | null |
| 2024-02-29 | ARMCHAIR: integrated inverse reinforcement learning and model predictive control for human-robot collaboration | Angelo Caregnato-Neto et.al. | 2402.19128 | translate | read | null |
| 2024-02-29 | Temporal-Aware Deep Reinforcement Learning for Energy Storage Bidding in Energy and Contingency Reserve Markets | Jinhao Li et.al. | 2402.19110 | translate | read | null |
| 2024-02-28 | Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards | Haoxiang Wang et.al. | 2402.18571 | translate | read | link |
| 2024-02-28 | Unifying F1TENTH Autonomous Racing: Survey, Methods and Benchmarks | Benjamin David Evans et.al. | 2402.18558 | translate | read | null |
| 2024-02-28 | Human-Centric Aware UAV Trajectory Planning in Search and Rescue Missions Employing Multi-Objective Reinforcement Learning with AHP and Similarity-Based Experience Replay | Mahya Ramezani et.al. | 2402.18487 | translate | read | null |
| 2024-02-28 | FinAgent: A Multimodal Foundation Agent for Financial Trading: Tool-Augmented, Diversified, and Generalist | Wentao Zhang et.al. | 2402.18485 | translate | read | null |
| 2024-02-28 | Implementing Online Reinforcement Learning with Clustering Neural Networks | James E. Smith et.al. | 2402.18472 | translate | read | null |
| 2024-02-28 | Why Do Animals Need Shaping? A Theory of Task Composition and Curriculum Learning | Jin Hwa Lee et.al. | 2402.18361 | translate | read | null |
| 2024-02-28 | Solving Multi-Entity Robotic Problems Using Permutation Invariant Neural Networks | Tianxu An et.al. | 2402.18345 | translate | read | null |
| 2024-02-28 | Whole-body Humanoid Robot Locomotion with Human Reference | Qiang Zhang et.al. | 2402.18294 | translate | read | null |
| 2024-02-28 | Is Crowdsourcing Breaking Your Bank? Cost-Effective Fine-Tuning of Pre-trained Language Models with Proximal Policy Optimization | Shuo Yang et.al. | 2402.18284 | translate | read | null |
| 2024-02-28 | Reinforcement Learning and Graph Neural Networks for Probabilistic Risk Assessment | Joachim Grimstad et.al. | 2402.18246 | translate | read | null |
(<a href=../Reinforcement_Learning.md>back to Reinforcement Learning</a>)