Reinforcement Learning - 2024-02

Publish Date	Title	Authors	PDF	Translate	Read	Code
2024-02-29	Curiosity-driven Red-teaming for Large Language Models	Zhang-Wei Hong et.al.	2402.19464	translate	read	link
2024-02-29	ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL	Yifei Zhou et.al.	2402.19446	translate	read	link
2024-02-29	Pushing the Limits of Cross-Embodiment Learning for Manipulation and Navigation	Jonathan Yang et.al.	2402.19432	translate	read	null
2024-02-29	Understanding Iterative Combinatorial Auction Designs via Multi-Agent Reinforcement Learning	Greg d’Eon et.al.	2402.19420	translate	read	null
2024-02-29	RL-GPT: Integrating Reinforcement Learning and Code-as-policy	Shaoteng Liu et.al.	2402.19299	translate	read	null
2024-02-29	StiefelGen: A Simple, Model Agnostic Approach for Time Series Data Augmentation over Riemannian Manifolds	Prasad Cheema et.al.	2402.19287	translate	read	null
2024-02-29	Adaptive Testing Environment Generation for Connected and Automated Vehicles with Dense Reinforcement Learning	Jingxuan Yang et.al.	2402.19275	translate	read	null
2024-02-29	Deep Reinforcement Learning: A Convex Optimization Approach	Ather Gattami et.al.	2402.19212	translate	read	null
2024-02-29	ARMCHAIR: integrated inverse reinforcement learning and model predictive control for human-robot collaboration	Angelo Caregnato-Neto et.al.	2402.19128	translate	read	null
2024-02-29	Temporal-Aware Deep Reinforcement Learning for Energy Storage Bidding in Energy and Contingency Reserve Markets	Jinhao Li et.al.	2402.19110	translate	read	null
2024-02-28	Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards	Haoxiang Wang et.al.	2402.18571	translate	read	link
2024-02-28	Unifying F1TENTH Autonomous Racing: Survey, Methods and Benchmarks	Benjamin David Evans et.al.	2402.18558	translate	read	null
2024-02-28	Human-Centric Aware UAV Trajectory Planning in Search and Rescue Missions Employing Multi-Objective Reinforcement Learning with AHP and Similarity-Based Experience Replay	Mahya Ramezani et.al.	2402.18487	translate	read	null
2024-02-28	FinAgent: A Multimodal Foundation Agent for Financial Trading: Tool-Augmented, Diversified, and Generalist	Wentao Zhang et.al.	2402.18485	translate	read	null
2024-02-28	Implementing Online Reinforcement Learning with Clustering Neural Networks	James E. Smith et.al.	2402.18472	translate	read	null
2024-02-28	Why Do Animals Need Shaping? A Theory of Task Composition and Curriculum Learning	Jin Hwa Lee et.al.	2402.18361	translate	read	null
2024-02-28	Solving Multi-Entity Robotic Problems Using Permutation Invariant Neural Networks	Tianxu An et.al.	2402.18345	translate	read	null
2024-02-28	Whole-body Humanoid Robot Locomotion with Human Reference	Qiang Zhang et.al.	2402.18294	translate	read	null
2024-02-28	Is Crowdsourcing Breaking Your Bank? Cost-Effective Fine-Tuning of Pre-trained Language Models with Proximal Policy Optimization	Shuo Yang et.al.	2402.18284	translate	read	null
2024-02-28	Reinforcement Learning and Graph Neural Networks for Probabilistic Risk Assessment	Joachim Grimstad et.al.	2402.18246	translate	read	null

(<a href=../Reinforcement_Learning.md>back to Reinforcement Learning</a>)