Reinforcement Learning - 2024-07

Publish Date Title Authors PDF Translate Read Code
2024-07-31 CREW: Facilitating Human-AI Teaming Research Lingyu Zhang et.al. 2408.00170 translate read null
2024-07-31 Formal Ethical Obligations in Reinforcement Learning Agents: Verification and Policy Updates Colin Shea-Blymyer et.al. 2408.00147 translate read null
2024-07-31 Adaptive Transit Signal Priority based on Deep Reinforcement Learning and Connected Vehicles in a Traffic Microsimulation Environment Dickness Kwesiga et.al. 2408.00098 translate read null
2024-07-31 Berkeley Humanoid: A Research Platform for Learning-based Control Qiayuan Liao et.al. 2407.21781 translate read null
2024-07-31 Human-Machine Co-Adaptation for Robot-Assisted Rehabilitation via Dual-Agent Multiple Model Reinforcement Learning (DAMMRL) Yang An et.al. 2407.21734 translate read null
2024-07-31 Multi-agent reinforcement learning for the control of three-dimensional Rayleigh-Bénard convection Joel Vasanth et.al. 2407.21565 translate read null
2024-07-31 Black box meta-learning intrinsic rewards for sparse-reward environments Octavio Pappalardo et.al. 2407.21546 translate read null
2024-07-31 Multi-agent Assessment with QoS Enhancement for HD Map Updates in a Vehicular Network Jeffrey Redondo et.al. 2407.21460 translate read null
2024-07-31 ProSpec RL: Plan Ahead, then Execute Liangliang Liu et.al. 2407.21359 translate read null
2024-07-31 Image-Based Deep Reinforcement Learning with Intrinsically Motivated Stimuli: On the Execution of Complex Robotic Tasks David Valencia et.al. 2407.21338 translate read null
2024-07-31 Tractable and Provably Efficient Distributional Reinforcement Learning with General Value Function Approximation Taehyun Cho et.al. 2407.21260 translate read null
2024-07-30 VITAL: Visual Teleoperation to Enhance Robot Learning through Human-in-the-Loop Corrections Hamidreza Kasaei et.al. 2407.21244 translate read null
2024-07-30 Learning Stable Robot Grasping with Transformer-based Tactile Control Policies En Yen Puang et.al. 2407.21172 translate read link
2024-07-30 Securing Proof of Stake Blockchains: Leveraging Multi-Agent Reinforcement Learning for Detecting and Mitigating Malicious Nodes Faisal Haque Bappy et.al. 2407.20983 translate read null
2024-07-30 How to Choose a Reinforcement-Learning Algorithm Fabian Bongratz et.al. 2407.20917 translate read null
2024-07-30 ARCLE: The Abstraction and Reasoning Corpus Learning Environment for Reinforcement Learning Hosung Lee et.al. 2407.20806 translate read link
2024-07-30 Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning Norman Di Palo et.al. 2407.20798 translate read link
2024-07-30 Architectural Influence on Variational Quantum Circuits in Multi-Agent Reinforcement Learning: Evolutionary Strategies for Optimization Michael Kölle et.al. 2407.20739 translate read null
2024-07-30 Online Prediction-Assisted Safe Reinforcement Learning for Electric Vehicle Charging Station Recommendation in Dynamically Coupled Transportation-Power Systems Qionghua Liao et.al. 2407.20679 translate read null
2024-07-30 Towards Generalizable Reinforcement Learning via Causality-Guided Self-Adaptive Representations Yupei Yang et.al. 2407.20651 translate read null
2024-07-30 Wireless Multi-User Interactive Virtual Reality in Metaverse with Edge-Device Collaborative Computing Caolu Xu et.al. 2407.20523 translate read null
2024-07-30 Boosting Efficiency in Task-Agnostic Exploration through Causal Knowledge Yupei Yang et.al. 2407.20506 translate read link
2024-07-29 A Method for Fast Autonomy Transfer in Reinforcement Learning Dinuka Sahabandu et.al. 2407.20466 translate read null
2024-07-29 SAPG: Split and Aggregate Policy Gradients Jayesh Singla et.al. 2407.20230 translate read null
2024-07-29 Privileged Reinforcement and Communication Learning for Distributed, Bandwidth-limited Multi-robot Exploration Yixiao Ma et.al. 2407.20203 translate read null
2024-07-29 Language-Conditioned Offline RL for Multi-Robot Navigation Steven Morad et.al. 2407.20164 translate read null
2024-07-29 Quantum Machine Learning Architecture Search via Deep Reinforcement Learning Xin Dai et.al. 2407.20147 translate read null
2024-07-29 Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning Liyuan Mao et.al. 2407.20109 translate read null
2024-07-29 Counterfactual rewards promote collective transport using individually controlled swarm microrobots Veit-Lorenz Heuthe et.al. 2407.20041 translate read null
2024-07-29 Collision Probability Distribution Estimation via Temporal Difference Learning Thomas Steinecker et.al. 2407.20000 translate read link
2024-07-29 Integrated Communications and Security: RIS-Assisted Simultaneous Transmission and Generation of Secret Keys Ning Gao et.al. 2407.19960 translate read null
2024-07-29 A Differential Dynamic Programming Framework for Inverse Reinforcement Learning Kun Cao et.al. 2407.19902 translate read null
2024-07-29 Imitation Learning for Intra-Day Power Grid Operation through Topology Actions Matthijs de Jong et.al. 2407.19865 translate read null
2024-07-26 SOAP-RL: Sequential Option Advantage Propagation for Reinforcement Learning in POMDP Environments Shu Ishida et.al. 2407.18913 translate read null
2024-07-26 Lessons from Learning to Spin “Pens” Jun Wang et.al. 2407.18902 translate read null
2024-07-26 SHANGUS: Deep Reinforcement Learning Meets Heuristic Optimization for Speedy Frontier-Based Exploration of Autonomous Vehicles in Unknown Spaces Seunghyeop Nam et.al. 2407.18892 translate read null
2024-07-26 An Accelerated Multi-level Monte Carlo Approach for Average Reward Reinforcement Learning with General Policy Parametrization Swetha Ganesh et.al. 2407.18878 translate read null
2024-07-26 QT-TDM: Planning with Transformer Dynamics Model and Autoregressive Q-Learning Mostafa Kotb et.al. 2407.18841 translate read null
2024-07-26 The Cross-environment Hyperparameter Setting Benchmark for Reinforcement Learning Andrew Patterson et.al. 2407.18840 translate read null
2024-07-26 Learning a Shape-Conditioned Agent for Purely Tactile In-Hand Manipulation of Various Objects Johannes Pitz et.al. 2407.18834 translate read null
2024-07-26 Online Planning in POMDPs with State-Requests Raphael Avalos et.al. 2407.18812 translate read null
2024-07-26 Tuning the kinetics of intracellular transport Ardra Suchitran et.al. 2407.18784 translate read null
2024-07-26 A Deep Reinforcement Learning Approach to Wavefront Control for Exoplanet Imaging Yann Gutierrez et.al. 2407.18733 translate read null
2024-07-25 Recursive Introspection: Teaching Language Model Agents How to Self-Improve Yuxiao Qu et.al. 2407.18219 translate read null
2024-07-25 Differentiable Quantum Architecture Search in Asynchronous Quantum Reinforcement Learning Samuel Yen-Chi Chen et.al. 2407.18202 translate read null
2024-07-25 Maximum Entropy On-Policy Actor-Critic via Entropy Advantage Estimation Jean Seong Bjorn Choe et.al. 2407.18143 translate read null
2024-07-25 MapTune: Advancing ASIC Technology Mapping via Reinforcement Learning Guided Library Tuning Mingju Liu et.al. 2407.18110 translate read link
2024-07-25 Principal-Agent Reinforcement Learning Dima Ivanov et.al. 2407.18074 translate read null
2024-07-25 Multi-Agent Deep Reinforcement Learning for Resilience Optimization in 5G RAN Soumeya Kaada et.al. 2407.18066 translate read null
2024-07-25 Personalized and Context-aware Route Planning for Edge-assisted Vehicles Dinesh Cyril Selvaraj et.al. 2407.17980 translate read null
2024-07-25 Optimal Hessian/Jacobian-Free Nonconvex-PL Bilevel Optimization Feihu Huang et.al. 2407.17823 translate read null
2024-07-25 Advanced deep-reinforcement-learning methods for flow control: group-invariant and positional-encoding networks improve learning speed and quality Joogoo Jeon et.al. 2407.17822 translate read null
2024-07-25 Preliminary Results of Neuromorphic Controller Design and a Parkinson’s Disease Dataset Building for Closed-Loop Deep Brain Stimulation Ananna Biswas et.al. 2407.17756 translate read null
2024-07-24 Traversing Pareto Optimal Policies: Provably Efficient Multi-Objective Reinforcement Learning Shuang Qiu et.al. 2407.17466 translate read null
2024-07-24 Toward human-centered shared autonomy AI paradigms for human-robot teaming in healthcare Reza Abiri et.al. 2407.17464 translate read null
2024-07-24 SoNIC: Safe Social Navigation with Adaptive Conformal Inference and Constrained Reinforcement Learning Jianpeng Yao et.al. 2407.17460 translate read null
2024-07-24 Joint Transmit and Jamming Power Optimization for Secrecy in Energy Harvesting Networks: A Reinforcement Learning Approach Shalini Tripathi et.al. 2407.17435 translate read null
2024-07-24 Market Making with Exogenous Competition Robert Boyce et.al. 2407.17393 translate read null
2024-07-24 MoveLight: Enhancing Traffic Signal Control through Movement-Centric Deep Reinforcement Learning Junqi Shao et.al. 2407.17303 translate read null
2024-07-24 Pretrained Visual Representations in Reinforcement Learning Emlyn Williams et.al. 2407.17238 translate read null
2024-07-24 Sublinear Regret for An Actor-Critic Algorithm in Continuous-Time Linear-Quadratic Reinforcement Learning Yilie Huang et.al. 2407.17226 translate read null
2024-07-24 Take a Step and Reconsider: Sequence Decoding for Self-Improved Neural Combinatorial Optimization Jonathan Pirnay et.al. 2407.17206 translate read link
2024-07-24 Path Following and Stabilisation of a Bicycle Model using a Reinforcement Learning Approach Sebastian Weyrer et.al. 2407.17156 translate read null
2024-07-23 A Simulation Benchmark for Autonomous Racing with Large-Scale Human Data Adrian Remonda et.al. 2407.16680 translate read link
2024-07-23 From Imitation to Refinement – Residual RL for Precise Visual Assembly Lars Ankile et.al. 2407.16677 translate read null
2024-07-23 Efficient Discovery of Actual Causality using Abstraction-Refinement Arshia Rafieioskouei et.al. 2407.16629 translate read null
2024-07-23 Functional Acceleration for Policy Mirror Descent Veronica Chelu et.al. 2407.16602 translate read null
2024-07-23 Real-Time Interactions Between Human Controllers and Remote Devices in Metaverse Kan Chen et.al. 2407.16591 translate read null
2024-07-23 TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback Eunseop Yoon et.al. 2407.16574 translate read null
2024-07-23 Cross Anything: General Quadruped Robot Navigation through Complex Terrains Shaoting Zhu et.al. 2407.16412 translate read null
2024-07-23 Evaluating Uncertainties in Electricity Markets via Machine Learning and Quantum Computing Shuyang Zhu et.al. 2407.16404 translate read null
2024-07-23 Reinforcement Learning-based Adaptive Mitigation of Uncorrected DRAM Errors in the Field Isaac Boixaderas et.al. 2407.16377 translate read null
2024-07-23 Arbitrary quantum states preparation aided by deep reinforcement learning Zhao-Wei Wang et.al. 2407.16368 translate read null
2024-07-22 WayEx: Waypoint Exploration using a Single Demonstration Mara Levy et.al. 2407.15849 translate read null
2024-07-23 QueST: Self-Supervised Skill Abstractions for Learning Continuous Control Atharva Mete et.al. 2407.15840 translate read null
2024-07-22 Importance Sampling-Guided Meta-Training for Intelligent Agents in Highly Interactive Environments Mansur Arief et.al. 2407.15839 translate read null
2024-07-22 On shallow planning under partial observability Randy Lefebvre et.al. 2407.15820 translate read null
2024-07-22 Learning to Manipulate Anywhere: A Visual Generalizable Framework For Reinforcement Learning Zhecheng Yuan et.al. 2407.15815 translate read null
2024-07-22 Concept-Based Interpretable Reinforcement Learning with Limited to No Human Labels Zhuorui Ye et.al. 2407.15786 translate read null
2024-07-22 Diffusion Model Based Resource Allocation Strategy in Ultra-Reliable Wireless Networked Control Systems Amirhassan Babazadeh Darabi et.al. 2407.15784 translate read null
2024-07-22 How to Shrink Confidence Sets for Many Equivalent Discrete Distributions? Odalric-Ambrym Maillard et.al. 2407.15662 translate read null
2024-07-22 Evaluation of Reinforcement Learning for Autonomous Penetration Testing using A3C, Q-learning and DQN Norman Becker et.al. 2407.15656 translate read null
2024-07-22 Reinforcement Learning Meets Visual Odometry Nico Messikommer et.al. 2407.15626 translate read null
2024-07-19 Catastrophic Goodhart: regularizing RLHF with KL divergence does not mitigate heavy-tailed reward misspecification Thomas Kwa et.al. 2407.14503 translate read null
2024-07-19 Explainable Post hoc Portfolio Management Financial Policy of a Deep Reinforcement Learning agent Alejandra de la Rica Escudero et.al. 2407.14486 translate read link
2024-07-19 Data-Centric Human Preference Optimization with Rationales Hoang Anh Just et.al. 2407.14477 translate read null
2024-07-19 FuzzTheREST: An Intelligent Automated Black-box RESTful API Fuzzer Tiago Dias et.al. 2407.14361 translate read null
2024-07-19 Hyperparameter Optimization for Driving Strategies Based on Reinforcement Learning Nihal Acharya Adde et.al. 2407.14262 translate read null
2024-07-19 On Policy Evaluation Algorithms in Distributional Reinforcement Learning Julian Gerstenberg et.al. 2407.14175 translate read null
2024-07-19 A Comparative Study of Deep Reinforcement Learning Models: DQN vs PPO vs A2C Neil De La Fuente et.al. 2407.14151 translate read link
2024-07-19 Track-MDP: Reinforcement Learning for Target Tracking with Controlled Sensing Adarsh M. Subramaniam et.al. 2407.13995 translate read null
2024-07-19 The Effect of Training Schedules on Morphological Robustness and Generalization Edoardo Barba et.al. 2407.13965 translate read link
2024-07-18 Event-Triggered Reinforcement Learning Based Joint Resource Allocation for Ultra-Reliable Low-Latency V2X Communications Nasir Khan et.al. 2407.13947 translate read null
2024-07-18 Random Latent Exploration for Deep Reinforcement Learning Srinath Mahankali et.al. 2407.13755 translate read null
2024-07-18 Optimistic Q-learning for average reward and episodic reinforcement learning Priyank Agrawal et.al. 2407.13743 translate read null
2024-07-18 Understanding Reinforcement Learning-Based Fine-Tuning of Diffusion Models: A Tutorial and Review Masatoshi Uehara et.al. 2407.13734 translate read null
2024-07-18 A Comprehensive Review of Recommender Systems: Transitioning from Theory to Practice Shaina Raza et.al. 2407.13699 translate read null
2024-07-18 Misspecified $Q$ -Learning with Sparse Linear Function Approximation: Tight Bounds on Approximation Error Ally Yalei Du et.al. 2407.13622 translate read null
2024-07-18 Hyp2Nav: Hyperbolic Planning and Curiosity for Crowd Navigation Alessandro Flaborea et.al. 2407.13567 translate read null
2024-07-18 Model-based Policy Optimization using Symbolic World Model Andrey Gorodetskiy et.al. 2407.13518 translate read null
2024-07-18 Instance Selection for Dynamic Algorithm Configuration with Reinforcement Learning: Improving Generalization Carolin Benjamins et.al. 2407.13513 translate read null
2024-07-18 LIMT: Language-Informed Multi-Task Visual World Models Elie Aljalbout et.al. 2407.13466 translate read null
2024-07-18 The Art of Imitation: Learning Long-Horizon Manipulation Tasks from Few Demonstrations Jan Ole von Hartz et.al. 2407.13432 translate read null
2024-07-17 Navigating the Smog: A Cooperative Multi-Agent RL for Accurate Air Pollution Mapping through Data Assimilation Ichrak Mokhtari et.al. 2407.12539 translate read null
2024-07-17 Towards Collaborative Intelligence: Propagating Intentions and Reasoning for Multi-Agent Coordination with Large Language Models Xihe Qiu et.al. 2407.12532 translate read null
2024-07-17 Subequivariant Reinforcement Learning in 3D Multi-Entity Physical Environments Runfa Chen et.al. 2407.12505 translate read null
2024-07-17 Estimating Reaction Barriers with Deep Reinforcement Learning Adittya Pal et.al. 2407.12453 translate read null
2024-07-17 Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement Learning Xu-Hui Liu et.al. 2407.12448 translate read link
2024-07-17 Variable-Agnostic Causal Exploration for Reinforcement Learning Minh Hoang Nguyen et.al. 2407.12437 translate read null
2024-07-17 Flow Matching Imitation Learning for Multi-Support Manipulation Quentin Rouxel et.al. 2407.12381 translate read null
2024-07-17 A foundation model approach to guide antimicrobial peptide design in the era of artificial intelligence driven scientific discovery Jike Wang et.al. 2407.12296 translate read null
2024-07-17 Chip Placement with Diffusion Vint Lee et.al. 2407.12282 translate read null
2024-07-17 Individualized Federated Learning for Traffic Prediction with Error Driven Aggregation Hang Chen et.al. 2407.12226 translate read link
2024-07-16 Why long model-based rollouts are no reason for bad Q-value estimates Philipp Wissmann et.al. 2407.11751 translate read null
2024-07-16 Pareto local search for a multi-objective demand response problem in residential areas with heat pumps and electric vehicles Thomas Dengiz et.al. 2407.11719 translate read null
2024-07-16 A Comparative Analysis of Interactive Reinforcement Learning Algorithms in Warehouse Robot Grid Based Environment Arunabh Bora et.al. 2407.11671 translate read null
2024-07-16 Exciting Action: Investigating Efficient Exploration for Learning Musculoskeletal Humanoid Locomotion Henri-Jacques Geiß et.al. 2407.11658 translate read null
2024-07-16 Building Resilience in Wireless Communication Systems With a Secret-Key Budget Karl-Ludwig Besser et.al. 2407.11604 translate read null
2024-07-16 Learning to Imitate Spatial Organization in Multi-robot Systems Ayomide O. Agunloye et.al. 2407.11592 translate read null
2024-07-16 Green Resource Allocation in Cloud-Native O-RAN Enabled Small Cell Networks Rana M. Sohaib et.al. 2407.11563 translate read null
2024-07-16 RobotKeyframing: Learning Locomotion with High-Level Objectives via Mixture of Dense and Sparse Rewards Fatemeh Zargarbashi et.al. 2407.11562 translate read null
2024-07-16 Imitation learning with artificial neural networks for demand response with a heuristic control approach for heat pumps Thomas Dengiz et.al. 2407.11561 translate read null
2024-07-16 DRL-based Joint Resource Scheduling of eMBB and URLLC in O-RAN Rana M. Sohaib et.al. 2407.11558 translate read null
2024-07-15 Walking the Values in Bayesian Inverse Reinforcement Learning Ondrej Bajgar et.al. 2407.10971 translate read null
2024-07-15 BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning Haohong Lin et.al. 2407.10967 translate read null
2024-07-15 Hedging Beyond the Mean: A Distributional Reinforcement Learning Perspective for Hedging Portfolios with Structured Products Anil Sharma et.al. 2407.10903 translate read null
2024-07-15 Offline Reinforcement Learning with Imputed Rewards Carlo Romeo et.al. 2407.10839 translate read null
2024-07-15 Exploration in Knowledge Transfer Utilizing Reinforcement Learning Adam Jedlička et.al. 2407.10835 translate read null
2024-07-15 GuideLight: “Industrial Solution” Guidance for More Practical Traffic Signal Control Agents Haoyuan Jiang et.al. 2407.10811 translate read null
2024-07-15 DINO Pre-training for Vision-based End-to-end Autonomous Driving Shubham Juneja et.al. 2407.10803 translate read null
2024-07-15 Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement Learning Alessandro Montenegro et.al. 2407.10775 translate read null
2024-07-16 Back to Newton’s Laws: Learning Vision-based Agile Flight via Differentiable Physics Yuang Zhang et.al. 2407.10648 translate read null
2024-07-15 Balancing the Scales: Reinforcement Learning for Fair Classification Leon Eshuijs et.al. 2407.10629 translate read null
2024-07-12 Learning Coordinated Maneuver in Adversarial Environments Zechen Hu et.al. 2407.09469 translate read null
2024-07-12 ASTPrompter: Weakly Supervised Automated Language Model Red-Teaming to Identify Likely Toxic Prompts Amelia F. Hardy et.al. 2407.09447 translate read null
2024-07-12 A Benchmark Environment for Offline Reinforcement Learning in Racing Games Girolamo Macaluso et.al. 2407.09415 translate read link
2024-07-12 Instruction Following with Goal-Conditioned Reinforcement Learning in Virtual Environments Zoya Volovikova et.al. 2407.09287 translate read null
2024-07-12 GNN with Model-based RL for Multi-agent Systems Hanxiao Chen et.al. 2407.09249 translate read null
2024-07-12 Constrained Intrinsic Motivation for Reinforcement Learning Xiang Zheng et.al. 2407.09247 translate read null
2024-07-12 Decentralized multi-agent reinforcement learning algorithm using a cluster-synchronized laser network Shun Kotoku et.al. 2407.09124 translate read null
2024-07-12 New Desiderata for Direct Preference Optimization Xiangkun Hu et.al. 2407.09072 translate read null
2024-07-12 Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control Huayu Chen et.al. 2407.09024 translate read null
2024-07-12 Communication-Aware Reinforcement Learning for Cooperative Adaptive Cruise Control Sicong Jiang et.al. 2407.08964 translate read null
2024-07-11 MetaUrban: A Simulation Platform for Embodied AI in Urban Spaces Wayne Wu et.al. 2407.08725 translate read null
2024-07-11 RoboMorph: Evolving Robot Morphology using Large Language Models Kevin Qiu et.al. 2407.08626 translate read null
2024-07-11 A Review of Nine Physics Engines for Reinforcement Learning Research Michael Kaup et.al. 2407.08590 translate read null
2024-07-11 HACMan++: Spatially-Grounded Motion Primitives for Manipulation Bowen Jiang et.al. 2407.08585 translate read null
2024-07-11 Imitation Learning for Robotic Assisted Ultrasound Examination of Deep Venous Thrombosis using Kernelized Movement Primitives Diego Dall’Alba et.al. 2407.08506 translate read null
2024-07-11 TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations Junik Bae et.al. 2407.08464 translate read null
2024-07-11 Distributed Deep Reinforcement Learning Based Gradient Quantization for Federated Learning Enabled Vehicle Edge Computing Cui Zhang et.al. 2407.08462 translate read null
2024-07-11 Joint Optimization of Age of Information and Energy Consumption in NR-V2X System based on Deep Reinforcement Learning Shulin Song et.al. 2407.08458 translate read link
2024-07-11 A Cantor-Kantorovich Metric Between Markov Decision Processes with Application to Transfer Learning Adrien Banse et.al. 2407.08324 translate read null
2024-07-11 A Deep Reinforcement Learning Framework and Methodology for Reducing the Sim-to-Real Gap in ASV Navigation Luis F W Batista et.al. 2407.08263 translate read null
2024-07-10 Learning In-Hand Translation Using Tactile Skin With Shear and Normal Force Sensing Jessica Yin et.al. 2407.07885 translate read null
2024-07-10 Green Screen Augmentation Enables Scene Generalisation in Robotic Manipulation Eugene Teoh et.al. 2407.07868 translate read null
2024-07-10 Reinforcement Learning of Adaptive Acquisition Policies for Inverse Problems Gianluigi Silvestri et.al. 2407.07794 translate read null
2024-07-11 BiGym: A Demo-Driven Mobile Bi-Manual Manipulation Benchmark Nikita Chernyadev et.al. 2407.07788 translate read null
2024-07-10 Continuous Control with Coarse-to-fine Reinforcement Learning Younggyo Seo et.al. 2407.07787 translate read null
2024-07-10 Towards Human-Like Driving: Active Inference in Autonomous Vehicle Control Elahe Delavari et.al. 2407.07684 translate read null
2024-07-10 Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning Dake Zhang et.al. 2407.07631 translate read null
2024-07-10 Resource Allocation for Twin Maintenance and Computing Task Processing in Digital Twin Vehicular Edge Computing Network Yu Xie et.al. 2407.07575 translate read link
2024-07-10 CM-DQN: A Value-Based Deep Reinforcement Learning Model to Simulate Confirmation Bias Jiacheng Shen et.al. 2407.07454 translate read link
2024-07-10 Real-time system optimal traffic routing under uncertainties – Can physics models boost reinforcement learning? Zemian Ke et.al. 2407.07364 translate read null
2024-07-09 Safe and Reliable Training of Learning-Based Aerospace Controllers Udayan Mandal et.al. 2407.07088 translate read null
2024-07-09 Hypothetical Minds: Scaffolding Theory of Mind for Multi-Agent Tasks with Large Language Models Logan Cross et.al. 2407.07086 translate read link
2024-07-09 Can Learned Optimization Make Reinforcement Learning Less Difficult? Alexander David Goldie et.al. 2407.07082 translate read link
2024-07-09 A Unified Approach to Multi-task Legged Navigation: Temporal Logic Meets Reinforcement Learning Jesse Jiang et.al. 2407.06931 translate read null
2024-07-09 Intercepting Unauthorized Aerial Robots in Controlled Airspace Using Reinforcement Learning Francisco Giral et.al. 2407.06909 translate read null
2024-07-09 Learning From Crowdsourced Noisy Labels: A Signal Processing Perspective Shahana Ibrahim et.al. 2407.06902 translate read null
2024-07-09 Energy Efficient Fair STAR-RIS for Mobile Users Ashok S. Kumar et.al. 2407.06868 translate read null
2024-07-09 Frequency and Generalisation of Periodic Activation Functions in Reinforcement Learning Augustine N. Mavor-Parker et.al. 2407.06756 translate read null
2024-07-09 Hierarchical Average-Reward Linearly-solvable Markov Decision Processes Guillermo Infante et.al. 2407.06690 translate read null
2024-07-09 Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning Fanyue Wei et.al. 2407.06642 translate read link
2024-07-08 Periodic agent-state based Q-learning for POMDPs Amit Sinha et.al. 2407.06121 translate read null
2024-07-08 QTRL: Toward Practical Quantum Reinforcement Learning via Quantum-Train Chen-Yu Liu et.al. 2407.06103 translate read null
2024-07-08 Stranger Danger! Identifying and Avoiding Unpredictable Pedestrians in RL-based Social Robot Navigation Sara Pohland et.al. 2407.06056 translate read link
2024-07-08 iLLM-TSC: Integration reinforcement learning and large language model for traffic signal control policy improvement Aoyu Pang et.al. 2407.06025 translate read link
2024-07-08 Multimodal Diffusion Transformer: Learning Versatile Behavior from Multimodal Goals Moritz Reuss et.al. 2407.05996 translate read null
2024-07-08 On Bellman equations for continuous-time policy evaluation I: discretization and approximation Wenlong Mou et.al. 2407.05966 translate read null
2024-07-08 Graph Anomaly Detection with Noisy Labels by Reinforcement Learning Zhu Wang et.al. 2407.05934 translate read null
2024-07-08 FedMRL: Data Heterogeneity Aware Federated Multi-agent Deep Reinforcement Learning for Medical Imaging Pranab Sahoo et.al. 2407.05800 translate read link
2024-07-08 Structural Generalization in Autonomous Cyber Incident Response with Message-Passing Neural Networks and Reinforcement Learning Jakob Nyberg et.al. 2407.05775 translate read link
2024-07-08 Multi-agent Reinforcement Learning-based Network Intrusion Detection System Amine Tellache et.al. 2407.05766 translate read null
2024-07-05 Graph Reinforcement Learning in Power Grids: A Survey Mohamed Hassouna et.al. 2407.04522 translate read null
2024-07-05 Using Petri Nets as an Integrated Constraint Mechanism for Reinforcement Learning Tasks Timon Sachweh et.al. 2407.04481 translate read null
2024-07-05 Hindsight Preference Learning for Offline Preference-based Reinforcement Learning Chen-Xiao Gao et.al. 2407.04451 translate read link
2024-07-05 Enhancing Safety for Autonomous Agents in Partly Concealed Urban Traffic Environments Through Representation-Based Shielding Pierre Haritz et.al. 2407.04343 translate read null
2024-07-05 Gradient-based Regularization for Action Smoothness in Robotic Control with Reinforcement Learning I Lee et.al. 2407.04315 translate read null
2024-07-05 Robust Decision Transformer: Tackling Data Corruption in Offline RL via Sequence Modeling Jiawei Xu et.al. 2407.04285 translate read null
2024-07-05 Unsupervised Video Summarization via Reinforcement Learning and a Trained Evaluator Mehryar Abbasi et.al. 2407.04258 translate read null
2024-07-05 PA-LOCO: Learning Perturbation-Adaptive Locomotion for Quadruped Robots Zhiyuan Xiao et.al. 2407.04224 translate read null
2024-07-05 Autoverse: An Evolvable Game Langugage for Learning Robust Embodied Agents Sam Earle et.al. 2407.04221 translate read null
2024-07-04 Orchestrating LLMs with Different Personalizations Jin Peng Zhou et.al. 2407.04181 translate read null
2024-07-03 Value-Penalized Auxiliary Control from Examples for Learning without Rewards or Demonstrations Trevor Ablett et.al. 2407.03311 translate read link
2024-07-03 A Review of the Applications of Deep Learning-Based Emergent Communication Brendon Boldt et.al. 2407.03302 translate read null
2024-07-03 Cooperative Multi-Agent Deep Reinforcement Learning Methods for UAV-aided Mobile Edge Computing Networks Mintae Kim et.al. 2407.03280 translate read null
2024-07-03 Policy-guided Monte Carlo on general state spaces: Application to glass-forming mixtures Leonardo Galliano et.al. 2407.03275 translate read null
2024-07-03 PPO-based Dynamic Control of Uncertain Floating Platforms in the Zero-G Environment Mahya Ramezani et.al. 2407.03224 translate read null
2024-07-03 Combining AI Control Systems and Human Decision Support via Robustness and Criticality Walt Woods et.al. 2407.03210 translate read null
2024-07-03 Bunny-VisionPro: Real-Time Bimanual Dexterous Teleoperation for Imitation Learning Runyu Ding et.al. 2407.03162 translate read null
2024-07-03 Reinforcement Learning for Sequence Design Leveraging Protein Language Models Jithendaraa Subramanian et.al. 2407.03154 translate read null
2024-07-03 Warm-up Free Policy Optimization: Improved Regret in Linear Markov Decision Processes Asaf Cassel et.al. 2407.03065 translate read null
2024-07-03 Improving Conversational Abilities of Quantized Large Language Models via Direct Preference Alignment Janghwan Lee et.al. 2407.03051 translate read null
2024-07-02 PWM: Policy Learning with Large World Models Ignat Georgiev et.al. 2407.02466 translate read null
2024-07-02 Predicting Visual Attention in Graphic Design Documents Souradeep Chakraborty et.al. 2407.02439 translate read null
2024-07-02 Reinforcement Learning and Machine ethics:a systematic review Ajay Vishwanath et.al. 2407.02425 translate read null
2024-07-02 Talking to Machines: do you read me? Lina M. Rojas-Barahona et.al. 2407.02354 translate read null
2024-07-02 DextrAH-G: Pixels-to-Action Dexterous Arm-Hand Grasping with Geometric Fabrics Tyler Ga Wei Lum et.al. 2407.02274 translate read null
2024-07-02 Safe CoR: A Dual-Expert Approach to Integrating Imitation Learning and Safe Reinforcement Learning Using Constraint Rewards Hyeokjin Kwon et.al. 2407.02245 translate read null
2024-07-02 Robust Zero-Shot Text-to-Speech Synthesis with Reverse Inference Optimization Yuchen Hu et.al. 2407.02243 translate read null
2024-07-02 Safety-Driven Deep Reinforcement Learning Framework for Cobots: A Sim2Real Approach Ammar N. Abbas et.al. 2407.02231 translate read link
2024-07-02 Physics-Informed Model and Hybrid Planning for Efficient Dyna-Style Reinforcement Learning Zakariae El Asri et.al. 2407.02217 translate read null
2024-07-02 Cost-Effective Proxy Reward Model Construction with On-Policy and Active Learning Yifang Chen et.al. 2407.02119 translate read null

(<a href=../Reinforcement_Learning.md>back to Reinforcement Learning</a>)