Reinforcement Learning - 2024-12

Publish Date Title Authors PDF Translate Read Code
2024-12-30 Advances in Multi-agent Reinforcement Learning: Persistent Autonomy and Robot Learning Lab Report 2024 Reza Azadeh et.al. 2412.21088 translate read null
2024-12-30 Learning Epidemiological Dynamics via the Finite Expression Method Jianda Du et.al. 2412.21049 translate read null
2024-12-30 Weber-Fechner Law in Temporal Difference learning derived from Control as Inference Keiichiro Takahashi et.al. 2412.21004 translate read null
2024-12-30 LEASE: Offline Preference-based Reinforcement Learning with High Sample Efficiency Xiao-Yin Liu et.al. 2412.21001 translate read link
2024-12-30 UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI Fangwei Zhong et.al. 2412.20977 translate read null
2024-12-30 Data-Based Efficient Off-Policy Stabilizing Optimal Control Algorithms for Discrete-Time Linear Systems via Damping Coefficients Dongdong Li et.al. 2412.20845 translate read null
2024-12-30 Isoperimetry is All We Need: Langevin Posterior Sampling for RL with Sublinear Regret Emilio Jorge et.al. 2412.20824 translate read null
2024-12-29 The intrinsic motivation of reinforcement and imitation learning for sequential tasks Sao Mai Nguyen et.al. 2412.20573 translate read null
2024-12-29 Diminishing Return of Value Expansion Methods Daniel Palenicek et.al. 2412.20537 translate read link
2024-12-29 Game Theory and Multi-Agent Reinforcement Learning : From Nash Equilibria to Evolutionary Dynamics Neil De La Fuente et.al. 2412.20523 translate read null
2024-12-27 From Ceilings to Walls: Universal Dynamic Perching of Small Aerial Robots on Surfaces with Variable Orientations Bryan Habas et.al. 2412.19765 translate read null
2024-12-27 Adaptive Context-Aware Multi-Path Transmission Control for VR/AR Content: A Deep Reinforcement Learning Approach Shakil Ahmed et.al. 2412.19737 translate read null
2024-12-27 Goal-oriented Communications based on Recursive Early Exit Neural Networks Jary Pomponi et.al. 2412.19587 translate read null
2024-12-27 Graph-attention-based Casual Discovery with Trust Region-navigated Clipping Policy Optimization Shixuan Liu et.al. 2412.19578 translate read null
2024-12-27 Reinforced Label Denoising for Weakly-Supervised Audio-Visual Video Parsing Yongbiao Gao et.al. 2412.19563 translate read null
2024-12-27 Scalable Hierarchical Reinforcement Learning for Hyper Scale Multi-Robot Task Planning Xuan Zhou et.al. 2412.19538 translate read null
2024-12-27 An Overview of Machine Learning-Driven Resource Allocation in IoT Networks Zhengdong Li et.al. 2412.19478 translate read null
2024-12-27 DeepSeek-V3 Technical Report DeepSeek-AI et.al. 2412.19437 translate read link
2024-12-27 Low-Rank Contextual Reinforcement Learning from Heterogeneous Human Feedback Seong Jin Lee et.al. 2412.19436 translate read null
2024-12-27 Comparing Few to Rank Many: Active Human Preference Learning using Randomized Frank-Wolfe Kiran Koshy Thekumparampil et.al. 2412.19396 translate read null
2024-12-24 Modeling the Centaur: Human-Machine Synergy in Sequential Decision Making David Shoresh et.al. 2412.18593 translate read null
2024-12-24 Dynamic Optimization of Portfolio Allocation Using Deep Reinforcement Learning Gang Huang et.al. 2412.18563 translate read link
2024-12-24 Large Language Model guided Deep Reinforcement Learning for Decision Making in Autonomous Driving Hao Pang et.al. 2412.18511 translate read null
2024-12-24 Joint Adaptive OFDM and Reinforcement Learning Design for Autonomous Vehicles: Leveraging Age of Updates Mamady Delamou et.al. 2412.18500 translate read null
2024-12-24 Contrastive Representation for Interactive Recommendation Jingyu Li et.al. 2412.18396 translate read link
2024-12-24 Navigating Data Corruption in Machine Learning: Balancing Quality, Quantity, and Imputation Strategies Qi Liu et.al. 2412.18296 translate read null
2024-12-24 Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage Policy Optimization Jiacai Liu et.al. 2412.18279 translate read null
2024-12-24 Accelerating AIGC Services with Latent Action Diffusion Scheduling in Edge Networks Changfu Xu et.al. 2412.18212 translate read link
2024-12-24 Quantum framework for Reinforcement Learning: integrating Markov Decision Process, quantum arithmetic, and trajectory search Thet Htar Su et.al. 2412.18208 translate read null
2024-12-24 Token Highlighter: Inspecting and Mitigating Jailbreak Prompts for Large Language Models Xiaomeng Hu et.al. 2412.18171 translate read null
2024-12-23 HyperQ-Opt: Q-learning for Hyperparameter Optimization Md. Tarek Hasan et.al. 2412.17765 translate read null
2024-12-23 Mimicking-Bench: A Benchmark for Generalizable Humanoid-Scene Interaction Learning via Human Mimicking Yun Liu et.al. 2412.17730 translate read null
2024-12-23 SMAC-Hard: Enabling Mixed Opponent Strategy Script and Self-play on SMAC Yue Deng et.al. 2412.17707 translate read link
2024-12-23 Towards Intrinsic Self-Correction Enhancement in Monte Carlo Tree Search Boosted Reasoning via Iterative Preference Learning Huchen Jiang et.al. 2412.17397 translate read null
2024-12-23 Reinforcement Learning with a Focus on Adjusting Policies to Reach Targets Akane Tsuboya et.al. 2412.17344 translate read null
2024-12-23 Multimodal Deep Reinforcement Learning for Portfolio Optimization Sumit Nawathe et.al. 2412.17293 translate read null
2024-12-23 LMD-PGN: Cross-Modal Knowledge Distillation from First-Person-View Images to Third-Person-View BEV Maps for Universal Point Goal Navigation Riku Uemura et.al. 2412.17282 translate read null
2024-12-23 ACECode: A Reinforcement Learning Framework for Aligning Code Efficiency and Correctness in Code Language Models Chengran Yang et.al. 2412.17264 translate read null
2024-12-23 A Coalition Game for On-demand Multi-modal 3D Automated Delivery System Farzan Moosavi et.al. 2412.17252 translate read null
2024-12-23 Model-free stochastic linear quadratic design by semidefinite programming Jing Guo et.al. 2412.17230 translate read null
2024-12-20 Offline Reinforcement Learning for LLM Multi-Step Reasoning Huaijie Wang et.al. 2412.16145 translate read null
2024-12-20 APIRL: Deep Reinforcement Learning for REST API Fuzzing Myles Foley et.al. 2412.15991 translate read link
2024-12-20 Active Flow Control for Bluff Body under High Reynolds Number Turbulent Flow Conditions Using Deep Reinforcement Learning Jingbo Chen et.al. 2412.15975 translate read null
2024-12-20 From General to Specific: Tailoring Large Language Models for Personalized Healthcare Ruize Shi et.al. 2412.15957 translate read null
2024-12-20 What Are Step-Level Reward Models Rewarding? Counterintuitive Findings from MCTS-Boosted Mathematical Reasoning Yiran Ma et.al. 2412.15904 translate read null
2024-12-20 Align Anything: Training All-Modality Models to Follow Instructions with Language Feedback Jiaming Ji et.al. 2412.15838 translate read link
2024-12-20 MacLight: Multi-scene Aggregation Convolutional Learning for Traffic Signal Control Sunbowen Lee et.al. 2412.15703 translate read link
2024-12-20 AIR: Unifying Individual and Cooperative Exploration in Collective Multi-Agent Reinforcement Learning Guangchong Zhou et.al. 2412.15700 translate read link
2024-12-20 Tacit Learning with Adaptive Information Selection for Cooperative Multi-Agent Reinforcement Learning Lunjun Liu et.al. 2412.15639 translate read null
2024-12-20 Dexterous Manipulation Based on Prior Dexterous Grasp Pose Knowledge Hengxu Yan et.al. 2412.15587 translate read null
2024-12-19 Qwen2.5 Technical Report Qwen et.al. 2412.15115 translate read null
2024-12-19 Dream to Manipulate: Compositional World Models Empowering Robot Imitation Learning with Imagination Leonardo Barcellona et.al. 2412.14957 translate read null
2024-12-19 Effective Method with Compression for Distributed and Federated Cocoercive Variational Inequalities Daniil Medyakov et.al. 2412.14935 translate read null
2024-12-19 Hierarchical Subspaces of Policies for Continual Offline Reinforcement Learning Anthony Kobanda et.al. 2412.14865 translate read null
2024-12-19 Entropy Regularized Task Representation Learning for Offline Meta-Reinforcement Learning Mohammadreza nakhaei et.al. 2412.14834 translate read link
2024-12-19 Agent-Temporal Credit Assignment for Optimal Policy Preservation in Sparse Multi-Agent Reinforcement Learning Aditya Kapoor et.al. 2412.14779 translate read null
2024-12-19 Learning to Generate Research Idea with Dynamic Control Ruochen Li et.al. 2412.14626 translate read null
2024-12-19 Simulation-Free Hierarchical Latent Policy Planning for Proactive Dialogues Tao He et.al. 2412.14584 translate read null
2024-12-19 Single-Loop Federated Actor-Critic across Heterogeneous Environments Ye Zhu et.al. 2412.14555 translate read null
2024-12-18 Implementing TD3 to train a Neural Network to fly a Quadcopter through an FPV Gate Patrick Thomas et.al. 2412.14367 translate read null
2024-12-18 Learning from Massive Human Videos for Universal Humanoid Pose Control Jiageng Mao et.al. 2412.14172 translate read null
2024-12-18 Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective Zhiyuan Zeng et.al. 2412.14135 translate read null
2024-12-18 Alignment faking in large language models Ryan Greenblatt et.al. 2412.14093 translate read link
2024-12-18 Spatio-Temporal SIR Model of Pandemic Spread During Warfare with Optimal Dual-use Healthcare System Administration using Deep Reinforcement Learning Adi Shuchami et.al. 2412.14039 translate read null
2024-12-18 Robust Optimal Safe and Stability Guaranteeing Reinforcement Learning Control for Quadcopter Sanghyoup Gu et.al. 2412.14003 translate read null
2024-12-18 Harvesting energy from turbulent winds with Reinforcement Learning Lorenzo Basile et.al. 2412.13961 translate read null
2024-12-18 RoboMIND: Benchmark on Multi-embodiment Intelligence Normative Data for Robot Manipulation Kun Wu et.al. 2412.13877 translate read null
2024-12-18 AI-Powered Algorithm-Centric Quantum Processor Topology Design Tian Li et.al. 2412.13805 translate read link
2024-12-18 Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN Pengxiang Li et.al. 2412.13795 translate read link
2024-12-18 A hybrid learning agent for episodic learning tasks with unknown target distance Oliver Sefrin et.al. 2412.13686 translate read null
2024-12-17 ExBody2: Advanced Expressive Humanoid Whole-Body Control Mazeyu Ji et.al. 2412.13196 translate read null
2024-12-17 Tilted Quantile Gradient Updates for Quantile-Constrained Reinforcement Learning Chenglin Li et.al. 2412.13184 translate read link
2024-12-17 Learning Visuotactile Estimation and Control for Non-prehensile Manipulation under Occlusions Juan Del Aguila Ferrandis et.al. 2412.13157 translate read null
2024-12-17 Practicable Black-box Evasion Attacks on Link Prediction in Dynamic Graphs – A Graph Sequential Embedding Method Jiate Li et.al. 2412.13134 translate read link
2024-12-17 Active Reinforcement Learning Strategies for Offline Policy Improvement Ambedkar Dukkipati et.al. 2412.13106 translate read null
2024-12-17 Reservoir Computing for Fast, Simplified Reinforcement Learning on Memory Tasks Kevin McKee et.al. 2412.13093 translate read null
2024-12-17 SMOSE: Sparse Mixture of Shallow Experts for Interpretable Reinforcement Learning in Continuous Control Tasks Mátyás Vincze et.al. 2412.13053 translate read null
2024-12-17 Relational Neurosymbolic Markov Models Lennert De Smet et.al. 2412.13023 translate read null
2024-12-17 Future Aspects in Human Action Recognition: Exploring Emerging Techniques and Ethical Influences Antonios Gasteratos et.al. 2412.12990 translate read null
2024-12-17 Guiding Generative Protein Language Models with Reinforcement Learning Filippo Stocco et.al. 2412.12979 translate read null
2024-12-16 MaxInfoRL: Boosting exploration in reinforcement learning through information gain maximization Bhavya Sukhija et.al. 2412.12098 translate read null
2024-12-16 Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation Eliot Xing et.al. 2412.12089 translate read null
2024-12-16 Artificial Intelligence in Traffic Systems Ritwik Raj Saxena et.al. 2412.12046 translate read null
2024-12-16 Learning to Navigate in Mazes with Novel Layouts using Abstract Top-down Maps Linfeng Zhao et.al. 2412.12024 translate read null
2024-12-16 Agentic AI-Driven Technical Troubleshooting for Enterprise Systems: A Novel Weighted Retrieval-Augmented Generation Paradigm Rajat Khanda et.al. 2412.12006 translate read null
2024-12-16 AlphaZero Neural Scaling and Zipf’s Law: a Tale of Board Games and Power Laws Oren Neumann et.al. 2412.11979 translate read link
2024-12-16 Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning Qi Sun et.al. 2412.11974 translate read link
2024-12-16 Hierarchical Meta-Reinforcement Learning via Automated Macro-Action Discovery Minjae Cho et.al. 2412.11930 translate read null
2024-12-16 Generalized Bayesian deep reinforcement learning Shreya Sinha Roy et.al. 2412.11743 translate read null
2024-12-16 Learning UAV-based path planning for efficient localization of objects using prior knowledge Rick van Essen et.al. 2412.11717 translate read null
2024-12-13 A Novel Framework Using Deep Reinforcement Learning for Join Order Selection Chang Liu et.al. 2412.10253 translate read null
2024-12-13 Physics Instrument Design with Reinforcement Learning Shah Rukh Qasim et.al. 2412.10237 translate read null
2024-12-13 Scaling Combinatorial Optimization Neural Improvement Heuristics with Online Search and Adaptation Federico Julian Camerota Verdù et.al. 2412.10163 translate read null
2024-12-13 AMUSE: Adaptive Model Updating using a Simulated Environment Louis Chislett et.al. 2412.10119 translate read null
2024-12-13 Reward Machine Inference for Robotic Manipulation Mattijs Baert et.al. 2412.10096 translate read null
2024-12-13 Optimized Coordination Strategy for Multi-Aerospace Systems in Pick-and-Place Tasks By Deep Neural Network Ye Zhang et.al. 2412.09877 translate read null
2024-12-13 RLDG: Robotic Generalist Policy Distillation via Reinforcement Learning Charles Xu et.al. 2412.09858 translate read null
2024-12-13 ScaleOT: Privacy-utility-scalable Offsite-tuning with Dynamic LayerReplace and Selective Rank Compression Kai Yao et.al. 2412.09812 translate read null
2024-12-12 GainAdaptor: Learning Quadrupedal Locomotion with Dual Actors for Adaptable and Energy-Efficient Walking on Various Terrains Mincheol Kim et.al. 2412.09520 translate read null
2024-12-12 Distributional Reinforcement Learning based Integrated Decision Making and Control for Autonomous Surface Vehicles Xi Lin et.al. 2412.09466 translate read link
2024-12-12 Learning to Adapt: Bio-Inspired Gait Strategies for Versatile Quadruped Locomotion Joseph Humphreys et.al. 2412.09440 translate read null
2024-12-12 Reinforcement Learning Within the Classical Robotics Stack: A Case Study in Robot Soccer Adam Labiosa et.al. 2412.09417 translate read null
2024-12-12 Does Low Spoilage Under Cold Conditions Foster Cultural Complexity During the Foraging Era? – A Theoretical and Computational Inquiry Minhyeok Lee et.al. 2412.09335 translate read null
2024-12-12 Learning to be Indifferent in Complex Decisions: A Coarse Payoff-Assessment Model Philippe Jehiel et.al. 2412.09321 translate read null
2024-12-12 Learning Novel Skills from Language-Generated Demonstrations Ao-Qun Jin et.al. 2412.09286 translate read null
2024-12-12 Student-Informed Teacher Training Nico Messikommer et.al. 2412.09149 translate read null
2024-12-12 Reconfigurable Intelligent Surface for Internet of Robotic Things Wanli Ni et.al. 2412.09117 translate read null
2024-12-12 In-Dataset Trajectory Return Regularization for Offline Preference-based Reinforcement Learning Songjun Tu et.al. 2412.09104 translate read null
2024-12-11 Learning Sketch Decompositions in Planning via Deep Reinforcement Learning Michael Aichmüller et.al. 2412.08574 translate read null
2024-12-11 GenPlan: Generative sequence models as adaptive planners Akash Karthikeyan et.al. 2412.08565 translate read null
2024-12-11 An End-to-End Collaborative Learning Approach for Connected Autonomous Vehicles in Occluded Scenarios Leandro Parada et.al. 2412.08562 translate read null
2024-12-11 MaestroMotif: Skill Design from Artificial Intelligence Feedback Martin Klissarov et.al. 2412.08542 translate read null
2024-12-11 Subspace-wise Hybrid RL for Articulated Object Manipulation Yujin Kim et.al. 2412.08522 translate read null
2024-12-11 Multi-perspective Alignment for Increasing Naturalness in Neural Machine Translation Huiyuan Lai et.al. 2412.08473 translate read null
2024-12-11 IRL for Restless Multi-Armed Bandits with Applications in Maternal and Child Health Gauri Jain et.al. 2412.08463 translate read link
2024-12-11 SINERGYM – A virtual testbed for building energy optimization with Reinforcement Learning Alejandro Campoy-Nieves et.al. 2412.08293 translate read link
2024-12-11 Coarse-to-Fine: A Dual-Phase Channel-Adaptive Method for Wireless Image Transmission Hanlei Li et.al. 2412.08211 translate read null
2024-12-11 Learn How to Query from Unlabeled Data Streams in Federated Learning Yuchang Sun et.al. 2412.08138 translate read link
2024-12-10 Mobile-TeleVision: Predictive Motion Priors for Humanoid Whole-Body Control Chenhao Lu et.al. 2412.07773 translate read null
2024-12-10 Efficient Online Reinforcement Learning Fine-Tuning Need Not Retain Offline Data Zhiyuan Zhou et.al. 2412.07762 translate read null
2024-12-10 Optimizing Sensor Redundancy in Sequential Decision-Making Problems Jonas Nüßlein et.al. 2412.07686 translate read null
2024-12-10 Offline Multi-Agent Reinforcement Learning via In-Sample Sequential Policy Optimization Zongkai Liu et.al. 2412.07639 translate read null
2024-12-10 Swarm Behavior Cloning Jonas Nüßlein et.al. 2412.07617 translate read null
2024-12-10 Contractive Dynamical Imitation Policies for Efficient Out-of-Sample Recovery Amin Abyaneh et.al. 2412.07544 translate read null
2024-12-10 ConfigX: Modular Configuration for Evolutionary Algorithms via Multitask Reinforcement Learning Hongshu Guo et.al. 2412.07507 translate read null
2024-12-10 Optimizing pulsed blowing parameters for active separation control in a one-sided diffuser using reinforcement learning Alexandra Müller et.al. 2412.07480 translate read null
2024-12-10 Progressive-Resolution Policy Distillation: Leveraging Coarse-Resolution Simulation for Time-Efficient Fine-Resolution Policy Learning Yuki Kadokawa et.al. 2412.07477 translate read null
2024-12-10 RLT4Rec: Reinforcement Learning Transformer for User Cold Start and Item Recommendation Dilina Chandika Rajapakse et.al. 2412.07403 translate read null
2024-12-09 Partially Observed Optimal Stochastic Control: Regularity, Optimality, Approximations, and Learning Ali Devran Kara et.al. 2412.06735 translate read null
2024-12-09 Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class and Backbone Max Sobol Mark et.al. 2412.06685 translate read null
2024-12-09 Off-Policy Maximum Entropy RL with Future State and Action Visitation Measures Adrien Bolland et.al. 2412.06655 translate read null
2024-12-09 Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation Egor Cherepanov et.al. 2412.06531 translate read null
2024-12-09 SimuDICE: Offline Policy Optimization Through World Model Updates and DICE Estimation Catalin E. Brita et.al. 2412.06486 translate read link
2024-12-09 Edge Delayed Deep Deterministic Policy Gradient: efficient continuous control for edge scenarios Alberto Sinigaglia et.al. 2412.06390 translate read null
2024-12-09 Tracking control of latent dynamic systems with application to spacecraft attitude control Congxi Zhang et.al. 2412.06342 translate read null
2024-12-09 Augmenting the action space with conventions to improve multi-agent cooperation in Hanabi F. Bredell et.al. 2412.06333 translate read null
2024-12-09 Vision-Based Deep Reinforcement Learning of UAV Autonomous Navigation Using Privileged Information Junqiao Wang et.al. 2412.06313 translate read null
2024-12-09 A Scalable Decentralized Reinforcement Learning Framework for UAV Target Localization Using Recurrent PPO Leon Fernando et.al. 2412.06231 translate read null
2024-12-06 Reinforcement Learning: An Overview Kevin Murphy et.al. 2412.05265 translate read null
2024-12-06 TeamCraft: A Benchmark for Multi-Modal Multi-Agent Systems in Minecraft Qian Long et.al. 2412.05255 translate read link
2024-12-06 LIAR: Leveraging Alignment (Best-of-N) to Jailbreak LLMs in Seconds James Beetham et.al. 2412.05232 translate read null
2024-12-06 FlowPolicy: Enabling Fast and Robust 3D Flow-based Policy via Consistency Flow Matching for Robot Manipulation Qinglun Zhang et.al. 2412.04987 translate read null
2024-12-06 Putting the Iterative Training of Decision Trees to the Test on a Real-World Robotic Task Raphael C. Engelhardt et.al. 2412.04974 translate read null
2024-12-06 DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling Minzheng Wang et.al. 2412.04905 translate read link
2024-12-06 Maximizing Alignment with Minimal Feedback: Efficiently Learning Rewards for Visuomotor Robot Policy Alignment Ran Tian et.al. 2412.04835 translate read null
2024-12-06 Learning-based Control for Tendon-Driven Continuum Robotic Arms Nima Maghooli et.al. 2412.04829 translate read null
2024-12-06 A Temporally Correlated Latent Exploration for Reinforcement Learning SuMin Oh et.al. 2412.04775 translate read null
2024-12-06 Measuring Goal-Directedness Matt MacDermott et.al. 2412.04758 translate read null
2024-12-05 Marvel: Accelerating Safe Online Reinforcement Learning with Finetuned Offline Policy Keru Chen et.al. 2412.04426 translate read null
2024-12-05 Intersection-Aware Assessment of EMS Accessibility in NYC: A Data-Driven Approach Haoran Su et.al. 2412.04369 translate read null
2024-12-05 Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting Edoardo Cetin et.al. 2412.04368 translate read null
2024-12-05 Reinforcement Learning for Freeway Lane-Change Regulation via Connected Vehicles Ke Sun et.al. 2412.04341 translate read null
2024-12-05 Action Mapping for Reinforcement Learning in Continuous Environments with Constraints Mirco Theile et.al. 2412.04327 translate read null
2024-12-05 GRAM: Generalization in Deep RL with a Robust Adaptation Module James Queeney et.al. 2412.04323 translate read link
2024-12-05 Reinforcement Learning from Wild Animal Videos Elliot Chane-Sane et.al. 2412.04273 translate read null
2024-12-05 HyperMARL: Adaptive Hypernetworks for Multi-Agent RL Kale-ab Abebe Tessera et.al. 2412.04233 translate read null
2024-12-05 A Dynamic Safety Shield for Safe and Efficient Reinforcement Learning of Navigation Tasks Murad Dawood et.al. 2412.04153 translate read null
2024-12-05 Towards Generalizable Autonomous Penetration Testing via Domain Randomization and Meta-Reinforcement Learning Shicheng Zhou et.al. 2412.04078 translate read link
2024-12-04 AI-Driven Day-to-Day Route Choice Leizhen Wang et.al. 2412.03338 translate read null
2024-12-04 Rotograb: Combining Biomimetic Hands with Industrial Grippers using a Rotating Thumb Arnaud Bersier et.al. 2412.03279 translate read null
2024-12-04 Learning on One Mode: Addressing Multi-Modality in Offline Reinforcement Learning Mianchu Wang et.al. 2412.03258 translate read null
2024-12-04 Alignment at Pre-training! Towards Native Alignment for Arabic LLMs Juhao Liang et.al. 2412.03253 translate read link
2024-12-04 Variable-Speed Teaching-Playback as Real-World Data Augmentation for Imitation Learning Nozomu Masuya et.al. 2412.03252 translate read null
2024-12-04 Using Deep Reinforcement Learning to Enhance Channel Sampling Patterns in Integrated Sensing and Communication Federico Mason et.al. 2412.03157 translate read null
2024-12-04 Experience-driven discovery of planning strategies Ruiqi He et.al. 2412.03111 translate read null
2024-12-04 Less is More: A Stealthy and Efficient Adversarial Attack Method for DRL-based Autonomous Driving Policies Junchao Fan et.al. 2412.03051 translate read null
2024-12-04 Learning Whole-Body Loco-Manipulation for Omni-Directional Task Space Pose Tracking with a Wheeled-Quadrupedal-Manipulator Kaiwen Jiang et.al. 2412.03012 translate read null
2024-12-04 Data Acquisition for Improving Model Fairness using Reinforcement Learning Jahid Hasan et.al. 2412.03009 translate read null
2024-12-03 UniGraspTransformer: Simplified Policy Distillation for Scalable Dexterous Robotic Grasping Wenbo Wang et.al. 2412.02699 translate read link
2024-12-03 Preliminary Investigation into Data Scaling Laws for Imitation Learning-Based End-to-End Autonomous Driving Yupeng Zheng et.al. 2412.02689 translate read null
2024-12-03 T-REG: Preference Optimization with Token-Level Reward Regularization Wenxuan Zhou et.al. 2412.02685 translate read link
2024-12-03 AI-Driven Resource Allocation Framework for Microservices in Hybrid Cloud Platforms Biman Barua et.al. 2412.02610 translate read null
2024-12-03 Explainable CTR Prediction via LLM Reasoning Xiaohan Yu et.al. 2412.02588 translate read null
2024-12-03 Mobile Cell-Free Massive MIMO with Multi-Agent Reinforcement Learning: A Scalable Framework Ziheng Liu et.al. 2412.02581 translate read null
2024-12-03 Generating Critical Scenarios for Testing Automated Driving Systems Trung-Hieu Nguyen et.al. 2412.02574 translate read link
2024-12-03 Cooperative Cruising: Reinforcement Learning based Time-Headway Control for Increased Traffic Efficiency Yaron Veksler et.al. 2412.02520 translate read null
2024-12-03 Reinforcement learning to learn quantum states for Heisenberg scaling accuracy Jeongwoo Jae et.al. 2412.02334 translate read null
2024-12-03 Optimizing Plastic Waste Collection in Water Bodies Using Heterogeneous Autonomous Surface Vehicles with Deep Reinforcement Learning Alejandro Mendoza Barrionuevo et.al. 2412.02316 translate read null

(<a href=../Reinforcement_Learning.md>back to Reinforcement Learning</a>)