MATLAB Reinforcement Learning Tutorial

Multimodal Reinforcement Learning with Agentic Verifier for AI Agents

Agentic reasoning models trained with multimodal reinforcement learning (MMRL) have become increasingly capable, yet they are almost universally optimized using sparse, outcome-based rewards computed ...

IEEE

LLM4RL: Enhancing Reinforcement Learning with Large Language Models

Abstract: Integrating large language models (LLMs) into reinforcement learning (RL) promises to enhance the learning performance. Traditional RL faces challenges in industrial settings, including ...

Frontiers

Solving robotics tasks with prior demonstration via exploration-efficient deep ...

This paper proposes an exploration-efficient deep reinforcement learning with reference (DRLR) policy framework for learning robotics tasks incorporating demonstrations. The DRLR framework is ...

GitHub

Demystifying Reinforcement Learning in Agentic Reasoning

An overview of our research on agentic RL. In this work, we systematically investigate three dimensions of agentic RL: data, algorithms, and reasoning modes. Our findings reveal: Real end-to-end ...

Frontiers

LG-H-PPO: offline hierarchical PPO for robot path planning on a latent graph

The path planning capability of autonomous robots in complex environments is crucial for their widespread application in the real world. However, long-term decision-making and sparse reward signals ...

IEEE

Safe Reinforcement Learning via Episodic Control

Abstract: Safe reinforcement learning (Safe RL) aims to learn policies capable of learning and adapting within complex environments while ensuring actions remain free from catastrophic consequences.

一些您可能无法访问的结果已被隐去。

显示无法访问的结果