Rl

Topic related to rl

An elegant PyTorch deep reinforcement learning library.

8,6321,166
Python

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, Kimi, and more!

2,956172
Python

Modular reinforcement learning library (on PyTorch and JAX) with support for NVIDIA Isaac Gym, Omniverse Isaac Gym and Isaac Lab

81295
Python

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

2,936389
Python

Distributed RL System for LLM Reasoning

2,031123
Python

TTRL: Test-Time Reinforcement Learning

71156
Python

AI-Native Risk Intelligence Systems, OpenDeRisk——Your application system risk intelligent manager provides 7* 24-hour comprehensive and in-depth protection.

57056
Python

🔁 AMP-RSL-RL: Adversarial Motion Priors for robotic RL (PPO + motion imitation)

1308
Python

Multi-Objective Reinforcement Learning algorithms implementations.

40571
Python

Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"

1295
Python

A repo for open research on building large reasoning models

726
Python

reasoning model trained using GRPO towards rosetta REF2015 for protein stability

858
Python

基于gym的pytorch深度强化学习(DRL)(PPO,PPG,DQN,SAC,DDPG,TD3等算法)

11312
Python

OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement

935
Python

Machine learning library, Distributed training, Deep learning, Reinforcement learning, Models, TensorFlow, PyTorch

672
Python

Topic Statistics

Related Topics