ThinkRL
GitHub
API Reference

Algorithms

ThinkRL implements state-of-the-art reinforcement learning algorithms optimized for human feedback training, from cutting-edge research to proven baselines.

Algorithm Comparison

AlgorithmBest ForSample EfficiencyStability
VAPOGeneral RLHF⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐
DAPOTransfer learning⭐⭐⭐⭐⭐⭐⭐⭐⭐
GRPOPreference learning⭐⭐⭐⭐⭐⭐⭐⭐
COPOExploration tasks⭐⭐⭐⭐⭐⭐⭐
PAPOMultimodal⭐⭐⭐⭐⭐⭐⭐⭐
PPOBaseline⭐⭐⭐⭐⭐⭐⭐⭐
REINFORCESimple tasks⭐⭐⭐⭐⭐

Available Algorithms