Robotics hlfshell • 1y ago • 80%

Using RLHF to train champion-level drone racing agents

A cool application of RLHF (Reinforcement Learning w/ Human Feedback - the same approach as what OpenAI used to train ChatGPT).

The authors trained an agent to fly FPV drones at a level surpassing world champions.

Comments 0