Reinforcement Learning
Recordings for the [RLC](https://rl-conference.cc/) keynote talks have been released. Keynote speakers: - David Silver - Doina Precup (Not recorded) - Peter Stone - Finale Doshi-Velez - Sergey Levine - Emma Brunskill - Andrew Barto
Reinforcement Learning
howrar
•
1mo ago
•
50%
OpenAI just put out a blog post about a new model trained via RL (I'm assuming this isn't the usual RLHF) to perform chain of thought reasoning before giving the user its answer. As usual, there's very little detail about how this is accomplished so it's hard for me to get excited about it, but the rest of you might find this interesting.