ai > concepts
Reinforcement Learning and RLHF in a nutshell
A very high-level summary of RL. Dall-e's somewhat inaccurate representation of RL state-space RL is an approach to efficiently search state-space of a well-defined system. We can think of the system as a graph with states (vertex) and actions (edges) that lead to change in state. In its