Reinforcement Learning from Human Feedback (RLHF) in Notebooks github.com 69 points by ash_at_hny 13 hours ago
Hl
[dead]
[dead]