RLHF, or Why ChatGPT is Free https://theservitor.com/understanding-rlhf-reinforced-learning-from-human-feedback/