Thinking about high quality human data

https://lilianweng.github.io/posts/2024-02-05-human-data-quality/

The alignment stage of LLM post-training requires lots of human-generated data, and the quality is bounded on how good that data is. Lots of techniques here, with citations.

Discover more from Ian’s Blog

Subscribe now to keep reading and get access to the full archive.

Continue reading