Skip to content

Latest commit

 

History

History
9.06 MB

Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback.pdf

File metadata and controls

9.06 MB
Loading