Training language models to follow instructions with human feedback
Titolo: Training language models to follow instructions with human feedback
Anno di pubblicazione:2022
Autori: Long Ouyang; Jeff Wu; Xu Jiang; Diogo Almeida; Carroll L. Wainwright; Pamela Mishkin; Chong Zhang; Sandhini Agarwal; Katarina Slama; Alex Ray; John Schulman; Jacob Hilton; Fraser Kelton; Luke Miller; Maddie Simens; Amanda Askell; Peter Welinder; Paul Christiano; Jan Leike; Ryan Lowe;
URL: https://arxiv.org/pdf/2203.02155.pdf