OpenAssistant Conversations - Democratizing Large Language Model Alignment - Research Collection

Metadata only

Author

Kilcher, Yannic

von Rütte, Dimitri

Anagnostidis, Sotiris

Barhoum, Abdullah

Nguyen, Duc Minh

Stanley, Oliver

Nagyfi, Richárd

Glushkov, David

Dantuluri, Arnav

Maguire, Andrew

Schuhmann, Christoph

Mattick, Alexander

Date

2023

Type

Conference Paper

ETH Bibliography

yes

Altmetrics

Abstract

Aligning large language models (LLMs) with human preferences has proven to drastically improve usability and has driven rapid adoption as demonstrated by ChatGPT. Alignment techniques such as supervised fine-tuning (\textit{SFT}) and reinforcement learning from human feedback (\textit{RLHF}) greatl Show more

Publication status

published

External links

https://neurips.cc/virtual/2023/oral/73741
https://papers.nips.cc/paper_files/paper/2023/hash/949f0f8f32267d297c2d4e3ee10a2e7e-Abstract-Datasets_and_Benchmarks.html

Editor

Naumann, Tristan

Globerson, Amir

Book title

Advances in Neural Information Processing Systems 36

Pages / Article No.

47669 - 47681

Publisher

Curran

Event

37th Conference on Neural Information Processing Systems (NeurIPS Datasets and Benchmarks.2023), New Orleans, LA, USA, December 10-16, 2023

Subject

dataset; human labels; instruction tuning; conversation; RLHF; open-source

Organisational unit

09462 - Hofmann, Thomas / Hofmann, Thomas

More

Show all metadata

ETH Bibliography

yes

Altmetrics