Show simple item record

dc.contributor.author
Köpf, Andreas
dc.contributor.author
Kilcher, Yannic
dc.contributor.author
von Rütte, Dimitri
dc.contributor.author
Anagnostidis, Sotiris
dc.contributor.author
Tam, Zhi-Rui
dc.contributor.author
Stevens, Keith
dc.contributor.author
Barhoum, Abdullah
dc.contributor.author
Nguyen, Duc Minh
dc.contributor.author
Stanley, Oliver
dc.contributor.author
Nagyfi, Richárd
dc.contributor.author
ES, Shahul
dc.contributor.author
Suri, Sameer
dc.contributor.author
Glushkov, David
dc.contributor.author
Dantuluri, Arnav
dc.contributor.author
Maguire, Andrew
dc.contributor.author
Schuhmann, Christoph
dc.contributor.author
Nguyen, Huu
dc.contributor.author
Mattick, Alexander
dc.contributor.editor
Oh, Alice
dc.contributor.editor
Naumann, Tristan
dc.contributor.editor
Globerson, Amir
dc.contributor.editor
Saenko, Kate
dc.contributor.editor
Hardt, Moritz
dc.contributor.editor
Levine, Sergey
dc.date.accessioned
2024-10-09T13:19:17Z
dc.date.available
2024-01-16T14:04:05Z
dc.date.available
2024-01-17T13:04:30Z
dc.date.available
2024-10-09T13:15:58Z
dc.date.available
2024-10-09T13:19:17Z
dc.date.issued
2023
dc.identifier.isbn
978-1-7138-9992-1
en_US
dc.identifier.uri
http://hdl.handle.net/20.500.11850/653249
dc.description.abstract
Aligning large language models (LLMs) with human preferences has proven to drastically improve usability and has driven rapid adoption as demonstrated by ChatGPT. Alignment techniques such as supervised fine-tuning (\textit{SFT}) and reinforcement learning from human feedback (\textit{RLHF}) greatly reduce the required skill and domain knowledge to effectively harness the capabilities of LLMs, increasing their accessibility and utility across various domains. However, state-of-the-art alignment techniques like \textit{RLHF} rely on high-quality human feedback data, which is expensive to create and often remains proprietary. In an effort to democratize research on large-scale alignment, we release OpenAssistant Conversations, a human-generated, human-annotated assistant-style conversation corpus consisting of 161,443 messages in 35 different languages, annotated with 461,292 quality ratings, resulting in over 10,000 complete and fully annotated conversation trees. The corpus is a product of a worldwide crowd-sourcing effort involving over 13,500 volunteers. Models trained on OpenAssistant Conversations show consistent improvements on standard benchmarks over respective base models. We release our code\footnote{\git} and data\footnote{\data} under a fully permissive licence.
en_US
dc.language.iso
en
en_US
dc.publisher
Curran
en_US
dc.subject
dataset
en_US
dc.subject
human labels
en_US
dc.subject
instruction tuning
en_US
dc.subject
conversation
en_US
dc.subject
RLHF
en_US
dc.subject
open-source
en_US
dc.title
OpenAssistant Conversations - Democratizing Large Language Model Alignment
en_US
dc.type
Conference Paper
dc.date.published
2023-09-26
ethz.book.title
Advances in Neural Information Processing Systems 36
en_US
ethz.pages.start
47669
en_US
ethz.pages.end
47681
en_US
ethz.event
37th Conference on Neural Information Processing Systems (NeurIPS Datasets and Benchmarks.2023)
en_US
ethz.event.location
New Orleans, LA, USA
en_US
ethz.event.date
December 10-16, 2023
en_US
ethz.publication.place
Red Hook, NY
en_US
ethz.publication.status
published
en_US
ethz.leitzahl
ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02150 - Dep. Informatik / Dep. of Computer Science::02661 - Institut für Maschinelles Lernen / Institute for Machine Learning::09462 - Hofmann, Thomas / Hofmann, Thomas
en_US
ethz.leitzahl.certified
ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02150 - Dep. Informatik / Dep. of Computer Science::02661 - Institut für Maschinelles Lernen / Institute for Machine Learning::09462 - Hofmann, Thomas / Hofmann, Thomas
en_US
ethz.identifier.url
https://neurips.cc/virtual/2023/oral/73741
ethz.identifier.url
https://papers.nips.cc/paper_files/paper/2023/hash/949f0f8f32267d297c2d4e3ee10a2e7e-Abstract-Datasets_and_Benchmarks.html
ethz.date.deposited
2024-01-16T14:04:05Z
ethz.source
FORM
ethz.eth
yes
en_US
ethz.availability
Metadata only
en_US
ethz.rosetta.installDate
2024-10-09T13:15:59Z
ethz.rosetta.lastUpdated
2024-10-09T13:15:59Z
ethz.rosetta.exportRequired
true
ethz.rosetta.versionExported
true
ethz.COinS
ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.atitle=OpenAssistant%20Conversations%20-%20Democratizing%20Large%20Language%20Model%20Alignment&rft.date=2023&rft.spage=47669&rft.epage=47681&rft.au=K%C3%B6pf,%20Andreas&Kilcher,%20Yannic&von%20R%C3%BCtte,%20Dimitri&Anagnostidis,%20Sotiris&Tam,%20Zhi-Rui&rft.isbn=978-1-7138-9992-1&rft.genre=proceeding&rft.btitle=Advances%20in%20Neural%20Information%20Processing%20Systems%2036
 Search print copy at ETH Library

Files in this item

FilesSizeFormatOpen in viewer

There are no files associated with this item.

Publication type

Show simple item record