OpenAssistant Conversations - Democratizing Large Language Model Alignment

Köpf, Andreas; Kilcher, Yannic; von Rütte, Dimitri; Anagnostidis, Sotiris; Tam, Zhi-Rui; Stevens, Keith; Barhoum, Abdullah; Nguyen, Duc Minh; Stanley, Oliver; Nagyfi, Richárd; ES, Shahul; Suri, Sameer; Glushkov, David; Dantuluri, Arnav; Maguire, Andrew; Schuhmann, Christoph; Nguyen, Huu; Mattick, Alexander

Show simple item record

dc.contributor.author

Köpf, Andreas

dc.contributor.author

Kilcher, Yannic

dc.contributor.author

von Rütte, Dimitri

dc.contributor.author

Anagnostidis, Sotiris

dc.contributor.author

Tam, Zhi-Rui

dc.contributor.author

Stevens, Keith

dc.contributor.author

Barhoum, Abdullah

dc.contributor.author

Nguyen, Duc Minh

dc.contributor.author

Stanley, Oliver

dc.contributor.author

Nagyfi, Richárd

dc.contributor.author

ES, Shahul

dc.contributor.author

Suri, Sameer

dc.contributor.author

Glushkov, David

dc.contributor.author

Dantuluri, Arnav

dc.contributor.author

Maguire, Andrew

dc.contributor.author

Schuhmann, Christoph

dc.contributor.author

Nguyen, Huu

dc.contributor.author

Mattick, Alexander

dc.contributor.editor

Oh, Alice

dc.contributor.editor

Naumann, Tristan

dc.contributor.editor

Globerson, Amir

dc.contributor.editor

Saenko, Kate

dc.contributor.editor

Hardt, Moritz

dc.contributor.editor

Levine, Sergey

dc.date.accessioned

2024-10-09T13:19:17Z

dc.date.available

2024-01-16T14:04:05Z

dc.date.available

2024-01-17T13:04:30Z

dc.date.available

2024-10-09T13:15:58Z

dc.date.available

2024-10-09T13:19:17Z

dc.date.issued

2023

dc.identifier.isbn

978-1-7138-9992-1

en_US

dc.identifier.uri

http://hdl.handle.net/20.500.11850/653249

dc.description.abstract

Aligning large language models (LLMs) with human preferences has proven to drastically improve usability and has driven rapid adoption as demonstrated by ChatGPT. Alignment techniques such as supervised fine-tuning (\textit{SFT}) and reinforcement learning from human feedback (\textit{RLHF}) greatly reduce the required skill and domain knowledge to effectively harness the capabilities of LLMs, increasing their accessibility and utility across various domains. However, state-of-the-art alignment techniques like \textit{RLHF} rely on high-quality human feedback data, which is expensive to create and often remains proprietary. In an effort to democratize research on large-scale alignment, we release OpenAssistant Conversations, a human-generated, human-annotated assistant-style conversation corpus consisting of 161,443 messages in 35 different languages, annotated with 461,292 quality ratings, resulting in over 10,000 complete and fully annotated conversation trees. The corpus is a product of a worldwide crowd-sourcing effort involving over 13,500 volunteers. Models trained on OpenAssistant Conversations show consistent improvements on standard benchmarks over respective base models. We release our code\footnote{\git} and data\footnote{\data} under a fully permissive licence.

en_US

dc.language.iso

en

en_US

dc.publisher

Curran

en_US

dc.subject

dataset

en_US

dc.subject

human labels

en_US

dc.subject

instruction tuning

en_US

dc.subject

conversation

en_US

dc.subject

RLHF

en_US

dc.subject

open-source

en_US

dc.title

OpenAssistant Conversations - Democratizing Large Language Model Alignment

en_US

dc.type

Conference Paper

dc.date.published

2023-09-26

ethz.book.title

Advances in Neural Information Processing Systems 36

en_US

ethz.pages.start

47669

en_US

ethz.pages.end

47681

en_US

ethz.event

37th Conference on Neural Information Processing Systems (NeurIPS Datasets and Benchmarks.2023)

en_US

ethz.event.location

New Orleans, LA, USA

en_US

ethz.event.date

December 10-16, 2023

en_US

ethz.publication.place

Red Hook, NY

en_US

ethz.publication.status

published

en_US

ethz.leitzahl

ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02150 - Dep. Informatik / Dep. of Computer Science::02661 - Institut für Maschinelles Lernen / Institute for Machine Learning::09462 - Hofmann, Thomas / Hofmann, Thomas

en_US

ethz.leitzahl.certified

ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02150 - Dep. Informatik / Dep. of Computer Science::02661 - Institut für Maschinelles Lernen / Institute for Machine Learning::09462 - Hofmann, Thomas / Hofmann, Thomas

en_US

ethz.identifier.url

https://neurips.cc/virtual/2023/oral/73741

ethz.identifier.url

https://papers.nips.cc/paper_files/paper/2023/hash/949f0f8f32267d297c2d4e3ee10a2e7e-Abstract-Datasets_and_Benchmarks.html

ethz.date.deposited

2024-01-16T14:04:05Z

ethz.source

FORM

ethz.eth

yes

en_US

ethz.availability

Metadata only

en_US

ethz.rosetta.installDate

2024-10-09T13:15:59Z

ethz.rosetta.lastUpdated

2024-10-09T13:15:59Z

ethz.rosetta.exportRequired

true

ethz.rosetta.versionExported

true

ethz.COinS

ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.atitle=OpenAssistant%20Conversations%20-%20Democratizing%20Large%20Language%20Model%20Alignment&rft.date=2023&rft.spage=47669&rft.epage=47681&rft.au=K%C3%B6pf,%20Andreas&Kilcher,%20Yannic&von%20R%C3%BCtte,%20Dimitri&Anagnostidis,%20Sotiris&Tam,%20Zhi-Rui&rft.isbn=978-1-7138-9992-1&rft.genre=proceeding&rft.btitle=Advances%20in%20Neural%20Information%20Processing%20Systems%2036

Search print copy at ETH Library

Files in this item

Files	Size	Format	Open in viewer
There are no files associated with this item.

Publication type

Conference Paper [35367]

Show simple item record

Research Collection

Search

OpenAssistant Conversations - Democratizing Large Language Model Alignment Mendeley CSV RIS BibTeX

Files in this item

Publication type

OpenAssistant Conversations - Democratizing Large Language Model Alignment

Mendeley

CSV

RIS

BibTeX