The CRINGE Loss: Learning what language not to model

Standard language model training employs gold human documents or human-human interaction data, and treats all training data as positive examples. Growing evidence shows that even with very large amounts of positive training data, issues remain that can be alleviated with relatively small amounts of Show more

Permanent link

https://doi.org/10.3929/ethz-b-000669683

Publication status

published

External links

https://doi.org/10.18653/v1/2023.acl-long.493

Editor

Rogers, Anna

Boyd-Graber, Jordan

Okazaki, Naoaki

Book title

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)