The CRINGE Loss: Learning what language not to model

Standard language model training employs gold human documents or human-human interaction data, and treats all training data as positive examples. Growing evidence shows that even with very large amounts of positive training data, issues remain that can be alleviated with relatively small amounts of Mehr anzeigen

Persistenter Link

https://doi.org/10.3929/ethz-b-000669683

Publikationsstatus

published

Externe Links

https://doi.org/10.18653/v1/2023.acl-long.493

Herausgeber(in)

Rogers, Anna

Boyd-Graber, Jordan

Okazaki, Naoaki

Buchtitel

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)