Language Model Quality Correlates with Psychometric Predictive Power in Multiple Languages

Wilcox, Ethan Gotlieb; Meister, Clara Isabel; Cotterell, Ryan; Pimentel, Tiago

doi:10.18653/v1/2023.emnlp-main.466

Download

Full text (published version) (PDF, 470.5Kb)

Open access

Author

Wilcox, Ethan Gotlieb

Meister, Clara Isabel

Cotterell, Ryan

Pimentel, Tiago

Show all

Date

2023-12

Type

Conference Paper

ETH Bibliography

yes

Altmetrics

Download

Full text (published version) (PDF, 470.5Kb)

Rights / license

Creative Commons Attribution 4.0 International

Abstract

Surprisal theory (Hale, 2001; Levy, 2008) posits that a word’s reading time is proportional to its surprisal (i.e., to its negative log probability given the proceeding context). Since we are unable to access a word’s ground-truth probability, surprisal theory has been empirically tested using surp Show more

Permanent link

https://doi.org/10.3929/ethz-b-000650659

Publication status

published

External links

https://doi.org/10.18653/v1/2023.emnlp-main.466

Editor

Bouamor, Houda

Pino, Juan

Bali, Kalika

Book title

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing

Pages / Article No.

7503 - 7511

Publisher

Association for Computational Linguistics

Event

2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), Singapore, December 6-10, 2023

Organisational unit

09682 - Cotterell, Ryan / Cotterell, Ryan
09462 - Hofmann, Thomas / Hofmann, Thomas

Related publications and datasets

Is supplemented by: https://github.com/rycolab/quality-power-hypothesis

More

Show all metadata

ETH Bibliography

yes

Altmetrics

Research Collection

Search

Language Model Quality Correlates with Psychometric Predictive Power in Multiple Languages Mendeley CSV RIS BibTeX

Language Model Quality Correlates with Psychometric Predictive Power in Multiple Languages

Mendeley

CSV

RIS

BibTeX