Open access
Date
2023-07Type
- Conference Paper
ETH Bibliography
yes
Altmetrics
Abstract
We introduce a novel dependency parser, the hexatagger, that constructs dependency trees by tagging the words in a sentence with elements from a finite set of possible tags. In contrast to many approaches to dependency parsing, our approach is fully parallelizable at training time, i.e., the structure-building actions needed to build a dependency parse can be predicted in parallel to each other. Additionally, exact decoding is linear in time and space complexity. Furthermore, we derive a probabilistic dependency parser that predicts hexatags using no more than a linear model with features from a pretrained language model, i.e., we forsake a bespoke architecture explicitly designed for the task. Despite the generality and simplicity of our approach, we achieve state-of-the-art performance of 96.4 LAS and 97.4 UAS on the Penn Treebank test set. Additionally, our parser’s linear time complexity and parallelism significantly improve computational efficiency, with a roughly 10-times speed-up over previous state-of-the-art models during decoding. Show more
Permanent link
https://doi.org/10.3929/ethz-b-000637527Publication status
publishedExternal links
Book title
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics, Volume 2: Short PapersPages / Article No.
Publisher
Association for Computational LinguisticsEvent
Organisational unit
02219 - ETH AI Center / ETH AI Center09682 - Cotterell, Ryan / Cotterell, Ryan
More
Show all metadata
ETH Bibliography
yes
Altmetrics