Metadata only
Date
2021-05Type
- Conference Paper
ETH Bibliography
yes
Altmetrics
Abstract
Translating renderings (e. g. PDFs, scans) into hierarchical document structures is extensively demanded in the daily routines of many real-world applications. However, a holistic, principled approach to inferring the complete hierarchical structure of documents is missing. As a remedy, we developed "DocParser": an end-to-end system for parsing the complete document structure - including all text elements, nested figures, tables, and table cell structures. Our second contribution is to provide a dataset for evaluating hierarchical document structure parsing. Our third contribution is to propose a scalable learning framework for settings where domain-specific data are scarce, which we address by a novel approach to weak supervision that significantly improves the document structure parsing performance. Our experiments confirm the effectiveness of our proposed weak supervision: Compared to the baseline without weak supervision, it improves the mean average precision for detecting document entities by 39.1% and improves the F1 score of classifying hierarchical relations by 35.8%. Show more
Publication status
publishedExternal links
Journal / series
Proceedings of the AAAI Conference on Artificial IntelligenceVolume
Pages / Article No.
Publisher
AAAIEvent
Subject
Applications; Information extractionOrganisational unit
09623 - Feuerriegel, Stefan (ehemalig) / Feuerriegel, Stefan (former)09588 - Zhang, Ce (ehemalig) / Zhang, Ce (former)
Funding
184628 - EASEML: Toward a More Accessible and Usable Machine Learning Platform for Non-expert Users (SNF)
957407 - Integrated Data Analysis Pipelines for Large-Scale Data Management, HPC, and Machine Learning (EC)
Related publications and datasets
Is cited by: https://doi.org/10.3929/ethz-b-000530965
More
Show all metadata
ETH Bibliography
yes
Altmetrics