A dataset for Text Detection, Optical Character Recognition, Spatial Layout Analysis and Form Understanding.
A dataset for the document understanding community.
If you use this dataset for your research, please cite our paper:
Bibtex format:
@inproceedings{jaume2019,
title = {FUNSD: A Dataset for Form Understanding in Noisy Scanned Documents},
author = {Guillaume Jaume, Hazim Kemal Ekenel, Jean-Philippe Thiran},
booktitle = {Accepted to ICDAR-OST},
year = {2019}
}
Word grouping and semantic entity labeling.