Unlike previous datasets containing the basic tables, ComFinTab contains a large ratio of compound tables, which is much more challenging and requires methods.
Oct 10, 2022 · We release a new benchmark named ComFinTab with rich annotations that support both table recognition and understanding tasks.
Oct 14, 2022 · Experiments compared with the previous modeling methods demonstrate the effectiveness and robustness of our proposed framework. The major ...
DavarOCR: A Toolbox for OCR and Multi-Modal Document Understanding Liang ... End-to-End Compound Table Understanding with Multi-Modal Modeling Zaisheng ...
Jun 12, 2024 · In this paper, we propose a new problem, multimodal table understanding, where the model needs to generate correct responses to various table-related requests ...
Missing: Compound | Show results with:Compound
Aug 11, 2024 · This process aims at aligning the visual features of diversified table images with the ground-truth textual table represen- tations, which ...
Missing: Compound | Show results with:Compound
The proposed model consists of one shared encoder, one shared decoder, and three separate decoders which are used for learning three sub-tasks of table.
Missing: Compound Modal
Mar 15, 2023 · In this paper, we propose an end-to-end multi-task learning model for image-based table recognition. The proposed model consists of one shared encoder, one ...
Missing: Compound Modal Modeling.
Jun 12, 2024 · The paper presents a novel approach for Multimodal Table Understanding, which aims to extract information from tables that contain both textual and visual ...
End-to-End Compound Table Understanding with Multi-Modal Modeling · hmtl icon ... DavarOCR: A Toolbox for OCR and Multi-Modal Document Understanding · hmtl ...