Academic Paper Parsing

Publisher-Adaptive HTML Parsing System for Figure and Caption Extraction from Scientific Papers

Publisher-adaptive extraction of figures, captions, and caption-mentioning body sentences from academic-paper HTML.

avatar
Woonghee Lee

CREMA: Contextual Reconstruction and Extraction of Multimodal Assets from academic papers

Academic paper visual-element extraction with translation and layout reconstruction.

avatar
Woonghee Lee