CREMA: Contextual Reconstruction and Extraction of Multimodal Assets from academic papers
Academic paper visual-element extraction with translation and layout reconstruction.
Academic paper visual-element extraction with translation and layout reconstruction.