Document to Markdown This collection contains models which convert text or multimodal documents to markdown format for various downstream tasks. rednote-hilab/dots.ocr Image-Text-to-Text β’ 3B β’ Updated Oct 31, 2025 β’ 185k β’ 1.3k numind/NuMarkdown-8B-Thinking Image-to-Text β’ Updated Nov 13, 2025 β’ 114k β’ 452 zai-org/GLM-4.5V Image-Text-to-Text β’ 108B β’ Updated Oct 25, 2025 β’ 76.2k β’ β’ 718 microsoft/kosmos-2.5 Image-Text-to-Text β’ Updated Aug 28, 2025 β’ 83.4k β’ 270
Document Datasets docling-project/DocLayNet Updated Jan 25, 2023 β’ 741 β’ 133 common-pile/caselaw_access_project Viewer β’ Updated Jun 6, 2025 β’ 5.52M β’ 2.43k β’ 211 llamaindex/vdr-multilingual-test Viewer β’ Updated Jan 10, 2025 β’ 15k β’ 285 β’ 3 PleIAs/common_corpus Viewer β’ Updated Feb 19 β’ 69.9k β’ 198k β’ 395
Document to Markdown This collection contains models which convert text or multimodal documents to markdown format for various downstream tasks. rednote-hilab/dots.ocr Image-Text-to-Text β’ 3B β’ Updated Oct 31, 2025 β’ 185k β’ 1.3k numind/NuMarkdown-8B-Thinking Image-to-Text β’ Updated Nov 13, 2025 β’ 114k β’ 452 zai-org/GLM-4.5V Image-Text-to-Text β’ 108B β’ Updated Oct 25, 2025 β’ 76.2k β’ β’ 718 microsoft/kosmos-2.5 Image-Text-to-Text β’ Updated Aug 28, 2025 β’ 83.4k β’ 270
Document Datasets docling-project/DocLayNet Updated Jan 25, 2023 β’ 741 β’ 133 common-pile/caselaw_access_project Viewer β’ Updated Jun 6, 2025 β’ 5.52M β’ 2.43k β’ 211 llamaindex/vdr-multilingual-test Viewer β’ Updated Jan 10, 2025 β’ 15k β’ 285 β’ 3 PleIAs/common_corpus Viewer β’ Updated Feb 19 β’ 69.9k β’ 198k β’ 395