Skip to content
Change the repository type filter

All

    Repositories list

    • MinerU

      Public
      A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
      Python
      GNU Affero General Public License v3.0
      1.8k24k483Updated Jan 10, 2025Jan 10, 2025
    • LabelLLM

      Public
      The Open-Source Data Annotation Platform
      TypeScript
      Apache License 2.0
      4962390Updated Jan 10, 2025Jan 10, 2025
    • WanJuan3.0(“万卷·丝路”)一个作为综合性的纯文本语料库,采集了多个国家地区的网络公开信息、文献、专利等资料,数据总规模超1.2TB,Token总数超过300B,处于国际领先水平,首期开源的语料库主要由泰语、俄语、阿拉伯语、韩语和越南语5个子集构成,每个子集的数据规模均超过150GB
      MIT License
      0300Updated Jan 10, 2025Jan 10, 2025
    • Data annotation component library --provided as NPM packages
      TypeScript
      Apache License 2.0
      177022Updated Jan 9, 2025Jan 9, 2025
    • labelU

      Public
      Data annotation toolbox supports image, audio and video data.
      Python
      Apache License 2.0
      93950130Updated Jan 9, 2025Jan 9, 2025
    • VHM

      Public
      VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis
      Python
      Apache License 2.0
      46410Updated Jan 8, 2025Jan 8, 2025
    • UrBench

      Public
      [AAAI 2025]This repo contains evaluation code for the paper “UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios”
      Python
      Other
      0400Updated Jan 7, 2025Jan 7, 2025
    • DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
      Python
      GNU Affero General Public License v3.0
      5373551Updated Jan 6, 2025Jan 6, 2025
    • A Comprehensive Toolkit for High-Quality PDF Content Extraction
      Python
      GNU Affero General Public License v3.0
      4166.3k694Updated Jan 3, 2025Jan 3, 2025
    • A Comprehensive Benchmark for Document Parsing and Evaluation
      Python
      Apache License 2.0
      1918360Updated Jan 2, 2025Jan 2, 2025
    • CRaFT

      Public
      [AAAI25] Certainty Represented Knowledge Flow for Refusal-Aware Instruction Tuning
      Python
      0100Updated Jan 1, 2025Jan 1, 2025
    • UniMERNet

      Public
      UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition
      Python
      Apache License 2.0
      22247120Updated Dec 26, 2024Dec 26, 2024
    • OHR-Bench

      Public
      OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation
      Python
      115200Updated Dec 19, 2024Dec 19, 2024
    • The official implementation of the paper “Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm”
      Python
      Apache License 2.0
      24120Updated Dec 14, 2024Dec 14, 2024
    • Miner-PDF-Benchmark

      Public archive
      MPB (Miner-PDF-Benchmark) is an end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios.
      Python
      Apache License 2.0
      52000Updated Dec 11, 2024Dec 11, 2024
    • LOKI

      Public
      The official implementation of the paper “LOKI:A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models”
      Python
      112210Updated Nov 24, 2024Nov 24, 2024
    • Python
      Apache License 2.0
      2934560Updated Nov 22, 2024Nov 22, 2024
    • .github

      Public
      2000Updated Sep 12, 2024Sep 12, 2024
    • ECCV2024_Parrot Captions Teach CLIP to Spot Text
      Python
      Apache License 2.0
      26330Updated Sep 6, 2024Sep 6, 2024
    • The official implementation of the paper "CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Street View Synthesis"
      JavaScript
      11310Updated Sep 2, 2024Sep 2, 2024
    • MLS-BRN

      Public
      [CVPR 2024] 3D Building Reconstruction from Monocular Remote Sensing Images with Multi-level Supervisions
      Python
      25650Updated Aug 30, 2024Aug 30, 2024
    • datasets resource
      99720Updated Aug 9, 2024Aug 9, 2024
    • CHARM

      Public
      [ACL 2024 Main Conference] Chinese commonsense benchmark for LLMs
      Python
      Apache License 2.0
      22800Updated Jul 27, 2024Jul 27, 2024
    • magic-doc

      Public
      Python
      Apache License 2.0
      33401220Updated Jul 26, 2024Jul 26, 2024
    • dsdl-sdk

      Public
      Jupyter Notebook
      Apache License 2.0
      61300Updated May 29, 2024May 29, 2024
    • dsdl-docs

      Public
      Data Set Description Language Specification (新一代人工智能数据集描述语言DSDL)
      HTML
      Apache License 2.0
      64610Updated May 29, 2024May 29, 2024
    • MLLM-DataEngine: An Iterative Refinement Approach for MLLM
      Python
      Apache License 2.0
      44100Updated May 24, 2024May 24, 2024
    • Python
      Apache License 2.0
      32620Updated May 13, 2024May 13, 2024
    • WanJuan-CC是以CommonCrawl为基础,经过数据抽取,规则清洗,去重,安全过滤,质量清洗等步骤得到的高质量数据。
      01210Updated Apr 18, 2024Apr 18, 2024
    • VIGC

      Public
      AAAI 2024: Visual Instruction Generation and Correction
      Python
      Apache License 2.0
      39130Updated Feb 4, 2024Feb 4, 2024