Skip to content
Change the repository type filter

All

    Repositories list

    • MinerU

      Public
      A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
      Python
      GNU Affero General Public License v3.0
      98513k1627Updated Oct 24, 2024Oct 24, 2024
    • A Comprehensive Toolkit for High-Quality PDF Content Extraction
      Python
      GNU Affero General Public License v3.0
      3475.2k513Updated Oct 24, 2024Oct 24, 2024
    • DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
      Python
      GNU Affero General Public License v3.0
      1020300Updated Oct 23, 2024Oct 23, 2024
    • Data annotation component library --provided as NPM packages
      TypeScript
      Apache License 2.0
      156152Updated Oct 23, 2024Oct 23, 2024
    • The official implementation of the paper “Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm”
      Apache License 2.0
      11700Updated Oct 22, 2024Oct 22, 2024
    • LOKI

      Public
      The official implementation of the paper “LOKI:A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models”
      Python
      110510Updated Oct 21, 2024Oct 21, 2024
    • labelU

      Public
      Data annotation toolbox supports image, audio and video data.
      Python
      75831100Updated Oct 17, 2024Oct 17, 2024
    • UniMERNet

      Public
      UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition
      Python
      Apache License 2.0
      1718360Updated Sep 30, 2024Sep 30, 2024
    • .github

      Public
      2000Updated Sep 12, 2024Sep 12, 2024
    • ECCV2024_Parrot Captions Teach CLIP to Spot Text
      Python
      Apache License 2.0
      26030Updated Sep 6, 2024Sep 6, 2024
    • The official implementation of the paper "CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Street View Synthesis"
      JavaScript
      1500Updated Sep 2, 2024Sep 2, 2024
    • MLS-BRN

      Public
      [CVPR 2024] 3D Building Reconstruction from Monocular Remote Sensing Images with Multi-level Supervisions
      Python
      23620Updated Aug 30, 2024Aug 30, 2024
    • UrBench

      Public
      JavaScript
      Apache License 2.0
      0000Updated Aug 30, 2024Aug 30, 2024
    • LabelLLM

      Public
      The Open-Source Data Annotation Platform
      TypeScript
      Apache License 2.0
      4354640Updated Aug 12, 2024Aug 12, 2024
    • datasets resource
      88220Updated Aug 9, 2024Aug 9, 2024
    • MPB (Miner-PDF-Benchmark) is an end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios.
      Python
      Apache License 2.0
      51360Updated Aug 2, 2024Aug 2, 2024
    • CHARM

      Public
      [ACL 2024 Main Conference] Chinese commonsense benchmark for LLMs
      Python
      Apache License 2.0
      22300Updated Jul 27, 2024Jul 27, 2024
    • magic-doc

      Public
      Python
      Apache License 2.0
      25341170Updated Jul 26, 2024Jul 26, 2024
    • Python
      Apache License 2.0
      1924050Updated Jul 18, 2024Jul 18, 2024
    • dsdl-sdk

      Public
      Jupyter Notebook
      Apache License 2.0
      61300Updated May 29, 2024May 29, 2024
    • dsdl-docs

      Public
      Data Set Description Language Specification (新一代人工智能数据集描述语言DSDL)
      HTML
      Apache License 2.0
      64610Updated May 29, 2024May 29, 2024
    • MLLM-DataEngine: An Iterative Refinement Approach for MLLM
      Python
      Apache License 2.0
      43400Updated May 24, 2024May 24, 2024
    • Python
      Apache License 2.0
      32720Updated May 13, 2024May 13, 2024
    • WanJuan-CC是以CommonCrawl为基础,经过数据抽取,规则清洗,去重,安全过滤,质量清洗等步骤得到的高质量数据。
      01210Updated Apr 18, 2024Apr 18, 2024
    • H2RSVLM

      Public
      H2RSVLM: Towards Helpful and Honest Remote Sensing Large Vision Language Model
      Apache License 2.0
      14430Updated Apr 1, 2024Apr 1, 2024
    • VIGC

      Public
      AAAI 2024: Visual Instruction Generation and Correction
      Python
      Apache License 2.0
      39020Updated Feb 4, 2024Feb 4, 2024
    • HA-DPO

      Public
      Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization
      Python
      Apache License 2.0
      56000Updated Jan 30, 2024Jan 30, 2024
    • 万卷1.0多模态语料
      Creative Commons Attribution 4.0 International
      26540190Updated Oct 20, 2023Oct 20, 2023
    • SDK of OpenDataLab - https://opendatalab.org.cn
      Python
      MIT License
      45622Updated Aug 1, 2023Aug 1, 2023
    • Python
      99420Updated May 16, 2023May 16, 2023