Skip to content
Change the repository type filter

All

    Repositories list

    • docling

      Public
      Get your documents ready for gen AI
      Python
      MIT License
      70114k968Updated Dec 13, 2024Dec 13, 2024
    • Python
      MIT License
      1053101Updated Dec 13, 2024Dec 13, 2024
    • A python library to define and validate data types in Docling.
      Python
      MIT License
      173952Updated Dec 13, 2024Dec 13, 2024
    • Running Docling as an API service
      Makefile
      MIT License
      73033Updated Dec 10, 2024Dec 10, 2024
    • Interact with the Deep Search platform for new knowledge explorations and discoveries
      Python
      MIT License
      21139811Updated Dec 9, 2024Dec 9, 2024
    • Simple package to extract text with coordinates from programmatic PDFs
      C++
      MIT License
      103760Updated Dec 9, 2024Dec 9, 2024
    • Create fast graph language models from converted PDF documents for knowledge extraction and Q&A.
      C++
      MIT License
      62721Updated Dec 9, 2024Dec 9, 2024
    • CSS
      MIT License
      11000Updated Dec 2, 2024Dec 2, 2024
    • Examples using the Deep Search functionalities
      Python
      MIT License
      175004Updated Nov 28, 2024Nov 28, 2024
    • PatCID

      Public
      Python
      MIT License
      13320Updated Nov 28, 2024Nov 28, 2024
    • MolGrapher: Graph-based Visual Recognition of Chemical Structures
      Python
      MIT License
      0800Updated Nov 28, 2024Nov 28, 2024
    • MolGrapher: Graph-based Visual Recognition of Chemical Structures
      Python
      MIT License
      35300Updated Nov 22, 2024Nov 22, 2024
    • quackling

      Public archive
      Build document-native LLM applications
      Python
      MIT License
      25100Updated Sep 11, 2024Sep 11, 2024
    • Mognet is a fast, simple framework to build distributed applications using task queues.
      Python
      MIT License
      3901Updated Aug 7, 2024Aug 7, 2024
    • Python
      MIT License
      0600Updated Jul 8, 2024Jul 8, 2024
    • Python
      MIT License
      0700Updated Jul 8, 2024Jul 8, 2024
    • SemTabNet

      Public
      Repository for ACL paper: "Statements: Universal Information Extraction from Tables with Large Language Models for ESG KPIs"
      Python
      MIT License
      1600Updated Jul 1, 2024Jul 1, 2024
    • .github

      Public
      0100Updated Jun 24, 2024Jun 24, 2024
    • Repository to detect scientific software in documents for Chan Zuckerberg Initiative workshop
      Python
      MIT License
      1200Updated Oct 26, 2023Oct 26, 2023
    • langchain

      Public
      ⚡ Building applications with LLMs through composability ⚡
      Python
      MIT License
      16k100Updated May 18, 2023May 18, 2023
    • Website of the ICDAR 2023 DocLayNet competition
      2100Updated Apr 26, 2023Apr 26, 2023
    • DocLayNet

      Public
      DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis
      Other
      1629030Updated Feb 1, 2023Feb 1, 2023
    • Example NLP Annotator API used for integrating with the IBM DeepSearch CPS platform
      Python
      Apache License 2.0
      41000Updated Sep 8, 2022Sep 8, 2022