Skip to main content

Documentation Index

Fetch the complete documentation index at: https://nvd-54.mintlify.app/llms.txt

Use this file to discover all available pages before exploring further.

Docling parses PDF, DOCX, PPTX, HTML, and other formats into a rich unified representation including document layout, tables etc., making them ready for generative AI workflows like RAG. 此集成 provides Docling’s capabilities via the DoclingLoader document loader.

安装和设置

Simply install langchain-docling from your package manager, e.g. pip:
pip install langchain-docling

文档加载器

The DoclingLoader class in langchain-docling seamlessly integrates Docling into LangChain, enabling you to:
  • use various document types in your LLM applications with ease and speed, and
  • leverage Docling’s rich representation for advanced, document-native grounding.
Basic usage looks as follows:
from langchain_docling import DoclingLoader

FILE_PATH = ["https://arxiv.org/pdf/2408.09869"]  # Docling Technical Report

loader = DoclingLoader(file_path=FILE_PATH)

docs = loader.load()
For end-to-end usage check out this example.

额外资源