Index
Docling simplifies document processing by parsing diverse formats — including advanced PDF understanding — and providing seamless integrations with the generative AI ecosystem.
Getting started
🐣 Ready to kick off your Docling journey? Let's dive right into it!
⬇️ Installation
Quickly install Docling in your environment ▶️ Quickstart
Get a jumpstart on basic Docling usage 🧩 Concepts
Learn Docling fundamentals and get a glimpse under the hood 🧑🏽🍳 Examples
Try out recipes for various use cases, including conversion, RAG, and more 🤖 Integrations
Check out integrations with popular AI tools and frameworks 📖 Reference
See more API details
Quickly install Docling in your environment ▶️ Quickstart
Get a jumpstart on basic Docling usage 🧩 Concepts
Learn Docling fundamentals and get a glimpse under the hood 🧑🏽🍳 Examples
Try out recipes for various use cases, including conversion, RAG, and more 🤖 Integrations
Check out integrations with popular AI tools and frameworks 📖 Reference
See more API details
Features
- 🗂️ Parsing of multiple document formats including PDF, DOCX, PPTX, XLSX, HTML, EPUB, WAV, MP3, WebVTT, email formats (EML, MSG), images (PNG, TIFF, JPEG, ...), LaTeX, DocLang, plain text, and more
- 📑 Advanced PDF understanding incl. page layout, reading order, table structure, code, formulas, image classification, and more
- 🧬 A unified, expressive DoclingDocument representation format
- ↪️ Various export formats and options, including Markdown, HTML, WebVTT, DocLang, DocTags and lossless JSON
- 📜 Support for several application-specific XML schemas including DocLang, USPTO patents, JATS articles, and XBRL financial reports.
- 🔒 Local execution capabilities for sensitive data and air-gapped environments
- 🤖 Plug-and-play integrations incl. LangChain, LlamaIndex, Crew AI & Haystack for agentic AI
- 🔍 Extensive OCR support for scanned PDFs and images
- 👓 Support for several Visual Language Models, such as (GraniteDocling)
- 🎙️ Audio support with Automatic Speech Recognition (ASR) models
- 🔌 Connect to any agent using the MCP server
- 🌐 Run Docling as a service with the API server (docling-serve)
- 💻 Simple and convenient CLI
What's new
- 📄 Parsing of ODF (OpenDocument Format) files for text documents (
.odt), spreadsheets (.ods), and presentations (.odp) - 💼 Parsing of XBRL (eXtensible Business Reporting Language) documents for financial reports
- 📧 Parsing of email files (
.eml,.msg) - 📚 Parsing of EPUB (Electronic Publication) files for e-books
- 📝 Parsing of plain-text files (
.txt,.text) and Markdown supersets (.qmd,.Rmd) - 📊 Chart understanding (Barchart, Piechart, LinePlot): convert them into tables or code and add detailed descriptions
Coming soon
- 📝 Metadata extraction, including title, authors, references & language
- 📝 Complex chemistry understanding (Molecular structures)
What's next
🚀 The journey has just begun! Join us and become a part of the growing Docling community.
Live assistant
Do you want to leverage the power of AI and get live support on Docling? Try out the Chat with Dosu functionalities provided by our friends at Dosu.
LF AI & Data
Docling is hosted as a project in the LF AI & Data Foundation.
IBM ❤️ Open Source AI
The project was started by the AI for knowledge team at IBM Research Zurich.