docext is a powerful tool for extracting structured information from documents such as invoices, passports, and other forms. It leverages vision-language models (VLMs) to accurately identify and extract both field data and tabular information from document images.
Cost / License
- Free
- Open Source
Platforms
- Self-Hosted
- Docker
- Python

docext is the most popular Self-Hosted alternative to PDF Tables.
docext is the most popular Open Source alternative to PDF Tables.
- docext is Free and Open Source
















