Cost / License
- Paid
- Proprietary
Platforms
- Windows



+1





docext is a powerful tool for extracting structured information from documents such as invoices, passports, and other forms. It leverages vision-language models (VLMs) to accurately identify and extract both field data and tabular information from document images.

Detect and export HTML tables to CSV, Excel, JSON, NDJSON or SQL. Automatic cleaning, reusable profiles, and 100% local processing: your data never leaves your browser.




Docsumo's PDF table extractor is a life-saviour when you need to analyze or store data quickly from PDF documents. Docsumo extracts tables from scanned or regular PDF files in a matter of seconds and lets you download and store data in tabular form.
