docext icon
docext icon

docext

docext is a powerful tool for extracting structured information from documents such as invoices, passports, and other forms. It leverages vision-language models (VLMs) to accurately identify and extract both field data and tabular information from document images.

docext screenshot 1

Cost / License

  • Free
  • Open Source

Platforms

  • Self-Hosted
  • Docker
  • Python
-
No reviews
0likes
0comments
0news articles

Features

Suggest and vote on features
  1.  OCR
  2.  Structured data
  3.  PDF OCR
  4.  REST API
  5.  Python-based
  6.  On-premises software

 Tags

docext News & Activities

Highlights All activities

Recent activities

Show all activities

docext information

  • Developed by

    US flagNanoNets
  • Licensing

    Open Source (Apache-2.0) and Free product.
  • Written in

  • Alternatives

    11 alternatives listed
  • Supported Languages

    • English

AlternativeTo Categories

DevelopmentOffice & Productivity

GitHub repository

  •  1,814 Stars
  •  136 Forks
  •  21 Open Issues
  •   Updated  
View on GitHub
docext was added to AlternativeTo by Paul on and this page was last updated .
No comments or reviews, maybe you want to be first?
Post comment/review

What is docext?

docext is a powerful tool for extracting structured information from documents such as invoices, passports, and other forms. It leverages vision-language models (VLMs) to accurately identify and extract both field data and tabular information from document images.

Features:

  • User-friendly interface: Built with Gradio for easy document processing
  • Flexible extraction: Define custom fields or use pre-built templates
  • Table extraction: Extract structured tabular data from documents
  • Confidence scoring: Get confidence levels for extracted information
  • On-premises deployment: Run entirely on your own infrastructure
  • Multi-page support: Process documents with multiple pages
  • REST API: Programmatic access for integration with your applications
  • Pre-built templates: Ready-to-use templates for common document types:
  • Invoices
  • Passports
  • Add/delete new fields/columns for other templates.

Official Links