docext icon
docext icon

docext

 Like

docext is a powerful tool for extracting structured information from documents such as invoices, passports, and other forms. It leverages vision-language models (VLMs) to accurately identify and extract both field data and tabular information from document images.

docext screenshot 1

License model

  • FreeOpen Source

Country of Origin

  • US flagUnited States

Platforms

  • Self-Hosted
  • Docker
  • Python
  No rating
0likes
0comments
0news articles

Features

Suggest and vote on features
  1.  OCR
  2.  Structured data
  3.  PDF OCR
  4.  REST API
  5.  Python-based
  6.  On-premises software

 Tags

docext News & Activities

Highlights All activities

Recent activities

Show all activities

docext information

  • Developed by

    US flagNanoNets
  • Licensing

    Open Source (Apache-2.0) and Free product.
  • Written in

  • Alternatives

    10 alternatives listed
  • Supported Languages

    • English

AlternativeTo Categories

DevelopmentOffice & Productivity

GitHub repository

  •  965 Stars
  •  73 Forks
  •  10 Open Issues
  •   Updated Jun 13, 2025 
View on GitHub

Our users have written 0 comments and reviews about docext, and it has gotten 0 likes

docext was added to AlternativeTo by Paul on Apr 8, 2025 and this page was last updated Apr 8, 2025.
No comments or reviews, maybe you want to be first?
Post comment/review

What is docext?

docext is a powerful tool for extracting structured information from documents such as invoices, passports, and other forms. It leverages vision-language models (VLMs) to accurately identify and extract both field data and tabular information from document images.

Features:

  • User-friendly interface: Built with Gradio for easy document processing
  • Flexible extraction: Define custom fields or use pre-built templates
  • Table extraction: Extract structured tabular data from documents
  • Confidence scoring: Get confidence levels for extracted information
  • On-premises deployment: Run entirely on your own infrastructure
  • Multi-page support: Process documents with multiple pages
  • REST API: Programmatic access for integration with your applications
  • Pre-built templates: Ready-to-use templates for common document types:
    • Invoices
    • Passports
    • Add/delete new fields/columns for other templates.

Official Links