Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Cost / License
- Free
- Open Source (Apache-2.0)
Platforms
- Windows
- Mac
- Linux



Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.



Easy-to-use PDF and email parser. Automatically extract text from emails and PDFs using our powerful OCR engine. Send extracted data to Google Sheet or hundreds of connected CRMs and applications.



A lightweight, elegant reference manager. Organize your library, extract annotations from PDFs, auto-enrich metadata, and export BibTeX or RIS — your data stays locally.



