

Pix2text
Open-source Python3 tool using lightweight models for recognizing layouts, tables, math (LaTeX), and multilingual text in images, converting them into Markdown. Supports 80+ languages, runs offline, extracts structured data for documentation, study, or research purposes.
Cost / License
- Free
- Open Source (MIT)
Platforms
- Mac
- Windows
- Linux
Features
- OCR
- Multiple languages
- Image to text
- AI-Powered
Pix2text News & Activities
Recent activities
Pix2text information
What is Pix2text?
Pix2Text (P2T) is an open-source Python tool similar to Mathpix. It recognizes elements like layouts, tables, images, text, and mathematical formulas, and integrates them into a Markdown format. P2T can convert PDF files into Markdown format, regardless of their content. It uses multiple models, including a Layout Analysis Model and a Table Recognition Model. Its Text Recognition Engine supports over 80 languages, using the open-source OCR tools CnOCR for English and Simplified Chinese, and EasyOCR for other languages. It also includes a Mathematical Formula Detection Model and a Mathematical Formula Recognition Model for recognizing and integrating mathematical formulas into Markdown format.






