Pix2text icon
Pix2text icon

Pix2text

Open-source Python3 tool using lightweight models for recognizing layouts, tables, math (LaTeX), and multilingual text in images, converting them into Markdown. Supports 80+ languages, runs offline, extracts structured data for documentation, study, or research purposes.

Pix2text screenshot 1

Cost / License

  • Free
  • Open Source (MIT)

Platforms

  • Mac
  • Windows
  • Linux
1like
0comments
0articles

Features

Pix2text News & Activities

Highlights All activities

Recent activities

Pix2text information

  • Developed by

    CN flagbreezedeus
  • Licensing

    Open Source (MIT) and Free product.
  • Alternatives

    2 alternatives listed
  • Supported Languages

    • English

AlternativeTo Categories

Office & ProductivityDevelopment

GitHub repository

  •  3,141 Stars
  •  272 Forks
  •  36 Open Issues
  •   Updated  
View on GitHub
Pix2text was added to AlternativeTo by eliasbuenosdias on and this page was last updated . Pix2text is sometimes referred to as P2T
No comments or reviews, maybe you want to be first?

What is Pix2text?

Pix2Text (P2T) is an open-source Python tool similar to Mathpix. It recognizes elements like layouts, tables, images, text, and mathematical formulas, and integrates them into a Markdown format. P2T can convert PDF files into Markdown format, regardless of their content. It uses multiple models, including a Layout Analysis Model and a Table Recognition Model. Its Text Recognition Engine supports over 80 languages, using the open-source OCR tools CnOCR for English and Simplified Chinese, and EasyOCR for other languages. It also includes a Mathematical Formula Detection Model and a Mathematical Formula Recognition Model for recognizing and integrating mathematical formulas into Markdown format.

Official Links