Textricator

2 likes

Textricator is a tool for extracting text from computer-generated PDFs and generating structured data . If you have a bunch of PDFs with the same format (or one big, consistently formatted PDF) and you want to extract the data to CSV or JSON,.

Cost / License

Free
Open Source (AGPL-3.0)

Application type

Web Scraping Tool

Platforms

Mac
Windows
Linux

Textricator alternatives

2likes

1comment

12alternatives

0articles

Features

No features, maybe you want to suggest one?

Textricator News & Activities

Highlights All activities

Recent activities

No activities found.

Textricator information

Developed by
Measures for Justice (MFJ)
Licensing
Open Source (AGPL-3.0) and Free product.
Alternatives
12 alternatives listed
Supported Languages
- English

GitHub repository

353 Stars
38 Forks
12 Open Issues
Updated Mar 14, 2025

View on GitHub

Popular alternatives

View all

Our users have written 1 comments and reviews about Textricator, and it has gotten 2 likes

Textricator was added to AlternativeTo by Hugo Albarracin on May 12, 2020 and this page was last updated May 12, 2020.

Comments and Reviews

Top Positive Comment

Hugo Albarracin

★

May 12, 2020

Textricator is a tool to extract text from documents and generate structured data. https://textricator.mfj.io

Featured in Lists

Data mining

A list with 33 apps by 6feriolimarco without a description.

List by Francisco Ferioli Marco with 33 apps, updated Aug 16, 2020

What is Textricator?

Textricator is a tool to extract text from documents and generate structured data.

If you have a bunch of PDFs with the same format (or one big, consistently formatted PDF) and you want to extract the data to CSV or JSON, Textricator can help! It can even work on OCR'ed documents!

Textricator is released under the GNU Affero General Public License Version 3.

Textricator is deployed to Maven Central with GAV io.mfj:textricator.

This application is actively used and developed by Measures for Justice. We welcome feedback, bug reports, and contributions. Create an issue, send a pull request, or email us at textricator@mfj.io. If you use Textricator, please let us know. Send us your mailing address and we will mail you a sticker.

io.mfj.textricator.Textricator is the main entry point for library usage.

io.mfj.textricator.cli.TextricatorCli is the command-line interface.

The CLI has three subcommands, to use the three main features of Textricator:

text - Extract text from the PDF and generate JSON. table - Parse the text that is in columns and rows. See Table section. form - Parse the text with a configured finite state machine. See Form section.

Textricator

Cost / License

Application type

Platforms

Textricator

Features

Tags

Textricator News & Activities

Recent activities

Textricator information

Developed by

Licensing

Alternatives

Supported Languages

GitHub repository

Popular alternatives

Comments and Reviews

Featured in Lists

Data mining

What is Textricator?

Official Links

AppStores & Other Links

Social Networks