Tabula
Tabula is a tool for liberating data tables locked inside PDF files. Extract tables from PDFs.
- Free • Open Source
- Mac
- Windows
- Linux
What is Tabula?
How Can Tabula Help Me? If you’ve ever tried to do anything with data provided to you in PDFs, you know how painful it is — there's no easy way to copy-and-paste rows of data out of PDF files. Tabula allows you to extract that data into a CSV or Microsoft Excel spreadsheet using a simple, easy-to-use interface. Tabula works on Mac, Windows and Linux.
Who Uses Tabula? Tabula is used to power investigative reporting at news organizations of all sizes, including ProPublica, The Times of London, Foreign Policy, La Nación (Argentina), The New York Times and the St. Paul (MN) Pioneer Press.
Grassroots organizations like SchoolCuts.org rely on Tabula to turn clunky documents into human-friendly public resources.
And researchers of all kinds use Tabula to turn PDF reports into Excel spreadsheets, CSVs, and JSON files for use in analysis and database applications.
Features Vote on or suggest new features
Supported Languages
- English
Comments and Reviews Post a comment / review all • positive • negative relevance • date
Tags
- pdf-to-csv
- pdf-to-json
- table
- auto-detect-structured-data
- extract-multipage-tables
- pdf-tool
- PDF Reader

Summary
Our users have written 1 comments and reviews about Tabula, and it has gotten 6 likes
- Developed by Manuel Aristarán, Mike Tigas and Jeremy B. Merrill
- Open Source (MIT) and Free product.
- Written in
- 17 alternatives listed
GitHub repository
- 5,720 Stars
- 587 Forks
- 542 Open Issues
- Updated Jun 21, 2022
Popular alternatives
- 1
- 2
- 3
Recent user activities on Tabula
- 2397165052 liked Tabula2323 days ago
- onomou added Tabula as alternative(s) to Docparser3 months ago
- NosaLee added Tabula as alternative(s) to PDF to DOC12 months ago
I like Tabula because it's the best, free, and opensource software that can quickly and easily extract structured data from PDFs.
It's quite powerful, yet easy to use. I've used it on PDF documents with table structured text data, with 100s of pages - to quickly extract this data into a csv format, enabling me to then import this data into a database.
It has a powerful Autodetect Tables feature, along with tools and settings to tweak and customise it's table recognition.
In my experience, it has been exceptionally accurate.
I've not used any paid tools, but this is THE best free tool. I have tried using Word, which does a very good job of converting PDFs into editable documents. But is a nightmare if you have a multipage document of structured data (say 400 page long table), and you want it to be converted to a structured format.
Cannot recommend more highly.