Wednesday, March 29, 2017

Tabula --- Free tool to extract tables in PDF as CSV or Microsoft Excel spreadsheet!

"Tabula was created by journalists for journalists and anyone else working with data locked away in PDFs. Tabula will always be free and open source."
Want to copy a table in a PDF document and save it as a CSV file or Excel spreadsheet? Just try the free and open source tool Tabula! It has a very simple and nice user interface (clicking, dragging to draw the selecting box) and works well for most PDF files containing data tables (only text-based PDFs, not scanned documents, though). Tabula is available on Mac, Windows (no installation is needed, just unzip and run) and Linux.
Official Site: http://tabula.technology/
Source Code: https://github.com/tabulapdf/tabula
Links:
https://ropensci.org/blog/blog/2017/04/18/tabulizer (tabulizer: R bindings to the tabula-java library for processing PDF tables programmatically)

No comments:

Post a Comment