What is html2text?
html2text is a command line utility, written in C++, that converts HTML documents into plain text.
Each HTML document is loaded from a location indicated by a URI or read from standard input, and formatted into a stream of plain text characters that is written to standard output or into an output-file. The input-URI may specify a remote site, from that the documents are loaded via the Hypertext Transfer Protocol (HTTP).
The program is able to preserve the original positions of table fields, allows you to set the screen width (to a given number of output characters), and accepts also syntactically incorrect input (attempting to interpret it "reasonably"). Boldface and underlined text is rendered by default with backspace sequences (which is particulary usefull when piping the program's output into "less" or an other pager). All rendering properties can largely be costomised trough an RC-file.
- Developed byMartin Bayer
- LicensingOpen Source and Free product.
- Alternatives3 alternatives listed
Our users have written 0 comments and reviews about html2text, and it has gotten 1 likes
Comments and Reviews Post a comment/review
html2text platform details
It is reported to run on Cygwin.