[CentOS] converting .doc to html

Warren Young

warren at etr-usa.com
Fri Jun 22 20:11:14 UTC 2012

On 6/22/2012 8:40 AM, m.roth at 5-cent.us wrote:
> wvHtml works,
> but I don't like the output - it insists on <div>, and on &rhquo instead
> of plain, simple ".

You mean ”?

What's wrong with that?  You wanted HTML, and *any* browser will 
understand that HTML entity, even Lynx.

If you wanted "HTML I can read like an e-book", I'd say you should be 
converting to Markdown instead.  One path from Word to Markdown would be 
unrtf (https://www.gnu.org/software/unrtf/) to HTML, then HTML to 
Markdown via Pandoc (http://johnmacfarlane.net/pandoc/).

More information about the CentOS mailing list