[CentOS] OT: .doc,.xls,.pdf,.ppt (etc.) string parser/indexers

Dave tdbtdb+centos at gmail.com
Sun Aug 30 00:13:05 UTC 2009


On Fri, Aug 28, 2009 at 7:20 AM, Les Mikesell<lesmikesell at gmail.com> wrote:
> Does anyone have experience with linux tools to parse the text from
> common non-text file formats for searching?

http://www.google.com/url?q=http://en.wikipedia.org/wiki/Pdftotext&ei=qsOZSreGOI_WtgOWooiiAg&sa=X&oi=spellmeleon_result&resnum=2&ct=result&usg=AFQjCNENpVi7xahbHDxv1oQm-gde8G2qIw

?



More information about the CentOS mailing list