[CentOS] OT: .doc,.xls,.pdf,.ppt (etc.) string parser/indexers

Rajagopal Swaminathan raju.rajsand at gmail.com
Mon Aug 31 18:07:55 UTC 2009


On Mon, Aug 31, 2009 at 10:38 PM, Les Mikesell<lesmikesell at gmail.com> wrote:
> Wouldn't that have to be run under windows?
Indeed. That was where that particular requirement was. One app wanted
fulltext search on a bunch of .doc,,,,, etc. files

But I demonstrated the POC using Centos with Sun Java Stack and the
other dependencies for making Apache Solr (wrapper around Lucene API)

I know I am not precise enough here.. But you get the drift...

> I'm not sure anything does visio, though.

I have not tried that

Thanks and Regards


More information about the CentOS mailing list