[CentOS] OT: .doc,.xls,.pdf,.ppt (etc.) string parser/indexers