[CentOS] excel parser (preferably perl)?

Wed Oct 20 21:46:27 UTC 2010
Les Mikesell <lesmikesell at gmail.com>

On 10/20/2010 3:49 PM, Kahlil Hodgson wrote:
> On 10/19/2010 09:13 AM, Les Mikesell wrote:
> ...
>> I haven't had much luck with
>> perl-Spreadsheet-ParseExcel (and find it odd that yum prefers the .32
>> version from epel over .57 from rpmforge anyway).  Is the current CPAN
>> version better?
> ...
>> Needs to deal with both xls and xlsx formats, the odd characters that
>> are confused with quotes even after csv conversion, numbers with $'s and
>> commas embedded, excel's date formatting nonsense, etc.
>
> Looking at the changelog, version .57 of perl-Spreadsheet-ParseExcel
> fixes some of the above issues.  Not clear from the above whether you
> have tried it or not.  The developers would probably be very interested
> in any examples that break the parser.

I just went as far as seeing it wouldn't take .xlsx (2007+ default 
format).   It does look like the rpmforge .57 version will accept the 
.xls format file, although I think it seems slower than loading excel 
and doing a 'save as' to get the csv.  For numbers, cell->unformatted() 
would give a real number instead of having to yank the commas out of the 
csv or $cell->value() versions, but dates don't look like what sql wants 
either way.

-- 
   Les Mikesell
    lesmikesell at gmail.com