On 10/20/2010 3:49 PM, Kahlil Hodgson wrote:
On 10/19/2010 09:13 AM, Les Mikesell wrote: ...
I haven't had much luck with perl-Spreadsheet-ParseExcel (and find it odd that yum prefers the .32 version from epel over .57 from rpmforge anyway). Is the current CPAN version better?
...
Needs to deal with both xls and xlsx formats, the odd characters that are confused with quotes even after csv conversion, numbers with $'s and commas embedded, excel's date formatting nonsense, etc.
Looking at the changelog, version .57 of perl-Spreadsheet-ParseExcel fixes some of the above issues. Not clear from the above whether you have tried it or not. The developers would probably be very interested in any examples that break the parser.
I just went as far as seeing it wouldn't take .xlsx (2007+ default format). It does look like the rpmforge .57 version will accept the .xls format file, although I think it seems slower than loading excel and doing a 'save as' to get the csv. For numbers, cell->unformatted() would give a real number instead of having to yank the commas out of the csv or $cell->value() versions, but dates don't look like what sql wants either way.