[CentOS] (no subject) many .eml files with attachments to process

Wed Oct 11 14:48:08 UTC 2006
Paul Malinowski <paul at fish4.co.uk>

That's quite helpful, thanks,

At the end of the day I would like to find some tool in Centos
repository which as output will give me attachment in plain/text format
(attachment is either plain text either html, but if I will have a
attachment as plaint/text that would be enough to process content of
attachment)

Mails are coming from exchange server, and they are in eml format. Any
mime tool in centos which I can use straight away rather the write a
perl script with using some module from cpan?

Thanks in advance,
Paul Malinowski

-----Original Message-----
From: centos-bounces at centos.org [mailto:centos-bounces at centos.org] On
Behalf Of Will McDonald
Sent: 11 October 2006 15:26
To: CentOS mailing list
Subject: Re: [CentOS] (no subject)

On 11/10/06, Paul Malinowski <paul at fish4.co.uk> wrote:
> Hello,
>
> I'm looking for some package in Centos which will provide en/decoding
of
> mime formats. I have over 1000 outlook mails to process, and it would
be
> great to script that, but centos 4.4 doesn't have a metamail package
and
> I couldn't find anything connected to decoding and encoding of MIME.
>
> Is there any tool in Centos repository ?

In your previous email you stated these messages were .eml files? This
implies your messages were extracted from Outlook EXPRESS rather than
Outlook, this is an important distinction as they store messages
extracted to Explorer (via dragging, for example) in different
formats.

Looking at an .eml file on a test system, they appear to be fairly
straightforward MIME encoded messages and I'd have thought any number
of Perl/Python/whatever libraries would be able to handle extraction
of their attachments.

http://search.cpan.org/search?query=mime&mode=all
http://docs.python.org/lib/module-email.html
http://www.broobles.com/eml2mbox/index.html

Outlook's extracted message format, .msg seems a little more
"special", http://www.fileformat.info/format/outlookmsg/ It contains
multiple streams which would need decoding.

http://mail.python.org/pipermail/python-list/2004-May/219401.html
http://www.matijs.net/software/msgconv/

It would also appear you can import them into mbox format using
Thunderbird.

I seem to remember there are also a couple of libraries and associated
tools for converting dbx/mbx/pst mail stores into mbox.

http://alioth.debian.org/projects/libpst/

Will.
_______________________________________________
CentOS mailing list
CentOS at centos.org
http://lists.centos.org/mailman/listinfo/centos

This e-mail and all attachments have been scanned by the HighSpeed
Office virus scanning service powered by MessageLabs and no known
viruses were detected.

This electronic message transmission contains information from Fish4
Trading Limited and/or other Fish4 affiliated companies 
(specifically Fish4Homes Ltd, Fish4Cars Ltd, Fish4Jobs Ltd and Fish4
Ltd) that may be confidential or privileged. 
The information is intended solely for the recipient and use by any
other party is not authorized. If you are not the intended recipient, 
be aware that any disclosure, copying, distribution or use of the
contents of this electronic message transmission is prohibited.
If you have received this electronic message transmission in error,
please notify Fish4 immediately by telephone (0208 600 7000)
or by electronic mail (administrator at fish4.co.uk).  Thank you.

This e-mail and all attachments have been scanned by the HighSpeed Office virus scanning service powered by MessageLabs and no known viruses were detected.