Greetings, On Wed, Feb 3, 2010 at 11:03 AM, Joseph L. Casale <jcasale at activenetwerx.com> wrote: > > You don't say much as to what bounds the words, spaces? Give more info, but > http://www.regular-expressions.info/unicode.html leads to some Perl solutions. Thanks for the quick reply. I have started perusing it. Perl is currently martian to me :) . Hope to gain fluency in that in the very near future. The said unicode strings (with multi-byte "points") may be bound by comma, single quotes, space etc. I am ready to sacrifice all characters except the [:alpha:] and unicode strings. Thanks again and Regards, Rajagopal