Log File Reviewing

List overview All Threads
Download

newer

older

CD burning issues & questions

Anyone got Diskless BOOT working...

Joseph L. Casale

5 Jan 2009 5 Jan '09

6:45 p.m.

I need to review a logfile with Sed and cut out all the lines that start with a certain word, problem is this word begins after some amount of whitespace and unless I search for whitespace at the beginning followed by "word" I may encounter "word" somewhere legitimately hence why I don't just search for "word" only...

Anyone know how to make sed accomplish this?

Thanks! jlc

Show replies by date

Bill Campbell

5 Jan 5 Jan

7:03 p.m.

On Mon, Jan 05, 2009, Joseph L. Casale wrote:

...

I need to review a logfile with Sed and cut out all the lines that start with a certain word, problem is this word begins after some amount of whitespace and unless I search for whitespace at the beginning followed by "word" I may encounter "word" somewhere legitimately hence why I don't just search for "word" only...

Anyone know how to make sed accomplish this?

There's always more than one way to do something like this:

sed -n '/^[ \t]*word\s/p' /var/log/messages

pcregrep '^\s*word\b' /var/log/messages

awk '$1 == "word"{print}' /var/log/messages

Bill

-- INTERNET: bill@celestial.com Bill Campbell; Celestial Software LLC URL: http://www.celestial.com/ PO Box 820; 6641 E. Mercer Way Voice: (206) 236-1676 Mercer Island, WA 98040-0820 Fax: (206) 232-9186 It is necessary for the welfare of society that genius should be privileged to utter sedition, to blaspheme, to outrage good taste, to corrupt the youthful mind, and generally to scandalize one's uncles. -- George Bernard Shaw

Spiro Harvey

7:23 p.m.

...

awk '$1 == "word"{print}' /var/log/messages

This example assumes that word is the first field and that it consists only of "word". If the first field is "word1" this won't match.

Fixes for this are

awk '$1 ~ "word"{print}'

(this matches any occurrance of "word" in the first field)

or:

awk '/^[[:space:]]*word/ {print}'

(this matches any line starting with whitespace followed immediately by "word")

-- Spiro Harvey Knossos Networks Ltd 021-295-1923 www.knossos.net.nz

Paul Heinlein

7:10 p.m.

On Mon, 5 Jan 2009, Joseph L. Casale wrote:

...

I need to review a logfile with Sed and cut out all the lines that start with a certain word, problem is this word begins after some amount of whitespace and unless I search for whitespace at the beginning followed by "word" I may encounter "word" somewhere legitimately hence why I don't just search for "word" only...

The regex you want is "^[[:space:]]*word"

-- Paul Heinlein <> heinlein@madboa.com <> http://www.madboa.com/

Joseph L. Casale

7:56 p.m.

...

The regex you want is "^[[:space:]]*word"

Wow, thanks everyone for the help! How does one modify this to also knock out lines that *must* have whitespace followed by a number [0-9]? I can do it using "^[[:space:]]*[0-9]" but it also takes out lines w/o whitespace that begin with numbers?

I have to buy a book on RegEx's and Sed :)

Thanks all! jlc

Spiro Harvey

8:14 p.m.

...

[0-9]? I can do it using "^[[:space:]]*[0-9]" but it also takes out lines w/o whitespace that begin with numbers?

to match one or more, use + instead of *.

* matches 0 or more, + matches 1 or more.

...

I have to buy a book on RegEx's and Sed :)

http://www.gnu.org/manual/gawk/gawk.pdf

(G)awk is pretty sh!t hot where I work; however we've extended it a bit. :)

-- Spiro Harvey Knossos Networks Ltd 021-295-1923 www.knossos.net.nz

Joseph L. Casale

8:40 p.m.

...

to match one or more, use + instead of *.

matches 0 or more, + matches 1 or more.

Thanks!

...

...
I have to buy a book on RegEx's and Sed :)

http://www.gnu.org/manual/gawk/gawk.pdf

(G)awk is pretty sh!t hot where I work; however we've extended it a bit. :)

So gawk does all that sed does and more? I suppose I can start with that in this case, I always wanted a book on regexe's so I think I am going to order O'Reilly's Mastering Regular Expressions, Third Edition. They also have a sed & awk, Second Edition book, but its 10+ years old, does that matter, has sed/awk changed any since then?

Thanks everyone! jlc

Spiro Harvey

8:57 p.m.

...

So gawk does all that sed does and more? I suppose I can start with

Can't really answer that. In 15 years of using UNIX systems, I've never touched sed. :)

With Gawk's BEGIN and END blocks you can use it to write full programs, which is kind of nice.

...

that in this case, I always wanted a book on regexe's so I think I am going to order O'Reilly's Mastering Regular Expressions, Third Edition. They also have a sed & awk, Second Edition book, but its 10+ years old, does that matter, has sed/awk changed any since then?

The link I sent you is the 3rd edition of that book. Dated 2004. The book (Effective AWK Programming) is available completely free, but is also available in dead-tree editions. I printed and bound my PDF and saved a few dollars.

-- Spiro Harvey Knossos Networks Ltd 021-295-1923 www.knossos.net.nz

William L. Maltby

9:03 p.m.

On Mon, 2009-01-05 at 13:40 -0700, Joseph L. Casale wrote:

...

...
to match one or more, use + instead of *.

matches 0 or more, + matches 1 or more.

Thanks!

<snip>

...

So gawk does all that sed does and more? I suppose I can start with

Tons. You can write fairly complex programs with (g)awk. It can combine command line expressions, scripts from files, has formatted print capability, conditional execution, multiple regex selection capabilities and mode.

A read of the man page would give you a lot of insight. Think of perl in an earlier form. The original awk was probably what inspired perl. That would be my guess.

Since (g)awk is regex based, what you learn for sed, vi(m), etc. is easily transferred into (g)awk, and vice-versa, to a limited degree.

...

that in this case, I always wanted a book on regexe's so I think I am going to order O'Reilly's Mastering Regular Expressions, Third Edition. They also have a sed & awk, Second Edition book, but its 10+ years old, does that matter, has sed/awk changed any since then?

The man pages will allow you to keep up easily once the fundamentals are in place. Of course, frequency of use affects that greatly.

...

Thanks everyone! jlc

<snip>

-- Bill

Les Mikesell

9:05 p.m.

Joseph L. Casale wrote:

...

...
to match one or more, use + instead of *.

matches 0 or more, + matches 1 or more.

Thanks!

...
...
I have to buy a book on RegEx's and Sed :)

http://www.gnu.org/manual/gawk/gawk.pdf

(G)awk is pretty sh!t hot where I work; however we've extended it a bit. :)

So gawk does all that sed does and more? I suppose I can start with that in this case, I always wanted a book on regexe's so I think I am going to order O'Reilly's Mastering Regular Expressions, Third Edition. They also have a sed & awk, Second Edition book, but its 10+ years old, does that matter, has sed/awk changed any since then?

Why not just start with perl which does more than sed/awk while using similar syntax (if you want)?

-- Les Mikesell lesmikesell@gmail.com

Spiro Harvey

9:12 p.m.

...

Why not just start with perl which does more than sed/awk while using similar syntax (if you want)?

This is why:

awk '/^[[:space:]]*word/ {print}' logfile

perl -ne 'if (/^\s*word/) { print $_; }' logfile

Which syntax is likely to be easier to remember?

-- Spiro Harvey Knossos Networks Ltd 021-295-1923 www.knossos.net.nz

Les Mikesell

9:58 p.m.

Spiro Harvey wrote:

...

...
Why not just start with perl which does more than sed/awk while using similar syntax (if you want)?

This is why:

awk '/^[[:space:]]*word/ {print}' logfile

vs

perl -ne 'if (/^\s*word/) { print $_; }' logfile

Which syntax is likely to be easier to remember?

I never remember the awk syntax because if it is really that simple I'd use grep with it's implied print. But it's almost never really that simple and you end up needing things that are difficult in awk but easy in perl. Perl can use the posix names for character classes too if you like to type and how can you forget the 'if (expresssion) {action}; syntax? Also you could have omitted the $_ argument to print, since it is assumed if you are looking for simplicity.

-- Les Mikesell lesmikesell@gmail.com

Bill Campbell

10:22 p.m.

On Tue, Jan 06, 2009, Spiro Harvey wrote:

...

...
Why not just start with perl which does more than sed/awk while using similar syntax (if you want)?

This is why:

awk '/^[[:space:]]*word/ {print}' logfile

vs

perl -ne 'if (/^\s*word/) { print $_; }' logfile

Which syntax is likely to be easier to remember?

It depends entirely on what you want to do. For on-liners, sed, awk, and grep, and pcregrep (grep using perl regular expression syntax which is considerably more concise than [:space:] and friends) are often the best tools. For anything more complex, scripting languages such as python and perl are generally more flexible and easier to use.

I used to some pretty complex shell and awk scripts before learning perl about 20 years ago. Perl allowed me to do most things in a single language including fairly low-level system calls that I previously had to do with compiled ``C'' programs.

I have switched all of my new development primarily to python which I find far cleaner than perl, and easier to use for large projects. Python uses perl regular expression syntax so the transition was pretty painless.

Bill

-- INTERNET: bill@celestial.com Bill Campbell; Celestial Software LLC URL: http://www.celestial.com/ PO Box 820; 6641 E. Mercer Way Voice: (206) 236-1676 Mercer Island, WA 98040-0820 Fax: (206) 232-9186 "The liberties of a people never were, nor ever will be, secure, when the transactions of their rulers may be concealed from them." -- Patrick Henry

Les Mikesell

10:46 p.m.

Bill Campbell wrote:

...

I used to some pretty complex shell and awk scripts before learning perl about 20 years ago. Perl allowed me to do most things in a single language including fairly low-level system calls that I previously had to do with compiled ``C'' programs.

And you can probably still run all of your perl scripts unchanged, with the possible exception of "@array" being interpolated in double-quoted strings which I think started in perl4.

...

I have switched all of my new development primarily to python which I find far cleaner than perl, and easier to use for large projects. Python uses perl regular expression syntax so the transition was pretty painless.

Don't count on the same stability with python. It has an annoying habit of changing syntax in non-backwards compatible ways with no provision for running old scripts. If you run your programs on more than one machine you'll end up having to maintain different versions to match the installed interpreters.

-- Les Mikesell lesmikesell@gmail.com

Spiro Harvey

11:49 p.m.

Les Mikesell lesmikesell@gmail.com wrote:

...

Don't count on the same stability with python. It has an annoying habit of changing syntax in non-backwards compatible ways with no

You seem to be hell-bent (excuse the pun) on turning this into a jihad on scripting languages. Please take the credo of your own favoured religion, sorry, language into account: There's more than one way to do it.

Cope.

-- Spiro Harvey Knossos Networks Ltd 021-295-1923 www.knossos.net.nz

Les Mikesell

6 Jan 6 Jan

5:11 p.m.

Spiro Harvey wrote:

...

Les Mikesell lesmikesell@gmail.com wrote:

...
Don't count on the same stability with python. It has an annoying habit of changing syntax in non-backwards compatible ways with no

You seem to be hell-bent (excuse the pun) on turning this into a jihad on scripting languages. Please take the credo of your own favoured religion, sorry, language into account: There's more than one way to do it.

Cope.

There are hard ways and easy ways. I tend to prefer the easy ways and thought others might too.

-- Les Mikesell lesmikesell@gmail.com

Bill Campbell

12:02 a.m.

On Mon, Jan 05, 2009, Les Mikesell wrote:

...

Bill Campbell wrote:

...
I used to some pretty complex shell and awk scripts before learning perl about 20 years ago. Perl allowed me to do most things in a single language including fairly low-level system calls that I previously had to do with compiled ``C'' programs.

And you can probably still run all of your perl scripts unchanged, with the possible exception of "@array" being interpolated in double-quoted strings which I think started in perl4.

I think that was perl-5, but I may well be mistaken. I have found some changes in perl along the way that have required fixing scripts since I started in perl-3.something, but not many.

...

...
I have switched all of my new development primarily to python which I find far cleaner than perl, and easier to use for large projects. Python uses perl regular expression syntax so the transition was pretty painless.

Don't count on the same stability with python. It has an annoying habit of changing syntax in non-backwards compatible ways with no provision for running old scripts. If you run your programs on more than one machine you'll end up having to maintain different versions to match the installed interpreters.

I have not run into many (any) compatibility issues with python, but then I have only been doing python for a bit over 4 years now. As I remember, there were some issues with the early versions of python-2.4, but those were in the python builds, not in the syntax of python itself.

I tend to stay away of the more esoteric features of languages that are likely to change so don't generally have problems of this type.

We don't have problems with multiple versions of packages as we use the ones from the OpenPKG portable packaging system which includes its own versions of python, perl, gcc, berkeley db, etc. avoiding most problems with the underlying distribution/vendor's packages. There were some issues when we moved to CentOS from SuSE in that SuSE ran ran python-2.3.x while CentOS has python-2.4.x which caused some interesting shared library issues with the OpenPKG python-2.4.x (which we are running for Zope compatibility as the version of Zope we're running doesn't work with python-2.5.x.

Python-3 definately has backwards compatibility issues, and there are lengthy explanations as to why this is so.

Bill

Kai Schaetzl

11:31 a.m.

com>

Bill Campbell wrote on Mon, 5 Jan 2009 16:02:29 -0800:

...

(which we are running for Zope compatibility as the version of Zope we're running doesn't work with python-2.5.x.

you did realize that this is another python compatibility issue, did you ;-)

Kai

-- Kai Schätzl, Berlin, Germany Get your web at Conactive Internet Services: http://www.conactive.com

Bill Campbell

4:52 p.m.

On Tue, Jan 06, 2009, Kai Schaetzl wrote:

...

com>

Bill Campbell wrote on Mon, 5 Jan 2009 16:02:29 -0800:

...
(which we are running for Zope compatibility as the version of Zope we're running doesn't work with python-2.5.x.

you did realize that this is another python compatibility issue, did you ;-)

True enough :-).

Bill

-- INTERNET: bill@celestial.com Bill Campbell; Celestial Software LLC URL: http://www.celestial.com/ PO Box 820; 6641 E. Mercer Way Voice: (206) 236-1676 Mercer Island, WA 98040-0820 Fax: (206) 232-9186 Make no laws whatever concerning speech and, speech will be free; so soon as you make a declaration on paper that speech shall be free, you will have a hundred lawyers proving that freedom does not mean abuse, nor liberty license; and they will define and define freedom out of existence. - Voltarine de Cleyre (1866-1912)

Steve Huff

5 Jan 5 Jan

8:15 p.m.

On Jan 5, 2009, at 2:56 PM, Joseph L. Casale wrote:

...

...
The regex you want is "^[[:space:]]*word"

Wow, thanks everyone for the help! How does one modify this to also knock out lines that *must* have whitespace followed by a number [0-9]? I can do it using "^[[:space:]]*[0-9]" but it also takes out lines w/o whitespace that begin with numbers?

^[[:space:]]+[[:digit:]]+

will hit numbers with one or more digits. to restrict the number of digits, use something like

^[[:space:]]+[[:digit:]]{2}[^[:digit:]]+

that, for example, should only hit lines that consist of at least one whitespace character, then exactly two digits, then at least one non- digit character.

-steve

-- If this were played upon a stage now, I could condemn it as an improbable fiction. - Fabian, Twelfth Night, III,v

Paul Heinlein

8:16 p.m.

On Mon, 5 Jan 2009, Joseph L. Casale wrote:

...

...
The regex you want is "^[[:space:]]*word"

Wow, thanks everyone for the help! How does one modify this to also knock out lines that *must* have whitespace followed by a number [0-9]? I can do it using "^[[:space:]]*[0-9]" but it also takes out lines w/o whitespace that begin with numbers?

Probably something like "^[[:space:]]+[0-9]"

-- though that assumes you're using gawk (since the + modifier is GNU-specific).

For non-GNU awks, "^[[:space:]][[:space:]]*[0-9]"

...

I have to buy a book on RegEx's and Sed :)

Good idea!

-- Paul Heinlein <> heinlein@madboa.com <> http://www.madboa.com/

Joshua Gimer

7:19 p.m.

What about:

perl -ne 'if (/^\s*word/) { print $_; }' logfile

any others?

On Mon, Jan 5, 2009 at 11:45 AM, Joseph L. Casale JCasale@activenetwerx.com wrote:

...

I need to review a logfile with Sed and cut out all the lines that start with a certain word, problem is this word begins after some amount of whitespace and unless I search for whitespace at the beginning followed by "word" I may encounter "word" somewhere legitimately hence why I don't just search for "word" only...

Anyone know how to make sed accomplish this?

Thanks! jlc _______________________________________________ CentOS mailing list CentOS@centos.org http://lists.centos.org/mailman/listinfo/centos

-- Thx Joshua Gimer

6038

Age (days ago)

6039

Last active (days ago)

discuss@lists.centos.org

21 comments

9 participants

tags (0)

participants (9)

Bill Campbell
Joseph L. Casale
Joshua Gimer
Kai Schaetzl
Les Mikesell
Paul Heinlein
Spiro Harvey
Steve Huff
William L. Maltby