Re: [CentOS] Aide error "Caught SIGBUS/SEGV"

List overview All Threads
Download

newer

older

CentOS-announce Digest, Vol 63,...

Install of 5.4 i386 formatting is...

m.roth＠5-cent.us

7 May 2010 7 May '10

4:35 p.m.

Brian wrote:

...

...
-----Original Message----- From: centos-bounces@centos.org [mailto:centos-bounces@centos.org] On Behalf Of Bowie Bailey Sent: Friday, May 07, 2010 10:15 AM To: centos@centos.org Subject: Re: [CentOS] Aide error "Caught SIGBUS/SEGV"

Bowie Bailey wrote:

...
One of my servers has recently started giving an error every time I run "aide --check". I ran it manually twice today with the same results. The second time, I added the -V flag, but that didn't give me anything useful. The system is currently running CentOS 5.3.

Nothing on the system has changed recently (that I am aware of). The Aide database hasn't been updated in a few months.

This is the error I am seeing:

Caught SIGBUS/SEGV while mmapping. File was truncated while aide was running? Caught SIGBUS/SEGV. Exiting

<snip>

...

Suggest: Rename your current database, and aide -i to build a new one, the aide -c to check it. If that works (aide -c on new database) I'd suspect (pulling stray thoughts out of /dev/chaos) that your current data base is corrupt enough that you can't check it.

I *assume* you've made sure that the filesystem isn't full.

mark, who had to deal with someone doing that today

Show replies by date

Bowie Bailey

7 May 7 May

4:55 p.m.

New subject: Aide error "Caught SIGBUS/SEGV"

m.roth@5-cent.us wrote:

...

Brian wrote:

...
...
-----Original Message----- From: centos-bounces@centos.org [mailto:centos-bounces@centos.org] On Behalf Of Bowie Bailey Sent: Friday, May 07, 2010 10:15 AM To: centos@centos.org Subject: Re: [CentOS] Aide error "Caught SIGBUS/SEGV"

Bowie Bailey wrote:

...
One of my servers has recently started giving an error every time I run "aide --check". I ran it manually twice today with the same results. The second time, I added the -V flag, but that didn't give me anything useful. The system is currently running CentOS 5.3.

Nothing on the system has changed recently (that I am aware of). The Aide database hasn't been updated in a few months.

This is the error I am seeing:

Caught SIGBUS/SEGV while mmapping. File was truncated while aide was running? Caught SIGBUS/SEGV. Exiting

<snip>

...
Suggest: Rename your current database, and aide -i to build a new one, the aide -c to check it. If that works (aide -c on new database) I'd suspect (pulling stray thoughts out of /dev/chaos) that your current data base is corrupt enough that you can't check it.

I *assume* you've made sure that the filesystem isn't full.
     mark, who had to deal with someone doing that today

Yes, I have 46G available, so that shouldn't be an issue.

I will try re-initializing the database. That's a good idea that hadn't occurred to me for whatever reason... :)

-- Bowie

Bowie Bailey

6:23 p.m.

New subject: Aide error "Caught SIGBUS/SEGV"

Bowie Bailey wrote:

...

m.roth@5-cent.us wrote:

...
Brian wrote:

...
...
-----Original Message----- From: centos-bounces@centos.org [mailto:centos-bounces@centos.org] On Behalf Of Bowie Bailey Sent: Friday, May 07, 2010 10:15 AM To: centos@centos.org Subject: Re: [CentOS] Aide error "Caught SIGBUS/SEGV"

Bowie Bailey wrote:

...
One of my servers has recently started giving an error every time I run "aide --check". I ran it manually twice today with the same results. The second time, I added the -V flag, but that didn't give me anything useful. The system is currently running CentOS 5.3.

Nothing on the system has changed recently (that I am aware of). The Aide database hasn't been updated in a few months.

This is the error I am seeing:

Caught SIGBUS/SEGV while mmapping. File was truncated while aide was running? Caught SIGBUS/SEGV. Exiting

<snip>

...
Suggest: Rename your current database, and aide -i to build a new one, the aide -c to check it. If that works (aide -c on new database) I'd suspect (pulling stray thoughts out of /dev/chaos) that your current data base is corrupt enough that you can't check it.

I *assume* you've made sure that the filesystem isn't full.
     mark, who had to deal with someone doing that today
Yes, I have 46G available, so that shouldn't be an issue.

I will try re-initializing the database. That's a good idea that hadn't occurred to me for whatever reason... :)

No dice. I tried running 'aide --init' and it died with the exact same error.

Maybe I should just try reinstalling it. Any other ideas?

-- Bowie

m.roth＠5-cent.us

6:41 p.m.

New subject: Aide error "Caught SIGBUS/SEGV"

...

Bowie Bailey wrote:

...
m.roth@5-cent.us wrote:

...
Brian wrote:

...
...
[mailto:centos-bounces@centos.org] On Behalf Of Bowie Bailey Bowie Bailey wrote:

...
One of my servers has recently started giving an error every time I run "aide --check". I ran it manually twice today with the same results. The second time, I added the -V flag, but that didn't give me anything useful. The system is currently running CentOS 5.3.

<snip>

...

...
...
...
Suggest: Rename your current database, and aide -i to build a new one, the aide -c to check it. If that works (aide -c on new database) I'd suspect (pulling stray thoughts out of /dev/chaos) that your current data base is corrupt enough that you can't check it.

I will try re-initializing the database. That's a good idea that hadn't occurred to me for whatever reason... :)

No dice. I tried running 'aide --init' and it died with the exact same error.

Maybe I should just try reinstalling it. Any other ideas?

mysqldump. Have you looked at the logs for mysql itelf?

mark

Bowie Bailey

6:55 p.m.

New subject: Aide error "Caught SIGBUS/SEGV"

m.roth@5-cent.us wrote:

...

...
Bowie Bailey wrote:

...
m.roth@5-cent.us wrote:

...
Brian wrote:

...
...
[mailto:centos-bounces@centos.org] On Behalf Of Bowie Bailey Bowie Bailey wrote:

> One of my servers has recently started giving an error every time I > run "aide --check". I ran it manually twice today with the > same results. The second time, I added the -V flag, but that didn't > give me anything useful. The system is currently running CentOS > 5.3. >

<snip>

...
...
...
...
Suggest: Rename your current database, and aide -i to build a new one, the aide -c to check it. If that works (aide -c on new database) I'd suspect (pulling stray thoughts out of /dev/chaos) that your current data base is corrupt enough that you can't check it.

I will try re-initializing the database. That's a good idea that hadn't occurred to me for whatever reason... :)

No dice. I tried running 'aide --init' and it died with the exact same error.

Maybe I should just try reinstalling it. Any other ideas?

mysqldump. Have you looked at the logs for mysql itelf?

What does mysql have to do with it? I don't have mysql installed on this machine.

However, the comment about looking at the logs pointed me to a related issue. I am seeing this in my logs:

kernel: attempt to access beyond end of device kernel: dm-0: rw=0, want=4344463064, limit=126550016

Looks like I may have some corruption on the disk. When I get a chance, I'll take it down and run fsck to see if that will help.

-- Bowie

m.roth＠5-cent.us

7:56 p.m.

New subject: Aide error "Caught SIGBUS/SEGV"

...

m.roth@5-cent.us wrote:

...
...
Bowie Bailey wrote:

...
m.roth@5-cent.us wrote:

...
Brian wrote:

...
> [mailto:centos-bounces@centos.org] On Behalf Of Bowie Bailey > Bowie Bailey wrote: > >> One of my servers has recently started giving an error every time >> I run "aide --check". I ran it manually twice today with the >> same results. The second time, I added the -V flag, but that >> didn't give me anything useful. The system is currently running >> CentOS 5.3. >>

<snip>

...
...
...
...
Suggest: Rename your current database, and aide -i to build a new one, the aide -c to check it. If that works (aide -c on new database) I'd suspect (pulling stray thoughts out of /dev/chaos) that your current data base is corrupt enough that you can't check it.

I will try re-initializing the database. That's a good idea that hadn't occurred to me for whatever reason... :)

No dice. I tried running 'aide --init' and it died with the exact same error.

Maybe I should just try reinstalling it. Any other ideas?

mysqldump. Have you looked at the logs for mysql itelf?

What does mysql have to do with it? I don't have mysql installed on this machine.

Sorry, don't know aide, but you mentioned a database. I was suggesting, in a broader sense, dumping the database to a backup and rebuilding the *entire* d/b, including the control files.

...

However, the comment about looking at the logs pointed me to a related issue. I am seeing this in my logs:
    kernel: attempt to access beyond end of device
    kernel: dm-0: rw=0, want=4344463064, limit=126550016
Looks like I may have some corruption on the disk. When I get a chance, I'll take it down and run fsck to see if that will help.

Ack! No, that doesn't look good at all. It's almost as though the disk is full, or there's something that makes the kernel think it is.

mark

Gary Greene

9:17 p.m.

New subject: Aide error "Caught SIGBUS/SEGV"

On 5/7/10 10:56 AM, "m.roth@5-cent.us" m.roth@5-cent.us wrote:

...

...
m.roth@5-cent.us wrote:

...
...
Bowie Bailey wrote:

...
m.roth@5-cent.us wrote:

...
Brian wrote: >> [mailto:centos-bounces@centos.org] On Behalf Of Bowie Bailey >> Bowie Bailey wrote: >> >>> One of my servers has recently started giving an error every time >>> I run "aide --check". I ran it manually twice today with the >>> same results. The second time, I added the -V flag, but that >>> didn't give me anything useful. The system is currently running >>> CentOS 5.3. >>>

<snip>

...
...
...
> Suggest: Rename your current database, and aide -i to build a new > one, the aide -c to check it. > If that works (aide -c on new database) I'd suspect (pulling stray > thoughts out of /dev/chaos) that your current data base is corrupt > enough that you can't check it. >

I will try re-initializing the database. That's a good idea that hadn't occurred to me for whatever reason... :)

No dice. I tried running 'aide --init' and it died with the exact same error.

Maybe I should just try reinstalling it. Any other ideas?

mysqldump. Have you looked at the logs for mysql itelf?

What does mysql have to do with it? I don't have mysql installed on this machine.

Sorry, don't know aide, but you mentioned a database. I was suggesting, in a broader sense, dumping the database to a backup and rebuilding the *entire* d/b, including the control files.

...
However, the comment about looking at the logs pointed me to a related issue. I am seeing this in my logs:
    kernel: attempt to access beyond end of device
    kernel: dm-0: rw=0, want=4344463064, limit=126550016
Looks like I may have some corruption on the disk. When I get a chance, I'll take it down and run fsck to see if that will help.
Ack! No, that doesn't look good at all. It's almost as though the disk is full, or there's something that makes the kernel think it is.
    mark
CentOS mailing list CentOS@centos.org http://lists.centos.org/mailman/listinfo/centos

Check the inode allocation on this box, it might well be that you have _space_, just no available inodes.

-- Gary L. Greene, Jr. IT Operations Minerva Networks, Inc. Cell: (650) 704-6633 Phone: (408) 240-1239

Bowie Bailey

9:33 p.m.

New subject: Aide error "Caught SIGBUS/SEGV"

Gary Greene wrote:

...

On 5/7/10 10:56 AM, "m.roth@5-cent.us" m.roth@5-cent.us wrote:

...
...
m.roth@5-cent.us wrote:

...
...
Bowie Bailey wrote:

...
However, the comment about looking at the logs pointed me to a related issue. I am seeing this in my logs:
    kernel: attempt to access beyond end of device
    kernel: dm-0: rw=0, want=4344463064, limit=126550016
Looks like I may have some corruption on the disk. When I get a chance, I'll take it down and run fsck to see if that will help.
Ack! No, that doesn't look good at all. It's almost as though the disk is full, or there's something that makes the kernel think it is.
    mark
CentOS mailing list CentOS@centos.org http://lists.centos.org/mailman/listinfo/centos
Check the inode allocation on this box, it might well be that you have _space_, just no available inodes.

No, only 2% of the inodes are in use.

-- Bowie

JohnS

8 May 8 May

3:11 p.m.

New subject: Aide error "Caught SIGBUS/SEGV"

On Fri, 2010-05-07 at 15:33 -0400, Bowie Bailey wrote:

...

...
...
...
    kernel: attempt to access beyond end of device
    kernel: dm-0: rw=0, want=4344463064, limit=126550016
Looks like I may have some corruption on the disk. When I get a chance, I'll take it down and run fsck to see if that will help.

--- No that would be a Bug. Look it up at rh bugzilla. Do a fsck on the file system and back it up.

John

5559

Age (days ago)

5560

Last active (days ago)

discuss@lists.centos.org

8 comments

4 participants

tags (0)

participants (4)

Bowie Bailey
Gary Greene
JohnS
m.roth＠5-cent.us