[CentOS] host controller halted... scarry error

Phil Schaffner Philip.R.Schaffner at NASA.GOV
Wed Aug 3 14:11:54 UTC 2005


On Wed, 2005-08-03 at 10:23 +0200, Ulrik S. Kofod wrote:
> Yesterday i was copying a few dump files (backups) about 1GB in size from my centOS
> 3.4, on a samba share, to my windows, when the centOS box stopped responding.
> 
> The HDD LED was on and after I connected a monitor all I could see was this error
> message over and over again:
> 
> "usb-uhci.c: host controller halted, trying to restart"
> 
> I wasn't able to login or anything, so I saw no other solution than pressing the
> reset button.
> 
> When it rebooted it forced a hdd check, but was unable to mount /dev/hde2 (that
> useually mounts on /var !) and pretty much nothing works without /var.
> The error message mount gave was something like "Unable to mount /dev/hde2: invalid
> argument".

Hard to see what that has to do with usb-uhci - hde is apparently on an
IDE controller and usb-uhci is USB.  Could be a memory or MB issue.  I'd
try running memtest86+ and monitor logs for errors.  Checking all disks
with smart is also indicated.

> The files I was copying are located on /dev/md0 (raid 0 over 4 disks) and that still
> worked fine, / is mounted on /dev/hde1 and that also worked as expected.
> 
> I removed /var from /etc/fstab and restored a backup of /var to the dir /var and
> then I could boot normally again.
> 
> Trying to save what was on /dev/hdde2 I ran a e2fsck -p /dev/hde2 and that corrected
> a ton of errors (deleted a lot of data), I then mounted /dev/hde2 to another folder
> and restored my backups so I only lost a few hours of data. After I added /var to
> /etc/fstab again everything worked as normal again.
> 
> My question is happened!? and what can I do to avoid this again?
> If /dev/hde2 had been a RAID 1 would it then have rebuild? Should I move /var to a
> RAID 1?

Wouldn't hurt, but why not do the whole system on RAID 1 if you're going
to that trouble?

> I have copied large files like this before without problems, was it just bad luck or
> should I expect it to do this again?

Again - I'd suspect some underlying hardware problems.

> /dev/hde is attached to a cheap ide ultra ata 133 pci controller card (Silicon
> Image) that has worked flawlessly for about a year. Can that be broken? Right now it
> seems OK again. I have a replacement for it but I would rather not replace it if it
> isn’t necessary.

I'd guess disks, memory, and MB before the controller - emphasis on
GUESS.

> PS. Try backups! You won’t regret it :)

Follow your own advice! :-)

Good luck,
Phil





More information about the CentOS mailing list