[CentOS] Help with disk server stability issues

Tue Feb 14 12:54:38 UTC 2006
Peter Kjellström <cap at nsc.liu.se>

On Monday 13 February 2006 22:44, Andrew Zahn wrote:
> Hi All,
> I am looking for advice on how to cure a constantly-crashing NFS server
> which crashes every few hours, or at least, every few days. The kernel
> log file (below) points toward NFS as a likely cause.
> The system disk is a 3ware 8000 series RAID1 mirror. The data disk is
> using a 3Ware 9000 controller to produce two RAID1 devices; these are
> then striped (RAID0) in software to form a RAID 10 device.  We're using
> a 2.6 kernel, xfs filesystem, and NFS3/UDP.
> We're running CentOS 4.2 with a 2.6.9- kernel.  This kernel
> has xfs extensions, and we're running the xfs filesystem for /home

One thing to consider is that the xfs module in current centosplus kernels is 
the same as kernel.org 2.6.9, that is, ancient. I never got 2.6.9 xfs stable 
for non-trivial loads and configurations.


> (obtained from CentOS website).
> In "lsmod" I see both 3w_xxxx and 3w_9xxx modules.
> NFS is over UDP, jumbo frames (9000), 32k rsize/wsize, async server,
> async clients, noac.
> This system has been serving /home in this configuration since October
> 2005; we've seen it crash rarely, but uptimes were usually on the order
> of months.  This past week, it can't seem to remain up for much longer
> than about a day.
> Kernel log file containing the crash:
> ...

  Peter Kjellström               |
  National Supercomputer Centre  |
  Sweden                         | http://www.nsc.liu.se
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <http://lists.centos.org/pipermail/centos/attachments/20060214/b240b493/attachment-0005.sig>