On Mon, 11 May 2020 12:40:15 +0000 Gestió Servidors sysadmin.caos@uab.cat wrote:
Hello,
My server is running kernel 3.10.0-1062.12.1 in a CentOS Linux release 7.7.1908. Since some weeks ago, server is restarting after XFS errors. Logs in /var/crash reported this information: [...]
...
[443804.295922] blk_update_request: I/O error, dev sda, sector 72607920 [443804.295969] sd 0:0:0:0: rejecting I/O to offline device
...
10332927480 [443804.321376] sd 0:0:0:0: rejecting I/O to offline device [443804.321384] XFS (dm-2): metadata I/O error: block 0xf00001
...
It seems problem are hard disk or XFS module, but physical RAID controller reports all hard disks are OK, so I suppose problem is XFS module. Also, I have read in this forum (https://bugs.centos.org/view.php?id=16960) something similar.
Just like your logs the thread you're referring to had SCSI errors before the XFS errors. The thread ended with the OP concluding that it was a problem on the hypervisor side (nothing to do with kernel/xfs on the guest).
If you have a hardware RAID serving that block device I'd look at that (and possibly its driver).
Could someone say me is lastest kernel version for CentOS-7 (3.10.0-1127) solves this problem?
Since neither are clear or confirmed it is not likely that anyone can say...
/Peter