Hello Everyone,
Since rebooting my Centos 6.10 Openvz server "daisy" yesterday, I am
getting horrible system performance. /var/log/messages is full of
HDIO_GET_IDENTITY failed for /dev/sdb. The latest entries look like this:
Apr 22 08:51:32 daisy kernel: [141224.655699] CT: 1005: stopped
Apr 22 08:55:04 daisy ata_id[21513]: HDIO_GET_IDENTITY failed for '/dev/sdb'
Apr 22 09:00:05 daisy ata_id[21584]: HDIO_GET_IDENTITY failed for '/dev/sdb'
Apr 22 09:05:02 daisy ata_id[21644]: HDIO_GET_IDENTITY failed for '/dev/sdb'
Apr 22 09:10:01 daisy ata_id[22282]: HDIO_GET_IDENTITY failed for '/dev/sdb'
Apr 22 09:11:49 daisy kernel: [142441.721065] INFO: task hdparm:22246
blocked for more than 120 seconds.
Apr 22 09:11:49 daisy kernel: [142441.721083] Not tainted
2.6.32-042stab142.1 #1
Apr 22 09:11:49 daisy kernel: [142441.721093] "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Apr 22 09:11:49 daisy kernel: [142441.721109] hdparm D
ffff88000c778300 0 22246 20845 0 0x00000080
Apr 22 09:11:49 daisy kernel: [142441.721115] ffff88006654bcb8
0000000000000086 ffffffff8114f130 ffff88002821fa40
Apr 22 09:11:49 daisy kernel: [142441.721121] ffff88000004d238
ffff88006654bd70 ffff88006654bc88 ffffea00016ab7c0
Apr 22 09:11:49 daisy kernel: [142441.721125] ffff88011a707000
ffff880028321168 000000000001b7ea 0000816be9b3faa2
Apr 22 09:11:49 daisy kernel: [142441.721130] Call Trace:
Apr 22 09:11:49 daisy kernel: [142441.721139] [<ffffffff8114f130>] ?
sync_page+0x0/0x50
Apr 22 09:11:49 daisy kernel: [142441.721144] [<ffffffff8107c851>] ?
update_curr+0xe1/0x1f0
Apr 22 09:11:49 daisy kernel: [142441.721149] [<ffffffff81566c55>]
schedule_timeout+0x215/0x2f0
Apr 22 09:11:49 daisy kernel: [142441.721155] [<ffffffff81067432>] ?
check_preempt_curr+0x82/0xa0
Apr 22 09:11:49 daisy kernel: [142441.721159] [<ffffffff815669b4>]
wait_for_completion+0xe4/0x120
Apr 22 09:11:49 daisy kernel: [142441.721162] [<ffffffff81071ce0>] ?
default_wake_function+0x0/0x20
Apr 22 09:11:49 daisy kernel: [142441.721167] [<ffffffff815694db>] ?
_spin_unlock_bh+0x1b/0x20
Apr 22 09:11:49 daisy kernel: [142441.721172] [<ffffffff811f98d8>]
sync_inodes_sb_ub+0xa8/0x1d0
Apr 22 09:11:49 daisy kernel: [142441.721176] [<ffffffff8114fa6f>] ?
filemap_fdatawait+0x2f/0x40
Apr 22 09:11:49 daisy kernel: [142441.721181] [<ffffffff81200f85>]
__sync_filesystem+0x95/0xa0
Apr 22 09:11:49 daisy kernel: [142441.721184] [<ffffffff8120151d>]
sync_filesystems+0x30d/0x350
Apr 22 09:11:49 daisy kernel: [142441.721188] [<ffffffff812016e5>]
sys_sync+0x155/0x1a0
Apr 22 09:11:49 daisy kernel: [142441.721192] [<ffffffff81571424>]
system_call_fastpath+0x22/0x3a
Apr 22 09:13:49 daisy kernel: [142561.721069] INFO: task hdparm:22246
blocked for more than 120 seconds.
Apr 22 09:13:49 daisy kernel: [142561.721087] Not tainted
2.6.32-042stab142.1 #1
Apr 22 09:13:49 daisy kernel: [142561.721096] "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Apr 22 09:13:49 daisy kernel: [142561.721112] hdparm D
ffff88000c778300 0 22246 20845 0 0x00000080
Apr 22 09:13:49 daisy kernel: [142561.721118] ffff88006654bcb8
0000000000000086 ffffffff8114f130 ffff88002821fa40
Apr 22 09:13:49 daisy kernel: [142561.721123] ffff88000004d238
ffff88006654bd70 ffff88006654bc88 ffffea00016ab7c0
Apr 22 09:13:49 daisy kernel: [142561.721128] ffff88011a707000
ffff880028321168 000000000001b7ea 0000816be9b3faa2
Apr 22 09:13:49 daisy kernel: [142561.721133] Call Trace:
Apr 22 09:13:49 daisy kernel: [142561.721142] [<ffffffff8114f130>] ?
sync_page+0x0/0x50
Apr 22 09:13:49 daisy kernel: [142561.721148] [<ffffffff8107c851>] ?
update_curr+0xe1/0x1f0
Apr 22 09:13:49 daisy kernel: [142561.721153] [<ffffffff81566c55>]
schedule_timeout+0x215/0x2f0
Apr 22 09:13:49 daisy kernel: [142561.721158] [<ffffffff81067432>] ?
check_preempt_curr+0x82/0xa0
Apr 22 09:13:49 daisy kernel: [142561.721162] [<ffffffff815669b4>]
wait_for_completion+0xe4/0x120
Apr 22 09:13:49 daisy kernel: [142561.721166] [<ffffffff81071ce0>] ?
default_wake_function+0x0/0x20
Apr 22 09:13:49 daisy kernel: [142561.721170] [<ffffffff815694db>] ?
_spin_unlock_bh+0x1b/0x20
Apr 22 09:13:49 daisy kernel: [142561.721176] [<ffffffff811f98d8>]
sync_inodes_sb_ub+0xa8/0x1d0
Apr 22 09:13:49 daisy kernel: [142561.721180] [<ffffffff8114fa6f>] ?
filemap_fdatawait+0x2f/0x40
Apr 22 09:13:49 daisy kernel: [142561.721184] [<ffffffff81200f85>]
__sync_filesystem+0x95/0xa0
Apr 22 09:13:49 daisy kernel: [142561.721188] [<ffffffff8120151d>]
sync_filesystems+0x30d/0x350
Apr 22 09:13:49 daisy kernel: [142561.721192] [<ffffffff812016e5>]
sys_sync+0x155/0x1a0
Apr 22 09:13:49 daisy kernel: [142561.721196] [<ffffffff81571424>]
system_call_fastpath+0x22/0x3a
Apr 22 09:15:06 daisy ata_id[22299]: HDIO_GET_IDENTITY failed for '/dev/sdb'
Apr 22 09:15:49 daisy kernel: [142681.721085] INFO: task hdparm:22246
blocked for more than 120 seconds.
Apr 22 09:15:49 daisy kernel: [142681.721104] Not tainted
2.6.32-042stab142.1 #1
Apr 22 09:15:49 daisy kernel: [142681.721113] "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Apr 22 09:15:49 daisy kernel: [142681.721129] hdparm D
ffff88000c778300 0 22246 20845 0 0x00000080
Apr 22 09:15:49 daisy kernel: [142681.721136] ffff88006654bcb8
0000000000000086 ffffffff8114f130 ffff88002821fa40
Apr 22 09:15:49 daisy kernel: [142681.721141] ffff88000004d238
ffff88006654bd70 ffff88006654bc88 ffffea00016ab7c0
Apr 22 09:15:49 daisy kernel: [142681.721146] ffff88011a707000
ffff880028321168 000000000001b7ea 0000816be9b3faa2
Apr 22 09:15:49 daisy kernel: [142681.721150] Call Trace:
Apr 22 09:15:49 daisy kernel: [142681.721160] [<ffffffff8114f130>] ?
sync_page+0x0/0x50
Apr 22 09:15:49 daisy kernel: [142681.721166] [<ffffffff8107c851>] ?
update_curr+0xe1/0x1f0
Apr 22 09:15:49 daisy kernel: [142681.721172] [<ffffffff81566c55>]
schedule_timeout+0x215/0x2f0
Apr 22 09:15:49 daisy kernel: [142681.721178] [<ffffffff81067432>] ?
check_preempt_curr+0x82/0xa0
Apr 22 09:15:49 daisy kernel: [142681.721182] [<ffffffff815669b4>]
wait_for_completion+0xe4/0x120
Apr 22 09:15:49 daisy kernel: [142681.721185] [<ffffffff81071ce0>] ?
default_wake_function+0x0/0x20
Apr 22 09:15:49 daisy kernel: [142681.721190] [<ffffffff815694db>] ?
_spin_unlock_bh+0x1b/0x20
Apr 22 09:15:49 daisy kernel: [142681.721196] [<ffffffff811f98d8>]
sync_inodes_sb_ub+0xa8/0x1d0
Apr 22 09:15:49 daisy kernel: [142681.721200] [<ffffffff8114fa6f>] ?
filemap_fdatawait+0x2f/0x40
Apr 22 09:15:49 daisy kernel: [142681.721204] [<ffffffff81200f85>]
__sync_filesystem+0x95/0xa0
Apr 22 09:15:49 daisy kernel: [142681.721208] [<ffffffff8120151d>]
sync_filesystems+0x30d/0x350
Apr 22 09:15:49 daisy kernel: [142681.721212] [<ffffffff812016e5>]
sys_sync+0x155/0x1a0
Apr 22 09:15:49 daisy kernel: [142681.721217] [<ffffffff81571424>]
system_call_fastpath+0x22/0x3a
Apr 22 09:17:49 daisy kernel: [142801.721064] INFO: task hdparm:22246
blocked for more than 120 seconds.
Apr 22 09:17:49 daisy kernel: [142801.721082] Not tainted
2.6.32-042stab142.1 #1
Apr 22 09:17:49 daisy kernel: [142801.721091] "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Apr 22 09:17:49 daisy kernel: [142801.721107] hdparm D
ffff88000c778300 0 22246 20845 0 0x00000080
Apr 22 09:17:49 daisy kernel: [142801.721114] ffff88006654bcb8
0000000000000086 ffffffff8114f130 ffff88002821fa40
Apr 22 09:17:49 daisy kernel: [142801.721119] ffff88000004d238
ffff88006654bd70 ffff88006654bc88 ffffea00016ab7c0
Apr 22 09:17:49 daisy kernel: [142801.721124] ffff88011a707000
ffff880028321168 000000000001b7ea 0000816be9b3faa2
Apr 22 09:17:49 daisy kernel: [142801.721128] Call Trace:
Apr 22 09:17:49 daisy kernel: [142801.721137] [<ffffffff8114f130>] ?
sync_page+0x0/0x50
Apr 22 09:17:49 daisy kernel: [142801.721143] [<ffffffff8107c851>] ?
update_curr+0xe1/0x1f0
Apr 22 09:17:49 daisy kernel: [142801.721149] [<ffffffff81566c55>]
schedule_timeout+0x215/0x2f0
Apr 22 09:17:49 daisy kernel: [142801.721154] [<ffffffff81067432>] ?
check_preempt_curr+0x82/0xa0
Apr 22 09:17:49 daisy kernel: [142801.721158] [<ffffffff815669b4>]
wait_for_completion+0xe4/0x120
Apr 22 09:17:49 daisy kernel: [142801.721162] [<ffffffff81071ce0>] ?
default_wake_function+0x0/0x20
Apr 22 09:17:49 daisy kernel: [142801.721166] [<ffffffff815694db>] ?
_spin_unlock_bh+0x1b/0x20
Apr 22 09:17:49 daisy kernel: [142801.721172] [<ffffffff811f98d8>]
sync_inodes_sb_ub+0xa8/0x1d0
Apr 22 09:17:49 daisy kernel: [142801.721176] [<ffffffff8114fa6f>] ?
filemap_fdatawait+0x2f/0x40
Apr 22 09:17:49 daisy kernel: [142801.721180] [<ffffffff81200f85>]
__sync_filesystem+0x95/0xa0
Apr 22 09:17:49 daisy kernel: [142801.721184] [<ffffffff8120151d>]
sync_filesystems+0x30d/0x350
Apr 22 09:17:49 daisy kernel: [142801.721188] [<ffffffff812016e5>]
sys_sync+0x155/0x1a0
Apr 22 09:17:49 daisy kernel: [142801.721192] [<ffffffff81571424>]
system_call_fastpath+0x22/0x3a
Apr 22 09:20:01 daisy ata_id[22405]: HDIO_GET_IDENTITY failed for '/dev/sdb'
Apr 22 09:21:49 daisy kernel: [143041.721494] INFO: task hdparm:22246
blocked for more than 120 seconds.
Apr 22 09:21:49 daisy kernel: [143041.721512] Not tainted
2.6.32-042stab142.1 #1
Apr 22 09:21:49 daisy kernel: [143041.721522] "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Apr 22 09:21:49 daisy kernel: [143041.721691] hdparm D
ffff88000c778300 0 22246 20845 0 0x00000080
Apr 22 09:21:49 daisy kernel: [143041.721697] ffff88006654bcc8
0000000000000086 ffff88006654bc58 ffffffff810098af
Apr 22 09:21:49 daisy kernel: [143041.721702] ffff880028200000
000000001a42f238 ffff8800110101c0 ffff88011a42f200
Apr 22 09:21:49 daisy kernel: [143041.721706] ffff88006654bc68
ffffffff8107bbfe ffff8800110101c0 0000000000000000
Apr 22 09:21:49 daisy kernel: [143041.721711] Call Trace:
Apr 22 09:21:49 daisy kernel: [143041.721720] [<ffffffff810098af>] ?
__switch_to+0x16f/0x470
Apr 22 09:21:49 daisy kernel: [143041.721726] [<ffffffff8107bbfe>] ?
finish_task_switch+0xce/0x120
Apr 22 09:21:49 daisy kernel: [143041.721730] [<ffffffff8107c851>] ?
update_curr+0xe1/0x1f0
Apr 22 09:21:49 daisy kernel: [143041.721735] [<ffffffff81566c55>]
schedule_timeout+0x215/0x2f0
Apr 22 09:21:49 daisy kernel: [143041.721739] [<ffffffff815669b4>]
wait_for_completion+0xe4/0x120
Apr 22 09:21:49 daisy kernel: [143041.721743] [<ffffffff81071ce0>] ?
default_wake_function+0x0/0x20
Apr 22 09:21:49 daisy kernel: [143041.721747] [<ffffffff815694db>] ?
_spin_unlock_bh+0x1b/0x20
Apr 22 09:21:49 daisy kernel: [143041.721753] [<ffffffff811f9773>]
writeback_inodes_sb_nr_ub+0x83/0xb0
Apr 22 09:21:49 daisy kernel: [143041.721757] [<ffffffff811f9806>]
writeback_inodes_sb_ub+0x46/0x50
Apr 22 09:21:49 daisy kernel: [143041.721762] [<ffffffff81200f38>]
__sync_filesystem+0x48/0xa0
Apr 22 09:21:49 daisy kernel: [143041.721765] [<ffffffff8120151d>]
sync_filesystems+0x30d/0x350
Apr 22 09:21:49 daisy kernel: [143041.721769] [<ffffffff812016d8>]
sys_sync+0x148/0x1a0
Apr 22 09:21:49 daisy kernel: [143041.721773] [<ffffffff81571424>]
system_call_fastpath+0x22/0x3a
Apr 22 09:23:49 daisy kernel: [143161.721064] INFO: task hdparm:22246
blocked for more than 120 seconds.
Apr 22 09:23:49 daisy kernel: [143161.721169] Not tainted
2.6.32-042stab142.1 #1
Apr 22 09:23:49 daisy kernel: [143161.721259] "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Apr 22 09:23:49 daisy kernel: [143161.721430] hdparm D
ffff88000c778300 0 22246 20845 0 0x00000080
Apr 22 09:23:49 daisy kernel: [143161.721437] ffff88006654bcc8
0000000000000086 ffff88006654bc58 ffffffff810098af
Apr 22 09:23:49 daisy kernel: [143161.721442] ffff880028200000
000000001a42f238 ffff8800110101c0 ffff88011a42f200
Apr 22 09:23:49 daisy kernel: [143161.721447] ffff88006654bc68
ffffffff8107bbfe ffff8800110101c0 0000000000000000
Apr 22 09:23:49 daisy kernel: [143161.721451] Call Trace:
Apr 22 09:23:49 daisy kernel: [143161.721460] [<ffffffff810098af>] ?
__switch_to+0x16f/0x470
Apr 22 09:23:49 daisy kernel: [143161.721466] [<ffffffff8107bbfe>] ?
finish_task_switch+0xce/0x120
Apr 22 09:23:49 daisy kernel: [143161.721470] [<ffffffff8107c851>] ?
update_curr+0xe1/0x1f0
Apr 22 09:23:49 daisy kernel: [143161.721475] [<ffffffff81566c55>]
schedule_timeout+0x215/0x2f0
Apr 22 09:23:49 daisy kernel: [143161.721479] [<ffffffff815669b4>]
wait_for_completion+0xe4/0x120
Apr 22 09:23:49 daisy kernel: [143161.721483] [<ffffffff81071ce0>] ?
default_wake_function+0x0/0x20
Apr 22 09:23:49 daisy kernel: [143161.721487] [<ffffffff815694db>] ?
_spin_unlock_bh+0x1b/0x20
Apr 22 09:23:49 daisy kernel: [143161.721493] [<ffffffff811f9773>]
writeback_inodes_sb_nr_ub+0x83/0xb0
Apr 22 09:23:49 daisy kernel: [143161.721498] [<ffffffff811f9806>]
writeback_inodes_sb_ub+0x46/0x50
Apr 22 09:23:49 daisy kernel: [143161.721502] [<ffffffff81200f38>]
__sync_filesystem+0x48/0xa0
Apr 22 09:23:49 daisy kernel: [143161.721506] [<ffffffff8120151d>]
sync_filesystems+0x30d/0x350
Apr 22 09:23:49 daisy kernel: [143161.721510] [<ffffffff812016d8>]
sys_sync+0x148/0x1a0
Apr 22 09:23:49 daisy kernel: [143161.721514] [<ffffffff81571424>]
system_call_fastpath+0x22/0x3a
Apr 22 09:25:02 daisy ata_id[22445]: HDIO_GET_IDENTITY failed for '/dev/sdb'
Apr 22 09:25:49 daisy kernel: [143281.721066] INFO: task hdparm:22246
blocked for more than 120 seconds.
Apr 22 09:25:49 daisy kernel: [143281.721159] Not tainted
2.6.32-042stab142.1 #1
Apr 22 09:25:49 daisy kernel: [143281.721244] "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Apr 22 09:25:49 daisy kernel: [143281.721408] hdparm D
ffff88000c778300 0 22246 20845 0 0x00000080
Apr 22 09:25:49 daisy kernel: [143281.721415] ffff88006654bcc8
0000000000000086 ffff88006654bc58 ffffffff810098af
Apr 22 09:25:49 daisy kernel: [143281.721420] ffff880028200000
000000001a42f238 ffff8800110101c0 ffff88011a42f200
Apr 22 09:25:49 daisy kernel: [143281.721424] ffff88006654bc68
ffffffff8107bbfe ffff8800110101c0 0000000000000000
Apr 22 09:25:49 daisy kernel: [143281.721429] Call Trace:
Apr 22 09:25:49 daisy kernel: [143281.721438] [<ffffffff810098af>] ?
__switch_to+0x16f/0x470
Apr 22 09:25:49 daisy kernel: [143281.721444] [<ffffffff8107bbfe>] ?
finish_task_switch+0xce/0x120
Apr 22 09:25:49 daisy kernel: [143281.721448] [<ffffffff8107c851>] ?
update_curr+0xe1/0x1f0
Apr 22 09:25:49 daisy kernel: [143281.721453] [<ffffffff81566c55>]
schedule_timeout+0x215/0x2f0
Apr 22 09:25:49 daisy kernel: [143281.721457] [<ffffffff815669b4>]
wait_for_completion+0xe4/0x120
Apr 22 09:25:49 daisy kernel: [143281.721461] [<ffffffff81071ce0>] ?
default_wake_function+0x0/0x20
Apr 22 09:25:49 daisy kernel: [143281.721465] [<ffffffff815694db>] ?
_spin_unlock_bh+0x1b/0x20
Apr 22 09:25:49 daisy kernel: [143281.721471] [<ffffffff811f9773>]
writeback_inodes_sb_nr_ub+0x83/0xb0
Apr 22 09:25:49 daisy kernel: [143281.721476] [<ffffffff811f9806>]
writeback_inodes_sb_ub+0x46/0x50
Apr 22 09:25:49 daisy kernel: [143281.721480] [<ffffffff81200f38>]
__sync_filesystem+0x48/0xa0
Apr 22 09:25:49 daisy kernel: [143281.721484] [<ffffffff8120151d>]
sync_filesystems+0x30d/0x350
Apr 22 09:25:49 daisy kernel: [143281.721487] [<ffffffff812016d8>]
sys_sync+0x148/0x1a0
Apr 22 09:25:49 daisy kernel: [143281.721492] [<ffffffff81571424>]
system_call_fastpath+0x22/0x3a
Apr 22 09:27:49 daisy kernel: [143401.721072] INFO: task hdparm:22246
blocked for more than 120 seconds.
Apr 22 09:27:49 daisy kernel: [143401.721165] Not tainted
2.6.32-042stab142.1 #1
Apr 22 09:27:49 daisy kernel: [143401.721253] "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Apr 22 09:27:49 daisy kernel: [143401.721421] hdparm D
ffff88000c778300 0 22246 20845 0 0x00000080
Apr 22 09:27:49 daisy kernel: [143401.721427] ffff88006654bcc8
0000000000000086 ffff88006654bc58 ffffffff810098af
Apr 22 09:27:49 daisy kernel: [143401.721432] ffff880028200000
000000001a42f238 ffff8800110101c0 ffff88011a42f200
Apr 22 09:27:49 daisy kernel: [143401.721436] ffff88006654bc68
ffffffff8107bbfe ffff8800110101c0 0000000000000000
Apr 22 09:27:49 daisy kernel: [143401.721441] Call Trace:
Apr 22 09:27:49 daisy kernel: [143401.721450] [<ffffffff810098af>] ?
__switch_to+0x16f/0x470
Apr 22 09:27:49 daisy kernel: [143401.721456] [<ffffffff8107bbfe>] ?
finish_task_switch+0xce/0x120
Apr 22 09:27:49 daisy kernel: [143401.721460] [<ffffffff8107c851>] ?
update_curr+0xe1/0x1f0
Apr 22 09:27:49 daisy kernel: [143401.721465] [<ffffffff81566c55>]
schedule_timeout+0x215/0x2f0
Apr 22 09:27:49 daisy kernel: [143401.721469] [<ffffffff815669b4>]
wait_for_completion+0xe4/0x120
Apr 22 09:27:49 daisy kernel: [143401.721473] [<ffffffff81071ce0>] ?
default_wake_function+0x0/0x20
Apr 22 09:27:49 daisy kernel: [143401.721477] [<ffffffff815694db>] ?
_spin_unlock_bh+0x1b/0x20
Apr 22 09:27:49 daisy kernel: [143401.721483] [<ffffffff811f9773>]
writeback_inodes_sb_nr_ub+0x83/0xb0
Apr 22 09:27:49 daisy kernel: [143401.721487] [<ffffffff811f9806>]
writeback_inodes_sb_ub+0x46/0x50
Apr 22 09:27:49 daisy kernel: [143401.721492] [<ffffffff81200f38>]
__sync_filesystem+0x48/0xa0
Apr 22 09:27:49 daisy kernel: [143401.721495] [<ffffffff8120151d>]
sync_filesystems+0x30d/0x350
Apr 22 09:27:49 daisy kernel: [143401.721499] [<ffffffff812016d8>]
sys_sync+0x148/0x1a0
Apr 22 09:27:49 daisy kernel: [143401.721503] [<ffffffff81571424>]
system_call_fastpath+0x22/0x3a
Apr 22 09:29:49 daisy kernel: [143521.721059] INFO: task hdparm:22246
blocked for more than 120 seconds.
Apr 22 09:29:49 daisy kernel: [143521.721158] Not tainted
2.6.32-042stab142.1 #1
Apr 22 09:29:49 daisy kernel: [143521.721245] "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Apr 22 09:29:49 daisy kernel: [143521.721415] hdparm D
ffff88000c778300 0 22246 20845 0 0x00000084
Apr 22 09:29:49 daisy kernel: [143521.721421] ffff88006654bcc8
0000000000000086 ffff88006654bc58 ffffffff810098af
Apr 22 09:29:49 daisy kernel: [143521.721426] ffff880028200000
000000001a42f238 ffff8800110101c0 ffff88011a42f200
Apr 22 09:29:49 daisy kernel: [143521.721431] ffff88006654bc68
ffffffff8107bbfe ffff8800110101c0 0000000000000000
Apr 22 09:29:49 daisy kernel: [143521.721436] Call Trace:
Apr 22 09:29:49 daisy kernel: [143521.721445] [<ffffffff810098af>] ?
__switch_to+0x16f/0x470
Apr 22 09:29:49 daisy kernel: [143521.721451] [<ffffffff8107bbfe>] ?
finish_task_switch+0xce/0x120
Apr 22 09:29:49 daisy kernel: [143521.721455] [<ffffffff8107c851>] ?
update_curr+0xe1/0x1f0
Apr 22 09:29:49 daisy kernel: [143521.721460] [<ffffffff81566c55>]
schedule_timeout+0x215/0x2f0
Apr 22 09:29:49 daisy kernel: [143521.721465] [<ffffffff815669b4>]
wait_for_completion+0xe4/0x120
Apr 22 09:29:49 daisy kernel: [143521.721469] [<ffffffff81071ce0>] ?
default_wake_function+0x0/0x20
Apr 22 09:29:49 daisy kernel: [143521.721473] [<ffffffff815694db>] ?
_spin_unlock_bh+0x1b/0x20
Apr 22 09:29:49 daisy kernel: [143521.721479] [<ffffffff811f9773>]
writeback_inodes_sb_nr_ub+0x83/0xb0
Apr 22 09:29:49 daisy kernel: [143521.721483] [<ffffffff811f9806>]
writeback_inodes_sb_ub+0x46/0x50
Apr 22 09:29:49 daisy kernel: [143521.721487] [<ffffffff81200f38>]
__sync_filesystem+0x48/0xa0
Apr 22 09:29:49 daisy kernel: [143521.721491] [<ffffffff8120151d>]
sync_filesystems+0x30d/0x350
Apr 22 09:29:49 daisy kernel: [143521.721495] [<ffffffff812016d8>]
sys_sync+0x148/0x1a0
Apr 22 09:29:49 daisy kernel: [143521.721499] [<ffffffff81571424>]
system_call_fastpath+0x22/0x3a
Apr 22 09:30:04 daisy ata_id[22489]: HDIO_GET_IDENTITY failed for '/dev/sdb'
------------------
I tried running hdparm -tT /dev/sda, but after waiting 5+ minutes for
any command output I cancelled it.
I am rsyncing the data from this system over to another system now,
clearly something is wrong, but I can't tell what.
The system is an older AMD Opteron 180 processor (dual core) 4 GB ram,
RAID controller with RAID 5 set up with 4x 4TB Western Digital Drives.
I rebooted the system day before yesterday, and that's when the timeout
messages started pouring into the log.
when I run tw_cli /c8 show, all four drives say they are ok
[root@daisy cron.daily]# tw_cli /c8 show
Unit UnitType Status %RCmpl %V/I/M Stripe Size(GB) Cache
AVrfy
------------------------------------------------------------------------------
u0 RAID-5 OK - - 256K 11175.8 Ri ON
VPort Status Unit Size Type Phy Encl-Slot Model
------------------------------------------------------------------------------
p0 OK u0 3.63 TB SATA 0 - WDC
WD4005FZBX-00K5
p1 OK u0 3.63 TB SATA 1 - WDC
WD4005FZBX-00K5
p2 OK u0 3.63 TB SATA 2 - WDC
WD4005FZBX-00K5
p3 OK u0 3.63 TB SATA 3 - WDC
WD4005FZBX-00K5
Logical Volumes appear active:
[root@daisy cron.daily]# lvscan
ACTIVE '/dev/vg_daisy/lv_root' [10.89 TiB] inherit
ACTIVE '/dev/vg_daisy/lv_swap' [3.88 GiB] inherit
ACTIVE '/dev/vg_daisy/lv_home' [20.00 GiB] inherit
[root@daisy cron.daily]#
[root@daisy cron.daily]# lvmdiskscan
/dev/ram0 [ 16.00 MiB]
/dev/root [ 10.89 TiB]
/dev/ram1 [ 16.00 MiB]
/dev/sda1 [ 2.82 TiB]
/dev/vg_daisy/lv_swap [ 3.88 GiB]
/dev/ram2 [ 16.00 MiB]
/dev/vg_daisy/lv_home [ 20.00 GiB]
/dev/ram3 [ 16.00 MiB]
/dev/sda3 [ 842.87 GiB]
/dev/ram4 [ 16.00 MiB]
/dev/ram5 [ 16.00 MiB]
/dev/ram6 [ 16.00 MiB]
/dev/ram7 [ 16.00 MiB]
/dev/ram8 [ 16.00 MiB]
/dev/ram9 [ 16.00 MiB]
/dev/ram10 [ 16.00 MiB]
/dev/ram11 [ 16.00 MiB]
/dev/ram12 [ 16.00 MiB]
/dev/ram13 [ 16.00 MiB]
/dev/ram14 [ 16.00 MiB]
/dev/ram15 [ 16.00 MiB]
/dev/sdb1 [ 1.82 TiB] LVM physical volume
/dev/sdc1 [ 500.00 MiB]
/dev/sdc2 [ 4.00 TiB] LVM physical volume
/dev/sdd1 [ 4.00 TiB] LVM physical volume
/dev/sde1 [ 2.91 TiB] LVM physical volume
3 disks
19 partitions
0 LVM physical volume whole disks
4 LVM physical volumes
[root@daisy cron.daily]#
grub.conf:
[root@daisy grub]# cat grub.conf
# grub.conf generated by anaconda
#
# Note that you do not have to rerun grub after making changes to this file
# NOTICE: You have a /boot partition. This means that
# all kernel and initrd paths are relative to /boot/, eg.
# root (hd0,0)
# kernel /vmlinuz-version ro root=/dev/mapper/vg_daisy-lv_root
# initrd /initrd-[generic-]version.img
#boot=/dev/sdb
default=0
timeout=5
splashimage=(hd0,0)/grub/splash.xpm.gz
hiddenmenu
title OpenVZ (2.6.32-042stab142.1)
root (hd0,0)
kernel /vmlinuz-2.6.32-042stab142.1 ro
root=/dev/mapper/vg_daisy-lv_root rd_NO_LUKS rd_LVM_LV=vg_daisy/lv_swap
LANG=en_US.UTF-8 rd_NO_MD SYSFONT=latarcyrheb-sun16 crashkernel=auto
rd_LVM_LV=vg_daisy/lv_root KEYBOARDTYPE=pc KEYTABLE=us rd_NO_DM rhgb quiet
initrd /initramfs-2.6.32-042stab142.1.img
title OpenVZ (2.6.32-042stab141.3)
root (hd0,0)
kernel /vmlinuz-2.6.32-042stab141.3 ro
root=/dev/mapper/vg_daisy-lv_root rd_NO_LUKS rd_LVM_LV=vg_daisy/lv_swap
LANG=en_US.UTF-8 rd_NO_MD SYSFONT=latarcyrheb-sun16 crashkernel=auto
rd_LVM_LV=vg_daisy/lv_root KEYBOARDTYPE=pc KEYTABLE=us rd_NO_DM rhgb quiet
initrd /initramfs-2.6.32-042stab141.3.img
title OpenVZ (2.6.32-042stab140.4)
root (hd0,0)
kernel /vmlinuz-2.6.32-042stab140.4 ro
root=/dev/mapper/vg_daisy-lv_root rd_NO_LUKS rd_LVM_LV=vg_daisy/lv_swap
LANG=en_US.UTF-8 rd_NO_MD SYSFONT=latarcyrheb-sun16 crashkernel=auto
rd_LVM_LV=vg_daisy/lv_root KEYBOARDTYPE=pc KEYTABLE=us rd_NO_DM rhgb quiet
initrd /initramfs-2.6.32-042stab140.4.img
title OpenVZ (2.6.32-042stab140.1)
root (hd0,0)
kernel /vmlinuz-2.6.32-042stab140.1 ro
root=/dev/mapper/vg_daisy-lv_root rd_NO_LUKS rd_LVM_LV=vg_daisy/lv_swap
LANG=en_US.UTF-8 rd_NO_MD SYSFONT=latarcyrheb-sun16 crashkernel=auto
rd_LVM_LV=vg_daisy/lv_root KEYBOARDTYPE=pc KEYTABLE=us rd_NO_DM rhgb quiet
initrd /initramfs-2.6.32-042stab140.1.img
title OpenVZ (2.6.32-042stab139.1)
root (hd0,0)
kernel /vmlinuz-2.6.32-042stab139.1 ro
root=/dev/mapper/vg_daisy-lv_root rd_NO_LUKS rd_LVM_LV=vg_daisy/lv_swap
LANG=en_US.UTF-8 rd_NO_MD SYSFONT=latarcyrheb-sun16 crashkernel=auto
rd_LVM_LV=vg_daisy/lv_root KEYBOARDTYPE=pc KEYTABLE=us rd_NO_DM rhgb quiet
initrd /initramfs-2.6.32-042stab139.1.img
title CentOS 6 (2.6.32-754.el6.x86_64)
root (hd0,0)
kernel /vmlinuz-2.6.32-754.el6.x86_64 ro
root=/dev/mapper/vg_daisy-lv_root rd_NO_LUKS rd_LVM_LV=vg_daisy/lv_swap
LANG=en_US.UTF-8 rd_NO_MD SYSFONT=latarcyrheb-sun16 crashkernel=auto
rd_LVM_LV=vg_daisy/lv_root KEYBOARDTYPE=pc KEYTABLE=us rd_NO_DM rhgb quiet
initrd /initramfs-2.6.32-754.el6.x86_64.img
-------------
Top is not showing anything out of the ordinary:
----------
[root@daisy grub]#
top - 09:41:57 up 1 day, 16:04, 3 users, load average: 5.89, 5.83, 5.43
Tasks: 369 total, 1 running, 368 sleeping, 0 stopped, 0 zombie
Cpu(s): 0.2%us, 1.2%sy, 0.0%ni, 25.0%id, 73.5%wa, 0.0%hi, 0.2%si,
0.0%st
Mem: 3894628k total, 3861280k used, 33348k free, 95608k buffers
Swap: 4063228k total, 34888k used, 4028340k free, 3139272k cached
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
1266 root 20 0 0 0 0 D 1.0 0.0 12:27.75 flush-253:0
21041 1153 20 0 3188 1840 1012 D 0.7 0.0 0:00.72 imap
21599 97 20 0 5160 1940 1568 S 0.7 0.0 0:01.06 imap-login
22636 root 20 0 15272 1524 964 R 0.7 0.0 0:00.06 top
1977 root 20 0 2096 644 360 S 0.3 0.0 0:27.92 dovecot
22528 97 20 0 5160 2044 1672 S 0.3 0.1 0:00.35 imap-login
22578 1155 20 0 2904 1528 940 D 0.3 0.0 0:00.22 imap
1 root 20 0 19236 268 136 S 0.0 0.0 0:00.68 init
2 root 20 0 0 0 0 S 0.0 0.0 0:00.00 kthreadd
3 root RT 0 0 0 0 S 0.0 0.0 0:00.04 migration/0
4 root 20 0 0 0 0 S 0.0 0.0 0:01.88 ksoftirqd/0
5 root RT 0 0 0 0 S 0.0 0.0 0:00.00 stopper/0
6 root RT 0 0 0 0 S 0.0 0.0 0:00.19 watchdog/0
7 root RT 0 0 0 0 S 0.0 0.0 0:00.07 migration/1
8 root RT 0 0 0 0 S 0.0 0.0 0:00.00 stopper/1
9 root 20 0 0 0 0 S 0.0 0.0 0:03.17 ksoftirqd/1
10 root RT 0 0 0 0 S 0.0 0.0 0:00.20 watchdog/1
11 root 20 0 0 0 0 S 0.0 0.0 0:07.23 events/0
12 root 20 0 0 0 0 S 0.0 0.0 0:08.55 events/1
13 root 20 0 0 0 0 S 0.0 0.0 0:00.00 events/0
14 root 20 0 0 0 0 S 0.0 0.0 0:00.00 events/1
15 root 20 0 0 0 0 S 0.0 0.0 0:00.00 events_long/0
16 root 20 0 0 0 0 S 0.0 0.0 0:00.00 events_long/1
17 root 20 0 0 0 0 S 0.0 0.0 0:00.00
events_power_ef
18 root 20 0 0 0 0 S 0.0 0.0 0:00.00
events_power_ef
19 root 20 0 0 0 0 S 0.0 0.0 0:00.00 cgroup
20 root 20 0 0 0 0 S 0.0 0.0 0:00.00 khelper
21 root 20 0 0 0 0 S 0.0 0.0 0:00.01 netns
22 root 20 0 0 0 0 S 0.0 0.0 0:00.00 async/mgr
23 root 20 0 0 0 0 S 0.0 0.0 0:00.00 pm
24 root 20 0 0 0 0 S 0.0 0.0 0:00.29 sync_supers
------------
This is a company production mail server, and I can't find the solution,
I need help, as soon as someone is able, thank you!