0% found this document useful (0 votes)
11 views4 pages

System Becomes Unresponsive

Uploaded by

April Sky
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
11 views4 pages

System Becomes Unresponsive

Uploaded by

April Sky
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

System becomes unresponsive with message "INFO: task <process>… blocked for more than 120 seconds".

- Red Hat Customer Portal 27/02/24, 11:10 AM

Subscriptions Downloads Containers Support Cases

Products & Services Knowledgebase System becomes unresponsive with message "INFO: task
<process>:<pid> blocked for more than 120 secon...

System becomes unresponsive with


message "INFO: task <process>:<pid>
blocked for more than 120 seconds".
! SOLUTION VERIFIED - Updated May 10 2023 at 12:09 PM - English

Environment
Red Hat Enterprise Linux (RHEL) 9
Red Hat Enterprise Linux (RHEL) 8
Red Hat Enterprise Linux (RHEL) 7
Red Hat Enterprise Linux (RHEL) 6
Red Hat Enterprise Linux (RHEL) 5.5 (kernel-2.6.18-194) or above
Processes in uninterruptible sleep (D state)

Issue
/var/log/messages had a series of following messages before the system
became unresponsive:

INFO: task <process>:<pid> blocked for more than 120 seconds

What is the reason behind above messages and what kind of information is
needed to troubleshoot them?

Resolution

https://access.redhat.com/solutions/31453 Page 1 of 12
System becomes unresponsive with message "INFO: task <process>… blocked for more than 120 seconds". - Red Hat Customer Portal 27/02/24, 11:10 AM

These messages typically mean that the system is experiencing resource


congestion like memory,I/O,network and semaphore,etc. and processes are
being starved of available resources.

See CAUTIONS section with "How do I use hung task check?" for
possible situations that can cause false triggering of stalled task logic
due to misconfiguration of the system.

These messages serve as a warning that something may not be operating


optimally. They do not necessarily indicate a serious problem and any
blocked processes should eventually proceed when the system recovers.
To troubleshoot further, generate a vmcore as described in the article How
do I use hung task check in RHEL ? and engage Red Hat Support for analysis
of the vmcore.
If the hung task is a third-party application, please engage the application
vendor in the issue meanwhile.
If the hung task messages are known to be erroneous, they can be disabled.
This is not advised, and it will not prevent any hangs. To disable the
messages, use

# sysctl kernel.hung_task_timeout_secs=0

If it causes the crash and the causes are known to be erroneous, please
disable it by putting the below in /etc/sysctl.conf. Once have the below value
in the file, please run 'sysctl -p' to apply the change.

kernel.hung_task_panic = 0

Root Cause
In kernel version 2.6.18-194 of Red Hat Enterprise Linux 5.5, the Detect
Hung Task kernel thread ( khungtaskd ) was added, which provides the
ability to detect tasks stuck in D-state ( Uninterruptible Sleep (UN) ) longer
than a specified time period (120 seconds by default) and results in following
type of message in system log file ( i.e /var/log/messages ).

https://access.redhat.com/solutions/31453 Page 2 of 12
System becomes unresponsive with message "INFO: task <process>… blocked for more than 120 seconds". - Red Hat Customer Portal 27/02/24, 11:10 AM

INFO: task <process>:<pid> blocked for more than 120 seconds

The khungtaskd kernel thread monitors the process states and checks if
there are any processes stuck in uninterruptible state for the period set in
"kernel.hung_task_timeout_secs" sysctl parameter (default is 120 seconds).
If so it will log the message along with a call trace of the process that is
blocked. The hung task watchdog will by default only report the first 10
instances of detected hung tasks and then will disable reporting. This is to
prevent the system log from being flooded with messages. This means that
even though no more blocked task messages are being reported there may
still be processes becoming stuck for long periods.

Diagnostic Steps
Look for log messages like:

INFO: task syslogd:2643 blocked for more than 120 seconds.


"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this
message.
syslogd D ffff81000237eaa0 0 2643 1 2646 2634
(NOTLB)
ffff8101352c3d88 0000000000000086 ffff8101352c3d98
ffffffff80063ff8
0000000000001000 0000000000000009 ffff81013d2c57e0
ffff810102ac1820
0000340b30708992 0000000000000571 ffff81013d2c59c8
000000010000089f
Call Trace:
[<ffffffff80063ff8>] thread_return+0x62/0xfe
[<ffffffff88036d8a>] :jbd:log_wait_commit+0xa3/0xf5
[<ffffffff800a1ba4>] autoremove_wake_function+0x0/0x2e
[<ffffffff8803178a>] :jbd:journal_stop+0x1cf/0x1ff
[<ffffffff8002ff40>] __writeback_single_inode+0x1e9/0x328
[<ffffffff800e1464>] do_readv_writev+0x26e/0x291
[<ffffffff800f3d9d>] sync_inode+0x24/0x33
[<ffffffff8804c36d>] :ext3:ext3_sync_file+0xc9/0xdc
[<ffffffff8005073a>] do_fsync+0x52/0xa4
[<ffffffff800e1ce9>] __do_fsync+0x23/0x36
[<ffffffff8005e28d>] tracesys+0xd5/0xe0

Capture the following data while the issue is occurring:

https://access.redhat.com/solutions/31453 Page 3 of 12
System becomes unresponsive with message "INFO: task <process>… blocked for more than 120 seconds". - Red Hat Customer Portal 27/02/24, 11:10 AM

# uname -a > /tmp/uname.out


# ifconfig > /tmp/ifcongfig.out
# top -n 5 -b > /tmp/top.out
# vmstat 1 50 > /tmp/vm.out
# iostat -x 2 10 > /tmp/io.out
# ps aux > /tmp/ps.out
# ps auxH > /tmp/psh.out
# sar -A > /tmp/sar.out
# free > /tmp/free.out
# lsof > /tmp/lsof.out

Then provide:

# tar -cjvf outputs.tar.bz2 /tmp/*.out


# tar -cjvf message.tar.bz2 /var/log/message*

Check if it's using remote filesystem such as NFS which can cause of delay
when there's high number of requests on NFS server

$ grep MYAPP lsof | grep oracle


MYAPP 7096 1062 cwd DIR 0,18
0 558301 /oracle/prd/fs_ne/inst/conc/log
...
$ grep nfs mount
remotehost@:/ on /oracle/prd/fs_ne/inst type nfs (rw,nosuid)

Product(s) Red Hat Enterprise Linux Component kernel

Category Troubleshoot

Tags hang messaging nfs rhel_5 rhel_6 rhel_7

This solution is part of Red Hat’s fast-track publication program, providing a huge
library of solutions that Red Hat engineers have created while supporting our
customers. To give you the knowledge you need the instant it becomes available,
these articles may be presented in a raw and unedited form.

https://access.redhat.com/solutions/31453 Page 4 of 12

You might also like