SUMMARY: Urgent Response Required (DECthreads bugcheck terminated process execution)

From: Butler, Mathew (ButlerGM@logica.com)
Date: Tue Oct 15 2002 - 20:23:30 EDT


Thanks to Bobby Mackun (HP Support). THere was very little information on
this in any of the doco (DECthreads guide mentions th ebug check utility).

It is resource related and I am investigating our kernel parameters to fix
the problem.

[SNIP...]
The error from your application logs indicate a kernel resource shortage.
There are many factors to such a situation and the best approach to target
each one to determine whether or not it may be responsible.

Firstly, I would like to know how your system is currently tuned. Please use
"/sbin/sysconfig -q proc" to extract the values for the kernel "proc"
subsystem and email the output to me. I can identify if certain parameters
require modification.

Secondly, as mentioned before, the problem is most likely attributed to a
kernel resource issue. You may need to simply tune your OS based on the
needs of your application. The parameters that most likely needs adjustment
are:

per-proc-data-size
max-per-proc-data-size
max_threads_per_user

Thirdly, another factor could be a lack of sufficient swap area for this
system. Swap space low warnings and events are reported in your system log.
Find if there are any swap issues and add additional swap space if
necessary.

[SNIP...]

QUESTION
==========

Whilst running one of our recently upgraded systems (4.0D to 5.1, Oracle 7.3
-> 9.0.1.4.0), We encountered the following in a generated log files. The
process did not run.

[SNIP...]
%DECthreads bugcheck (version V3.18-042d), terminating execution.
% Reason: vpInitMultithread: (os/kern) resource shortage (6)
nxm_task_init(11fff3a40,3ffc01b7e90)
% Running on OSF1 V5.1(732) on AlphaServer 4100 5/533 4MB, 1024Mb; 2 CPUs,
pid 465847-directed.
DEBUG 08:57:24.176 15/9/102>> job log file is
/usr/users/nemint/N7PREI/build/server/log/job_1452080_pid_475969.log.
DEBUG 08:57:24.176 15/9/102>> job_start: 1452080, pid is 475969.
[...SNIP]

I am investigating this now, but need a quick turnaround to this. Can anyone
please answer the following:

Where else would this resource issue be logged?
Is it a machine configuration issue?
What does it indicate (too many open threads)?

Can anyone provide any additional information?

I need a quick workaround and longer term solution, all help and advise is
welcomed.

Thanks in advance.

Mat

Mathew Butler
butlergm@logica.com

This e-mail and any attachment is for authorised use by the intended
recipient(s) only. It may contain proprietary material, confidential
information and/or be subject to legal privilege. It should not be copied,
disclosed to, retained or used by, any other party. If you are not an
intended recipient then please promptly delete this e-mail and any
attachment and all copies and inform the sender. Thank you.

This e-mail and any attachment is for authorised use by the intended recipient(s) only. It may contain proprietary material, confidential information and/or be subject to legal privilege. It should not be copied, disclosed to, retained or used by, any other party. If you are not an intended recipient then please promptly delete this e-mail and any attachment and all copies and inform the sender. Thank you.



This archive was generated by hypermail 2.1.7 : Sat Apr 12 2008 - 10:48:56 EDT