Usage Note 32683: Jobs in a flow might appear to continue running after a kill command is issued
Jobs might appear to continue running in Flow Manager when you attempt to kill the job (whether via the Flow Manager GUI or via the jkill command from a command prompt). When this happens, a message similar to the following appears in the jfd.log file:
Jan 17 14:41:45 16537 38 3 JFLSFExecutionAgent::checkReturnStatus: Failed to execute command , exited with <255>. Job <9999>: Job has already finished
When a job finishes, if you use the LSF command BHIST -L job-id, you will see two events that are similar to the following:
-----------
Fri Mar 7 14:27:39: Done successfully. The CPU time used is 0.0 seconds;
Fri Mar 7 14:27:40: Post job process done successfully;
-----------
The first message, Done successfully, means that the execution of the job itself is done. The second message, Post job process done successfully, means that the post-job clean up finished successfully as well.
The problem occurs because Platform Process Manager (the jfd executable file) does not receive the signal from the second LSF event, which indicates that the job is done. Because the application does not receive the signal, it does not consider the job complete. As such, the job still appears to be running.
To circumvent the issue, force Process Manager to look for the first event. Then edit the js.conf file and add the following line:
JS_JOB_STAT_DONE=true
You must restart Process Manager (via the jfd executable file) for this new behavior to take effect. This change does not negatively affect the use of Process Manager or of any job flows.
Operating System and Release Information
SAS System | Platform Process Manager | Microsoft® Windows® for 64-Bit Itanium-based Systems | 3.0 | | 9.1 TS1M3 SP4 | |
Microsoft Windows Server 2003 Datacenter 64-bit Edition | 3.0 | | 9.1 TS1M3 SP4 | |
Microsoft Windows Server 2003 Enterprise 64-bit Edition | 3.0 | | 9.1 TS1M3 SP4 | |
Microsoft Windows XP 64-bit Edition | 3.0 | | 9.1 TS1M3 SP4 | |
Microsoft® Windows® for x64 | 3.0 | | 9.1 TS1M3 SP4 | |
Microsoft Windows 2000 Advanced Server | 3.0 | | 9.1 TS1M3 SP4 | |
Microsoft Windows 2000 Datacenter Server | 3.0 | | 9.1 TS1M3 SP4 | |
Microsoft Windows 2000 Server | 3.0 | | 9.1 TS1M3 SP4 | |
Microsoft Windows 2000 Professional | 3.0 | | 9.1 TS1M3 SP4 | |
Microsoft Windows NT Workstation | 3.0 | | 9.1 TS1M3 SP4 | |
Microsoft Windows Server 2003 Datacenter Edition | 3.0 | | 9.1 TS1M3 SP4 | |
Microsoft Windows Server 2003 Enterprise Edition | 3.0 | | 9.1 TS1M3 SP4 | |
Microsoft Windows Server 2003 Standard Edition | 3.0 | | 9.1 TS1M3 SP4 | |
Microsoft Windows XP Professional | 3.0 | | 9.1 TS1M3 SP4 | |
Windows Vista | 3.0 | | 9.1 TS1M3 SP4 | |
64-bit Enabled AIX | 3.0 | | 9.1 TS1M3 SP4 | |
64-bit Enabled HP-UX | 3.0 | | 9.1 TS1M3 SP4 | |
64-bit Enabled Solaris | 3.0 | | 9.1 TS1M3 SP4 | |
HP-UX IPF | 3.0 | | 9.1 TS1M3 SP4 | |
Linux | 3.0 | | 9.1 TS1M3 SP4 | |
Linux on Itanium | 3.0 | | 9.1 TS1M3 SP4 | |
*
For software releases that are not yet generally available, the Fixed
Release is the software release in which the problem is planned to be
fixed.
Jobs might appear to continue running state in Flow Manager when you attempt to kill the job (whether via the Flow Manager GUI or via the jkill command from a command prompt).
Date Modified: | 2008-09-08 14:15:15 |
Date Created: | 2008-07-16 09:31:05 |