Problem Note 69499: Scheduled jobs in SAS® Platform LSF fail to start and return "Error: Pending: Failed in talking to server to start the job"
When you try to schedule jobs in SAS Platform LSF, they fail to start, and the following error is generated in Object Spawner log:
INFO [00068696] :user- Created grid <job-id> using credentials user (child id <>).
ERROR [00068609] :user- The specified uuid XXXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXX did not match any process managed by this spawner.
ERROR [00068609] :user- Objspawn was unable to launch the server SASApp - Workspace Server () because the Grid provider has exceeded the specified wait time.
INFO [00068609] :user- Client connection 8227 for user closed.
You can run the bhist command for a grid job from any machine to get more information:
bhist -u all -n 0 -l <Job -id>
In this scenario, the following error is generated when you run this command:
Wed Aug 17 14:18:38: Pending: Failed in talking to server to start the job;
This issue is caused by the sbatchd daemon being busy handling too many concurrent jobs on the execution host.
To eliminate the daemon being too busy as a possible cause, add the following to the end of your lsf.conf file:
LSF_CALL_PIM_SELECT_TIMEOUT=30
ego.conf:
EGO_PIM_SLEEPTIME_UPDATE=Y
EGO_PIM_SLEEPTIME=28800
Then run the following as root:
# lsadmin limrestart
# badmin mbdrestart
# badmin hrestart
Make sure that users that connect to the grid or validate the server have execute permissions on /home and /tmp.
Operating System and Release Information
| SAS System | Platform LSF | Microsoft® Windows® for x64 | 10.1 | | 9.4 TS1M5 | |
| Microsoft Windows 8 Enterprise 32-bit | 10.1 | | 9.4 TS1M5 | |
| Microsoft Windows 8 Enterprise x64 | 10.1 | | 9.4 TS1M5 | |
| Microsoft Windows 8 Pro 32-bit | 10.1 | | 9.4 TS1M5 | |
| Microsoft Windows 8 Pro x64 | 10.1 | | 9.4 TS1M5 | |
| Microsoft Windows 8.1 Enterprise 32-bit | 10.1 | | 9.4 TS1M5 | |
| Microsoft Windows 8.1 Enterprise x64 | 10.1 | | 9.4 TS1M5 | |
| Microsoft Windows 8.1 Pro 32-bit | 10.1 | | 9.4 TS1M5 | |
| Microsoft Windows 8.1 Pro x64 | 10.1 | | 9.4 TS1M5 | |
| Microsoft Windows 10 | 10.1 | | 9.4 TS1M5 | |
| Microsoft Windows Server 2008 | 10.1 | | 9.4 TS1M5 | |
| Microsoft Windows Server 2008 R2 | 10.1 | | 9.4 TS1M5 | |
| Microsoft Windows Server 2008 for x64 | 10.1 | | 9.4 TS1M5 | |
| Microsoft Windows Server 2012 Datacenter | 10.1 | | 9.4 TS1M5 | |
| Microsoft Windows Server 2012 R2 Datacenter | 10.1 | | 9.4 TS1M5 | |
| Microsoft Windows Server 2012 R2 Std | 10.1 | | 9.4 TS1M5 | |
| Microsoft Windows Server 2012 Std | 10.1 | | 9.4 TS1M5 | |
| Microsoft Windows Server 2016 | 10.1 | | 9.4 TS1M5 | |
| Windows 7 Enterprise 32 bit | 10.1 | | 9.4 TS1M5 | |
| Windows 7 Enterprise x64 | 10.1 | | 9.4 TS1M5 | |
| Windows 7 Home Premium 32 bit | 10.1 | | 9.4 TS1M5 | |
| Windows 7 Home Premium x64 | 10.1 | | 9.4 TS1M5 | |
| Windows 7 Professional 32 bit | 10.1 | | 9.4 TS1M5 | |
| Windows 7 Professional x64 | 10.1 | | 9.4 TS1M5 | |
| Windows 7 Ultimate 32 bit | 10.1 | | 9.4 TS1M5 | |
| Windows 7 Ultimate x64 | 10.1 | | 9.4 TS1M5 | |
| 64-bit Enabled AIX | 10.1 | | 9.4 TS1M5 | |
| 64-bit Enabled Solaris | 10.1 | | 9.4 TS1M5 | |
| HP-UX IPF | 10.1 | | 9.4 TS1M5 | |
| Linux for x64 | 10.1 | | 9.4 TS1M5 | |
| Solaris for x64 | 10.1 | | 9.4 TS1M5 | |
*
For software releases that are not yet generally available, the Fixed
Release is the software release in which the problem is planned to be
fixed.
| Type: | Problem Note |
| Priority: | medium |
| Date Modified: | 2022-08-30 11:01:59 |
| Date Created: | 2022-08-24 12:55:21 |