This example demonstrates how to run the tuner in distributed mode. The example is similar to Example 14.2. The only difference between single-machine and distributed mode is that the PERFORMANCE statement specifies the number of threads and nodes to be used. The following statement changes the operating mode to distributed:
/* set the numbers of nodes and threads and get performance details */ performance nodes=5 nthreads=4 details;
The performance information and procedure task timing tables are displayed in Output 14.3.1. The NODES=5 and NTHREADS=4 options in the PERFORMANCE statement cause the tuner to run in distributed mode, where each computer node processes up to four threads simultaneously.
Output 14.3.1: Performance Information in Distributed Mode: Output
Tuner Output |
Performance Information | |
---|---|
Host Node | < your grid host > |
Execution Mode | Distributed |
Grid Mode | Symmetric |
Number of Compute Nodes | 5 |
Number of Threads per Node | 4 |
Procedure Task Timing | ||
---|---|---|
Task | Time (sec.) |
% Time |
Data Loading | 0.83 | 0.02% |
Data Transfer | 0.08 | 0.00% |
Tuner | 0.27 | 0.01% |
Solver | 3994.81 | 99.95% |
Idle | 0.96 | 0.02% |