This example demonstrates how to run the tuner in distributed mode. The example is similar to Example 14.2. The only difference between single-machine and distributed mode is that the PERFORMANCE statement specifies the number of threads and nodes to be used. The following statement changes the operating mode to distributed:
/* set the numbers of nodes and threads and get performance details */ performance nodes=5 nthreads=4 details;
The performance information and procedure task timing tables are displayed in Output 14.3.1. The NODES=5 and NTHREADS=4 options in the PERFORMANCE statement cause the tuner to run in distributed mode, where each computer node processes up to four threads simultaneously.
Output 14.3.1: Performance Information in Distributed Mode: Output
| Tuner Output |
| Performance Information | |
|---|---|
| Host Node | < your grid host > |
| Execution Mode | Distributed |
| Grid Mode | Symmetric |
| Number of Compute Nodes | 5 |
| Number of Threads per Node | 4 |
| Procedure Task Timing | ||
|---|---|---|
| Task | Time (sec.) |
% Time |
| Data Loading | 0.83 | 0.02% |
| Data Transfer | 0.08 | 0.00% |
| Tuner | 0.27 | 0.01% |
| Solver | 3994.81 | 99.95% |
| Idle | 0.96 | 0.02% |