================================================================================ HPLinpack 2.1 -- High-Performance Linpack benchmark -- October 26, 2012 Written by A. Petitet and R. Clint Whaley, Innovative Computing Laboratory, UTK Modified by Piotr Luszczek, Innovative Computing Laboratory, UTK Modified by Julien Langou, University of Colorado Denver ================================================================================ An explanation of the input/output parameters follows: T/V : Wall time / encoded variant. N : The order of the coefficient matrix A. NB : The partitioning blocking factor. P : The number of process rows. Q : The number of process columns. Time : Time in seconds to solve the linear system. Gflops : Rate of execution for solving the linear system. The following parameter values will be used: N : 80640 NB : 896 PMAP : Row-major process mapping P : 2 Q : 1 PFACT : Left NBMIN : 2 NDIV : 2 RFACT : Right BCAST : 2ringM DEPTH : 0 SWAP : Spread-roll (long) L1 : no-transposed form U : transposed form EQUIL : yes ALIGN : 8 double precision words -------------------------------------------------------------------------------- - The matrix A is randomly generated for each test. - The following scaled residual check will be computed: ||Ax-b||_oo / ( eps * ( || x ||_oo * || A ||_oo + || b ||_oo ) * N ) - The relative machine precision (eps) is taken to be 1.110223e-16 - Computational tests pass if scaled residuals are less than 16.0 displaying Prog:%complete, N:columns, Time:seconds iGF:instantaneous GF, GF:avg GF, GF_per: process GF trsm_cutoff from environment variable 16000 gpu_dgemm_split from environment variable 0.970  Prog= 3.30% N_left= 79744 Time= 7.08 iGF= 1628.24 GF= 1628.24 iGF_per= 814.12 GF_per= 814.12   Prog= 6.52% N_left= 78848 Time= 11.17 iGF= 2041.17 GF= 2041.17 iGF_per= 1020.59 GF_per= 1020.59   Prog= 9.67% N_left= 77952 Time= 15.57 iGF= 2171.00 GF= 2171.00 iGF_per= 1085.50 GF_per= 1085.50   Prog= 12.75% N_left= 77056 Time= 19.45 iGF= 2670.74 GF= 2291.42 iGF_per= 1335.37 GF_per= 1145.71   Prog= 15.76% N_left= 76160 Time= 23.39 iGF= 2641.73 GF= 2355.05 iGF_per= 1320.86 GF_per= 1177.52   Prog= 18.70% N_left= 75264 Time= 27.13 iGF= 2729.53 GF= 2408.97 iGF_per= 1364.76 GF_per= 1204.49   Prog= 21.57% N_left= 74368 Time= 30.80 iGF= 2714.78 GF= 2447.44 iGF_per= 1357.39 GF_per= 1223.72   Prog= 24.37% N_left= 73472 Time= 34.45 iGF= 2721.34 GF= 2472.63 iGF_per= 1360.67 GF_per= 1236.32   Prog= 27.10% N_left= 72576 Time= 38.00 iGF= 2703.82 GF= 2493.28 iGF_per= 1351.91 GF_per= 1246.64   Prog= 29.77% N_left= 71680 Time= 41.37 iGF= 2714.32 GF= 2515.59 iGF_per= 1357.16 GF_per= 1257.79   Prog= 32.37% N_left= 70784 Time= 44.81 iGF= 2700.88 GF= 2525.39 iGF_per= 1350.44 GF_per= 1262.69   Prog= 34.90% N_left= 69888 Time= 48.03 iGF= 2718.35 GF= 2540.31 iGF_per= 1359.18 GF_per= 1270.15   Prog= 37.38% N_left= 68992 Time= 51.22 iGF= 2698.91 GF= 2550.86 iGF_per= 1349.46 GF_per= 1275.43   Prog= 39.78% N_left= 68096 Time= 54.29 iGF= 2733.33 GF= 2561.72 iGF_per= 1366.66 GF_per= 1280.86   Prog= 42.13% N_left= 67200 Time= 57.24 iGF= 2743.13 GF= 2572.94 iGF_per= 1371.57 GF_per= 1286.47   Prog= 44.41% N_left= 66304 Time= 60.29 iGF= 2714.11 GF= 2575.41 iGF_per= 1357.06 GF_per= 1287.70   Prog= 46.64% N_left= 65408 Time= 63.07 iGF= 2729.40 GF= 2585.05 iGF_per= 1364.70 GF_per= 1292.53   Prog= 48.80% N_left= 64512 Time= 65.70 iGF= 2758.79 GF= 2596.85 iGF_per= 1379.40 GF_per= 1298.43   Prog= 50.90% N_left= 63616 Time= 68.28 iGF= 2837.45 GF= 2606.10 iGF_per= 1418.73 GF_per= 1303.05   Prog= 52.95% N_left= 62720 Time= 70.75 iGF= 2872.96 GF= 2616.31 iGF_per= 1436.48 GF_per= 1308.16   Prog= 54.94% N_left= 61824 Time= 73.85 iGF= 2630.39 GF= 2600.55 iGF_per= 1315.19 GF_per= 1300.28   Prog= 56.87% N_left= 60928 Time= 76.18 iGF= 2641.11 GF= 2609.72 iGF_per= 1320.55 GF_per= 1304.86   Prog= 58.74% N_left= 60032 Time= 78.47 iGF= 2624.63 GF= 2617.13 iGF_per= 1312.31 GF_per= 1308.56   Prog= 60.56% N_left= 59136 Time= 80.67 iGF= 2883.22 GF= 2624.46 iGF_per= 1441.61 GF_per= 1312.23   Prog= 62.33% N_left= 58240 Time= 82.82 iGF= 2872.93 GF= 2630.84 iGF_per= 1436.46 GF_per= 1315.42   Prog= 64.04% N_left= 57344 Time= 84.93 iGF= 2867.78 GF= 2636.19 iGF_per= 1433.89 GF_per= 1318.09   Prog= 65.70% N_left= 56448 Time= 87.24 iGF= 2732.84 GF= 2632.62 iGF_per= 1366.42 GF_per= 1316.31   Prog= 67.31% N_left= 55552 Time= 89.15 iGF= 2749.81 GF= 2639.29 iGF_per= 1374.90 GF_per= 1319.64   Prog= 68.86% N_left= 54656 Time= 91.20 iGF= 2685.89 GF= 2639.61 iGF_per= 1342.95 GF_per= 1319.80   Prog= 70.37% N_left= 53760 Time= 93.17 iGF= 2753.83 GF= 2640.33 iGF_per= 1376.91 GF_per= 1320.17   Prog= 71.83% N_left= 52864 Time= 95.11 iGF= 2651.07 GF= 2640.02 iGF_per= 1325.54 GF_per= 1320.01   Prog= 73.24% N_left= 51968 Time= 96.86 iGF= 2700.74 GF= 2643.18 iGF_per= 1350.37 GF_per= 1321.59   Prog= 74.60% N_left= 51072 Time= 98.64 iGF= 2701.53 GF= 2643.72 iGF_per= 1350.76 GF_per= 1321.86   Prog= 75.91% N_left= 50176 Time= 100.32 iGF= 2742.38 GF= 2645.33 iGF_per= 1371.19 GF_per= 1322.67   Prog= 77.18% N_left= 49280 Time= 101.98 iGF= 2692.29 GF= 2645.65 iGF_per= 1346.15 GF_per= 1322.82   Prog= 78.40% N_left= 48384 Time= 103.54 iGF= 2717.00 GF= 2647.19 iGF_per= 1358.50 GF_per= 1323.59   Prog= 79.58% N_left= 47488 Time= 105.05 iGF= 2709.47 GF= 2648.22 iGF_per= 1354.74 GF_per= 1324.11   Prog= 80.71% N_left= 46592 Time= 106.50 iGF= 2735.62 GF= 2649.46 iGF_per= 1367.81 GF_per= 1324.73   Prog= 81.80% N_left= 45696 Time= 107.93 iGF= 2708.34 GF= 2649.68 iGF_per= 1354.17 GF_per= 1324.84   Prog= 82.85% N_left= 44800 Time= 109.27 iGF= 2714.79 GF= 2650.79 iGF_per= 1357.39 GF_per= 1325.40   Prog= 83.86% N_left= 43904 Time= 110.59 iGF= 2693.17 GF= 2651.08 iGF_per= 1346.58 GF_per= 1325.54   Prog= 84.83% N_left= 43008 Time= 111.79 iGF= 2741.45 GF= 2652.85 iGF_per= 1370.72 GF_per= 1326.42   Prog= 85.76% N_left= 42112 Time= 113.01 iGF= 2714.18 GF= 2652.89 iGF_per= 1357.09 GF_per= 1326.45   Prog= 86.65% N_left= 41216 Time= 114.14 iGF= 2740.42 GF= 2653.86 iGF_per= 1370.21 GF_per= 1326.93   Prog= 87.50% N_left= 40320 Time= 115.26 iGF= 2691.87 GF= 2654.02 iGF_per= 1345.94 GF_per= 1327.01   Prog= 88.31% N_left= 39424 Time= 116.29 iGF= 2726.67 GF= 2654.97 iGF_per= 1363.34 GF_per= 1327.49   Prog= 89.09% N_left= 38528 Time= 117.29 iGF= 2716.40 GF= 2655.54 iGF_per= 1358.20 GF_per= 1327.77   Prog= 89.84% N_left= 37632 Time= 118.23 iGF= 2745.49 GF= 2656.32 iGF_per= 1372.75 GF_per= 1328.16   Prog= 90.55% N_left= 36736 Time= 119.17 iGF= 2710.48 GF= 2656.31 iGF_per= 1355.24 GF_per= 1328.16   Prog= 91.22% N_left= 35840 Time= 120.02 iGF= 2723.56 GF= 2657.09 iGF_per= 1361.78 GF_per= 1328.54   Prog= 91.86% N_left= 34944 Time= 120.84 iGF= 2713.05 GF= 2657.55 iGF_per= 1356.53 GF_per= 1328.77   Prog= 92.47% N_left= 34048 Time= 121.62 iGF= 2749.10 GF= 2658.18 iGF_per= 1374.55 GF_per= 1329.09   Prog= 93.05% N_left= 33152 Time= 122.25 iGF= 2873.35 GF= 2661.03 iGF_per= 1436.67 GF_per= 1330.51   Prog= 93.60% N_left= 32256 Time= 122.98 iGF= 2846.63 GF= 2660.83 iGF_per= 1423.31 GF_per= 1330.41   Prog= 94.12% N_left= 31360 Time= 123.69 iGF= 2776.82 GF= 2660.17 iGF_per= 1388.41 GF_per= 1330.08   Prog= 94.61% N_left= 30464 Time= 124.36 iGF= 2577.92 GF= 2659.61 iGF_per= 1288.96 GF_per= 1329.81   Prog= 95.07% N_left= 29568 Time= 124.99 iGF= 2549.72 GF= 2659.04 iGF_per= 1274.86 GF_per= 1329.52   Prog= 95.51% N_left= 28672 Time= 125.57 iGF= 2571.11 GF= 2658.83 iGF_per= 1285.56 GF_per= 1329.42   Prog= 95.91% N_left= 27776 Time= 126.15 iGF= 2546.45 GF= 2658.01 iGF_per= 1273.22 GF_per= 1329.00   Prog= 96.30% N_left= 26880 Time= 126.66 iGF= 2575.88 GF= 2657.94 iGF_per= 1287.94 GF_per= 1328.97   Prog= 96.65% N_left= 25984 Time= 127.15 iGF= 2546.57 GF= 2657.44 iGF_per= 1273.28 GF_per= 1328.72   Prog= 96.99% N_left= 25088 Time= 127.61 iGF= 2575.34 GF= 2657.06 iGF_per= 1287.67 GF_per= 1328.53   Prog= 97.30% N_left= 24192 Time= 128.04 iGF= 2540.64 GF= 2656.68 iGF_per= 1270.32 GF_per= 1328.34   Prog= 97.59% N_left= 23296 Time= 128.42 iGF= 2564.88 GF= 2656.52 iGF_per= 1282.44 GF_per= 1328.26   Prog= 97.86% N_left= 22400 Time= 128.81 iGF= 2532.47 GF= 2655.90 iGF_per= 1266.24 GF_per= 1327.95   Prog= 98.10% N_left= 21504 Time= 129.13 iGF= 2570.06 GF= 2655.94 iGF_per= 1285.03 GF_per= 1327.97   Prog= 98.33% N_left= 20608 Time= 129.45 iGF= 2525.74 GF= 2655.48 iGF_per= 1262.87 GF_per= 1327.74   Prog= 98.54% N_left= 19712 Time= 129.75 iGF= 2541.59 GF= 2655.08 iGF_per= 1270.80 GF_per= 1327.54   Prog= 98.73% N_left= 18816 Time= 130.06 iGF= 2362.67 GF= 2653.86 iGF_per= 1181.34 GF_per= 1326.93   Prog= 98.90% N_left= 17920 Time= 130.32 iGF= 2295.73 GF= 2653.08 iGF_per= 1147.86 GF_per= 1326.54   Prog= 99.06% N_left= 17024 Time= 130.57 iGF= 2207.05 GF= 2652.25 iGF_per= 1103.52 GF_per= 1326.13   Prog= 99.20% N_left= 16128 Time= 130.71 iGF= 2507.03 GF= 2653.12 iGF_per= 1253.51 GF_per= 1326.56   Prog= 99.33% N_left= 15232 Time= 130.90 iGF= 2547.95 GF= 2652.61 iGF_per= 1273.97 GF_per= 1326.31   Prog= 99.44% N_left= 14336 Time= 131.09 iGF= 2563.53 GF= 2651.90 iGF_per= 1281.76 GF_per= 1325.95   Prog= 99.54% N_left= 13440 Time= 131.25 iGF= 2183.09 GF= 2651.19 iGF_per= 1091.55 GF_per= 1325.59   Prog= 99.62% N_left= 12544 Time= 131.39 iGF= 2120.89 GF= 2650.63 iGF_per= 1060.45 GF_per= 1325.31   Prog= 99.70% N_left= 11648 Time= 131.53 iGF= 2034.96 GF= 2649.80 iGF_per= 1017.48 GF_per= 1324.90   Prog= 99.76% N_left= 10752 Time= 131.64 iGF= 2032.63 GF= 2649.36 iGF_per= 1016.31 GF_per= 1324.68   Prog= 99.82% N_left= 9856 Time= 131.74 iGF= 1941.21 GF= 2648.75 iGF_per= 970.61 GF_per= 1324.37   Prog= 99.86% N_left= 8960 Time= 131.84 iGF= 1889.25 GF= 2648.05 iGF_per= 944.62 GF_per= 1324.02   Prog= 99.90% N_left= 8064 Time= 131.93 iGF= 1642.40 GF= 2647.13 iGF_per= 821.20 GF_per= 1323.57   Prog= 99.93% N_left= 7168 Time= 132.01 iGF= 1441.76 GF= 2646.26 iGF_per= 720.88 GF_per= 1323.13   Prog= 99.95% N_left= 6272 Time= 132.10 iGF= 1209.02 GF= 2645.21 iGF_per= 604.51 GF_per= 1322.61   Prog= 99.97% N_left= 5376 Time= 132.17 iGF= 1037.40 GF= 2644.25 iGF_per= 518.70 GF_per= 1322.12   Prog= 99.98% N_left= 4480 Time= 132.24 iGF= 825.21 GF= 2643.16 iGF_per= 412.61 GF_per= 1321.58   Prog= 99.99% N_left= 3584 Time= 132.30 iGF= 669.24 GF= 2642.22 iGF_per= 334.62 GF_per= 1321.11   Prog= 100.00% N_left= 2688 Time= 132.36 iGF= 485.23 GF= 2641.20 iGF_per= 242.61 GF_per= 1320.60   Prog= 100.00% N_left= 1792 Time= 132.40 iGF= 348.25 GF= 2640.37 iGF_per= 174.13 GF_per= 1320.18   Prog= 100.00% N_left= 896 Time= 132.44 iGF= 208.61 GF= 2639.56 iGF_per= 104.31 GF_per= 1319.78  ================================================================================ T/V N NB P Q Time Gflops -------------------------------------------------------------------------------- WR03R2L2 80640 896 2 1 132.84 2.632e+03 -------------------------------------------------------------------------------- ||Ax-b||_oo/(eps*(||A||_oo*||x||_oo+||b||_oo)*N)= 0.0032065 ...... PASSED ================================================================================ Finished 1 tests with the following results: 1 tests completed and passed residual checks, 0 tests completed and failed residual checks, 0 tests skipped because of illegal input values. -------------------------------------------------------------------------------- End of Tests. ================================================================================