Sunway TaihuLight - China's newly crowned supercomputer
The latest Top500 list of the world’s fastest supercomputers was released on Monday 20th June 2016. The Top500 list was first introduced in June 1993, and it has been compiled twice a year ever since, pitting the world’s most powerful machines against each other.
China maintained its top spot as the home of the fastest supercomputer in the planet, thanks to the Sunway TaihuLight supercomputer, which scored a staggering 93 PETAFLOP/s on the LINPACK benchmark. This is roughly three times faster than the Tianhe-2 supercomputer, which has been the pacesetter for the past three years.
So let us dip into the data, which you can download from here.
The above table shows the five fastest supercomputers. The ‘Rpeak’ and ‘Rmax’ columns represent the advertized peak performance and the maximum performance achieved in TeraFLOPS.
So, the world’s fastest computer can perform at about 93 PetaFLOPS! For some reason, clock cycle per second (=Hz) is favored when it comes to measuring performance of computers. So let’s make a (highly approximative) conversion between FLOPS and Hz. Assume that the Sunway TaihuLight processes one operation per clock cycle, this would correspond to 93PHz. To put it into perspective, let’s see what it means in comparison to an off-the-shelf consumer laptop computer having a 2.4GHz double core processor. Furthermore, let’s generously assume that the laptop completes one operation per cycle, and that it can perfectly parallelize (so that its maximum total speed would be 4.8GHz :)). Even with such relaxed assumptions, the Sunway TaihuLight would be 93015000/4.8 ≈ 20 × times faster. In other words, a complex operation that takes 10 minutes on the poor laptop would be completed within 30 microseconds by the Sunway TaihuLight.
Enough with the performance. Let’s look at the “Power” and “Mflops/Watt” columns of the above dataframe a bit more closely. The former gives the power consumption values in kWatts, while the latter highlights energy efficiency of the supercomputers. For example, the Sunway TaihuLight can perform about 6051 million floating point operations per second for each Watt of power it consumes.
It goes without saying that such supercomputers are expensive to build, which justifies the hefty price for getting one. But the purchase price of supercomputers is only one portion of the Total Cost of Ownership (TCO). Energy consumption is another factor to be considered. Again taking the world’s fastest supercomputer as an example, it consumes a whopping 15371 kWatt. This amounts to the electricity consumption of about 12,000 American homes. To get a rough estimate of the electricity bill, let’s assume a modest 0.05 cents/kWh and that the machine will be running at peak power for 10 hours/day. This will result in
!
By now it should be obvious that one needs to keep a tab on the energy consumption of supercomputers for a complete comparison. In fact there are organizations solely focusing on energy efficiency, like The Green500 and The Green Grid So, now, let’s rank the supercomputers based on their energy efficiency, that is, sort them based on the “Mflops/Watt” column and introduce the “GreenRank” column.
So, the Shoubu supercomputer from Japan wins the “Mflops/Watt” battle. But the Sunway TaihuLight is not far off, coming third next to the Satsuki. The next couple of plots show how the maximum performance achieved (Rmax) relates to the power consumption.
It comes as no surprise that there is a strong relationship between power consumption and performance. Thus, one needs to plan carefully in order to meet computation requirements while still having a reasonable power consumption. Pick a system way faster than your needs, and your energy consumption would shoot up unnecessarily. On the other hand, underprovisioning for the sake of energy savings may end up impairing the performance. Thus one needs to optimize between performance and energy.
For this, lets consider a ranking scheme where the rank of a supercomputer would be the sum of its performance rank (the ‘Rank’ column) and its energy efficiency rank (the ‘GreenRank column’). Although this is an over symplified and synthetic ranking scheme, it would at least depict where a supercomputer lies in the spectrum when both performance and energy efficiency are given equal weight. The resulting rank is depicted in the following bar plot (Shorter bars correspond to better rank).
Concluding the Performance-Energy thing, let’s look at the “Processor Generation” used in these supercomputers. As shown below the Intel Xeon E5 and E7 processors take the lion’s share, although the world’s fastest supercomputer runs on the ‘sunway’ processors.
Processor family | Count |
---|---|
Intel Xeon E5 (Broadwell) | 21 |
Intel Xeon E5 (Haswell) | 221 |
Intel Xeon E5 (IvyBridge) | 118 |
Intel Xeon E5 (SandyBridge) | 66 |
Intel Xeon E7 (Haswell-Ex) | 7 |
Intel Xeon E7 (IvyBridge) | 8 |
Intel Xeon Phi | 1 |
Opteron 6100-series | 3 |
Opteron 6200 Series | 5 |
Opteron 6300 Series | 5 |
POWER7 | 4 |
Power BQC | 19 |
SPARC64 IXfx | 1 |
SPARC64 VIIIfx | 1 |
SPARC64 XIfx | 5 |
ShenWei | 1 |
Sunway | 1 |
Xeon 5500-series (Nehalem-EP | 2 |
Xeon 5500-series (Nehalem-EX) | 1 |
Xeon 5600-series (Westmere-EP) | 9 |
Xeon E7 (Westmere-EX) | 1 |