AMD's Lisa Su displayed the world's first 7nm GPU die at Computex 2018. The GPU, which is complemented by 32GB of HBM2 memory, drops into the company's new Radeon Instinct Vega GPUs. Unfortunately for the enthusiasts among us, AMD designed the Instinct cards to capture some of the exploding AI and machine learning applications in the data center, but Su assured the crowd that the company plans to bring the new process to consumer GPUs in the future. Su didn't specify if the new gaming graphics card would feature the Vega architecture, but we expect it to come with the next-gen Navi architecture.
AMD's David Wang also announced that the company is striving to produce a new graphics product every year for the next three years, which bodes well for a consumer graphics market that has become somewhat stagnant in terms of recently released high-end graphics cards.
AMD is currently sampling the 7nm Vega GPU to its partners and will launch it to the general market in the second half of 2018. That's a full quarter ahead of expectations. The card comes bearing the fifth-gen GCN microarchitecture, but it will bring many of the benefits borne of the 7nm process. In addition, true to the GPUs' stated mission, the card has a number of optimizations specifically for AI workloads.
AMD claims the new 7nm process is twice as dense as its 14nm process, and the 7nm Vega die appears to be roughly 40% smaller than its predecessor. The new process also affords a 2x reduction in power efficiency and AMD also claims it provides a 1.35x increase in performance. This very specific metric indicates that AMD is relatively far down the development path with its working silicon. The company seems increasingly confident with the new 7nm process, as evidenced by its demo of a 7nm EPYC CPU at the same event.
Another fundamental change comes in the form of the Infinity Fabric. AMD designed this coherent interconnect to facilitate communication between its components inside discrete packages, as we see with the Ryzen and Threadripper processors. It also makes an appearance in the Vega 10 for intra-device communication.
AMD's new tactic extends the fabric outside of the GPU to speed peer-to-peer (P2P) communication between GPUs. This approach is similar to Nvidia's NVLink implementation, so it purportedly reduces latency and boosts throughput between graphics cards. We've seen several other P2P implementations take root in the broader component landscape, such as P2P fabrics that speed communication between storage devices and GPUs, but it appears the industry is stratifying into proprietary solutions.
In either case, it's logical to expect the Infinity Fabric to eventually extend to communication between the CPU and GPU, which could provide AMD yet another advantage as the only producer of both x86 processors and GPUs. Interconnects are increasingly becoming more of a strategic asset as companies transition to heterogeneous computing architectures, and it's a positive sign that AMD's Infinity Fabric continues to pay dividends.
AMD has also infused the GPU with support for a new set of deep learning operations that are likely designed to boost performance and efficiency.
AMD originally planned to release 12nm GPUs, which it announced last year, but made the strategic decision to remove those products from the roadmap and skip to the 7nm process instead. AMD has 7nm Navi up next on its roadmap, followed by new graphics cards with a 7nm+ process that should arrive before the end of 2020.
AMD demoed the GPU running a Cinema4D rendering workload with the company's open-source Radeon Pro Render ray tracing solution. AMD continues to leverage the open source community for its Radeon Open Ecosystem (ROCm) software solutions, which stands in stark contrast to other vendors' proprietary software, like CUDA.
Neural networks are becoming increasingly complex, and, as a byproduct, voluminous, so AI-driven architectures increasingly require more memory capacity to keep pace. AMD's 32GB of HBM2 should put the company on par with Nvidia's recent adjustment that boosts the Tesla V100 up to 32GB, but we aren't sure if the company will also offer versions with 16GB of HBM2. That could be an asset given the current sky-high pricing for HBM2 memory. AMD didn't reveal specifications or pricing for the new cards or a definitive timeline for mainstream graphics cards.