---------Tech News Updates----


Breaking

Have You Heard of NEWSPAY

NewsPay.ng

Tech tutorials

Game news update

Social media trends

Latest News on social media

Tech infos

Updates on tech Tutorials

Free browsing and android tricks update

Tech news update

Thursday, 11 May 2017

Nvidia Details Volta GV100 GPU, Tesla V100 Accelerator

 


The Volta GV100 GPU Architecture


The Volta GV100 GPU uses the 12nm TSMC FFN process, has over 21 billion transistors, and is designed for deep learning applications.

We're talking about an 815mm2 die here, which pushes the limits of TSMC's current capabilities. Nvidia said it's not possible to build a larger GPU on the current process technology.

 The GP100 was the largest GPU that Nvidia ever produced before the GV100. It took up a 610mm2 surface area and housed 15.3 billion transistors. The GV100 is more than 30% larger.

 

Like the GP100, we get two SMs per TPC; 42 TPC overall in GV100. And that rolls up into six GPCs.

GV100 also features four HBM2 memory emplacements, like GP100, with each stack controlled by a pair of memory controllers. Speaking of which, there are eight 512-bit memory controllers (giving this GPU a total memory bus width of 4,096-bit).

Each memory controller is attached to 768KB of L2 cache, for a total of 6MB of L2 cache (vs 4MB for Pascal).

Tesla V100


The new Nvidia Tesla V100 features 80 SMs for a total of 5,120 CUDA cores. However, it has the potential to reach 7.5, 15, and 120 TFLOPs in FP64, FP32, and Tensor computations, respectively.

The Tesla V100 sports 16GB of HBM2 memory, which is capable of reaching up to 900 GB/s. The Samsung memory that Nvidia installed on the Tesla V100 is also 180 GB/s faster than the memory found on the Tesla P100 cards.

Nvidia said it used the fastest memory available on the market.

The Tesla V100 also introduces the second generation of NVLink, which allows for up to 300 GB/s over six 25GB/s NVLinks per GPU. 

 


V100P100
  • SMs
8056
  • Cores
- 5,120 (FP32)
- 2,560 (FP64)
- 3,584 (FP32)

- 1,792 (FP64)
  • Boost Clock
1,455MHz1,480MHz
  • TFLOPs
- 7.5 (FP64)
- 15 (FP32)
- 120 Tensor
- 5.3 (FP64)

- 10.3 (FP32)
  • Texture Units
320
224
  • Memory
16GB 4096-bit HBM216GB 4096-bit HBM2
  • Data Rate
900 GB/s720 GB/s
  • Transistors
21.1 Billion
15.3 Billion
  • Manufacturing Process
12nm FFN16nm FinFET+

 

 

 

 

No comments:

Post a Comment

Make money now why reading news

NewsPay.ng