0 citations0 references

Motivation for and Evaluation of the First Tensor Processing Unit

IEEE Micro2018Vol. 38(3), pp. 10–19

Citations Over TimeTop 1% of 2018 papers

Norman P. Jouppi, Cliff Young, Nishant Patil, David A. Patterson

Abstract

The first-generation tensor processing unit (TPU) runs deep neural network (DNN) inference 15-30 times faster with 30-80 times better energy efficiency than contemporary CPUs and GPUs in similar semiconductor technologies. This domain-specific architecture (DSA) is a custom chip that has been deployed in Google datacenters since 2015, where it serves billions of people.

Citations Over TimeTop 1% of 2018 papers

Abstract

Related Papers