Papers
Topics
Authors
Recent
Search
2000 character limit reached

The Recurrent Processing Unit: Hardware for High Speed Machine Learning

Published 12 Dec 2019 in cs.ET | (1912.07363v1)

Abstract: Machine learning applications are computationally demanding and power intensive. Hardware acceleration of these software tools is a natural step being explored using various technologies. A recurrent processing unit (RPU) is fast and power-efficient hardware for machine learning under development at the University of Maryland. It is comprised of a recurrent neural network and a trainable output vector as a hardware implementation of a reservoir computer. The reservoir is currently realized on both Xilinx 7-series and Ultrascale+ ZYNQ SoCs using an autonomous Boolean network for processing and a Python-based software API. The RPU is capable of classifying up to 40M MNIST images per second with the reservoir consuming under 261mW of power. Using an array of 2048 unclocked gates with roughly 100pS transition times, we achieve about 20 TOPS and 75 TOPS/W.

Citations (1)

Summary

No one has generated a summary of this paper yet.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.